Patent application title: METHODS AND COMPOSITIONS FOR PROGNOSTIC AND/OR DIAGNOSTIC SUBTYPING OF PANCREATIC CANCER
Inventors:
IPC8 Class: AC12Q16886FI
USPC Class:
Class name:
Publication date: 2022-04-28
Patent application number: 20220127676
Abstract:
Methods for generating a prognostic and/or subtype signature for a
subject with pancreatic ductal adenocarcinoma (PDAC) are provided. In
some embodiments, the methods include determining expression levels for
one or more genes listed in Tables 2-5, 9, 10, or 11, and/or the DE-S
and/or DE-T subset of genes in PDAC cells obtained from the subject,
wherein the determining provides a prognostic and/or subtype signature
for the subject. Also provided are methods for classifying a subject
diagnosed with pancreatic ductal adenocarcinoma (PDAC) as having an
activated stroma subtype or a normal stroma subtype of PDAC and/or a
basal subtype or a classical subtype of PDAC; and methods for identifying
a differential treatment strategy for a subject diagnosed with pancreatic
ductal adenocarcinoma (PDAC).Claims:
1.-14. (canceled)
15. A method of assaying a biological sample obtained from a subject comprising measuring a nucleic acid expression level of gene A and gene B for a plurality of gene pairs selected from the group consisting of Table 9, 10 and 11 in the biological sample obtained from the subject, wherein the subject has been diagnosed a cancer.
16. The method of claim 15, wherein the cancer is selected from the group consisting of pancreatic cancer, breast cancer and bladder cancer and the biological samples is obtained from the pancreas, the breast or the bladder, respectively.
17. The method of claim 16, wherein the plurality of gene pairs are selected from Table 9.
18. The method of claim 17, wherein the plurality of gene pairs selected from Table 9 comprises all of the gene pairs from Table 9.
19. The method of claim 16, wherein the cancer is pancreatic cancer and the biological samples is obtained from the pancreas.
20. The method of claim 19, wherein the plurality of gene pairs are selected from Table 10 and 11.
21. The method of claim 20, wherein the plurality of gene pairs selected from Table 10 comprises all of the gene pairs from Table 10.
22. The method of claim 20, wherein the plurality of gene pairs selected from Table 11 comprises all of the gene pairs from Table 11.
23. The method of claim 15, wherein the subject is a human.
24. A method for treating cancer in a subject diagnosed with cancer, the method comprising: (a) measuring a nucleic acid expression level of gene A and gene B for a plurality of gene pairs selected from Table 9 in a biological sample obtained from the subject, wherein the subject has been diagnosed with either breast cancer, pancreatic cancer or bladder cancer; (b) classifying the subject as having a basal subtype of the cancer based on the nucleic acid expression levels of gene A and gene B in each gene pair from the plurality of gene pairs selected from Table 9, wherein the classifying comprises calculating a value d using EQUATION 1, P i = { 1 .times. .times. if .times. .times. A i > B i 0 .times. .times. if .times. .times. B i .gtoreq. A i .times. .times. d = I + i .times. P i .times. C i .times. .times. decision = { Basal .times. .times. if .times. .times. d > 0 Not .times. .times. Basal .times. .times. if .times. .times. d .ltoreq. 0 EQUATION .times. .times. 1 ##EQU00006## wherein A.sub.i and B.sub.i are measured expression levels of each Gene A and each Gene B of Table 9 in the i.sup.th row, respectively, C.sub.i is the i.sup.th coefficient, and I is the intercept, and further wherein if d is greater than 0, the subject is classified as having a basal subtype, and if d is less than or equal to 0, the subject is classified as having a not basal subtype; and (c) administering a treatment for the subject based on the subject being classified as having a basal subtype, wherein the treatment is selected from agents for treating the basal subtype listed in FIG. 20, agents for treating the basal subtype listed in FIG. 21, agents listed in Table 4, agents listed in Table 6 and combinations thereof.
25. The method of claim 24, wherein the plurality of gene pairs selected from Table 9 comprises all of the gene pairs from Table 9.
26. A method for treating pancreatic cancer in a subject diagnosed with pancreatic cancer, the method comprising: (a) measuring a nucleic acid expression level of gene A and gene B for a plurality of gene pairs selected from Table 10 or Table 11 in a biological sample comprising pancreatic cells obtained from the subject; (b) classifying the subject as having a normal stroma subtype of pancreatic cancer or an activated stroma subtype of pancreatic cancer based on the nucleic acid expression levels of gene A and gene B in each gene pair from the plurality of gene pairs selected from Table 10, wherein the classifying comprises calculating a value d using EQUATION 2, P i = { 1 .times. .times. if .times. .times. A i > B i 0 .times. .times. if .times. .times. B i .gtoreq. A i .times. .times. d = I + i .times. P i .times. C i .times. .times. decision = { Activated .times. .times. Stroma .times. .times. if .times. .times. d > 0 Normal .times. .times. Stroma .times. .times. if .times. .times. d .ltoreq. 0 EQUATION .times. .times. 2 ##EQU00007## wherein A.sub.i and B.sub.i are measured expression levels of each Gene A and each Gene B of the plurality of gene pairs selected from Table 10 in the i.sup.th row, respectively, C.sub.i is the i.sup.th coefficient, and I is the intercept, and further wherein if d is greater than 0, the subject is classified as having an activated stroma subtype, and if d is less than or equal to 0, the subject is classified as having a normal stroma subtype OR classifying the subject as having a basal-like subtype of pancreatic cancer or a classical subtype of pancreatic cancer based on the nucleic acid expression levels of gene A and gene B in each gene pair from the plurality of gene pairs selected from Table 11, wherein the classifying comprises calculating a value d using EQUATION 3, P i = { 1 .times. .times. if .times. .times. A i > B i 0 .times. .times. if .times. .times. B i .gtoreq. A i .times. .times. d = I + i .times. P i .times. C i .times. .times. decision = { Basal - like .times. .times. if .times. .times. d > 0 Classical .times. .times. if .times. .times. d .ltoreq. 0 EQUATION .times. .times. 3 ##EQU00008## wherein A.sub.i and B.sub.i are measured expression levels of each Gene A and each Gene B of the plurality of gene pairs selected from Table 11 in the i.sup.th row, respectively, C.sub.i is the i.sup.th coefficient, and I is the intercept, and further wherein if d is greater than 0, the subject is classified as having a basal-like subtype, and if d is less than or equal to 0, the subject is classified as having a classical subtype; and (c) administering a treatment for the subject based on the subject being classified as having a normal stroma subtype, an activated stroma subtype, a basal-like subtype or a classical subtype, wherein the treatment for the normal stroma subtype is surgery alone or surgery prior to treatment with agents selected from the agents listed in Table 3, wherein the treatment for the activated stroma subtype is selected from the groups consisting of radiation, stroma modulation therapies noted in FIG. 20 and agents listed in or directed against the genes listed in Table 2, wherein the treatment for the basal-like subtype is cisplatin, oxaliplatin-based therapies, gemcitabine, chemotherapy with the agents listed in FIG. 20 and/or with agents listed in Tables 4 and 6 and/or against the genes listed in Tables 4 and 6, and wherein the treatment for the classical subtype is 5-fluorouracil, platinum-based therapy, surgery or prior to surgery, treatment with one or more agents listed in Table 5 or Table 6 or directed against the genes listed in Tables 5 and 6.
27. The method of claim 26, wherein the plurality of gene pairs selected from Table 10 comprises all of the gene pairs from Table 10.
28. The method of claim 26, wherein the plurality of gene pairs selected from Table 11 comprises all of the gene pairs from Table 11.
Description:
CROSS REFERENCE TO RELATED APPLICATIONS
[0001] This application is a continuation of U.S. patent application Ser. No. 15/518,900, filed Apr. 13, 2017 (pending), which itself is a United States National Stage Application filed under 35 U.S.C. .sctn. 371 of PCT International Patent Application Serial No. PCT/2015/055565, filed Oct. 14, 2015, which itself is based on and claims priority to U.S. Provisional Patent Application Ser. No. 62/201,793, filed Aug. 6, 2015 and U.S. Provisional Patent Application Ser. No. 62/063,719, filed Oct. 14, 2014. The disclosure of each of these applications is incorporated by reference herein in its entirety.
REFERENCE TO SEQUENCE LISTING
[0003] The Sequence Listing associated with the instant disclosure has been electronically submitted to the United States Patent and Trademark Office as the Receiving Office as a 521 kilobyte ASCII text file created on May 26, 2021 and entitled "421_357_2_PCT_US_CON_ST25.txt". The Sequence Listing submitted via EFS-Web is hereby incorporated by reference in its entirety.
TECHNICAL FIELD
[0004] The presently disclosed subject matter relates to compositions and methods for producing gene expression profiles for subjects that have or are suspected of having pancreatic cancer and employing the same to identify appropriate treatment approaches.
BACKGROUND
[0005] Pancreatic ductal adenocarcinoma (PDAC), comprising over 90% of all pancreatic cancers, remains a lethal disease with an estimated 232,000 new cases and an estimated 227,000 deaths per year worldwide in 2008 (Parkin et al., 2002; Boyle & Levin, 2008). Incremental improvements in the treatment of this cancer have been made in the last two decades, but the estimated five-year survival worldwide remains at less than 5% (Boyle & Levin, 2008).
[0006] Currently, the standard of care for the 20% of patients who are diagnosed with localized disease is surgery followed by chemotherapy with gemcitabine. Unfortunately, despite the use of adjuvant therapy, median survival remains at less than two years (Neuhaus et al., 2008), with only 12% of patients undergoing curative surgery surviving more than five years (Conlon et al., 1996; Ahmad et al., 2001; Cleary et al., 2004; Han et al., 2006; Winter et al., 2006; Ferrone et al., 2008; Schnelldorfer et al., 2008).
[0007] PDAC is thus characterized by a lack of effective targeted therapies, clinically useful biomarkers, and consensus subtypes. Therefore, understanding molecular mechanisms of disease underlying PDAC has the potential to facilitate the development of rationally designed therapies, and could assist in tailoring the use of the same to individual patients. Interestingly, in large retrospective studies examining actual long-term (five- and ten-year) survivors (Conlon et al., 1996; Ahmad et al., 2001; Cleary et al., 2004; Han et al., 2006; Winter et al., 2006; Ferrone et al., 2008; Schnelldorfer et al., 2008), only two studies (Ahmad et al., 2001; Winter et al., 2006) have found that adjuvant therapy was associated with improved survival, suggesting that the benefits of adjuvant therapy are still controversial. In addition, gene sequencing of rare long-term survivors suggests that gene mutations in those tumors are no different than PDAC patients with more aggressive disease. One possible conclusion from these studies is that tumor biology in PDAC is more complex than gene mutations. Unfortunately, previous work using gene expression has been hampered by the low cellularity of malignant epithelium in PDAC patient samples. The low cellularity of PDAC poses a diagnostic dilemma as well in that biopsies of the tumor many times is non-diagnostic.
[0008] Despite these difficulties, defining subtypes of PDAC that would dictate the type whether it be tumor extirpation, chemotherapy or molecular and immunotherapy and timing of those therapies for patients would be beneficial. For PDAC in particular, better diagnostic tests independent of tumor cellularity would be beneficial. Achieving these goals is the ultimate goal of precision medicine.
SUMMARY
[0009] This Summary lists several embodiments of the presently disclosed subject matter, and in many cases lists variations and permutations of these embodiments. This Summary is merely exemplary of the numerous and varied embodiments. Mention of one or more representative features of a given embodiment is likewise exemplary. Such an embodiment can typically exist with or without the feature(s) mentioned; likewise, those features can be applied to other embodiments of the presently disclosed subject matter, whether listed in this Summary or not. To avoid excessive repetition, this Summary does not list or suggest all possible combinations of such features.
[0010] In some embodiments, the presently disclosed subject matter provides methods for generating a prognostic and/or subtype signature for a subject with pancreatic ductal adenocarcinoma (PDAC). In some embodiments, the methods comprise determining expression levels for one or more genes selected from the group consisting of those genes listed in Tables 2-5 in PDAC cells obtained from the subject, wherein the determining provides a prognostic and/or subtype signature for the subject. In some embodiments, the methods comprise determining expression levels for one or more genes listed in Table 1 as corresponding to the DE-S or DE-T subset in PDAC cells obtained from the subject, wherein the determining provides a prognostic and/or subtype signature and/or subtype identification that can be a diagnostic, prognostic, and/or treatment-determinative call for the subject. In some embodiments, the methods comprise determining expression levels for all of the genes listed in Tables 2-5 and/or for all of the genes listed in Table 1 as corresponding to the DE-S or DE-T subset in PDAC cells obtained from the subject.
[0011] In some embodiments, the methods further comprise comparing a first prognostic and/or subtype signature determined for the genes in Table 2 to a second prognostic and/or subtype signature for the genes in Table 3, wherein the comparing classifies the subject as having a PDAC subtype that is associated with either normal or activated stroma.
[0012] In some embodiments, the methods further comprise comparing a first prognostic and/or subtype signature determined for the genes in Table 4 to a second prognostic and/or subtype signature for the genes in Table 5, wherein the comparing classifies the subject as having a PDAC subtype that is a classical subtype or a basal subtype.
[0013] The presently disclosed subject matter also provides methods for classifying a subject diagnosed with pancreatic ductal adenocarcinoma (PDAC) as having an activated stroma subtype or a normal stroma subtype of PDAC. In some embodiments, the methods comprise (a) determining expression levels of the genes listed in Table 2 or an informative subset thereof and in Table 3 or an informative subset thereof in a biological sample comprising PDAC cells obtained from the PDAC of the subject; (b) creating an expression profile, wherein the expression profile encompasses expression levels of the genes listed in Table 23 or the informative subset thereof and the genes listed in Table 3 or the informative subset thereof; and (c) using the expression profiles created in the form of analysis of top scoring pairs of genes, wherein the analysis employs a trained logistic model in which binary input from discriminatory gene pairs are input and classification odds results are produced, whereby the subject is classified as having an activated stroma subtype or a normal stroma subtype of PDAC. In some embodiments, the method comprises comparing the expression profiles created to a standard, wherein the comparing employs a Bayesian classification reflecting a distance from (1) an activated stroma centroid that is high magnitude for all activated stroma genes and low magnitude for all normal stroma discriminatory genes; and (2) a normal stroma centroid that is high magnitude for all normal stroma genes and low magnitude for all activated stroma discriminatory genes. In some embodiments, the comparing determines whether the expression profile is closer to the activated stroma centroid or the normal stroma centroid, whereby the subject is classified as having an activated stroma subtype or a normal stroma subtype of PDAC. In some embodiments, the expression profiles comprise expression levels for each of the genes listed in Table 10, and the using comprises calculating a value d using EQUATION 2,
P i = { 1 .times. .times. if .times. .times. A i > B i 0 .times. .times. if .times. .times. B i .gtoreq. A i .times. .times. d = I + i .times. P i .times. C i .times. .times. decision = { Activated .times. .times. Stroma .times. .times. if .times. .times. d > 0 Normal .times. .times. Stroma .times. .times. if .times. .times. d .ltoreq. 0 EQUATION .times. .times. 2 ##EQU00001##
wherein A.sub.i and B.sub.i are measured expression levels of each Gene A and each Gene B of Table 10 in the i.sup.th row, respectively, C.sub.i is the i.sup.th coefficient, and I is the intercept, and further wherein if d is greater than 0, the subject is classified as having an activated stroma subtype, and if d is less than or equal to 0, the subject is classified as having a normal stroma subtype of PDAC.
[0014] The presently disclosed subject matter also provides methods for classifying a subject diagnosed with pancreatic ductal adenocarcinoma (PDAC) as having a basal subtype or a classical subtype of PDAC. In some embodiments, the methods comprise (a) determining expression levels of the genes listed in Table 4 or an informative subset thereof and in Table 5 or an informative subset thereof in a biological sample comprising PDAC cells obtained from the PDAC of the subject; (b) creating an expression profile, wherein the expression profile encompasses expression levels of the genes listed in Table 4 or the informative subset thereof and the genes listed in Table 5 or the informative subset thereof; and (c) using the expression profiles created in the form of analysis of top scoring pairs of genes, wherein the analysis is composed of a trained logistic model in which binary input from discriminatory gene pairs are input and classification odds results are produced, whereby the subject is classified as having a basal subtype or a classical subtype of PDAC. In some embodiments, the method comprises (c) comparing the expression profiles created to a standard, wherein the comparing employs a Bayesian classification reflecting a distance from (1) a basal centroid that is high magnitude for all basal genes and low magnitude for all classical discriminatory genes; and (2) a classical centroid that is high magnitude for all classical genes and low magnitude for all basal discriminatory genes. In some embodiments, the comparing determines whether the expression profile is closer to the basal centroid or the classical centroid, whereby the subject is classified as having a basal subtype or a classical subtype of PDAC. In some embodiments, the expression profiles comprise expression levels for each of the genes listed in Table 11, and the using comprises calculating a value d using EQUATION 3,
P i = { 1 .times. .times. if .times. .times. A i > B i 0 .times. .times. if .times. .times. B i .gtoreq. A i .times. .times. d = I + i .times. P i .times. C i .times. .times. decision = { Basal - like .times. .times. if .times. .times. d > 0 Classical .times. .times. if .times. .times. d .ltoreq. 0 EQUATION .times. .times. 3 ##EQU00002##
wherein A.sub.i and B.sub.i are measured expression levels of each Gene A and each Gene B of Table 11 in the i.sup.th row, respectively, C.sub.i is the i.sup.th coefficient, and I is the intercept, and further wherein if d is greater than 0, the subject is classified as having a basal-like subtype, and if d is less than or equal to 0, the subject is classified as having a classical subtype of PDAC.
[0015] In some embodiments, the presently disclosed subject matter also provides methods for identifying a differential treatment strategy for a subject diagnosed with pancreatic ductal adenocarcinoma (PDAC) and/or for diagnosing PDAC on low cellularity biopsies. In some embodiments, the methods comprise (a) determining the expression levels of the genes listed in Tables 2-5 in a biological sample comprising PDAC cells obtained from the PDAC of the subject; (b) creating an expression profile for the subject based on the expression levels of the genes listed in Tables 2-5; (c) classifying the subject as having an activated stroma subtype or a normal stroma subtype of PDAC, a basal subtype or a classical subtype of PDAC, or both; and (d) selecting a treatment strategy for the subject based on the classification of the subject as having an activated stroma subtype or a normal stroma subtype of PDAC, a basal subtype or a classical subtype of PDAC, an activated stroma/basal subtype of PDAC, a normal stroma/basal subtype of PDAC, an activated stroma/classical subtype of PDAC, or a normal stroma/classical subtype of PDAC, wherein a differential treatment strategy for the subject is identified. In some embodiments, the method further comprises (e) diagnosing PDAC on a patient with inadequate tumor cells by classifying the subject as having an activated stroma subtype or a normal stroma subtype of PDAC.
[0016] In some embodiments of the instantly disclosed methods where the genes to be assayed are those set forth in Tables 2-5, the genes referred to herein as DE-S and/or DE-T can be employed rather than those in Tables 2-5.
[0017] In some embodiments of the presently disclosed methods, the subject is a human.
[0018] It is thus an object of the presently disclosed subject matter to provide methods for predicting outcomes of subjects with pancreatic cancer.
[0019] An object of the presently disclosed subject matter having been stated hereinabove, and which is achieved in whole or in part by the presently disclosed subject matter, other objects will become evident as the description proceeds when taken in connection with the accompanying Figures as best described herein below.
BRIEF DESCRIPTION OF THE FIGURES
[0020] The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawings will be provided by the Office upon request and payment of the necessary fee.
[0021] FIGS. 1A-1D are representative hematoxylin and eosin (H&E) staining of patient tumor samples. FIG. 1A depicts liver metastases showing regions of tumor and normal tissue. FIG. 1B depicts a primary pancreatic tumor sample showing normal pancreatic tissue and tumor cells in the same field. FIG. 1C depicts a primary pancreatic tumor with high tumor cellularity. FIG. 1D depicts a primary pancreatic tumor with abundant tumor stroma. Black arrowheads show areas of tumor stroma. Black arrows show areas of tumor. White arrowheads show normal tissue. Scale bars, 200 .mu.m.
[0022] FIG. 2 depicts the percentage of tumor in primary pancreatic tumors in the UNC and International Cancer Genome Consortium (ICGC) cohorts.
[0023] FIGS. 3A-3D depict the results of Successful Deconvolution of Normal Tissue with NMF. FIG. 3A is a cartoon depicting the major cell types in primary tumor and liver metastasis samples. FIG. 3B (above) is an overlap of sample types (solid colors) with factor weights (grayscale heat maps), and (below) heat maps of five exemplar genes for all tumors and adjacent normal tissues. Gene expression shown in the heat map has been Z-normalized. FIG. 3C is a series of Box and Whiskers plots comparing NMF factor weights across tissue types and corresponding t-test result. FIG. 3D is a series of plots showing percent tumor cellularity versus NMF liver factor weight, and NMF basal tumor factor weight for metastases to the liver and adjacent liver samples. Linear regression lines are shown in red along with corresponding statistics.
[0024] FIGS. 4A-4D depict the results of a series of experiments that demonstrated that a dual action of stroma is described by distinct gene expression patterns which are not expressed in cell lines. FIG. 4A is a consensus clustered heat map of UNC primary tumor samples, metastases, and cell lines using genes from stromal factors. Samples clustered into 3 groups, describing samples with activated stroma, normal stroma, and samples with low or absent stromal gene expression. FIG. 4B is a Kaplan-Meier survival analysis of resected PDAC patients from the activated and normal stromal clusters shows that samples in the activated stroma group have worse prognosis, with a hazard ratio of 1.94 (CI=[1.11, 3.37], p=0.019). FIG. 4C shows gene expressions of various stromal signatures were overexpressed in cancer associated fibroblasts (CAFs) as compared to tumor cell lines. FIG. 4D is a series of plots showing that genes from both stromal signatures were specifically overexpressed by the mouse stroma in PDX tumors, and not expressed by the human tumor cells.
[0025] FIG. 5 depicts deconvolution of a large cohort of PDAC revealed distinct gene expression patterns from multiple tissue types. Solid color bars above the heat map show the tissue of origin and tumor status of the samples, which were used to order the samples horizontally. Factor weights derived by NMF for selected factors are shown as grayscale bars. Heat maps show Z-normalized gene expression of five exemplar genes from each factor. All tumors, cell lines, and adjacent normal tissues from the present cohort are shown.
[0026] FIG. 6 depicts a correlation of pathology assessments of tumor with factor weights in normal pancreas and primary tumors. Horizontal axes all show tumor cellularity, while vertical axes show factor weight. Red dashed lines show best linear fits. p values are given for each R.sup.2.
[0027] FIGS. 7A-7H depict the results of a series of experiments that showed that tumor specific gene expression suggested two subtypes of PDAC with similarities to other tumor types. FIG. 7A is a consensus clustered heat map of primary tumors, metastatic tumors, and cell line models of PDAC using correlation as the underlying distance function shows two subtypes of PDAC FIG. 7B is a Kaplan-Meier survival analysis of resected primary patients from each tumor subtype (36 basal-like, 89 classical) in FIG. 7A shows differential prognosis among subtypes with a hazard ratio of 1.89, and a 95% CI of [1.19, 3.02]. FIG. 7C is a consensus clustered heat map of tumors in the ICGC PDAC cohort split by basal and classical factor gene expression into basal-like (n=56) and classical (n=47) tumors. FIG. 7D is a plot showing that basal-like tumors in the ICGC data set had a hazard ratio of 2.11, with a 95% CI of [1.14, 3.89]. Median follow up was 20 months. FIG. 7E is a consensus clustered heat map of The Cancer Genome Atlas (TCGA) Bladder cancer (BLCA) samples split by basal and classical factor gene expression into basal-like (n=128) and classical-like (n=95) tumors strongly agrees with BASE47 basal calls shown above the heat map. FIG. 7F shows subtyping in the TCGA BLCA data set had a hazard ratio of 1.43, with a 95% CI of [0.84, 2.42] FIG. 7G is a consensus clustered heat map of the Perou breast cancer data set as split by basal factor genes (n=72 basal-like, n=223 not basal) strongly agrees with the division of samples into previously published basal and non-basal subtypes. FIG. 7H shows that basal-like breast cancer, as defined by the presently disclosed subject matter, had a hazard ratio of 3.52, with a 95% CI of [1.94, 6.38].
[0028] FIGS. 8A-8F are a series of immunofluorescence images of Cancer Associated Fibroblasts (CAFs). Staining using antibodies against EpCAM (FIG. 8A), vimentin (FIG. 8B), and SMA.alpha. (FIG. 8C). FIG. 8D shows staining of T3M4 cells as a positive control for EpCAM. FIG. 8E shows staining of T3M4 cells as a negative control for vimentin, and FIG. 8F shows staining of T3M4 cells as a negative control for SMA.alpha.. Scale bars are 50 .mu.m.
[0029] FIG. 9 is a hierarchical clustering of Spearman correlation of samples from UNC, TCGA Bladder, and Perou data sets showing similarities among basal-like subtype samples. Color bars above the heat map show subtype, either from original publication (Known Tumor Subtype), or from the cross-platform classifier (Pan-platform classification).
[0030] FIGS. 10A-10C depict comparisons to the subtypes disclosed in Collisson et al., 2011. FIG. 10A is a consensus clustered heat map of normalized data from UNC and Collisson et al. using Collisson et al.'s gene sets. Primary tumors, normal pancreas, and cell lines are shown. Collisson samples were previously classified as exocrine-like (magenta or black), classical (cyan or dark grey), and quasimesenchymal (yellow or light gray). FIG. 10B is a Kaplan-Meier plots of UNC samples classified by PAM into Collisson et al.'s subtypes. FIG. 10C is a series of plots of mouse and human specific gene expression of the Collisson et al. gene lists in PDX shown in log.sub.2(1+RPKM). Classical genes are expressed by tumor cells, quasimesenchymal genes are expressed by a mix of human and mouse, while exocrine-like genes are lowly expressed throughout.
[0031] FIGS. 11A-11E depict the results of multivariate survival analysis of tumor and stromal subtypes. FIG. 11A is a heat map of tumor samples using 25 genes from each of the tumor and stromal factors, with samples sorted horizontally by classification. Signature scores for selected gene sets appear above for each sample. FIG. 11B is a combined Kaplan-Meier survival analysis of resected primary patients from basal-like or classical tumor types and normal or activated stroma subtypes with differential survival (p<0.001 log-rank test). Differential prognosis among subtypes shows complementarity. Classical tumors with normal stroma subtypes (n=24) had the lowest hazard ratio of 0.39, and a 95% CI of [0.21, 0.73], while basal-like tumors with activated stroma subtypes (n=26) had the highest hazard ratio of 2.28 with a 95% CI of [1.34, 3.87]. FIG. 11C is a Kaplan-Meir survival analysis showing that patients with classical subtype tumors show less response to adjuvant therapy (HR=0.76, 95% CI [0.40, 1.43]) compared to FIG. 11D is a plot showing basal-like tumors (HR of 0.38, and a 95% CI of [0.14, 1.09]). FIG. 11E is a Kaplan-Meir survival analysis showing that African-Americans have worse overall survival in both basal-like and classical subtypes, with a Hazard ratio of 2.28 and a 95% CI of [1.16,4.5].
[0032] FIGS. 12A-12D are a series of immunohistochemical panels of Collagen I staining to define mouse stroma in PDX. FIG. 12A shows anti-mouse Collagen I staining of stroma in a representative PDX tumor. FIG. 12B is a corresponding H&E stain of the section adjacent to that shown in FIG. 12A. Anti-mouse Collagen I staining of mouse skin (FIG. 12C) and human skin (FIG. 12D) are also depicted. Black arrowheads show areas of tumor stroma. Black arrows show areas of tumor. Scale bars, 200 .mu.m.
[0033] FIGS. 13A and 13B depict the results of tumor gene expression in PDX models. FIG. 13A is a series of plots of mouse and human specific gene expression of basal-like and classical subtype gene lists in 37 PDX tumors shown in log.sub.2(1+RPKM). Both gene sets were robustly expressed by the human (tumor) but not the mouse (stroma) cells in PDX samples. FIG. 13B is a consensus clustering of these PDX tumors using basal-like and classical gene lists divides samples into 2 groups.
[0034] FIGS. 14A-14I depict associations between tumor and stroma subtypes, PDX tumors, KRAS mutations, and SMAD4 expression. FIG. 14A is a series of pie charts showing that tumor subtype was not associated with PDX graft success rate (p=0.417). FIG. 14B is a series of pie charts showing that activated stromal subtype samples engrafted with higher success rates than low or normal stromal subtype samples (p=0.019) FIG. 14C is a plot showing that basal-like tumor subtype PDX reached 200 mm.sup.3 faster than classical subtype PDX (p=0.032). FIG. 14D is a plot showing that PDX from samples with activated stroma subtype or normal stroma subtype did not have significantly different times to reach 200 mm.sup.3 (p=0.170). FIG. 14E is a plot showing that PDX tumors with faster growth rates were associated with earlier recurrences in patients (HR=0.31, 95% CI [0.10, 0.92]. FIG. 14F is a series of pie charts showing that KRAS mutation type was not uniformly distributed among race or subtype. KRAS G12D mutations were more prevalent in basal-like subtype tumors than classical tumors (p=0.030). FIG. 14G is a series of pie charts showing that African Americans had more G12V mutations, while Caucasians had more G12D mutations (p<0.001). FIG. 14H is a plot showing that SMAD4 staining in primary tumors was predictive of successful PDX engraftment (p=0.044). FIG. 14I is a plot showing that basal-like subtype PDX exhibited weaker SMAD4 staining than classical subtype PDX (p=0.015).
[0035] FIGS. 15A-15G are a series of immunohistochemical panels showing SMAD4 staining of representative patient and matched PDX tumors. Positive SMAD4 staining of a patient adenocarcinoma is shown in FIG. 15A, and the corresponding PDX at passage 4 is shown in FIG. 15B. SMAD4 loss in a patient adenocarcinoma is shown in FIG. 15C and corresponding PDX at passage 2 is shown in FIG. 15D. SMAD4 staining of control human skin is shown in FIG. 15E, and is shown in mouse skin in FIG. 15F and in human normal pancreas in FIG. 15G. Scale bars are 200 .mu.m.
[0036] FIG. 16 depicts a consensus clustered heat map of ICGC data for which genetic information was available. Color bars above the heat map show subtypes and genetic alterations for key genes in PDAC. Heat maps show Z-normalized gene expression of basal-like and classical tumor genes.
[0037] FIGS. 17A-17C are a series of plots showing Gene signature scores by subtype normalized across the cohort, and calculated as the mean expression across a panel of genes obtained from MsigDB. FIG. 17A shows that the basal-like subtype showed downregulation of GATA6. FIG. 17B shows that the classical subtype tumors were enriched in genes associated with mucinous ovarian cancer. FIG. 17C shows that basal-like subtype tumors were enriched in genes related to KRAS activation and STK11 loss.
[0038] FIGS. 18A-18C depict differences in extracellular mucin in classical and basal-like subtype tumors. FIG. 18A is a series of pie charts showing that number of samples with low (<10%) compared to high (.gtoreq.10%) extracelluar mucin content. Representative H&E stains of a sample with low degree (FIG. 18B) and high degree (FIG. 18C) of extracellular mucin content are also depicted. Scale bars are 200 .mu.m.
[0039] FIGS. 19A-19G depict the results of experiments showing that overcoming tumor cellularity revealed true heterogeneity among matched primary and metastatic sites. FIG. 19A shows that sample-sample correlations of matched primary and metastatic tumors using the 50 most differentially expressed genes across all samples ("DE50") caused samples to group by organ location. FIG. 19B shows that sample-sample correlations using 25 genes each from classical and basal-like tumor lists ("T50") caused samples to cluster instead by tumor subtype and patient of origin. FIG. 19C is a plot showing that the correlation of samples within the same patient was higher when using T50 genes than when using DE50 genes. FIG. 19D is a plot showing that correlation of samples originating in the same organ was higher when using DE50 than when using T50. FIG. 19E shows clustering of multiple samples from two patients using the DE50 divides samples by organ. Genes expressed highly in lung and liver tissue are noted with brackets. FIG. 19F shows clustering of the same samples from (e) using T50 genes separates samples by patient. Brackets note genes which differentiate the two patients. FIG. 19G is an diagram of sampled locations for these patients indicated by concentric circles and illustrating how samples simultaneously exhibit both patient (inner color) and organ (outer color) specific gene expression.
[0040] FIG. 20 is a summary of exemplary, non-limiting treatment strategy considerations for patients with non-metastatic disease based on stromal subtype as identified using EQUATION 2 or tumor subtype as identified using EQUATION 3 below.
[0041] FIG. 21 is a summary of exemplary, non-limiting treatment strategy considerations for patients with metastatic disease based on stromal subtype as identified using EQUATION 2 or tumor subtype as identified using EQUATION 3 below.
BRIEF DESCRIPTION OF THE SEQUENCES
[0042] The biosequences summarized in Table 1 are Accession Numbers for exemplary human nucleic acid sequences that are present in the GENBANK.RTM. biosequence database, the expression of which can be assayed in the practice of the presently disclosed methods. It is noted that the GENBANK.RTM. biosequence database Accession Numbers presented in Table 1 are exemplary only and that other nucleic acids including but not limited to other transcript variants that are also listed in the GENBANK.RTM. biosequence database under the corresponding Gene Names and/or that are derived from the listed loci can be employed for the analysis of subjects. Similarly, in the event that any of the sequences set forth in Table 1 are updated in the GENBANK.RTM. biosequence database, the updated sequences are also understood to be encompassed by the presently disclosed subject matter.
TABLE-US-00001 TABLE 1 Listing of GENBANK .RTM. Accession Numbers for Nucleic Acid Sequences of Exemplary Human Gene Products GENBANK .RTM. SEQ GENBANK .RTM. SEQ Gene Accession ID Accession ID Symbol No. No. Gene Symbol No. No. ABCA8.sup.N NM_001288985.1 1 COL1A2.sup.A NM_000089.3 27 ACTG2.sup.N NM_001615.3 2 COL3A1.sup.A NM_000090.3 28 ADAMTS1.sup.N NM_006988.3 3 COL5A1.sup.A NM_000093.4 29 AGR2.sup.C NM_006408.3 4 COL5A2.sup.A NM_000393.3 30 AGR3.sup.C NM_176813.3 5 COL10A1.sup.A NM_000493.3 31 ANGPTL7.sup.N NM_021146.3 6 COL11A1.sup.A NM_001854.3 32 ANXA8L2.sup.B NM_001098845.2 7 COMP.sup.A NM_000095.2 33 ANXA10.sup.C NM_007193.4 8 CST6.sup.B NM_001323.3 34 AREGB NM_001657.3 9 CTHRC1.sup.A NM_138455.3 35 ATAD4.sup.C NM_024320.3 10 CTSE.sup.C NM_001910.3 36 ATP1OB NM_025153.2 11 CTSL2.sup.B NM_001333.3 37 B3GNT5 NM_032047.4 12 CYP3A7C NM_000765.4 38 BCAS1 NM_003657.2 13 DCBLD2 NM_080927.3 39 BTNL8.sup.C NM_024850.2 14 DDC NM_001082971.1 40 C2ORF40.sup.N NM_032411.2 15 DES.sup.N NM_001927.3 41 C100RF116.sup.N NM_006829.2 16 DHRS9.sup.B NM_199204.1 42 C16orf74 NM_206967.2 17 FABP4.sup.N NM_001442.2 43 CAPN9 NM_006615.2 18 FAM3D.sup.C NM_138805.2 44 CD109 NM_133493.4 19 FAM83A.sup.B NM_032899.5 45 CDH11.sup.A NM_001797.2 20 FAP.sup.A NM_004460.3 46 CDH17.sup.C NM_004063.3 21 FGFBP1.sup.B NM_005130.4 47 CDH19.sup.N NM_021153.3 22 FN1.sup.A NM_212482.1 48 CEACAM6.sup.C NM_002483.6 23 FNDC1.sup.A NM_032532.2 49 CHST6 NM_021615.4 24 GPM6B.sup.N NM_001001995.1 50 CLRN3.sup.C NM_152311.3 25 GPR87.sup.B NM_023915.3 51 COL1A1.sup.A NM_000088.3 26 GPR160 NM_014373.2 52 GREM1.sup.A NM_013372.6 53 MYO1A.sup.C NM_001256041.1 82 HPGD NM_000860.5 54 NAB1 NM_005966.3 83 ID4.sup.N NM_001546.3 55 OGN.sup.N NM_033014.2 84 IGF1.sup.N NM_001111283.1 56 PLA2G10.sup.C NM_003561.1 85 IL20RB NM_144717.3 57 PLEKHA6 NM_014935.4 86 INHBA.sup.A NM_002192.2 58 PLP1.sup.N NM_000533.3 87 ITGA1l.sup.A NM_001004439.1 59 PLS1 NM_001145319.1 88 KCNE3 NM_005472.4 60 POSTN.sup.A NM_006475.2 89 KRT6A.sup.B NM_005554.3 61 PPP1R14.sup.C NM_030949.2 90 KRT6C.sup.B NM_173086.4 62 PTGES NM_004878.4 91 KRT7.sup.B NM_005556.3 63 PTX3.sup.N NM_002852.3 92 KRT15.sup.B NM_002275.3 64 RBPMS2.sup.N NM_194272.1 93 KRT16 NM_005557.3 65 REG4.sup.C NM_001159352.1 94 KRT17.sup.B NM_000422.2 66 RERGL.sup.N NM_024730.3 95 KRT20.sup.C NM_019010.2 67 RSPO3.sup.N NM_032784.4 96 LEMD1.sup.B NM_001199050.1 68 S100A2.sup.B NM_005978.3 97 LGALS4.sup.C NM_006149.3 69 SCEL.sup.B NM_144777.2 98 LMOD1.sup.N NM_012134.2 70 SCRG1.sup.N NM_007281.2 99 LOC400573.sup.C BC063383 71 SERPINB3.sup.B NM_006919.2 100 LPHN3.sup.N NM_015236.4 72 SERPINB4.sup.B NM_002974.3 101 LUM.sup.A NM_002345.3 73 SERPINB5 NM_002639.4 102 LY6D.sup.B NM_003695.2 74 SFRP2.sup.A NM_003013.2 103 LYZ.sup.C NM_000239.2 75 SLC2A1.sup.B NM_006516.2 104 MEOX2.sup.N NM_005924.4 76 5LC44A4 NM_025257.2 105 MET NM_001127500.1 77 SPARC.sup.A NM_003118.3 106 MMP11.sup.A NM_005940.3 78 SPINK4.sup.C NM_014471.1 107 MS4A8B NM_031457.1 79 SPRR1B.sup.B NM_003125.2 108 MSLN NM_005823.5 80 SPRR3.sup.B NM_005416.2 109 MYH11.sup.N NM_002474.2 81 ST6GALNAC1.sup.C NM_018414.4 110 SULF1.sup.A NM_001128205.1 111 TN54.sup.B NM_032865.5 119 SYNM.sup.N NM_145728.2 112 TSPAN8.sup.C NM_001168412.1 120 SYTL2 NM_001289610.1 113 UCA1.sup.B EU334869.1 121 TFF1.sup.C NM_003225.2 114 VCAN.sup.A NM_004385.4 122 TFF2.sup.C NM_005423.4 115 VGLL1.sup.B NM_016267.3 123 TFF3.sup.C NM_003226.3 116 VIT.sup.N NM_053276.3 124 THBS2.sup.A NM_003247.3 117 VSIG2.sup.C NM_014312.3 125 TMEM45B NM_138788.3 118 ZNF469.sup.A NM_001127464.1 126 .sup.AMember of the DE-S stromal subtype differentiation gene subset that is associated with the Activated stroma subtype .sup.BMember of the DE-T tumor subtype differentiation gene subset that is associated with the Basal tumor subtype .sup.CMember of the DE-T tumor subtype differentiation gene subset that is associated with the Classical tumor subtype .sup.NMember of the DE-S stromal subtype differentiation gene subset that is associated with the Normal stroma subtype
[0043] All of the nucleic acid sequences that correspond to the gene names listed in Table 1 and throughout the instant disclosure, including the corresponding GENBANK.RTM. biosequence database Accession Numbers, all annotations and references cited in the corresponding GENBANK.RTM. biosequence database entries, and all other nucleic acid sequences that correspond to the listed genetic loci that are present in the GENBANK.RTM. biosequence database and related annotations and references, are incorporated herein by reference in their entireties.
DETAILED DESCRIPTION
[0044] The present subject matter will be now be described more fully hereinafter with reference to the accompanying Examples, in which representative embodiments of the presently disclosed subject matter are shown. The presently disclosed subject matter can, however, be embodied in different forms and should not be construed as limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the presently disclosed subject matter to those skilled in the art.
I. General Considerations
[0045] Pancreatic ductal adenocarcinoma (PDAC) remains a lethal disease with a 5-year survival of 4%. Roughly half of PDAC patients present with metastases at the time of diagnosis, and metastatic disease remains the primary cause of mortality in patients. In this study, we set out to identify subtypes among PDAC patients, with a focus on understanding factors which contribute to patient outcome. A key hallmark of PDAC is the presence of extensive stromal and immune involvement, as well as the presence of endocrine, exocrine, and normal ductal pancreas cells. Additionally, metastatic samples often include cell types from the host organ. Thus, PDAC tumors are in fact complex mixtures in which malignant epithelial cells often represent only a minority of the bulk tumor. For this reason, normal and PDAC tissues often cluster separately from cell lines which are assumed to be purely neoplastic (Iacobuzio-Donahue et al., 2003).
[0046] Separating molecular signatures of tissue compartments from measurement of bulk tumor belongs to the general class of problems called blind source separation. Previous studies have used samples of chronic pancreatitis to control for the presence of desmoplastic stroma in tumor samples (Logsdon et al., 2003). In prostate cancer, Stuart et al. have used pathologist assessments of cell types to train models of gene expression signatures of tumor, stroma, and normal tissue (Stuart et al., 2004). In a follow up study, they used their learned gene lists for in silico estimation of tissue components in a larger set of data (Wang et al., 2010). A similar approach has also been used to quantify stromal content across multiple TCGA data sets (Yoshihara et al., 2013). Among source separation techniques, nonnegative matrix factorization (NMF) is especially well suited for biological data, because it constrains all sources to be positive in nature, reflecting the goal of identifying positive gene expression exemplars, rather than pairwise differences between tissue types. Alexandrov et al. have recently demonstrated that NMF is useful for a similar problem of identifying mutational signatures from the aggregate list of somatic mutations in human cancer samples (Alexandrov et al., 2013a,b).
[0047] As disclosed herein, NMF was applied to a large microarray data set of primary and metastatic samples of PDAC to evaluate tumor and stroma specific gene expression signatures. Briefly, NMF was defined as modeling the matrix X of expression for g genes and s samples, as the product of a matrix G of g gene weights for k factors and a matrix S of s sample weights for k factors. By looking at samples with mixed tumor and stroma cellularity, two tumor subtypes have been identified that were validated in multiple data sets, as well as important contributions from normal, immune, and stromal compartments.
II. Definitions
[0048] All technical and scientific terms used herein, unless otherwise defined below, are intended to have the same meaning as commonly understood by one of ordinary skill in the art. References to techniques employed herein are intended to refer to the techniques as commonly understood in the art, including variations on those techniques or substitutions of equivalent techniques that would be apparent to one of skill in the art. While the following terms are believed to be well understood by one of ordinary skill in the art, the following definitions are set forth to facilitate explanation of the presently disclosed subject matter.
[0049] Following long-standing patent law convention, the terms "a," "an," and "the" mean "one or more" when used in this application, including the claims. Thus, the phrase "a cell" refers to one or more cells, unless the context clearly indicates otherwise.
[0050] As used herein, the term "and/or" when used in the context of a list of entities, refers to the entities being present singly or in combination. Thus, for example, the phrase "A, B, C, and/or D" includes A, B, C, and D individually, but also includes any and all combinations and subcombinations of A, B, C, and D.
[0051] The term "comprising," which is synonymous with "including," "containing," and "characterized by," is inclusive or open-ended and does not exclude additional, unrecited elements and/or method steps. "Comprising" is a term of art that means that the named elements and/or steps are present, but that other elements and/or steps can be added and still fall within the scope of the relevant subject matter.
[0052] As used herein, the phrase "consisting of" excludes any element, step, and/or ingredient not specifically recited. For example, when the phrase "consists of" appears in a clause of the body of a claim, rather than immediately following the preamble, it limits only the element set forth in that clause; other elements are not excluded from the claim as a whole.
[0053] As used herein, the phrase "consisting essentially of" limits the scope of the related disclosure or claim to the specified materials and/or steps, plus those that do not materially affect the basic and novel characteristic(s) of the disclosed and/or claimed subject matter. For example, the presently disclosed subject matter in some embodiments can "consist essentially of" determining expression levels for one or more genes listed in Table 1 in PDAC cells present in a sample (e.g., a biopsy) obtained from a subject, which means that the recited gene(s) is/are the only genes for which an expression level or expression levels are determined. It is noted, however, that expression levels for various positive and/or negative control genes can also be determined, for example, to standardize and/or normalize the expression levels in PDAC cells of the genes employed, if desired, and still be within the scope of the phrase consist essentially of determining expression levels for one or more genes listed in Table 1.
[0054] With respect to the terms "comprising," "consisting essentially of," and "consisting of," where one of these three terms is used herein, the presently disclosed and claimed subject matter can include the use of either of the other two terms. For example, it is understood that the methods of the presently disclosed subject matter in some embodiments comprise the steps that are disclosed herein and/or that are recited in the claims, in some embodiments consist essentially of the steps that are disclosed herein and/or that are recited in the claims, and in some embodiments consist of the steps that are disclosed herein and/or that are recited in the claim.
[0055] The term "subject" as used herein refers to a member of any invertebrate or vertebrate species. Accordingly, the term "subject" is intended to encompass any member of the Kingdom Animalia including, but not limited to the phylum Chordata (i.e., members of Classes Osteichythyes (bony fish), Amphibia (amphibians), Reptilia (reptiles), Ayes (birds), and Mammalia (mammals)), and all Orders and Families encompassed therein. In some embodiments, the presently disclosed subject matter relates to human subjects.
[0056] Similarly, all genes, gene names, and gene products disclosed herein are intended to correspond to orthologs from any species for which the compositions and methods disclosed herein are applicable. Thus, the terms include, but are not limited to genes and gene products from humans. It is understood that when a gene or gene product from a particular species is disclosed, this disclosure is intended to be exemplary only, and is not to be interpreted as a limitation unless the context in which it appears clearly indicates. Thus, for example, the genes and/or gene products disclosed herein are also intended to encompass homologous genes and gene products from other animals including, but not limited to other mammals, fish, amphibians, reptiles, and birds.
[0057] The methods and compositions of the presently disclosed subject matter are particularly useful for warm-blooded vertebrates. Thus, the presently disclosed subject matter concerns mammals and birds. More particularly provided is the use of the methods and compositions of the presently disclosed subject matter on mammals such as humans and other primates, as well as those mammals of importance due to being endangered (such as Siberian tigers), of economic importance (animals raised on farms for consumption by humans) and/or social importance (animals kept as pets or in zoos) to humans, for instance, carnivores other than humans (such as cats and dogs), swine (pigs, hogs, and wild boars), ruminants (such as cattle, oxen, sheep, giraffes, deer, goats, bison, and camels), rodents (such as mice, rats, and rabbits), marsupials, and horses. Also provided is the use of the disclosed methods and compositions on birds, including those kinds of birds that are endangered, kept in zoos, as well as fowl, and more particularly domesticated fowl, e.g., poultry, such as turkeys, chickens, ducks, geese, guinea fowl, and the like, as they are also of economic importance to humans. Thus, also provided is the application of the methods and compositions of the presently disclosed subject matter to livestock, including but not limited to domesticated swine (pigs and hogs), ruminants, horses, poultry, and the like.
[0058] The term "about," as used herein when referring to a measurable value such as an amount of weight, time, dose, etc., is meant to encompass variations of in some embodiments .+-.20%, in some embodiments .+-.10%, in some embodiments .+-.5%, in some embodiments .+-.1%, and in some embodiments .+-.0.1% from the specified amount, as such variations are appropriate to perform the disclosed methods and/or to employ the presently disclosed arrays.
[0059] As used herein the term "gene" refers to a hereditary unit including a sequence of DNA that occupies a specific location on a chromosome and that contains the genetic instruction for a particular characteristic or trait in an organism. Similarly, the phrase "gene product" refers to biological molecules that are the transcription and/or translation products of genes. Exemplary gene products include, but are not limited to mRNAs and polypeptides that result from translation of mRNAs. Any of these naturally occurring gene products can also be manipulated in vivo or in vitro using well known techniques, and the manipulated derivatives can also be gene products. For example, a cDNA is an enzymatically produced derivative of an RNA molecule (e.g., an mRNA), and a cDNA is considered a gene product. Additionally, polypeptide translation products of mRNAs can be enzymatically fragmented using techniques well known to those of skill in the art, and these peptide fragments are also considered gene products.
[0060] It is understood that while exemplary nucleotide sequences for the human orthologs of the genes listed in Table 1 are disclosed herein, orthologs of these genes from other species are also included within the presently disclosed subject matter.
[0061] The term "isolated," as used in the context of a nucleic acid or polypeptide (including, for example, a nucleotide sequence, a polypeptide, and/or a peptide), indicates that the nucleic acid or polypeptide exists apart from its native environment. An isolated nucleic acid or polypeptide can exist in a purified form or can exist in a non-native environment.
[0062] Further, as used for example in the context of a cell, nucleic acid, polypeptide, or peptide, the term "isolated" indicates that the cell, nucleic acid, polypeptide, or peptide exists apart from its native environment. In some embodiments, "isolated" refers to a physical isolation, meaning that the cell, nucleic acid, polypeptide, or peptide has been removed from its native environment (e.g., from a subject).
[0063] The terms "nucleic acid molecule" and "nucleic acid" refer to deoxyribonucleotides, ribonucleotides, and polymers thereof, in single-stranded or double-stranded form. Unless specifically limited, the term encompasses nucleic acids containing known analogues of natural nucleotides that have similar properties as the reference natural nucleic acid. The terms "nucleic acid molecule" and "nucleic acid" can also be used in place of "gene," "cDNA," and "mRNA." Nucleic acids can be synthesized, or can be derived from any biological source, including any organism.
[0064] As used herein, the terms "peptide" and "polypeptide" refer to polymers of at least two amino acids linked by peptide bonds. Typically, "peptides" are shorter than "polypeptides," but unless the context specifically requires, these terms are used interchangeably herein.
[0065] As used herein, a cell, nucleic acid, or peptide exists in a "purified form" when it has been isolated away from some, most, or all components that are present in its native environment, but also when the proportion of that cell, nucleic acid, or peptide in a preparation is greater than would be found in its native environment. As such, "purified" can refer to cells, nucleic acids, and peptides that are free of all components with which they are naturally found in a subject, or are free from just a proportion thereof.
III. Methods for Generating Prognostic and/or Subtype Signatures
[0066] In some embodiments, the presently disclosed subject matter provides methods for generating prognostic and/or subtype signatures for a subject with cancer (e.g., pancreatic ductal adenocarcinoma (PDAC)). As used herein, the phrase "prognostic and/or subtype signature" refers to a gene expression profile comprising gene expression levels for one or more of the genes disclosed in Table 1 in PDAC cells obtained from the subject, wherein the determining provides a prognostic and/or subtype signature for the subject. In some embodiments, a gene expression profile of the presently disclosed subject matter can comprise gene expression levels for one, five, ten, 25, 50, or 100 of more of the genes listed in Tables 2-5. In some embodiments, a gene expression profile of the presently disclosed subject matter can comprise gene expression levels for all of the genes listed in Tables 2-5.
[0067] As disclosed herein, such gene expression profiles can be predictive of various clinical outcomes, for example, by comparing to appropriate standards.
[0068] In some embodiments, methods for generating prognostic and/or subtype signatures further comprise comparing the derived prognostic and/or subtype signatures to one or more standards. As used herein, the term "standard" refers to an entity to which another entity (e.g., a prognostic and/or subtype signature) can be compared such that the comparison provides information of interest. An exemplary standard that is described herein is a test set. Additional discussion of standards can be found herein below. Such a comparison can be carried out on an apparatus, such as a system comprising a suitably programmed computer.
[0069] Thus, a profile can be created once an expression level is determined for a gene. As used herein, the term "profile" (e.g., a "gene expression profile") refers to a repository of the expression level data that can be used to compare the expression levels of one or more genes, such as but not limited to one or more different genes among various subjects. For example, for a given subject, the term "profile" can encompass the expression levels of one or more of the genes disclosed herein detected in whatever units are chosen.
[0070] The term "profile" is also intended to encompass manipulations of the expression level data derived from a subject. For example, once relative expression levels are determined for a given set of genes in a subject, the relative expression levels for that subject can be compared to a standard to determine if the expression levels in that subject are higher or lower than for the same genes in the standard. Standards can include any data deemed to be relevant for comparison. Such a comparison can be carried out on an apparatus, such as a system comprising a suitably programmed computer. In some embodiments, an expression profile with respect to a plurality of the genes listed in Table 1 is presented such that a subject can be assigned into one particular treatment category (i.e., normal vs. activated stroma or classical vs. basal subtypes) based on the expression profile.
IV. Methods for Selecting a Treatment
[0071] The presently disclosed subject matter also provides methods for selecting a treatment for a subject diagnosed with pancreatic ductal adenocarcinoma (PDAC). In some embodiments, the methods comprise assigning the subject into a classification based on an analysis of a gene expression profile with respect to one or more of the genes listed in Table 1, wherein the analysis classifies the subject as having a tumor that corresponds to either a normal vs. an activated stroma subtype, or alternatively a classical vs. basal subtype.
[0072] In some embodiments a method for selecting a treatment comprises classifying a patient as being in a normal vs. an activated stroma subtype or a classical vs. basal subtype using one or more of Algorithms A-C described herein below.
[0073] IV.A. Overview of Exemplary Diagnostic Algorithms
[0074] The presently disclosed subject matter provides in some embodiments algorithms that can be employed for classifying PDAC subtypes in patient samples. In some embodiments, a particular algorithm is selected based on whether or not cytopathological assessment of the sample provides a reasonable basis for an initial diagnosis, and if so, whether the presence of metastatic disease is suggested thereby.
[0075] IV.A.1. Algorithm A: Diagnosing Pancreatic Cancer from a Non-Diagnostic Specimen on Traditional Cytopathology
[0076] Low tumor cellularity and high stroma content has long hampered the ability to diagnose pancreatic cancer on biopsies. According to pathology assessments, stroma comprises on average 39% of the primary tumor samples examined. At least 8% of endoscopic ultrasound biopsies are non-diagnostic (Gress et al., 2001). Biopsy results can alter the decision to proceed with surgery, which involves an operation that has an attendant postoperative complication and hospital readmission rates of 59% and mortality of 6% (DeOliveira et al., 2006; Eppsteiner et al., 2009; Yermilov et al., 2009). Therefore, clarity of biopsy results can be a key factor for correctly diagnosing patients and for assisting their physicians in determining appropriate treatment strategies.
[0077] The stroma subtypes disclosed herein have the potential to overcome the cellularity problem and provides a much needed diagnostic tool that leverages the most abundant component of tumor biopsies of pancreatic cancer. An example of the decision making process based on the genomic subtypes disclosed herein is described herein.
[0078] IV.A.2. Algorithm B: Diagnostic Specimen on Traditional Cytopathology or Diagnosis after Application of Algorithm A--Determining Tumor Subtype in the Non-Metastatic Setting
[0079] Despite curative operations, pancreatic cancer patients who have had their tumors fully resected only have a median survival of 23 months (Neuhaus et al., 2008). The majority of patients relapse with metastatic disease.
[0080] Thus, there has been much interest in using systemic therapies preoperatively in an attempt to treat micrometastatic disease that might be present at the time of surgery (i.e., neoadjuvant approaches). The tumor and stroma subtypes disclosed herein are independently prognostic and diagnostic, and can add value to prognosticating the outcome of patients. Algorithm B provides an exemplary treatment approach based on findings of specific subtype mixtures with classical/normal being the best and basal/activated the worst.
[0081] IV.A.3. Algorithm C: Determining Tumor Subtype in the Metastatic Setting
[0082] Recent studies have shown two promising chemotherapeutic regimens for patients with metastatic pancreatic cancer (Louvet et al., 2005; Conroy et al., 2011). However, promising targeted therapies have been lacking. Algorithm C provides an exemplary treatment approach dependent on subtype identified using the methods and compositions disclosed herein.
[0083] IV.B. Determination of Subtypes
[0084] Patient samples can be profiled for mRNA expression by any method that provides for an analysis of quantitative gene expression. Non-limiting examples of such techniques include whole transcriptome RNAseq, targeted RNAseq, SAGE, RT-PCR (particularly QRT-PCR), and cDNA microarray analyses. With respect to the presently disclosed methods, gene expression from the following lists are measured: (1) the four "core" expression lists for each of the four subtypes, which describe genes which are overexpressed in each subtype; and (2) the four "differential" expression lists, which define genes which are uniquely expressed in each subtype. Genes from the core lists are not mutually exclusive, as there are genes which are expressed by both tumor subtypes, and could be relevant targets for treatment in both groups. Genes from the core lists are used to select from among appropriate therapeutic targets for a particular subtype. Genes from differential lists are, by design, mutually exclusive and represent the most discriminatory biomarkers for subtype diagnosis. For classification purposes, the union of tumor subtype differential genes are referred to herein as "DE-T" (see Table 1), and the union of stromal subtype differentiation genes are referred to herein as "DE-S" (see Table 1).
[0085] Two classifiers, (one using DE-T, and one using DE-S), are used to classify new samples using a Bayesian framework that allows for incorporation of a priori evidence such as population prevalence, and allows for the assessment of confidence in each decision (Duda et al., 2012). For example, DE-S gene expression from an unknown sample is compared to the DE-S gene expression of each of two template centroids representing the two stromal subtypes. Or, for example, DE-T gene expression is assessed with a top-scoring-pairs logistic regression model to estimate probability of class membership. Samples are classified as the subtype with which they exhibit the highest degree of likelihood as formalized by maximum a posteriori probability and associated confidence level. Thus, each sample has both a stroma and a tumor classification type with associated confidences for clinical use.
[0086] Alternatively or in addition, the gene pairs disclosed in Tables 9-11 below can be employed for determining tumor and stromal subtypes in cancers including, but not limited to the breast, bladder, or pancreas. For example, cancers in these tissues can be identified as being basal-like or not basal-like using the gene pairs disclosed in Table 9 below. To classify each sample, gene expression from pairs of genes in Table 9 below can be compared such that for each gene pair, if Gene A expression is greater than Gene B expression, the coefficient for that gene pair was added to a running sum. If the sum of all such coefficients and the intercept from Table 9 below is greater than zero, the sample is classified as basal (see EQUATION 1).
[0087] Using the gene pairs in Table 9 below for breast, bladder, or pancreas, if A.sub.i and B.sub.i are the measured expression of Genes A and B of Table 9 in the i.sup.th row, C.sub.i is the i.sup.th coefficient, and I is the intercept, then a decision can be calculated as follows:
P i = { 1 .times. .times. if .times. .times. A i > B i 0 .times. .times. if .times. .times. B i .gtoreq. A i .times. .times. d = I + i .times. P i .times. C i .times. .times. decision = { Basal .times. .times. if .times. .times. d > 0 Not .times. .times. Basal .times. .times. if .times. .times. d .ltoreq. 0 EQUATION .times. .times. 1 ##EQU00003##
[0088] More particularly in the case of cancer of the pancreas, the gene pairs listed in Table 10 below can be employed for classifying a pancreas tumor as being of the activated stroma subtype or the normal stroma subtype. Using Table 10 below, if A.sub.i and B.sub.i are the measured expression of Genes A and B of Table 10 in the i.sup.th row, C.sub.i is the i.sup.th coefficient, and I is the intercept, then a decision can be calculated as in EQUATION 2:
P i = { 1 .times. .times. if .times. .times. A i > B i 0 .times. .times. if .times. .times. B i .gtoreq. A i .times. .times. d = I + i .times. P i .times. C i .times. .times. decision = { Activated .times. .times. Stroma .times. .times. if .times. .times. d > 0 Normal .times. .times. Stroma .times. .times. if .times. .times. d .ltoreq. 0 EQUATION .times. .times. 2 ##EQU00004##
[0089] Also more particularly in the case of cancer of the pancreas, the gene pairs listed in Table 11 below can be employed for classifying a pancreas tumor as being of the basal subtype or the classical subtype. Using Table 11 below, if A.sub.i and B.sub.i are the measured expression of Genes A and B of Table 11 in the i.sup.th row, C.sub.i is the i.sup.th coefficient, and I is the intercept, then a decision can be calculated as in EQUATION 3:
P i = { 1 .times. .times. if .times. .times. A i > B i 0 .times. .times. if .times. .times. B i .gtoreq. A i .times. .times. d = I + i .times. P i .times. C i .times. .times. decision = { Basal - like .times. .times. if .times. .times. d > 0 Classical .times. .times. if .times. .times. d .ltoreq. 0 EQUATION .times. .times. 3 ##EQU00005##
[0090] IV.C. Determination of Subtype-specific Treatment Strategies
[0091] Many of the genes that are descriptive for each subtype have yet to have an available drug. However, the majority are targetable and as drugs become available, and thus are expected to guide therapeutic decisions in the future.
[0092] At the current time, treatment of pancreatic cancer is limited to three regimens: gemcitabine, gemcitabine in combination with nab-paclitaxel (Von Hoff et al., 2013), and treatment with FOLFIRINOX (composed of folinic acid (leucovorin), fluorouracil, irinotecan, and oxaliplatin; Conroy et al., 2011). In those patients with non-metastatic disease, the subset of patients classified as classical/normal are offered surgery as the first stage of therapy. In those patients classified as classical/activated, the basal/activated subset and the basal/normal subset are offered chemotherapy (FOLFIRINOX or gemcitabine+nab-paclitaxel, dependent on oncologist and patient preference and patient tolerance) prior to surgery as outcome in patients with basal subtypes after surgery is poor, with 50% of patients relapsing and dying about 1 year after the surgery that had been intended to cure the disease. As therapies in trial become available, all patients with activated subtypes will be offered stroma modulating therapies (see examples described herein below) prior to surgery. In some embodiments, patients with basal subtypes derive greater benefit from chemotherapy after surgery as described herein.
[0093] For those patients with metastatic disease, the classical/normal subset of patients can proceed with currently available chemotherapies. For the subset of patients with other subtypes, therapies are tailored as described in more detail herein below. In some embodiments, different subtypes respond to different therapies, so as newer therapies develop the selected strategies can be altered.
[0094] Drug regimens can be further tailored by tumor and/or stroma subtype as drugs currently in early phase clinical trials become available. For instance, patients with activated stroma subtypes could benefit from extracellular matrix-associated therapies such as hyaluronidase treatment (currently in clinical trials) and/or collagenase treatment in combination with other therapies.
[0095] Patients with normal subtype tumors might not benefit from similar stroma-modulating agents, which conversely could be harmful. Rather, such patients' disease could be sensitive to anti-PDGFRB- or anti-TEK-directed therapy.
[0096] Patients with the basal subtype might benefit from AGS-14CD4, crizotinib, or erlotinib, or other kinase inhibitors that have anti-MET activity. Patients with classical subtypes might benefit from varespladib, cobicistat, traztuzumab, or other kinase inhibitors with anti-ERBB2 or anti-EGFR activity.
[0097] Finally, Table 6 shows a list of kinases that can be considered as therapeutic targets for patients with classical and basal subtype tumors.
[0098] Tables 2-5 list the genes that define each subtype and the currently known drugs and/or combination(s) of drugs that can be used based on the overall subtype. The gene lists in Tables 2-5 are descriptive for each subtype and are relevant to designing treatment regimens for each subtype, but are not necessarily mutually exclusive as multiple treatment possibilities can be considered for each subtype. For diagnostic purposes, subsets of these genes, which are unique to each subtype, were used (see DE-S and DE-T above).
[0099] Regardless of whether specific drugs have been effective in pancreatic cancer, the results disclosed herein suggested that pancreatic cancer is not one singular disease, and unless specific therapies are appropriately tailored, individual patients are unlikely to benefit from the current one size fits all approach to treatment. The findings disclosed herein can thus be used to personalize therapies to individual patients by reference to their tumor and/or stroma subtype.
V. Methods of Gene Expression/Transcriptome Analysis
[0100] V.A. Assay Formats
[0101] The genes identified as being differentially expressed in, for example, normal subtype vs. activated stroma subtype PDAC, or alternatively classical subtype vs. basal subtype PDAC, can be used in a variety of nucleic acid detection assays to detect and/or quantitate the expression level of a gene or multiple genes in a given sample. For example, Northern blotting, nuclease protection, RT-PCR (e.g., quantitative RT-PCR; QRT-PCR), and/or differential display methods can be used for detecting gene expression levels. In some embodiments, methods and assays of the presently disclosed subject matter are employed with array or chip hybridization-based methods and systems for detecting the expression of a plurality of genes. However, it is noted that any nucleotide analysis method can be employed with the presently disclosed subject matter, including in some embodiments RNA sequencing and transcriptome analysis.
[0102] Any hybridization assay format can be used, including solution-based and solid support-based assay formats. Representative solid supports containing oligonucleotide probes for differentially expressed genes of the presently disclosed subject matter can be filters, polyvinyl chloride dishes, silicon, glass based chips, etc. Such wafers and hybridization methods are widely available and include, for example, those disclosed in PCT International Patent Application Publication WO 1995/011755). Any solid surface to which oligonucleotides can be bound, either directly or indirectly, either covalently or non-covalently, can be used. An exemplary solid support is a high-density array or DNA chip. These contain a particular oligonucleotide probe in a predetermined location on the array. Each predetermined location can contain more than one molecule of the probe, but in some embodiments each molecule within the predetermined location has an identical sequence. Such predetermined locations are termed features. There can be any number of features on a single solid support including, for example, about 2, 10, 100, 1000, 10,000, 100,000, or 400,000 of such features on a single solid support. The solid support, or the area within which the probes are attached, can be of any convenient size (for example, on the order of a square centimeter).
[0103] Oligonucleotide probe arrays for differential gene expression monitoring can be made and employed according to any techniques known in the art (see e.g., Lockhart et al., 1996; McGall et al., 1996). Such probe arrays can contain at least two or more oligonucleotides that are complementary to or hybridize to two or more of the genes described herein. Such arrays can also contain oligonucleotides that are complementary or hybridize to at least about 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, 50, 70, 100, or more of the nucleic acid sequences disclosed herein.
[0104] The genes that are assayed according to the presently disclosed subject matter are typically in the form of RNA (e.g., total RNA or mRNA) and/or reverse transcribed RNA (i.e., cDNA), including subsequences thereof. The genes can be cloned or not, and the genes can be amplified or not. In some embodiments, poly A.sup.+ RNA is employed as a source.
[0105] Probes based on the sequences of the genes described herein can be prepared by any commonly available method. Oligonucleotide probes for assaying the tissue or cell sample are in some embodiments of sufficient length to specifically hybridize only to appropriate complementary genes or transcripts. Typically, the oligonucleotide probes are at least 10, 12, 14, 16, 18, 20, or 25 nucleotides in length. In some embodiments, longer probes of at least 30, 40, 50, or 60 nucleotides are employed.
[0106] As used herein, oligonucleotide sequences that are complementary to one or more of the genes described herein are oligonucleotides that are capable of hybridizing under stringent conditions to at least part of the nucleotide sequence of said genes. Such hybridizable oligonucleotides will typically exhibit in some embodiments at least about 75% sequence identity, in some embodiments about 80% sequence identity, in some embodiments about 85% sequence identity, in some embodiments about 90% sequence identity, in some embodiments about 91% sequence identity, in some embodiments about 92% sequence identity, in some embodiments about 93% sequence identity, in some embodiments about 94% sequence identity, in some embodiments about 95% sequence identity, and in some embodiments greater than 95% sequence identity (e.g., 96%, 97%, 98%, 99%, or 100% sequence identity) at the nucleotide level to the nucleic acid sequences disclosed herein.
[0107] "Bind(s) substantially" refers to complementary hybridization between a probe nucleic acid and a target nucleic acid and embraces minor mismatches that can be accommodated by reducing the stringency of the hybridization media to achieve the desired detection of the target polynucleotide sequence.
[0108] The terms "background" or "background signal intensity" refer to hybridization signals resulting from non-specific binding, or other interactions, between the labeled target nucleic acids and components of the oligonucleotide array (e.g., the oligonucleotide probes, control probes, the array substrate, etc.). Background signals can also be produced by intrinsic fluorescence of the array components themselves. A single background signal can be calculated for the entire array, or a different background signal can be calculated for each target nucleic acid. In some embodiments, background is calculated as the average hybridization signal intensity for the lowest 5% to 10% of the probes in the array, or, where a different background signal is calculated for each target gene, for the lowest 5% to 10% of the probes for each gene. Of course, one of skill in the art will appreciate that where the probes to a particular gene hybridize well and thus appear to be specifically binding to a target sequence, they should not be used in a background signal calculation. Alternatively, background can be calculated as the average hybridization signal intensity produced by hybridization to probes that are not complementary to any sequence found in the sample (e.g., probes directed to nucleic acids of the opposite sense or to genes not found in the sample such as bacterial genes where the sample is mammalian nucleic acids). Background can also be calculated as the average signal intensity produced by regions of the array that lack probes.
[0109] Assays, methods, and systems of the presently disclosed subject matter can utilize available formats to simultaneously screen in some embodiments at least about 10, in some embodiments at least about 50, in some embodiments at least about 100, in some embodiments at least about 1000, in some embodiments at least about 10,000, and in some embodiments at least about 40,000 or more different nucleic acid hybridizations.
[0110] As used herein, a "probe" is defined as a nucleic acid that is capable of binding to a target nucleic acid of complementary sequence through one or more types of chemical bonds, usually through complementary base pairing, usually through hydrogen bond formation. As used herein, a probe can include natural (i.e., A, G, U, C, or T) or modified bases (7-deazaguanosine, inosine, etc.). In addition, the bases in probes can be joined by a linkage other than a phosphodiester bond, so long as it does not interfere with hybridization. Thus, probes can be peptide nucleic acids in which the constituent bases are joined by peptide bonds rather than phosphodiester linkages.
[0111] The terms "mismatch control" and "mismatch probe" refer to a probe comprising a sequence that is deliberately selected not to be perfectly complementary to a particular target sequence. For each mismatch (MM) control in a high-density array there typically exists a corresponding perfect match (PM) probe that is perfectly complementary to the same particular target sequence. The mismatch can comprise one or more bases.
[0112] While the mismatch(s) can be located anywhere in the mismatch probe, terminal mismatches are less desirable as a terminal mismatch is less likely to prevent hybridization of the target sequence. In some embodiments, the mismatch is located at or near the center of the probe such that the mismatch is most likely to destabilize the duplex with the target sequence under the test hybridization conditions.
[0113] The phrase "perfect match probe" refers to a probe that has a sequence that is perfectly complementary to a particular target sequence. The test probe is typically perfectly complementary to a portion (subsequence) of the target sequence. The perfect match (PM) probe can be a "test probe," a "normalization control" probe, an expression level control probe, or the like. A perfect match control or perfect match probe is, however, distinguished from a "mismatch control" or "mismatch probe."
[0114] V.B. Probe Design
[0115] Upon review of the present disclosure, one of skill in the art will appreciate that an enormous number of array designs are suitable for the practice of the presently disclosed subject matter. The high-density array typically includes a number of probes that specifically hybridize to the sequences of interest. See PCT International Patent Application Publication WO 1999/032660, incorporated herein by reference in its entirety, for methods of producing probes for a given gene or genes. In addition, in some embodiments, the array includes one or more control probes.
[0116] High-density array chips of the presently disclosed subject matter include in some embodiments "test probes." Test probes can be oligonucleotides that in some embodiments range from about 5 to about 500 or about 5 to about 50 nucleotides, in some embodiments from about 10 to about 40 nucleotides, and in some embodiments from about 15 to about 40 nucleotides in length. In some embodiments, the probes are about 20 to 25 nucleotides in length. In some embodiments, test probes are double or single strand DNA sequences. DNA sequences are isolated or cloned from natural sources and/or amplified from natural sources using natural nucleic acid as templates. These probes have sequences complementary to particular subsequences of the genes the expression of which they are designed to detect. Thus, the test probes are capable of specifically hybridizing to the target nucleic acid they are to detect.
[0117] In addition to test probes that bind the target nucleic acid(s) of interest, the high-density array can contain a number of control probes. The control probes fall into three categories referred to herein as (1) normalization controls; (2) expression level controls; and (3) mismatch controls.
[0118] Normalization controls are oligonucleotide or other nucleic acid probes that are complementary to labeled reference oligonucleotides or other nucleic acid sequences that are added to the nucleic acid sample. The signals obtained from the normalization controls after hybridization provide a control for variations in hybridization conditions, label intensity, "reading" efficiency and other factors that can cause the signal of a perfect hybridization to vary between arrays. In some embodiments, signals (e.g., fluorescence intensity) read from some or all other probes in the array are divided by the signal (e.g., fluorescence intensity) from the control probes, thereby normalizing the measurements.
[0119] Virtually any probe can serve as a normalization control. However, it is recognized that hybridization efficiency varies with base composition and probe length. Exemplary normalization probes can be selected to reflect the average length of the other probes present in the array; however, they can be selected to cover a range of lengths. The normalization control(s) can also be selected to reflect the (average) base composition of the other probes in the array; however, in some embodiments, only one or a few probes are used and they are selected such that they hybridize well (i.e., no secondary structure) and do not match any target-specific probes.
[0120] Expression level controls are probes that hybridize specifically with constitutively expressed genes in the biological sample. Virtually any constitutively expressed gene provides a suitable target for expression level controls. Typical expression level control probes have sequences complementary to subsequences of constitutively expressed "housekeeping genes" including, but not limited to, the (3-actin gene, the transferrin receptor gene, the GAPDH gene, and the like. Exemplary human housekeeping genes are disclosed in Eisenberg & Levanon, 2003. It is noted that certain of the genes listed in Eisenberg & Levanon, 2003 are also listed in one or more of Tables 2-5. In some embodiments, a gene that appears in Eisenberg & Levanon, 2003 and also in one or more of Tables 2-5 is not selected for use as an expression level control.
[0121] Mismatch controls can also be provided for the probes to the target genes, for expression level controls or for normalization controls. Mismatch controls are oligonucleotide probes or other nucleic acid probes identical to their corresponding test or control probes except for the presence of one or more mismatched bases. A mismatched base is a base selected so that it is not complementary to the corresponding base in the target sequence to which the probe would otherwise specifically hybridize. One or more mismatches are selected such that under appropriate hybridization conditions (e.g., stringent conditions) the test or control probe would be expected to hybridize with its target sequence, but the mismatch probe would not hybridize (or would hybridize to a significantly lesser extent). In some embodiments, mismatch probes contain one or more central mismatches. Thus, for example, where a probe is a 20-mer, a corresponding mismatch probe will have the identical sequence except for a single base mismatch (e.g., substituting a G, a C, or a T for an A) at any of positions 6 through 14 (the central mismatch).
[0122] Mismatch probes thus provide a control for non-specific binding or cross hybridization to a nucleic acid in the sample other than the target to which the probe is directed. Mismatch probes also indicate whether a given hybridization is specific or not. For example, if the target is present the perfect match probes should be consistently brighter than the mismatch probes. In addition, if all central mismatches are present, the mismatch probes can be used to detect a mutation. The difference in intensity between the perfect match and the mismatch probe (IBM)-I(MM)) provides a good measure of the concentration of the hybridized material.
[0123] V.C. Nucleic Acid Samples
[0124] A biological sample that can be analyzed in accordance with the presently disclosed subject matter comprises in some embodiments a nucleic acid. The terms "nucleic acid," "nucleic acids," and "nucleic acid molecules" each refer in some embodiments to deoxyribonucleotides, ribonucleotides, and polymers and folded structures thereof in either single- or double-stranded form. Nucleic acids can be derived from any source, including any organism. Deoxyribonucleic acids can comprise genomic DNA, cDNA derived from ribonucleic acid, DNA from an organelle (e.g., mitochondrial DNA or chloroplast DNA), or combinations thereof. Ribonucleic acids can comprise genomic RNA (e.g., viral genomic RNA), messenger RNA (mRNA), ribosomal RNA (rRNA), transfer RNA (tRNA), or combinations thereof.
[0125] V.C.1. Isolation of Nucleic Acid Samples
[0126] Nucleic acid samples used in the methods and assays of the presently disclosed subject matter can be prepared by any available method or process. Methods of isolating total mRNA are also known to those of skill in the art. For example, methods of isolation and purification of nucleic acids are described in detail in Chapter 3 of Tijssen, 1993. Such samples include RNA samples, but also include cDNA synthesized from an mRNA sample isolated from a cell or tissue of interest. Such samples also include DNA amplified from the cDNA, an RNA transcribed from the amplified DNA, and combinations thereof. One of skill in the art would appreciate that it can be desirable to inhibit or destroy RNase present in homogenates before homogenates are used as a source of RNA.
[0127] The presently disclosed subject matter encompasses use of a sufficiently large biological sample to enable a comprehensive survey of low abundance nucleic acids in the sample. Thus, the sample can optionally be concentrated prior to isolation of nucleic acids. Several protocols for concentration have been developed that alternatively use slide supports (Kohsaka & Carson, 1994; Millar et al., 1995), filtration columns (Bej et al., 1991), or immunomagnetic beads (Albert et al., 1992; Cousins et al., 1992). Such approaches can significantly increase the sensitivity of subsequent detection methods.
[0128] As one example, SEPHADEX.RTM. matrix (Sigma of St. Louis, Mo., United States of America) is a matrix of diatomaceous earth and glass suspended in a solution of chaotropic agents and has been used to bind nucleic acid material (Boom et al., 1990; Buffone et al., 1991). After the nucleic acid is bound to the solid support material, impurities and inhibitors are removed by washing and centrifugation, and the nucleic acid is then eluted into a standard buffer. Target capture also allows the target sample to be concentrated into a minimal volume, facilitating the automation and reproducibility of subsequent analyses (Lanciotti et al., 1992).
[0129] Methods for nucleic acid isolation can comprise simultaneous isolation of total nucleic acid, or separate and/or sequential isolation of individual nucleic acid types (e.g., genomic DNA, cDNA, organelle DNA, genomic RNA, mRNA, poly A.sup.+ RNA, rRNA, tRNA) followed by optional combination of multiple nucleic acid types into a single sample.
[0130] When RNA (e.g., mRNA) is selected for analysis, the disclosed methods allow for an assessment of gene expression in the tissue or cell type from which the RNA was isolated. RNA isolation methods are known to one of skill in the art. See Albert et al., 1992; Busch et al., 1992; Hamel et al., 1995; Herrewegh et al., 1995; Izraeli et al., 1991; McCaustland et al., 1991; Natarajan et al., 1994; Rupp et al., 1988; Tanaka et al., 1994; and Van Kerckhoven et al., 1994.
[0131] Simple and semi-automated extraction methods can also be used for nucleic acid isolation, including for example, the SPLIT SECOND.TM. system (Boehringer Mannheim of Indianapolis, Ind., United States of America), the TRIZOL.TM. Reagent system (Life Technologies of Gaithersburg, Md., United States of America), and the FASTPREP.TM. system (Bio 101 of La Jolla, Calif., United States of America). See also Smith 1998a; and Paladichuk 1999.
[0132] In some embodiments, nucleic acids that are used for subsequent amplification and labeling are analytically pure as determined by spectrophotometric measurements or by visual inspection following electrophoretic resolution. In some embodiments, the nucleic acid sample is free of contaminants such as polysaccharides, proteins, and inhibitors of enzyme reactions. When a biological sample comprises an RNA molecule that is intended for use in producing a probe, it is preferably free of DNase and RNase. Contaminants and inhibitors can be removed or substantially reduced using resins for DNA extraction (e.g., CHELEX.TM. 100 from Bio-Rad Laboratories of Hercules, Calif., United States of America) or by standard phenol extraction and ethanol precipitation.
[0133] V.C.2. Amplification of Nucleic Acid Samples
[0134] In some embodiments, a nucleic acid isolated from a biological sample is amplified prior to being used in the methods disclosed herein. In some embodiments, the nucleic acid is an RNA molecule, which is converted to a complementary DNA (cDNA) prior to amplification. Techniques for the isolation of RNA molecules and the production of cDNA molecules from the RNA molecules are known (see generally, Silhavy et al., 1984; Sambrook & Russell, 2001; Ausubel et al., 2002; and Ausubel et al., 2003). In some embodiments, the amplification of RNA molecules isolated from a biological sample is a quantitative amplification (e.g., by quantitative RT-PCR).
[0135] The terms "template nucleic acid" and "target nucleic acid" as used herein each refer to nucleic acids isolated from a biological sample as described herein above. The terms "template nucleic acid pool," "template pool," "target nucleic acid pool," and "target pool" each refer to an amplified sample of "template nucleic acid." Thus, a target pool comprises amplicons generated by performing an amplification reaction using the template nucleic acid. In some embodiments, a target pool is amplified using a random amplification procedure as described herein.
[0136] The term "target-specific primer" refers to a primer that hybridizes selectively and predictably to a target sequence, for example a subsequence of one of the six genes disclosed herein, in a target nucleic acid sample. A target-specific primer can be selected or synthesized to be complementary to known nucleotide sequences of target nucleic acids.
[0137] The term "random primer" refers to a primer having an arbitrary sequence. The nucleotide sequence of a random primer can be known, although such sequence is considered arbitrary in that it is not specifically designed for complementarity to a nucleotide sequence of the presently disclosed subject matter. The term "random primer" encompasses selection of an arbitrary sequence having increased probability to be efficiently utilized in an amplification reaction. For example, the Random Oligonucleotide Construction Kit (ROCK) is a macro-based program that facilitates the generation and analysis of random oligonucleotide primers (Strain & Chmielewski, 2001). Representative primers include but are not limited to random hexamers and rapid amplification of polymorphic DNA (RAPD)-type primers as described by Williams et al., 1990.
[0138] A random primer can also be degenerate or partially degenerate as described by Telenius et al., 1992. Briefly, degeneracy can be introduced by selection of alternate oligonucleotide sequences that can encode a same amino acid sequence.
[0139] In some embodiments, random primers can be prepared by shearing or digesting a portion of the template nucleic acid sample. Random primers so-constructed comprise a sample-specific set of random primers.
[0140] The term "heterologous primer" refers to a primer complementary to a sequence that has been introduced into the template nucleic acid pool. For example, a primer that is complementary to a linker or adaptor, as described below, is a heterologous primer. Representative heterologous primers can optionally include a poly(dT) primer, a poly(T) primer, or as appropriate, a poly(dA) or poly(A) primer.
[0141] The term "primer" as used herein refers to a contiguous sequence comprising in some embodiments about 6 or more nucleotides, in some embodiments about 10-20 nucleotides (e.g., 15-mer), and in some embodiments about 20-30 nucleotides (e.g., a 22-mer). Primers used to perform the methods of the presently disclosed subject matter encompass oligonucleotides of sufficient length and appropriate sequence so as to provide initiation of polymerization on a nucleic acid molecule.
[0142] U.S. Pat. No. 6,066,457 to Hampson et al. describes a method for substantially uniform amplification of a collection of single stranded nucleic acid molecules such as RNA. Briefly, the nucleic acid starting material is anchored and processed to produce a mixture of directional shorter random size DNA molecules suitable for amplification of the sample.
[0143] In accordance with the methods and systems of the presently disclosed subject matter, any PCR technique or related technique can be employed to perform the step of amplifying the nucleic acid sample. In addition, such methods can be optimized for amplification of a particular subset of nucleic acid (e.g., genomic DNA versus RNA), and representative optimization criteria and related guidance can be found in the art. See Cha & Thilly, 1993; Linz et al., 1990; Robertson & Walsh-Weller, 1998; Roux 1995; Williams 1989; and McPherson et al., 1995.
[0144] V.C.3. Labeling of Nucleic Acid Samples
[0145] Optionally, a nucleic acid sample (e.g., a quantitatively amplified RNA sample) further comprises a detectable label. In some embodiments of the presently disclosed subject matter, the amplified nucleic acids can be labeled prior to hybridization to an array. Alternatively, randomly amplified nucleic acids are hybridized with a set of probes, without prior labeling of the amplified nucleic acids. For example, an unlabeled nucleic acid in the biological sample can be detected by hybridization to a labeled probe. In some embodiments, both the randomly amplified nucleic acids and the one or more probes include a label, wherein the proximity of the labels following hybridization enables detection. An exemplary procedure using nucleic acids labeled with chromophores and fluorophores to generate detectable photonic structures is described in U.S. Pat. No. 6,162,603 to Heller.
[0146] In accordance with the methods and systems of the presently disclosed subject matter, the amplified nucleic acids and/or probes/probe sets can be labeled using any detectable label. It will be understood to one of skill in the art that any suitable method for labeling can be used, and no particular detectable label or technique for labeling should be construed as a limitation of the disclosed methods.
[0147] Direct labeling techniques include incorporation of radioisotopic or fluorescent nucleotide analogues into nucleic acids by enzymatic synthesis in the presence of labeled nucleotides or labeled PCR primers. A radio-isotopic label can be detected using autoradiography or phosphorimaging. A fluorescent label can be detected directly using emission and absorbance spectra that are appropriate for the particular label used. Any detectable fluorescent dye can be used, including but not limited to FITC (fluorescein isothiocyanate), FLUOR X.TM., ALEXA FLUOR.RTM. 488, OREGON GREEN.RTM. 488, 6-JOE (6-carboxy-4',5'-dichloro-2', 7'-dimethoxyfluorescein, succinimidyl ester), ALEXA FLUOR.RTM. 532, Cy3, ALEXA FLUOR.RTM. 546, TMR (tetramethylrhodamine), ALEXA FLUOR.RTM. 568, ROX (X-rhodamine), ALEXA FLUOR.RTM. 594, TEXAS RED.RTM., BODIPY.RTM. 630/650, and Cy5 (available from Amersham Pharmacia Biotech of Piscataway, N.J., United States of America or from Molecular Probes Inc. of Eugene, Oreg., United States of America). Fluorescent tags also include sulfonated cyanine dyes (available from Li-Cor, Inc. of Lincoln, Nebr., United States of America) that can be detected using infrared imaging. Methods for direct labeling of a heterogeneous nucleic acid sample are known in the art and representative protocols can be found in, for example, DeRisi et al., 1996; Sapolsky & Lipshutz, 1996; Schena et al., 1995; Schena et al., 1996; Shalon et al., 1996; Shoemaker et al., 1996; and Wang et al., 1989.
[0148] In some embodiments, nucleic acid molecules isolated from different cell types (e.g., primary versus metastatic PDAC) are labeled with different detectable markers, allowing the nucleic acids to be analyzed simultaneously on an array. For example, a first RNA sample can be reverse transcribed into cDNAs labeled with cyanine 3 (a green dye fluorophore; Cy3) while a second RNA sample to which the first RNA sample is to be compared can be labeled with cyanine 5 (a red dye fluorophore; Cy5).
[0149] The quality of probe or nucleic acid sample labeling can be approximated by determining the specific activity of label incorporation. For example, in the case of a fluorescent label, the specific activity of incorporation can be determined by the absorbance at 260 nm and 550 nm (for Cy3) or 650 nm (for Cy5) using published extinction coefficients (Randolph & Waggoner, 1995). Very high label incorporation (specific activities of >1 fluorescent molecule/20 nucleotides) can result in a decreased hybridization signal compared with probe with lower label incorporation. Very low specific activity (<1 fluorescent molecule/100 nucleotides) can give unacceptably low hybridization signals. See Worley et al., 2000. Thus, it will be understood to one of skill in the art that labeling methods can be optimized for performance in microarray hybridization assay, and that optimal labeling can be unique to each label type.
[0150] V.D. Forming High-Density Arrays
[0151] In some embodiments of the presently disclosed subject matter, probes or probe sets are immobilized on a solid support such that a position on the support identifies a particular probe or probe set. In the case of a probe set, constituent probes of the probe set can be combined prior to placement on the solid support or by serial placement of constituent probes at a same position on the solid support.
[0152] A microarray can be assembled using any suitable method known to one of skill in the art, and any one microarray configuration or method of construction is not considered to be a limitation of the presently disclosed subject matter. Representative microarray formats that can be used in accordance with the methods of the presently disclosed subject matter are described herein below and include, but are not limited to light-directed chemical coupling, and mechanically directed coupling (see U.S. Pat. No. 5,143,854 to Pirrung et al.; U.S. Pat. No. 5,800,992 to Fodor et al.; and U.S. Pat. No. 5,837,832 to Chee et al.).
[0153] V.D.1. Array Substrate and Configuration
[0154] The substrate for printing the array should be substantially rigid and amenable to DNA immobilization and detection methods (e.g., in the case of fluorescent detection, the substrate must have low background fluorescence in the region of the fluorescent dye excitation wavelengths). The substrate can be nonporous or porous as determined most suitable for a particular application. Representative substrates include but are not limited to a glass microscope slide, a glass coverslip, silicon, plastic, a polymer matrix, an agar gel, a polyacrylamide gel, and a membrane, such as a nylon, nitrocellulose or ANAPORE.TM. (Whatman of Maidstone, United Kingdom) membrane.
[0155] Porous substrates (membranes and polymer matrices) are preferred in that they permit immobilization of relatively large amount of probe molecules and provide a three-dimensional hydrophilic environment for biomolecular interactions to occur (Dubiley et al., 1997; Yershov et al., 1996). A BIOCHIP ARRAYER.TM. dispenser (Packard Instrument Company of Meriden, Conn., United States of America) can effectively dispense probes onto membranes such that the spot size is consistent among spots whether one, two, or four droplets were dispensed per spot (Englert, 2000).
[0156] A microarray substrate for use in accordance with the methods of the presently disclosed subject matter can have either a two-dimensional (planar) or a three-dimensional (non-planar) configuration. An exemplary three-dimensional microarray is the FLOW-THRU.TM. chip (Gene Logic, Inc. of Gaithersburg, Md., United States of America), which has implemented a gel pad to create a third dimension. Such a three-dimensional microarray can be constructed of any suitable substrate, including glass capillary, silicon, metal oxide filters, or porous polymers. See Yang et al., 1998.
[0157] Briefly, a FLOW-THRU.TM. chip (Gene Logic, Inc.) comprises a uniformly porous substrate having pores or microchannels connecting upper and lower faces of the chip. Probes are immobilized on the walls of the microchannels and a hybridization solution comprising sample nucleic acids can flow through the microchannels. This configuration increases the capacity for probe and target binding by providing additional surface relative to two-dimensional arrays. See U.S. Pat. No. 5,843,767 to Beattie.
[0158] V.D.2. Surface Chemistry
[0159] The particular surface chemistry employed is inherent in the microarray substrate and substrate preparation. Probe immobilization of nucleic acids probes post-synthesis can be accomplished by various approaches, including adsorption, entrapment, and covalent attachment. Typically, the binding technique is designed to not disrupt the activity of the probe.
[0160] For substantially permanent immobilization, covalent attachment is generally performed. Since few organic functional groups react with an activated silica surface, an intermediate layer is advisable for substantially permanent probe immobilization. Functionalized organosilanes can be used as such an intermediate layer on glass and silicon substrates (Liu & Hlady, 1996; Shriver-Lake 1998). A hetero-bifunctional cross-linker requires that the probe have a different chemistry than the surface, and is preferred to avoid linking reactive groups of the same type. A representative hetero-bifunctional cross-linker comprises gamma-maleimidobutyryloxy-succimide (GMBS) that can bind maleimide to a primary amine of a probe. Procedures for using such linkers are known to one of skill in the art and are summarized in Hermanson, 1990. A representative protocol for covalent attachment of DNA to silicon wafers is described by O'Donnell et al., 1997.
[0161] When using a glass substrate, the glass should be substantially free of debris and other deposits and have a substantially uniform coating. Pretreatment of slides to remove organic compounds that can be deposited during their manufacture can be accomplished, for example, by washing in hot nitric acid. Cleaned slides can then be coated with 3-aminopropyltrimethoxysilane using vapor-phase techniques. After silane deposition, slides are washed with deionized water to remove any silane that is not attached to the glass and to catalyze unreacted methoxy groups to cross-link to neighboring silane moieties on the slide. The uniformity of the coating can be assessed by known methods, for example electron spectroscopy for chemical analysis (ESCA) or ellipsometry (Ratner & Castner, 1997; Schena et al., 1995). See also Worley et al., 2000.
[0162] For attachment of probes greater than about 300 base pairs, noncovalent binding is suitable. A representative technique for noncovalent linkage involves use of sodium isothiocyanate (NaSCN) in the spotting solution. When using this method, amino-silanized slides are typically employed because this coating improves nucleic acid binding when compared to bare glass. This method works well for spotting applications that use about 100 ng/.mu.l (Worley et al., 2000).
[0163] In the case of nitrocellulose or nylon membranes, the chemistry of nucleic acid binding chemistry to these membranes has been well characterized (Southern, 1975; Sambrook & Russell, 2001).
[0164] V.D.3. Arraying Techniques
[0165] A microarray for the analysis of gene expression in a biological sample can be constructed using any one of several methods available in the art, including but not limited to photolithographic and microfluidic methods, further described herein below. In some embodiments, the method of construction is flexible, such that a microarray can be tailored for a particular purpose.
[0166] Exemplary arraying techniques include, but are not limited to light-directed synthesis (Fodor et al., 1991; Fodor et al., 1993), commercialized by Affymetrix of Santa Clara, Calif., United States of America; Digital Optical Chemistry (PCT International Patent Application Publication No. WO 1999/063385; Warrington et al., 2000); Contact Printing (Maier et al., 1994; Mace et al., 2000; Rose, 2000); Noncontact Ink-Jet Printing U.S. Pat. No. 5,965,352 to Stoughton & Friend; see also Theriault et al., 1999); Syringe-Solenoid Printing (U.S. Pat. Nos. 5,743,960 and 5,916,524, both to Tisone); Electronic Addressing (U.S. Pat. No. 6,225,059 to Ackley et al. and PCT International Patent Application Publication No. WO 2001/023082); and Nanoelectrode Synthesis (U.S. Pat. No. 6,123,819 to Peeters).
[0167] In addition to the foregoing, other methods that can be used to generate an array of oligonucleotides on a single substrate are described in PCT International Patent Application Publication WO 1993/009668. High-density nucleic acid arrays can also be fabricated by depositing pre-made and/or natural nucleic acids in predetermined positions. Synthesized or natural nucleic acids are deposited on specific locations of a substrate by light directed targeting and oligonucleotide directed targeting. A dispenser that moves from region to region to deposit nucleic acids in specific spots can also be employed.
[0168] V.E. Hybridization
[0169] V.E.1. General Considerations
[0170] The terms "specifically hybridizes" and "selectively hybridizes" each refer to binding, duplexing, or hybridizing of a molecule only to a particular nucleotide sequence under stringent conditions when that sequence is present in a complex nucleic acid mixture (e.g., total cellular DNA or RNA).
[0171] The phrase "substantially hybridizes" refers to complementary hybridization between a probe nucleic acid molecule and a substantially identical target nucleic acid molecule as defined herein. Substantial hybridization is generally permitted by reducing the stringency of the hybridization conditions using art-recognized techniques.
[0172] "Stringent hybridization conditions" and "stringent hybridization wash conditions" in the context of nucleic acid hybridization experiments are both sequence- and environment-dependent. Longer sequences hybridize specifically at higher temperatures. Generally, highly stringent hybridization and wash conditions are selected to be about 5.degree. C. lower than the thermal melting point (T.sub.m) for the specific sequence at a defined ionic strength and pH. The T.sub.m is the temperature (under defined ionic strength and pH) at which 50% of the target sequence hybridizes to a perfectly matched probe. Very stringent conditions are selected to be equal to the T.sub.m for a particular probe. Typically, under "stringent conditions" a probe hybridizes specifically to its target sequence, but to no other sequences.
[0173] An extensive guide to the hybridization of nucleic acids is found in Tijssen, 1993. In general, a signal to noise ratio of 2-fold (or higher) than that observed for a negative control probe in a same hybridization assay indicates detection of specific or substantial hybridization.
[0174] V.E.2. Hybridization on a Solid Support
[0175] In some embodiments of the presently disclosed subject matter, an amplified and/or labeled nucleic acid sample is hybridized to specific probes or probe sets that are immobilized on a continuous solid support comprising a plurality of identifying positions. Representative formats of such solid supports are described herein.
[0176] Examples of hybridization and wash conditions that can be employed are known to those of skill in the art (see Sambrook & Russell, 2001; Ausubel et al., 2002; and Ausubel et al., 2003; each of which is incorporated herein in its entirety).
[0177] For some high-density glass-based microarray experiments, hybridization at 65.degree. C. is too stringent for typical use, at least in part because the presence of fluorescent labels destabilizes the nucleic acid duplexes (Randolph & Waggoner, 1995). Alternatively, hybridization can be performed in a formamide-based hybridization buffer as described in Pietu et al., 1996.
[0178] A microarray format can be selected for use based on its suitability for electrochemical-enhanced hybridization. Provision of an electric current to the microarray, or to one or more discrete positions on the microarray facilitates localization of a target nucleic acid sample near probes immobilized on the microarray surface. Concentration of target nucleic acid near arrayed probe accelerates hybridization of a nucleic acid of the sample to a probe. Further, electronic stringency control allows the removal of unbound and nonspecifically bound DNA after hybridization. See U.S. Pat. No. 6,017,696 to Heller and U.S. Pat. No. 6,245,508 to Heller & Sosnowski.
[0179] V.E.3. Hybridization in Solution
[0180] In some embodiments of the presently disclosed subject matter, an amplified and/or labeled nucleic acid sample is hybridized to one or more probes in solution. Exemplary hybridization conditions are also disclosed in Sambrook & Russell, 2001; Ausubel et al., 2002; and Ausubel et al., 2003.
[0181] Alternate capture techniques can be used as will be understood to one of skill in the art, for example, purification by a metal affinity column when using probes comprising a histidine tag. As another example, the hybridized sample can be hydrolyzed by alkaline treatment wherein the double-stranded hybrids are protected while non-hybridizing single-stranded template and excess probe are hydrolyzed. The hybrids are then collected using any nucleic acid purification technique for further analysis.
[0182] To assess the expression of multiple genes and/or samples from multiple different sources simultaneously, probes or probe sets can be distinguished by differential labeling of probes or probe sets. Alternatively, probes or probe sets can be spatially separated in different hybridization vessels.
[0183] In some embodiments, a probe or probe set having a unique label is prepared for each gene or source to be detected. For example, a first probe or probe set can be labeled with a first fluorescent label, and a second probe or probe set can be labeled with a second fluorescent label. Multi-labeling experiments should consider label characteristics and detection techniques to optimize detection of each label. Representative first and second fluorescent labels are Cy3 and Cy5 (Amersham Pharmacia Biotech of Piscataway, N.J., United States of America), which can be analyzed with good contrast and minimal signal leakage.
[0184] A unique label for each probe or probe set can further comprise a labeled microsphere to which a probe or probe set is attached. A representative system is LabMAP (Luminex Corporation of Austin, Tex., United States of America). Briefly, LabMAP (Laboratory Multiple Analyte Profiling) technology involves performing molecular reactions, including hybridization reactions, on the surface of color-coded microscopic beads called microspheres. When used in accordance with the methods of the presently disclosed subject matter, an individual probe or probe set is attached to beads having a single color-code such that they can be identified throughout the assay. Successful hybridization is measured using a detectable label of the amplified nucleic acid sample, wherein the detectable label can be distinguished from each color-code used to identify individual microspheres. Following hybridization of the randomly amplified, labeled nucleic acid sample with a set of microspheres comprising probe sets, the hybridization mixture is analyzed to detect the signal of the color-code as well as the label of a sample nucleic acid bound to the microsphere. See Vignali 2000; Smith et al., 1998b; and PCT International Patent Application Publication Nos. WO 2001/013120; WO 2001/014589; WO 1999/019515; WO 1999/032660; and WO 1997/014028.
[0185] V.F. Detection
[0186] Methods and systems for detecting hybridization are typically selected according to the label employed.
[0187] In the case of a radioactive label (e.g., .sup.32P-dNTP) detection can be accomplished by autoradiography or by using a phosphorimager as is known to one of skill in the art. In some embodiments, a detection method can be automated and is adapted for simultaneous detection of numerous samples.
[0188] Common research equipment has been developed to perform high-throughput fluorescence detecting, including instruments from GSI Lumonics (Watertown, Mass., United States of America), Amersham Pharmacia Biotech/Molecular Dynamics (Sunnyvale, Calif., United States of America), Applied Precision Inc. (Issauah, Wash., United States of America), Genomic Solutions Inc. (Ann Arbor, Mich., United States of America), Genetic MicroSystems Inc. (Woburn, Mass., United States of America), Axon (Foster City, Calif., United States of America), Hewlett Packard (Palo Alto, Calif., United States of America), and Virtek (Woburn, Mass., United States of America). Most of the commercial systems use some form of scanning technology with photomultiplier tube detection. Criteria for consideration when analyzing fluorescent samples are summarized by Alexay et al., 1996.
[0189] In some embodiments, a nucleic acid sample or probe is labeled with far infrared, near infrared, or infrared fluorescent dyes. Following hybridization, the mixture of nucleic acids and probes is scanned photoelectrically with a laser diode and a sensor, wherein the laser scans with scanning light at a wavelength within the absorbance spectrum of the fluorescent label, and light is sensed at the emission wavelength of the label. See U.S. Pat. No. 6,086,737 to Patonay et al.; U.S. Pat. No. 5,571,388 to Patonay et al.; U.S. Pat. No. 5,346,603 to Middendorf & Brumbaugh; U.S. Pat. No. 5,534,125 to Middendorf et al.; U.S. Pat. No. 5,360,523 to Middendorf et al.; U.S. Pat. No. 5,230,781 to Middendorf & Patonay; U.S. Pat. No. 5,207,880 to Middendorf & Brumbaugh; and U.S. Pat. No. 4,729,947 to Middendorf & Brumbaugh. An ODYSSEY.TM. infrared imaging system (Li-Cor, Inc. of Lincoln, Nebr., United States of America) can be used for data collection and analysis.
[0190] If an epitope label has been used, a protein or compound that binds the epitope can be used to detect the epitope. For example, an enzyme-linked protein can be subsequently detected by development of a colorimetric or luminescent reaction product that is measurable using a spectrophotometer or luminometer, respectively.
[0191] In some embodiments, INVADER.RTM. technology (Third Wave Technologies of Madison, Wis., United States of America) is used to detect target nucleic acid/probe complexes. Briefly, a nucleic acid cleavage site (such as that recognized by a variety of enzymes having 5' nuclease activity) is created on a target sequence, and the target sequence is cleaved in a site-specific manner, thereby indicating the presence of specific nucleic acid sequences or specific variations thereof. See U.S. Pat. No. 5,846,717 to Brow et al.; U.S. Pat. No. 5,985,557 to Prudent et al.; U.S. Pat. No. 5,994,069 to Hall et al.; U.S. Pat. No. 6,001,567 to Brow et al.; and U.S. Pat. No. 6,090,543 to Prudent et al.
[0192] In some embodiments, target nucleic acid/probe complexes are detected using an amplifying molecule, for example a poly-dA oligonucleotide as described by Lisle et al., 2001. Briefly, a tethered probe is employed against a target nucleic acid having a complementary nucleotide sequence. A target nucleic acid having a poly-dT sequence, which can be added to any nucleic acid sequence using methods known to one of skill in the art, hybridizes with an amplifying molecule comprising a poly-dA oligonucleotide. Short oligo-dT.sub.40 signaling moieties are labeled with any suitable label (e.g., fluorescent, chemiluminescent, radioisotopic labels). The short oligo-dT.sub.40 signaling moieties are subsequently hybridized along the molecule, and the label is detected.
[0193] The presently disclosed subject matter also envisions use of electrochemical technology for detecting a nucleic acid hybrid according to the disclosed method. In this case, the detection method relies on the inherent properties of DNA, and thus a detectable label on the target sample or the probe/probe set is not required. In some embodiments, probe-coupled electrodes are multiplexed to simultaneously detect multiple genes using any suitable microarray or multiplexed liquid hybridization format. To enable detection, gene-specific and control probes are synthesized with substitution of the non-physiological nucleic acid base inosine for guanine, and subsequently coupled to an electrode. Following hybridization of a nucleic acid sample with probe-coupled electrodes, a soluble redox-active mediator (e.g., ruthenium 2,2'-bipyridine) is added, and a potential is applied to the sample. In the absence of guanine, each mediator is oxidized only once. However, when a guanine-containing nucleic acid is present, by virtue of hybridization of a sample nucleic acid molecule to the probe, a catalytic cycle is created that results in the oxidation of guanine and a measurable current enhancement. See U.S. Pat. No. 6,127,127 to Eckhardt et al.; U.S. Pat. No. 5,968,745 to Thorp et al.; and U.S. Pat. No. 5,871,918 to Thorp et al.
[0194] Surface plasmon resonance spectroscopy can also be used to detect hybridization. See e.g., Heaton et al., 2001; Nelson et al., 2001; and Guedon et al., 2000.
[0195] V.G. Data Analysis
[0196] Databases and software designed for use with microarrays is discussed in U.S. Pat. No. 6,229,911 to Balaban & Aggarwal, a computer-implemented method for managing information, stored as indexed tables, collected from small or large numbers of microarrays, and U.S. Pat. No. 6,185,561 to Balaban & Khurgin, a computer-based method with data mining capability for collecting gene expression level data, adding additional attributes and reformatting the data to produce answers to various queries. U.S. Pat. No. 5,974,164 to Chee, disclose a software-based method for identifying mutations in a nucleic acid sequence based on differences in probe fluorescence intensities between wild type and mutant sequences that hybridize to reference sequences.
[0197] Analysis of microarray data can also be performed using the method disclosed in Tusher et al., 2001, which describes the Significance Analysis of Microarrays (SAM) method for determining significant differences in gene expression among two or more samples.
VI. Devices, Systems, and Compositions for Use in the Presently Disclosed Methods
[0198] The presently disclosed subject matter also provides devices, systems, and compositions that can be employed in the practice of the methods disclosed herein.
[0199] The methods and systems disclosed herein relate in some embodiments to generating gene expression profiles from biological samples that comprise PDAC cells obtained from a subject. The gene expression profiles are then in some embodiments compared to standards such as, but not limited to gene expression profiles of metastatic PDAC cells and/or primary (i.e., non-metastatic) PDAC cells.
[0200] As such, the presently disclosed methods can employ various techniques to generate the gene expression profiles required for the comparisons. See e.g., PCT International Patent Application Publication Nos. WO 2004/046098; WO 2004/110244; WO 2006/089268; WO 2007/001324; WO 2007/056332; WO 2007/070252, each of which is incorporated herein by reference in its entirety.
[0201] Generally, a gene expression profile can be generated using the following basic steps:
[0202] (1) a biological sample such as, but not limited to a PDAC biopsy or resected PDAC cells are obtained; and
[0203] (2) the expression levels of one or more (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 25, 50, 100, or all) of the genes listed in Tables 2-5 are determined.
[0204] As is known to one of ordinary skill in the art, gene expression levels can be assayed either at the level of RNA or at the level of protein. As such, in some embodiments RNA is extracted from the biological sample and analyzed by techniques that include, but are not limited to PCR analysis (in some embodiments, quantitative reverse transcription PCR) and/or array analysis. In each case, one of ordinary skill in the art would be aware of techniques that can be employed to determine the expression level of a gene product in the biological sample.
[0205] With respect to PCR analyses, the sequences of nucleic acids that correspond to one or more of the genes listed in Tables 2-5 are present within the GENBANK.RTM. biosequence database, and oligonucleotide primers can be designed for the purpose of determining expression levels.
[0206] Alternatively, arrays can be produced that include single-stranded nucleic acids that can hybridize to nucleic acids derived from one or more of the genes listed in Tables 2-5. Exemplary, non-limiting methods that can be used to produce and screen arrays are described herein above.
[0207] Therefore, in some embodiments the presently disclosed subject matter provides arrays comprising polynucleotides that are capable of hybridizing to one or more up to all of the genes listed in Tables 2-5 and/or comprising specific peptide or polypeptide gene products of one or more up to all of the genes listed in Tables 2-5.
[0208] Alternatively or in addition, gene expression can be assayed by determining the levels at which polypeptides are present in PDAC tissue. This can also be done using arrays, and exemplary methods for producing peptide and/or polypeptide arrays attached to nitrocellulose-coated glass slides (Espejo et al., 2002), alkanethiol-coated gold surfaces (Houseman et al., 2002), poly-L-lysine-treated glass slides (Haab et al., 2001), aldehyde-treated glass slides (MacBeath & Schreiber, 2000; Salisbury et al., 2002), silane-modified glass slides (Fang et al., 2002; Seong, 2002), and nickel-treated glass slides (Zhu et al., 2001), among others, have been reported.
[0209] In some embodiments, the presently disclosed subject matter provides arrays that comprise peptides or polypeptides that are correspond to one or more up to all of the genes listed in Tables 2-5. In these embodiments, arrays are produced from proteins isolated from PDAC tissue, and these arrays are then probed with molecules that specifically bind to the various gene products of interest, if present. Exemplary molecules that specifically bind to one or more up to all of the genes listed in Tables 2-5 include antibodies (as well as fragments and derivatives thereof that include at least one Fab fragment). Antibodies to many of the polypeptides that correspond to the genes listed in Tables 2-5 are commercially available, and antibodies that specifically bind to gene products that are not commercially available can be produced using routine techniques.
[0210] Peptide and/or polypeptide arrays can be designed quantitatively such that the amount of each individual peptide or polypeptide is reflective of the amount of that individual peptide or polypeptide in the PDAC tissue.
[0211] Further, the arrays can be designed such that specific peptide or polypeptide gene products that correspond to one or more of the genes listed in Tables 2-5 can be localized (sometimes referred to as "spotted") on the array such that the array can be interrogated with at least one antibody that specifically binds to one of the specific peptide or polypeptide gene products.
[0212] In some embodiments, gene expression at the level of protein is assayed without isolating the relevant peptides and/or polypeptides from the PDAC cells. For example, immunohistochemistry and/or immunocytochemistry can be employed, in which the expression levels of gene products that correspond to one or more of the genes listed in Tables 2-5 can be determined by incubating appropriate binding molecules to PDAC cells and/or tissue. In some embodiments, the PDAC cells and/or tissue is mounted in paraffin blocks before the immunohistochemistry and/or immunocytochemistry is performed.
[0213] As would be understood by one of ordinary skill in the art upon consideration of the present disclosure, many of the manipulations disclosed herein can be automated, and it is intended that such automation is encompassed by the presently disclosed subject matter.
EXAMPLES
[0214] The following Examples provide further illustrative embodiments. In light of the present disclosure and the general level of skill in the art, those of skill will appreciate that the following Example is intended to be exemplary only and that numerous changes, modifications, and alterations can be employed without departing from the scope of the presently disclosed subject matter.
TABLE-US-00002 TABLE 2 Exemplary Genes Associated with Activated Stroma Subtype and Exemplary Chemotherapeutics Applicable Thereto Gene symbol Possible drug(s) ANXA1 Hydrocortisone, hydrocortisone/prednisone, hydrocortisone/ mitoxantrone AOC3 Hydralazine, hydralazine/ hydrochlorothiazide/reserpine, hydralazine/hydrochlorothiazide, hydralazine/isosorbide dinitrate APP Bapineuzumab, florbetapir F18, florbetaben F ATP1A1 Digoxin, trichloromethiazide, ciclopirox olamine, ethacrynic acid, reserpine/trichloromethiazide, bretylium, perphenazine, ouabain, digitoxin AXL Cabozantinib, cabozantinib/erlotinib BDKRB2 Anatibant, icatibant C1S SERPING1 CCR5 Maraviroc, vicriviroc, ancriviroc CD52 Alemtuzumab, alemtuzumab/cyclosporin A, alemtuzumab/cyclophosphamide/ fludarabine phosphate/rituximab, alemtuzumab/fludarabine phosphate, alemtuzumab/rituximab, alemtuzumab/cyclophosphamide/ fludarabine phosphate/mitoxantrone, alemtuzumab/pentostatin, alemtuzumab/bendamustine CFTR Crofelemer, ivacaftor COL10A1 Collagenase clostridium histolyticum COL11A1 Collagenase clostridium histolyticum COL12A1 Collagenase clostridium histolyticum COL16A1 Collagenase clostridium histolyticum COL1A1 Collagenase clostridium histolyticum COL1A2 Collagenase clostridium histolyticum COL3A1 Collagenase clostridium histolyticum COL4A2 Collagenase clostridium histolyticum COL5A1 Collagenase clostridium histolyticum COL5A2 Collagenase clostridium histolyticum COL8A1 Collagenase clostridium histolyticum COL8A2 Collagenase clostridium histolyticum CSF1R Nilotinib, sunitinib, pazopanib CXCR4 Cladribine/cytarabine/filgrastim/idarubicin/ plerixafor, plerixafor EDNRA Bosentan, avosentan, clazosentan, ambrisentan, sitaxsentan, zibotentan, SB 234551, TBC 3214, BSF 302146, macitentan, fandosentan, atrasentan EPCAM Tucotuzumab celmoleukin, catumaxomab, adecatumumab ERBB2 Trastuzumab, BMS-599626, varlitinib, XL647, CP-724,714, afatinib, pertuzumab, sapitinib, trastuzumab emtansine, lapatinib/pazopanib, lapatinib/letrozole, paclitaxel/trastuzumab, capecitabine/lapatinib, cyclophosphamide/docetaxel/epirubicin/5-fluorouracil/trastuzumab, docetaxel/trastuzumab, paclitaxel/pertuzumab/trastuzumab, trastuzumab/vinorelbine, capecitabine/trastuzumab, lapatinib/paclitaxel, pertuzumab/trastuzumab, lapatinib/trastuzumab, neratinib, lap atinib, erlotinib F2R Chrysalin, argatroban, bivalirudin FCGR1B IgG FCGR2A IgG FN1 Ocriplasmin FYN Dasatinib GABRP Alphadolone, nitrazepam, adinazolam, sevoflurane, isoflurane, isoniazid, felbamate, etomidate, halothane, fluoxetine/olanzapine, estazolam, eszopiclone, quazepam, diazepam, temazepam, zolpidem, lorazepam, olanzapine, triazolam, flurazepam, midazolam, oxazepam, zaleplon, secobarbital, phenobarbital, pentobarbital, desflurane, methoxyflurane, enflurane HLA- Apolizumab DRB1 IL1R1 Anakinra ITGAV Abciximab, CNTO 95, cilengitide ITGB5 Cilengitide KCNJ8 Gliquidone, thiamylal KCNN4 Betamethasone/clotrimazole, clotrimazole, senicapoc KCNQ1 Dextromethorphan/quinidine, indapamide, quinidine KIT Dasatinib, sunitinib, pazopanib, tivozanib, motesanib, OSI-930, telatinib, tandutinib, cabozantinib, regorafenib, ponatinib, bortezomib/sorafenib, lapatinib/pazopanib, dexamethasone/lenalidomide/sorafenib, bevacizumab/sorafenib, imatinib/sirolimus, cabozantinib/erlotinib, imatinib, sorafenib MET Crizotinib, tivantinib, cabozantinib, INC280, cabozantinib/erlotinib MMP11 Marimastat MMP7 Marimastat MUC1 HuHMFG1 NNMT Atorvastatin/niacin, nicotinic acid/pioglitazone, nicotinic acid, lovastatin/niacin PDGFRA Sunitinib, pazopanib, axitinib, telatinib, regorafenib, lapatinib/pazopanib, imatinib/sirolimus, imatinib, becaplermin PDGFRB Nilotinib, dasatinib, sunitinib, pazopanib, axitinib, tivozanib, tandutinib, regorafenib, bortezomib/sorafenib, lapatinib/pazopanib, dexamethasone/lenalidomide/sorafenib, bevacizumab/sorafenib, imatinib/sirolimus, imatinib, sorafenib, becaplermin PLA2G7 Darapladib PLAT 6-aminocaproic acid PTGER2 Misoprostol, prostaglandin E2, prostaglandin E1, CP 533536, diclofenac/misoprostol RAMP1 Pramlintide SLC12A2 Bumetanide, quinethazone TEK Cabozantinib, regorafenib, ponatinib, cabozantinib/erlotinib, vandetanib TLR4 Resatorvid TLR7 UC-1V150, 5-fluorouracil/imiquimod, resiquimod, hydroxychloroquine, imiquimod
TABLE-US-00003 TABLE 3 Exemplary Genes Associated with Normal Stroma Subtype and Exemplary Chemotherapeutics Applicable Thereto Gene symbol Possible Drug(s) ACE2 Hydrochlorothiazide/lisinopril, hydrochlorothiazide/moexipril, moexipril, lisinopril ADH1A Caffeine/ethanol, 4-methylpyrazole (Fomepizole), ethanol ADH1C Caffeine/ethanol, 4-methylpyrazole (Fomepizole), ethanol ADRB2 Articaine/epinephrine, bupivacaine/epinephrine, carteolol, dipivefrin, meluadrine, epinephrine/prilocaine, epinephrine/lidocaine, bedoradrine, KUL 7211, arformoterol, indacaterol, myogane, budesonide/formoterol, nebivolol, vilanterol, olodaterol, formoterol/mometasone furoate, glycopyrrolate/indacaterol, fluticasone furoate/vilanterol, latanoprost/timolol, umeclidinium/vilanterol, fluticasone/salmeterol, albuterol/ipratropium, isoprenaline, carvedilol, ephedrine, guanethidine, levalbuterol, propranolol, pindolol, esmolol, metoprolol, alprenolol, salmeterol, dorzolamide/timolol, fluoxetine/olanzapine, guanadrel, bendroflumethiazide/nadolol, isoxsuprine, hydrochlorothiazide/propranolol, hydrochlorothiazide/timolol, isoproterenol, sotalol, bambuterol, nadolol, timolol, isoetharine, ritodrine, olanzapine, venlafaxine, labetalol, formoterol, bitolterol, albuterol, terbutaline, procaterol, pirbuterol, clenbuterol, fenoterol, norepinephrine, metaproterenol sulfate, epinephrine, dobutamine, droxidopa, arbutamine AGTR1 Amlodipine/olmesartan medoxomil, olmesartan, amlodipine/hydrochlorothiazide/valsartan, amlodipine/telmisartan, aliskiren/valsartan, azilsartan kamedoxomil, amlodipine/hydrochlorothiazide/olmesartan medoxomil, aspirin/dipyridamole/telmisartan, clopidogrel/telmisartan, amlodipine/valsartan, hydrochlorothiazide/losartan, hydrochlorothiazide/valsartan, candesartan, candesartan cilexetil, olmesartan medoxomil, irbesartan, losartan potassium, telmisartan, eprosartan, candesartan cilexetil/hydrochlorothiazide, hydrochlorothiazide/irbesartan, eprosartan/hydrochlorothiazide, hydrochlorothiazide/telmisartan, hydrochlorothiazide/olmesartan medoxomil, valsartan ANXA1 Hydrocortisone, hydrocortisone/prednisone, hydrocortisone/mitoxantrone AOC3 Hydralazine, hydralazine/hydrochloro-thiazide/ reserpine; hydralazine/ hydrochlorothiazide; hydralazine/ isosorbide dinitrate APP Bapineuzumab, florbetapir F18, florbetaben F ATP1A1 Digoxin, trichloromethiazide, ciclopirox olamine, ethacrynic acid, reserpine/trichloromethiazide, bretylium, perphenazine, ouabain, digitoxin ATP1A2 Digoxin, ethacrynic acid, perphenazine AXL Cabozantinib, cabozantinib/erlotinib BDKRB2 Anatibant, icatibant C1S Serpin peptidase inhibitor (SERPING1) CNR1 Trans-(A.+-.)-nabilone, SLV 319, rimonabant, BAY 38-7271, delta-8- tetrahydrocannabinol, delta-9-tetrahydrocannabinol CSF1R Nilotinib, sunitinib, pazopanib CXCR4 Cladribine/cytarabine/filgrastim/idarubicin/plerixafor, plerixafor ERBB2 Trastuzumab, BMS-599626, varlitinib, XL647, CP-724, 714, afatinib, pertuzumab, sapitinib, trastuzumab emtansine, lapatinib/pazopanib, lapatinib/letrozole, paclitaxel/trastuzumab, capecitabine/lapatinib, cyclophosphamide/docetaxel/epirubicin/5-fluorouracil/trastuzumab, docetaxel/trastuzumab, paclitaxel/pertuzumab/trastuzumab, trastuzumab/vinorelbine, capecitabine/trastuzumab, lapatinib/paclitaxel, pertuzumab/trastuzumab, lapatinib/trastuzumab, neratinib, lapatinib, erlotinib FYN Dasatinib GHR GH1, pegvisomant, somatrem HBB Iron dextran HLA-DRB1 Apolizumab IL1R1 Anakinra ITGAV Abciximab, CNTO 95, cilengitide ITGB5 Cilengitide KCNJ8 Gliquidone, thiamylal KCNK1 KCNMB4 Tedisamil KIT Dasatinib, sunitinib, pazopanib, tivozanib, motesanib, OSI-930, telatinib, tandutinib, cabozantinib, regorafenib, ponatinib, bortezomib/sorafenib, lapatinib/pazopanib, dexamethasone/lenalidomide/sorafenib, bevacizumab/sorafenib, imatinib/sirolimus, cabozantinib/erlotinib, imatinib, sorafenib LEPR Recombinant-methionyl human leptin LPL Atorvastatin/niacin, nicotinic acid/pioglitazone, nicotinic acid, tyloxapol, lovastatin/niacin PDGFRA Sunitinib, pazopanib, axitinib, telatinib, regorafenib, lapatinib/pazopanib, imatinib/sirolimus, imatinib, becaplermin PDGFRB Nilotinib, dasatinib, sunitinib, pazopanib, axitinib, tivozanib, tandutinib, regorafenib, bortezomib/sorafenib, lapatinib/pazopanib, dexamethasone/lenalidomide/sorafenib, bevacizumab/sorafenib, imatinib/sirolimus, imatinib, sorafenib, becaplermin PLA2G2A Varespladib methyl, varespladib, indomethacin RAMP1 Pramlinti de RAMP3 Pramlintide S1PR1 Fingolimod SCN7A Riluzole TEK Cabozantinib, regorafenib, ponatinib, cabozantinib/erlotinib, vandetanib
TABLE-US-00004 TABLE 4 Exemplary Genes Associated with Basal Subtype and Exemplary Chemotherapeutics Applicable Thereto Gene symbol Possible drug(s) ADORA2B Adenosine, enprofylline, dyphylline, aspirin/butalbital/caffeine, acetaminophen/caffeine/ dihydrocodeine, acetaminophen/ aspirin/caffeine, caffeine/ergotamine, aspirin/caffeine/propoxyphene, aspirin/butalbital/caffeine/codeine, aspirin/caffeine/dihydrocodeine, acetaminophen/butalbital/caffeine, aminophylline, aspirin/caffeine/ orphenadrine, acetaminophen/ butalbital/caffeine/codeine, theophylline, caffeine, acetaminophen/caffeine/chlorpheniramine/hydrocodone/phenylephrine ANXA1 Hydrocortisone, hydrocortisone/ prednisone, hydrocortisone/ mitoxantrone ATP1A1 Digoxin, trichloromethiazide, ciclopirox olamine, ethacrynic acid, reserpine/trichloromethiazide, bretylium, perphenazine, ouabain, digitoxin AXL Cabozantinib, cabozantinib/erlotinib BDKRB2 Anatibant, icatibant COL17A1 Collagenase clostridium histolyticum DDR1 Nilotinib EGFR Cetuximab, AEE 788, panitumumab, BMS-599626, varlitinib, XL647, bevacizumab/erlotinib, afatinib, sapitinib, cetuximab/irinotecan, lapatinib/pazopanib, irinotecan/panitumumab, erlotinib/vismodegib, erlotinib/gemcitabine, lapatinib/letrozole, capecitabine/lapatinib, bevacizumab/panitumumab, bevacizumab/cetuximab, capecitabine/erlotinib, lapatinib/paclitaxel, cabozantinib/erlotinib, lapatinib/trastuzumab, canertinib, gefitinib, neratinib, PD 153035, lapatinib, vandetanib, erlotinib EPCAM Tucotuzumab celmoleukin, catumaxomab, adecatumumab ERBB2 Trastuzumab, BMS-599626, varlitinib, XL647, CP-724,714, afatinib, pertuzumab, sapitinib, trastuzumab emtansine, lapatinib/pazopanib, lapatinib/letrozole, paclitaxel/trastuzumab, capecitabine/lapatinib, cyclophosphamide/docetaxel/epirubicin/5-fluorouracil/trastuzumab, docetaxel/trastuzumab, paclitaxel/pertuzumab/trastuzumab, trastuzumab/vinorelbine, capecitabine/trastuzumab, lapatinib/paclitaxel, pertuzumab/trastuzumab, lapatinib/trastuzumab, neratinib, lapatinib, erlotinib GABRP Alphadolone, nitrazepam, adinazolam, sevoflurane, isoflurane, isoniazid, felbamate, etomidate, halothane, fluoxetine/olanzapine, estazolam, eszopiclone, quazepam, diazepam, temazepam, zolpidem, lorazepam, olanzapine, triazolam, flurazepam, midazolam, oxazepam, zaleplon, secobarbital, phenobarbital, pentobarbital, desflurane, methoxyflurane, enflurane IFNAR1 Interferon alfacon-1, PEG-interferon alfa-2a, interferon beta-1a, recombinant interferon, PEG-interferon alfa-2a/telaprevir, pegintron/ribavirin, interferon alfa-n1, PEG-interferon alfa- 2a/ribavirin, IFNA2, hydroxyurea/recombinant interferon, interferon alfa-2b/ribavirin, pegintron, interferon beta-1b ITGAV Abciximab, CNTO 95, cilengitide ITGB5 Cilengitide KCNMB4 Tedisamil KCNN4 Betamethasone/clotrimazole, clotrimazole, senicapoc MET Crizotinib, tivantinib, cabozantinib, INC280, cabozantinib/erlotinib MMP7 Marimastat MST1R Crizotinib MUC1 HuHMFG1 NOXO1 Ecabet P2RY2 Suramin PLAT 6-aminocaproic acid PSCA AGS-1C4D4 PTK6 Vandetanib RAMP1 Pramlintide SCNN1A Hydrochlorothiazide/triamterene, amiloride, amiloride/ hydrochloro- thiazide, triamterene
TABLE-US-00005 TABLE 5 Exemplary Genes Associated with Classical Subtype and Exemplary Chemotherapeutic Applicable Thereto Gene symbol Possible drug(s) ACE2 Hydrochlorothiazide/lisinopril, hydrochlorothiazide/moexipril, moexipril, lisinopril ATP1A1 Digoxin, trichloromethiazide, ciclopirox olamine, ethacrynic acid, reserpine/trichloromethiazide, bretylium, perphenazine, ouabain, digitoxin BDKRB2 Anatibant, icatibant CFTR Crofelemer, ivacaftor CYP3A4 Cobicistat, cobicistat/elvitegravir/emtricitabine/ tenofovir disoproxil, ketoconazole CYP3A7 Cobicistat, cobicistat/elvitegravir/emtricitabine/ tenofovir disoproxil DDR1 Nilotinib EPCAM Tucotuzumab celmoleukin, catumaxomab, adecatumumab ERBB2 Trastuzumab, BMS-599626, varlitinib, XL647, CP-724,714, afatinib, pertuzumab, sapitinib, trastuzumab emtansine, lapatinib/pazopanib, lapatinib/letrozole, paclitaxel/trastuzumab, capecitabine/lapatinib, cyclophosphamide/docetaxel/epirubicin/5-fluorouracil/trastuzumab, docetaxel/trastuzumab, paclitaxel/pertuzumab/trastuzumab, trastuzumab/vinorelbine, capecitabine/trastuzumab, lapatinib/paclitaxel, pertuzumab/trastuzumab, lapatinib/trastuzumab, neratinib, lapatinib, erlotinib F5 Drotrecogin alfa, antithrombin alfa GABRP Alphadolone, nitrazepam, adinazolam, sevoflurane, isoflurane, isoniazid, felbamate, etomidate, halothane, fluoxetine/olanzapine, estazolam, eszopiclone, quazepam, diazepam, temazepam, zolpidem, lorazepam, olanzapine, triazolam, flurazepam, midazolam, oxazepam, zaleplon, secobarbital, phenobarbital, pentobarbital, desflurane, methoxyflurane, enflurane HLA- Apolizumab DRB1 ITGB5 Cilengitide KCNN4 Betamethasone/clotrimazole, clotrimazole, senicapoc KCNQ1 Dextromethorphan/quinidine, indapamide, quinidine MET Crizotinib, tivantinib, cabozantinib, INC280, cabozantinib/erlotinib MMP7 Marimastat MST1R Crizotinib MUC1 HuHMFG1 NOXO1 Ecabet P2RY2 Suramin PLA2G10 Varespladib methyl, varespladib PSCA AGS-1C4D4 RAMP1 Pramlintide SCNN1A Hydrochlorothiazide/triamterene, amiloride, amiloride/ hydrochlorothiazide, triamterene SLC12A2 Bumetanide, quinethazone
TABLE-US-00006 TABLE 6 Exemplary Kinases as Therapeutic Targets for Classical and Basal Subtype tumors Gene Name Description Subtype CDK1 Cyclin-dependent kinase 1 Basal CDK6 Cyclin-dependent kinase 6 Basal EPHA1 Ephrin type-A receptor 1 Basal EPHB2 Ephrin type-B receptor 2 Basal KAPCA cAMP-dependent protein kinase catalytic Classical subunit alpha KAPCB cAMP-dependent protein kinase catalytic Classical subunit beta KCC2D Calcium/calmodulin-dependent protein kinase Classical type II subunit delta KGP1 cGMP-dependent protein kinase 1 Classical LIMK1 LIM domain kinase 1 Basal PGFRB Platelet-derived growth factor receptor beta Classical RIPK2 Receptor-interacting serine/threonine-protein Basal kinase 2
[0215] By applying a computational approach to a large cohort of data, the presently disclosed subject matter overcame the low cellularity problem and generated new insights into the complex molecular composition of PDAC. The results disclosed herein and their prognostic values can thus provide decision support in a clinical setting for the choice and timing of treatment regimens.
[0216] Co-expression of stromal gene signatures was largely conserved across other large primary tumor datasets (The Cancer Genome Atlas Research Network, 2014a,b; Nones et al., 2014). Co-expression was particularly high in lung adenocarcinoma (The Cancer Genome Atlas Research Network, 2012b), which was previously shown to be low in purity (Carter et al., 2012) and high in stromal content (Yoshihara et al., 2013). Both expression and co-expression was low in primary acute myeloid leukemia (The Cancer Genome Atlas Research Network, 2013c), which lacks stroma.
Materials and Methods for Examples 1-8
[0217] Decomposition by factors and gene ranking. For all analyses in this manuscript, we used k=14 as the number of factors. Unsupervised NMF was performed on a gene-by-sample matrix X first with 20 randomly initialized instances of NMF using the MATLAB (MathWorks R2013a) multiplicative update NMF solver for 10 steps. The lowest-residual solution pair from these 20 instances was then used to seed NMF of X to convergence with the alternating least-squares solver. The result was a matrix of gene loadings, G, and a matrix of sample loadings, S. G and S were then scaled such that the mean of each column of G was 1 to facilitate cross-factor comparisons.
[0218] For each of the k factors, a set of distinct exemplar genes for the i.sup.th factor was established by ranking genes in descending order of the difference between the loading value in the i.sup.th column of matrix G and the largest loading value not in the i.sup.th column of matrix G.
[0219] 200 iterations of 5-fold resampling, i.e. training on a partition of approximately 80% of the samples, were performed to achieve stable NMF results. For each of these 200 data partitions, unsupervised NMF was performed, and the genes which appeared ranked in the top 50 of any factor together were recorded in a gene by gene consensus matrix. This gene factor-co-occurrence-consensus matrix was then used as the basis of a hierarchical clustering operation using correlation as a distance metric and an appropriate cutoff as to yield k gene clusters. These k gene-clusters were used to create a seed matrix, G.sub.0 such that the i.sup.th column of G.sub.0 contained 0.01 for all genes except those in gene cluster i, which were set to 1. G.sub.0 was then used to seed a final NMF using the multiplicative update solver to completion.
[0220] Gene set analysis was performed on the ranked list of genes for each factor with all sets available from MSigDB v3.1 (Subramanian et al., 2005). Sets were assessed for significance via Kolmogorov-Smirnov statistic with Benjamini-Hochberg correction. Due to the positive nature of the ranked gene list, only gene sets with positive enrichment were considered.
[0221] Patients and Samples. Multiple samples were obtained from 15 patients with metastatic PDAC from the University of Nebraska Medical Center Rapid Autopsy Pancreatic Program, and 17 patients from Johns Hopkins Medical Institutions and the Johns Hopkins Gastrointestinal Cancer Rapid Medical Donation Program. Informed consent was obtained from all subjects. To ensure minimal degradation of tissue, organs were harvested within 3 hours postmortem and the specimens flash frozen in liquid nitrogen. The cohort further included patients with resected PDAC and/or normal tissue from Johns Hopkins Medical Institutions, Northwestern Memorial Hospital, NorthShore Hospital, and the University of North Carolina (UNC) hospitals. All samples were collected between 1999 and 2009, flash frozen in liquid nitrogen at the time of operation after approval by each individual IRB. The UNC IRB approved use of all de-identified samples for this study. Some of these samples were previously published using a different normalization procedure as part of GSE21501 (Garrido-Laguna et al., 2011). All available samples were reviewed by a single pathologist (KEV).
[0222] The microarray cohort employed herein consisted of 145 primary (125 with survival data) and 61 metastatic PDAC tumors, 17 cell lines, 47 pancreas and 89 distant site adjacent normal samples, providing a rare diversity of tissue types with which to train our model. This data set represents an expansion from the 106 primary tumors in the previously published cohort GSE21501 (Garrido-Laguna et al., 2011) which was a bulk analysis of gene expression confined to primary tumors. The BxPC-3, MIA PaCa-2, HPAC, Panc 02.03, SW1990, HPAF-II, CFPAC-1, PANC-1, Capan-1, Capan-2, Panc 10.05, Hs 766T, Panc 03.27, and T3M4 PDAC cell lines were obtained from ATCC (Manassas, Va., United States of America). HuPT3 cells (obtained from Dan Billadeau, Mayo Clinic, Rochester, Minn., United States of America) and the immortalized human pancreatic duct-derived (HPNE) cells were described previously (Neel et al., 2014). All cell lines were authenticated via short tandem repeat profiling (Genetica), and all cell lines were mycoplasma negative by indirect staining. For survival analysis, only data from patients with localized resected tumors were used. RNA sequencing was performed on an additional 15 primary tumors, 37 pancreatic cancer patient-derived xenografts (PDX), 3 cell lines (HuPT3 plus 2 PDX-derived), and 6 cancer associated fibroblast (CAF) lines derived from deidentified patients with pancreatic cancer. Expression data have been uploaded to GEO.
[0223] PDX and derived cells. Fresh tumor samples from deidentified pancreatic ductal adenocarcinoma patients were obtained under protocols approved by the UNC IRB. All patient tissues were stained with hematoxylin and eosin (H&E) to confirm histology. The tumors were implanted subcutaneously into the flanks of 6-8 week old female NSG or NOD/SCID mice and subsequently passaged into other mice under protocols approved by the Institutional Animal Care and Use Committee.
[0224] Cell lines were derived from PDX as follows. At the time of passage, a section of the tumor was cut into approximately 3 mm pieces and rinsed with PBS containing penicillin and streptomycin (P/S). The tissue was minced with the GENTLEMACS.TM. Dissociator (Miltenyi Biotec) and incubated for 30 minutes in a Collagenase/Dispase (Roche 11097113001) solution. After incubation, mincing was repeated, the dissociation media was removed and the tissue was resuspended in DMEM/F12 media with 5 ng/ml EGF, 10 .mu.g/ml insulin (Life Technologies, 11330-032, PHG0311 and 12585-014 respectively), 10% FBS and 1.times.P/S and seeded onto tissue culture treated plates. Once culture was established, differential trypsinization was used to remove the fibroblasts and the cells were seeded on gelatin coated glass coverslips for immunofluorescence confirmation. Epithelial tumor cells were confirmed based on their expression of cytokeratin 18 or 19 and EpCAM (using Abcam ab133302, ab76539 and BioLegend 324209 antibodies).
[0225] Primary CAF cell lines from tumors of patients with PDAC were isolated using the outgrowth method as follows (Bachem et al., 2005). Fresh tumor was minced into pieces no larger than 1 mm.sup.3 and cultured with DMEM/Ham's F12 (1:1) media supplemented with 10% FBS. Immunofluorescence was used to confirm the presence of CAFs as defined by the presence of smooth muscle actin alpha (SMA.alpha. Santa Cruz Biotechnology 32251) and a mesenchymal marker, vimentin, (Cell Signaling 5741) as well as the absence of an epithelial marker, EpCAM (BioLegend 324209).
[0226] Statistical Analysis. For all analyses, sample size was limited to all appropriate cases with full data (i.e., no imputation was performed to estimate missing clinical information). Disease-specific survival or recurrence free survival was analyzed using the Kaplan-Meier product-limit method and the significance of clinicopathologic or subtype variables were measured by Cox proportional hazards regression. Multi-variable associations with survival were also performed using the Cox proportional hazards regression method. When more than 2 survival cohorts were compared, the log-rank test was used to assess global differences in survival. Fisher's exact test was used to analyze associations between 2 categorical variables. For continuous variables, e.g. stain intensity, factor weights, unpaired two-tailed two-sample t-tests were performed under the equal variance assumption. Box and whiskers plots show median, quartiles and range of continuous data to demonstrate variability of data and demonstrate degree of normality. Unless otherwise mentioned, sample to sample or gene to gene similarities were measured by correlation based on log.sub.2 transformed gene expression after normalizing each gene's expression to have a mean of zero and variance of one. Unless otherwise noted, clustering was done via consensus clustering of row-normalized gene expression. Consensus clustering consisted of 1000 iterations of k-means clustering, with 50% feature hold-out at each iteration, followed by hierarchical clustering of the consensus matrix with average linkage.
[0227] Microarray Data. All RNA isolation and hybridization was performed at UNC on Agilent human whole genome 4x44K microarrays (Agilent Technologies). RNA was extracted from macrodissected snap-frozen tumor samples using Allprep Kits (Qiagen) and quantified using nanodrop spectrophotometry (ThermoScientific). RNA quality was assessed with the use of the Bioanalyzer 2100 (Agilent Technologies). RNA was selected for hybridization using RNA integrity number and by inspection of the 18S and 28S ribosomal RNA. Similar RNA quality was selected across samples. One microgram of RNA was used as a template for cDNA preparations. cDNA was labeled with Cy5-dUTP and a reference control (Stratagene) was labeled with Cy3-dUTP using the Agilent low RNA input linear amplification kit (Agilent Technologies) and hybridized overnight at 65uC to Agilent 4x44 K whole human genome arrays (Agilent Technologies). Arrays were washed and scanned using an Agilent scanner (Agilent Technologies).
[0228] Arrays were annotated using GEO platform GPL4133, and analyzed using log.sub.2 background corrected Cy5 signal to maintain positivity. Multiple probes mapping to the same gene symbol were collapsed by mean probe expression. Samples were normalized to each other via quantile normalization.
[0229] RNAseq. 200-1000 ng of total RNA was used to prepare libraries with the TruSeq Stranded mRNA Sample Prep Kit (Illumina). 75b paired-end reads were sequenced on a NextSeq 500 Desktop Sequencer using a high output flow cell kit (Illumina). Reads were separated by species of origin using Xenome (Conway et al., 2012). Human or mouse specific reads were then aligned and quantified using Tophat2 (Kim et al., 2013), Cufflinks (Trapnell et al., 2012), hg19, mm10, and the UCSC knownGene transcript and gene definitions (<<genome>><<.>>ucsc<<.>>edu). mRNA gene expression was analyzed as log.sub.2(1+FPKM), and KRAS mutation status was determined by manual curation of aligned human reads.
[0230] Validation Data Sets. Gene expression array data from resected primary tumor samples from the Australian Pancreatic Cancer Genome Initiative and International Cancer Genome Consortium (ICGC) data were obtained from GSE50827 (Biton et al., 2014). Associated open access clinical data were obtained from the ICGC data portal: <<http>>://<<dcc>>.<<icgc>>.<<o- rg>>/release_16. Patients with death events before 30 days were assumed to have postoperative complications and were censored. Patients with metastases were excluded from survival analyses. Genomic subtypes, mutations, and amplifications were obtained from supplemental materials available from Waddell et al., 2015.
[0231] Normalized gene expression, survival data, and PAM50 (Stolze et al., 2015) classification from primary breast cancer (Perou) samples (n=295) as part of the UNC337 set were obtained from GSE18229 (Dal Molin et al., 2015).
[0232] Normalized RNAseq expression data of 845 primary tumor data were obtained as described by Hoadley et al., 2014 from TCGA <<https>>://<<tcga-data>>.<<nci>>.<- ;<nih>>.<<gov/tcga>> (Zhong et al., 2015),
[0233] Normalized RNAseq gene expression and partial survival data from 223 urothelial bladder carcinoma (BLCA) samples were obtained from TCGA (<<https>>://<<tcga-data>>.<<nci>>.&l- t;<nih>>.<<gov/tcga>>)<Alexandrov et al., 2013b). Samples were classified as basal or luminal with BASE47 classifications provided by Damrauer et al. (Isella et al., 2015).
Example 1
Virtual Microdissection of PDAC
[0234] Gene expression in a cohort of microarray data from 145 primary and 61 metastatic PDAC tumors, 17 cell lines, 47 pancreas and 89 distant site adjacent normal samples were analyzed using Agilent (Agilent Technologies) human whole genome 4x44K DNA microarrays (106 primary tumors were previously used in a separate analysis of gene expression (GSE2150115; Stratford et al., 2010). To validate the findings, further RNA sequencing was performed on 15 primary tumors, 37 pancreatic cancer patient-derived xenografts (PDX), 3 cells lines, and 6 cancer associated fibroblast (CAF) lines derived from deidentified patients with pancreatic cancer. Histology of all available samples was reviewed by a single blinded pathologist (KEV). Table 7 summarizes the demographic and clinical characteristics of patients in our cohorts.
TABLE-US-00007 TABLE 7 Demographics and Univariate Cox Analysis Resected Univariate with Cox p- Microarray RNAseq RNAseq All Survival value Primary Primary PDX Race Caucasian 128 121 0.507 99 9 25 African- 23 18 0.333 10 3 8 American Other 8 7 0.821 5 0 3 Gender F 90 83 0.348 67 5 23 M 80 68 0.348 55 8 14 T T1 4 4 0.420 2 1 2 Stage T2 22 20 0.530 20 2 5 T3 131 122 0.743 91 9 28 T4 1 1 0.115 1 0 0 N N0 49 43 0.068 36 7 10 Stage N1 112 106 0.068 80 5 25 M M0 160 149 -- 129 12 35 Stage M1 15 0 -- 14 0 Adjuvant Yes 74 70 0.055 44 5 21 Therapy No 30 28 0.055 27 3 7 Differentiation Well 16 13 0.940 16 0 1 Moderate 49 47 0.398 49 1 3 Poor 34 31 0.407 34 1 2 PDX Graft Success 44 37 0.164 11 8 37 Graft Failure 18 12 0.164 9 3 0 Margin Positive 58 52 0.026 34 5 17 Negative 93 88 0.026 75 7 17 TOTAL 193 163 143 15 37
Example 2
NMF Distinguishes Normal and Tumor Compartments
[0235] A key obstacle in the analysis of gene expression data, particularly in PDAC, is the removal of confounding normal or stroma gene expression from local and distant organ sites. FIGS. 1A-1D shows example histology of samples with both tumor, normal, and stromal tissue. NMF was employed to identify gene expression which we attribute to normal pancreas, liver, lung, muscle, and immune tissues. Expression of exemplar genes from these factors, i.e., genes with distinctly large weights in a single column of G, as well as factor weights for the samples, i.e., rows of S, showed excellent agreement with known tissue labels (see FIG. 3B, FIG. 3C, and FIG. 5). Investigation of the exemplar genes from these factors further confirmed their role as confounding normal tissue. For example, using the Kolmogorov-Smirnov test, the top-weighted genes from the liver factor showed significant (p<10.sup.-10) enrichment in the MSigDB term SU_LIVER, and the highest weighted gene, fibrinogen beta (FGB), was specifically expressed in normal human liver tissue.
[0236] In addition to normal tissue from distant organs, two factors were identified that were exclusive to pancreas tissue, but were differentiated from each other by their respective gene lists. One factor described endocrine function including expression of glucagon and insulin (GCG and INS), while the other factor described exocrine function including expression of digestive enzyme genes such as pancreatic lipase, PNLIP. This unsupervised discovery of two molecularly distinct yet highly co-localized factors related to normal pancreatic function represented an important proof of concept in the use of NMF to identify novel features without pre-defined expression knowledge.
[0237] To validate the normal expression signatures disclosed herein, all available samples were reviewed by a single pathologist to independently assess the amount of tumor, normal, and stroma cellularity. It was determined that many factor weights were correlated or anti-correlated to tumor cellularity (FIG. 6). Among normal and metastatic liver samples, for example, tumor-specific basal-like factor weights were correlated with cellularity, whereas the normal-specific liver factor weight was inversely related to the tumor content of a sample (FIG. 3D). These findings support the hypothesis that factor weights obtained from NMF were quantitatively indicative of underlying sample composition.
Example 3
Identification of Stroma-Specific Subtypes
[0238] Stroma is particularly important in PDAC. According to pathology assessments, stroma varies, and comprises on average 48% of the primary tumor samples employed herein, with a standard deviation of 30%. The instant analysis identified two factors which described gene expression from the stroma, which were distinctly different from the normal factors shown in FIGS. 3A-3D. Consensus clustering on exemplar genes from these two stroma factors divided tumor samples into two stromal subtypes, which were classified as "normal" and "activated" (FIG. 4A). Patients with samples with an activated stroma subtype had worse median survival (15 months) and 60% 1-year survival, when compared to patients with a normal stroma subtype (median 24 months, 1-year survival 82%; FIG. 4B). Both were notably absent in PDAC cell lines (FIG. 4C), which exhibited a distinct mitotic expression signature associated with mitotic checkpoints and DNA replication (Table 8). Whitfield et al., 2002. The fact that cell lines do not express these stromal factors and many metastatic samples do express them at low levels suggested that these genes were not expressed by the tumor epithelium. To further validate the stromal origin of these gene expression signatures, 6 CAF lines were isolated from primary tumors (FIGS. 8A-8F), and it was determined that they robustly overexpressed the stromal signatures disclosed herein as compared to PDAC tumor cell lines which had no expression of the stromal signatures (FIG. 4C).
[0239] The vast majority of collagen gene expression was attributable to stromal compartments, with the lone exception being COL17A1, which was high in tumors. "Normal" stroma was characterized by relatively high expression of known markers for pancreatic stellate cells, smooth muscle actin, vimentin, and desmin, (ACTA2, VEIL and DES). Stellate cells have been shown to promote cancer cell survival in vitro (Froeling et al., 2011), but at the same time may restrain PDAC in mouse models (Ozdemir et al., 2014; Rhim et al., 2014), or inhibit delivery of chemotherapy (Olive et al., 2009). In patients, the ratio of smooth muscle actin stained area to the collagen-stained area has been shown to be predictive of poor outcomes (Erkan et al., 2008). "Activated" stroma was characterized by a more diverse set of genes associated with macrophages, such as the integrin ITGAM, and the chemokine ligands CCL13 and CCL18. "Activated" stroma also expressed other genes which point to its role in tumor promotion, including the secreted protein SPARC, WNT family members WNT2, and WNT5A, gelatinase B (MMP9), and stromelysin 3 (WPM). The presence of fibroblast activation protein (FAP) in the activated stroma, which has previously been related to worse prognosis, suggested that an activated fibroblast state may be partially responsible for the poor outcomes for these patients (Cohen et al., 2008). This observation led to the hypothesis that the "normal" stroma factor may describe a "good" version of stroma and that "activated" stroma factor may describe the activated inflammatory stromal response that has been seen in previous studies to be responsible for disease progression (Hwang et al., 2008; Vonlaufen et al., 2008; Herrera et al., 2013). The multifactor analysis disclosed herein supported a complex, multi-gene model of stroma in PDAC, which may explain why single gene analysis has yielded mixed results.
Example 4
Identification of Tumor-Specific Subtypes
[0240] Independent of normal and stromal factors, it was determined that two tumor-specific factors define "classical" and "basal-like" subtypes of PDAC. When the presently disclosed samples were split into the two tumor subtypes (FIG. 7A), patients with basal-like subtype tumors had an overall worse median survival of 11 months and 44% 1-year survival compared to 19 months and 70% 1-year survival for those with classical subtype tumors (p=0.006, FIG. 7B). All cell lines assayed in this study (p<0.001), as well as a majority of metastatic samples (p=0.002), were classified as "basal-like", suggesting that cell line models represent only one subset of PDAC. These subtypes as well as their prognostic and/or diagnostic value were independently validated within the recently published International Cancer Genome Consortium (ICGC) PDAC microarray data set (FIGS. 7C and 7D; Nones et al., 2014). Genes from the "basal-like" factor, including laminins and keratins, were also consistent with basal subtypes previously defined in bladder (Rubio-Viqueira et al., 2006; Alexandrov et al., 2013b; Isella et al., 2015) and breast (Stolze et al., 2015) cancers (FIGS. 7E-7H). Interestingly, genes from the "basal-like" subtype reproduced subtype calls (p<0.001) in breast cancer, had prognostic value in breast cancer samples (p<0.001) and reproduced previous subtype calls in bladder cancer (p<0.001). Given these promising results, a single-sample cross-platform classifier of basal-like subtype which was trained on the presently disclosed microarray was developed, TCGA bladder, and Perou breast cancer data, with a 93% cross validation accuracy, which was able to classify TCGA breast cancer data with 92% accuracy during external validation (FIG. 9).
[0241] Potential subtypes of PDAC have previously been described by Collisson et al., 2011. The published exemplar genes were employed for "exocrine-like", "classical", and "quasimesenchymal" subtypes to cluster normal pancreas, cell lines, and primary PDAC tumors from the presently disclosed cohort (FIG. 10A). The three previous classifications were also observed in the data presented herein, but none held prognostic power either by cluster label or by supervised classification with PAM (FIG. 10B; Ihle et al., 2012). Furthermore, inclusion of the Collisson et al. subtypes into a multivariate Cox regression with the proposed tumor subtypes described herein did not remove the predictive power of the presently disclosed subtyping (p=0.014). By cross-referencing the genes from Collisson et al.'s model with the NMF model disclosed herein, three key findings were observed. First, "exocrine-like" genes overlapped with genes from the exocrine pancreas factor (17/17). Tumors in this cluster had expression indistinguishable from adjacent normal samples from the presently disclosed data set. Second, Collisson et al.'s "classical" genes overlapped with the "classical" subtype genes disclosed herein (20/22), for which the naming convention "classical" was retained herein. Third, the gene set associated with "quasimesenchymal" subtype appeared to be a mixed collection of genes from the presently disclosed "basal-like" tumor (6/20) and stromal subtypes (6/20). Thus, the appearance of stromal factors in the Collisson et al. list of "quasimesenchymal" class genes may explain the apparent mesenchymal-like gene expression that was observed.
[0242] "Basal-like" and "classical" tumors were found within both "normal" and "activated" stroma subtypes (FIG. 11A). Differential prognosis among tumor and stroma subtypes was cumulative, as "classical" subtype tumors with "normal" stroma subtypes (n=24) had the lowest hazard ratio of 0.39 with and a 95% CI of [0.21, 0.73], while the "basal-like" subtype tumors with "activated" stroma subtypes (n=26) had the highest hazard ratio of 2.28 with a 95% CI of [1.34, 3.87] (FIG. 11B). In a multivariate Cox regression model, which included tumor subtypes, stromal subtypes, and clinical variables (gender, race, T stage, N stage, margin status, adjuvant therapy, histological grade, and age), both classifications were independently associated with survival (stroma subtypes: p=0.037, tumor subtypes: p=0.003).
[0243] Although basal-like subtype tumors have a worse prognosis, patients with basal-like subtype tumors showed a strong trend towards better response to adjuvant therapy (p=0.072; FIG. 11C). Among basal-like subtype patients, adjuvant therapy provided a hazard ratio of 0.38, (95% CI of [0.14, 1.09]), while in patients with classical subtype tumors, adjuvant therapy is associated with a hazard ratio of only 0.76 (95% CI [0.40, 1.43]). In the presently disclosed cohort, there was no association of most clinical variables (race, gender, T stage, N stage, differentiation, or tumor cellularity) with survival, although positive nodal status trended towards significance, and positive margin status was significantly associated with worse survival (Table 7). Table 8 shows two-way associations of all subtype calls with clinical and pathological information from the presently disclosed cohort of PDAC patients. No association of tumor or stroma subtype with standard clinical or pathological variables was found, with the notable exception of mucinous features.
TABLE-US-00008 TABLE 8 Summary of Associations with Clinical Covariates and Subtypes Tumor Subtype Fischer's Fischer's Basal- Exact Stroma Subtype Exact Covariate Classical like p-value Normal Activated p-value Race Caucasian 90 27 0.521 26 65 1 African- 13 2 3 7 American Gender F 64 19 0.849 17 43 1 M 50 16 15 36 T T2 16 6 0.590 5 14 1 Stage T3 87 25 25 59 N N0 35 9 0.532 11 22 0.649 Stage N1 72 25 21 54 Margin Positive 38 8 0.385 7 22 0.629 Negative 65 22 22 49 Adjuvant Yes 48 13 0.437 10 30 0.769 Therapy No 21 9 5 19 Differentiation Poor 23 11 0.479 11 18 0.203 Well 49 16 13 44 Extracellular Low Mucin 49 24 0.042 18 43 0.792 Mucin High Mucin 23 3 6 19 Stroma Normal 31 8 0.144 Activated 57 31
Example 5
Tumor-Specific Subtypes Found in Patient-Derived Xenografts
[0244] To assess the tumor or stromal specificity of the presently disclosed signatures, RNAseq was performed on a group of 37 PDX tumors. PDX tumors were composed of human tumor cells surrounded by mouse stroma (FIGS. 12A-12D; Isella et al., 2015). Genes from both of the presently disclosed tumor signatures were expressed as human transcripts, whereas genes from both of the presently disclosed stromal signatures were expressed as mouse transcripts (FIG. 4D, FIG. 13A). PDX RNAseq expression was found to divide PDX into both classical and basal-like groupings (FIG. 13B) while predominantly expressing an activated stromal signature (FIG. 4D). Additionally, while tumor-specific subtype was not predictive of graft success (FIG. 14A), patient tumors with an activated stroma subtype had significantly higher graft success rates than those with normal stroma subtype or low amounts of stroma (FIG. 14B; p=0.019). Basal-like subtype tumors also exhibited faster growth rates than classical tumors (p=0.032) as measured by the length of time that tumors took to grow to 200 mm.sup.3 (TT200; FIGS. 14C and 14D), a previously used metric for PDX growth (Rubio-Viqueira et al., 2006). Retrospective analysis of patients who had matched PDX tumors found that a shorter TT200 was associated with an unfavorable recurrence-free survival (p=0.035; FIG. 14E), suggesting that PDX tumor growth rate may reflect patient biology.
[0245] Both mouse and human-specific expression of the Collisson et al. genes were measured in the presently disclosed PDX models. It was determined that while genes from the "classical" subtype were expressed by human cells in PDX, "quasimesenchymal" transcripts were expressed by a mixture of human and mouse cells, and "exocrine-like" transcripts were infrequently expressed (FIG. 10C). This supported the hypothesis that while the "classical" subtype was a bona fide group, the "quasimesenchymal" subtype was partially driven by non-tumor contributions of stroma and the "exocrine-like" subtype by normal pancreas.
Example 6
KRAS Codon Mutations, Tumor-Specific Subtypes, and Race
[0246] Studies of KRAS codon mutations have demonstrated that different codon mutations may have differential functions (Ihle et al., 2012; Stolze et al., 2015) and in some clinical studies, have been shown to be associated with differential outcome. Because PDX tumors are enriched for human-specific tumor cells, KRAS codon mutations were evaluated in the presently disclosed PDX cohort using manually curated RNAseq data. While the overall frequency of KRAS codon mutations was similar to a recent study of PDAC (Witkiewicz et al., 2015), it was noted that the KRAS G12D mutation was significantly overrepresented in the presently disclosed basal-like subtype while G12V was isolated to the classical subtype (FIG. 14F; p=0.030). Furthermore, an overrepresentation of KRAS G12V mutations was found in African-Americans (FIG. 14G; p<0.001). In contrast to basal-like breast cancers, which occur most frequently in African-American women and have a worse prognosis (Carey et al., 2010), African-American patients in the presently disclosed cohort tended to have mainly classical subtype tumors (13 vs 2). Similar to other cancers, African-Americans had a worse prognosis after adjusting for tumor subtype (FIG. 11E; p=0.017). African-American patients with classical subtype tumors had a mean survival of 13 months compared to Caucasian patients with classical subtype tumors, who had a median survival of 19 months.
Example 7
Other Commonly Mutated Genes and Altered Pathways in PDAC
[0247] Previously, loss of SMAD4 has been shown to promote tumor growth (Bardeesy et al., 2006; Haeger et al., 2015). Similar to previous PDX studies of PDAC, loss of SMAD4 was also found to be associated with graft success in PDX models (Garrido-Laguna et al., 2011; see FIG. 14H, FIG. 15A-15G; p=0.044). Furthermore, in the presently disclosed PDX cohort, SMAD4 expression was significantly higher in classical compared to basal-like subtype PDX tumors (FIG. 14I; p=0.015), consistent with the observation that SMAD4 loss confers a more aggressive phenotype.
[0248] Using mutation, genomic subtype (Waddell et al., 2015), and gene expression (Nones et al., 2014) data from publically available ICGC data in which recapitulation of the presently disclosed subtypes and prognosis were shown, significantly mutated genes and pathways in PDAC were also evaluated, including ones recently identified through whole-exome sequencing of microdissected primary PDAC tumors (Jones et al., 2008; Biankin et al., 2012; Waddell et al., 2015; Witkiewicz et al., 2015). No significant associations between the presently disclosed expression subtypes and these mutationally altered pathways, i.e., TGF.beta., RB, NOTCH, CTNNB1, SWI/SNF, and DNA repair, were found (FIG. 16). Furthermore, no overlap was found between the presently disclosed subtypes and recently identified genomic subtypes, or response to platinum therapy (Waddell et al., 2015). Consistent with this, a recent comprehensive study of somatic mutations in PDAC long-term survivors suggested that somatic mutations alone will not be sufficient to explain clinical outcome (Dal Molin et al., 2015).
[0249] Given the overlap of the presently disclosed classical subtype with that of Collisson et al., 2011, it was not surprising to find that the presently disclosed classical subtype was also enriched for genes associated with GATA6 overexpression (Zhang et al., 2008; FIG. 17A, FIG. 11A). GATA6 has been found to promote epithelial cell differentiation (Zhang et al., 2008; Zhong et al., 2015). More detailed histological markers of differentiation were evaluated in the presently disclosed samples, and it was found that samples with greater than 10% extracellular mucin, a marker of differentiation, comprised mostly of classical subtype tumors (88.5%, n=23) compared to only 11.5% (n=3) of basal-like subtype tumors (FIGS. 18A-18C, p=0.042; Table 9). Consistent with the increased presence of extracellular mucin, the presently disclosed classical subtype was enriched for genes upregulated in mucinous ovarian cancer (WAMAUNYOKOLI_OVARIAN CANCER_GRADES_1_2_UP; Wamunyokoli et al., 2006). Interestingly, the presently disclosed basal-like subtype was enriched for genes related to KRAS activation and STK11 loss in a lung cancer mouse model where STK11-deficient tumors demonstrated shorter latency and more frequent metastasis (Ji et al., 2007). One sample with STK11 inactivation was found in the ICGC data; this sample was a basal-like subtype (FIG. 16). Notably, the presently disclosed subtypes were not associated with other known signaling pathways in PDAC, including Fanconi anemia, DNA repair, chromatin remodeling, beta-catenin, RB, ARF, G1 (FIG. 11A). However, all of these pathways except for beta-catenin were considerably differentially expressed in cell lines compared to patient tumors, suggesting that gene expression in cell lines might be a deceptive representation of most tumors.
Example 8
Tumor-Specific Subtypes Suggested Low Intrapatient Heterogeneity Between Primary and Metastatic Lesions
[0250] It is likely that only a subset of genes are relevant to the question of intra- and inter-patient heterogeneity in PDAC. Many methods exist to pre-select genes for supervised analysis (Carey et al., 2010), but selection of the most differentially expressed genes is a common preprocessing step during unsupervised analysis (Bardeesy et al., 2006). When clustering matched samples of metastatic and primary lesions using the 50 most differentially expressed genes among all matched samples, samples separated primarily by organ site instead of by patient (FIGS. 19A and 19C). In contrast, when considering 25 top ranked exemplar genes each from the "basal-like" and "classical" factors, samples from the same patient clustered closer together, and were less dependent of organ site (FIGS. 19B and 19D).
[0251] This was further illustrated in a focused analysis of two patients (FIGS. 19A-19G), whose tumor samples appeared patient-specific when considering the presently disclosed tumor subtype gene list, but clustered by site when considering differentially expressed genes. Overall, it was found that the presently disclosed tumor subtype gene list showed higher similarity (mean Pearson's .rho.=0.53) between all other samples from the same patient than did the differentially expressed gene list (.rho.=0.32, t-test p.ltoreq.0.001). Furthermore, the presently disclosed tumor subtype gene list produced much lower similarity among all other samples from the same organ site across different patients (.rho.=0.04) than the differentially expressed gene list (.rho.=0.34, p.ltoreq.0.001). This observed similarity of tumor gene expression among tumors within the same patient suggested overall high inter-patient tumor heterogeneity and low heterogeneity between primary and metastatic sites. However, examples of intra-patient heterogeneity were not observed between metastatic sites. For example, lung metastases, even those from patients with "basal-like" tumors in other locations, clustered exclusively with the "classical" tumors, suggesting that some intra-patient heterogeneity may exist among metastatic sites, and supporting the previously reported divergent patterns of failure in PDAC (Haeger et al., 2015).
Discussion of Examples 1-8
[0252] The studies disclosed herein represent the largest investigation of primary and metastatic PDAC gene expression to date. NMF was used to identify novel prognostic and/or diagnostic subtypes of PDAC which may have been previously obscured by confounding normal and stromal tissue. The identification of normal-, tumor-, and stroma-specific gene expression signatures was supported by both their overlap with previously identified gene lists and their expression in appropriate tissue types. The presently disclosed tumor subtypes were further supported by their relationship to previously identified basal tumor subtypes in breast and bladder cancers and their prognostic and/or diagnostic relevance in external cohorts. The present findings of two different stroma subtypes may help explain the differential effects of stroma previously seen in preclinical models.
[0253] Tumor and stroma specific gene expression classified PDAC into four distinct subtypes with prognostic and/or diagnostic relevance. The orthogonal nature of tumor- and stroma-specific subtypes suggested an important interplay in patient tumors that will need to be taken into account as stroma and immune modulating therapies are studied. In the presently disclosed cohort, patients with basal-like tumors appeared to derive more benefit from adjuvant therapy. Whether basal-like and classical subtypes may be associated with response to specific therapies can be studied further as more effective therapies become available. One challenge will be defining preclinical model systems that recapitulate these subtypes as the presently disclosed results suggested that traditional cell lines are lacking in the classical subtype. Although it has been demonstrated that PDX models recapitulate tumor-specific subtypes, these models alone may not be sufficient due to either the lack of human stroma or overrepresentation of the activated stroma subtype in the tumors that are successfully grafted. Thus, more detailed characterization of genetically engineered mouse models of PDAC models can be employed to determine which models best reflect both our tumor- and stroma-specific subtypes.
[0254] Recent exome sequencing studies have confirmed commonly mutated genes in PDAC but have not uncovered mutations that clearly confer survival differences (Jones et al., 2008; Waddell et al., 2015; Witkiewicz et al., 2015). In fact, exome sequencing of a cohort of very long-term survivors of PDAC (Dal Molin et al., 2015) found no differences in somatic mutations to explain the improved biology of tumors from these rare patients compared to the majority of patients with PDAC, suggesting that examining somatic mutations alone may not be sufficient to understand the biological and clinical differences in PDAC tumors. Furthermore, exome sequencing studies and studies of microdissected samples are limited to the tumor compartment and overlook the stroma compartment which has been shown to be biologically critical in PDAC, with both tumor-promoting and tumor-inhibiting effects. The results provided herein suggested that RNA subtypes may better capture the molecular landscape of PDAC and its reflection on patient outcome. As such, the RNA subtypes disclosed herein may reflect the broad effect of somatic mutations while also capturing the importance of the neoplastic stroma.
[0255] These results provide new insight into the molecular composition of PDAC which may be used for precision medicine. Furthermore, knowledge of these subtypes and their prognostic and/or diagnostic value can provide decision support in a clinical setting where the choice and timing of therapies can be critical.
Example 9
Construction of a Cross-Platform Basal-Like Classifier
[0256] Having established a method for classifying cohorts of PDAC expression data into basal-like and classical samples, a more clinically applicable classification scheme that works on single samples was constructed. Such a single-sample classifier can be valuable in a clinical setting, where access to a large cohort of comparative cases is prohibitive. Furthermore, the ability of such a classifier to work across gene expression platforms and across relevant cancer types was assessed.
[0257] As such, a platform-independent classifier was developed and tested to discriminate between "basal-like" samples versus others across various cancers, given a sample's individual gene expression profile. Rank-based classifiers such as the Top Scoring Pair (TSP; Leek, 2009) and kTSP (Afsari et al., 2014) depend only on the relative ranks of the expression of genes within a sample, allowing such classifiers to be robust against platform-specific effects and study-to-study variations due to data normalization and preprocessing (Patil et al., 2015)
[0258] Briefly, the kTSP approach selects k pairs of genes A and B such that gene A expression>gene B expression implies sample membership to class 1, otherwise implying membership to class 2. The default decision rule in Afsari et al., 2015 following feature selection weights each TSP equally in their class prediction ("voting"), despite the fact that some TSPs may better discriminate between classes than others. The kTSP approach of Afsari et al., 2015 was extended as set forth herein by implementing a custom decision rule that inputs the selected k gene pairs into a penalized logistic regression classifier to estimate the relative contribution each of the k selected TSPs in predicting class membership (defined here as basal-like versus otherwise), similar to (Shi et al., 2011). In fitting the model, class membership was the binary outcome variable, and each covariate corresponded to a TSP, consisting of a binary integer vector which took on the value of 1 for a sample if gene A>gene B in expression for that TSP, and 0 otherwise for each sample.
[0259] A penalized logistic regression model was fit using the ncvreg package (Breheny & Huang, 2011) to account for potential correlation between TSPs (ridge penalty) and to remove TSPs unhelpful in prediction given the presence of other features in the model (MCP penalty). Given the fitted model and a new sample's expression profile, a predicted probability of basal-like class membership could be obtained.
[0260] To build the presently disclosed classifier to predict the basal-like class across various cancers, the presently disclosed classifier was trained on a "metadataset" consisting of the TCGA Bladder (RNA-seq, 20533 genes), UNC Pancreas (Microarray, 19749 genes), and Perou Breast Cancer (Microarray, 17631 genes) data sets, totaling 788 samples. Each data set was reduced to a common set of genes found across each study to the described 50 gene signature described herein. The Perou Breast Cancer data set was further filtered to remove genes that had missing values for more than 10 samples, leaving 11526 genes. The remaining missing data was imputed using the impute package (Hastie et al. impute: impute: Imputation for microarray data. R package version 1.42.0.) in R using default parameters. Only 29 of the 50 genes from the original gene signature remained for feature selection after filtering. Because of this small number, a larger 500 gene set encompassing the original 50 gene set, which was derived in a similar fashion, was utilized. From this larger gene set, 302 genes were found across all three training datasets.
[0261] Basal-like samples were identified in the TCGA bladder and Perou Breast Cancer data sets from their associated clinical annotation files, and in the UNC Pancreas data, the basal-like clustering calls from the present disclosure were utilized. Given the known classes (basal-like versus otherwise) and gene expression profiles in each data set, the presently disclosed feature selection was performed using the switchBox package (Afsari et al., 2015) to select the k TSPs from the 302 candidate genes, resulting in 16 TSPs being selected. The ncvreg function from (Breheny & Huang, 2011) was applied using the MCP penalty and an alpha parameter of 0.5, allowing for equal contribution of the ridge penalty to account for correlation between TSPs and the MCP penalty for feature selection. The appropriate penalty was chosen via leave-one-out cross validation using the cv.ncreg function (788 folds).
[0262] The final model described herein was found to contain 14 TSPs when derived from the larger 500 gene signature. The fitted estimates can be found in Table 9. Calculating the pair-wise spearman correlation between samples across the classifier's genes, it was determined that samples from the basal-like state (orange) tended to cluster together in terms of similarity (see FIG. 16). It was also determined that the predictions described herein tended to match the known classes for each sample regardless of platform or tumor type.
TABLE-US-00009 TABLE 9 Fitted Estimates for the Final Model Estimated Increase in Odds of Basal Class Gene A Gene B Coefficient Membership when A > B CD109 GPR160 0.87 2.38 SLC2A1 AGR2 1.22 3.39 KRT16 SLC44A4 0.52 1.68 CTSL2 TMEM45B 1.43 4.17 KRT6A BCAS1 0.70 2.01 B3GNT5 VSIG2 0.41 1.51 MET TFF3 0.72 2.06 CHST6 PLA2G10 0.80 2.24 SERPINB5 HPGD 0.76 2.13 DCBLD2 PLS1 1.40 4.07 IL20RB FAM3D 1.33 3.79 PPP1R14C SYTL2 1.58 4.85 NAB1 PLEKHA6 0.41 1.50 MSLN CAPN9 1.58 4.83 (Intercept) -7.16
[0263] To classify each sample, gene expression from pairs of genes in Table 9 were compared such that for each gene pair, if Gene A expression is greater than Gene B expression, the coefficient for that gene pair was added to a running sum. If the sum of all such coefficients and the intercept from Table 9 was greater than zero, the sample was classified as basal (see EQUATION 1).
[0264] To validate the 14 TSP classifier, the presently disclosed model was applied to two independent data sets: the TCGA Breast Cancer (RNAseq) data set and the ICGC pancreas cancer data sat (Microarray). It was determined that the predictions matched well in the independent TCGA data set, demonstrating a 92.3% classification accuracy. The only validation data set that did not have existing subtype calls is the ICGC pancreas data set. It was further determined that the presently disclosed TSP predictions did not match as well with the presently disclosed clustering results, with a match rate between clustering-based calls and classifier prediction of 85.5%. Finally, it was also determined that spearman correlation of gene expression as a whole was much worse between the ICGC platform and any of the various RNAseq or Agilent Microarray data described herein.
[0265] Accordingly, the present disclosure demonstrated excellent within-training set performance of the described classifier across multiple platforms, in addition to accurate prediction of the classifier in an independent RNAseq data set.
[0266] Extending the methodology described above, a stroma-specific (activated versus normal stroma; see EQUATION 2) and a tumor-specific (basal versus classical; see EQUATION 3) classifier was trained within only the pancreatic cancer data. Table 10 and Table 11 show the coefficients of the fitted model sufficient for classifying between activated and normal stroma subtypes, or between basal-like and classical subtypes, respectively.
TABLE-US-00010 TABLE 10 Fitted Estimates for the Pancreas-specific Stromal Model Estimated Increase in odds of activated stroma Gene A Gene B Coefficient class membership when A > B ITGA11 SCRG1 0.67 1.95 COL5A1 IGF1 1.25 3.48 COL11A1 ANGPTL7 3.23 25.37 MMP11 ACTG2 1.67 5.30 FNDC1 SYNM 1.43 4.18 ZNF469 MYH11 1.51 4.54 RBPMS2 RERGL 1.25 3.49 COL1A1 COL1A2 0.18 1.20 Intercept -6.17
TABLE-US-00011 TABLE 11 Fitted Estimates for the Pancreas-specific Tumor Subtype Model Estimated Increase in odds of basal class Gene A Gene B Coefficient membership when A > B GPR87 MS4A8B 1.084442 2.96 KRT6C BTNL8 2.622242 13.77 ANXA8L2 PLA2G10 2.73881 15.47 KRT6A KCNE3 1.891903 6.63 C16orf74 DDC 1.898285 6.67 SCEL MYO1A 2.161549 8.68 DCBLD2 PLS1 2.189532 8.93 FAM83A REG4 2.855056 17.38 PTGES ATP10B 1.674513 5.34 Intercept -9.255835
Example 10
Exemplary Clinical Approaches to Care
[0267] FIGS. 20 and 21 show exemplary, non-limiting clinical approaches to care based on tumor and stroma subtype determinations employing EQUATIONS 2 and 3 above.
[0268] In FIG. 20, exemplary treatment considerations for patients with a pancreatic mass that has no evidence of distant (metastatic) spread to other organs (i.e., tumor is confined to the pancreas) is presented. In this case, a patient undergoes a biopsy. If the stroma subtype is determined to be normal using EQUATION 2, the patient proceeds to surgery. However, as agents such as those listed in Table 3 become available or are developed against the genes in Table 3, a patient with normal stroma subtypes also considers neoadjuvant therapy using the Table 3 agents prior to surgery. If this patient is determined to have an activated stroma subtype, the patient considers radiation and other stroma modulation therapies noted in FIG. 20, including but not limited to hyaluronidase, hedgehog inhibition, modified vitamin D, vitamin D derivatives or compounds, anti-cytokine agents, or agents listed in or directed against the genes listed in Table 2. Additionally, the patient considers the therapies recommended based on tumor subtype (classical or basal-like) as described herein below.
[0269] If the biopsy shows classical subtype as determined using EQUATION 3, the patient is moved directly to surgery or prior to surgery, treatment with one or more agents listed in Table 5 or Table 6 or directed against the genes listed in Tables 5 and 6 is commenced. If the patient has a basal-like tumor, surgery alone would not be adequate. Therefore, this patient is recommended to undergo chemotherapy with the agents listed in FIG. 20 and/or with agents listed in Tables 4 and 6 and/or against the genes listed in Tables 4 and 6.
[0270] FIG. 21 shows exemplary treatment considerations for patients with a pancreatic mass that has evidence of distant (metastatic) spread to other organs. In this case, the patient would also undergo a biopsy. If the biopsy shows classical subtype as per EQUATION 3, the patient considers 5-fluorouracil or platinum based therapy. In some instances, other chemotherapies are also considered. However, as other agents such as those listed in Tables 5 and 6 become available or are developed against the genes listed in Tables 5 and 6, these therapies are considered in conjunction with the chemotherapy. If the patient has a basal-like tumor, cisplatin- or oxaliplatin-based therapies or gemcitabine as listed in FIG. 21 are considered. In some instances other chemotherapies are appropriate. In addition, the agents listed in Tables 4 and 6 or agents against the genes listed in Tables 4 and 6 are added to the chemotherapy.
[0271] If the patient has a normal stroma subtype as per EQUATION 2, no additional therapy besides those based on the tumor subtype is considered. However, immunotherapies to augment immune response can be considered. As additional agents such as those listed in Table 3 become available or are developed against the genes in Table 3, a patient with normal stroma subtypes considers using Table 3 agents in conjunction with the tumor subtype specific therapy regimen such as chemotherapy. For patients with activated stroma, radiation and other stroma modulation therapies listed in FIG. 21 are considered in conjunction with the tumor subtype specific therapy, including but not limited to hyaluronidase, hedgehog inhibition, modified vitamin D, vitamin D derivatives or compounds, and anti-cytokine agents. In addition, the agents listed in Table 2 and/or agents against the genes listed in Table 2 are also considered.
REFERENCES
[0272] The references listed below as well as all references cited in the specification including, but not limited to patents, patent application publications, journal articles, and database entries (e.g., GENBANK.RTM. biosequence database entries including all annotations and references cited therein) are incorporated herein by reference to the extent that they supplement, explain, provide a background for, or teach methodology, techniques, and/or compositions employed herein. With respect to GENBANK.RTM. biosequence database entries, if a sequence listed herein is or has been updated with a new sequence, it is understood that the instant disclosure also incorporates by reference to the sequence listed herein any such new sequences.
[0273] Afsari et al. (2014) Ann Appl Stat 8:1469-1491.
[0274] Afsari et al. (2015) Bioinformatics 31:273-274
[0275] Ahmad et al. (2001)Am J Gastroenterol 96:2609-2615.
[0276] Albert et al. (1992) J Virol 66:5627-5630.
[0277] Alexandrov eta. (2013a) Cell Rep 3:246-259.
[0278] Alexandrov et al. (2013b) Nature 500:415-421.
[0279] Alexay et al. (1996) Proc SPIE 2705, Fluorescence Detection IV, 6363.
[0280] Ausubel et al. (2002) Short Protocols in Molecular Biology, Fifth ed. Wiley, New York, N.Y., United States of America.
[0281] Ausubel et al. (2003) Current Protocols in Molecular Biology, John Wylie & Sons, Inc., New York, N.Y., United States of America.
[0282] Bachem et al. (2005) Gastroenterol 128:907-921.
[0283] Bardeesy et al. (2006) Genes Dev 20:3130-3146.
[0284] Bej et al. (1991) Appl Environ Microbiol 57:3529-3534.
[0285] Biankin et al. (2012) Nature 491:399-405.
[0286] Biton et al. (2014) Cell Rep 9:1235-1245.
[0287] Boom et al. (1990) J Clin Microbiol 28:495-503.
[0288] Boyle & Levin (2008) World Cancer Report 2008. Lyon, International Agency for Research on Cancer.
[0289] Breheny & Huang (2011) Ann Appl Stat 5:232-253.
[0290] Buffone et al. (1991) Clin Chem 37:1945-1949.
[0291] Busch et al. (1992) Transfusion 32:420-425.
[0292] Cancer Genome Atlas Research Network, The (2011) Nature 474:609-615.
[0293] Cancer Genome Atlas Research Network, The (2012a) Nature 487:330-337.
[0294] Cancer Genome Atlas Research Network, The (2012b) Nature 489:519-525.
[0295] Cancer Genome Atlas Research Network, The (2012c) Nature 490:61-70.
[0296] Cancer Genome Atlas Research Network, The (2013a) Nature 497:67-73.
[0297] Cancer Genome Atlas Research Network, The (2013b) Nature 499:43-49.
[0298] Cancer Genome Atlas Research Network, The (2013c) New Eng J Med 368:2059.
[0299] Cancer Genome Atlas Research Network, The (2014a) Nature 507:315-322.
[0300] Cancer Genome Atlas Research Network, The (2014b) Nature 511:543-550.
[0301] Carey et al. (2010) Nat Rev Clin Oncol 7:683-692.
[0302] Carter et al. (2012) Nat Biotechnol 30:413-421.
[0303] Cha & Thilly (1993) PCR Methods Appl 3:S18-S29.
[0304] Cleary et al. (2004) J Am Coll Surg 198:722-731.
[0305] Cohen et al. (2008) Pancreas 37:154-158.
[0306] Cohen et al. (2008) Pancreas 37:154-158.
[0307] Collisson et al. (2011) Nat Med 17:500-503.
[0308] Conlon et al. (1996) Ann Surg 223:273-279.
[0309] Conroy et al. (2011) New Eng J Med 364:1817-1825.
[0310] Conway et al. (2012) Bioinformatics 28:i172-i178.
[0311] Cousins et al. (1992) J Clin Microbiol 30:255-258.
[0312] Crnogorac-Jurcevic et al. (2002) Oncogene 21:4587-4594.
[0313] Dal Molin et al. (2015) Clin Cancer Res 21:1944-1950.
[0314] Damrauer et al. (2014) Proc Nat Acad Sci USA 111:3110-3115.
[0315] DeOliveira et al. (2006) Annals Surg 244:931-937.
[0316] DeRisi et al. (1996) Nat Genet 14:457-460.
[0317] Dubiley et al. (1997) Nucl Acids Res 25:2259-2265.
[0318] Duda et al. (2012) Pattern Classification. John Wiley & Sons, New York, N.Y., United States of America.
[0319] Eisenberg & Levanon (2003) Trends Genet 19:362-365.
[0320] Englert (2000) in Schena, ed., Microarray Biochip Technology, pp. 231-246, Eaton Publishing, Natick, Mass., United States of America.
[0321] Eppsteiner et al. (2009) Annals Surg 249:635-640.
[0322] Erkan et al. (2008) Clin Gastroenterol Hepatol 6:1155-1161.
[0323] Espejo et al. (2002) Biochem J 367:697-702.
[0324] Fang et al. (2002) Chembiochem 3:987-991.
[0325] Ferrone et al. (2008) J Gastrointest Surg 12:701-706.
[0326] Fodor et al. (1991) Science 251:767-773.
[0327] Fodor et al. (1993) Nature 364:555-556.
[0328] Froeling et al. (2011) Gastroenterol 141:1486-1497.
[0329] Garrido-Laguna et al. (2011) Clin Cancer Res 17:5793-5800.
[0330] Gress et al. (2001) Annals Internal Med 134:459-464.
[0331] Guedon et al. (2000) Anal Chem 72(24):6003-6009.
[0332] Haab et al. (2001) Genome Biol 2:RESEARCH0004.
[0333] Haeger et al. (2015) Oncogene Apr. 20. doi: 10.1038/onc.2015.112. [Epub ahead of print].
[0334] Hamel et al. (1995) J Clin Microbiol 33:287-291.
[0335] Han et al. (2006) Pancreas 32:271-275.
[0336] Heaton et al. (2001) Proc Natl Acad Sci USA 98(7):3701-3704.
[0337] Hermanson (1990) Bioconjugate Techniques, Academic Press, San Diego, Calif., United States of America.
[0338] Herrera et al. (2013) Clin Cancer Res 19:5914-5926.
[0339] Herrewegh et al. (1995) J Clin Microbiol 33:684-689.
[0340] Hoadley et al. (2014) Cell 158:929-944.
[0341] Houseman et al. (2002) Nat Biotechnol 20:270-274.
[0342] Hwang et al. (2008) Cancer Res 68:918-926.
[0343] Iacobuzio-Donahue et al. (2003) Am J Pathol 162:1151-1162.
[0344] Iacobuzio-Donahue et al. (2009) J Clin Oncol 27:1806-1813.
[0345] Ihle et al. (2012) J Natl Cancer Inst 104:228-239.
[0346] Isella et al. (2015) Nat Genet 47:312-319.
[0347] Izraeli et al. (1991) Nucl Acids Res 19:6051.
[0348] Ji et al. (2007) Nature 448:807-810.
[0349] Jones et al. (2008) Science 321:1801-1806.
[0350] Kim et al. (2013) Genome Biol 14:R36.
[0351] Kohsaka & Carson (1994) J Clin Lab Anal 8:452-455.
[0352] Krapp et al. (1998) Genes Dev 12:3752-3763.
[0353] Lanciotti et al. (1992) J Clin Microbiol 30:545-551.
[0354] Leek (2009) Bioinformatics 25:1203-1204.
[0355] Linz et al. (1990)J Clin Chem Clin Biochem 28:5-13.
[0356] Lisle et al. (2001) BioTechniques 30:1268-1272.
[0357] Liu & Hlady (1996) Colloids Surfaces B Biointerfaces 8:25-37.
[0358] Lockhart et al. (1996) Nat Biotechnol 14:1675-1680.
[0359] Logsdon et al. (2003) Cancer Res 63:2649-2657.
[0360] Louvet et al. (2005) J Clin Oncol 23:3509-3516.
[0361] MacBeath & Schreiber (2000) Science 289:1760-1763.
[0362] Mace et al. (2000) in Schena, ed., Microarray Biochip Technology, pp. 39-64, Eaton Publishing, Natick, Mass., United States of America.
[0363] Maier et al. (1994) J Biotechnol 35:191-203.
[0364] McCaustland et al. (1991) J Virol Methods 35:331-342.
[0365] McConkey et al. (2014) Eur Urol 66:609-910.
[0366] McGall et al. (1996) Proc Nat Acad Sci USA 93:13555-13460.
[0367] McLendon et al. (2008) Nature 455:1061-1068.
[0368] McPherson et al. (1995) PCR 2: A Practical Approach, IRL Press, New York, N.Y., United States of America.
[0369] Millar et al. (1995) Anal Biochem 226:325-330.
[0370] Natarajan et al. (1994) PCR Methods Appl 3:346-350.
[0371] Neel et al. (2014) Mol Cancer Ther 13:122-133.
[0372] Nelson et al. (2001) Anal Chem 73(1):1-7.
[0373] Neuhaus et al. (2008) J Clin Oncol May 20 Suppl; Abstr LBA4504.
[0374] Nones et al. (2014) Int J Cancer 135:1110-1118.
[0375] O'Donnell et al. (1997) Anal Chem 69:2438-2443.
[0376] Olive et al. (2009) Science 324:1457-1461.
[0377] Ozdemir et al. (2014) Cancer Cell 25:719-734.
[0378] Paladichuk (1999) The Scientist 13:20-23.
[0379] Parker et al. (2009) J Clin Oncol 27:1160-1167.
[0380] Parkin et al. (2005) CA Cancer J Clin 55:74-108.
[0381] Patil et al. (2015) Bioinformatics btv157 [Epub ahead of print].
[0382] PCT International Patent Application Publication Nos. WO 1993/009668; WO 1995/011755; WO 1997/014028; WO 1999/019515; WO 1999/032660; WO 1999/032660; WO 1999/063385; WO 2001/013120; WO 2001/014589; WO 2001/023082; WO 2004/046098; WO 2004/110244; WO 2006/089268; WO 2007/001324; WO 2007/056332; WO 2007/07025.
[0383] Pietu et al. (1996) Genome Res 6:492-503.
[0384] Prat et al. (2010) Breast Cancer Res 12:R68.
[0385] Randolph & Waggoner (1995) Nucl Acids Res 25:2923-2929.
[0386] Ratner & Castner (1997) in Vickerman, ed., Surface Analysis: The Principal Techniques, John Wiley & Sons, New York, N.Y., United States of America.
[0387] Rhim et al. (2014) Cancer Cell 16:735-747.
[0388] Robertson & Walsh-Weller (1998) Methods Mol Biol 98:121-154.
[0389] Rose (2000) in Schena, ed., Microarray Biochip Technology, pp. 19-38, Eaton Publishing, Natick, Mass., United States of America.
[0390] Roux (1995) PCR Methods Appl 4:S185-S194.
[0391] Rubio-Viqueira et al. (2006) Clin Cancer Res 12:4652-4661.
[0392] Rupp et al. (1988) BioTechniques 6:56-60.
[0393] Salisbury et al. (2002) J Am Chem Soc 124:14868-14870.
[0394] Sambrook & Russell (2001) Molecular Cloning: A Laboratory Manual, 3.sup.rd. Edition, Cold Spring Harbor Press, Cold Spring Harbor, N.Y., United States of America.
[0395] Sapolsky & Lipshutz (1996) Genomics 33:445-456.
[0396] Schena et al. (1995) Science 270:467-470.
[0397] Schena et al. (1996) Proc Natl Acad Sci USA 93:10614-10619.
[0398] Schnelldorfer et al. (2008) Ann Surg 247:456-462.
[0399] Seong (2002) Clin Diagn Lab Immunol 9:927-930.
[0400] Shalon et al. (1996) Genome Res 6:639-645.
[0401] Shi et al. (2010) Nat Biotechnol 28:827-838.
[0402] Shi et al. (2011) Bmc Bioinformatics 12:375.
[0403] Shoemaker et al. (1996) Nat Genet 14:450-456.
[0404] Shriver-Lake (1998) in Cass & Ligler, eds., Immobilized Biomolecules in Analysis, pp. 1-14, Oxford Press, Oxford, United Kingdom.
[0405] Silhavy et al. (1984) Experiments with Gene Fusions, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y., United States of America.
[0406] Smith (1998a) The Scientist 12(14):21-24.
[0407] Smith et al. (1998b) Clin Chem 44(9):2054-2056.
[0408] Southern (1975) J Mol Biol 98:503-517.
[0409] Stolze et al. (2015) Sci Rep 5:8535.
[0410] Strain & Chmielewski (2001) BioTechniques 30(6):1286-1291.
[0411] Stratford et al. (2010) PLoS Med 7:e1000307.
[0412] Stuart et al. (2004) Proc Nat Acad Sci USA 101:615-620.
[0413] Subramanian et al. (2005) Proc Nat Acad Sci USA 102:15545-15550.
[0414] Tanaka et al. (1994) J Gen Virol 75:2691-2698.
[0415] Theriault et al. (1999) in Schena, ed., DNA Microarrays: A Practical Approach, pp. 101-120, Oxford University Press Inc., New York, N.Y., United States of America.
[0416] Tibshirani et al. (2002) Proc Nat Acad Sci USA 99:6567-6572.
[0417] Tijssen (ed.) (1993) Laboratory Techniques in Biochemistry and Molecular Biology: Hybridization With Nucleic Acid Probes, Part I: Part I. Theory and Nucleic Acid Preparation, Elsevier Press, New York, N.Y., United States of America.
[0418] Trapnell et al. (2012) Nature Protoc 7:562-578.
[0419] Tusher et al. (2001) Proc Natl Acad Sci USA 98:5116-5121.
[0420] U.S. Pat. Nos. 4,729,947; 5,143,854; 5,207,880; 5,230,781; 5,346,603; 5,360,523; 5,534,125; 5,571,388; 5,743,960; 5,800,992; 5,837,832; 5,843,767; 5,846,717; 5,871,918; 5,916,524; 5,965,352; 5,968,745; 5,974,164; 5,985,557; 5,994,069; 6,001,567; 6,017,696; 6,066,457; 6,086,737; 6,090,543; 6,123,819; 6,127,127; 6,162,603; 6,185,561; 6,225,059; 6,229,911; 6,245,508.
[0421] Van Kerckhoven et al. (1994) J Clin Microbiol 32:1669-1673.
[0422] Vignali (2000) J Immunol Methods 243(1-2):243-255.
[0423] Von Hoff et al. (2013) N Engl J Med 369:1691-703.
[0424] Vonlaufen et al. (2008) Cancer Res 68:2085-2093.
[0425] Waddell et al. (2015) Nature 518:495-501.
[0426] Wamunyokoli et al. (2006) Clin Cancer Res 12:690-700.
[0427] Wang et al. (1989) Proc Natl Acad Sci USA 86:9717-9721.
[0428] Wang et al. (2010) Cancer Res 70:6448-6455.
[0429] Whitfield et al. (2002)Mol Biol Cell 13:1977-2000.
[0430] Williams (1989) BioTechniques 7:762-769.
[0431] Williams et al. (1990) Nucl Acids Res 18(22):6531-6535.
[0432] Winter et al. (2006) J Gastrointest Surg 10:1199-1210; discussion 1210-1211.
[0433] Witkiewicz et al. (2015) Nature Commun 6:6744.
[0434] Worley et al. (2000) in Schena, ed., Microarray Biochip Technology, pp. 65-86, Eaton Publishing, Natick, Mass., United States of America.
[0435] Yachida et al. (2010) Nature 467:1114-1117.
[0436] Yang et al. (1998) Science 282:2244-2246.
[0437] Yermilov et al. (2009) Annals Surg Oncol 16:554-561.
[0438] Yershov et al. (1996) Proc Natl Acad Sci USA 93:4913-4918.
[0439] Yoshihara et al. (2013) Nat Commun 4:2612.
[0440] Zhang et al. (2008) Nat Genet 40:862-870.
[0441] Zhong et al. (2015) PLoS One 6:e22129.
[0442] Zhu et al. (2001) Science 293:2101-2105.
[0443] It will be understood that various details of the presently disclosed subject matter may be changed without departing from the scope of the presently disclosed subject matter. Furthermore, the foregoing description is for the purpose of illustration only, and not for the purpose of limitation.
Sequence CWU
1
1
12616012DNAHomo sapiens 1ataacctcca ctctgaaagc agtcttcaca gaaacttttc
acagaagtca aatagttaaa 60gcaaattcta gatacatggt agagaccagg agaaaatatg
aataactttc ttctaaacaa 120ggagctcagt ggataaacca tacctctaga ttccttgctt
ccattttccc agaagttttg 180gtagcaggat gatgttggcc tcataatgtg agttagagag
gagtccctct ttttcgactg 240tttggaattg tttcagaagg aatgctacca gctcctcttg
taccactggt agaattcagc 300tgtgaatctg tctggtcctg ggcttttttt gattgacaag
atgaggaaga gaaagatcag 360tgtgtgtcaa caaacttggg ccttattatg caagaacttt
cttaaaaaat ggagaatgaa 420aagagagtcc ttaatggaat ggctgaattc attgctccta
ctactttgtt tgtatatata 480tcctcatagt catcaagtaa atgatttttc ttcactgctt
accatggacc tgggacgggt 540agatacattt aatgaatcca gattttctgt tgtatacaca
cctgtcacca acacgaccca 600acagataatg aataaagtag cctctactcc cttcctggca
ggtaaagagg tcttgggact 660gccagatgag gaaagtatta aagaattcac agcaaattat
cctgaagaaa tagtaagagt 720cacctttact aatacatact catatcattt gaagttcttg
ctaggacatg gaatgccagc 780aaagaaggag cacaaggacc atacagctca ttgttatgaa
acaaatgaag atgtttactg 840tgaagtttca gtattttgga aggaaggttt tgtggctctt
caagctgcca ttaatgctgc 900tattatagaa atcacaacaa atcactcagt gatggaggag
ctgatgtcag ttactggaaa 960aaatatgaag atgcattcct tcattggtca atcaggagtt
ataactgatt tgtacctttt 1020ttcctgcatt atttcatttt cctcattcat ttactatgca
tctgttaatg tcacaagaga 1080gaggaaaagg atgaaggcct tgatgacaat gatgggtctt
cgggattcag cgttctggct 1140ctcctggggt ttgctctatg ctggtttcat cttcattatg
gcccttttct tggcacttgt 1200tataagatct acccagttta tcattttgtc tggcttcatg
gtagtcttca gcctctttct 1260cctgtatgga ttatctttgg tagctttggc tttcttaatg
agcatcttgg taaagaaatc 1320tttcctcacc ggcctggtcg tgttcctcct cactgtcttt
tgggggtgtc tggggttcac 1380atcactgtac agacaccttc ctgcatcctt ggagtggatt
ttaagcttgc ttagtccctt 1440tgccttcatg cttggaatgg cccagctttt acacttggac
tatgatttga attctaatgc 1500atttcctcat ccatcggacg gctcaaatct cattgtagca
acaaatttca tgttggcatt 1560tgacacttgc ctctatctgg cattggcgat ttactttgaa
aaaattttgc caaatgaata 1620tggacatcga cgtccacctt tgtttttcct gaagtcctca
ttttggtctc aaacacaaaa 1680gactgatcac gtggcccttg aagatgaaat ggatgccgat
ccttcatttc atgactcttt 1740tgaacaagcg cctccagaat tccaagggaa agaagccatc
agaatcagaa atgttacaaa 1800agaatataaa ggaaagcctg ataaaataga agccttgaaa
gatctggtat ttgacattta 1860cgaaggccaa atcactgcaa tacttggtca cagtggagct
ggaaagtcaa cactgctaaa 1920cattcttagt gggttgtctg ttcccaccaa aggttcagtc
accatctata acaataagct 1980ttcagaaatg gctgacctag aaaatctcag caagctgacc
ggagtttgtc cacaatccaa 2040tgtgcaattt gacttcctca ctgtaagaga aaacctcaga
ctctttgcta aaataaaagg 2100gattctgcca caagaagtgg ataaagagat acaaagggtt
ctgctggaat tggaaatgaa 2160aaatattcag gatgttcttg ctcaaaactt aagtggtgga
cagaaaagaa agctaacctt 2220tgggattgcc attttaggag atcctcagat tttcctgttg
gatgaaccaa ctgctggatt 2280ggatcccttt tcaagacacc aagtatggaa ccttctgaaa
gaacgcaaaa cagaccgcgt 2340gatcctcttc agtacccagt tcatggatga ggccgacatc
ctggcggaca ggaaagtatt 2400tctctcccaa gggaagctaa agtgcgcggg ctcttctttg
tttctaaaga agaaatgggg 2460gattggatat cacttaagct tgcagttaaa tgaaatatgt
gttgaggaaa acataacatc 2520acttgttaaa cagcacatcc ctgatgccaa attatcagcc
aaaagcgaag gaaaacttat 2580ttatacatta cccttagaaa gaacaaataa atttccagaa
ctttacaagg atcttgatag 2640ctatcctgac ctaggaattg agaattatgg tgtttccatg
acaactttga atgaagtatt 2700cctgaagcta gaaggaaaat ctacaattaa tgaatcggac
attgctattt tgggagaagt 2760acaagcggaa aaagctgacg acactgaaag gcttgttgag
atggaacaag tcctctcttc 2820acttaacaag atgagaaaga caataggtgg tgtggctctc
tggcgacagc aaatctgcgc 2880aattgcaagg gttcgcttgt taaagttaaa gcatgaaaga
aaagctcttt tagcactgct 2940attaattcta atggctggat tttgccctct tcttgtggag
tataccatgg tgaaaatata 3000tcaaaacagt tacacctggg aactttctcc tcatttgtat
ttccttgctc ctggacaaca 3060accacatgac cctctcactc aactactgat catcaataaa
acaggggcaa gcattgatga 3120ctttatacag tctgtggagc accagaacat agctttagaa
gtggatgcat ttggaactag 3180aaatggcaca gatgacccat cttataatgg agccatcaca
gtgtgttgta atgaaaagaa 3240ttacagcttt tcgttagcat gcaatgccaa aagattgaat
tgcttcccag ttcttatgga 3300cattgttagt aatgggctac ttggaatggt taaaccatca
gtacatatcc gaactgaaag 3360aagtacattt ttggagaatg gacaggacaa tccaatcgga
ttcctggcat atatcatgtt 3420ctggctggtt ttaacatcga gttgcccacc ttacattgcc
atgagcagca tcgatgatta 3480taagaacaga gctcggtccc agctacggat ttccggactc
tccccttctg cttactggtt 3540tgggcaggcg ctggtggatg tttccctgta cttcttggtc
ttcgttttta tatatttaat 3600gagctacatt tcaaacttcg aagacatgct acttacaata
attcatatta ttcaaatccc 3660atgtgctgtt ggttattcct tttccctcat cttcatgaca
tacgtgattt ccttcatctt 3720tcgcaagggg agaaaaaata gtggcatttg gtcattttgt
ttctatgttg tcactgtatt 3780ctctgtggct ggatttgcgt tcagtatctt cgaaagtgat
attccattta tcttcacttt 3840tttaatacca cctgccacaa tgattggctg tttgttctta
tcttctcatc ttctcttttc 3900ttctctcttt tctgaagaac gaatggatgt acagccattt
ctggtattcc taattccttt 3960ccttcatttt atcatttttc tttttactct tcgatgtctg
gaatggaagt ttggaaagaa 4020atcaatgaga aaggatcctt tctttagaat ttctccaaga
agtagtgatg tgtgtcaaaa 4080tccagaagaa ccagaaggag aggatgaaga tgttcagatg
gaaagagtga gaacagcaaa 4140tgccttgaat tctactaatt ttgatgagaa gccagtcatc
attgccagct gtctacgcaa 4200ggagtatgca gggaagagga aaggctgttt ttccaagagg
aagaataaga tagccacgag 4260aaatgtctcc ttctgtgtta gaaaaggtga agttttagga
ttattaggac acaatggagc 4320tggtaaaagc acatccatta aggtgataac tggagacaca
aaaccaactg ctggacaagt 4380gctactgaaa gggagcggtg gaggggatgc cctggagttc
ctggggtact gccctcagga 4440gaacgcgctg tggcccaacc tgacagtgag gcagcacctg
gaggtgtacg ccgccgtgaa 4500agggctgagg aaaggggatg ctgaggttgc catcacacgg
ttagtggatg cgctcaagct 4560gcaggaccag ctgaagtctc ccgtgaagac cttgtcagag
ggaataaaga gaaagctgtg 4620ctttgtcctg agcatactgg ggaacccgtc agtggtgctt
ctggatgagc cgtcgaccgg 4680gatggacccc gaggggcagc agcaaatgtg gcaggccatc
cgggccacct ttagaaacac 4740ggaaaggggt gccctcctaa ccacccacta catggcagag
gctgaggccg tgtgtgaccg 4800agtggccatc atggtatctg ggaggttgag atgtatcggt
tccatccaac acctgaaaag 4860caaatttggc aaagattacc tgctggagat gaaggtgaag
aacctggcac aagtggagcc 4920cctccatgca gagatcctga ggcttttccc ccaggctgct
cggcaggaaa ggtactcctc 4980tctgatggtt tataagttgc cagtggaaga tgtgcaacct
ttagcccaag ctttcttcaa 5040attagagaag gttaaacaga gctttgacct agaggagtac
agcctctcac agtctaccct 5100ggagcaggtt ttcctggagc tctccaagga gcaggagctg
ggtgattttg aggaggattt 5160tgatccctca gtgaagtgga agctcctccc ccaggaagag
ccttaaaacc ccaaattctg 5220tgttcctgtt taaacccgtg gtttttttta aatacattta
tttttatagc agcaatgttc 5280tatttttaga aactatatta taagtacaga aatggttctc
cgtgtggtgg gaggaggagg 5340ttcgggtgct gggtaagtgc catgtcagtg tggacagagg
catttgacta agccaacctc 5400ctctcacagc ctctgtatct ctgcaggcca tactggttcc
attgttctgt ataatactga 5460ataaataaat ttacttttac atgatcgtat aagtttctag
ataagataaa caaattttgt 5520ttaaattttt ttaataaaaa tcttaaaaca ctttttttct
aacctagact gagaaattca 5580tgtttacttt tctaggtgta tgatactttg taaagttgat
actttcctaa gaatttaaca 5640tgtcatattt ttgaaataga tttaagtgtg cttcttattg
ctaaaaatac taaatgtcat 5700gggtcatagt atctgatatc aatatcgttg ataacatatc
cacaggtaac accatgatgt 5760aggcataaat ggaaaacaaa aaccctacta tttcaaatat
attgtacttt tttatttctg 5820taagccaact gtgtgccatt ttcactggac ttttaaatct
agactttagt gatgtctaca 5880ttgtaaatga tcttttgtgg atatttgtca cttggtttca
gaaagttcac aaatgtagca 5940acagctcaca tgactgagta ggtagaaaat gtgaaataaa
tctcatatat atagttttga 6000aataaaaaaa aa
601221345DNAHomo sapiens 2gcctctgggg ttttatattg
ctctggtatt catgccaaag acacaccagc cctcagtcac 60tgggagaaga acctctcata
ccctcggtgc tccagtcccc agctcactca gccacacaca 120ccatgtgtga agaggagacc
accgcgctcg tgtgtgacaa tggctctggc ctgtgcaagg 180caggcttcgc aggagatgat
gccccccggg ctgtcttccc ctccattgtg ggccgccctc 240gccaccaggg tgtgatggtg
ggaatgggcc agaaagacag ctatgtgggg gatgaggctc 300agagcaagcg agggatccta
actctcaaat accccattga acacggcatc atcaccaact 360gggatgacat ggagaagatc
tggcaccact ccttctacaa tgagctgcgt gtagcacctg 420aagagcaccc caccctgctc
acagaggctc ccctaaatcc caaggccaac agggaaaaga 480tgacccagat catgtttgaa
accttcaatg tccctgccat gtacgtcgcc attcaagctg 540tgctctccct ctatgcctct
ggccgcacga caggcatcgt cctggattca ggtgatggcg 600tcacccacaa tgtccccatc
tatgaaggct atgccctgcc ccatgccatc atgcgcctgg 660acttggctgg ccgtgacctc
acggactacc tcatgaagat cctcacagag agaggctatt 720cctttgtgac cacagctgag
agagaaattg tgcgagacat caaggagaag ctgtgctatg 780tggccctgga ttttgagaat
gagatggcca cagcagcttc ctcttcctcc ctggagaaga 840gctatgagct gccagatggg
caggttatca ccattggcaa tgagcgcttc cgctgccctg 900agaccctctt ccagccttcc
tttattggca tggagtccgc tggaattcat gagacaacct 960acaattccat catgaagtgt
gacattgaca tccgtaagga cttatatgcc aacaatgtcc 1020tctctggggg caccaccatg
taccctggca ttgctgacag gatgcagaag gagatcacag 1080ccctggcccc cagcaccatg
aagatcaaga ttattgctcc cccagagcgg aagtactcag 1140tctggatcgg gggctctatc
ctggcctctc tctccacctt ccagcagatg tggatcagca 1200agcctgagta tgatgaggca
gggccctcca ttgtccacag gaagtgcttc taaagtcaga 1260acaggttctc caaggatccc
ctcgagacta ctctgttacc agtcatgaaa cattaaaacc 1320tacaagcctt aaaaaaaaaa
aaaaa 134534670DNAHomo sapiens
3gcactcgctg gaaagcggct ccgagccagg ggctattgca aagccagggt gcgctaccgg
60acggagaggg gagagccctg agcagagtga gcaacatcgc agccaaggcg gaggccgaag
120aggggcgcca ggcaccaatc tccgcgttgc ctcagccccg gaggcgcccc agagcgcttc
180ttgtcccagc agagccactc tgcctgcgcc tgcctctcag tgtctccaac tttgcgctgg
240aagaaaaact tcccgcgcgc cggcagaact gcagcgcctc cttttagtga ctccgggagc
300ttcggctgta gccggctctg cgcgcccttc caacgaataa tagaaattgt taattttaac
360aatccagagc aggccaacga ggctttgctc tcccgacccg aactaaaggt ccctcgctcc
420gtgcgctgct acgagcggtg tctcctgggg ctccaatgca gcgagctgtg cccgaggggt
480tcggaaggcg caagctgggc agcgacatgg ggaacgcgga gcgggctccg gggtctcgga
540gctttgggcc cgtacccacg ctgctgctgc tcgccgcggc gctactggcc gtgtcggacg
600cactcgggcg cccctccgag gaggacgagg agctagtggt gccggagctg gagcgcgccc
660cgggacacgg gaccacgcgc ctccgcctgc acgcctttga ccagcagctg gatctggagc
720tgcggcccga cagcagcttt ttggcgcccg gcttcacgct ccagaacgtg gggcgcaaat
780ccgggtccga gacgccgctt ccggaaaccg acctggcgca ctgcttctac tccggcaccg
840tgaatggcga tcccagctcg gctgccgccc tcagcctctg cgagggcgtg cgcggcgcct
900tctacctgct gggggaggcg tatttcatcc agccgctgcc cgccgccagc gagcgcctcg
960ccaccgccgc cccaggggag aagccgccgg caccactaca gttccacctc ctgcggcgga
1020atcggcaggg cgacgtcggc ggcacgtgcg gggtcgtgga cgacgagccc cggccgactg
1080ggaaagcgga gaccgaagac gaggacgaag ggactgaggg cgaggacgaa ggggctcagt
1140ggtcgccgca ggacccggca ctgcaaggcg taggacagcc cacaggaact ggaagcataa
1200gaaagaagcg atttgtgtcc agtcaccgct atgtggaaac catgcttgtg gcagaccagt
1260cgatggcaga attccacggc agtggtctaa agcattacct tctcacgttg ttttcggtgg
1320cagccagatt gtacaaacac cccagcattc gtaattcagt tagcctggtg gtggtgaaga
1380tcttggtcat ccacgatgaa cagaaggggc cggaagtgac ctccaatgct gccctcactc
1440tgcggaactt ttgcaactgg cagaagcagc acaacccacc cagtgaccgg gatgcagagc
1500actatgacac agcaattctt ttcaccagac aggacttgtg tgggtcccag acatgtgata
1560ctcttgggat ggctgatgtt ggaactgtgt gtgatccgag cagaagctgc tccgtcatag
1620aagatgatgg tttacaagct gccttcacca cagcccatga attaggccac gtgtttaaca
1680tgccacatga tgatgcaaag cagtgtgcca gccttaatgg tgtgaaccag gattcccaca
1740tgatggcgtc aatgctttcc aacctggacc acagccagcc ttggtctcct tgcagtgcct
1800acatgattac atcatttctg gataatggtc atggggaatg tttgatggac aagcctcaga
1860atcccataca gctcccaggc gatctccctg gcacctcgta cgatgccaac cggcagtgcc
1920agtttacatt tggggaggac tccaaacact gccccgatgc agccagcaca tgtagcacct
1980tgtggtgtac cggcacctct ggtggggtgc tggtgtgtca aaccaaacac ttcccgtggg
2040cggatggcac cagctgtgga gaagggaaat ggtgtatcaa cggcaagtgt gtgaacaaaa
2100ccgacagaaa gcattttgat acgccttttc atggaagctg gggaatgtgg gggccttggg
2160gagactgttc gagaacgtgc ggtggaggag tccagtacac gatgagggaa tgtgacaacc
2220cagtcccaaa gaatggaggg aagtactgtg aaggcaaacg agtgcgctac agatcctgta
2280accttgagga ctgtccagac aataatggaa aaacctttag agaggaacaa tgtgaagcac
2340acaacgagtt ttcaaaagct tcctttggga gtgggcctgc ggtggaatgg attcccaagt
2400acgctggcgt ctcaccaaag gacaggtgca agctcatctg ccaagccaaa ggcattggct
2460acttcttcgt tttgcagccc aaggttgtag atggtactcc atgtagccca gattccacct
2520ctgtctgtgt gcaaggacag tgtgtaaaag ctggttgtga tcgcatcata gactccaaaa
2580agaagtttga taaatgtggt gtttgcgggg gaaatggatc tacttgtaaa aaaatatcag
2640gatcagttac tagtgcaaaa cctggatatc atgatatcat cacaattcca actggagcca
2700ccaacatcga agtgaaacag cggaaccaga ggggatccag gaacaatggc agctttcttg
2760ccatcaaagc tgctgatggc acatatattc ttaatggtga ctacactttg tccaccttag
2820agcaagacat tatgtacaaa ggtgttgtct tgaggtacag cggctcctct gcggcattgg
2880aaagaattcg cagctttagc cctctcaaag agcccttgac catccaggtt cttactgtgg
2940gcaatgccct tcgacctaaa attaaataca cctacttcgt aaagaagaag aaggaatctt
3000tcaatgctat ccccactttt tcagcatggg tcattgaaga gtggggcgaa tgttctaagt
3060catgtgaatt gggttggcag agaagactgg tagaatgccg agacattaat ggacagcctg
3120cttccgagtg tgcaaaggaa gtgaagccag ccagcaccag accttgtgca gaccatccct
3180gcccccagtg gcagctgggg gagtggtcat catgttctaa gacctgtggg aagggttaca
3240aaaaaagaag cttgaagtgt ctgtcccatg atggaggggt gttatctcat gagagctgtg
3300atcctttaaa gaaacctaaa catttcatag acttttgcac aatggcagaa tgcagttaag
3360tggtttaagt ggtgttagct ttgagggcaa ggcaaagtga ggaagggctg gtgcagggaa
3420agcaagaagg ctggagggat ccagcgtatc ttgccagtaa ccagtgaggt gtatcagtaa
3480ggtgggatta tgggggtaga tagaaaagga gttgaatcat cagagtaaac tgccagttgc
3540aaatttgata ggatagttag tgaggattat taacctctga gcagtgatat agcataataa
3600agccccgggc attattatta ttatttcttt tgttacatct attacaagtt tagaaaaaac
3660aaagcaattg tcaaaaaaag ttagaactat tacaacccct gtttcctggt acttatcaaa
3720tacttagtat catgggggtt gggaaatgaa aagtaggaga aaagtgagat tttactaaga
3780cctgttttac tttacctcac taacaatggg gggagaaagg agtacaaata ggatctttga
3840ccagcactgt ttatggctgc tatggtttca gagaatgttt atacattatt tctaccgaga
3900attaaaactt cagattgttc aacatgagag aaaggctcag caacgtgaaa taacgcaaat
3960ggcttcctct ttcctttttt ggaccatctc agtctttatt tgtgtaattc attttgagga
4020aaaaacaact ccatgtattt attcaagtgc attaaagtct acaatggaaa aaaagcagtg
4080aagcattaga tgctggtaaa agctagagga gacacaatga gcttagtacc tccaacttcc
4140tttctttcct accatgtaac cctgctttgg gaatatggat gtaaagaagt aacttgtgtc
4200tcatgaaaat cagtacaatc acacaaggag gatgaaacgc cggaacaaaa atgaggtgtg
4260tagaacaggg tcccacaggt ttggggacat tgagatcact tgtcttgtgg tggggaggct
4320gctgaggggt agcaggtcca tctccagcag ctggtccaac agtcgtatcc tggtgaatgt
4380ctgttcagct cttctgtgag aatatgattt tttccatatg tatatagtaa aatatgttac
4440tataaattac atgtacttta taagtattgg tttgggtgtt ccttccaaga aggactatag
4500ttagtaataa atgcctataa taacatattt atttttatac atttatttct aatgaaaaaa
4560acttttaaat tatatcgctt ttgtggaagt gcatataaaa tagagtattt atacaatata
4620tgttactaga aataaaagaa cacttttgga aaaaaaaaaa aaaaaaaaaa
46704996DNAHomo sapiens 4aatcacttgg ggaaaggaag gttcgtttct gagttagcaa
caagtaaatg cagcactagt 60gggtgggatt gaggtatgcc ctggtgcata aatagagact
cagctgtgct ggcacactca 120gaagcttgga ccgcatccta gccgccgact cacacaaggc
aggtgggtga ggaaatccag 180agttgccatg gagaaaattc cagtgtcagc attcttgctc
cttgtggccc tctcctacac 240tctggccaga gataccacag tcaaacctgg agccaaaaag
gacacaaagg actctcgacc 300caaactgccc cagaccctct ccagaggttg gggtgaccaa
ctcatctgga ctcagacata 360tgaagaagct ctatataaat ccaagacaag caacaaaccc
ttgatgatta ttcatcactt 420ggatgagtgc ccacacagtc aagctttaaa gaaagtgttt
gctgaaaata aagaaatcca 480gaaattggca gagcagtttg tcctcctcaa tctggtttat
gaaacaactg acaaacacct 540ttctcctgat ggccagtatg tccccaggat tatgtttgtt
gacccatctc tgacagttag 600agccgatatc actggaagat attcaaatcg tctctatgct
tacgaacctg cagatacagc 660tctgttgctt gacaacatga agaaagctct caagttgctg
aagactgaat tgtaaagaaa 720aaaaatctcc aagcccttct gtctgtcagg ccttgagact
tgaaaccaga agaagtgtga 780gaagactggc tagtgtggaa gcatagtgaa cacactgatt
aggttatggt ttaatgttac 840aacaactatt ttttaagaaa aacaagtttt agaaatttgg
tttcaagtgt acatgtgtga 900aaacaatatt gtatactacc atagtgagcc atgattttct
aaaaaaaaaa ataaatgttt 960tgggggtgtt ctgttttctc caaaaaaaaa aaaaaa
9965763DNAHomo sapiens 5agaaacatcc agaatacatt
tccaacaaga gcactggcca agtcagcttc ttctgagaga 60gtctctagaa gacatgatgc
tacactcagc tttgggtctc tgcctcttac tcgtcacagt 120ttcttccaac cttgccattg
caataaaaaa ggaaaagagg cctcctcaga cactctcaag 180aggatgggga gatgacatca
cttgggtaca aacttatgaa gaaggtctct tttatgctca 240aaaaagtaag aagccattaa
tggttattca tcacctggag gattgtcaat actctcaagc 300actaaagaaa gtatttgccc
aaaatgaaga aatacaagaa atggctcaga ataagttcat 360catgctaaac cttatgcatg
aaaccactga taagaattta tcacctgatg ggcaatatgt 420gcctagaatc atgtttgtag
acccttcttt aacagttaga gctgacatag ctggaagata 480ctctaacaga ttgtacacat
atgagcctcg ggatttaccc ctattgatag aaaacatgaa 540gaaagcatta agacttattc
agtcagagct ataagagatg atagaaaaaa gccttcactt 600caaagaagtc aaatttcatg
aagaaaacct ctggcacatt gacaaatact aaatgtgcaa 660gtatatagat tttgtaatat
tactatttag tttttttaat gtgtttgcaa tagtcttatt 720aaaataaatg ttttttaaat
ctgagactga aaaaaaaaaa aaa 76362307DNAHomo sapiens
6cagccatggt aggggtggag gtacaggcag caaacaatat ttaagatgct gacttgtgga
60gcattcgggc ttggaaggaa agctataggc tacccattca gctcccctgt cagagactca
120agctttgaga aaggctagca aagagcaagg aaagagagaa aacaacaaag tggcgaggcc
180ctcagagtga aagcgtaagg ttcagtcagc ctgctgcagc tttgcagacc tcagctgggc
240atctccagac tcccctgaag gaagagcctt cctcacccaa acccacaaaa gatgctgaaa
300aagcctctct cagctgtgac ctggctctgc attttcatcg tggcctttgt cagccaccca
360gcgtggctgc agaagctctc taagcacaag acaccagcac agccacagct caaagcggcc
420aactgctgtg aggaggtgaa ggagctcaag gcccaagttg ccaaccttag cagcctgctg
480agtgaactga acaagaagca ggagagggac tgggtcagcg tggtcatgca ggtgatggag
540ctggagagca acagcaagcg catggagtcg cggctcacag atgctgagag caagtactcc
600gagatgaaca accaaattga catcatgcag ctgcaggcag cacagacggt cactcagacc
660tccgcagatg ccatctacga ctgctcttcc ctctaccaga agaactaccg catctctgga
720gtgtataagc ttcctcctga tgacttcctg ggcagccctg aactggaggt gttctgtgac
780atggagactt caggcggagg ctggaccatc atccagagac gaaaaagtgg ccttgtctcc
840ttctaccggg actggaagca gtacaagcag ggctttggca gcatccgtgg ggacttctgg
900ctggggaacg aacacatcca ccggctctcc agacagccaa cccggctgcg tgtagagatg
960gaggactggg agggcaacct gcgctacgct gagtatagcc actttgtttt gggcaatgaa
1020ctcaacagct atcgcctctt cctggggaac tacactggca atgtggggaa cgacgccctc
1080cagtatcata acaacacagc cttcagcacc aaggacaagg acaatgacaa ctgcttggac
1140aagtgtgcac agctccgcaa aggtggctac tggtacaact gctgcacaga ctccaacctc
1200aatggagtgt actaccgcct gggtgagcac aataagcacc tggatggcat cacctggtat
1260ggctggcatg gatctaccta ctccctcaaa cgggtggaga tgaaaatccg cccagaagac
1320ttcaagcctt aaaaggaggc tgccgtggag cacggataca gaaactgaga cacgtggaga
1380ctggatgagg gcagatgagg acaggaagag agtgttagaa agggtaggac tgagaaacag
1440cctataatct ccaaagaaag aataagtctc caaggagcac aaaaaaatca tatgtaccaa
1500ggatgttaca gtaaacagga tgaactattt aaacccactg ggtcctgcca catccttctc
1560aaggtggtag actgagtggg gtctctctgc ccaagatccc tgacatagca gtagcttgtc
1620ttttccacat gatttgtctg tgaaagaaaa taattttgag atcgttttat ctattttctc
1680tacggcttag gctatgtgag ggcaaaacac aaatcccttt gctaaaaaga accatattat
1740tttgattctc aaaggatagg cctttgagtg ttagagaaag gagtgaagga ggcaggtggg
1800aaatggtatt tctattttta aatccagtga aattatcttg agtctacaca ttatttttaa
1860aacacaaaaa ttgttcggct ggaactgacc caggctggac ttgcggggag gaaactccag
1920ggcactgcat ctggcgatca gactctgagc actgcccctg ctcgccttgg tcatgtacag
1980cactgaaagg aatgaagcac cagcaggagg tggacagagt ctctcatgga tgccggcaca
2040aaactgcctt aaaatattca tagttaatac aggtatatct atttttattt actttgtaag
2100aaacaagctc aaggagcttc cttttaaatt ttgtctgtag gaaatggttg aaaactgaag
2160gtagatggtg ttatagttaa taataaatgc tgtaaataag catctcactt tgtaaaaata
2220aaatattgtg gttttgtttt aaacattcaa cgtttctttt ccttctacaa taaacacttt
2280caaaatgtga aaaaaaaaaa aaaaaaa
230772100DNAHomo sapiens 7gagttgtgta attccccaga gcaggcctgg gcagtgtctg
ggtggggcct gggagccaca 60ggagacgccc aaagccaggc agagcccggg ggcgaggggg
cggcaggcag gtgtagcgct 120gccctgggag ggcttgcacc cccacaccca agtgagcggc
ctgctcactc ctcagctgca 180ggagccagac gtgtggagtc ccagcagagg ccaacctgtg
tctcttcatc tccgtgagaa 240aggtgccccc gaagtgaaag agatggcctg gtggaaagcc
tggattgaac aggagggtgt 300cacagtgaag agcagctccc acttcaaccc agaccctgat
gcagagaccc tctacaaagc 360catgaagggg atcgggacca acgagcaggc tatcatcgat
gtgctcacca agagaagcaa 420cacgcagcgg cagcagatcg ccaagtcctt caaggctcag
ttcggcaagg acctcactga 480gaccttgaag tctgagctca gtggcaagtt tgagaggctc
attgtggccc ttatgtatcc 540gccatacaga tacgaagcca aggagctgca tgacgccatg
aagggcttag gaaccaagga 600gggtgtcatc attgagatcc tggcctctcg gaccaagaac
cagctgcggg agataatgaa 660ggcgtatgag gaagactatg ggtccagcct ggaggaggac
atccaagcag acacaagtgg 720ctacctggag aggatcctgg tgtgcctcct gcagggcagc
agggatgatg tgagcagctt 780tgtggacccg gcactggccc tccaagacgc acaggatctg
tatgcggcag gcgagaagat 840tcgtgggact gatgagatga aattcatcac catcctgtgc
acgcgcagtg ccactcacct 900gctgagagtg tttgaagagt atgagaaaat tgccaacaag
agcattgagg acagcatcaa 960gagtgagacc catggctcac tggaggaggc catgctcact
gtggtgaaat gcacccaaaa 1020cctccacagc tactttgcag agagactcta ctatgccatg
aagggagcag ggacgcgtga 1080tgggaccctg ataagaaaca tcgtttcaag gagcgagatt
gacttaaatc ttatcaaatg 1140tcacttcaag aagatgtacg gcaagaccct cagcagcatg
atcatggaag acaccagcgg 1200cgactacaag aacgccctgc tgagcctggt gggcagcgac
ccctgaggca cagaagaaca 1260agagcaaaga ccatgaagcc agagtctcca ggactcctca
ctcaacctcg gccatggacg 1320caggttgggt gtgagggggg tcccagcctt tcggtcttct
atttccctat ttccagtgct 1380ttccagccgg gtttctgacc cagagggtgg aaccggcctg
gactcctctt cccaacttcc 1440tccaggtcat ttcccagtgt gagcacaatg ccaaccttag
tgtttctcca gccagacaga 1500tgcctcagca tgaagggctt ggggacttgt ggatcattcc
ttcctccctg caggagcttc 1560ccaagctggt cacagagtct cctgggcaca ggttatacag
accccagccc cattcccatc 1620tactgaaaca gggtctccac aagaggggcc agggaatatg
ggtttttaac aagcgtctta 1680caaaacactt ctctatcatg cagccggaga gctggctggg
agcccttttg ttttagaaca 1740cacatccttc agcagctgag aaatgaacac gaatccatcc
caaccgagat gccattaaca 1800ttcatctaaa aatgttaggc tctaaatgga cgaaaaattc
tctcgccatc ttaataacaa 1860aataaactac aaattcctga cccaaggaca ctgtgttata
agaggcgtgg gctcccctgg 1920tggctgacca ggtcagctgc cctggccttg cacccctctg
catgcagcac agaagggtgt 1980gaccatgccc tcagcaccac tcttgtcccc actgaacggc
aactgagact gggtacctgg 2040agattctgaa gtgcctttgc tgtggttttc aaaataataa
agatttgtat tcaactcaaa 210081447DNAHomo sapiens 8atccagattt gcttttacat
tttcttgcct gagtctgagg tgaacagtga acatatttac 60atttgattta acagtgaacc
ttaattcttt ctggcttcac agtgaaacaa gtttatgcaa 120tcgatcaaat attttcatcc
ctgaggttaa caattaccat caaaatgttt tgtggagact 180atgtgcaagg aaccatcttc
ccagctccca atttcaatcc cataatggat gcccaaatgc 240taggaggagc actccaagga
tttgactgtg acaaagacat gctgatcaac attctgactc 300agcgctgcaa tgcacaaagg
atgatgattg cagaggcata ccagagcatg tatggccggg 360acctgattgg ggatatgagg
gagcagcttt cggatcactt caaagatgtg atggctggcc 420tcatgtaccc accaccactg
tatgatgctc atgagctctg gcatgccatg aagggagtag 480gcactgatga gaattgcctc
attgaaatac tagcttcaag aacaaatgga gaaattttcc 540agatgcgaga agcctactgc
ttgcaataca gcaataacct ccaagaggac atttattcag 600agacctcagg acacttcaga
gatactctca tgaacttggt ccaggggacc agagaggaag 660gatatacaga ccctgcgatg
gctgctcagg atgcaatggt cctatgggaa gcctgtcagc 720agaagacggg ggagcacaaa
accatgctgc aaatgatcct gtgcaacaag agctaccagc 780agctgcggct ggttttccag
gaatttcaaa atatttctgg gcaagatatg gtagatgcca 840ttaatgaatg ttatgatgga
tactttcagg agctgctggt tgcaattgtt ctctgtgttc 900gagacaaacc agcctatttt
gcttatagat tatatagtgc aattcatgac tttggtttcc 960ataataaaac tgtaatcagg
attctcattg ccagaagtga aatagacctg ctgaccataa 1020ggaaacgata caaagagcga
tatggaaaat ccctatttca tgatatcaga aattttgctt 1080cagggcatta taagaaagca
ctgcttgcca tctgtgctgg tgatgctgag gactactaaa 1140atgaagagga cttggagtac
tgtgcactcc tctttctaga cacttccaaa tagagatttt 1200ctcacaaatt tgtactgttc
atggcactat taacaaaact atacaatcat attttctctt 1260ctatctttga aattattcta
agccaaagaa aactatgaat gaaagtatat gatactgaat 1320ttgcctacta tcctgaattt
gcctactatc taatcagcaa ttaaataaat tgtgcatgat 1380ggaataatag aaaaattgca
ttggaataga ttttatttaa atgtgaacca tcaacaacct 1440acaacaa
144791290DNAHomo sapiens
9ggctgagcct ataaagcggc aggtgcgcgc cgccctacag acgttcgcac acctgggtgc
60cagcgcccca gaggtcccgg gacagcccga ggcgccgcgc ccgccgcccc gagctcccca
120agccttcgag agcggcgcac actcccggtc tccactcgct cttccaacac ccgctcgttt
180tggcggcagc tcgtgtccca gagaccgagt tgccccagag accgagacgc cgccgctgcg
240aaggaccaat gagagccccg ctgctaccgc cggcgccggt ggtgctgtcg ctcttgatac
300tcggctcagg ccattatgct gctggattgg acctcaatga cacctactct gggaagcgtg
360aaccattttc tggggaccac agtgctgatg gatttgaggt tacctcaaga agtgagatgt
420cttcagggag tgagatttcc cctgtgagtg aaatgccttc tagtagtgaa ccgtcctcgg
480gagccgacta tgactactca gaagagtatg ataacgaacc acaaatacct ggctatattg
540tcgatgattc agtcagagtt gaacaggtag ttaagccccc ccaaaacaag acggaaagtg
600aaaatacttc agataaaccc aaaagaaaga aaaagggagg caaaaatgga aaaaatagaa
660gaaacagaaa gaagaaaaat ccatgtaatg cagaatttca aaatttctgc attcacggag
720aatgcaaata tatagagcac ctggaagcag taacatgcaa atgtcagcaa gaatatttcg
780gtgaacggtg tggggaaaag tccatgaaaa ctcacagcat gattgacagt agtttatcaa
840aaattgcatt agcagccata gctgccttta tgtctgctgt gatcctcaca gctgttgctg
900ttattacagt ccagcttaga agacaatacg tcaggaaata tgaaggagaa gctgaggaac
960gaaagaaact tcgacaagag aatggaaatg tacatgctat agcataactg aagataaaat
1020tacaggatat cacattggag tcactgccaa gtcatagcca taaatgatga gtcggtcctc
1080tttccagtgg atcataagac aatggaccct ttttgttatg atggttttaa actttcaatt
1140gtcacttttt atgctatttc tgtatataaa ggtgcacgaa ggtaaaaagt attttttcaa
1200gttgtaaata atttatttaa tatttaatgg aagtgtattt attttacagc tcattaaact
1260tttttaacca aacagaaaaa aaaaaaaaaa
1290101530DNAHomo sapiens 10actccaggag ctgcagcaga gcaggtaaca gctcttgcac
ctgtttctct tgcacctgac 60gtgcagctgc tcctacccac ctctcctggc tgagccttgc
ctgatacagc agcccggagg 120caccacttgc ttcccgagtc tcaccctccc aggcagctcc
tacactcaac tgcttctcta 180ggaaaggtct cacctccagc ctggagcagt cgggattaca
gaaagcccca tccttggctt 240agggagcgcc atgacgactg aaattggttg gtggaagctg
actttcctcc ggaaaaagaa 300atccactccc aaagtgctgt atgagatccc tgacacctat
gcccaaacag agggagatgc 360agaacccccg aggcctgacg ctggaggccc caacagcgac
tttaacaccc gcctggagaa 420gattgtggac aagagcacaa agggcaagca cgtcaaggtc
tccaactcag gacgcttcaa 480ggagaagaag aaagtgagag ccacgctggc agagaaccct
aacctctttg atgatcacga 540ggaaggacgg tcatcaaagt gaagggctga ggagggtgct
agcacctctt ggctccctgc 600catcagccag atctgagaca ggaccttgcc acgctggcct
ctttggccat agctgaagct 660gtggggccag ttgatacctg ctggcaggaa atggctgttt
tttaggtttg tatttatgtg 720ccgccacttt tgtaaggcct gggagatccc agggtcctcc
accctccccc tgaccacata 780caaaggcact ctagttcaag agtgaaaagt ctcacccagg
aggaacagcc ctccttgaag 840caatggcagg gccagcaggg aggtgggcat ggcagggaat
ggagagagtg agccagacag 900acttcacctc cttactggac acagggtcaa gggcgagttt
caattgctgc tccctttact 960ttctctacct gtgactactc cctggaccaa tcctgaggag
ggcacatttt ccagaagcca 1020cgtgataggg gctggtttct gtggagccag aggcagagac
actgaacttg agctcacctc 1080ctaacaccgg cagtaaactt cctggaactt tgccctcagg
tgcggagggg acagaggacc 1140ctggcactct gttagggtgc tgtagaagac tagattgatg
gtagtttggc ctgttagttc 1200ctgttttggc catgactttt gcagatggca agtcacacac
cctcaaaggg aagctacacg 1260ggccaaatcg ggggagtggg tggggaattt tctcctctcc
ctttcctact ataatagtat 1320ttaagacata tcagctccag agatgagtcc tggagccttg
aattttgttt aacaaaataa 1380ttgtaggttt ctctctgtaa taacaacgct ggaaaggcag
agaacctctt ttatgctcat 1440gtcttgcatt tattgagatg actgtttctc atgcctttat
gttccttcat gtaagtaaag 1500tggacctttg tgctcaaaaa aaaaaaaaaa
1530117582DNAHomo sapiens 11tctttcactc ttgggcctag
cgaggcagct tatttttcct ttttctttca ttggtcacag 60ttatttcttt tcagtgtgtc
atcttctcta agtttctgag ggagtgttag cttgaggcct 120tgccttgaca taatcagata
taatcagaaa aatgaaaaat tccataggaa agagaactat 180tttagccaag gtgtgcgaga
gaaataccgc cactttcaag cactgttttc ttctactgga 240gtctgctcaa tagggacgtc
agctttgctg gggcttcctt tgacaagaga atcagaaccg 300actggtgaca tttgtttcaa
tgaaagcaac agtgtgaaga gtaagtagtt tttcctatta 360ttcatctcac tcagtggaga
taatgaactc ctctccactg ccaagatgag aaactaccat 420ttcttacaca tggacacaaa
gatgagaaca ataaacactg agacttcaaa aaggagggaa 480ggagaaagag gaacaagact
tgaaatctac ctatcagaac ttgcaattta ttctgatcaa 540caatttgccc agctaaagta
ctacatctgc cccctcttcc tgtggctgta ggggcacagc 600aaaggtcact ggtctaacct
ccttaaaggg actccgctaa cagaaaccac caaatggagt 660ggagaaaaag aaaagggatc
ccctatcccc aactccagcc atctagatta agaaaagcca 720gctgactgga cagtagcaca
gcccagtcac ctcatggaca aatttcctag gaaagaacct 780ctcccatcta ttctacttat
cactctcctt tgaggttccg gccacagatc ttcgcctgct 840gctggaaatg gccctctcag
tggactcatc gtggcatcgg tggcagtgga gagtcagaga 900tggcttcccc cattgtccat
cggaaaccac accgctgctc tctccagaga aagggagaca 960gagctacaac ttgacacagc
agcgggtcgt gttccccaac aacagcatat tccatcaaga 1020ttgggaagag gtctccagga
gataccctgg caacagaacc tgcacaacca aatacaccct 1080cttcaccttc ctgccccgga
atctctttga gcaatttcat agatgggcta acctctattt 1140cctgttcctg gtgattttga
actggatgcc ctccatggaa gtcttccaca gagaaatcac 1200catgttacca ttggccattg
tcctgttcgt catcatgatc aaggatggca tggaggactt 1260caagagacac cgctttgata
aagcaataaa ctgctccaac attcgaattt atgaaagaaa 1320agagcagacc tatgtgcaga
agtgctggaa ggatgtgcgc gtgggagact tcatccaaat 1380gaaatgcaat gagattgtcc
cagcagacat actcctcctt ttttcctctg accccaatgg 1440gatatgccat ctggaaactg
ccagcttgga tggagagaca aacctcaagc aaagatgtgt 1500cgtgaagggc ttctcacagc
aggaggtaca gttcgaacca gagcttttcc acaataccat 1560cgtgtgtgag aaacccaaca
accacctcaa caaatttaag ggttatatgg agcatcctga 1620ccagaccagg actggctttg
gctgtgagag tcttctgctt cgaggctgca ccatcagaaa 1680caccgagatg gctgttggca
ttgtcatcta tgcaggccat gagacgaaag ccatgctgaa 1740caacagtggc ccccggtaca
aacgcagcaa gattgagcgg cgcatgaata tagacatctt 1800cttctgcatt gggatcctca
tcctcatgtg ccttattgga gctgtaggtc acagcatctg 1860gaatgggacc tttgaagaac
accctccctt cgatgtgcca gatgccaatg gcagcttcct 1920tcccagtgcc cttgggggct
tctacatgtt cctcacaatg atcatcctgc tccaggtgct 1980gatccccatc tctttgtatg
tctccattga gctggtgaag ctcgggcaag tgttcttctt 2040gagcaatgac cttgacctgt
atgatgaaga gaccgattta tccattcaat gtcgagccct 2100caacatcgca gaggacttgg
gccagatcca gtacatcttc tccgataaga cggggaccct 2160gacagagaac aagatggtgt
tccgacgttg caccatcatg ggcagcgagt attctcacca 2220agaaaatgct aagcgactgg
agaccccaaa ggagctggac tcagatggtg aagagtggac 2280ccaataccaa tgcctgtcct
tctcggctag atgggcccag gatccagcaa ctatgagaag 2340ccaaaaaggt gctcagcctc
tgaggaggag ccagagtgcc cgggtgccca tccagggcca 2400ctaccggcaa aggtctatgg
ggcaccgtga aagctcacag cctcctgtgg ccttcagcag 2460ctccatagaa aaagatgtaa
ctccagataa aaacctactg accaaggttc gagatgctgc 2520cctgtggttg gagaccttgt
cagacagcag acctgccaag gcttccctct ccaccacctc 2580ctccattgct gatttcttcc
ttgccttaac catctgcaac tctgtcatgg tgtccacaac 2640caccgagccc aggcagaggg
tcaccatcaa accctcaagc aaggctctgg ggacgtccct 2700ggagaagatt cagcagctct
tccagaagtt gaagctattg agcctcagcc agtcattctc 2760atccactgca ccctctgaca
cagacctcgg ggagagctta ggggccaacg tggccaccac 2820agactcggat gagagagatg
atgcatctgt gtgcagtgga ggtgactcca ctgatgacgg 2880tggctacagg agcagcatgt
gggaccaggg cgacatcctg gagtctgggt caggcacttc 2940cttggaggag gcattggagg
ccccagccac agacctggcc aggcctgagt tctgttacga 3000ggctgagagc cctgatgagg
ccgccctggt gcacgctgcc catgcctaca gcttcacact 3060agtgtcccgg acacctgagc
aggtgactgt gcgcctgccc cagggcacct gcctcacctt 3120cagcctcctc tgcaccctgg
gctttgactc tgtcaggaag agaatgtctg tggttgtgag 3180gcacccactg actggcgaga
ttgttgtcta caccaagggt gctgactcgg tcatcatgga 3240cctgctggaa gacccagcct
gcgtacctga cattaatatg gaaaagaagc tgagaaaaat 3300ccgagcccgg acccaaaagc
atctagactt gtatgcaaga gatggcctgc gcacactatg 3360cattgccaag aaggttgtaa
gcgaagagga cttccggaga tgggccagtt tccggcgtga 3420ggctgaggca tccctcgaca
accgagatga gcttctcatg gaaactgcac agcatctgga 3480gaatcaactc accttacttg
gagccactgg gatcgaagac cggctgcagg aaggagttcc 3540agatacgatt gccactctgc
gggaggctgg gatccagctc tgggtcctga ctggagataa 3600gcaggagaca gcggtcaaca
ttgcccattc ctgcagactg ttaaatcaga ccgacactgt 3660ttataccatc aatacagaga
atcaggagac ctgtgaatcc atcctcaatt gtgcattgga 3720agagctaaag caatttcgtg
aactacagaa gccagaccgc aagctctttg gattccgctt 3780accttccaag acaccatcca
tcacctcaga agctgtggtt ccagaagctg gattggtcat 3840cgatgggaag acattgaatg
ccatcttcca gggaaagcta gagaagaagt ttctggaatt 3900gacccagtat tgtcggtccg
tcctgtgctg ccgctccacg ccactccaga agagtatgat 3960agtcaagctg gtgcgagaca
agttgcgcgt catgaccctt tccataggtg atggagcaaa 4020tgatgtaagc atgattcaag
ctgctgatat tggaattgga atatctggac aggaaggcat 4080gcaggctgtc atgtccagcg
actttgccat cacccgcttt aagcatctca agaagttgct 4140gctcgtgcat ggccactggt
gttactcgcg cctggccagg atggtggtgt actacctcta 4200caagaacgtg tgctacgtca
acctgctctt ctggtatcag ttcttctgtg gtttctccag 4260ctccaccatg attgattact
ggcagatgat attcttcaat ctcttcttta cctccttgcc 4320tcctcttgtc tttggagtcc
ttgacaaaga catctctgca gaaacactcc tggcattgcc 4380tgagctatac aagagtggcc
agaactctga gtgctataac ctgtcgactt tctggatttc 4440tatggtggat gcattctacc
agagcctcat ctgtttcttt atcccttacc tggcctataa 4500gggctctgat atagatgtct
ttacctttgg gacaccaatc aacaccatct ccctcaccac 4560aatccttttg caccaggcaa
tggaaatgaa gacatggacc attttccacg gagtcgtgct 4620cctcggcagc ttcctgatgt
actttctggt atccctcctg tacaatgcca cctgcgtcat 4680ctgcaacagc cccaccaatc
cctattgggt gatggaaggc cagctctcaa accccacttt 4740ctacctcgtc tgctttctca
caccagttgt tgctcttctc ccaagatact ttttcctgtc 4800tctgcaagga acttgtggga
agtctctaat ctcaaaagct cagaaaattg acaaactccc 4860cccagacaaa agaaacctgg
aaatccagag ttggagaagc agacagaggc ctgcccctgt 4920ccccgaagtg gctcgaccaa
ctcaccaccc agtgtcatct atcacaggac aggacttcag 4980tgccagcacc ccaaagagct
ctaaccctcc caagaggaag catgtggaag agtcagtact 5040ccacgaacag agatgtggca
cggagtgcat gagggatgac tcatgctcag gggactcctc 5100agctcaactc tcatccgggg
agcacctgct gggacctaac aggataatgg cctactcaag 5160aggacagact gatatgtgcc
ggtgctcaaa gaggagcagc catcgccgat cccagagttc 5220actgaccata tgaggagctg
cagaaatctg tacaaactca acagaggcca cctagtcact 5280ggtccacata acccttgacc
ccttcttctt catagaggaa acaatgtgcc agtcttattc 5340ttttcttcaa caaccttgac
ttccatggag gaagtgctgg ccccaagggg tctgacacaa 5400agacgggaaa cccagtcggc
ctctagtttt ctgctgctct caggcagcac atcttgcaaa 5460cagtttggag aaggaggctg
tttttgttga atcgagttct caaatcggtt tagaccaaag 5520ccattcttct gaccctctag
ataagcgtag cctacaaccc agtgccgtaa gtttccaaga 5580ttcaagaagt gtatcaaccc
aggcaatatc tcaggatatg gaagtttctg ggtttattta 5640cccctcagtg cccagagtta
aagtttcaga agagacttgt gcacataagg gcttcatctc 5700aagtgtattg cagtaatggc
tgaatcgggg ttaacatccc ttccaggcac agcgagttgg 5760ttctgctttt tgcctgtaag
ccaaagaaaa gccacatcta aaaagctact actaaaagcc 5820agaaagaaaa gtggatttga
actcagtgtc acagactctt ctgagtgttt tagggtcaca 5880gctagtgtaa gaggcatgaa
gaatagacat gcaaaaggga acgggtgcac cagagacccc 5940tgttttggct gacagaccat
atgtcccacc agctggggaa tctgacaaga ggacataggt 6000ggcactcttt ttttaaagct
atttattgta tctattttta aataaaattg cccatcctca 6060ttcagctctt agaacaaaag
caaaaaaccc tgtaaatcag gagatataag cacatctgca 6120cccagaatag gcccatatga
tagggcaacc ctgagcttaa acaatgacat cttcaagggt 6180agaactaatc tgaaacccca
ttcagcctat tccagaatgg ggataggctg aaaccccctt 6240ccagcctctg gaagacactg
gcctgcatca gttagagtca gagcaagtgt cacttcacag 6300ggaaaagaag gattatatag
acttcctatc cctagagttt ataaatgtca actatataaa 6360aaaagctcaa aacagtgtta
aaggaatgaa cagtagaatt ttaataggct gtccaaagaa 6420gccaggtctg ctgtgggcaa
gtatagccta accctagtct tgtaaaataa gccagaaagg 6480gttactgagc caccttaagc
tagtacctat atagtaggca aaaagtacag aaatagatgc 6540aataagtgtg gtgagtcttt
gagcctacga gtcatgccac cagccataag ttgacctatc 6600acttgagaac ctcctcagca
aagatgccag aaaacattca atcaagttgg caaatgacac 6660agggagctgg ccctctgacc
atcttcctgg caaacctgga ctggaagggc catttgcagc 6720actgtcctgg agctaataca
ctgtttcact gcctctgcca tataatgatg ccagcactag 6780ccagctggtg ggtatttgga
ggaatcctgc atgaggattg cccaataagg ggcaggtaca 6840catacctggc aaagtgatga
tgatgtgaat tgtttccagt gaggggattg agtcaaaact 6900tggatctcag gtacctcaat
ttttccccca atttctggct actactaaaa gccagaaaga 6960acagaacagt ggcctcagga
gatctgagtt tgaatccttg ctctctagga tgcaggtggc 7020ttgaagcaga atgccacacc
tgcaagttga ttagaactgc ctttcttccc aggcttgaca 7080taggtattaa gtcaaaatta
catgaaaccc agtggtaaaa aagcctctga aagctgtaac 7140accctcagta ataacaaaag
ggatttttat ttcacagcta aagggaaaat aggtggagaa 7200gttaaaaaat aatgtctgat
cctgttccta agttccaaac tatagccaac actctgatgc 7260tgctcttttt cttgtaggac
caaccgtccc agtttgcctg ggactttctc atttttacag 7320agtcccaaat cctaggaaac
tggagcaact ggtacaactg gtcacctact cttgcccctc 7380tgtaaatcaa gccaactgtg
accatccaat gtgccatctt acagggaaaa gttataacca 7440ctattcccct ataacataat
gctaatgatt gtacttagta catttttata cttttatgat 7500attttactga ttggaaatgt
catcctttat taaaaataaa catggttttc catagttgcc 7560tgccaaaaaa aaaaaaaaaa
aa 7582124131DNAHomo sapiens
12atcccgcccg catacagccc gcatcccgcc ggggaagcga gcccagtcca gcgctgcccg
60tccagtcctc gcccaagatt taaagcccgc aagttttgtt cttgagacca gcgactttag
120ctccgatgcg ggaaggaaag ccgacctccg atttggacat ttaaagagct gggcttgaac
180ttcgtgagtt tcgctctaaa ctgcccttga aatgaagctg gacttggagg tggcatggaa
240tattcacatg ggagagccgc atgaggccgc ccaccacgct tcctgaagga tgcccgtgtg
300gaagaatttt gacgtgccag tgtcctcgtt ctacagggtg ttccattctt ccgcaatctc
360agaaaaatgg gactaaaaga aactattttg taaaataaga agacttccat ttttaatgac
420caacatgtat taagatggac acctactcta cgaaacacga agttctatgg tctcgaagaa
480gcccgtgcct gtttaaaact gatcctaact aaaaacagac ttgagtggat atgagaatgt
540tggttagtgg cagaagagtc aaaaaatggc agttaattat tcagttattt gctacttgtt
600ttttagcgag cctcatgttt ttttgggaac caatcgataa tcacattgtg agccatatga
660agtcatattc ttacagatac ctcataaata gctatgactt tgtgaatgat accctgtctc
720ttaagcacac ctcagcgggg cctcgctacc aatacttgat taaccacaag gaaaagtgtc
780aagctcaaga cgtcctcctt ttactgtttg taaaaactgc tcctgaaaac tatgatcgac
840gttccggaat tagaaggacg tggggcaatg aaaattatgt tcggtctcag ctgaatgcca
900acatcaaaac tctgtttgcc ttaggaactc ctaatccact ggagggagaa gaactacaaa
960gaaaactggc ttgggaagat caaaggtaca atgatataat tcagcaagac tttgttgatt
1020ctttctacaa tcttactctg aaattactta tgcagttcag ttgggcaaat acctattgtc
1080cacatgccaa atttcttatg actgctgatg atgacatatt tattcacatg ccaaatctga
1140ttgagtacct tcaaagttta gaacaaattg gtgttcaaga cttttggatt ggtcgtgttc
1200atcgtggtgc ccctcccatt agagataaaa gcagcaaata ctacgtgtcc tatgaaatgt
1260accagtggcc agcttaccct gactacacag ccggagctgc ctatgtaatc tccggtgatg
1320tagctgccaa agtctatgag gcatcacaga cactaaattc aagtctttac atagacgatg
1380tgttcatggg cctctgtgcc aataaaatag ggatagtacc gcaggaccat gtgttttttt
1440ctggagaggg taaaactcct tatcatccct gcatctatga aaaaatgatg acatctcatg
1500gacacttaga agatctccag gacctttgga agaatgctac agatcctaaa gtaaaaacca
1560tttccaaagg tttttttggt caaatatact gcagattaat gaagataatt ctcctttgta
1620aaattagcta tgtggacaca tacccttgta gggctgcgtt tatctaatag tacttgaatg
1680ttgtatgttt tcactgtcac tgagtcaaac ctggatgaaa aaaaccttta aatgttcgtc
1740tataccctaa gtaaaatgag gacgaaagac aaatattttg aaagcctagt ccatcagaat
1800gtttctttga ttctagaagc tgtttaatat cacttatcta cttcattgcc taagttcatt
1860tcaaagaatt tgtatttaga aaaggtttat attattagtg aaaacaaaac taaagggaag
1920ttcaagttct catgtaatgc cacatatata cttgaggtgt agagatgtta ttaagaagtt
1980ttgatgttag aataattgct tttggaaaat accaaatgaa cgtacagtac aacatttcaa
2040ggaaatgaat atattgttag accaggtaag caagtttatt tttgttaaag agcacttggt
2100ggaggtagta ggggcaggga aaggtcagca taggagagaa agttcatgaa tctggtaaaa
2160cagtctcttg ttcttaagag gagatgtaga aaaatgtgta caatgttatt ataaacagac
2220aaatcacgtc ttaccacatc catgtagcta ctggtgttag agtcattaaa ataccttttt
2280ttgcatcttt tttcaaagtt taatgtgaac ttttagaaaa gtgattaatg ttgccctaat
2340actttatatg tttttaatgg attttttttt aagtattaga aaatgacaca taacacgggc
2400agctggttgc tcatagggtc cttctctagg gagaaaccat tgttaattca aataagctga
2460ttttaatgac gttttcaact ggtttttaaa tattcaatat tggtctgtgt ttaagtttgt
2520tatttgaatg taatttacat agaggaatat aataatggag agacttcaaa tggaaagaca
2580gaacattaca agcctaatgt ctccataatt ttataaaatg aaatcttagt gtctaaatcc
2640ttgtactgat tactaaaatt aacccactcc tccccaacaa ggtcttataa accacagcac
2700tttgttccaa gttcagagtt ttaaattgag agcattaaac atcaaagtta taatatctaa
2760aacaatttat ttttcatcaa taactgtcag aggtgatctt tattttctaa atatttcaaa
2820cttgaaaaca gagtaaaaaa gtgatagaaa agttgccagt ttggggttaa agcattttta
2880aagctgcatg ttccttgtaa tcaaagagat gtgtctgaga tctaatagag taagttacat
2940ttattttaca aagcaggata aaaatgtggc tataatacac actacctccc ttcactacag
3000aaagaactag gtggtgtcta ctgctaggga gattatatga aggccaaaat aatgacttca
3060gcaagagtga ctgaactcac tctaaggcct ttgactgcag aggcacctgt tagggaaaat
3120cagatgtctc atataataag gtgatgtcgg aaacacgcaa aacaaaacga aaaaagattt
3180ctcagtatac acaactgaat gatgatactt acaattttta gcaggtagct ttttaatgtt
3240tacagaaatt ttaatttttt tctattttga aatttgaggc ttgtttacat tgcttagata
3300atttagaatt tttaactaat gtcaaaacta cagtgtcaaa cattctaggt tgtagttact
3360ttcagagtag atacagggtt ttagatcatt acagtttaag ttttctgacc aattaaaaaa
3420acatagagaa caaaagcata tttgaccaag caacaagctt ataattaatt tttattagtt
3480gattgattaa tgatgtattg ccttttgccc atatataccc tgtgtatcta tacttggaag
3540tgtttaaggt tgccattggt tgaaaacata agtgtctctg gccatcaaag tgatcttgtt
3600tacagcagtg cttttgtgaa acaattattt atttgctgaa agagctcttc tgaactgtgt
3660ccttttaatt tttgcttaga atagaatgga acaagtttaa atttcaagga aatatgaagg
3720cacttccttt ttttctaaga aggaagttgc tagatgattc cttcatcaca cttacttaaa
3780gtactgagaa gagtatctgt aaataaaagg gttccaacct tttaaaaaag aaggaaaaaa
3840ctttttggtg ctccagtgta gggctatctt tttaaaaaat gtcaacaaag ggaaaataaa
3900ctatcagctt ggatggtcac ttgaatagaa gatggttata cacagtgtta ttgttaaaat
3960ttttttacct tttggttggt ttgcatcttt tttccatatt gttaatttta taccaaaatg
4020ttaaatattt gtattacttg aattttgctc ttgtatggca aaataattag tgagtttaaa
4080aaaaatctat agtttccaat aaacaactga aaaattatca tgaaaaaaaa a
4131133475DNAHomo sapiens 13actgggtaga atacttgggg tgccagggag gcattaatgc
gagaggagtc aggtgctcag 60tttttattgg agttgggagg gcagccccac atcaggaaga
gaacctgttt ctgcaggatg 120gtccggggag aagggaggac tccacccagg cttgtgtttg
ccctgctctg tgtattcagc 180cagcaggctc tgcacaagga agcaaagtgc agggagccag
gctccaccga cagccaggca 240ctgggcagca cgcactggag acccaggacc ctgtgcagga
gcagctccgg gtgacacgag 300gggactgaag atactcccac aggggctcag caggagcaat
gggtaaccaa atgagtgttc 360cccaaagagt tgaagaccaa gagaatgaac cagaagcaga
gacttaccag gacaacgcgt 420ctgctctgaa cggggttcca gtggtggtgt cgacccacac
agttcagcac ttagaggaag 480tcgacttggg aataagtgtc aagacggata atgtggccac
ttcttccccc gagacaacgg 540agataagtgc tgttgcggat gccaacggaa agaatcttgg
gaaagaggcc aaacccgagg 600caccagctgc taaatctcgt tttttcttga tgctctctcg
gcctgtacca ggacgtaccg 660gagaccaagc cgcagattca tcccttggat cagtgaagct
tgatgtcagc tccaataaag 720ctccagcgaa caaagaccca agtgagagct ggacacttcc
ggtggcagct ggaccggggc 780aggacacaga taaaacccca gggcacgccc cggcccaaga
caaggtcctc tctgccgcca 840gggatcccac gcttctccca cctgagacag ggggagcagg
aggagaagct ccctccaagc 900ccaaggactc cagctttttt gacaaattct tcaagctgga
caagggacag gaaaaggtgc 960caggtgacag ccaacaggaa gccaagaggg cagagcatca
agacaaggtg gatgaggttc 1020ctggcttatc agggcagtcc gatgatgtcc ctgcagggaa
ggacatagtt gacggcaagg 1080aaaaagaagg acaagaactt ggaactgcgg attgctctgt
ccctggggac ccagaaggac 1140tggagactgc aaaggacgat tcccaggcag cagctatagc
agagaataat aattccatca 1200tgagtttctt taaaactctg gtttcaccta acaaagctga
aacaaaaaag gacccagaag 1260acacgggtgc tgaaaagtca cccaccactt cagctgacct
taagtcagac aaagccaact 1320ttacatccca ggagacccaa ggggctggca agaattccaa
aggatgcaac ccatcggggc 1380acacacagtc cgtgacaacc cctgaacctg cgaaggaagg
caccaaggag aaatcaggac 1440ccacctctct gcctctgggc aaactgtttt ggaaaaagtc
agttaaagag gactcagtcc 1500ccacaggtgc ggaggagaat gtggtgtgtg agtcaccagt
agagattata aagtccaagg 1560aagtagaatc agccttacaa acagtggacc tcaacgaagg
agatgctgca cctgaaccca 1620cagaagcgaa actcaaaaga gaagaaagca aaccaagaac
ctctctgatg gcgtttctca 1680gacaaatgtc agtgaaaggg gatggaggga tcacccactc
agaagaaata aatgggaaag 1740actccagctg ccaaacatca gactccacag aaaagactat
cacaccgcca gagcctgaac 1800caacaggagc accacagaag ggtaaagagg gctcctcgaa
ggacaagaag tcagcagccg 1860agatgaacaa gcagaagagc aacaagcagg aagccaaaga
accagcccag tgcacagagc 1920aggccacggt ggacacgaac tcactgcaga atggggacaa
gctccaaaag agacctgaga 1980agcggcagca gtcccttggg ggcttcttta aaggcctggg
accaaagcgg atgttggatg 2040ctcaagtgca aacagaccca gtatccatcg gaccagttgg
caaatccaag taaacaaatc 2100agcacggttc ccaccaggtt ctcctgccac caagatgtgt
tctccttact ccatctcctc 2160cccaaacacg ctccatgtat atattcttct gatggccagc
aaatgaaatt ctgcctagaa 2220attaagcccg agctgttgta tattgaggtg tattatttac
gtctctggtc cagtcttttc 2280tggcaaataa cagtaaagat ggtttagcag gtcacctagt
tgggtcagaa gagtcgatga 2340tcaccaagca ggaaagggag ggaatagagg aatgtgttcg
ggttaagtga tgaaaatggc 2400agtggtggcc gggcgtggtg gctctcgcct gtaatctcag
cactttggga ggccgaggca 2460ggtggatcac ctgaggtcag gagttcaaga ctagcctggc
caacatcatg aaaccccgtc 2520tctactaaaa atacaaaaat tagccaggca tggtggcaca
cacctgtagt cccagctact 2580cgggagccca acgcacgaga accgcttgta cccaggaggt
ggaggttgca gtgagccgaa 2640gttgcaccat tgcactccac cctgggcgac agagcaagat
tctatcaaaa aaaaaaaaag 2700gcagtggcaa gtaagttata gaagagaaat gctgctagaa
ggaattaagc gttgtagtaa 2760atgcgtgctt atcctctaag cttgaagaag ggagacgaaa
atccatttgt ttaaattcac 2820atctcaagga gggagaaccc gggctgtgtt gggtggttgc
caatttccta gaacggaatg 2880tgtggggtat agaaaaagga atgaataagc gttgtttttc
aaatagggtc cttgtaagtt 2940attgatgaga gggaaaagat tgactgggga gggcttaaaa
tgatttggga aaacaattgc 3000ttttgaggct cagtgacaac ggcaaagatt acaacttaaa
aaaaaaaaat aaataaaaaa 3060taaaggaagt tgcacggtta ttttgcaaca caagggggcg
gcaaggtccc catttttatc 3120ctgtaatact gtatccctaa caaagatttg gtctctgcta
tcttacatta ttaatgtttc 3180tcagatggct gaggggctcg cttcatctgt tccgtctgac
acttatctca agtgtgtctg 3240tcattcctaa tgttctcagg atgtgctctg ataaaaccct
ccccataacc tcagttaata 3300aaaatttaca gaagacttct caaatacctg agttgttttt
aatacctgta caaaggagta 3360aataggaccc tgagtctatt aaaatgtaat tcaaagtagc
atatgattga ctgacagtca 3420tgtaaactgt atctttcttt ttctgattta ataaaaaata
catttacttc taaag 3475142239DNAHomo sapiens 14tagacattta tgcagtggtt
caaagtctag agtccctaca cttctggtac atgacagctg 60tgtctcgatg gagtagactc
tcagaacagc gcagtttgcc ctccgctcac gcagagcctc 120tccgtggctt ccgcaccttg
agcattaggc cagttctcct cttctctcta atccatccgt 180cacctctcct gtcatccgtt
tccatgccgt gaggtccatt cacagaacac atccatggct 240ctcatgctca gtttggttct
gagtctcctc aagctgggat cagggcagtg gcaggtgttt 300gggccagaca agcctgtcca
ggccttggtg ggggaggacg cagcattctc ctgtttcctg 360tctcctaaga ccaatgcaga
ggccatggaa gtgcggttct tcaggggcca gttctctagc 420gtggtccacc tctacaggga
cgggaaggac cagccattta tgcagatgcc acagtatcaa 480ggcaggacaa aactggtgaa
ggattctatt gcggaggggc gcatctctct gaggctggaa 540aacattactg tgttggatgc
tggcctctat gggtgcagga ttagttccca gtcttactac 600cagaaggcca tctgggagct
acaggtgtca gcactgggct cagttcctct catttccatc 660acgggatatg ttgatagaga
catccagcta ctctgtcagt cctcgggctg gttcccccgg 720cccacagcga agtggaaagg
tccacaagga caggatttgt ccacagactc caggacaaac 780agagacatgc atggcctgtt
tgatgtggag atctctctga ccgtccaaga gaacgccggg 840agcatatcct gttccatgcg
gcatgctcat ctgagccgag aggtggaatc cagggtacag 900ataggagata cctttttcga
gcctatatcg tggcacctgg ctaccaaagt actgggaata 960ctctgctgtg gcctattttt
tggcattgtt ggactgaaga ttttcttctc caaattccag 1020tgtaagcgag agagagaagc
atgggccggt gccttattca tggttccagc agggacagga 1080tcagagatgc tcccacatcc
agctgcttct cttcttctag tcctagcctc caggggccca 1140ggcccaaaaa aggaaaatcc
aggcggaact ggactggaga agaaagcacg gacaggcaga 1200attgagagac gcccggaaac
acgcagtgga ggtgactctg gatccagaga cggctcaccc 1260gaagctctgc gtttctgatc
tgaaaactgt aacccataga aaagctcccc aggaggtgcc 1320tcactctgag aagagattta
caaggaagag tgtggtggct tctcagagtt tccaagcagg 1380gaaacattac tgggaggtgg
acggaggaca caataaaagg tggcgcgtgg gagtgtgccg 1440ggatgatgtg gacaggagga
aggagtacgt gactttgtct cccgatcatg ggtactgggt 1500cctcagactg aatggagaac
atttgtattt cacattaaat ccccgtttta tcagcgtctt 1560ccccaggacc ccacctacaa
aaataggggt cttcctggac tatgagtgtg ggaccatctc 1620cttcttcaac ataaatgacc
agtcccttat ttataccctg acatgtcggt ttgaaggctt 1680attgaggccc tacattgagt
atccgtccta taatgagcaa aatggaactc ccatagtcat 1740ctgcccagtc acccaggaat
cagagaaaga ggcctcttgg caaagggcct ctgcaatccc 1800agagacaagc aacagtgagt
cctcctcaca ggcaaccacg cccttcctcc ccaggggtga 1860aatgtaggat gaatcacatc
ccacattctt ctttagggat attaaggtct ctctcccaga 1920tccaaagtcc cgcagcagcc
ggccaaggtg gcttccagat gaagggggac tggcctgtcc 1980acatgggagt caggtgtcat
ggctgccctg agctgggagg gaagaaggct gacattacat 2040ttagtttgct ctcactccat
ctggctaagt gatcttgaaa taccacctct caggtgaaga 2100accgtcagga attcccatct
cacaggctgt ggtgtagatt aagtagacaa ggaatgtgaa 2160taatgcttag atcttattga
tgacagagtg tatcctaatg gtttgttcat tatattacac 2220tttcagtaaa aaaaaaaaa
223915793DNAHomo sapiens
15ggataacccg cggccgcgcc tgcccgctcg cacccctctc ccgcgcccgg ttctccctcg
60cagcacctcg aagtgcgccc ctcgccctcc tgctcgcgcc ccgccgccat ggctgcctcc
120cccgcgcggc ctgctgtcct ggccctgacc gggctggcgc tgctcctgct cctgtgctgg
180ggcccaggtg gcataagtgg aaataaactc aagctgatgc ttcaaaaacg agaagcacct
240gttccaacta agactaaagt ggccgttgat gagaataaag ccaaagaatt ccttggcagc
300ctgaagcgcc agaagcggca gctgtgggac cggactcggc ccgaggtgca gcagtggtac
360cagcagtttc tctacatggg ctttgacgaa gcgaaatttg aagatgacat cacctattgg
420cttaacagag atcgaaatgg acatgaatac tatggcgatt actaccaacg tcactatgat
480gaagactctg caattggtcc ccggagcccc tacggcttta ggcatggagc cagcgtcaac
540tacgatgact actaaccatg acttgccaca cgctgtacaa gaagcaaata gcgattctct
600tcatgtatct cctaatgcct tacactactt ggtttctgat ttgctctatt tcagcagatc
660ttttctacct actttgtgtg atcaaaaaag aagagttaaa acaacacatg taaatgcctt
720ttgatatttc atgggaatgc ctctcattta aaaatagaaa taaagcattt tgttaaaaag
780aaaaaaaaaa aaa
79316672DNAHomo sapiens 16caggccagcc ctggggcgcc ttaaaaaccg gagctggcgc
ttggcatcgc cactctgggc 60aggatccaac gtcgctccag ctgctcttga cgactccaca
gataccccga agccatggca 120agcaagggct tgcaggacct gaagcaacag gtggagggga
ccgcccagga agccgtgtca 180gcggccggag cggcagctca gcaagtggtg gaccaggcca
cagaggcggg gcagaaagcc 240atggaccagc tggccaagac cacccaggaa accatcgaca
agactgctaa ccaggcctct 300gacaccttct ctgggattgg gaaaaaattc ggcctcctga
aatgacagca gggagacttg 360ggtcggcctc ctgaaatgac agcagggaga cttgggtgac
cccccttcca ggcgccatct 420agcacagcct ggccctgatc tccgggcagc caccacctcc
tcggtctgcc ccctcattaa 480aattcacgtt cccaccctgt gtccacttca tgattcctcg
caagctgggc ccagtcctct 540catcccaaga gcagagccac cgtagccgga gtcctagcct
cccaaattcg gaaatccaat 600ccaacggtct caggaatgtt ttccatcccg ccacgcgcct
cccgaagctc ccagaccgga 660ggctcagccc cc
67217899DNAHomo sapiens 17cccgagcgcc ggccgggcca
tgacccccgc tgctctgtct tgcaggctcg tcgccgcggc 60cccccgagcc cgaccgccgc
cgccaccacc accagcgccc gggcgggcct cgcgcgcctc 120gggcgcggct ccgcagtgag
cccaccaaga aggaagcggc ctgcagaggt gccgacatgg 180ggcttaagat gtcctgcctg
aaaggctttc aaatgtgtgt cagcagcagc agcagcagcc 240acgacgaggc ccccgtcctg
aacgacaagc acctggacgt gcccgacatc atcatcacgc 300cccccacccc cacgggcatg
atgctgccga gggacttggg gagcacagtc tggctggatg 360agacagggtc gtgcccagat
gatggagaaa tcgacccaga agcctgagga ggtgtcctgg 420gtttggctgg ctggctcctg
ctccagcggc ccggcttcag gtgtccgggg gcgtggctgc 480ctggagcagg tgtgctgaat
accctggatg ggaactgagc gaacccgggc ctccgctcag 540agagacgtgg caggaccagc
gaggaatcca gcctgtccac ttccagaaca gtgtttccca 600ggccccgctg agtggaccgg
acctctgaca cctccaggtt cttgctgact ccggcctggt 660gaaagggagc gccatggtcc
tggctgttgg ggtcccaggg agaggctctc ttctggacaa 720acacaccctc ccagccccca
gggctgtgca aacacatgcc cctgccataa gcaccaacaa 780gaacttcttg caggtggagt
ggctgttttt tataagttgt tttacagata cggaaacagt 840ccaaaatggg atttataatt
tcttttttgc attataaata aagatcctct gtaacaaaa 899182362DNAHomo sapiens
18actcagccca gtggccctct gagctgttcc ttcttgaccg gcacacacag ctcgcttctt
60cactttcttt tccatccact gccggaccca agccagcctt ccagggagca gccatgcctt
120acctctaccg ggccccaggg cctcaggcac acccggttcc caaggacgcc cggatcaccc
180actcctcagg ccagagcttt gagcaaatga ggcaggagtg cctgcagaga ggcaccctgt
240ttgaggatgc agacttccca gccagcaatt cctccctgtt ctacagtgag aggccgcaga
300tcccctttgt gtggaaacga ccaggggaaa tcgtgaaaaa cccagaattc attcttggag
360gggccaccag gactgatatc tgccagggag agctgggaga ctgctggcta ttagccgcca
420tcgcctccct tacgcttaat caaaaagcac tggccagagt catcccccag gaccaaagct
480ttggccctgg ttatgccggg atattccatt tccagttctg gcagcacagt gagtggctgg
540acgtggtgat cgatgaccgc ctgcccacct tcagggaccg cttggttttc ctccactctg
600ccgaccacaa cgagttctgg agcgccttgc tggaaaaagc ctacgccaag ctaaatggga
660gctatgaagc tctgaaggga ggcagcgcca tcgaggccat ggaagacttc actgggggtg
720tggcagagac cttccaaact aaagaggccc ccgagaactt ctatgagatt ctagagaagg
780ctttgaagag aggctccctg ctgggctgct tcattgatac cagaagtgct gcagaatctg
840aggcccggac gccgtttggt cttattaagg gtcatgccta cagtgtaacg ggaattgacc
900aggtaagctt ccgaggccag agaatcgagc tcatccgaat ccggaaccct tggggccagg
960ttgagtggaa cgggtcgtgg agcgacagtt ctccggagtg gcgttctgtt ggtccagctg
1020agcagaagcg tctgtgtcac actgctctgg atgatgggga attctggatg gcatttaagg
1080acttcaaggc ccactttgat aaagtggaga tctgcaacct cactcccgat gccctggagg
1140aagacgcgat ccacaaatgg gaggtgacgg tccatcaggg aagctgggtt cgcggctcca
1200cggctggggg ctgccgcaat ttcctggata ccttttggac caatccacaa ataaaattgt
1260ctctgactga gaaagatgag gggcaggagg agtgtagttt ccttgtagcc ctgatgcaga
1320aagatagaag gaaactcaag agatttggtg ccaatgtgct gacaatcggc tatgccattt
1380atgagtgccc tgacaaagac gaacacctga acaaagactt cttcagatac cacgcttctc
1440gggccagaag caagacgttc atcaacctga gagaagtctc cgaccggttc aagctgcccc
1500ctggggagta catcctgatt cccagcactt ttgagcccca ccaggaagct gatttctgtc
1560tgagaatctt ttcagagaaa aaagccatta cccgggatat ggatggaaat gtagacattg
1620accttcctga gcctccaaag ccaactccac ctgaccagga gacagaggag gagcagcggt
1680ttcgggctct gtttgaacaa gtcgctggtg aggacatgga ggtgacagca gaggaacttg
1740agtatgtttt aaatgctgtg ctgcaaaaga aaaaggacat caaattcaag aagctaagcc
1800tgatctcctg taaaaacatc atttccctga tggacaccag cggcaatggg aagctggagt
1860ttgatgaatt caaagtgttc tgggacaagc tgaagcagtg gattaacctt ttccttcggt
1920ttgatgctga caagtccggc accatgtcta cctatgaact acggactgca ctgaaagctg
1980caggctttca gctgagcagc cacctcctgc agctgattgt gctcaggtat gcggatgagg
2040agctccagct ggacttcgat gacttcctca actgcctggt ccggctggag aatgcgagcc
2100gggtgttcca ggctctcagt acaaagaaca aggagttcat tcatctcaat ataaatgagt
2160tcatccattt gacaatgaac atctgaggct gccttgtaga gatgcagcct gcccagctga
2220atcttggctt ctggaccttg accttcagaa cttctcttgg tgtggaacca ttacgcccag
2280ggttcactcc cctctcatcg tccggccttc tcccttcatc ttgatctggg aagaatgaaa
2340tgaactcagc tacactctct ga
2362199170DNAHomo sapiens 19gcgcgcccat ttcagattac taaactcgaa ttaagaggga
aaaaaaatca gggaggaggt 60ggcaagccac accccacggt gcccgcgaac ttccccggca
gcggactgta gcccaggcag 120acgccgtcga gatgcagggc ccaccgctcc tgaccgccgc
ccacctcctc tgcgtgtgca 180ccgccgcgct ggccgtggct cccgggcctc ggtttctggt
gacagcccca gggatcatca 240ggcccggagg aaatgtgact attggggtgg agcttctgga
acactgccct tcacaggtga 300ctgtgaaggc ggagctgctc aagacagcat caaacctcac
tgtctctgtc ctggaagcag 360aaggagtctt tgaaaaaggc tcttttaaga cacttactct
tccatcacta cctctgaaca 420gtgcagatga gatttatgag ctacgtgtaa ccggacgtac
ccaggatgag attttattct 480ctaatagtac ccgcttatca tttgagacca agagaatatc
tgtcttcatt caaacagaca 540aggccttata caagccaaag caagaagtga agtttcgcat
tgttacactc ttctcagatt 600ttaagcctta caaaacctct ttaaacattc tcattaagga
ccccaaatca aatttgatcc 660aacagtggtt gtcacaacaa agtgatcttg gagtcatttc
caaaactttt cagctatctt 720cccatccaat acttggtgac tggtctattc aagttcaagt
gaatgaccag acatactatc 780aatcatttca ggtttcagaa tatgtattac caaaatttga
agtgactttg cagacaccat 840tatattgttc tatgaattct aagcatttaa atggtaccat
cacggcaaag tatacatatg 900ggaagccagt gaaaggagac gtaacgctta catttttacc
tttatccttt tggggaaaga 960agaaaaatat tacaaaaaca tttaagataa atggatctgc
aaacttctct tttaatgatg 1020aagagatgaa aaatgtaatg gattcttcaa atggactttc
tgaatacctg gatctatctt 1080cccctggacc agtagaaatt ttaaccacag tgacagaatc
agttacaggt atttcaagaa 1140atgtaagcac taatgtgttc ttcaagcaac atgattacat
cattgagttt tttgattata 1200ctactgtctt gaagccatct ctcaacttca cagccactgt
gaaggtaact cgtgctgatg 1260gcaaccaact gactcttgaa gaaagaagaa ataatgtagt
cataacagtg acacagagaa 1320actatactga gtactggagc ggatctaaca gtggaaatca
gaaaatggaa gctgttcaga 1380aaataaatta tactgtcccc caaagtggaa cttttaagat
tgaattccca atcctggagg 1440attccagtga gctacagttg aaggcctatt tccttggtag
taaaagtagc atggcagttc 1500atagtctgtt taagtctcct agtaagacat acatccaact
aaaaacaaga gatgaaaata 1560taaaggtggg atcgcctttt gagttggtgg ttagtggcaa
caaacgattg aaggagttaa 1620gctatatggt agtatccagg ggacagttgg tggctgtagg
aaaacaaaat tcaacaatgt 1680tctctttaac accagaaaat tcttggactc caaaagcctg
tgtaattgtg tattatattg 1740aagatgatgg ggaaattata agtgatgttc taaaaattcc
tgttcagctt gtttttaaaa 1800ataagataaa gctatattgg agtaaagtga aagctgaacc
atctgagaaa gtctctctta 1860ggatctctgt gacacagcct gactccatag ttgggattgt
agctgttgac aaaagtgtga 1920atctgatgaa tgcctctaat gatattacaa tggaaaatgt
ggtccatgag ttggaacttt 1980ataacacagg atattattta ggcatgttca tgaattcttt
tgcagtcttt caggaatgtg 2040gactctgggt attgacagat gcaaacctca cgaaggatta
tattgatggt gtttatgaca 2100atgcagaata tgctgagagg tttatggagg aaaatgaagg
acatattgta gatattcatg 2160acttttcttt gggtagcagt ccacatgtcc gaaagcattt
tccagagact tggatttggc 2220tagacaccaa catgggttac aggatttacc aagaatttga
agtaactgta cctgattcta 2280tcacttcttg ggtggctact ggttttgtga tctctgagga
cctgggtctt ggactaacaa 2340ctactccagt ggagctccaa gccttccaac catttttcat
ttttttgaat cttccctact 2400ctgttatcag aggtgaagaa tttgctttgg aaataactat
attcaattat ttgaaagatg 2460ccactgaggt taaggtaatc attgagaaaa gtgacaaatt
tgatattcta atgacttcaa 2520atgaaataaa tgccacaggc caccagcaga cccttctggt
tcccagtgag gatggggcaa 2580ctgttctttt tcccatcagg ccaacacatc tgggagaaat
tcctatcaca gtcacagctc 2640tttcacccac tgcttctgat gctgtcaccc agatgatttt
agtaaaggct gaaggaatag 2700aaaaatcata ttcacaatcc atcttattag acttgactga
caataggcta cagagtaccc 2760tgaaaacttt gagtttctca tttcctccta atacagtgac
tggcagtgaa agagttcaga 2820tcactgcaat tggagatgtt cttggtcctt ccatcaatgg
cttagcctca ttgattcgga 2880tgccttatgg ctgtggtgaa cagaacatga taaattttgc
tccaaatatt tacattttgg 2940attatctgac taaaaagaaa caactgacag ataatttgaa
agaaaaagct ctttcattta 3000tgaggcaagg ttaccagaga gaacttctct atcagaggga
agatggctct ttcagtgctt 3060ttgggaatta tgacccttct gggagcactt ggttgtcagc
ttttgtttta agatgtttcc 3120ttgaagccga tccttacata gatattgatc agaatgtgtt
acacagaaca tacacttggc 3180ttaaaggaca tcagaaatcc aacggtgaat tttgggatcc
aggaagagtg attcatagtg 3240agcttcaagg tggcaataaa agtccagtaa cacttacagc
ctatattgta acttctctcc 3300tgggatatag aaagtatcag cctaacattg atgtgcaaga
gtctatccat tttttggagt 3360ctgaattcag tagaggaatt tcagacaatt atactctagc
ccttataact tatgcattgt 3420catcagtggg gagtcctaaa gcgaaggaag ctttgaatat
gctgacttgg agagcagaac 3480aagaaggtgg catgcaattc tgggtgtcat cagagtccaa
actttctgac tcctggcagc 3540cacgctccct ggatattgaa gttgcagcct atgcactgct
ctcacacttc ttacaatttc 3600agacttctga gggaatccca attatgaggt ggctaagcag
gcaaagaaat agcttgggtg 3660gttttgcatc tactcaggat accactgtgg ctttaaaggc
tctgtctgaa tttgcagccc 3720taatgaatac agaaaggaca aatatccaag tgaccgtgac
ggggcctagc tcaccaagtc 3780ctgtaaagtt tctgattgac acacacaacc gcttactcct
tcagacagca gagcttgctg 3840tggtacagcc aacggcagtt aatatttccg caaatggttt
tggatttgct atttgtcagc 3900tcaatgttgt atataatgtg aaggcttctg ggtcttctag
aagacgaaga tctatccaaa 3960atcaagaagc ctttgattta gatgttgctg taaaagaaaa
taaagatgat ctcaatcatg 4020tggatttgaa tgtgtgtaca agcttttcgg gcccgggtag
gagtggcatg gctcttatgg 4080aagttaacct attaagtggc tttatggtgc cttcagaagc
aatttctctg agcgagacag 4140tgaagaaagt ggaatatgat catggaaaac tcaacctcta
tttagattct gtaaatgaaa 4200cccagttttg tgttaatatt cctgctgtga gaaactttaa
agtttcaaat acccaagatg 4260cttcagtgtc catagtggat tactatgagc caaggagaca
ggcggtgaga agttacaact 4320ctgaagtgaa gctgtcctcc tgtgaccttt gcagtgatgt
ccagggctgc cgtccttgtg 4380aggatggagc ttcaggctcc catcatcact cttcagtcat
ttttattttc tgtttcaagc 4440ttctgtactt tatggaactt tggctgtgat ttatttttaa
aggactctgt gtaacactaa 4500catttccagt agtcacatgt gattgttttg ttttcgtaga
agaatactgc ttctattttg 4560aaaaaagagt tttttttctt tctatggggt tgcagggatg
gtgtacaaca ggtcctagca 4620tgtatagctg catagatttc ttcacctgat ctttgtgtgg
aagatcagaa tgaatgcagt 4680tgtgtgtcta tattttcccc tctcaaaatc ttttagaatt
tttttggagg tgtttgtttt 4740ctccagaata aaggtattac tttagaatag gtattctcct
cattttgtga aagaaatgaa 4800cctagattct taagcattat tacacatcca tgtttgctta
aagatggatt tccctgggaa 4860tgggagaaaa cagccagcag gaggagcttc atctgttccc
ttcccacctc caacctagcc 4920ctactgccca ccccacccca acccacccca tgcccagtgg
tctcagtaga tacttcttaa 4980ctggaaattc tttcttttca gaatctaggt ggtgaatttt
ttttaagtgg cacggtcttt 5040ttctgcttga aatctgatca caccccccag ccattgccct
ccctctcttt ttcctctgta 5100gagaaatgtg aggggcagta catttactgt gcttttcaca
ccatctcaga ggttgaggag 5160catactgaaa attgccctgg ggggtgctgg gtgtgctgtc
tccttcccac atcctcagcc 5220ccacaccagc tctatttcag gggtgagagt cagagagcac
tgcaatatgt gcttcatggg 5280atttcgattc gaagatccta gaccagggag acactgtgag
ccagggatac aacaaaatac 5340taggtaagtc actgcagacc gacctccctg cagtttggga
aagaagctgg gtttgtggag 5400aatcagagca tcttgacatg actgctgacc taaagatccc
tggcattggc cagggatcct 5460gtggaacctc ttctagttca ggggtgtgag cattagactg
ccagttgtct agtgacatct 5520gatgcttgct gtgaactttt aagatccccg aatcctgagc
acctcaatct ttaattgccc 5580tgtattccga agggtaatat aatttatctg gatggaaatt
ttaaagatga atcccccttt 5640tttcttttct tctctctttt ctttccttct ccctttcttc
tttgccttct aaatatactg 5700aaatgattta gatatgtgtc aacaattaat gatcttttat
tcaatctaag aaatggttta 5760gtttttctct ttagctctat ggcatttcac tcaagtggac
aggggaaaaa gtaattgcca 5820tgggctccaa agaatttgct ttatgttttt agctatttaa
aaataaatcc atcaaaaata 5880aagtatgcaa atgtatcttt taaagttaat ttttaaaaat
gctcttattt tagtgaattt 5940tcagaaatta tagtggaatg gatgctcata tattgcttat
ggatattttg gataccaaag 6000taggaataac tgacattcag tattttaaag ctggcaaacc
tgtacataga aaatagatcc 6060ccagacagtg gtctatgaag agggcagtta agtatcaaat
acttaatttt cttgcctttt 6120tttcttaagt ggggaaaagt ttctagatct cttacacctc
tgacacaatc tgttctaaaa 6180caggcacttg taatgttggg gcctccttgt aaacgtgttt
ttgcccttta ctctctggga 6240gttctttaaa ggtgaaatca tcttacaaag aaattggggg
agggtcttgg caaaggactt 6300tcccctcctc tttcctggcc tgggaacctt atactgacaa
tcaatacttt atattttaaa 6360gtatataatt tatagttaac ttctagtgta atatattagg
aaacactaga atggaaaggc 6420cattggaaga caggttgtat cttttttaga ccatatttcc
ttgtttaaaa actatcattt 6480gaatactttt ttggtgaaga actccatgtt ttcaagttaa
aggtcacctc gtaggccagg 6540cgcagtggct catgcctgta atcccagcac tctgggaggc
tgaggcgggt gaatcacaag 6600gttaggagtt tgagaccagc ctggccaata tggtgaaacc
ccgtccctac taaaaataca 6660aaatttagcc aggcgtggtg gcatgcacct gtagtcccac
ctactcggga ggctgaggca 6720ggagaatcac ttgaacctga gagacagagg ttgcagtgag
ccgagatcac gccactgcac 6780tccagcctgg gggacagagt gagattctgt ctcaaaaaac
aaaaaacaaa aaagtcacct 6840tgtaactcat ctctttttat tgtaagttta ttaaaaatga
agaggacaac aatgagaagg 6900aacataaagg gttagctagc actgtctcct ggtgcatggg
gctgtgcaga tgtcccggcc 6960acttcttcct tcatacttcc cttagagaac ttgctctgct
acaagcagtg ggcttggact 7020aaaagtgatt aaaataccac aggcataagg agaaaaggag
tatatgtagt agtaataatt 7080actagtataa attattttct tcacatgcta tgagtaataa
tattaaaaaa ctcattttac 7140cattaagatt ccttatgctg aagctcttcc atttagaata
ctgtcaatgt catttactgg 7200tatgaactaa agtccccctt cttttccact cactgggaac
cttagtaaaa caccagcata 7260tcttacctct ctttctgact ggccgatgct tccagagact
gaatgttggg aaaacctagt 7320agccaaacaa ttctaggaca gaataacatt tttatatttg
gttccaccat cttattacat 7380ttagttatag ttttaaaaaa gaaattcaag cccattaaaa
tatgtctggt caatgaaatg 7440cttcctttta ttgtgttgtg ctattgtact ttgtttttca
aaacattgta aaaatagtat 7500ctttggttta gtattttgga ttatatatta taatctgagg
agtgttttgc ttatgtagaa 7560tccagatata tttctgttac ctaggagatg ttacttacat
atgtaatact gtatcctgca 7620cgtggaaata ttcagaattg tagatagcat aactctccct
gctcctattc ttttgagcct 7680aggtataatt tttttttttt ttttagaaaa agacatattt
agctttaatt tctatttatg 7740ctaaacatat ttataagtag tctgtcaata taataccaac
tatttttatt tttacataat 7800tcaattattt catttgacat gtctggcaga ctcaagacat
taagtaaaaa attggaacta 7860tgatttttct ttgtcatttt ttaaaaaaga attattttat
taacctgctg gcatataatc 7920tggagttctt ttcacaacct tactttttct gatttgcttt
attgaatgat tgaatactca 7980tttctttcta aaaatatgtt gtaaattctc ccttggcaag
atttctccct atgagggtag 8040ttattatttg agtctgccaa gtggttacca tggggcaagg
tgccatgatg tattcttggg 8100tgcattggtt ttttgcgcat tgtaaattta agacacttat
agtaagtgga ctcattcata 8160gatgagtttc agaacctttt acgttctcgg tagaggcttc
tgtcggacag gcagaagagt 8220gtattcctca cttttttttt tgtcttcaaa ttccagtaag
gcatagcact tttaagaaat 8280tagaattttt ctatcatcta tgcaaatgat atttatgtta
atattaaata tcttatgtta 8340cactgggagt aatttgaggt gcaattattt ttattactac
tttgaataga ggaccattat 8400ccttctttct tcagaaaact aagaagtaag tgtaactttt
aaagtaagta tatatcagtg 8460agagtaggct tgttttacaa ctatttctag ccagtgagtt
gtgttttcat gtctcatcaa 8520aagacaatac cacattgcat cattttacaa aatatgttgt
cattttcatt tcagttgtaa 8580cataggaaaa tagatatttc ctagatgatt tctgagtttc
ttactgcaaa gaacagttat 8640aaattggtat acatgtgtct ctgtaatagg gataatattg
atatatctgt tgctacatat 8700ttaagaatca ttctatctta tgttgtcttg aggccaagat
ttaccacgtt tgcccagtgt 8760attgaattgg tggtagaagg tagttccatg ttccatttgt
agatctttaa gattttatct 8820ttgataactt taatagaatg tggctcagtt ctggtccttc
aagcctgtat ggtttggatt 8880ttcagtaggg gacagttgat gtggagtcaa tctctttggt
acacaggaag ctttataaaa 8940tttcattcac gaatctctta ttttgggaag ctgttttgca
tatgagaaga acactgttga 9000aataaggaac taaagcttta tatattgatc aaggtgattc
tgaaagtttt aatttttaat 9060gttgtaatgt tatgttattg ttaattgtac tttattatgt
attcaataga aaatcatgat 9120ttattaataa aagcttaaat tctcatctat ttaaaaaaaa
aaaaaaaaaa 9170203654DNAHomo sapiens 20agatgccgcg ggggccgctc
gcagccgccg ctgacttgtg aatgggaccg ggactggggc 60cgggactgac accgcagcgc
ttgccctgcg ccagggactg gcggctcgga ggttgcgtcc 120accctcaagg gccccagaaa
tcactgtgtt ttcagctcag cggccctgtg acattccttc 180gtgttgtcat ttgttgagtg
accaatcaga tgggtggagt gtgttacaga aattggcagc 240aagtatccaa tgggtgaaga
agaagctaac tggggacgtg ggcagccctg acgtgatgag 300ctcaaccagc agagacattc
catcccaaga gaggtctgcg tgacgcgtcc gggaggccac 360cctcagcaag accaccgtac
agttggtgga aggggtgaca gctgcattct cctgtgccta 420ccacgtaacc aaaaatgaag
gagaactact gtttacaagc cgccctggtg tgcctgggca 480tgctgtgcca cagccatgcc
tttgccccag agcggcgggg gcacctgcgg ccctccttcc 540atgggcacca tgagaagggc
aaggaggggc aggtgctaca gcgctccaag cgtggctggg 600tctggaacca gttcttcgtg
atagaggagt acaccgggcc tgaccccgtg cttgtgggca 660ggcttcattc agatattgac
tctggtgatg ggaacattaa atacattctc tcaggggaag 720gagctggaac catttttgtg
attgatgaca aatcagggaa cattcatgcc accaagacgt 780tggatcgaga agagagagcc
cagtacacgt tgatggctca ggcggtggac agggacacca 840atcggccact ggagccaccg
tcggaattca ttgtcaaggt ccaggacatt aatgacaacc 900ctccggagtt cctgcacgag
acctatcatg ccaacgtgcc tgagaggtcc aatgtgggaa 960cgtcagtaat ccaggtgaca
gcttcagatg cagatgaccc cacttatgga aatagcgcca 1020agttagtgta cagtatcctc
gaaggacaac cctatttttc ggtggaagca cagacaggta 1080tcatcagaac agccctaccc
aacatggaca gggaggccaa ggaggagtac cacgtggtga 1140tccaggccaa ggacatgggt
ggacatatgg gcggactctc agggacaacc aaagtgacga 1200tcacactgac cgatgtcaat
gacaacccac caaagtttcc gcagagcgta taccagatgt 1260ctgtgtcaga agcagccgtc
cctggggagg aagtaggaag agtgaaagct aaagatccag 1320acattggaga aaatggctta
gtcacataca atattgttga tggagatggt atggaatcgt 1380ttgaaatcac aacggactat
gaaacacagg agggggtgat aaagctgaaa aagcctgtag 1440attttgaaac caaaagagcc
tatagcttga aggtagaggc agccaacgtg cacatcgacc 1500cgaagtttat cagcaatggc
cctttcaagg acactgtgac cgtcaagatc tcagtagaag 1560atgctgatga gccccctatg
ttcttggccc caagttacat ccacgaagtc caagaaaatg 1620cagctgctgg caccgtggtt
gggagagtgc atgccaaaga ccctgatgct gccaacagcc 1680cgataaggta ttccatcgat
cgtcacactg acctcgacag atttttcact attaatccag 1740aggatggttt tattaaaact
acaaaacctc tggatagaga ggaaacagcc tggctcaaca 1800tcactgtctt tgcagcagaa
atccacaatc ggcatcagga agccaaagtc ccagtggcca 1860ttagggtcct tgatgtcaac
gataatgctc ccaagtttgc tgccccttat gaaggtttca 1920tctgtgagag tgatcagacc
aagccacttt ccaaccagcc aattgttaca attagtgcag 1980atgacaagga tgacacggcc
aatggaccaa gatttatctt cagcctaccc cctgaaatca 2040ttcacaatcc aaatttcaca
gtcagagaca accgagataa cacagcaggc gtgtacgccc 2100ggcgtggagg gttcagtcgg
cagaagcagg acttgtacct tctgcccata gtgatcagcg 2160atggcggcat cccgcccatg
agtagcacca acaccctcac catcaaagtc tgcgggtgcg 2220acgtgaacgg ggcactgctc
tcctgcaacg cagaggccta cattctgaac gccggcctga 2280gcacaggcgc cctgatcgcc
atcctcgcct gcatcgtcat tctcctggtc attgtagtat 2340tgtttgtgac cctgagaagg
caaaagaaag aaccactcat tgtctttgag gaagaagatg 2400tccgtgagaa catcattact
tatgatgatg aagggggtgg ggaagaagac acagaagcct 2460ttgatattgc caccctccag
aatcctgatg gtatcaatgg atttatcccc cgcaaagaca 2520tcaaacctga gtatcagtac
atgcctagac ctgggctccg gccagcgccc aacagcgtgg 2580atgtcgatga cttcatcaac
acgagaatac aggaggcaga caatgacccc acggctcctc 2640cttatgactc cattcaaatc
tacggttatg aaggcagggg ctcagtggcc gggtccctga 2700gctccctaga gtcggccacc
acagattcag acttggacta tgattatcta cagaactggg 2760gacctcgttt taagaaacta
gcagatttgt atggttccaa agacactttt gatgacgatt 2820cttaacaata acgatacaaa
tttggcctta agaactgtgt ctggcgttct caagaatcta 2880gaagatgtgt aaacaggtat
ttttttaaat caaggaaagg ctcatttaaa acaggcaaag 2940ttttacagag aggatacatt
taataaaact gcgaggacat caaagtggta aatactgtga 3000aatacctttt ctcacaaaaa
ggcaaatatt gaagttgttt atcaacttcg ctagaaaaaa 3060aaaacacttg gcatacaaaa
tatttaagtg aaggagaagt ctaacgctga actgacaatg 3120aagggaaatt gtttatgtgt
tatgaacatc caagtctttc ttctttttta agttgtcaaa 3180gaagcttcca caaaattaga
aaggacaaca gttctgagct gtaatttcgc cttaaactct 3240ggacactcta tatgtagtgc
atttttaaac ttgaaatata taatattcag ccagcttaaa 3300cccatacaat gtatgtacaa
tacaatgtac aattatgtct cttgagcatc aatcttgtta 3360ctgctgattc ttgtaaatct
ttttgcttct actttcatct taaactaata cgtgccagat 3420ataactgtct tgtttcagtg
agagacgccc tatttctatg tcatttttaa tgtatctatt 3480tgtacaattt taaagttctt
attttagtat acgtataaat atcagtattc tgacatgtaa 3540gaaaatgtta cggcatcaca
cttatatttt atgaacattg tactgttgct ttaatatgag 3600cttcaatata agaagcaatc
tttgaaataa aaaaagattt ttttttaaaa aaaa 3654213698DNAHomo sapiens
21ggaagaggga gtgttcccgg gggagatact ccagtcgtag caagagtctc gaccactgaa
60tggaagaaaa ggacttttaa ccaccatttt gtgacttaca gaaaggaatt tgaataaaga
120aaactatgat acttcaggcc catcttcact ccctgtgtct tcttatgctt tatttggcaa
180ctggatatgg ccaagagggg aagtttagtg gacccctgaa acccatgaca ttttctattt
240atgaaggcca agaaccgagt caaattatat tccagtttaa ggccaatcct cctgctgtga
300cttttgaact aactggggag acagacaaca tatttgtgat agaacgggag ggacttctgt
360attacaacag agccttggac agggaaacaa gatctactca caatctccag gttgcagccc
420tggacgctaa tggaattata gtggagggtc cagtccctat caccataaaa gtgaaggaca
480tcaacgacaa tcgacccacg tttctccagt caaagtacga aggctcagta aggcagaact
540ctcgcccagg aaagcccttc ttgtatgtca atgccacaga cctggatgat ccggccactc
600ccaatggcca gctttattac cagattgtca tccagcttcc catgatcaac aatgtcatgt
660actttcagat caacaacaaa acgggagcca tctctcttac ccgagaggga tctcaggaat
720tgaatcctgc taagaatcct tcctataatc tggtgatctc agtgaaggac atgggaggcc
780agagtgagaa ttccttcagt gataccacat ctgtggatat catagtgaca gagaatattt
840ggaaagcacc aaaacctgtg gagatggtgg aaaactcaac tgatcctcac cccatcaaaa
900tcactcaggt gcggtggaat gatcccggtg cacaatattc cttagttgac aaagagaagc
960tgccaagatt cccattttca attgaccagg aaggagatat ttacgtgact cagcccttgg
1020accgagaaga aaaggatgca tatgtttttt atgcagttgc aaaggatgag tacggaaaac
1080cactttcata tccgctggaa attcatgtaa aagttaaaga tattaatgat aatccaccta
1140catgtccgtc accagtaacc gtatttgagg tccaggagaa tgaacgactg ggtaacagta
1200tcgggaccct tactgcacat gacagggatg aagaaaatac tgccaacagt tttctaaact
1260acaggattgt ggagcaaact cccaaacttc ccatggatgg actcttccta atccaaacct
1320atgctggaat gttacagtta gctaaacagt ccttgaagaa gcaagatact cctcagtaca
1380acttaacgat agaggtgtct gacaaagatt tcaagaccct ttgttttgtg caaatcaacg
1440ttattgatat caatgatcag atccccatct ttgaaaaatc agattatgga aacctgactc
1500ttgctgaaga cacaaacatt gggtccacca tcttaaccat ccaggccact gatgctgatg
1560agccatttac tgggagttct aaaattctgt atcatatcat aaagggagac agtgagggac
1620gcctgggggt tgacacagat ccccatacca acaccggata tgtcataatt aaaaagcctc
1680ttgattttga aacagcagct gtttccaaca ttgtgttcaa agcagaaaat cctgagcctc
1740tagtgtttgg tgtgaagtac aatgcaagtt cttttgccaa gttcacgctt attgtgacag
1800atgtgaatga agcacctcaa ttttcccaac acgtattcca agcgaaagtc agtgaggatg
1860tagctatagg cactaaagtg ggcaatgtga ctgccaagga tccagaaggt ctggacataa
1920gctattcact gaggggagac acaagaggtt ggcttaaaat tgaccacgtg actggtgaga
1980tctttagtgt ggctccattg gacagagaag ccggaagtcc atatcgggta caagtggtgg
2040ccacagaagt aggggggtct tccttgagct ctgtgtcaga gttccacctg atccttatgg
2100atgtgaatga caaccctccc aggctagcca aggactacac gggcttgttc ttctgccatc
2160ccctcagtgc acctggaagt ctcattttcg aggctactga tgatgatcag cacttatttc
2220ggggtcccca ttttacattt tccctcggca gtggaagctt acaaaacgac tgggaagttt
2280ccaaaatcaa tggtactcat gcccgactgt ctaccaggca cacagagttt gaggagaggg
2340agtatgtcgt cttgatccgc atcaatgatg ggggtcggcc acccttggaa ggcattgttt
2400ctttaccagt tacattctgc agttgtgtgg aaggaagttg tttccggcca gcaggtcacc
2460agactgggat acccactgtg ggcatggcag ttggtatact gctgaccacc cttctggtga
2520ttggtataat tttagcagtt gtgtttatcc gcataaagaa ggataaaggc aaagataatg
2580ttgaaagtgc tcaagcatct gaagtcaaac ctctgagaag ctgaatttga aaaggaatgt
2640ttgaatttat atagcaagtg ctatttcagc aacaaccatc tcatcctatt acttttcatc
2700taacgtgcat tataattttt taaacagata ttccctcttg tcctttaata tttgctaaat
2760atttcttttt tgaggtggag tcttgctctg tcgcccaggc tggagtacag tggtgtgatc
2820ccagctcact gcaacctccg cctcctgggt tcacatgatt ctcctgcctc agcttcctaa
2880gtagctgggt ttacaggcac ccaccaccat gcccagctaa tttttgtatt tttaatagag
2940acggggtttc gccatttggc caggctggtc ttgaactcct gacgtcaagt gatctgcctg
3000ccttggtctc ccaatacagg catgaaccac tgcacccacc tacttagata tttcatgtgc
3060tatagacatt agagagattt ttcatttttc catgacattt ttcctctctg caaatggctt
3120agctacttgt gtttttccct tttggggcaa gacagactca ttaaatattc tgtacatttt
3180ttctttatca aggagatata tcagtgttgt ctcatagaac tgcctggatt ccatttatgt
3240tttttctgat tccatcctgt gtccccttca tccttgactc ctttggtatt tcactgaatt
3300tcaaacattt gtcagagaag aaaaacgtga ggactcagga aaaataaata aataaaagaa
3360cagccttttc ccttagtatt aacagaaatg tttctgtgtc attaaccatc tttaatcaat
3420gtgacatgtt gctctttggc tgaaattctt caacttggaa atgacacaga cccacagaag
3480gtgttcaaac acaacctact ctgcaaacct tggtaaagga accagtcagc tggccagatt
3540tcctcactac ctgccatgca tacatgctgc gcatgttttc ttcattcgta tgttagtaaa
3600gttttggtta ttatatattt aacatgtgga agaaaacaag acatgaaaag agtggtgaca
3660aatcaagaat aaacactggt tgtagtcagt tttgtttg
3698226241DNAHomo sapiens 22atactgacag tgttgtggta ctaaaagcac aagcgtctgt
aactctgggc aatggggcac 60atcgagagtt tgctgagaag actgtgaagc aaaaagaaga
aagtttttcc tactcttcct 120tatgtgtcca acacgaagtt tgctgttcag ttttcacaga
acttctagaa gttgaagtta 180caaaggtata tagaaggtac acagaatcag aaaagattat
aaaagaaagc aagatttttg 240ttagtgacgt cctgtttcct ctgaagagta atagttggaa
tcaaaagagt caacgcaatg 300aactgttatt tactgctgcg ttttatgttg ggaattcctc
tcctatggcc ttgtcttgga 360gcaacagaaa actctcaaac aaagaaagtc aagcagccag
tgcgatctca tttgagagtg 420aagcgtggct gggtgtggaa ccaatttttt gtaccagagg
aaatgaatac gactagtcat 480cacatcggcc agctaagatc tgatttagac aatggaaaca
attctttcca gtacaagctt 540ttgggagctg gagctggaag tacttttatc attgatgaaa
gaacaggtga catatatgcc 600atacagaagc ttgatagaga ggagcgatcc ctctacatct
taagagccca ggtaatagac 660atcgctactg gaagggctgt ggaacctgag tctgagtttg
tcatcaaagt ttcggatatc 720aatgacaatg aaccaaaatt cctagatgaa ccttatgagg
ccattgtacc agagatgtct 780ccagaaggaa cattagttat ccaggtgaca gcaagtgatg
ctgacgatcc ctcaagtggt 840aataatgctc gtctcctcta cagcttactt caaggccagc
catatttttc tgttgaacca 900acaacaggag tcataagaat atcttctaaa atggatagag
aactgcaaga tgagtattgg 960gtaatcattc aagccaagga catgattggt cagccaggag
cgttgtctgg aacaacaagt 1020gtattaatta aactttcaga tgttaatgac aataagccta
tatttaaaga aagtttatac 1080cgcttgactg tctctgaatc tgcacccact gggacttcta
taggaacaat catggcatat 1140gataatgaca taggagagaa tgcagaaatg gattacagca
ttgaagagga tgattcgcaa 1200acatttgaca ttattactaa tcatgaaact caagaaggaa
tagttatatt aaaaaagaaa 1260gtggattttg agcaccagaa ccactacggt attagagcaa
aagttaaaaa ccatcatgtt 1320cctgagcagc tcatgaagta ccacactgag gcttccacca
ctttcattaa gatccaggtg 1380gaagatgttg atgagcctcc tcttttcctc cttccatatt
atgtatttga agtttttgaa 1440gaaaccccac agggatcatt tgtaggcgtg gtgtctgcca
cagacccaga caataggaaa 1500tctcctatca ggtattctat tactaggagc aaagtgttca
atatcaatga taatggtaca 1560atcactacaa gtaactcact ggatcgtgaa atcagtgctt
ggtacaacct aagtattaca 1620gccacagaaa aatacaatat agaacagatc tcttcgatcc
cactgtatgt gcaagttctt 1680aacatcaatg atcatgctcc tgagttctct caatactatg
agacttatgt ttgtgaaaat 1740gcaggctctg gtcaggtaat tcagactatc agtgcagtgg
atagagatga atccatagaa 1800gagcaccatt tttactttaa tctatctgta gaagacacta
acaattcaag ttttacaatc 1860atagataatc aagataacac agctgtcatt ttgactaata
gaactggttt taaccttcaa 1920gaagaacctg tcttctacat ctccatctta attgccgaca
atggaatccc gtcacttaca 1980agtacaaaca cccttaccat ccatgtctgt gactgtggtg
acagtgggag cacacagacc 2040tgccagtacc aggagcttgt gctttccatg ggattcaaga
cagaagtcat cattgctatt 2100ctcatttgca ttatgatcat atttgggttt atttttttga
ctttgggttt aaaacaacgg 2160agaaaacaga ttctatttcc tgagaaaagt gaagatttca
gagagaatat attccaatat 2220gatgatgaag ggggtggaga agaagataca gaggcctttg
atatagcaga gctgaggagt 2280agtaccataa tgcgggaacg caagactcgg aaaaccacaa
gcgctgagat caggagccta 2340tacaggcagt ctttgcaagt tggccccgac agtgccatat
tcaggaaatt cattctggaa 2400aagctcgaag aagctaatac tgatccgtgt gcccctcctt
ttgattccct ccagacctac 2460gcttttgagg gaacagggtc attagctgga tccctgagct
ccttagaatc agcagtctct 2520gatcaggatg aaagctatga ttaccttaat gagttgggac
ctcgctttaa aagattagca 2580tgcatgtttg gttctgcagt gcagtcaaat aattagggct
ttttaccatc aaaattttta 2640aaagtgctaa tgtgtattcg aacccaatgg tagtcttaaa
gagttttgtg ccctggctct 2700atggcgggga aagccctagt ctatggagtt ttctgatttc
cctggagtaa atactccatg 2760gttattttaa gctacctaca tgctgtcatt gaacagagat
gtggggagaa atgtaaacaa 2820tcagctcaca ggcatcaata caaccagatt tgaagtaaaa
taatttagga agatattaaa 2880agtagatgag aggacacaag atgtagtcga tccttatgcg
attatatcat tatttactta 2940ggaaagagta aaaataccaa acgagaaaat ttaaaggagc
aaaaatttgc aagtcaaata 3000gaaatgtaca aatcgagata acatttacat ttctatcata
ttgacatgaa aattgaaaat 3060gtatagtcag agaaattttc atgaattatt ccatgaagta
ttgtttcctt tatttaaaaa 3120aaaaaaaaaa aaaaaagaat gctaggtaat cttcgtagaa
aactagaaag tatgataaac 3180aacagttgga ggaatccatg gaaaatagac gagaaaatgt
aaataaggct ttctggggat 3240cacagaactt ttgtatgaat aaaagcctta taaaaccagc
tctggcatat gactaaaata 3300ttcctctact atttttcaat gtcatctaat gcaaacgctc
aaccttttta aatggttaat 3360gtgagaaagg cactgaatta agatatctta gtgttatttt
atttactcaa ttttcatctt 3420tcacactgta agaataaaat gtttttgtgg ttaagttcct
gatactatat agagataaac 3480tttaaaactt taacacctag gataatgtgg gtgattacat
aaatgtttaa aaacatttac 3540cttatatgga aagcagattt ctgactaaga tgccatgatt
cattctagtt tgatatttta 3600aaactttgcc aagagcactc tttaaaactg atttcaaatt
tgaacagata cttggtgtat 3660acaaattaat cagagttata tagtataaat ttgtattatt
tttgatatta caactttgta 3720ttaaaaagga aaatatattt aggtgtttgt catagatttt
ttttcatatg caactcaaaa 3780cacattctta tgtcaaaata caatagcaaa atacaagtca
atatttactc ataattaaat 3840gcaaacttta tgaaactggt ggaaagtgtg gaaaaattta
agtaaatttt tacctctact 3900aaaaattatt tagacaactt aaaggaatac tggattcaaa
cttcagttac tattcactgt 3960aaatattatg atcaactgaa ctgacttacc attcatggga
taaaacatga tacaattctc 4020agtatttgct tttctgttaa gagcttggaa tatgactgat
gcttcagtgt cattttttta 4080ttattgaata taaaattagg tgcaatagat acatttattg
tagccttaag aaggaaatca 4140atgctaagct gttgggcata aaactttaca gtaacattag
ataacaagat aataaaacac 4200tttaatttta atcgaccttt taagaatgat ctttattatt
ctatccaaat caaggtagac 4260ttcaaaattt atggtttgaa caggaagaaa gtagtgaata
caaaattctt aaaattggtt 4320catatgatca tttttttatt aatagcatgt gagactaagc
tgaaccaaac tggccatata 4380tgttaaatca aataatttga gtaattttga aatattgatg
aagtagttaa atggttataa 4440tatattttat tggtactttg aaatttaaac atctatatca
cataataaag aaagtttttc 4500atcaatctga aacaggcacc attaaattgg tgacacaaat
attatcattt cacaaaagct 4560tgagaaaacc agatttgaag tgaaataatt taggaagata
ttaaaagtgg atgagaggac 4620ataagatgta gtcaatcctt aacgcaatga taccattatt
aggaaagagg aaaaatacca 4680aacaagagaa aatttaaagg agcaaaaatt tgcaagtcaa
atagaaatgc acaaatcgag 4740agaacattta catttctatc acattgacat gaaaattgaa
aatgtatagt cagagaaatt 4800ttcatgaatt attccatgaa gtgttgtttc ctttatttaa
aaagaatgct aggtaatctt 4860agtagaaaac taaaaagtat gataaacaac agatggaaga
attcatggaa aatagacgag 4920aaaaagtaaa taaggctttc tggagatgac ggaaattttg
tatgaataaa agccttataa 4980acaatgaata catttggata aacaactcat taaaatgcac
attaatggaa tacctgtaca 5040aatagtttta atacttgtgt ccagtggcac taaagagata
tattaaaata gattctggtt 5100gagttgatta taagattgtc aatcatagga tacacaggga
aagcacagac acgattgctc 5160atctctttgg tagtctaaaa tatgttcctt ttaaccacat
caaaagatac tatctgattc 5220taatacagaa ggtgcaactt cctaggaatt atactttgta
ttatattcaa taattatttt 5280atgttttgca tcacatgggt gttgcaagaa cacagctctt
aaactaaaaa ggcattacat 5340taagaattta tttttgtaaa ttttaataaa ctctaaatta
atgcaaaata tttatgaaaa 5400tagcacagct tcacattaat acccacagta cttaaatatc
tcccttagtt gactcacagc 5460tttgtatgta agcttcaaat ttttaatggt aaaaataaga
aaattaggac tagcgaactg 5520ctagttgttc cctcagtgca tgtgcttgag cctaacagat
ctttgtttga attagttact 5580ttcaattaag cgaacatata atatttgtta aactgcttaa
tctttccaga tctcactttg 5640taaaatgacg tccaatcaac ctatttcaca tggttgttat
gaagaattaa taataagata 5700tgtaaaacca gcagcaccaa tagtgcctga caaaatataa
ctgctcaata aataacagtg 5760aattattata acaacagcta atggtattct attggaacac
atggtctgtt tcttttccaa 5820ttctgagtga tctcaacagc ctcacggcac attgaaaata
aaagtcaact tcctcacctt 5880ccccataggc catggacttt cccatctcta cccacttgtg
cagtttctcc tctttcctct 5940ctcctcctca gttactaaag caatcttcct tcaaacactt
ctattttcca ttacatcact 6000tcattcaggt tggtgatcag ctgcaacttc ctcagaaaga
ccactgctgc ttccagtgac 6060cctggctttg tcttgctttg tattagatta ctcagctccc
tggtattatt gttgtcttta 6120tcgcttatta gaatgtaacc cttaatgggc aagaaattgt
cagtcttgtt catttatcat 6180cgtataaaaa atgtttctag cacacagaaa gtgaccaata
aatatttctg gaattaatgt 6240a
6241232601DNAHomo sapiens 23gaggctcagc acagaaggag
gaaggacagc agggccaaca gtcacagcag ccctgaccag 60agcattcctg gagctcaagc
tcctctacaa agaggtggac agagaagaca gcagagacca 120tgggaccccc ctcagcccct
ccctgcagat tgcatgtccc ctggaaggag gtcctgctca 180cagcctcact tctaaccttc
tggaacccac ccaccactgc caagctcact attgaatcca 240cgccgttcaa tgtcgcagag
gggaaggagg ttcttctact cgcccacaac ctgccccaga 300atcgtattgg ttacagctgg
tacaaaggcg aaagagtgga tggcaacagt ctaattgtag 360gatatgtaat aggaactcaa
caagctaccc cagggcccgc atacagtggt cgagagacaa 420tataccccaa tgcatccctg
ctgatccaga acgtcaccca gaatgacaca ggattctata 480ccctacaagt cataaagtca
gatcttgtga atgaagaagc aaccggacag ttccatgtat 540acccggagct gcccaagccc
tccatctcca gcaacaactc caaccccgtg gaggacaagg 600atgctgtggc cttcacctgt
gaacctgagg ttcagaacac aacctacctg tggtgggtaa 660atggtcagag cctcccggtc
agtcccaggc tgcagctgtc caatggcaac atgaccctca 720ctctactcag cgtcaaaagg
aacgatgcag gatcctatga atgtgaaata cagaacccag 780cgagtgccaa ccgcagtgac
ccagtcaccc tgaatgtcct ctatggccca gatgtcccca 840ccatttcccc ctcaaaggcc
aattaccgtc caggggaaaa tctgaacctc tcctgccacg 900cagcctctaa cccacctgca
cagtactctt ggtttatcaa tgggacgttc cagcaatcca 960cacaagagct ctttatcccc
aacatcactg tgaataatag cggatcctat atgtgccaag 1020cccataactc agccactggc
ctcaatagga ccacagtcac gatgatcaca gtctctggaa 1080gtgctcctgt cctctcagct
gtggccaccg tcggcatcac gattggagtg ctggccaggg 1140tggctctgat atagcagccc
tggtgtattt tcgatatttc aggaagactg gcagattgga 1200ccagaccctg aattcttcta
gctcctccaa tcccatttta tcccatggaa ccactaaaaa 1260caaggtctgc tctgctcctg
aagccctata tgctggagat ggacaactca atgaaaattt 1320aaagggaaaa ccctcaggcc
tgaggtgtgt gccactcaga gacttcacct aactagagac 1380agtcaaactg caaaccatgg
tgagaaattg acgacttcac actatggaca gcttttccca 1440agatgtcaaa acaagactcc
tcatcatgat aaggctctta ccccctttta atttgtcctt 1500gcttatgcct gcctctttcg
cttggcagga tgatgctgtc attagtattt cacaagaagt 1560agcttcagag ggtaacttaa
cagagtgtca gatctatctt gtcaatccca acgttttaca 1620taaaataaga gatcctttag
tgcacccagt gactgacatt agcagcatct ttaacacagc 1680cgtgtgttca aatgtacagt
ggtccttttc agagttggac ttctagactc acctgttctc 1740actccctgtt ttaattcaac
ccagccatgc aatgccaaat aatagaattg ctccctacca 1800gctgaacagg gaggagtctg
tgcagtttct gacacttgtt gttgaacatg gctaaataca 1860atgggtatcg ctgagactaa
gttgtagaaa ttaacaaatg tgctgcttgg ttaaaatggc 1920tacactcatc tgactcattc
tttattctat tttagttggt ttgtatcttg cctaaggtgc 1980gtagtccaac tcttggtatt
accctcctaa tagtcatact agtagtcata ctccctggtg 2040tagtgtattc tctaaaagct
ttaaatgtct gcatgcagcc agccatcaaa tagtgaatgg 2100tctctctttg gctggaatta
caaaactcag agaaatgtgt catcaggaga acatcataac 2160ccatgaagga taaaagcccc
aaatggtggt aactgataat agcactaatg ctttaagatt 2220tggtcacact ctcacctagg
tgagcgcatt gagccagtgg tgctaaatgc tacatactcc 2280aactgaaatg ttaaggaaga
agatagatcc aattaaaaaa aattaaaacc aatttaaaaa 2340aaaaaagaac acaggagatt
ccagtctact tgagttagca taatacagaa gtcccctcta 2400ctttaacttt tacaaaaaag
taacctgaac taatctgatg ttaaccaatg tatttatttc 2460tgtggttctg tttccttgtt
ccaatttgac aaaacccact gttcttgtat tgtattgccc 2520agggggagct atcactgtac
ttgtagagtg gtgctgcttt aattcataaa tcacaaataa 2580aagccaatta gctctataac t
2601246885DNAHomo sapiens
24gacgacgttt gggagccttt gctgagtcca gggagagagg cgtcccccac cgtgccgctg
60cagctcgggc agagccgcca agctttgggg tgctgaggaa cctctaatca tctcccatgg
120atttgtgatc agcgttgcag ctctcccagc agccctggac agtggccccc agcagtcagc
180atgtggctgc cgcgcgtctc cagcacagca gtgaccgcgc tcctcctggc gcagaccttc
240ctcctcctct ttctggtttc ccggccaggg ccctcgtccc cagcaggcgg cgaggcgcgc
300gtgcatgtgc tggtgctgtc ctcgtggcgc tcgggctcgt ccttcgtggg ccaactcttc
360aaccagcacc ccgacgtctt ctacctaatg gagcccgcgt ggcacgtgtg gaccaccctg
420tcgcagggca gcgccgcaac gctgcacatg gctgtgcgcg acctggtgcg ctccgtcttc
480ctgtgcgaca tggacgtgtt tgatgcctat ctgccttggc gccgcaacct gtccgacctc
540ttccagtggg ccgtgagccg tgcactgtgc tcgccacccg cctgcagtgc ctttccccga
600ggcgccatca gcagcgaggc cgtgtgcaag ccactgtgcg cgcggcagtc cttcaccctg
660gcccgggagg cctgccgctc ctacagccac gtggtgctca aggaggtgcg cttcttcaac
720ctgcaggtgc tctacccgct gctcagcgac cccgcgctca acctacgcat cgtgcacctg
780gtgcgcgacc cgcgggccgt gctgcgctcc cgggagcaga cagccaaggc tctggcgcgt
840gacaacggca tcgtgctggg caccaacggc acgtgggtgg aggccgaccc cggcctgcgc
900gtggtgcgcg aggtgtgccg tagccacgta cgcatcgccg aggccgccac actcaagccg
960ccaccctttc tgcgcggccg ctaccgcctg gtgcgcttcg aggacctggc gcgggagccg
1020ctggcagaaa tccgtgcgct ctacgccttc actgggctca gtctcacgcc acagctcgag
1080gcctggatcc ataacatcac ccacggatct ggacctggtg cgcgccgcga agccttcaag
1140acttcgtcca ggaatgcgct caacgtctcc caggcctggc gccatgcgct gccctttgcc
1200aagatccgcc gcgtgcagga actgtgcgct ggtgcgctgc agctgctggg ctaccggcct
1260gtgtactctg aggacgagca gcgcaacctc gcccttgatc tggtgctgcc acgaggcctg
1320aacggcttca cttgggcatc atccaccgcc tcgcaccccc gaaattagtg gaggccacag
1380ttgtagcagg cgctaggccc gggaggagag tgcatggtgc agagggggct ggggcgcacg
1440gagaagcagg tccctatatt gaccaaggag tttgtggtac gacccctccc cctccccaag
1500taggcaagga ctgcacgttt ctttctctct tgattcttgg ttttcctttg agtcctctgg
1560agctgccttc tcatcaggtg cactcttcat ggaaagcaac tcttgcccct cctcctctgg
1620gcacagggtg tgcgttcaga tgacttggct cctactcaag ggctttcttc ccctttaact
1680ctctccttct ggtgacacat cctgcagcag ctgagggggt gccctggcac tggctgggag
1740tggagaggca ctgtggtgaa atggctccag aggtctgtac atcacataca tatgcacaca
1800ggcacacatg gcaaaactcg gaagtgaaag gacttgtctg aaatcacatg gtgagaagga
1860ggatgaaggg aggagagagc ttttgctctg ggtctccagt ggataggaga ggacctgcct
1920cctgggtgag aagggtcaga ttttcctatt ttaattgctt tagggaagag caagcagagt
1980catgaccagg gacacagctg agagatagag gaggctgtga atgctgagac cagagtttat
2040catgctggac aagcctggaa ggaggcaata agtgggaaag gtaggaggag agaaggctgg
2100ggagggctgg gcagcaagcc aggcacagtg agtggcagag caagaggggg aaagcaggat
2160cagtgcctgg aaggcaggtg tgcccgtcag cggggagtgg aactcatcag gcttgccaag
2220aggttggaag ggaaatggct ctgggctgga actgtcttcc cttggtcctt ctggtccagg
2280ccttggagga aagcagagga tgatccctgc ctgtgagcca cacctcctag ctctgggggc
2340aaaggggctt agtaaaggaa tgctggatgt gtagagggtt tagtcccgag ctcaggaaat
2400gagagcctat aagtgcccag tacatgttta aaagaagagc tcatggaacc tctggaaagg
2460acagggaagt tgagttagcc acataaatga acccaagtca cattggaaca cagagctggt
2520ctgggaactg tgttggctgc caacagaact tctgaccctg ttacctgtga aatgaggcag
2580tttccctcac gttgccatca gctaccagga gcgatgctgg tggtcactag cttctgatcc
2640tcatcctggg tgtggccaca gattggggga acctggattg tggagtcaca tcctccctgc
2700aaagcaagca gggcaaggga gatctggcat tttctgcttt acgtggaggg agaacaggca
2760cattagcctt gaagctgaag ctcattttag gttccttcca ggtttagaag cttcaaccaa
2820atgaaacttg aatctgtccc tcgtgacaat tataggagga aggtatttaa aaccccagat
2880ttatgaatgt gtactacatg gcttagagaa tgtctttgtt cttgttcagg tggttataac
2940aaaatacctt aagagtgggt aacttggctg gatgcagtgg ctcatgcctg taatcccagc
3000actgtgggag gccgaggggg atgaatcacc tgaggtcagg cattcaagaa cagcctggcc
3060aacatggcga agcccctcct ctactaaaaa tacaaaatta gcgaggcatg gtcgcacata
3120cctgtaatcc cagctcctcg ggaagctgag gcaggagaat cgcttgaacc caggaggcgg
3180aggttgcagt gagccaagat cacgccattg cactccagcc tgggtgacag agcaagactc
3240catctcaaaa aaaaaaaaaa gactgggtaa cttataaaca aatgttcttc tcacaagtct
3300ggagactggg aagtccaaga tcaagccacc agtgctgtct gatgagggcc cactttttca
3360aagacagtgc cttctagctg tgtcctctta tcgtagaaga tgggagacag ctctccaggg
3420ccattttttt tttttttttt ttttttttga gatggagtct ggctctgtcg cccaggctgg
3480agtgcagtgg cacaatctcg gctcactgca acctctgcct ctctctgcct cctgagttca
3540agcaattctc ctgcctcagc ctcctgagga gctgggacta cagggatgca ccaccatgcc
3600cagctaattt ttgtattttt gtagacactg ggtttcacca tattggccag gttggtctca
3660aactcctgac ctcaagtgat ctgcccacct cagcctccca aagtgctgag attacaggca
3720tgagccactg tacccagtct ccagggcctt ttaaagaatg tcactaatcc cattcttgag
3780gtctccacct tcattatcta atcacctccc aaaggctcca catcccaaca ccatcatatt
3840gtgggttaag atttcaacca caagccaggc gtggtggctc atgcctgtaa tcccagcatt
3900ttggaaggct gaggcaggtg gatcacttga ggtcaggagt ttgagaccag cctggccaac
3960atggtgaaac cccatctcta ctaaaaataa aaaaaattag ccgggtgtgg tggtgcacac
4020ctgtaattcc agctactcag gaggctgagg caggagaatc ccttgaatcc aggaggcgga
4080cagtgcagtg agccgagata atgccactgc actccagcct ggatgacaga gcaagactcc
4140atctcatgcc cagccagcat gcccaacaag cttcatttgc ccctgtttag gtcacaaatt
4200ttattgatgg ctgcaattaa tggcctcttg gtatccaagt cctttgttgt atgacccatc
4260cattctcccc tgactcccaa ggtgtcagga catgcttgac tggctcctga atttgctctc
4320tgcgcatggg cagtacagtc aagcctcaca gtgaacccag gtcagctttc aggacaaaga
4380aagtggcctg gctgactagg cacagtaaag ccagggctgg gtaggtacat acttgtgctg
4440atcacgtatg tcttatatct ctgtgagagt gcagtcccaa caggaaggtt taatcactgg
4500ggactgccca atgctgtgac agggcacaga gctctgggtt gctgtggggg tgactgcatt
4560gaccactgtt agtggtttgc tgtgttgaca ctctgtgctg tgtgaccatg gctcctgcca
4620tcaagaagta gagtctgttt ctccacctct gaatccaggc tggtcctgtg acttgctttg
4680tcctgtagac aagtgtagtg caacttcctg tgagccagtt tgaagcatag gccttggaag
4740caaaacttta cctccacctg tcttaggttt tcagctgggg ctctgctgtg atttgattgt
4800gtctcccaaa gttggaacct tgatccccag tgttgtgagg tgaggcttga tggaaagtaa
4860attacgccgt gcgggttatg cccttgtgaa tgggtagaga acattatttc tgggcgcagg
4920catggtagct catgtctgta atcccagcat tttgggaggt tgaggtgtgc ggattacttg
4980aggtcaggag tttgagacca gcctggccaa caaggtaaaa caccatttct agttaaaata
5040caaaaattag ccaggtgtgg tggcacatgc ctgtaaggcc agctacttgg gaggctgaga
5100caggagaatc gcttgaaccc aggaggcaga gattgcagtg agctgagatc gcaccactgt
5160actccagcct gggcaacaaa gcgagagtct gtcttaaaaa aaaccaccat tatttcagga
5220gtgagttggt tatcctgaga gtggtgcctt ttaaatgaag gagttcattc tttgtctttc
5280tctcgccctc actttgccct tctgccatat gatgccttcc atcatgctag gacacagcaa
5340gaaggctctc gctagatgct ggctccttga tcttgggctt cccagcctcc agaactgtaa
5400gccaatacac ttctatttat tatatatgac ccttgctggg ttcagtggct cacgcctgta
5460atcccaatac tttgcaaggc tgaggcagga ggatcacttg agaccaggca ctcaagacca
5520gcctgggcaa catagtgaga ccccatctct acaaagttaa aaaaaaatta gcagggcatg
5580gtgtcgtgca cctgtagtcc tagctacttg ggaggctgag ttgggaggac tgcttgaccc
5640tgggaggttg aggctacatt gaaccatgat catgccagtg cactccagcc tgagtgacag
5700agcaagacac ctatctctaa ataaatgacc ccatctgtgg tattgttata gcaaaacaaa
5760acagattaag agagactttt taatgaaaag acagattcac aaagaaaaac aatgtttttg
5820tttctgtttt tttgaggcag agtcttgctc ttgtccccca ggctggagtg cagtggcgcc
5880atcttggctc actgcaacct ccgcctccca gtttcaagcg attctcctgc ctcagcctcc
5940cgagtagctg ggattacaga tgtacaccac cacgcccggc taattttttt tgtattttta
6000gtagagatgg ggtttcacca tgtcgatcag gctgggctgg aactcctgac ctcaggtgat
6060ccacctgcct tggcctccca atgtgctagg attacaggca tgagccactg tacctggcga
6120aaaacagttt gttaacacag gcagccaaca tcactcagga taagcctcaa tgaaaagtaa
6180caaagtgatg gcttggaaca ctgtcttaca cagcattttt aaaaaataca ataaatttgt
6240agagatagga tgaccaagga caacagtttt aggcttccaa aggtggtaaa ctatgggatg
6300gtaaatatcc gagaggaagc tgatgcaaca ggatttgtct gcagcagcct ctggtaccac
6360ctctgagtca agggttgtgt ccagtgatgg agagtttata tcgtgccttt aggcagaaaa
6420ggggagggaa acctgaactt ttcctgcact ttctgcttct taattgcctt cagctgaaaa
6480tcatttttta tgtgaaaaag gcatagtctg agctgacgcc tctgctttcc tccacctgaa
6540gagaacctgc gtgctgctcc tttgcttcgg acctccgcct ctgcccggga gaaagcccag
6600gccagcctgc tggacaagca gagaccatga gaaggagagt tcaggggtcc caaaccaggc
6660catcctagac cagccagctc cagctgatcc gcacgcagcc acttcggcta ccttctactg
6720gccaaaggga gtcccagggc tcacccagat tcagaggtgg ggaaactgag tccaccactt
6780gagaagagta gctataaaga catatgagcg aggccagctg agcccagcac tgcggccaag
6840tcgaagactt taggagcaat aaaagtgctt attgtgtttc agtca
6885251174DNAHomo sapiens 25aatgtccatt agcataaccc ttcctcagga agagtgagat
tttatatttg acaataaagt 60gttagactcc atttctaaat accagacttc aaaagataag
gttcaaaagt gttataagaa 120gatattcctt tttttgtcct agagaactta ttttcctgtg
aaaatgccta ccacaaagaa 180gacattgatg ttcttatcaa gctttttcac cagccttggg
tccttcattg taatttgctc 240tattcttggg acacaagcat ggatcaccag tacaattgct
gttagagact ctgcttcaaa 300tgggagcatt ttcatcactt acggactttt tcgtggggag
agtagtgaag aattgagtca 360cggacttgca gaaccaaaga aaaagtttgc agttttagag
atactgaata attcttccca 420aaaaactctg cattcggtga ctatcctgtt cctggtcctg
agtttgatca cgtcgctgct 480gagctctggg tttaccttct acaacagcat cagcaaccct
taccagacat tcctggggcc 540gacgggggtg tacacctgga acgggctcgg tgcatccttc
gtttttgtga ccatgatact 600gtttgtggcg aacacgcagt ccaaccaact ctccgaagag
ttgttccaaa tgctttaccc 660ggcaaccacc agtaaaggaa cgacccacag ttacggatac
tcgttctggc tcatactgct 720cgtcattctt ctaaatatag tcactgtaac catcatcatt
ttctaccaga aggccagata 780ccagcggaag caggagcaga gaaagccaat ggaatatgct
ccaagggacg gaattttatt 840ctgaattctc tttcatctca ttttggcgtt gcatctattg
tacatcagcc ctgagtagta 900actggttagc ttctctggac aattcagcat ggtaacgtga
ctgtcatctg tgacagcatt 960tgtgtttcat gacactgtgt tcttcattga tgctgtactc
ctgaaaattt ttcccacaag 1020gttggggaaa tgaatgggaa atgtcgctgg tctgtgtggt
attcaaagca gtagtatcat 1080gatgagcgta acgacccttc tgacctggtc tcacgatctg
aaataataaa aggctgtgtc 1140atgcaaaaaa aaaaaaaaaa aaaaaaaaaa aaaa
1174265927DNAHomo sapiens 26tcgtcggagc agacgggagt
ttctcctcgg ggtcggagca ggaggcacgc ggagtgtgag 60gccacgcatg agcggacgct
aaccccctcc ccagccacaa agagtctaca tgtctagggt 120ctagacatgt tcagctttgt
ggacctccgg ctcctgctcc tcttagcggc caccgccctc 180ctgacgcacg gccaagagga
aggccaagtc gagggccaag acgaagacat cccaccaatc 240acctgcgtac agaacggcct
caggtaccat gaccgagacg tgtggaaacc cgagccctgc 300cggatctgcg tctgcgacaa
cggcaaggtg ttgtgcgatg acgtgatctg tgacgagacc 360aagaactgcc ccggcgccga
agtccccgag ggcgagtgct gtcccgtctg ccccgacggc 420tcagagtcac ccaccgacca
agaaaccacc ggcgtcgagg gacccaaggg agacactggc 480ccccgaggcc caaggggacc
cgcaggcccc cctggccgag atggcatccc tggacagcct 540ggacttcccg gaccccccgg
accccccgga cctcccggac cccctggcct cggaggaaac 600tttgctcccc agctgtctta
tggctatgat gagaaatcaa ccggaggaat ttccgtgcct 660ggccccatgg gtccctctgg
tcctcgtggt ctccctggcc cccctggtgc acctggtccc 720caaggcttcc aaggtccccc
tggtgagcct ggcgagcctg gagcttcagg tcccatgggt 780ccccgaggtc ccccaggtcc
ccctggaaag aatggagatg atggggaagc tggaaaacct 840ggtcgtcctg gtgagcgtgg
gcctcctggg cctcagggtg ctcgaggatt gcccggaaca 900gctggcctcc ctggaatgaa
gggacacaga ggtttcagtg gtttggatgg tgccaaggga 960gatgctggtc ctgctggtcc
taagggtgag cctggcagcc ctggtgaaaa tggagctcct 1020ggtcagatgg gcccccgtgg
cctgcctggt gagagaggtc gccctggagc ccctggccct 1080gctggtgctc gtggaaatga
tggtgctact ggtgctgccg ggccccctgg tcccaccggc 1140cccgctggtc ctcctggctt
ccctggtgct gttggtgcta agggtgaagc tggtccccaa 1200gggccccgag gctctgaagg
tccccagggt gtgcgtggtg agcctggccc ccctggccct 1260gctggtgctg ctggccctgc
tggaaaccct ggtgctgatg gacagcctgg tgctaaaggt 1320gccaatggtg ctcctggtat
tgctggtgct cctggcttcc ctggtgcccg aggcccctct 1380ggaccccagg gccccggcgg
ccctcctggt cccaagggta acagcggtga acctggtgct 1440cctggcagca aaggagacac
tggtgctaag ggagagcctg gccctgttgg tgttcaagga 1500ccccctggcc ctgctggaga
ggaaggaaag cgaggagctc gaggtgaacc cggacccact 1560ggcctgcccg gaccccctgg
cgagcgtggt ggacctggta gccgtggttt ccctggcgca 1620gatggtgttg ctggtcccaa
gggtcccgct ggtgaacgtg gttctcctgg ccctgctggc 1680cccaaaggat ctcctggtga
agctggtcgt cccggtgaag ctggtctgcc tggtgccaag 1740ggtctgactg gaagccctgg
cagccctggt cctgatggca aaactggccc ccctggtccc 1800gccggtcaag atggtcgccc
cggaccccca ggcccacctg gtgcccgtgg tcaggctggt 1860gtgatgggat tccctggacc
taaaggtgct gctggagagc ccggcaaggc tggagagcga 1920ggtgttcccg gaccccctgg
cgctgtcggt cctgctggca aagatggaga ggctggagct 1980cagggacccc ctggccctgc
tggtcccgct ggcgagagag gtgaacaagg ccctgctggc 2040tcccccggat tccagggtct
ccctggtcct gctggtcctc caggtgaagc aggcaaacct 2100ggtgaacagg gtgttcctgg
agaccttggc gcccctggcc cctctggagc aagaggcgag 2160agaggtttcc ctggcgagcg
tggtgtgcaa ggtccccctg gtcctgctgg tccccgaggg 2220gccaacggtg ctcccggcaa
cgatggtgct aagggtgatg ctggtgcccc tggagctccc 2280ggtagccagg gcgcccctgg
ccttcaggga atgcctggtg aacgtggtgc agctggtctt 2340ccagggccta agggtgacag
aggtgatgct ggtcccaaag gtgctgatgg ctctcctggc 2400aaagatggcg tccgtggtct
gactggcccc attggtcctc ctggccctgc tggtgcccct 2460ggtgacaagg gtgaaagtgg
tcccagcggc cctgctggtc ccactggagc tcgtggtgcc 2520cccggagacc gtggtgagcc
tggtcccccc ggccctgctg gctttgctgg cccccctggt 2580gctgacggcc aacctggtgc
taaaggcgaa cctggtgatg ctggtgctaa aggcgatgct 2640ggtccccctg gccctgccgg
acccgctgga ccccctggcc ccattggtaa tgttggtgct 2700cctggagcca aaggtgctcg
cggcagcgct ggtccccctg gtgctactgg tttccctggt 2760gctgctggcc gagtcggtcc
tcctggcccc tctggaaatg ctggaccccc tggccctcct 2820ggtcctgctg gcaaagaagg
cggcaaaggt ccccgtggtg agactggccc tgctggacgt 2880cctggtgaag ttggtccccc
tggtccccct ggccctgctg gcgagaaagg atcccctggt 2940gctgatggtc ctgctggtgc
tcctggtact cccgggcctc aaggtattgc tggacagcgt 3000ggtgtggtcg gcctgcctgg
tcagagagga gagagaggct tccctggtct tcctggcccc 3060tctggtgaac ctggcaaaca
aggtccctct ggagcaagtg gtgaacgtgg tccccctggt 3120cccatgggcc cccctggatt
ggctggaccc cctggtgaat ctggacgtga gggggctcct 3180ggtgccgaag gttcccctgg
acgagacggt tctcctggcg ccaagggtga ccgtggtgag 3240accggccccg ctggaccccc
tggtgctcct ggtgctcctg gtgcccctgg ccccgttggc 3300cctgctggca agagtggtga
tcgtggtgag actggtcctg ctggtcccgc cggtcctgtc 3360ggccctgttg gcgcccgtgg
ccccgccgga ccccaaggcc cccgtggtga caagggtgag 3420acaggcgaac agggcgacag
aggcataaag ggtcaccgtg gcttctctgg cctccagggt 3480ccccctggcc ctcctggctc
tcctggtgaa caaggtccct ctggagcctc tggtcctgct 3540ggtccccgag gtccccctgg
ctctgctggt gctcctggca aagatggact caacggtctc 3600cctggcccca ttgggccccc
tggtcctcgc ggtcgcactg gtgatgctgg tcctgttggt 3660ccccccggcc ctcctggacc
tcctggtccc cctggtcctc ccagcgctgg tttcgacttc 3720agcttcctgc cccagccacc
tcaagagaag gctcacgatg gtggccgcta ctaccgggct 3780gatgatgcca atgtggttcg
tgaccgtgac ctcgaggtgg acaccaccct caagagcctg 3840agccagcaga tcgagaacat
ccggagccca gagggcagcc gcaagaaccc cgcccgcacc 3900tgccgtgacc tcaagatgtg
ccactctgac tggaagagtg gagagtactg gattgacccc 3960aaccaaggct gcaacctgga
tgccatcaaa gtcttctgca acatggagac tggtgagacc 4020tgcgtgtacc ccactcagcc
cagtgtggcc cagaagaact ggtacatcag caagaacccc 4080aaggacaaga ggcatgtctg
gttcggcgag agcatgaccg atggattcca gttcgagtat 4140ggcggccagg gctccgaccc
tgccgatgtg gccatccagc tgaccttcct gcgcctgatg 4200tccaccgagg cctcccagaa
catcacctac cactgcaaga acagcgtggc ctacatggac 4260cagcagactg gcaacctcaa
gaaggccctg ctcctccagg gctccaacga gatcgagatc 4320cgcgccgagg gcaacagccg
cttcacctac agcgtcactg tcgatggctg cacgagtcac 4380accggagcct ggggcaagac
agtgattgaa tacaaaacca ccaagacctc ccgcctgccc 4440atcatcgatg tggccccctt
ggacgttggt gccccagacc aggaattcgg cttcgacgtt 4500ggccctgtct gcttcctgta
aactccctcc atcccaacct ggctccctcc cacccaacca 4560actttccccc caacccggaa
acagacaagc aacccaaact gaaccccctc aaaagccaaa 4620aaatgggaga caatttcaca
tggactttgg aaaatatttt tttcctttgc attcatctct 4680caaacttagt ttttatcttt
gaccaaccga acatgaccaa aaaccaaaag tgcattcaac 4740cttaccaaaa aaaaaaaaaa
aaaaagaata aataaataac tttttaaaaa aggaagcttg 4800gtccacttgc ttgaagaccc
atgcgggggt aagtcccttt ctgcccgttg ggcttatgaa 4860accccaatgc tgccctttct
gctcctttct ccacaccccc cttggggcct cccctccact 4920ccttcccaaa tctgtctccc
cagaagacac aggaaacaat gtattgtctg cccagcaatc 4980aaaggcaatg ctcaaacacc
caagtggccc ccaccctcag cccgctcctg cccgcccagc 5040acccccaggc cctgggggac
ctggggttct cagactgcca aagaagcctt gccatctggc 5100gctcccatgg ctcttgcaac
atctcccctt cgtttttgag ggggtcatgc cgggggagcc 5160accagcccct cactgggttc
ggaggagagt caggaagggc cacgacaaag cagaaacatc 5220ggatttgggg aacgcgtgtc
aatcccttgt gccgcagggc tgggcgggag agactgttct 5280gttccttgtg taactgtgtt
gctgaaagac tacctcgttc ttgtcttgat gtgtcaccgg 5340ggcaactgcc tgggggcggg
gatgggggca gggtggaagc ggctccccat tttataccaa 5400aggtgctaca tctatgtgat
gggtggggtg gggagggaat cactggtgct atagaaattg 5460agatgccccc ccaggccagc
aaatgttcct ttttgttcaa agtctatttt tattccttga 5520tatttttctt tttttttttt
tttttttgtg gatggggact tgtgaatttt tctaaaggtg 5580ctatttaaca tgggaggaga
gcgtgtgcgg ctccagccca gcccgctgct cactttccac 5640cctctctcca cctgcctctg
gcttctcagg cctctgctct ccgacctctc tcctctgaaa 5700ccctcctcca cagctgcagc
ccatcctccc ggctccctcc tagtctgtcc tgcgtcctct 5760gtccccgggt ttcagagaca
acttcccaaa gcacaaagca gtttttcccc ctaggggtgg 5820gaggaagcaa aagactctgt
acctattttg tatgtgtata ataatttgag atgtttttaa 5880ttattttgat tgctggaata
aagcatgtgg aaatgaccca aacataa 5927275411DNAHomo sapiens
27gtgtcccata gtgtttccaa acttggaaag ggcgggggag ggcgggagga tgcggagggc
60ggaggtatgc agacaacgag tcagagtttc cccttgaaag cctcaaaagt gtccacgtcc
120tcaaaaagaa tggaaccaat ttaagaagcc agccccgtgg ccacgtccct tcccccattc
180gctccctcct ctgcgccccc gcaggctcct cccagctgtg gctgcccggg cccccagccc
240cagccctccc attggtggag gcccttttgg aggcacccta gggccaggga aacttttgcc
300gtataaatag ggcagatccg ggctttatta ttttagcacc acggcagcag gaggtttcgg
360ctaagttgga ggtactggcc acgactgcat gcccgcgccc gccaggtgat acctccgccg
420gtgacccagg ggctctgcga cacaaggagt ctgcatgtct aagtgctaga catgctcagc
480tttgtggata cgcggacttt gttgctgctt gcagtaacct tatgcctagc aacatgccaa
540tctttacaag aggaaactgt aagaaagggc ccagccggag atagaggacc acgtggagaa
600aggggtccac caggcccccc aggcagagat ggtgaagatg gtcccacagg ccctcctggt
660ccacctggtc ctcctggccc ccctggtctc ggtgggaact ttgctgctca gtatgatgga
720aaaggagttg gacttggccc tggaccaatg ggcttaatgg gacctagagg cccacctggt
780gcagctggag ccccaggccc tcaaggtttc caaggacctg ctggtgagcc tggtgaacct
840ggtcaaactg gtcctgcagg tgctcgtggt ccagctggcc ctcctggcaa ggctggtgaa
900gatggtcacc ctggaaaacc cggacgacct ggtgagagag gagttgttgg accacagggt
960gctcgtggtt tccctggaac tcctggactt cctggcttca aaggcattag gggacacaat
1020ggtctggatg gattgaaggg acagcccggt gctcctggtg tgaagggtga acctggtgcc
1080cctggtgaaa atggaactcc aggtcaaaca ggagcccgtg ggcttcctgg tgagagagga
1140cgtgttggtg cccctggccc agctggtgcc cgtggcagtg atggaagtgt gggtcccgtg
1200ggtcctgctg gtcccattgg gtctgctggc cctccaggct tcccaggtgc ccctggcccc
1260aagggtgaaa ttggagctgt tggtaacgct ggtcctgctg gtcccgccgg tccccgtggt
1320gaagtgggtc ttccaggcct ctccggcccc gttggacctc ctggtaatcc tggagcaaac
1380ggccttactg gtgccaaggg tgctgctggc cttcccggcg ttgctggggc tcccggcctc
1440cctggacccc gcggtattcc tggccctgtt ggtgctgccg gtgctactgg tgccagagga
1500cttgttggtg agcctggtcc agctggctcc aaaggagaga gcggtaacaa gggtgagccc
1560ggctctgctg ggccccaagg tcctcctggt cccagtggtg aagaaggaaa gagaggccct
1620aatggggaag ctggatctgc cggccctcca ggacctcctg ggctgagagg tagtcctggt
1680tctcgtggtc ttcctggagc tgatggcaga gctggcgtca tgggccctcc tggtagtcgt
1740ggtgcaagtg gccctgctgg agtccgagga cctaatggag atgctggtcg ccctggggag
1800cctggtctca tgggacccag aggtcttcct ggttcccctg gaaatatcgg ccccgctgga
1860aaagaaggtc ctgtcggcct ccctggcatc gacggcaggc ctggcccaat tggcccagct
1920ggagcaagag gagagcctgg caacattgga ttccctggac ccaaaggccc cactggtgat
1980cctggcaaaa acggtgataa aggtcatgct ggtcttgctg gtgctcgggg tgctccaggt
2040cctgatggaa acaatggtgc tcagggacct cctggaccac agggtgttca aggtggaaaa
2100ggtgaacagg gtccccctgg tcctccaggc ttccagggtc tgcctggccc ctcaggtccc
2160gctggtgaag ttggcaaacc aggagaaagg ggtctccatg gtgagtttgg tctccctggt
2220cctgctggtc caagagggga acgcggtccc ccaggtgaga gtggtgctgc cggtcctact
2280ggtcctattg gaagccgagg tccttctgga cccccagggc ctgatggaaa caagggtgaa
2340cctggtgtgg ttggtgctgt gggcactgct ggtccatctg gtcctagtgg actcccagga
2400gagaggggtg ctgctggcat acctggaggc aagggagaaa agggtgaacc tggtctcaga
2460ggtgaaattg gtaaccctgg cagagatggt gctcgtggtg ctcctggtgc tgtaggtgcc
2520cctggtcctg ctggagccac aggtgaccgg ggcgaagctg gggctgctgg tcctgctggt
2580cctgctggtc ctcggggaag ccctggtgaa cgtggtgagg tcggtcctgc tggccccaat
2640ggatttgctg gtcctgctgg tgctgctggt caacctggtg ctaaaggaga aagaggagcc
2700aaagggccta agggtgaaaa cggtgttgtt ggtcccacag gccccgttgg agctgctggc
2760ccagctggtc caaatggtcc ccccggtcct gctggaagtc gtggtgatgg aggcccccct
2820ggtatgactg gtttccctgg tgctgctgga cggactggtc ccccaggacc ctctggtatt
2880tctggccctc ctggtccccc tggtcctgct gggaaagaag ggcttcgtgg tcctcgtggt
2940gaccaaggtc cagttggccg aactggagaa gtaggtgcag ttggtccccc tggcttcgct
3000ggtgagaagg gtccctctgg agaggctggt actgctggac ctcctggcac tccaggtcct
3060cagggtcttc ttggtgctcc tggtattctg ggtctccctg gctcgagagg tgaacgtggt
3120ctaccaggtg ttgctggtgc tgtgggtgaa cctggtcctc ttggcattgc cggccctcct
3180ggggcccgtg gtcctcctgg tgctgtgggt agtcctggag tcaacggtgc tcctggtgaa
3240gctggtcgtg atggcaaccc tgggaacgat ggtcccccag gtcgcgatgg tcaacccgga
3300cacaagggag agcgcggtta ccctggcaat attggtcccg ttggtgctgc aggtgcacct
3360ggtcctcatg gccccgtggg tcctgctggc aaacatggaa accgtggtga aactggtcct
3420tctggtcctg ttggtcctgc tggtgctgtt ggcccaagag gtcctagtgg cccacaaggc
3480attcgtggcg ataagggaga gcccggtgaa aaggggccca gaggtcttcc tggcttaaag
3540ggacacaatg gattgcaagg tctgcctggt atcgctggtc accatggtga tcaaggtgct
3600cctggctccg tgggtcctgc tggtcctagg ggccctgctg gtccttctgg ccctgctgga
3660aaagatggtc gcactggaca tcctggtaca gttggacctg ctggcattcg aggccctcag
3720ggtcaccaag gccctgctgg cccccctggt ccccctggcc ctcctggacc tccaggtgta
3780agcggtggtg gttatgactt tggttacgat ggagacttct acagggctga ccagcctcgc
3840tcagcacctt ctctcagacc caaggactat gaagttgatg ctactctgaa gtctctcaac
3900aaccagattg agacccttct tactcctgaa ggctctagaa agaacccagc tcgcacatgc
3960cgtgacttga gactcagcca cccagagtgg agcagtggtt actactggat tgaccctaac
4020caaggatgca ctatggatgc tatcaaagta tactgtgatt tctctactgg cgaaacctgt
4080atccgggccc aacctgaaaa catcccagcc aagaactggt ataggagctc caaggacaag
4140aaacacgtct ggctaggaga aactatcaat gctggcagcc agtttgaata taatgtagaa
4200ggagtgactt ccaaggaaat ggctacccaa cttgccttca tgcgcctgct ggccaactat
4260gcctctcaga acatcaccta ccactgcaag aacagcattg catacatgga tgaggagact
4320ggcaacctga aaaaggctgt cattctacag ggctctaatg atgttgaact tgttgctgag
4380ggcaacagca ggttcactta cactgttctt gtagatggct gctctaaaaa gacaaatgaa
4440tggggaaaga caatcattga atacaaaaca aataagccat cacgcctgcc cttccttgat
4500attgcacctt tggacatcgg tggtgctgac caggaattct ttgtggacat tggcccagtc
4560tgtttcaaat aaatgaactc aatctaaatt aaaaaagaaa gaaatttgaa aaaactttct
4620ctttgccatt tcttcttctt cttttttaac tgaaagctga atccttccat ttcttctgca
4680catctacttg cttaaattgt gggcaaaaga gaaaaagaag gattgatcag agcattgtgc
4740aatacagttt cattaactcc ttcccccgct cccccaaaaa tttgaatttt tttttcaaca
4800ctcttacacc tgttatggaa aatgtcaacc tttgtaagaa aaccaaaata aaaattgaaa
4860aataaaaacc ataaacattt gcaccacttg tggcttttga atatcttcca cagagggaag
4920tttaaaaccc aaacttccaa aggtttaaac tacctcaaaa cactttccca tgagtgtgat
4980ccacattgtt aggtgctgac ctagacagag atgaactgag gtccttgttt tgttttgttc
5040ataatacaaa ggtgctaatt aatagtattt cagatacttg aagaatgttg atggtgctag
5100aagaatttga gaagaaatac tcctgtattg agttgtatcg tgtggtgtat tttttaaaaa
5160atttgattta gcattcatat tttccatctt attcccaatt aaaagtatgc agattatttg
5220cccaaatctt cttcagattc agcatttgtt ctttgccagt ctcattttca tcttcttcca
5280tggttccaca gaagctttgt ttcttgggca agcagaaaaa ttaaattgta cctattttgt
5340atatgtgaga tgtttaaata aattgtgaaa aaaatgaaat aaagcatgtt tggttttcca
5400aaagaacata t
5411285490DNAHomo sapiens 28ggctgagttt tatgacgggc ccggtgctga agggcaggga
acaacttgat ggtgctactt 60tgaactgctt ttcttttctc ctttttgcac aaagagtctc
atgtctgata tttagacatg 120atgagctttg tgcaaaaggg gagctggcta cttctcgctc
tgcttcatcc cactattatt 180ttggcacaac aggaagctgt tgaaggagga tgttcccatc
ttggtcagtc ctatgcggat 240agagatgtct ggaagccaga accatgccaa atatgtgtct
gtgactcagg atccgttctc 300tgcgatgaca taatatgtga cgatcaagaa ttagactgcc
ccaacccaga aattccattt 360ggagaatgtt gtgcagtttg cccacagcct ccaactgctc
ctactcgccc tcctaatggt 420caaggacctc aaggccccaa gggagatcca ggccctcctg
gtattcctgg gagaaatggt 480gaccctggta ttccaggaca accagggtcc cctggttctc
ctggcccccc tggaatctgt 540gaatcatgcc ctactggtcc tcagaactat tctccccagt
atgattcata tgatgtcaag 600tctggagtag cagtaggagg actcgcaggc tatcctggac
cagctggccc cccaggccct 660cccggtcccc ctggtacatc tggtcatcct ggttcccctg
gatctccagg ataccaagga 720ccccctggtg aacctgggca agctggtcct tcaggccctc
caggacctcc tggtgctata 780ggtccatctg gtcctgctgg aaaagatgga gaatcaggta
gacccggacg acctggagag 840cgaggattgc ctggacctcc aggtatcaaa ggtccagctg
ggatacctgg attccctggt 900atgaaaggac acagaggctt cgatggacga aatggagaaa
agggtgaaac aggtgctcct 960ggattaaagg gtgaaaatgg tcttccaggc gaaaatggag
ctcctggacc catgggtcca 1020agaggggctc ctggtgagcg aggacggcca ggacttcctg
gggctgcagg tgctcggggt 1080aatgacggtg ctcgaggcag tgatggtcaa ccaggccctc
ctggtcctcc tggaactgcc 1140ggattccctg gatcccctgg tgctaagggt gaagttggac
ctgcagggtc tcctggttca 1200aatggtgccc ctggacaaag aggagaacct ggacctcagg
gacacgctgg tgctcaaggt 1260cctcctggcc ctcctgggat taatggtagt cctggtggta
aaggcgaaat gggtcccgct 1320ggcattcctg gagctcctgg actgatggga gcccggggtc
ctccaggacc agccggtgct 1380aatggtgctc ctggactgcg aggtggtgca ggtgagcctg
gtaagaatgg tgccaaagga 1440gagcccggac cacgtggtga acgcggtgag gctggtattc
caggtgttcc aggagctaaa 1500ggcgaagatg gcaaggatgg atcacctgga gaacctggtg
caaatgggct tccaggagct 1560gcaggagaaa ggggtgcccc tgggttccga ggacctgctg
gaccaaatgg catcccagga 1620gaaaagggtc ctgctggaga gcgtggtgct ccaggccctg
cagggcccag aggagctgct 1680ggagaacctg gcagagatgg cgtccctgga ggtccaggaa
tgaggggcat gcccggaagt 1740ccaggaggac caggaagtga tgggaaacca gggcctcccg
gaagtcaagg agaaagtggt 1800cgaccaggtc ctcctgggcc atctggtccc cgaggtcagc
ctggtgtcat gggcttcccc 1860ggtcctaaag gaaatgatgg tgctcctggt aagaatggag
aacgaggtgg ccctggagga 1920cctggccctc agggtcctcc tggaaagaat ggtgaaactg
gacctcaggg acccccaggg 1980cctactgggc ctggtggtga caaaggagac acaggacccc
ctggtccaca aggattacaa 2040ggcttgcctg gtacaggtgg tcctccagga gaaaatggaa
aacctgggga accaggtcca 2100aagggtgatg ccggtgcacc tggagctcca ggaggcaagg
gtgatgctgg tgcccctggt 2160gaacgtggac ctcctggatt ggcaggggcc ccaggactta
gaggtggagc tggtccccct 2220ggtcccgaag gaggaaaggg tgctgctggt cctcctgggc
cacctggtgc tgctggtact 2280cctggtctgc aaggaatgcc tggagaaaga ggaggtcttg
gaagtcctgg tccaaagggt 2340gacaagggtg aaccaggcgg tccaggtgct gatggtgtcc
cagggaaaga tggcccaagg 2400ggtcctactg gtcctattgg tcctcctggc ccagctggcc
agcctggaga taagggtgaa 2460ggtggtgccc ccggacttcc aggtatagct ggacctcgtg
gtagccctgg tgagagaggt 2520gaaactggcc ctccaggacc tgctggtttc cctggtgctc
ctggacagaa tggtgaacct 2580ggtggtaaag gagaaagagg ggctccgggt gagaaaggtg
aaggaggccc tcctggagtt 2640gcaggacccc ctggaggttc tggacctgct ggtcctcctg
gtccccaagg tgtcaaaggt 2700gaacgtggca gtcctggtgg acctggtgct gctggcttcc
ctggtgctcg tggtcttcct 2760ggtcctcctg gtagtaatgg taacccagga cccccaggtc
ccagcggttc tccaggcaag 2820gatgggcccc caggtcctgc gggtaacact ggtgctcctg
gcagccctgg agtgtctgga 2880ccaaaaggtg atgctggcca accaggagag aagggatcgc
ctggtgccca gggcccacca 2940ggagctccag gcccacttgg gattgctggg atcactggag
cacggggtct tgcaggacca 3000ccaggcatgc caggtcctag gggaagccct ggccctcagg
gtgtcaaggg tgaaagtggg 3060aaaccaggag ctaacggtct cagtggagaa cgtggtcccc
ctggacccca gggtcttcct 3120ggtctggctg gtacagctgg tgaacctgga agagatggaa
accctggatc agatggtctt 3180ccaggccgag atggatctcc tggtggcaag ggtgatcgtg
gtgaaaatgg ctctcctggt 3240gcccctggcg ctcctggtca tccaggccca cctggtcctg
tcggtccagc tggaaagagt 3300ggtgacagag gagaaagtgg ccctgctggc cctgctggtg
ctcccggtcc tgctggttcc 3360cgaggtgctc ctggtcctca aggcccacgt ggtgacaaag
gtgaaacagg tgaacgtgga 3420gctgctggca tcaaaggaca tcgaggattc cctggtaatc
caggtgcccc aggttctcca 3480ggccctgctg gtcagcaggg tgcaatcggc agtccaggac
ctgcaggccc cagaggacct 3540gttggaccca gtggacctcc tggcaaagat ggaaccagtg
gacatccagg tcccattgga 3600ccaccagggc ctcgaggtaa cagaggtgaa agaggatctg
agggctcccc aggccaccca 3660gggcaaccag gccctcctgg acctcctggt gcccctggtc
cttgctgtgg tggtgttgga 3720gccgctgcca ttgctgggat tggaggtgaa aaagctggcg
gttttgcccc gtattatgga 3780gatgaaccaa tggatttcaa aatcaacacc gatgagatta
tgacttcact caagtctgtt 3840aatggacaaa tagaaagcct cattagtcct gatggttctc
gtaaaaaccc cgctagaaac 3900tgcagagacc tgaaattctg ccatcctgaa ctcaagagtg
gagaatactg ggttgaccct 3960aaccaaggat gcaaattgga tgctatcaag gtattctgta
atatggaaac tggggaaaca 4020tgcataagtg ccaatccttt gaatgttcca cggaaacact
ggtggacaga ttctagtgct 4080gagaagaaac acgtttggtt tggagagtcc atggatggtg
gttttcagtt tagctacggc 4140aatcctgaac ttcctgaaga tgtccttgat gtgcagctgg
cattccttcg acttctctcc 4200agccgagctt cccagaacat cacatatcac tgcaaaaata
gcattgcata catggatcag 4260gccagtggaa atgtaaagaa ggccctgaag ctgatggggt
caaatgaagg tgaattcaag 4320gctgaaggaa atagcaaatt cacctacaca gttctggagg
atggttgcac gaaacacact 4380ggggaatgga gcaaaacagt ctttgaatat cgaacacgca
aggctgtgag actacctatt 4440gtagatattg caccctatga cattggtggt cctgatcaag
aatttggtgt ggacgttggc 4500cctgtttgct ttttataaac caaactctat ctgaaatccc
aacaaaaaaa atttaactcc 4560atatgtgttc ctcttgttct aatcttgtca accagtgcaa
gtgaccgaca aaattccagt 4620tatttatttc caaaatgttt ggaaacagta taatttgaca
aagaaaaatg atacttctct 4680ttttttgctg ttccaccaaa tacaattcaa atgctttttg
ttttattttt ttaccaattc 4740caatttcaaa atgtctcaat ggtgctataa taaataaact
tcaacactct ttatgataac 4800aacactgtgt tatattcttt gaatcctagc ccatctgcag
agcaatgact gtgctcacca 4860gtaaaagata acctttcttt ctgaaatagt caaatacgaa
attagaaaag ccctccctat 4920tttaactacc tcaactggtc agaaacacag attgtattct
atgagtccca gaagatgaaa 4980aaaattttat acgttgataa aacttataaa tttcattgat
taatctcctg gaagattggt 5040ttaaaaagaa aagtgtaatg caagaattta aagaaatatt
tttaaagcca caattatttt 5100aatattggat atcaactgct tgtaaaggtg ctcctctttt
ttcttgtcat tgctggtcaa 5160gattactaat atttgggaag gctttaaaga cgcatgttat
ggtgctaatg tactttcact 5220tttaaactct agatcagaat tgttgacttg cattcagaac
ataaatgcac aaaatctgta 5280catgtctccc atcagaaaga ttcattggca tgccacaggg
gattctcctc cttcatcctg 5340taaaggtcaa caataaaaac caaattatgg ggctgctttt
gtcacactag catagagaat 5400gtgttgaaat ttaactttgt aagcttgtat gtggttgttg
atcttttttt tccttacaga 5460cacccataat aaaatatcat attaaaattc
5490298455DNAHomo sapiens 29ccgcactctc cgtccccgcg
gctggcgcag gacctcactc gagcggagcg cccacgggga 60gcgggtcgcg gggcggcggc
ggcgaggagg aggcgagaag gagttggagg aggaggagga 120ggaggcgagg gcgagctagc
ccagcggggt cccggccgcc ccgcgggcca aagtcgagcc 180ctcccgcccg tgggcgagcg
cgccagccgc cccttccaga acagccgccg ccacaaagaa 240gaacgggggg tgccgaggtc
cccatgacct cctaaagtgg tgcggtccct gctgagtgcg 300ctgcccgggc cgtgacccgc
gcccctgtgc gtccccgcgc gcctccgagc gcccctgtgc 360gccccggccc gcgccccgcc
ggcatggacg tccatacccg ctggaaagcg cgcagcgcgc 420tccgcccggg cgccccgctg
ctgcccccgc tgctgctgct gctgctgtgg gcgccgcctc 480cgagccgcgc agctcagcca
gcagatctcc tgaaggttct agattttcac aacttgcctg 540atggaataac aaagacaaca
ggcttttgcg ccacgcggcg atcttccaaa ggcccggatg 600tcgcttacag agtcaccaaa
gacgcgcagc tcagcgcacc caccaagcag ctgtaccctg 660cgtctgcatt tcccgaggac
ttctccatcc taacaactgt gaaagccaag aaaggcagcc 720aggccttcct ggtctccatc
tacaacgagc agggtatcca gcagattggg ctggagctgg 780gccgctctcc cgtcttcctc
tacgaggacc acacggggaa gcctggcccg gaagactacc 840ccctcttccg gggcatcaac
ctgtcagatg gcaagtggca cagaattgct ctcagcgtcc 900acaagaaaaa tgtcaccttg
atcctcgact gtaaaaagaa gaccaccaaa ttcctcgacc 960gcagcgacca ccccatgatc
gacatcaatg gcatcatcgt gtttggcacc cggatcctgg 1020atgaggaggt gtttgagggt
gacatccagc agctgctctt tgtctcggac caccgggcag 1080cttatgatta ctgtgagcac
tacagccctg actgtgacac cgcagtacct gacaccccac 1140agtcgcagga ccccaatcca
gatgaatatt acacggaagg agacggcgag ggtgagacct 1200attactacga atacccctac
tacgaagacc ccgaagacct agggaaggag cccaccccca 1260gcaagaagcc cgtggaagct
gccaaagaaa ccacagaggt ccccgaggag ctgaccccga 1320cccccacgga agctgctccc
atgcctgaaa ccagtgaagg ggctgggaag gaagaggacg 1380tcggcatcgg ggactatgac
tacgtgccca gtgaggacta ctacacgccc tcaccgtatg 1440atgacctcac ctatggcgag
ggggaggaga accccgacca gcccacagac ccaggcgctg 1500gggccgaaat tcccaccagc
accgccgaca cctccaactc ctccaatcca gctccgcctc 1560caggggaagg tgcggatgac
ttggaggggg agttcactga ggaaacgatc cggaaccttg 1620acgagaacta ctacgacccc
tactacgacc ccaccagctc cccgtcggag atcgggccgg 1680gaatgccggc gaaccaggat
accatctatg aagggattgg aggacctcgg ggcgagaaag 1740gccaaaaggg agaaccagcg
attatcgagc cgggcatgct catcgagggc ccgcctggcc 1800cagaaggccc cgcgggtctt
cccggacctc caggaaccat gggtcccact ggccaagtcg 1860gggaccctgg agaaaggggc
ccccctggac gcccaggcct tcctggggcc gatggcctgc 1920ccggtcctcc aggaaccatg
ctcatgctgc ccttccggtt tggaggtggc ggcgatgcgg 1980gctccaaagg ccccatggtc
tcagcccagg agtcccaggc gcaagccatt ctccagcagg 2040ccaggttggc actgagggga
ccagctggcc cgatgggtct cacagggaga cctggccctg 2100tgggtccccc tgggagcgga
ggtttgaagg gcgagccggg agacgtgggg cctcagggtc 2160ctcgaggtgt gcaaggcccg
cctggtccgg ccgggaagcc cggaagacgg ggtcgggctg 2220ggagtgatgg agccagagga
atgcctggac aaactggccc caagggtgac cggggtttcg 2280acggcctggc tgggttgcca
ggcgagaagg gccacagggg tgaccctggt ccttccggcc 2340caccaggacc tccgggagac
gatggagaaa ggggtgacga cggagaagtt gggcccaggg 2400ggctgcctgg ggagcccggg
ccacgtggtc tgcttgggcc gaaggggccc ccaggtcctc 2460ccggacctcc cggtgtcacg
ggtatggacg gccagccggg gccaaaagga aatgtgggtc 2520cccagggaga gcctggcccc
ccaggacagc agggtaatcc aggcgcccag ggtcttccag 2580gcccccaggg tgcaattggt
cctccaggag aaaagggtcc cttggggaaa ccaggccttc 2640caggaatgcc cggtgctgac
ggacccccgg gacaccctgg caaagaaggc cctccaggag 2700agaaaggagg tcagggtcca
cctggccccc agggtccgat tggctaccca ggtcctcgag 2760gagtcaaggg ggccgatggc
atccgtggtc tgaagggcac aaagggcgag aagggtgaag 2820acggctttcc tgggtttaaa
ggagacatgg gcatcaaggg tgatcggggg gagatcggcc 2880cacccggtcc caggggagaa
gatggccctg aaggcccaaa gggtcgcgga ggtcccaatg 2940gtgaccccgg tcctctggga
ccccctgggg agaagggaaa actcggagtc ccagggttac 3000cagggtatcc aggaagacaa
ggaccaaagg gctctattgg attccctgga tttcctggcg 3060ccaatggaga gaagggcggc
agggggaccc ctggaaagcc aggaccgcgg gggcagcgag 3120gcccaacggg tccgaggggt
gaaagaggcc cccggggcat cactgggaag cctggcccca 3180agggcaactc cggaggtgac
ggcccagctg gccctcctgg tgaacgggga cccaatggac 3240cccaaggacc cacaggattt
cctggaccaa agggcccccc tggccctcca ggcaaggatg 3300gactcccagg acaccctgga
cagagaggcg agactggttt ccaaggcaag accggccctc 3360caggcccccc cggcgtggtc
ggccctcagg gtcccacggg agaaacgggc ccaatgggtg 3420agcgtggcca ccctgggccc
cctggacccc ccggtgaaca ggggcttccg ggccttgctg 3480gaaaagaagg gacgaagggt
gacccaggcc ctgcaggcct ccctgggaaa gatggccctc 3540caggattacg tggtttccct
ggggaccgag ggcttcctgg tccagtggga gctcttggac 3600tgaaaggcaa tgaagggccc
cctggcccac caggccctgc gggatctcca ggggagagag 3660gtccagctgg agccgctggg
cccatcggaa ttccagggag acctgggccc cagggacccc 3720cagggccggc aggagagaaa
ggggctcctg gcgagaaagg cccacaaggc ccagctggcc 3780gagacggtct ccaggggcct
gtggggctcc cgggtccagc tggccctgtg ggtccccctg 3840gagaagacgg agataaggga
gagatcgggg agccggggca gaaaggaagc aagggggaca 3900aaggagaaca gggtcctcct
gggcctacag gtcctcaagg ccccatcgga cagccaggcc 3960cctctggagc tgacggcgag
ccggggcctc ggggccagca gggccttttc gggcagaaag 4020gtgatgaagg tcccagaggc
tttcctggac cccctgggcc agtggggctg cagggtttgc 4080caggacctcc aggcgagaag
ggtgagacag gagacgtggg ccagatgggc cccccgggtc 4140cccctggccc ccgaggaccc
tccggagctc caggtgctga tggcccacaa ggtcccccag 4200gtggaatagg aaaccctggt
gcagtgggag agaagggcga gcctggcgaa gcaggtgagc 4260ctggccttcc gggagaaggc
ggccccccgg gacccaaagg agaaagggga gagaagggcg 4320agtcaggccc ttcaggtgct
gccggacccc ctggacccaa aggccctccc ggagatgatg 4380gtcccaaagg cagccctggc
ccagtgggtt ttcctggaga tcctggcccc cccggagagc 4440ctggccccgc gggtcaagat
ggtccccctg gtgacaaagg agatgatggt gaacccgggc 4500agacgggatc ccccggccct
actggtgaac caggtccatc ggggcctcca ggaaaaaggg 4560gtcccccagg ccccgcaggc
cccgaaggca gacagggaga gaaaggggcc aagggagaag 4620ccggcttgga aggccctcct
gggaagactg gccccatcgg cccccagggg gcccctggga 4680agcccggacc ggatggcctt
cgagggatcc ctggccctgt gggagaacaa ggtctcccag 4740gatccccagg cccggacggt
ccccccggcc ccatgggtcc cccaggactt cccggcctca 4800aaggagattc tggtcccaaa
ggtgaaaagg gtcatccagg cctgatcggg ctcatcggtc 4860ctccgggtga acagggtgag
aagggcgacc gtggtctccc tggcccccag ggctcctccg 4920gtcctaaggg agaacagggt
atcactggtc cttctggccc gattgggcct cctgggcccc 4980ctggcctgcc gggtccgcct
ggtccaaaag gtgctaaggg ctcctcgggt ccaactggcc 5040cgaagggtga ggcaggccac
ccaggacccc caggcccccc gggccccccg ggagaggtca 5100tccagcccct gccaatccag
gcatccagga cgcggcggaa catcgacgcc agccagctgc 5160tggacgacgg gaatggcgag
aactacgtgg actacgcgga cggcatggaa gagatcttcg 5220gctctctcaa ctctctgaag
ctggagattg agcagatgaa acggcccctg ggcacgcagc 5280agaaccccgc ccgcacctgc
aaggacctgc agctctgcca ccccgacttc ccagatggtg 5340aatactgggt cgatcctaac
caaggatgct ccagggattc cttcaaggtt tactgcaact 5400tcacagccgg ggggtcgaca
tgcgtcttcc ctgacaagaa gtccgaaggg gccagaatca 5460cttcttggcc caaagaaaac
ccgggctcct ggttcagtga attcaagcgt gggaaactgc 5520tctcctatgt ggacgccgag
ggcaaccctg tgggtgtggt acagatgacc ttcctgcggc 5580tgctgagcgc ctctgcccac
cagaacgtca cctaccactg ctaccagtca gtggcctggc 5640aggacgcagc cacgggcagc
tacgacaagg ccctccgctt cctgggctcc aacgacgagg 5700agatgtccta tgacaacaac
ccctacatcc gcgccctggt ggacggctgt gctaccaaga 5760aaggctacca gaagacggtt
ctggagatcg acacccccaa agtggagcag gtgcccatcg 5820tggacatcat gttcaatgac
ttcggtgaag cgtcacagaa atttggattt gaagtggggc 5880cggcttgctt catgggctag
gagccgccga gcccgggctc ccgagagcaa cctcgtgacc 5940tcagcatgcc attcgttcgt
gagtgtcccg tgcacgtcct gaccctggac agtgaaggct 6000tctccctccc ctcccacctg
acttcatcta cgcctcggca ccacggggtg tgggacccca 6060gcccggagag aacagaggga
aggagccgcg cccccacctg gagctgaatc acatgaccta 6120gctgcacccc agcgcctggg
cccgccccac gctctgtcca cacccacgcg ccccgggagc 6180ggggccatgc ctccagcccc
ccagctcgcc cgacccatcc tgttcgtgaa taggtctcag 6240gggttggggg agggactgcc
agatttggac actatatttt tttctaaatt caacttgaag 6300atgtgtattt cccctgacct
tcaaaaaatg ttccaaggta agcctcgtaa aggtcatccc 6360accatcacca aagcctccgt
ttttaacaac ctccaacacg atccatttag aggccaaatg 6420tcattctgca ggtgccttcc
cgatggatta aaggtgctta tgtttttgtg agttttaagt 6480aaatatttgt attgtattgt
tataaatgtt aagtgtgcct ggctttcaat catgcacgga 6540aacccagtct cagtcccacg
gacagaatgg gcgaggcatg gattctgggt tgcagtaccg 6600ttctgattag aaataggaag
tctccccacc cccgccctgg ccaagaacgt gcaataaatt 6660ggaagtttgc cccggggcag
caagaattta tgctgccatt gaaaagcagg taccagtgcc 6720ccttttcaga cagtttttga
ttcgctctag actttttttt tttttaatag ggaaaaaatt 6780tgataatttt cttttttcta
catgcactta agactaaaac acaggtttgg attaatttta 6840tttgcttcct ttttccgctt
ttcttcccgc agagcctgat gggagaatgt ccagggcagg 6900gaaaccacat tttttgtagg
tgataactca atgaaaattg gtgcttattt tttacacttc 6960tctcttgtgg ctctcttgtg
gtgctatcta tctgttttaa ggtctccttg aaggcgcact 7020ggggaccctg gccatgcctc
gttctccctg ctttctttat cctgttattg cctccacagt 7080ctgttgccaa ggactctaag
atcaatgcac gtcactttcc tttccactgg gcaggatagc 7140caagcacact ccctcctgcg
ctctcccgcc ccggtgcgtc cactcccgag ggctgttatg 7200aggactgggt tgtgcctact
tgatttgaaa acacacacaa gcaataaaaa gcctcttcct 7260gcattgtctg tggtgtgacc
atagcagatt atatttggtt cctgaatgtt tgtggtgcta 7320atttctgtgt ttgttccaag
ccgttcagtc atgccatgcg ctgcctcggt agatggagta 7380atgtacaatg aactccatga
gtctctccag ggctgcctgc agcacgtctt ttccaagtag 7440cctatttgga ttcccatctc
aaatgtcctg gatgcgagcg tcagcggctc cagagctcgg 7500ggcgggtgag gtcccctttg
gggaaccctt tcctggccat cgaggtcggg gggctgccgt 7560ctgtgggcag gaggacccga
ggggcagcca ggaaaggcga tctcttcact gtgaaaagtt 7620gcccgggtgc agcgcctttt
ccttctacca tgggaaatgc aggctgggcc cttggggtga 7680gcctgcgggg ctctggtgct
gtccccgacc cccaccacca ccagaatgca gttccagctt 7740aggaagccac aaacaagcca
cccaggagga acaaaacacc gccagcgtgg attttccaaa 7800tttccctgga aagtaagtct
cgctcttgcc aaagaaaagt ctggcttgga gagtctctgg 7860agcccaggat gccagcatgt
gccaatgact gtcaccttca tctcttcaaa agaaaagcca 7920tagccgagga ctgtcccgcg
acccccgtgg actgcgtcta ggtcatgtga ttctgttttc 7980atttctcatc ccatccaatt
tgtccttttc tcctgtcatt ttcttcctct gtggtccctt 8040caaagttgtt ataatttgta
ctgaacttca aaatgtgtcc cgttctcccc agaccactct 8100agccacagta tattgcaata
aaattacttc ttatatttgc agaaattctt ttggtgtaat 8160tttatttttt cctctcaata
tatataattg gacaaacgct ggcaaaaaga aaaaaatggt 8220aagcaaaaaa cccaagataa
agtttcgagg acatcaggcc ttttgaaata caatgtcaaa 8280tgacacattg tacggtttca
aaaaatccgc tagacatgtc ataagtttta actgtaatgc 8340ccaggaaagg atatcttaaa
atattctaaa cttgtgtaac aaaggaataa ttaactgtaa 8400tagtttttca ataaatcgag
ttgggtgttt ccaccgtaaa aaaaaaaaaa aaaaa 8455306930DNAHomo sapiens
30gaccgttgct tggcagacac tggatggtta tgagcctgaa caagctgaaa aggggcagga
60aaagaagtgg aggcagcatt cttcctattt aaagctgcat cgcttgaaaa aagttttcgc
120agactgtgct ggagctggtg ctgaaaaagg gggtttgcag aggctgccct ggggctggtg
180ctgaaagaag agcccacagc tgacttcatg gtgctacaat aacctcagaa tctacttttc
240actctcagga gaacccacat gtctaatatt tagacatgat ggcaaactgg gcggaagcaa
300gacctctcct cattcttatt gttttattag ggcaatttgt ctcaataaaa gcccaggaag
360aagacgagga tgaaggatat ggtgaagaaa tagcctgcac tcagaatggc cagatgtact
420taaacaggga catttggaaa cctgcccctt gtcagatctg tgtctgtgac aatggagcca
480ttctctgtga caagatagaa tgccaggatg tgctggactg tgccgaccct gtaacgcccc
540ctggggaatg ctgtcctgtc tgttcacaaa cacctggagg tggcaataca aattttggta
600gaggaagaaa gggacaaaag ggagaaccag gattagtgcc tgttgtaaca ggcatacgtg
660gtcgtccagg accggcagga cctccaggat cacagggacc aagaggagag cgagggccaa
720aaggaagacc tggccctcgt ggacctcagg gaattgatgg agaaccaggt gttcctggtc
780aacctggtgc tccaggacct cctggacatc cgtcccaccc aggacccgat ggcttgagca
840ggccgttttc agctcaaatg gctgggttgg atgaaaaatc tggacttggg agtcaagtag
900gactaatgcc tggctctgtg ggtcctgttg gcccaagggg accacagggt ttacaaggac
960agcaaggtgg tgcaggacct acaggacctc ctggtgaacc tggtgatcct ggaccaatgg
1020gtccgattgg ttcacgtgga ccagagggcc ctcctggtaa acctggggaa gatggtgaac
1080ctggcagaaa tggaaatcct ggtgaagtgg gatttgcagg atctccggga gctcgtggat
1140ttcctggggc tcctggtctt ccaggtctga agggtcaccg aggacacaaa ggtcttgaag
1200gccctaaagg tgaagttgga gcacctggtt ccaagggtga agctggcccc actggtccaa
1260tgggtgccat gggtcctctg ggtccgaggg gaatgccagg agagagaggg agacttgggc
1320cacagggtgc tcctggacaa cgaggtgcac atggtatgcc tggaaaacct ggaccaatgg
1380gtcctcttgg gataccaggc tcttctggtt ttccaggaaa tcctggaatg aagggagaag
1440caggtcctac aggggcgcga ggccctgaag gtcctcaggg gcagagaggt gaaactgggc
1500ccccaggtcc agttggctct ccaggtcttc ctggtgcaat aggaactgat ggtactcctg
1560gtgccaaagg cccaacgggc tctccgggta cctctggtcc tcctggctca gcagggcctc
1620ctggatctcc aggacctcag ggtagcactg gtcctcaggg aattcgaggc caaccgggtg
1680atccaggagt tccaggtttc aaaggagaag ctggcccaaa aggggaacca gggccacatg
1740gtattcaggg tccgataggc ccacccggtg aagaaggcaa aagaggtccc agaggtgacc
1800caggaacagt tggtcctcca gggccagtgg gagaaagggg tgctcctggc aatcgtggtt
1860ttccaggctc tgatggttta cctgggccaa agggtgctca aggagaacgg ggtcctgtag
1920gttcttcagg acccaaagga agccaggggg atccaggacg tccaggggaa cctgggcttc
1980caggtgctcg gggtttgaca ggaaatcctg gtgttcaagg tcctgaagga aaacttggac
2040ctttgggtgc gccaggggaa gatggccgtc caggtcctcc aggctccata ggaatcagag
2100ggcagcccgg gagcatgggc cttccaggcc ccaaaggtag cagtggtgac cctgggaaac
2160ctggagaagc aggaaatgct ggagttcctg ggcagagggg agctcctgga aaagatggtg
2220aagttggtcc ttctggtcct gtgggcccgc cgggtctagc tggtgaaaga ggagaacaag
2280gacctccagg ccccacaggt tttcaggggc ttcctggtcc tccagggcct cctggagaag
2340gtggaaaacc aggtgatcaa ggtgttcctg gagatcccgg agcagttggc ccgttaggac
2400ctagaggaga acgaggaaat cctggggaaa gaggagaacc tgggataact ggactccctg
2460gtgagaaggg aatggctgga ggacatggtc ctgatggccc aaaaggcagt ccaggtccat
2520ctgggacccc tggagataca ggcccaccag gtcttcaagg tatgccggga gaaagaggaa
2580ttgcaggaac tcctggcccc aagggtgaca gaggtggcat aggagaaaaa ggtgctgaag
2640gcacagctgg aaatgatggt gcaagaggtc ttccaggtcc tttgggccct ccaggtccgg
2700caggtcctac tggagaaaag ggtgaacctg gtcctcgagg tttagttggc cctcctggct
2760cccggggcaa tcctggttct cgaggtgaaa atgggccaac tggagctgtt ggttttgccg
2820gaccccaggg tcctgacgga cagcctggag taaaaggtga acctggagag ccaggacaga
2880agggagatgc tggttctcct ggaccacaag gtttagcagg atcccctggc cctcatggtc
2940ctaatggtgt tcctggacta aaaggtggtc gaggaaccca aggtccgcct ggtgctacag
3000gatttcctgg ttctgcgggc agagttggac ctccaggccc tgctggagct ccaggacctg
3060cgggacccct aggggaaccc gggaaggagg gacctccagg tcttcgtggg gaccctggct
3120ctcatgggcg tgtgggagat cgaggaccag ctggcccccc tggtggccca ggagacaaag
3180gggacccagg agaagatggg caacctggtc cagatggccc ccctggtcca gctggaacga
3240ccgggcagag aggaattgtt ggcatgcctg ggcaacgtgg agagagaggc atgcccggcc
3300taccaggccc agcgggaaca ccaggaaaag taggaccaac tggtgcaaca ggagataaag
3360gtccacctgg acctgtgggg cccccaggct ccaatggtcc tgtaggggaa cctggaccag
3420aaggtccagc tggcaatgat ggtaccccag gacgggatgg tgctgttgga gaacgtggtg
3480atcgtggaga ccctgggcct gcaggtctgc caggctctca gggtgcccct ggaactcctg
3540gccctgtggg tgctccagga gatgcaggac aaagaggaga tccgggttct cggggtccta
3600taggaccacc tggtcgagct gggaaacgtg gattacctgg accccaagga cctcgtggtg
3660acaaaggtga tcatggagac cgaggcgaca gaggtcagaa gggccacaga ggctttactg
3720gtcttcaggg tcttcctggc cctcctggtc caaatggtga acaaggaagt gctggaatcc
3780ctggaccatt tggcccaaga ggtcctccag gcccagttgg tccttcaggt aaagaaggaa
3840accctgggcc acttgggcca attggacctc caggtgtacg aggcagtgta ggagaagcag
3900gacctgaggg ccctcctggt gagcctggcc cacctggccc tccgggtccc cctggccacc
3960ttacagctgc tcttggggat atcatggggc actatgatga aagcatgcca gatccacttc
4020ctgagtttac tgaagatcag gcggctcctg atgacaaaaa caaaacggac ccaggggttc
4080atgctaccct gaagtcactc agtagtcaga ttgaaaccat gcgcagcccc gatggctcga
4140aaaagcaccc agcccgcacg tgtgatgacc taaagctttg ccattccgca aagcagagtg
4200gtgaatactg gattgatcct aaccaaggat ctgttgaaga tgcaatcaaa gtttactgca
4260acatggaaac aggagaaaca tgtatttcag caaacccatc cagtgtacca cgtaaaacct
4320ggtgggccag taaatctcct gacaataaac ctgtttggta tggtcttgat atgaacagag
4380ggtctcagtt cgcttatgga gaccaccaat cacctaatac agccattact cagatgactt
4440ttttgcgcct tttatcaaaa gaagcctccc agaacatcac ttacatctgt aaaaacagtg
4500taggatacat ggacgatcaa gctaagaacc tcaaaaaagc tgtggttctc aaaggggcaa
4560atgacttaga tatcaaagca gagggaaata ttagattccg gtatatcgtt cttcaagaca
4620cttgctctaa gcggaatgga aatgtgggca agactgtctt tgaatataga acacagaatg
4680tggcacgctt gcccatcata gatcttgctc ctgtggatgt tggcggcaca gaccaggaat
4740tcggcgttga aattgggcca gtttgttttg tgtaaagtaa gccaagacac atcgacaatg
4800agcaccacca tcaatgacca ccgccattca caagaacttt gactgtttga agttgatcct
4860gagactcttg aagtaatggc tgatcctgca tcagcattgt atatatggtc ttaagtgcct
4920ggcctcctta tccttcagaa tatttatttt acttacaatc ctcaagtttt aattgatttt
4980aaatattttt caatacaaca gtttaggttt aagatgacca atgacaatga ccacctttgc
5040agaaagtaaa ctgattgaat aaataaatct ccgttttctt caatttattt cagtgtaatg
5100aaaaagttgc ttagtattta tgaggaaatt cttcttcctg gcaggtagct taaagagtgg
5160ggtatataga gccacaacac atgtttattt tgcttggctg cagttgaaaa atagaaatta
5220gtgccctttt gtgacctctc attccaagat tgtcaattaa aaatgagttt aaaatgttta
5280acttgtgatc gagacctaca tgcatgtctt gatattgtgt aactataata gagactcttt
5340aaggagaatc ttaaaaaaaa aaaaacgttt ctcactgtct taaatagaat ttttaaatag
5400tatatattca gtggcatttt ggagaacaaa gtgaatttac ttcgacttct taaatttttg
5460taaaagacta taagtttaga catctttctc attcaaattt aaagatatct ttctcctctt
5520gatcaatcta tcaatattga tagaagtcac actagtatat accatttaat acatttacac
5580tttcttattt aagaagatat tgaatgcaaa ataattgaca tatagaactt tacaaacata
5640tgtccaagga ctctaaattg agactcttcc acatgtacaa tctcatcatc ctgaagccta
5700taatgaagaa aaagatctag aaactgagtt gtggagctga ctctaatcaa atgtgatgat
5760tggaattaga ccatttggcc tttgaacttt cataggaaaa atgacccaac atttcttagc
5820atgagctacc tcatctctag aagctgggat ggacttacta ttcttgttta tattttagat
5880actgaaaggt gctatgcttc tgttattatt ccaagactgg agataggcag ggctaaaaag
5940gtattattat ttttccttta atgatggtgc taaaattctt cctataaaat tccttaaaaa
6000taaagatggt ttaatcacta ccattgtgaa aacataactg ttagacttcc cgtttctgaa
6060agaaagagca tcgttccaat gcttgttcac tgttcctctg tcatactgta tctggaatgc
6120tttgtaatac ttgcatgctt cttagaccag aacatgtagg tccccttgtg tctcaatact
6180ttttttttct taattgcatt tgttggctct attttaattt ttttctttta aaataaacag
6240ctgggaccat cccaaaagac aagccatgca tacaactttg gtcatgtatc tctgcaaagc
6300atcaaattaa atgcacgctt ttgtcatgtc agtggttttt gttttgtgaa attcctttga
6360ccatattaga tctatttcat ttccaatagt gaaaaggaga tgtggtggta tactttgttt
6420gccatttgtt taaaagatac aacggatacc ttctatcatg tatgtactgg cttataaatg
6480aaaatctatc tacaacatta cccacaaagg caacatgaca ccaattatca ctgcctctgc
6540ccttaaaaat gtcagagtag tattattgat aaaaagggca agcaatagat ttttcatgac
6600tgaataaact gtaataataa aacatatgtc tcaaagtgta tcacatatga atttagccta
6660attgttttca gtttcattct caatatttag tttacaacat cattttcccc taaactggtt
6720atattttgac ctgtatatct taaatttgag tatttatatg cctaaataca tgtgtgagtt
6780ttgtttgact tccaagtcca aactataaga ttatataagt tcatatagat gaatcagaaa
6840tatgtggtaa tactattaag tcacaaacac taacaatttc caactataga aataacagtt
6900cttatttgga ttttgggaat gctaccaata
6930313307DNAHomo sapiens 31caccttctgc actgctcatc tgggcagagg aagcttcaga
aagctgccaa ggcaccatct 60ccaggaactc ccagcacgca gaatccatct gagaatatgc
tgccacaaat accctttttg 120ctgctagtat ccttgaactt ggttcatgga gtgttttacg
ctgaacgata ccaaatgccc 180acaggcataa aaggcccact acccaacacc aagacacagt
tcttcattcc ctacaccata 240aagagtaaag gtatagcagt aagaggagag caaggtactc
ctggtccacc aggccctgct 300ggacctcgag ggcacccagg tccttctgga ccaccaggaa
aaccaggcta cggaagtcct 360ggactccaag gagagccagg gttgccagga ccaccgggac
catcagctgt agggaaacca 420ggtgtgccag gactcccagg aaaaccagga gagagaggac
catatggacc aaaaggagat 480gttggaccag ctggcctacc aggaccccgg ggcccaccag
gaccacctgg aatccctgga 540ccggctggaa tttctgtgcc aggaaaacct ggacaacagg
gacccacagg agccccagga 600cccaggggct ttcctggaga aaagggtgca ccaggagtcc
ctggtatgaa tggacagaaa 660ggggaaatgg gatatggtgc tcctggtcgt ccaggtgaga
ggggtcttcc aggccctcag 720ggtcccacag gaccatctgg ccctcctgga gtgggaaaaa
gaggtgaaaa tggggttcca 780ggacagccag gcatcaaagg tgatagaggt tttccgggag
aaatgggacc aattggccca 840ccaggtcccc aaggccctcc tggggaacga gggccagaag
gcattggaaa gccaggagct 900gctggagccc caggccagcc agggattcca ggaacaaaag
gtctccctgg ggctccagga 960atagctgggc ccccagggcc tcctggcttt gggaaaccag
gcttgccagg cctgaaggga 1020gaaagaggac ctgctggcct tcctgggggt ccaggtgcca
aaggggaaca agggccagca 1080ggtcttcctg ggaagccagg tctgactgga ccccctggga
atatgggacc ccaaggacca 1140aaaggcatcc cgggtagcca tggtctccca ggccctaaag
gtgagacagg gccagctggg 1200cctgcaggat accctggggc taagggtgaa aggggttccc
ctgggtcaga tggaaaacca 1260gggtacccag gaaaaccagg tctcgatggt cctaagggta
acccagggtt accaggtcca 1320aaaggtgatc ctggagttgg aggacctcct ggtctcccag
gccctgtggg cccagcagga 1380gcaaagggaa tgcccggaca caatggagag gctggcccaa
gaggtgcccc tggaatacca 1440ggtactagag gccctattgg gccaccaggc attccaggat
tccctgggtc taaaggggat 1500ccaggaagtc ccggtcctcc tggcccagct ggcatagcaa
ctaagggcct caatggaccc 1560accgggccac cagggcctcc aggtccaaga ggccactctg
gagagcctgg tcttccaggg 1620ccccctgggc ctccaggccc accaggtcaa gcagtcatgc
ctgagggttt tataaaggca 1680ggccaaaggc ccagtctttc tgggacccct cttgttagtg
ccaaccaggg ggtaacagga 1740atgcctgtgt ctgcttttac tgttattctc tccaaagctt
acccagcaat aggaactccc 1800ataccatttg ataaaatttt gtataacagg caacagcatt
atgacccaag gactggaatc 1860tttacttgtc agataccagg aatatactat ttttcatacc
acgtgcatgt gaaagggact 1920catgtttggg taggcctgta taagaatggc acccctgtaa
tgtacaccta tgatgaatac 1980accaaaggct acctggatca ggcttcaggg agtgccatca
tcgatctcac agaaaatgac 2040caggtgtggc tccagcttcc caatgccgag tcaaatggcc
tatactcctc tgagtatgtc 2100cactcctctt tctcaggatt cctagtggct ccaatgtgag
tacacacaga gctaatctaa 2160atcttgtgct agaaaaagca ttctctaact ctaccccacc
ctacaaaatg catatggagg 2220taggctgaaa agaatgtaat ttttattttc tgaaatacag
atttgagcta tcagaccaac 2280aaaccttccc cctgaaaagt gagcagcaac gtaaaaacgt
atgtgaagcc tctcttgaat 2340ttctagttag caatcttaag gctctttaag gttttctcca
atattaaaaa atatcaccaa 2400agaagtcctg ctatgttaaa aacaaacaac aaaaaacaaa
caacaaaaaa aaaattaaaa 2460aaaaaaacag aaatagagct ctaagttatg tgaaatttga
tttgagaaac tcggcatttc 2520ctttttaaaa aagcctgttt ctaactatga atatgagaac
ttctaggaaa catccaggag 2580gtatcatata actttgtaga acttaaatac ttgaatattc
aaatttaaaa gacactgtat 2640cccctaaaat atttctgatg gtgcactact ctgaggcctg
tatggcccct ttcatcaata 2700tctattcaaa tatacaggtg catatatact tgttaaagct
cttatataaa aaagccccaa 2760aatattgaag ttcatctgaa atgcaaggtg ctttcatcaa
tgaacctttt caaacttttc 2820tatgattgca gagaagcttt ttatataccc agcataactt
ggaaacaggt atctgaccta 2880ttcttattta gttaacacaa gtgtgattaa tttgatttct
ttaattcctt attgaatctt 2940atgtgatatg attttctgga tttacagaac attagcacat
gtaccttgtg cctcccattc 3000aagtgaagtt ataatttaca ctgagggttt caaaattcga
ctagaagtgg agatatatta 3060tttatttatg cactgtactg tatttttata ttgctgttta
aaacttttaa gctgtgcctc 3120acttattaaa gcacaaaatg ttttacctac tccttattta
cgacgcaata aaataacatc 3180aatagatttt taggctgaat taatttgaaa gcagcaattt
gctgttctca accattcttt 3240caaggctttt cattgttcaa agttaataaa aaagtaggac
aataaagtga aaaaaaaaaa 3300aaaaaaa
3307327291DNAHomo sapiens 32acacagtact ctcagcttgt
tggtggaagc ccctcatctg ccttcattct gaaggcaggg 60cccggcagag gaaggatcag
agggtcgcgg ccggagggtc ccggccggtg gggccaactc 120agagggagag gaaagggcta
gagacacgaa gaacgcaaac catcaaattt agaagaaaaa 180gccctttgac tttttccccc
tctccctccc caatggctgt gtagcaaaca tccctggcga 240taccttggaa aggacgaagt
tggtctgcag tcgcaatttc gtgggttgag ttcacagttg 300tgagtgcggg gctcggagat
ggagccgtgg tcctctaggt ggaaaacgaa acggtggctc 360tgggatttca ccgtaacaac
cctcgcattg accttcctct tccaagctag agaggtcaga 420ggagctgctc cagttgatgt
actaaaagca ctagattttc acaattctcc agagggaata 480tcaaaaacaa cgggattttg
cacaaacaga aagaattcta aaggctcaga tactgcttac 540agagtttcaa agcaagcaca
actcagtgcc ccaacaaaac agttatttcc aggtggaact 600ttcccagaag acttttcaat
actatttaca gtaaaaccaa aaaaaggaat tcagtctttc 660cttttatcta tatataatga
gcatggtatt cagcaaattg gtgttgaggt tgggagatca 720cctgtttttc tgtttgaaga
ccacactgga aaacctgccc cagaagacta tcccctcttc 780agaactgtta acatcgctga
cgggaagtgg catcgggtag caatcagcgt ggagaagaaa 840actgtgacaa tgattgttga
ttgtaagaag aaaaccacga aaccacttga tagaagtgag 900agagcaattg ttgataccaa
tggaatcacg gtttttggaa caaggatttt ggatgaagaa 960gtttttgagg gggacattca
gcagtttttg atcacaggtg atcccaaggc agcatatgac 1020tactgtgagc attatagtcc
agactgtgac tcttcagcac ccaaggctgc tcaagctcag 1080gaacctcaga tagatgagta
tgcaccagag gatataatcg aatatgacta tgagtatggg 1140gaagcagagt ataaagaggc
tgaaagtgta acagagggac ccactgtaac tgaggagaca 1200atagcacaga cggaggcaaa
catcgttgat gattttcaag aatacaacta tggaacaatg 1260gaaagttacc agacagaagc
tcctaggcat gtttctggga caaatgagcc aaatccagtt 1320gaagaaatat ttactgaaga
atatctaacg ggagaggatt atgattccca gaggaaaaat 1380tctgaggata cactatatga
aaacaaagaa atagacggca gggattctga tcttctggta 1440gatggagatt taggcgaata
tgatttttat gaatataaag aatatgaaga taaaccaaca 1500agccccccta atgaagaatt
tggtccaggt gtaccagcag aaactgatat tacagaaaca 1560agcataaatg gccatggtgc
atatggagag aaaggacaga aaggagaacc agcagtggtt 1620gagcctggta tgcttgtcga
aggaccacca ggaccagcag gacctgcagg tattatgggt 1680cctccaggtc tacaaggccc
cactggaccc cctggtgacc ctggcgatag gggcccccca 1740ggacgtcctg gcttaccagg
ggctgatggt ctacctggtc ctcctggtac tatgttgatg 1800ttaccgttcc gttatggtgg
tgatggttcc aaaggaccaa ccatctctgc tcaggaagct 1860caggctcaag ctattcttca
gcaggctcgg attgctctga gaggcccacc tggcccaatg 1920ggtctaactg gaagaccagg
tcctgtgggg gggcctggtt catctggggc caaaggtgag 1980agtggtgatc caggtcctca
gggccctcga ggcgtccagg gtccccctgg tccaacggga 2040aaacctggaa aaaggggtcg
tccaggtgca gatggaggaa gaggaatgcc aggagaacct 2100ggggcaaagg gagatcgagg
gtttgatgga cttccgggtc tgccaggtga caaaggtcac 2160aggggtgaac gaggtcctca
aggtcctcca ggtcctcctg gtgatgatgg aatgagggga 2220gaagatggag aaattggacc
aagaggtctt ccaggtgaag ctggcccacg aggtttgctg 2280ggtccaaggg gaactccagg
agctccaggg cagcctggta tggcaggtgt agatggcccc 2340ccaggaccaa aagggaacat
gggtccccaa ggggagcctg ggcctccagg tcaacaaggg 2400aatccaggac ctcagggtct
tcctggtcca caaggtccaa ttggtcctcc tggtgaaaaa 2460ggaccacaag gaaaaccagg
acttgctgga cttcctggtg ctgatgggcc tcctggtcat 2520cctgggaaag aaggccagtc
tggagaaaag ggggctctgg gtccccctgg tccacaaggt 2580cctattggat acccgggccc
ccggggagta aagggagcag atggtgtcag aggtctcaag 2640ggatctaaag gtgaaaaggg
tgaagatggt tttccaggat tcaaaggtga catgggtcta 2700aaaggtgaca gaggagaagt
tggtcaaatt ggcccaagag gggaagatgg ccctgaagga 2760cccaaaggtc gagcaggccc
aactggagac ccaggtcctt caggtcaagc aggagaaaag 2820ggaaaacttg gagttccagg
attaccagga tatccaggaa gacaaggtcc aaagggttcc 2880actggattcc ctgggtttcc
aggtgccaat ggagagaaag gtgcacgggg agtagctggc 2940aaaccaggcc ctcggggtca
gcgtggtcca acgggtcctc gaggttcaag aggtgcaaga 3000ggtcccactg ggaaacctgg
gccaaagggc acttcaggtg gcgatggccc tcctggccct 3060ccaggtgaaa gaggtcctca
aggacctcag ggtccagttg gattccctgg accaaaaggc 3120cctcctggac cacctgggaa
ggatgggctg ccaggacacc ctgggcaacg tggggagact 3180ggatttcaag gcaagaccgg
ccctcctggg ccagggggag tggttggacc acagggacca 3240accggtgaga ctggtccaat
aggggaacgt gggcatcctg gccctcctgg ccctcctggt 3300gagcaaggtc ttcctggtgc
tgcaggaaaa gaaggtgcaa agggtgatcc aggtcctcaa 3360ggtatctcag ggaaagatgg
accagcagga ttacgtggtt tcccagggga aagaggtctt 3420cctggagctc agggtgcacc
tggactgaaa ggaggggaag gtccccaggg cccaccaggt 3480ccagttggct caccaggaga
acgtgggtca gcaggtacag ctggcccaat tggtttacca 3540gggcgcccgg gacctcaggg
tcctcctggt ccagctggag agaaaggtgc tcctggagaa 3600aaaggtcccc aagggcctgc
agggagagat ggagttcaag gtcctgttgg tctcccaggg 3660ccagctggtc ctgccggctc
ccctggggaa gacggagaca agggtgaaat tggtgagccg 3720ggacaaaaag gcagcaaggg
tgacaaggga gaaaatggcc ctcccggtcc cccaggtctt 3780caaggaccag ttggtgcccc
tggaattgct ggaggtgatg gtgaaccagg tcctagagga 3840cagcagggga tgtttgggca
aaaaggtgat gagggtgcca gaggcttccc tggacctcct 3900ggtccaatag gtcttcaggg
tctgccaggc ccacctggtg aaaaaggtga aaatggggat 3960gttggtccca tggggccacc
tggtcctcca ggcccaagag gccctcaagg tcccaatgga 4020gctgatggac cacaaggacc
cccagggtct gttggttcag ttggtggtgt tggagaaaag 4080ggtgaacctg gagaagcagg
gaacccaggg cctcctgggg aagcaggtgt aggcggtccc 4140aaaggagaaa gaggagagaa
aggggaagct ggtccacctg gagctgctgg acctccaggt 4200gccaaggggc caccaggtga
tgatggccct aagggtaacc cgggtcctgt tggttttcct 4260ggagatcctg gtcctcctgg
ggaacctggc cctgcaggtc aagatggtgt tggtggtgac 4320aagggtgaag atggagatcc
tggtcaaccg ggtcctcctg gcccatctgg tgaggctggc 4380ccaccaggtc ctcctggaaa
acgaggtcct cctggagctg caggtgcaga gggaagacaa 4440ggtgaaaaag gtgctaaggg
ggaagcaggt gcagaaggtc ctcctggaaa aaccggccca 4500gtcggtcctc agggacctgc
aggaaagcct ggtccagaag gtcttcgggg catccctggt 4560cctgtgggag aacaaggtct
ccctggagct gcaggccaag atggaccacc tggtcctatg 4620ggacctcctg gcttacctgg
tctcaaaggt gaccctggct ccaagggtga aaagggacat 4680cctggtttaa ttggcctgat
tggtcctcca ggagaacaag gggaaaaagg tgaccgaggg 4740ctccctggaa ctcaaggatc
tccaggagca aaaggggatg ggggaattcc tggtcctgct 4800ggtcccttag gtccacctgg
tcctccaggt ttaccaggtc ctcaaggccc aaagggtaac 4860aaaggctcta ctggacccgc
tggccagaaa ggtgacagtg gtcttccagg gcctcctggg 4920tctccaggtc cacctggtga
agtcattcag cctttaccaa tcttgtcctc caaaaaaacg 4980agaagacata ctgaaggcat
gcaagcagat gcagatgata atattcttga ttactcggat 5040ggaatggaag aaatatttgg
ttccctcaat tccctgaaac aagacattga gcatatgaaa 5100tttccaatgg gtactcagac
caatccagcc cgaacttgta aagacctgca actcagccat 5160cctgacttcc cagatggtga
atattggatt gatcctaacc aaggttgctc aggagattcc 5220ttcaaagttt actgtaattt
cacatctggt ggtgagactt gcatttatcc agacaaaaaa 5280tctgagggag taagaatttc
atcatggcca aaggagaaac caggaagttg gtttagtgaa 5340tttaagaggg gaaaactgct
ttcatactta gatgttgaag gaaattccat caatatggtg 5400caaatgacat tcctgaaact
tctgactgcc tctgctcggc aaaatttcac ctaccactgt 5460catcagtcag cagcctggta
tgatgtgtca tcaggaagtt atgacaaagc acttcgcttc 5520ctgggatcaa atgatgagga
gatgtcctat gacaataatc cttttatcaa aacactgtat 5580gatggttgtg cgtccagaaa
aggctatgaa aagactgtca ttgaaatcaa tacaccaaaa 5640attgatcaag tacctattgt
tgatgtcatg atcaatgact ttggtgatca gaatcagaag 5700ttcggatttg aagttggtcc
tgtttgtttt cttggctaag attaagacaa agaacatatc 5760aaatcaacag aaaatatacc
ttggtgccac caacccattt tgtgccacat gcaagttttg 5820aataaggatg gtatagaaaa
caacgctgca tatacaggta ccatttagga aataccgatg 5880cctttgtggg ggcagaatca
catggcaaaa gctttgaaaa tcataaagat ataagttggt 5940gtggctaaga tggaaacagg
gctgattctt gattcccaat tctcaactct ccttttccta 6000tttgaatttc tttggtgctg
tagaaaacaa aaaaagaaaa atatatattc ataaaaaata 6060tggtgctcat tctcatccat
ccaggatgta ctaaaacagt gtgtttaata aattgtaatt 6120attttgtgta cagttctata
ctgttatctg tgtccatttc caaaacttgc acgtgtccct 6180gaattccatc tgactctaat
tttatgagaa ttgcagaact ctgatggcaa taaatatatg 6240tattatgaaa aaataaagtt
gtaatttctg atgactctaa gtccctttct ttggttaata 6300ataaaatgcc tttgtatata
ttgatgttga agagttcaat tatttgatgt cgccaacaaa 6360attctcagag ggcaaaaatc
tggaagactt ttggaagcac actctgatca actcttctct 6420gccgacagtc attttgctga
atttcagcca aaaatattat gcattttgat gctttattca 6480aggctatacc tcaaactttt
tcttctcaga atccaggatt tcacaggata cttgtatata 6540tggaaaacaa gcaagtttat
atttttggac agggaaatgt gtgtaagaaa gtatattaac 6600aaatcaatgc ctccgtcaag
caaacaatca tatgtatact ttttttctac gttatctcat 6660ctccttgttt tcagtgtgct
tcaataatgc aggttaatat taaagatgga aattaagcaa 6720ttatttatga atttgtgcaa
tgttagattt tcttatcaat caagttcttg aatttgattc 6780taagttgcat attataacag
tctcgaaaat tattttactt gcccaacaaa tattactttt 6840ttcctttcaa gataatttta
taaatcattt gacctaccta attgctaaat gaataacata 6900tggtggactg ttattaagag
tatttgtttt aagtcattca ggaaaatcta aacttttttt 6960tccactaagg tatttacttt
aaggtagctt gaaatagcaa tacaatttaa aaattaaaaa 7020ctgaattttg tatctatttt
aagtaatata tgtaagactt gaaaataaat gttttatttc 7080ttatataaag tgttaaatta
attgatacca gatttcactg gaacagtttc aactgataat 7140ttatgacaaa agaacatacc
tgtaatattg aaattaaaaa gtgaaatttg tcataaagaa 7200tttcttttat ttttgaaatc
gagtttgtaa atgtcctttt aagaagggag atatgaatcc 7260aataaataaa ctcaagtctt
ggctacctgg a 7291332471DNAHomo sapiens
33agaaagcgag cagccaccca gctccccgcc accgccatgg tccccgacac cgcctgcgtt
60cttctgctca ccctggctgc cctcggcgcg tccggacagg gccagagccc gttgggctca
120gacctgggcc cgcagatgct tcgggaactg caggaaacca acgcggcgct gcaggacgtg
180cgggagctgc tgcggcagca ggtcagggag atcacgttcc tgaaaaacac ggtgatggag
240tgtgacgcgt gcgggatgca gcagtcagta cgcaccggcc tacccagcgt gcggcccctg
300ctccactgcg cgcccggctt ctgcttcccc ggcgtggcct gcatccagac ggagagcggc
360gcgcgctgcg gcccctgccc cgcgggcttc acgggcaacg gctcgcactg caccgacgtc
420aacgagtgca acgcccaccc ctgcttcccc cgagtccgct gtatcaacac cagcccgggg
480ttccgctgcg aggcttgccc gccggggtac agcggcccca cccaccaggg cgtggggctg
540gctttcgcca aggccaacaa gcaggtttgc acggacatca acgagtgtga gaccgggcaa
600cataactgcg tccccaactc cgtgtgcatc aacacccggg gctccttcca gtgcggcccg
660tgccagcccg gcttcgtggg cgaccaggcg tccggctgcc agcggcgcgc acagcgcttc
720tgccccgacg gctcgcccag cgagtgccac gagcatgcag actgcgtcct agagcgcgat
780ggctcgcggt cgtgcgtgtg tgccgttggc tgggccggca acgggatcct ctgtggtcgc
840gacactgacc tagacggctt cccggacgag aagctgcgct gcccggagcg ccagtgccgt
900aaggacaact gcgtgactgt gcccaactca gggcaggagg atgtggaccg cgatggcatc
960ggagacgcct gcgatccgga tgccgacggg gacggggtcc ccaatgaaaa ggacaactgc
1020ccgctggtgc ggaacccaga ccagcgcaac acggacgagg acaagtgggg cgatgcgtgc
1080gacaactgcc ggtcccagaa gaacgacgac caaaaggaca cagaccagga cggccggggc
1140gatgcgtgcg acgacgacat cgacggcgac cggatccgca accaggccga caactgccct
1200agggtaccca actcagacca gaaggacagt gatggcgatg gtatagggga tgcctgtgac
1260aactgtcccc agaagagcaa cccggatcag gcggatgtgg accacgactt tgtgggagat
1320gcttgtgaca gcgatcaaga ccaggatgga gacggacatc aggactctcg ggacaactgt
1380cccacggtgc ctaacagtgc ccaggaggac tcagaccacg atggccaggg tgatgcctgc
1440gacgacgacg acgacaatga cggagtccct gacagtcggg acaactgccg cctggtgcct
1500aaccccggcc aggaggacgc ggacagggac ggcgtgggcg acgtgtgcca ggacgacttt
1560gatgcagaca aggtggtaga caagatcgac gtgtgtccgg agaacgctga agtcacgctc
1620accgacttca gggccttcca gacagtcgtg ctggacccgg agggtgacgc gcagattgac
1680cccaactggg tggtgctcaa ccagggaagg gagatcgtgc agacaatgaa cagcgaccca
1740ggcctggctg tgggttacac tgccttcaat ggcgtggact tcgagggcac gttccatgtg
1800aacacggtca cggatgacga ctatgcgggc ttcatctttg gctaccagga cagctccagc
1860ttctacgtgg tcatgtggaa gcagatggag caaacgtatt ggcaggcgaa ccccttccgt
1920gctgtggccg agcctggcat ccaactcaag gctgtgaagt cttccacagg ccccggggaa
1980cagctgcgga acgctctgtg gcatacagga gacacagagt cccaggtgcg gctgctgtgg
2040aaggacccgc gaaacgtggg ttggaaggac aagaagtcct atcgttggtt cctgcagcac
2100cggccccaag tgggctacat cagggtgcga ttctatgagg gccctgagct ggtggccgac
2160agcaacgtgg tcttggacac aaccatgcgg ggtggccgcc tgggggtctt ctgcttctcc
2220caggagaaca tcatctgggc caacctgcgt taccgctgca atgacaccat cccagaggac
2280tatgagaccc atcagctgcg gcaagcctag ggaccagggt gaggacccgc cggatgacag
2340ccaccctcac cgcggctgga tgggggctct gcacccagcc ccaaggggtg gccgtcctga
2400gggggaagtg agaagggctc agagaggaca aaataaagtg tgtgtgcagg gaaaaaaaaa
2460aaaaaaaaaa a
247134618DNAHomo sapiens 34gcggccgcaa gctcggcact cacggctctg agggctccga
cggcactgac ggccatggcg 60cgttcgaacc tcccgctggc gctgggcctg gccctggtcg
cattctgcct cctggcgctg 120ccacgcgacg cccgggcccg gccgcaggag cgcatggtcg
gagaactccg ggacctgtcg 180cccgacgacc cgcaggtgca gaaggcggcg caggcggccg
tggccagcta caacatgggc 240agcaacagca tctactactt ccgagacacg cacatcatca
aggcgcagag ccagctggtg 300gccggcatca agtacttcct gacgatggag atggggagca
cagactgccg caagaccagg 360gtcactggag accacgtcga cctcaccact tgccccctgg
cagcaggggc gcagcaggag 420aagctgcgct gtgactttga ggtccttgtg gttccctggc
agaactcctc tcagctccta 480aagcacaact gtgtgcagat gtgataagtc cccgagggcg
aaggccattg ggtttggggc 540catggtggag ggcacttcag gtccgtgggc cgtatctgtc
acaataaatg gccagtgctg 600cttcttgcaa aaaaaaaa
618351279DNAHomo sapiens 35gggagggaga gaggcgcgcg
ggtgaaaggc gcattgatgc agcctgcggc ggcctcggag 60cgcggcggag ccagacgctg
accacgttcc tctcctcggt ctcctccgcc tccagctccg 120cgctgcccgg cagccgggag
ccatgcgacc ccagggcccc gccgcctccc cgcagcggct 180ccgcggcctc ctgctgctcc
tgctgctgca gctgcccgcg ccgtcgagcg cctctgagat 240ccccaagggg aagcaaaagg
cgcagctccg gcagagggag gtggtggacc tgtataatgg 300aatgtgctta caagggccag
caggagtgcc tggtcgagac gggagccctg gggccaatgg 360cattccgggt acacctggga
tcccaggtcg ggatggattc aaaggagaaa agggggaatg 420tctgagggaa agctttgagg
agtcctggac acccaactac aagcagtgtt catggagttc 480attgaattat ggcatagatc
ttgggaaaat tgcggagtgt acatttacaa agatgcgttc 540aaatagtgct ctaagagttt
tgttcagtgg ctcacttcgg ctaaaatgca gaaatgcatg 600ctgtcagcgt tggtatttca
cattcaatgg agctgaatgt tcaggacctc ttcccattga 660agctataatt tatttggacc
aaggaagccc tgaaatgaat tcaacaatta atattcatcg 720cacttcttct gtggaaggac
tttgtgaagg aattggtgct ggattagtgg atgttgctat 780ctgggttggt acttgttcag
attacccaaa aggagatgct tctactggat ggaattcagt 840ttctcgcatc attattgaag
aactaccaaa ataaatgctt taattttcat ttgctacctc 900tttttttatt atgccttgga
atggttcact taaatgacat tttaaataag tttatgtata 960catctgaatg aaaagcaaag
ctaaatatgt ttacagacca aagtgtgatt tcacactgtt 1020tttaaatcta gcattattca
ttttgcttca atcaaaagtg gtttcaatat tttttttagt 1080tggttagaat actttcttca
tagtcacatt ctctcaacct ataatttgga atattgttgt 1140ggtcttttgt tttttctctt
agtatagcat ttttaaaaaa atataaaagc taccaatctt 1200tgtacaattt gtaaatgtta
agaatttttt ttatatctgt taaataaaaa ttatttccaa 1260caaccttaat atctttaaa
1279362322DNAHomo sapiens
36atcattcggc cctcagactg ggctgggcag gtctgagagt tagggaaagt ccgttcccac
60tgccctcggg gagagaagaa aggagggggc aagggagaag ctgctggtcg gactcacaat
120gaaaacgctc cttcttttgc tgctggtgct cctggagctg ggagaggccc aaggatccct
180tcacagggtg cccctcagga ggcatccgtc cctcaagaag aagctgcggg cacggagcca
240gctctctgag ttctggaaat cccataattt ggacatgatc cagttcaccg agtcctgctc
300aatggaccag agtgccaagg aacccctcat caactacttg gatatggaat acttcggcac
360tatctccatt ggctccccac cacagaactt cactgtcatc ttcgacactg gctcctccaa
420cctctgggtc ccctctgtgt actgcactag cccagcctgc aagacgcaca gcaggttcca
480gccttcccag tccagcacat acagccagcc aggtcaatct ttctccattc agtatggaac
540cgggagcttg tccgggatca ttggagccga ccaagtctct gtggaaggac taaccgtggt
600tggccagcag tttggagaaa gtgtcacaga gccaggccag acctttgtgg atgcagagtt
660tgatggaatt ctgggcctgg gatacccctc cttggctgtg ggaggagtga ctccagtatt
720tgacaacatg atggctcaga acctggtgga cttgccgatg ttttctgtct acatgagcag
780taacccagaa ggtggtgcgg ggagcgagct gatttttgga ggctacgacc actcccattt
840ctctgggagc ctgaattggg tcccagtcac caagcaagct tactggcaga ttgcactgga
900taacatccag gtgggaggca ctgttatgtt ctgctccgag ggctgccagg ccattgtgga
960cacagggact tccctcatca ctggcccttc cgacaagatt aagcagctgc aaaacgccat
1020tggggcagcc cccgtggatg gagaatatgc tgtggagtgt gccaacctta acgtcatgcc
1080ggatgtcacc ttcaccatta acggagtccc ctataccctc agcccaactg cctacaccct
1140actggacttc gtggatggaa tgcagttctg cagcagtggc tttcaaggac ttgacatcca
1200ccctccagct gggcccctct ggatcctggg ggatgtcttc attcgacagt tttactcagt
1260ctttgaccgt gggaataacc gtgtgggact ggccccagca gtcccctaag gaggggcctt
1320gtgtctgtgc ctgcctgtct gacagacctt gaatatgtta ggctggggca ttctttacac
1380ctacaaaaag ttattttcca gagaatgtag ctgtttccag ggttgcaact tgaattaaga
1440ccaaacagaa catgagaata cacacacaca cacacatata cacacacaca cacttcacac
1500atacacacca ctcccaccac cgtcatgatg gaggaattac gttatacatt catattttgt
1560attgattttt gattatgaaa atcaaaaatt ttcacatttg attatgaaaa tctccaaaca
1620tatgcacaag cagagatcat ggtataataa atccctttgc aactccactc agccctgaca
1680acccatccac acacggccag gcctgtttat ctacactgct gcccactcct ctctccagct
1740ccacatgctg tacctggatc attctgaagc aaattccgag cattacatca ttttgtccat
1800aaatatttct aacatcctta aatatacaat cggaattcaa gcatctccca ttgtcccaca
1860aatgtttggc tgtttttgta gttggattgt ttgtattagg attcaagcaa ggcccatata
1920ttgcatttat ttgaaatgtc tgtaagtctc tttccatcta cagagtttag cacatttgaa
1980cgttgctggt tgaaatcccg aggtgtcatt tgacatggtt ctctgaactt atctttccta
2040taaaatggta gttagatctg gaggtctgat tttgtggcaa aaatacttcc taggtggtgc
2100tgggtacttc ttgttgcatc ctgtcaggag gcagataatg ctggtgcctc tctattggta
2160atgttaagac tgctgggtgg gtttggagtt cttggcttta atcattcatt acaaagttca
2220gcattttaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
2280aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa
2322374542DNAHomo sapiens 37ggtaggcgcg cccagacctg agacgggttg ggactgggct
gcgtcacgcg cgggctctaa 60gcgcccgggg ccccgcccag tggccggcac agccaatcgc
agcgcgggaa ggcggtgggg 120gcggggaagg ccgcctggaa acttaaatcc cgaggcgggc
gaacctgcac cagaccgcgg 180acgtctgtaa tctcagaggc ttgtttgctg agggtgcctg
cgcagctgcg acggctgctg 240gttttgaaac atgaatcttt cgctcgtcct ggctgccttt
tgcttgggaa tagcctccgc 300tgttccaaaa tttgaccaaa atttggatac aaagtggtac
cagtggaagg caacacacag 360aagattatat ggcgcgaatg aagaaggatg gaggagagca
gtgtgggaaa agaatatgaa 420aatgattgaa ctgcacaatg gggaatacag ccaagggaaa
catggcttca caatggccat 480gaatgctttt ggtgacatga ccaatgaaga attcaggcag
atgatgggtt gctttcgaaa 540ccagaaattc aggaagggga aagtgttccg tgagcctctg
tttcttgatc ttcccaaatc 600tgtggattgg agaaagaaag gctacgtgac gccagtgaag
aatcagaaac agtgtggttc 660ttgttgggct tttagtgcga ctggtgctct tgaaggacag
atgttccgga aaactgggaa 720acttgtctca ctgagcgagc agaatctggt ggactgttcg
cgtcctcaag gcaatcaggg 780ctgcaatggt ggcttcatgg ctagggcctt ccagtatgtc
aaggagaacg gaggcctgga 840ctctgaggaa tcctatccat atgtagcagt ggatgaaatc
tgtaagtaca gacctgagaa 900ttctgttgct aatgacactg gcttcacagt ggtcgcacct
ggaaaggaga aggccctgat 960gaaagcagtc gcaactgtgg ggcccatctc cgttgctatg
gatgcaggcc attcgtcctt 1020ccagttctac aaatcaggca tttattttga accagactgc
agcagcaaaa acctggatca 1080tggtgttctg gtggttggct acggctttga aggagcaaat
tcgaataaca gcaagtattg 1140gctcgtcaaa aacagctggg gtccagaatg gggctcgaat
ggctatgtaa aaatagccaa 1200agacaagaac aaccactgtg gaatcgccac agcagccagc
taccccaatg tgtgagctga 1260tggatggtga ggaggaagga cttaaggaca gcatgtctgg
ggaaatttta tcttgaaact 1320gaccaaacgc ttattgtgta agataaacca gttgaatcat
tgaggatcca agttgagatt 1380ttaattctgt gacattttta caagggtaaa atgttaccac
tactttaatt attgttatac 1440acagctttat gatatcaaag actcattgct taattctaag
acttttgaat tttcattttt 1500taaaaagatg tacaaaacag tttgaaataa attttaattc
gtatataaag gtgggccttt 1560ttttaatgca ttggcttttt gtgtagtcag gaaatataat
taagctctga aataataact 1620tcatgtgcca atggtatgtt aggagaaaga tgctagcaga
aagctggctt cctggctctt 1680gagtagttat aaaaatacag actttatatc agctgcccag
ttactgtgtg ttacccaact 1740cctgaatcta ccaaaatttg ataaacaatt tggaaggaca
aacctacatt tttttttttt 1800atgagacgga gtctctctca ctctgtcacc aaggctggag
tgcagtagtg tgatctcagc 1860tcactgcaac ctctgcctcc cgggtttaag caattctctg
cctcagcctc ccgagtagct 1920gggattacag gcgcgtgcca ccacgcctag tgtattttta
gtagagagag ggtttcacca 1980tcttggccag gctggtcttg aactctcgac ctcgtgatcc
acctgcctca gcctcccaaa 2040gtgttgggat tacagatgtg agccactgcg cccagcccct
aaattgtttt agtaaagaag 2100aaagcagttg actttttaga aaaggaaatg tcttcttcct
ttgacacata acctcaagat 2160ggttaggtta cgtgataaaa agtgaagctg cttgttactg
gagcctctct gaattgctgg 2220tgaccacttt gcagtctgag gaagacgcag gtgcaagtaa
gttaccaagg gtattttgtt 2280ttcattccaa gtctctgatg acatattcca cctagctctt
ctagaaaagc ttatttgtta 2340actagtgtta gcaaaaataa atggtgtggt taacaaatct
caggcatttg acagtagctg 2400gagaaatttg ttcaaggtcc acaagagatg aggctgggct
acaaaaagag gcctctggcc 2460gaagatggat aagagatcac aagtgaaagg tcagaagtca
ctaggccaga ctctggctac 2520ctaatccctg aattaaaaat aaactcaaaa tggggggaaa
aaggctaagc atatagttac 2580acaggtagta aatacaaata aaacgatatt tttgcatatt
taaaatactg gaagaaagta 2640caccaaactt gtgggtgaat tatgggtggt tatttaactc
gtttagtgaa aagcatgtat 2700ttttatgagc aggagaaaat agtaaagata tacccacatt
ctgcaggcta aaggagagcc 2760ctgtgtgtgt tttaaaagcc catgtttcca gtcaaataaa
aaattaccgg ggatgagctg 2820gtgaaattta atcgaaaggt gatccattgt gaatgcaatg
ggagggaagg ggcatgtggg 2880actgtgtatc ccaaaaaccc tttgatagcc tatgtccaca
gccatctctg gaaaatccag 2940tctaccatat tcagtcttga gttttctctg agaaaggatt
ccttcgtctt tgcagcagca 3000ggctaagttg gtactctcca cagattcttg gaacaacaac
ccaggctcta aggaaacatc 3060taggcactaa attgaatgaa agagtgatgg ctttttattc
aaataaaaaa atttcaaatc 3120tgtcaaaaca gcacccctcg gaaaactaaa atagagatat
ttcaagattt tataattttc 3180aaagaccttt gaaatatttt aaacttgtga aaagttacaa
acctgatgtg ttgtctaaaa 3240gcgtttttca aacaagccag tagacttgaa aaatctagtc
tgaagcacag acttaaccaa 3300tatttgctgg ggatatgccc ccaaatctgg ccataaacaa
aatctctgca gtactgtgac 3360aggttcatga tggccatgac gccatgctga aggtttaccg
gaatgagagc aaggaacacc 3420tggaccaccc agggcgggaa aacggcttaa aggcgttcct
aaactacaaa caatagcatg 3480agcgatctgt gccttaagga catgttcctg ctgcagataa
ctagccagac cccatgcctt 3540tgtttcgttt gggaaggaat acttttagtt aatctataat
ctatagaaac aatgtttatc 3600actggcttgc tgtcaataaa catgggtaaa tctctgttcg
gggctctcag ctctgaaagc 3660tgtgagtccc ctgatttccc actctgcatg ctatatttct
gggtgtgtgt ctttaatttc 3720tctagcgcct ctgggttagg gtctccatga ccaagctggt
ctcggcaaat atttcctata 3780ctcgaacatc taaattgttt gttttcttac ttgaattgca
aaggaagtaa ggaagggtga 3840gatagaaatc actgagcata ttaagaggga ggagtttgct
gatgtgggag tgtagttatg 3900acacttgggc atcatactag aggctatgga cttagcaata
aggaaacatc cactcagccc 3960tatgtgtggt tcatgtgtat gttttgccta tgaaagttga
aactgttttc taaatgttat 4020ttccttttgt aactgataaa atttctaaaa agcaaagaag
aaacttttct gtattaaaaa 4080taacaaacat tagtaaatgt atatatacag tcatgtgttg
cttaatgatg gggatacatt 4140ctgagaaatg tgtcttgggt gatttcatcg tgacgtgaac
atcatagagt acactgacac 4200aaacctagat ggtatagcct attacacacc taggctatat
ggtacagcct gttgctccta 4260agctacgcac ctgtgcaaca catcactgtg ctgagtactg
tacacagttg taacacagtg 4320gtaagtattc gtgtatctga acagaaaagt acagtaaaaa
tataggatca taattttatg 4380ggaccactat catatatgca gtcactgacc aaaacatcac
acagcacatg acttgtgtgt 4440atggatatct caattgtaaa caatttcaca aatgttcatt
atctcttttt aagtaagtta 4500ataagtaaaa ctattactaa gctgcaaaaa aaaaaaaaaa
aa 4542382099DNAHomo sapiens 38aatcactgct gtgcagggca
ggaaagctcc acacacacag cccagcaaac agcagcacgc 60tgctgaaaaa aagactcaga
ggagagagat aaggaaggaa agtagtgatg gatctcatcc 120caaacttggc cgtggaaacc
tggcttctcc tggctgtcag cctgatactc ctctatctat 180atggaacccg tacacatgga
ctttttaaga agcttggaat tccagggccc acacctctgc 240cttttttggg aaatgctttg
tccttccgta agggctattg gacgtttgac atggaatgtt 300ataaaaagta tagaaaagtc
tggggtattt atgactgtca acagcctatg ctggctatca 360cagatcccga catgatcaaa
acagtgctag tgaaagaatg ttattctgtc ttcacaaacc 420ggaggccttt cgggccagtg
ggatttatga aaaatgccat ctctatagct gaggatgaag 480aatggaagag aatacgatca
ttgctgtctc caacattcac cagcggaaaa ctcaaggaga 540tggtccctat cattgcccag
tatggagatg tgttggtgag aaatctgagg cgggaagcag 600agacaggcaa gcctgtcacc
ttgaaacacg tctttggggc ctacagcatg gatgtgatca 660ctagcacatc atttggagtg
agcatcgact ctctcaacaa tccacaagac ccctttgtgg 720aaaacaccaa gaagctttta
agatttaatc cattagatcc attcgttctc tcaataaaag 780tctttccatt ccttacccca
attcttgaag cattaaatat cactgtgttt ccaagaaaag 840ttataagttt tctaacaaaa
tctgtaaaac agataaaaga aggtcgcctc aaagagacac 900aaaagcaccg agtggatttc
cttcagctga tgattgactc tcagaattca aaagactctg 960agacccacaa agctctgtct
gatctggagc tcatggccca atcaattatc tttatttttg 1020ctggctatga aaccacgagc
agtgttctct ccttcattat atatgaactg gccactcacc 1080ctgatgtcca gcagaaagtg
cagaaggaaa ttgatacagt tttacccaat aaggcaccac 1140ccacctatga tactgtgcta
cagttggagt atcttgacat ggtggtgaat gaaacactca 1200gattattccc agttgctatg
agacttgaga gggtctgcaa aaaagatgtt gaaatcaatg 1260ggatgtttat tcccaaaggg
gtggtggtga tgattccaag ctatgttctt catcatgacc 1320caaagtactg gacagagcct
gagaagttcc tccctgaaag gttcagtaaa aagaacaagg 1380acaacataga tccttacata
tacacaccct ttggaagtgg acccagaaac tgcattggca 1440tgaggtttgc tctcgtgaac
atgaaacttg ctctagtcag agtccttcag aacttctcct 1500tcaaaccttg taaagaaaca
cagatccccc tgaaattacg ctttggagga cttcttctaa 1560cagaaaaacc cattgttcta
aaggctgagt caagggatga gaccgtaagt ggagcctgat 1620ttccctaagg acttctggtt
tgctctttaa gaaagctgtg ccccagaaca ccagagacct 1680caaattactt tacaaataga
accctgaaat gaagacgggc ttcatccaat gtgctgcata 1740aataatcagg gattctgtac
gtgcattgtg ctctctcatg gtctgtatag agtgttatac 1800ttggtaatat agaggagatg
accaaatcag tgctggggaa gtagatttgg cttctctgct 1860tctcatagga ctatctccac
cacccccagt tagcaccatt aactcctcct gagctctgat 1920aacataatta acatttctca
ataatttcaa ccacaatcat taataaaaat aggaattatt 1980ttgatggctc taacagtgac
atttatatca tgtgttatat ctgtagtatt ctatagtaag 2040ctttatatta agcaaatcaa
taaaaacctc tttacaaaag taaaaaaaaa aaaaaaaaa 2099396093DNAHomo sapiens
39gcctgccagc tagccggagc cgcgggtgag cgcggcgagc ggcgaccctg gtgaggagcg
60cggcgcggga ggcacgttcc ttagctccgc cgcggccgtc ctccgcggct cgaggactcc
120gcttccttcc ctcccctccc ctgcgctccg gcctggggtc tcggcgcggg gagcggaggg
180aagggacgaa ggaggagtag gtgaaagcgg ggtgaggggc ggaagggtcc cggcgcgggg
240tgaggcgagg gctgcctctt gttctcccgc cgctgccgcc gtctcctggt cgggtgccgc
300ggccagaggc gcgcggggct gccgaggcac ccgcactatg caggcagact gccggccgcc
360gcgatggcga gccgggcggt ggtgagagcc aggcgctgcc cgcagtgtcc ccaagtccgg
420gccgcggccg ccgcccccgc ctgggccgcg ctccccctct cccgctccct ccctccctgc
480tccaactcct cctccttctc catgcctctg ttcctcctgc tcttacttgt cctgctcctg
540ctgctcgagg acgctggagc ccagcaaggt gatggatgtg gacacactgt actaggccct
600gagagtggaa cccttacatc cataaactac ccacagacct atcccaacag cactgtttgt
660gaatgggaga tccgtgtaaa gatgggagag agagttcgca tcaaatttgg tgactttgac
720attgaagatt ctgattcttg tcactttaat tacttgagaa tttataatgg aattggagtc
780agcagaactg aaataggcaa atactgtggt ctggggttgc aaatgaacca ttcaattgaa
840tcaaaaggca atgaaatcac attgctgttc atgagtggaa tccatgtttc tggacgcgga
900tttttggcct catactctgt tatagataaa caagatctaa ttacttgttt ggacactgca
960tccaattttt tggaacctga gttcagtaag tactgcccag ctggttgtct gcttcctttt
1020gctgagatat ctggaacaat tcctcatgga tatagagatt cctcgccatt gtgcatggct
1080ggtgtgcatg caggagtagt gtcaaacacg ttgggcggcc aaatcagtgt tgtaattagt
1140aaaggtatcc cctattatga aagttctttg gctaacaacg tcacatctgt ggtgggacac
1200ttatctacaa gtctttttac atttaagaca agtggatgtt atggaacact ggggatggag
1260tctggtgtga tcgcggatcc tcaaataaca gcatcatctg tgctggagtg gactgaccac
1320acagggcaag agaacagttg gaaacccaaa aaagccaggc tgaaaaaacc tggaccgcct
1380tgggctgctt ttgccactga tgaataccag tggttacaaa tagatttgaa taaggaaaag
1440aaaataacag gcattataac cactggatcc accatggtgg agcacaatta ctatgtgtct
1500gcctacagaa tcctgtacag tgatgatggg cagaaatgga ctgtgtacag agagcctggt
1560gtggagcaag ataagatatt tcaaggaaac aaagattatc accaggatgt gcgtaataac
1620tttttgccac caattattgc acgttttatt agagtgaatc ctacccaatg gcagcagaaa
1680attgccatga aaatggagct gctcggatgt cagtttattc ctaaaggtcg tcctccaaaa
1740cttactcaac ctccacctcc tcggaacagc aatgacctca aaaacactac agcccctcca
1800aaaatagcca aaggtcgtgc cccaaaattt acgcaaccac tacaacctcg cagtagcaat
1860gaatttcctg cacagacaga acaaacaact gccagtcctg atatcagaaa tactaccgta
1920actccaaatg taaccaaaga tgtagcgctg gctgcagttc ttgtccctgt gctggtcatg
1980gtcctcacta ctctcattct catattagtg tgtgcttggc actggagaaa cagaaagaaa
2040aaaactgaag gcacctatga cttaccttac tgggaccggg caggttggtg gaaaggaatg
2100aagcagtttc ttcctgcaaa agcagtggac catgaggaaa ccccagttcg ctatagcagc
2160agcgaagtta atcacctgag tccaagagaa gtcaccacag tgctgcaggc tgactctgca
2220gagtatgctc agccactggt aggaggaatt gttggtacac ttcatcaaag atctaccttt
2280aaaccagaag aaggaaaaga agcaggctat gcagacctag atccttacaa ctcaccaggg
2340caggaagttt atcatgccta tgctgaacca ctcccaatta cggggcctga gtatgcaacc
2400ccaatcatca tggacatgtc agggcacccc acaacttcag ttggtcagcc ctccacatcc
2460actttcaagg ctacggggaa ccaacctccc ccactagtgg gaacttacaa tacacttctc
2520tccaggactg acagctgctc ctcagcccag gcccagtatg ataccccgaa agctgggaag
2580ccaggtctac ctgccccaga cgaattggtg taccaggtgc cacagagcac acaagaagta
2640tcaggagcag gaagggatgg ggaatgtgat gtttttaaag aaatcctttg aagatgatgc
2700tgctttttac aaagcatcgt tttaaagcac atggcctttt ttttttaatt attagtggta
2760gtaatatata gaatgtatta cataactgtc actgaagtgg ttggggaaaa tgtggtgact
2820gaggtacagg aaactactaa tcttgccatc ttgctttaag gtgttatggt ggcacagtta
2880ctgctcgcct gttaaatttc aaatgtcctg tttgatacta ctgtagaaca ctatttttaa
2940tacagaaaaa gctccctata atgcacttca gagaaattaa aaatcacaga gtatttatta
3000ccaatgctgc aggtacatta atgaactcga gatggctctg taagcctgac tggcaataac
3060gcacggtact gttcttgaaa tacctaatgg cttgaaattc tagtctgttt gtgaaagatg
3120ggtactatca tgatttcctc ttctattcct atattctttt ctggattttt tttaataatt
3180agtgatataa gcattgtttt tattgcagcc atatccactt atccatctta agatctgtag
3240ctgggatttt ctgacttgta atgagcaggg ggattgcttt ttcactttgt gacactcttt
3300agagctttaa tgcttcacag tatatggcct ggtctcatcc ttgcgtgttc cacttgaggc
3360cctttggtgt cttgccccat tcttgtgttt ataaaatgtt tgagtatttc tgatgagtga
3420tgcttgcctt agtctcatga attcagatcc cttcatgtcc tttaagtatg ctcctcaatg
3480tgtaaacagg aacaacttta tgatttgaaa gctttaaagg agattcttct cccaccccca
3540actttatttg caatgggatt tttcctagga gagttatgaa aagttgaagg cttctaaggg
3600aatactgtaa acatgaccca cttatattta tcacagtgaa aggcaaaatt attcactcag
3660aagtaatata aattacctct ttaaaaagta accagaattt gtcctttttg gttttataca
3720ttcacaaaca tatacatttt tcttgagtct caaggtattt tatattttta gtcagaaaaa
3780ataatttttc atttcagttt tccataaact gttacacaaa atataaacct aacgtgtatt
3840tttcaggact gcgtgatcgt gcactttgtg tggtaagagg tttgagtagt cctatatgtc
3900acctagggaa cagacattat agcttactag caaatgaata ttcatgcctt gtttttgata
3960cctcctggca gcttccatgt caccacttgt tcatacctgc ccagagctag ttttagacat
4020ggcaaaatag aaatcatctg taatttatta gctaacaatg taaaaccatc ttttaaagcc
4080ttcagactgt caagacgaca tgagcagctc accatatgat aaaaatacat aaatttgaca
4140ttccctcttc cataaacctt tgtttgtaga tttaatgttg aacagtactt ttccataaag
4200ttctagtcac ttctgttggc ctgagccacc agattatgat gttgccagaa ttcactcaat
4260ttgaataaag atgaacagta tttgttttct tgtttccatg aattatatca gtattctaaa
4320acatcgcttc agaaagagaa ctgtttattt ctgcaggctt cctgtccttt tgtggtatgg
4380ttttttggcc ttattttcac tggcttttcc ttctccaaac tttgaggcgt gatttcattc
4440attgaagaat caatacatat tttgtttcaa aatgtttgaa acaaaagaca tagatggtag
4500acttttatta aaacatatat ggatgtggaa agcacatata ttaatgcagt catccctttt
4560caggtgggaa gagagcaaac cagttgattt tttaattcat ccttagtaca cagagaatat
4620acttttcctc aagtaatata cctgtttgaa gctttaagag agatgttttt ggtaactatt
4680tcattttccc aaagaagttt gctattcttg tgttaattgt gtatacctga ttgttttttc
4740ctggaggttt ttgttgttgt tgtttagttt tgggtttttt tttttttaag aggggcaagt
4800gttttctgaa atgatgcata ttttaagact cgattcatat tgccactgtg ctatccttga
4860actaccaata atttttataa aatatctagt ttttactact tttatataaa ctttactttc
4920cagatgaaga gctgagcctg attcaaatgg tttttctgct ttatacttct ttttagttca
4980ttggttttta tagtagaggt tttctatttt tttttttttt ttttttacta catttatatg
5040tctgatacat atacggcttt ggagacaatc aagtaacaac tgaaaatgtg aaagtaacca
5100tatctgacaa aattcccttg aatttttatc ctttgcttgc aacatttaag actcaaagtc
5160actggtatat tggattaagt tttttcctgt taatgcaatt atagaaatac atcggagaca
5220caacaaatgt ggccattaca ggtttcataa aattacactg acttggctgt tacttgatct
5280taggaaacag cacagtttaa gatattgtga attctgactt atactttatt aaatgctgta
5340aatctaaata gatcctgttg gatgtgatgg gtctagtcca gtttatttaa gttcatgttt
5400cactgtttgc actttgcatt gaacaatggg tttattcgct gatgtaaacg gttcgagtga
5460agaattaatg cagtaagtat gacaacacat acacacttgc ctctccccat ctccagaaga
5520ggggagcaga gtccgagctt atctaaatat gaatgtggcc acaaagctgt ggaaggtgac
5580aaagcttaaa cacctttgcc ctggctctgc attgtcacct agagagcaag aggtctatag
5640aaacatcatg tcacatgaaa cgattctctg ctttttggtt ctgaacttga agtccctaaa
5700ctgcaaaatc taagagttgg gtggttatta aaatgctttt aaagtcaact gtggcaccaa
5760ttctaatgta atccaacttg tgactgtttt tttttgtttt gttttgtttt tgtgtgtgtg
5820tgtgtggcac tgggaaaagt ggaaacaaac atgtattgaa atacatattg gaaataaaaa
5880tggtttgagc gtcagtgata ttctcccaga atgtacttat cttacctcgg catgtactgt
5940agtcactcag tatttgtata tgttgctaga atttagattg taaaatagtg aaattttaat
6000gtgttcattt gtttttaatg tatatatgtc ttgctcagat tatttggttt aaataaaaca
6060accttgaggt ttgtagcttt tccttatact ata
6093402090DNAHomo sapiens 40ggcacatgtg gctggaaatg caagaagagg gaaacgtgtg
gcttggagtt tcaagaagag 60tgactgtctc agtgccgagt gcctcagcag cttctccaca
tgctcttcag tccccaaagt 120tggagaatcc catcaaggag agtagccctg taaggaattc
gaatttccag catttttcac 180ctctgacaga gcccagacac catgaacgca agtgaattcc
gaaggagagg gaaggagatg 240gtggattacg tggccaacta catggaaggc attgagggac
gccaggtcta ccctgacgtg 300gagcccgggt acctgcggcc gctgatccct gccgctgccc
ctcaggagcc agacacgttt 360gaggacatca tcaacgacgt tgagaagata atcatgcctg
gggtgacgca ctggcacagc 420ccctacttct tcgcctactt ccccactgcc agctcgtacc
cggccatgct tgcggacatg 480ctgtgcgggg ccattggctg catcggcttc tcctgggcgg
caagcccagc atgcacagag 540ctggagactg tgatgatgga ctggctcggg aagatgctgg
aactaccaaa ggcatttttg 600aatgagaaag ctggagaagg gggaggagtg atccagggaa
gtgccagtga agccaccctg 660gtggccctgc tggccgctcg gaccaaagtg atccatcggc
tgcaggcagc gtccccagag 720ctcacacagg ccgctatcat ggagaagctg gtggcttact
catccgatca ggcacactcc 780tcagtggaaa gagctgggtt aattggtgga gtgaaattaa
aagccatccc ctcagatggc 840aacttcgcca tgcgtgcgtc tgccctgcag gaagccctgg
agagagacaa agcggctggc 900ctgattcctt tctttatggt tgccaccctg gggaccacaa
catgctgctc ctttgacaat 960ctcttagaag tcggtcctat ctgcaacaag gaagacatat
ggctgcacgt tgatgcagcc 1020tacgcaggca gtgcattcat ctgccctgag ttccggcacc
ttctgaatgg agtggagttt 1080gcagattcat tcaactttaa tccccacaaa tggctattgg
tgaattttga ctgttctgcc 1140atgtgggtga aaaagagaac agacttaacg ggagccttta
gactggaccc cacttacctg 1200aagcacagcc atcaggattc agggcttatc actgactacc
ggcattggca gataccactg 1260ggcagaagat ttcgctcttt gaaaatgtgg tttgtattta
ggatgtatgg agtcaaagga 1320ctgcaggctt atatccgcaa gcatgtccag ctgtcccatg
agtttgagtc actggtgcgc 1380caggatcccc gctttgaaat ctgtgtggaa gtcattctgg
ggcttgtctg ctttcggcta 1440aagggttcca acaaagtgaa tgaagctctt ctgcaaagaa
taaacagtgc caaaaaaatc 1500cacttggttc catgtcacct cagggacaag tttgtcctgc
gctttgccat ctgttctcgc 1560acggtggaat ctgcccatgt gcagcgggcc tgggaacaca
tcaaagagct ggcggccgac 1620gtgctgcgag cagagaggga gtaggagtga agccagctgc
aggaatcaaa aattgaagag 1680agatatatct gaaaactgga ataagaagca aataaatatc
atcctgcctt catggaactc 1740agctgtctgt ggcttcccat gtctttctcc aaagttatcc
agagggttgt gattttgtct 1800gcttagtatc tcatcaacaa agaaatatta tttgctaatt
aaaaagttaa tcttcatggc 1860catagctttt attcattagc tgtgattttt gttgattaaa
acattataga ttttcatgtt 1920cttgcagtca tcagaagtgg taggaaagcc tcactgatat
attttccagg gcaatcaatg 1980ttcacgcaac ttgaaattat atctgtggtc ttcaaattgt
cttttgtcat gtggctaaat 2040gcctaataaa caattcaagt gaaatactaa aaaaaaaaaa
aaaaaaaaaa 2090412268DNAHomo sapiens 41gtctcccctc gccgcatcca
ctctccggcc ggccgcctgc ccgccgcctc ctccgtgcgc 60ccgccagcct cgcccgcgcc
gtcaccatga gccaggccta ctcgtccagc cagcgcgtgt 120cctcctaccg ccgcaccttc
ggcggggccc cgggcttccc actcggctcc ccgctgagtt 180cgcccgtgtt cccgcgggcg
ggtttcggct ctaagggctc ctccagctcg gtgacgtccc 240gcgtgtacca ggtgtcgcgc
acgtcgggcg gggccggggg cctggggtcg ctgcgggcca 300gccggctggg gaccacccgc
acgccctcct cctacggcgc aggcgagctg ctggacttct 360cactggccga cgcggtgaac
caggagtttc tgaccacgcg caccaacgag aaggtggagc 420tgcaggagct caatgaccgc
ttcgccaact acatcgagaa ggtgcgcttc ctggagcagc 480agaacgcggc gctcgccgcc
gaagtgaacc ggctcaaggg ccgcgagccg acgcgagtgg 540ccgagctcta cgaggaggag
ctgcgggagc tgcggcgcca ggtggaggtg ctcactaacc 600agcgcgcgcg cgtcgacgtc
gagcgcgaca acctgctcga cgacctgcag cggctcaagg 660ccaagctgca ggaggagatt
cagttgaagg aagaagcaga gaacaatttg gctgccttcc 720gagcggacgt ggatgcagct
actctagctc gcattgacct ggagcgcaga attgaatctc 780tcaacgagga gatcgcgttc
cttaagaaag tgcatgaaga ggagatccgt gagttgcagg 840ctcagcttca ggaacagcag
gtccaggtgg agatggacat gtctaagcca gacctcactg 900ccgccctcag ggacatccgg
gctcagtatg agaccatcgc ggctaagaac atttctgaag 960ctgaggagtg gtacaagtcg
aaggtgtcag acctgaccca ggcagccaac aagaacaacg 1020acgccctgcg ccaggccaag
caggagatga tggaataccg acaccagatc cagtcctaca 1080cctgcgagat tgacgccctg
aagggcacta acgattccct gatgaggcag atgcgggaat 1140tggaggaccg atttgccagt
gaggccagtg gctaccagga caacattgcg cgcctggagg 1200aggaaatccg gcacctcaag
gatgagatgg cccgccatct gcgcgagtac caggacctgc 1260tcaacgtgaa gatggccctg
gatgtggaga ttgccaccta ccggaagctg ctggagggag 1320aggagagccg gatcaatctc
cccatccaga cctactctgc cctcaacttc cgagaaacca 1380gccctgagca aaggggttct
gaggtccata ccaagaagac ggtgatgatc aagaccatcg 1440agacacggga tggggaggtc
gtcagtgagg ccacacagca gcagcatgaa gtgctctaaa 1500gacagagacc ctctgccacc
agagaccgtc ctcacccctg tcctcactgc tccctgaagc 1560cagccttctt ccatcccagg
acaccacacc cagcctcagt cctcccctca cagcctctga 1620cccctcctca ctggccatcc
ctcgtggtcc ccaacagcga catagcccat ccctgcctgg 1680tcacagggca tgccccggcc
acctctgcgg accccagctg tgagccttgg ctgttggcag 1740tgagtgagcc tggctcttgt
gctggatgga gcccaggcgg gagcggtggc cctgtccctc 1800ccacctctgt gacctcaggc
actagccttt ggctctggag acagccccag agcagggtgt 1860tgggatactg cagggccagg
actgagcccc gcagacctcc ccagccccta gcccaggaga 1920gagaaagcca ggcaggtagc
cagggggact agcccctgtg gagactgggg ggcttgaaat 1980tgtccccgtg gtctcttact
ttcctttccc cagcccaggg tggacttaga aagcaggggc 2040tacaagaggg aatccccgaa
ggtgctggag gtgggagcag gagattgaga aggagagaaa 2100gtgggtgaga tgctggagaa
gagaggagag gagagaggca gagagcggtc tcaggctggt 2160gggaggggcg cccacctccc
cacgccctcc cctcccctgc tgcaggggct ctggagagaa 2220acaataaaga gattcacaca
caagccaaaa aaaaaaaaaa aaaaaaaa 2268421993DNAHomo sapiens
42aggaagaaaa tataaagtac acttttaaaa catgaagtct tcatagcagc ttatagtcgt
60tcagagaaac atgttccact gagaatgact tgagagagag gattacatca ttatgccaga
120aggaagaagc cactgtgcat gctctatcac cagcctcacc ctcctggtca gccttacaag
180agtgacactg gatatactcc agaagttgga cccaccacag cctgcacact ggacttcttg
240gcttttatga gctattcaag agatatttag tcatcacgtt gtgtcacaat gggagtgact
300cacagagcaa ggagagaacc tgaggattcc tcacacatgt agtactcaga gctctacgga
360aacccaggca cctcgacctc aagaggatca gcctggccag ggtggcacaa ctcttccttc
420cccgtgcaca gcaggaaagc tgccatcagc tgagcaagtc caccaacagt ttctgtgtcc
480cacttcatct ttaataagga caccatcttc ttgtattata caagaaagga gtgtacctat
540cacacacagg gggaaaaatg ctcttttggg tgctaggcct cctaatcctc tgtggttttc
600tgtggactcg taaaggaaaa ctaaagattg aagacatcac tgataagtac atttttatca
660ctggatgtga ctcgggcttt ggaaacttgg cagccagaac ttttgataaa aagggatttc
720atgtaatcgc tgcctgtctg actgaatcag gatcaacagc tttaaaggca gaaacctcag
780agagacttcg tactgtgctt ctggatgtga ccgacccaga gaatgtcaag aggactgccc
840agtgggtgaa gaaccaagtt ggggagaaag gtctctgggg tctgatcaat aatgctggtg
900ttcccggcgt gctggctccc actgactggc tgacactaga ggactacaga gaacctattg
960aagtgaacct gtttggactc atcagtgtga cactaaatat gcttcctttg gtcaagaaag
1020ctcaagggag agttattaat gtctccagtg ttggaggtcg ccttgcaatc gttggagggg
1080gctatactcc atccaaatat gcagtggaag gtttcaatga cagcttaaga cgggacatga
1140aagcttttgg tgtgcacgtc tcatgcattg aaccaggatt gttcaaaaca aacttggcag
1200atccagtaaa ggtaattgaa aaaaaactcg ccatttggga gcagctgtct ccagacatca
1260aacaacaata tggagaaggt tacattgaaa aaagtctaga caaactgaaa ggcaataaat
1320cctatgtgaa catggacctc tctccggtgg tagagtgcat ggaccacgct ctaacaagtc
1380tcttccctaa gactcattat gccgctggaa aagatgccaa aattttctgg atacctctgt
1440ctcacatgcc agcagctttg caagactttt tattgttgaa acagaaagca gagctggcta
1500atcccaaggc agtgtgactc agctaaccac aaatgtctcc tccaggctat gaaattggcc
1560gatttcaaga acacatctcc ttttcaaccc cattccttat ctgctccaac ctggactcat
1620ttagatcgtg cttatttgga ttgcaaaagg gagtcccacc atcgctggtg gtatcccagg
1680gtccctgctc aagttttctt tgaaaaggag ggctggaatg gtacatcaca taggcaagtc
1740ctgccctgta tttaggcttt gcctgcttgg tgtgatgtaa gggaaattga aagacttgcc
1800cattcaaaat gatctttacc gtggcctgcc ccatgcttat ggtccccagc atttacagta
1860acttgtgaat gttaagtatc atctcttatc taaatattaa aagataagtc aaacattaaa
1920aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
1980aaaaaaaaaa aaa
199343838DNAHomo sapiens 43gggtcacagc accctcctga aaactgcagc ttccttctca
ccttgaagaa taatcctaga 60aaactcacaa aatgtgtgat gcttttgtag gtacctggaa
acttgtctcc agtgaaaact 120ttgatgatta tatgaaagaa gtaggagtgg gctttgccac
caggaaagtg gctggcatgg 180ccaaacctaa catgatcatc agtgtgaatg gggatgtgat
caccattaaa tctgaaagta 240cctttaaaaa tactgagatt tccttcatac tgggccagga
atttgacgaa gtcactgcag 300atgacaggaa agtcaagagc accataacct tagatggggg
tgtcctggta catgtgcaga 360aatgggatgg aaaatcaacc accataaaga gaaaacgaga
ggatgataaa ctggtggtgg 420aatgcgtcat gaaaggcgtc acttccacga gagtttatga
gagagcataa gccaagggac 480gttgacctgg actgaagttc gcattgaact ctacaacatt
ctgtgggata tattgttcaa 540aaagatattg ttgttttcca tgatttagca agcaactaat
tttctcccaa gctgatttta 600ttcaatatgg ttacgttggt taaataaact ttttttagat
ttagaaggtg atgtaatgat 660gtattcattg tgcttatgat gtattcttag tcataactga
gtgaaggaaa tgggaaattt 720gcattatttc tttgttctga tatgaataat aacatatttc
ataataattc aaggtaaaaa 780gggatatcta tggatttccc taggtaggag ataacaagta
tgtaccatta ctgaatat 838441322DNAHomo sapiens 44tcctcaaagg aggggcagag
cctgcgcagg gcaggagcag ctggcccact ggcggcccgc 60aacactccgt ctcaccctct
gggcccactg catctagagg agggccgtct gtgaggccac 120tacccctcca gcaactggga
ggtgggactg tcagaagctg gcccagggtg gtggtcagct 180gggtcaggga cctacggcac
ctgctggacc acctcgcctt ctccatcgaa gcagggaagt 240gggagcctcg agccctcggg
tggaagctga ccccaagcca cccttcacct ggacaggatg 300agagtgtcag gtgtgcttcg
cctcctggcc ctcatctttg ccatagtcac gacatggatg 360tttattcgaa gctacatgag
cttcagcatg aaaaccatcc gtctgccacg ctggctggca 420gcctcgccca ccaaggagat
ccaggttaaa aagtacaagt gtggcctcat caagccctgc 480ccagccaact actttgcgtt
taaaatctgc agtggggccg ccaacgtcgt gggccctact 540atgtgctttg aagaccgcat
gatcatgagt cctgtgaaaa acaatgtggg cagaggccta 600aacatcgccc tggtgaatgg
aaccacggga gctgtgctgg gacagaaggc atttgacatg 660tactctggag atgttatgca
cctagtgaaa ttccttaaag aaattccggg gggtgcactg 720gtgctggtgg cctcctacga
cgatccaggg accaaaatga acgatgaaag caggaaactc 780ttctctgact tggggagttc
ctacgcaaaa caactgggct tccgggacag ctgggtcttc 840ataggagcca aagacctcag
gggtaaaagc ccctttgagc agttcttaaa gaacagccca 900gacacaaaca aatacgaggg
atggccagag ctgctggaga tggagggctg catgcccccg 960aagccatttt agggtggctg
tggctcttcc tcagccaggg gcctgaagaa gctcctgcct 1020gacttaggag tcagagcccg
gcaggggctg aggaggagga gcagggggtg ctgcgtggaa 1080ggtgctgcag gtccttgcac
gctgtgtcgc gcctctcctc ctcggaaaca gaaccctccc 1140acagcacatc ctacccggaa
gaccagcctc agagggtcct tctggaacca gctgtctgtg 1200gagagaatgg ggtgctttcg
tcagggactg ctgacggctg gtcctgagga aggacaaact 1260gcccagactt gagcccaatt
aaattttatt tttgctggtt ttgaatgaaa aaaaaaaaaa 1320aa
1322454596DNAHomo sapiens
45gatgacttgg agaacagtca cttcctcttt ctgggctaca gttttctcat cagtaactga
60agagcttgca gtaccttcaa cattccttca ggtgaggact tctctttgat cactgctatg
120gtttgaatgt gtctcctaaa gttcatgtgt tggaagcttg atccccagtg caaaagtgtt
180gggaggtggg gcctaatgag aagtaattag gccatgagta ttctgccctc atgtattcat
240gagattaatg tcattatcat gggagcgggt ttgttaaaat gagttcggcc ccttccttct
300ctctctttct tctgccatac agtatgggat gcacagtgca gaaggtctta ccagatactg
360gcaccatgct tttggacttc tcagccccca gaaccatgag ccaaataaat ttctgttcat
420tataaatcac ccagtctgtg gcatcctgtt agagcagcat aaatggacta agataatccc
480tataaagagt ggcaacagaa cagttcccag ctcactggca gaatcctatg atcaactagt
540aatgtctgcc agggaaggag gatgagtggc actaacatgt acggtgtgtt tgctcactgt
600tctacaggat tcaactagaa tctctgggtc tgtgtgaaga ccaagagctg ggaacagaag
660cagggctcta gagggaaaag tttctttcgg attccttttt tttgttttgt tttaaagagc
720tgctctgtga gaagacaagg agaaaatggt ccctacagga tactcacctc tgtctcaggg
780agacactcaa gcatttattc aaccaagaaa gagaattagt tccaggtgaa aggagaaccc
840cagaataccc acctactttt aaaattctcc cctatgcata ttcaggaacc aatcagagga
900tctgtaatgc gctgtgagta gaaagggaag gggaatggga aagaaaaaag tgaagcttgg
960aatggtggaa agtacatggg ctctgccatt taccagctaa gtgatcttgg gcaagtaact
1020tgacctttct gagcctcggt ttcctctttg gtgaaatgag gactaataat ccatttctcc
1080ctcagtacag acagccagca tgcagtgagc actcagcgat ggccaagtat gggagaagcc
1140atgctggaat gaacatgtgg gaccattttg tgcagtttct cagcgccaac actgactggt
1200cccctgggct cgtgggccgc cggcagcctc ggctcgttct ccagacagtg ttccaagaag
1260ccacttccag cgaggaagcg ttggcctgag aactggaacc tctgcggtct ctgcaaacac
1320gacaatgaca aacacttgag agggcatggg agaaaggagc tccttcatag ggcagggagg
1380ggtgggcact tgggtgtgac caaggagagg aggcgcgcct ggtcaacagc tctccctggc
1440ccgtgtccag ctccctcctc acacagagag gggggcgcat ctcagggatg gcatctttcc
1500cccccacagg gaaattctta tctttgaaac agcatgggaa tcgaggcacc caggagggga
1560gcagaggcag gcaggcctcc ttcaggccca tcctccagct gggctggtgg tgccagggag
1620gctccctgct tggtaacaaa ggcctgaggg agagttgcga aacccagcag gaaagccggc
1680tcaccttcgc ctccccctgc ggctgggagg agaggaaata tcccatggct gactgtgcca
1740aggaggtgtc tgagccagcc ctcccggccc gagggcaggg caggtggccc tgagagataa
1800gccaatcccg cagctgcaga tgaggagttc tgagaagcat tgctcaggac agcggtaaat
1860cacttcttgg aggtgccctg cacgccggtc ctgggagcag gcggcctccc gggggtgcgg
1920gagccccact cctccgtggt gtgttccatt tgcttcccac atctggagga gctgacgtgc
1980cagcctcccc cagcaccacc cagggacggg aggcatgagc cggtcaaggc acctgggcaa
2040aatccggaag cgtctggaag atgtcaagag ccagtgggtc cggccagcca gggctgactt
2100tagtgacaac gagagtgccc ggctggccac ggacgccctc ttggatgggg gttctgaagc
2160ctactggcgg gtgctcagcc aggaaggcga ggtggacttc ttgtcctcgg tggaggccca
2220gtacatccag gcccaggcca gggagccccc gtgtccccca gacaccctgg gaggggcgga
2280agcaggccct aagggactgg actccagctc cctacagtcc ggcacctact tccctgtggc
2340ctcagagggc agcgagccgg ccctactgca cagctgggcc tcagctgaga agccctacct
2400gaaggaaaaa tccagcgcca ctgtgtactt ccagaccgtc aagcacaaca acatcagaga
2460cctcgtccgc cgctgcatca cccggactag ccaggtcctg gtcatcctga tggatgtgtt
2520cacggatgtg gagatcttct gtgacattct agaggcagcc aacaagcgtg gggtgttcgt
2580ttgtgtgctc ctggaccagg gaggtgtgaa gctcttccag gagatgtgtg acaaagtcca
2640gatctctgac agtcacctca agaacatttc catccggagt gtggaaggag agatatactg
2700tgccaagtca ggcaggaaat tcgctggcca aatccgggag aagttcatca tctcggactg
2760gagatttgtc ctgtctggat cttacagctt cacctggctc tgcggacacg tgcaccggaa
2820catcctctcc aagttcacag gccaggcggt ggagctgttt gacgaggagt tccgccacct
2880ctacgcctcc tccaagcctg tgatgggcct gaagtccccg cggctggtcg cccccgtccc
2940gcccggagca gccccggcca atggccgcct tagcagcagc agtggctccg ccagtgaccg
3000cacgtcctcc aaccccttca gcggccgctc ggcaggcagc caccccggta cccgaagtgt
3060gtccgcgtct tcagggccct gtagccccgc ggccccacac ccgcctccac cgccccggtt
3120ccagccccac caaggccctt ggggagcccc gagtccccag gcccacctct ccccgcggcc
3180ccacgacggc ccgcccgccg ctgtctacag caacctgggg gcctacaggc ccacgcggct
3240gcagctggag cagctgggcc tggtgccgag gctgactcca acctggaggc ccttcctgca
3300ggcctcccct cacttctgaa ggtcccatcc cctgctgccc tccgcaggcc cagggctggg
3360cactccctga gacccaaaga cccacctcaa cgacgagtgg cgttgagcca cttccctttg
3420aaaagacact caaaatcact gccatggttc aatgttccca ggccccaggc catccacttg
3480ccggccccca ccagttcttg ggttccccgc tctagtttga cctgtgcagc acattccaga
3540aggttccagg gaggttgtgg ggcagctaga ggacaaaatc atgaaaacag agtccctgtc
3600ttccagagat catccggggc tttaatatta atggccccca aaactccgta agaagcagga
3660aatgcagccc aagttttaca aatgggtaaa cagaggcact gagagataga tggtagtttg
3720gtacttctgg ttcccagtgc ccaggaatgg tccactccca agaaattcag gaaagaaaga
3780ctgaggagaa ggtgtgggaa cattctggat gtttcgggag agttggggaa actcctcctc
3840ttaggaaagg ctaatactag ggtatccttg ggcccaatga attaggggtg aggccccaga
3900acccgttatc tatgagttgt atgggggagc catctgaagc tgtagccacc agggatgcag
3960ctagctgagg agtttggggt gttgggttgg acaaggcagg ttagtagact cagattcttg
4020cttcaaagag ccttgggctg gcctggaggt ccctggagtc tagactggac ctaggagctt
4080gagttgtcag gggccaggac tggccccact gcagtgccca ggccagtctt gagcagcagg
4140gagggctcag ctgtccccag atccaggtgc ctctgaccag cctggtcacc tcctgaggaa
4200taaatgctga acctcacaag ccccatcatt catttcttct caattcacag tgcccctctt
4260tgtttctggg gtggaactag gtcctgaggg cacagcctag ctgagtgcaa agaaatatag
4320gatgcttaga aagcatacag gaggggccag gcgtggtggc tcatgcctgt aatcccagaa
4380ctttgggatg ccaaggtggt tggattacct gagatcaggt ggattacctg gtctcgagac
4440cagcctgacc aatatggtga aaccccgtct ctactaaaaa tacaaaaatt aggctgagac
4500aggagaattg cttgaaccca ggaagcagag gttgcaatga gctgagattg catcactgca
4560ctccagcatg ggcaacaaag caagactccg tcacag
4596462815DNAHomo sapiens 46aagaacgccc ccaaaatctg tttctaattt tacagaaatc
ttttgaaact tggcacggta 60ttcaaaagtc cgtggaaaga aaaaaacctt gtcctggctt
cagcttccaa ctacaaagac 120agacttggtc cttttcaacg gttttcacag atccagtgac
ccacgctctg aagacagaat 180tagctaactt tcaaaaacat ctggaaaaat gaagacttgg
gtaaaaatcg tatttggagt 240tgccacctct gctgtgcttg ccttattggt gatgtgcatt
gtcttacgcc cttcaagagt 300tcataactct gaagaaaata caatgagagc actcacactg
aaggatattt taaatggaac 360attttcttat aaaacatttt ttccaaactg gatttcagga
caagaatatc ttcatcaatc 420tgcagataac aatatagtac tttataatat tgaaacagga
caatcatata ccattttgag 480taatagaacc atgaaaagtg tgaatgcttc aaattacggc
ttatcacctg atcggcaatt 540tgtatatcta gaaagtgatt attcaaagct ttggagatac
tcttacacag caacatatta 600catctatgac cttagcaatg gagaatttgt aagaggaaat
gagcttcctc gtccaattca 660gtatttatgc tggtcgcctg ttgggagtaa attagcatat
gtctatcaaa acaatatcta 720tttgaaacaa agaccaggag atccaccttt tcaaataaca
tttaatggaa gagaaaataa 780aatatttaat ggaatcccag actgggttta tgaagaggaa
atgcttgcta caaaatatgc 840tctctggtgg tctcctaatg gaaaattttt ggcatatgcg
gaatttaatg atacggatat 900accagttatt gcctattcct attatggcga tgaacaatat
cctagaacaa taaatattcc 960atacccaaag gctggagcta agaatcccgt tgttcggata
tttattatcg ataccactta 1020ccctgcgtat gtaggtcccc aggaagtgcc tgttccagca
atgatagcct caagtgatta 1080ttatttcagt tggctcacgt gggttactga tgaacgagta
tgtttgcagt ggctaaaaag 1140agtccagaat gtttcggtcc tgtctatatg tgacttcagg
gaagactggc agacatggga 1200ttgtccaaag acccaggagc atatagaaga aagcagaact
ggatgggctg gtggattctt 1260tgtttcaaca ccagttttca gctatgatgc catttcgtac
tacaaaatat ttagtgacaa 1320ggatggctac aaacatattc actatatcaa agacactgtg
gaaaatgcta ttcaaattac 1380aagtggcaag tgggaggcca taaatatatt cagagtaaca
caggattcac tgttttattc 1440tagcaatgaa tttgaagaat accctggaag aagaaacatc
tacagaatta gcattggaag 1500ctatcctcca agcaagaagt gtgttacttg ccatctaagg
aaagaaaggt gccaatatta 1560cacagcaagt ttcagcgact acgccaagta ctatgcactt
gtctgctacg gcccaggcat 1620ccccatttcc acccttcatg atggacgcac tgatcaagaa
attaaaatcc tggaagaaaa 1680caaggaattg gaaaatgctt tgaaaaatat ccagctgcct
aaagaggaaa ttaagaaact 1740tgaagtagat gaaattactt tatggtacaa gatgattctt
cctcctcaat ttgacagatc 1800aaagaagtat cccttgctaa ttcaagtgta tggtggtccc
tgcagtcaga gtgtaaggtc 1860tgtatttgct gttaattgga tatcttatct tgcaagtaag
gaagggatgg tcattgcctt 1920ggtggatggt cgaggaacag ctttccaagg tgacaaactc
ctctatgcag tgtatcgaaa 1980gctgggtgtt tatgaagttg aagaccagat tacagctgtc
agaaaattca tagaaatggg 2040tttcattgat gaaaaaagaa tagccatatg gggctggtcc
tatggaggat acgtttcatc 2100actggccctt gcatctggaa ctggtctttt caaatgtggt
atagcagtgg ctccagtctc 2160cagctgggaa tattacgcgt ctgtctacac agagagattc
atgggtctcc caacaaagga 2220tgataatctt gagcactata agaattcaac tgtgatggca
agagcagaat atttcagaaa 2280tgtagactat cttctcatcc acggaacagc agatgataat
gtgcactttc aaaactcagc 2340acagattgct aaagctctgg ttaatgcaca agtggatttc
caggcaatgt ggtactctga 2400ccagaaccac ggcttatccg gcctgtccac gaaccactta
tacacccaca tgacccactt 2460cctaaagcag tgtttctctt tgtcagacta aaaacgatgc
agatgcaagc ctgtatcaga 2520atctgaaaac cttatataaa cccctcagac agtttgctta
ttttattttt tatgttgtaa 2580aatgctagta taaacaaaca aattaatgtt gttctaaagg
ctgttaaaaa aaagatgagg 2640actcagaagt tcaagctaaa tattgtttac attttctggt
actctgtgaa agaagagaaa 2700agggagtcat gcattttgct ttggacacag tgttttatca
cctgttcatt tgaagaaaaa 2760taataaagtc agaagttcaa gtgctaaaaa aaaaaaaaaa
aaaaaaaaaa aaaaa 2815471369DNAHomo sapiens 47gaatcattgc actccctact
agagcggatg tgatgaggga aaaggagaac tcagcacttt 60ccctgcagga accggctccc
tcggaggggc gtggctggga ggagctgtga gtaacgtgcc 120acagtgttgt aaaaacccag
tgagtgttat aaaaacccag tcagcctggc tcctgttgaa 180tagtctaccc cccttgcact
ctacctgaca cagctgcagc ctgcaattca ctcgcactgc 240ctgggattgc actggatccg
tgtgctcaga acaaggtgaa cgcccagctg cagccatgaa 300gatctgtagc ctcaccctgc
tctccttcct cctactggct gctcaggtgc tcctggtgga 360ggggaaaaaa aaagtgaaga
atggacttca cagcaaagtg gtctcagaac aaaaggacac 420tctgggcaac acccagatta
agcagaaaag caggcccggg aacaaaggca agtttgtcac 480caaagaccaa gccaactgca
gatgggctgc tactgagcag gaggagggca tctctctcaa 540ggttgagtgc actcaattgg
accatgaatt ttcctgtgtc tttgctggca atccaacctc 600atgcctaaag ctcaaggatg
agagagtcta ttggaaacaa gttgcccgga atctgcgctc 660acagaaagac atctgtagat
attccaagac agctgtgaaa accagagtgt gcagaaagga 720ttttccagaa tccagtctta
agctagtcag ctccactcta tttgggaaca caaagcccag 780gaaggagaaa acagagatgt
cccccaggga gcacatcaaa ggcaaagaga ccaccccctc 840tagcctagca gtgacccaga
ccatggccac caaagctccc gagtgtgtgg aggacccaga 900tatggcaaac cagaggaaga
ctgccctgga gttctgtgga gagacttgga gctctctctg 960cacattcttc ctcagcatag
tgcaggacac gtcatgctaa tgaggtcaaa agagaacggg 1020ttcccttaag agatgtcatg
tcgtaagtcc ctctgtatac tttaaagctc tctacagtcc 1080ccccaaaata tgaacttttg
tgcttagtga gtgcaacgaa atatttaaac aagttttgta 1140ttttttgctt ttgtgttttg
gaatttgcct tatttttctt ggatgcgatg ttcagaggct 1200gtttcctgca gcatgtattt
ccatggccca cacagctatg tgtttgagca gcgaagagtc 1260tttgagctga atgagccaga
gtgataattt cagtgcaacg aactttctgc tgaattaatg 1320gtaataaaac tctgggtgtt
tttcagaaat acattcaaaa aaaaaaaaa 1369488815DNAHomo sapiens
48gcccgcgccg gctgtgctgc acagggggag gagagggaac cccaggcgcg agcgggaaga
60ggggacctgc agccacaact tctctggtcc tctgcatccc ttctgtccct ccacccgtcc
120ccttccccac cctctggccc ccaccttctt ggaggcgaca acccccggga ggcattagaa
180gggatttttc ccgcaggttg cgaagggaag caaacttggt ggcaacttgc ctcccggtgc
240gggcgtctct cccccaccgt ctcaacatgc ttaggggtcc ggggcccggg ctgctgctgc
300tggccgtcca gtgcctgggg acagcggtgc cctccacggg agcctcgaag agcaagaggc
360aggctcagca aatggttcag ccccagtccc cggtggctgt cagtcaaagc aagcccggtt
420gttatgacaa tggaaaacac tatcagataa atcaacagtg ggagcggacc tacctaggca
480atgcgttggt ttgtacttgt tatggaggaa gccgaggttt taactgcgag agtaaacctg
540aagctgaaga gacttgcttt gacaagtaca ctgggaacac ttaccgagtg ggtgacactt
600atgagcgtcc taaagactcc atgatctggg actgtacctg catcggggct gggcgaggga
660gaataagctg taccatcgca aaccgctgcc atgaaggggg tcagtcctac aagattggtg
720acacctggag gagaccacat gagactggtg gttacatgtt agagtgtgtg tgtcttggta
780atggaaaagg agaatggacc tgcaagccca tagctgagaa gtgttttgat catgctgctg
840ggacttccta tgtggtcgga gaaacgtggg agaagcccta ccaaggctgg atgatggtag
900attgtacttg cctgggagaa ggcagcggac gcatcacttg cacttctaga aatagatgca
960acgatcagga cacaaggaca tcctatagaa ttggagacac ctggagcaag aaggataatc
1020gaggaaacct gctccagtgc atctgcacag gcaacggccg aggagagtgg aagtgtgaga
1080ggcacacctc tgtgcagacc acatcgagcg gatctggccc cttcaccgat gttcgtgcag
1140ctgtttacca accgcagcct cacccccagc ctcctcccta tggccactgt gtcacagaca
1200gtggtgtggt ctactctgtg gggatgcagt ggctgaagac acaaggaaat aagcaaatgc
1260tttgcacgtg cctgggcaac ggagtcagct gccaagagac agctgtaacc cagacttacg
1320gtggcaactc aaatggagag ccatgtgtct taccattcac ctacaatggc aggacgttct
1380actcctgcac cacagaaggg cgacaggacg gacatctttg gtgcagcaca acttcgaatt
1440atgagcagga ccagaaatac tctttctgca cagaccacac tgttttggtt cagactcgag
1500gaggaaattc caatggtgcc ttgtgccact tccccttcct atacaacaac cacaattaca
1560ctgattgcac ttctgagggc agaagagaca acatgaagtg gtgtgggacc acacagaact
1620atgatgccga ccagaagttt gggttctgcc ccatggctgc ccacgaggaa atctgcacaa
1680ccaatgaagg ggtcatgtac cgcattggag atcagtggga taagcagcat gacatgggtc
1740acatgatgag gtgcacgtgt gttgggaatg gtcgtgggga atggacatgc attgcctact
1800cgcagcttcg agatcagtgc attgttgatg acatcactta caatgtgaac gacacattcc
1860acaagcgtca tgaagagggg cacatgctga actgtacatg cttcggtcag ggtcggggca
1920ggtggaagtg tgatcccgtc gaccaatgcc aggattcaga gactgggacg ttttatcaaa
1980ttggagattc atgggagaag tatgtgcatg gtgtcagata ccagtgctac tgctatggcc
2040gtggcattgg ggagtggcat tgccaacctt tacagaccta tccaagctca agtggtcctg
2100tcgaagtatt tatcactgag actccgagtc agcccaactc ccaccccatc cagtggaatg
2160caccacagcc atctcacatt tccaagtaca ttctcaggtg gagacctaaa aattctgtag
2220gccgttggaa ggaagctacc ataccaggcc acttaaactc ctacaccatc aaaggcctga
2280agcctggtgt ggtatacgag ggccagctca tcagcatcca gcagtacggc caccaagaag
2340tgactcgctt tgacttcacc accaccagca ccagcacacc tgtgaccagc aacaccgtga
2400caggagagac gactcccttt tctcctcttg tggccacttc tgaatctgtg accgaaatca
2460cagccagtag ctttgtggtc tcctgggtct cagcttccga caccgtgtcg ggattccggg
2520tggaatatga gctgagtgag gagggagatg agccacagta cctggatctt ccaagcacag
2580ccacttctgt gaacatccct gacctgcttc ctggccgaaa atacattgta aatgtctatc
2640agatatctga ggatggggag cagagtttga tcctgtctac ttcacaaaca acagcgcctg
2700atgcccctcc tgacccgact gtggaccaag ttgatgacac ctcaattgtt gttcgctgga
2760gcagacccca ggctcccatc acagggtaca gaatagtcta ttcgccatca gtagaaggta
2820gcagcacaga actcaacctt cctgaaactg caaactccgt caccctcagt gacttgcaac
2880ctggtgttca gtataacatc actatctatg ctgtggaaga aaatcaagaa agtacacctg
2940ttgtcattca acaagaaacc actggcaccc cacgctcaga tacagtgccc tctcccaggg
3000acctgcagtt tgtggaagtg acagacgtga aggtcaccat catgtggaca ccgcctgaga
3060gtgcagtgac cggctaccgt gtggatgtga tccccgtcaa cctgcctggc gagcacgggc
3120agaggctgcc catcagcagg aacacctttg cagaagtcac cgggctgtcc cctggggtca
3180cctattactt caaagtcttt gcagtgagcc atgggaggga gagcaagcct ctgactgctc
3240aacagacaac caaactggat gctcccacta acctccagtt tgtcaatgaa actgattcta
3300ctgtcctggt gagatggact ccacctcggg cccagataac aggataccga ctgaccgtgg
3360gccttacccg aagaggacag cccaggcagt acaatgtggg tccctctgtc tccaagtacc
3420cactgaggaa tctgcagcct gcatctgagt acaccgtatc cctcgtggcc ataaagggca
3480accaagagag ccccaaagcc actggagtct ttaccacact gcagcctggg agctctattc
3540caccttacaa caccgaggtg actgagacca ccattgtgat cacatggacg cctgctccaa
3600gaattggttt taagctgggt gtacgaccaa gccagggagg agaggcacca cgagaagtga
3660cttcagactc aggaagcatc gttgtgtccg gcttgactcc aggagtagaa tacgtctaca
3720ccatccaagt cctgagagat ggacaggaaa gagatgcgcc aattgtaaac aaagtggtga
3780caccattgtc tccaccaaca aacttgcatc tggaggcaaa ccctgacact ggagtgctca
3840cagtctcctg ggagaggagc accaccccag acattactgg ttatagaatt accacaaccc
3900ctacaaacgg ccagcaggga aattctttgg aagaagtggt ccatgctgat cagagctcct
3960gcacttttga taacctgagt cccggcctgg agtacaatgt cagtgtttac actgtcaagg
4020atgacaagga aagtgtccct atctctgata ccatcatccc agaggtgccc caactcactg
4080acctaagctt tgttgatata accgattcaa gcatcggcct gaggtggacc ccgctaaact
4140cttccaccat tattgggtac cgcatcacag tagttgcggc aggagaaggt atccctattt
4200ttgaagattt tgtggactcc tcagtaggat actacacagt cacagggctg gagccgggca
4260ttgactatga tatcagcgtt atcactctca ttaatggcgg cgagagtgcc cctactacac
4320tgacacaaca aacggctgtt cctcctccca ctgacctgcg attcaccaac attggtccag
4380acaccatgcg tgtcacctgg gctccacccc catccattga tttaaccaac ttcctggtgc
4440gttactcacc tgtgaaaaat gaggaagatg ttgcagagtt gtcaatttct ccttcagaca
4500atgcagtggt cttaacaaat ctcctgcctg gtacagaata tgtagtgagt gtctccagtg
4560tctacgaaca acatgagagc acacctctta gaggaagaca gaaaacaggt cttgattccc
4620caactggcat tgacttttct gatattactg ccaactcttt tactgtgcac tggattgctc
4680ctcgagccac catcactggc tacaggatcc gccatcatcc cgagcacttc agtgggagac
4740ctcgagaaga tcgggtgccc cactctcgga attccatcac cctcaccaac ctcactccag
4800gcacagagta tgtggtcagc atcgttgctc ttaatggcag agaggaaagt cccttattga
4860ttggccaaca atcaacagtt tctgatgttc cgagggacct ggaagttgtt gctgcgaccc
4920ccaccagcct actgatcagc tgggatgctc ctgctgtcac agtgagatat tacaggatca
4980cttacggaga gacaggagga aatagccctg tccaggagtt cactgtgcct gggagcaagt
5040ctacagctac catcagcggc cttaaacctg gagttgatta taccatcact gtgtatgctg
5100tcactggccg tggagacagc cccgcaagca gcaagccaat ttccattaat taccgaacag
5160aaattgacaa accatcccag atgcaagtga ccgatgttca ggacaacagc attagtgtca
5220agtggctgcc ttcaagttcc cctgttactg gttacagagt aaccaccact cccaaaaatg
5280gaccaggacc aacaaaaact aaaactgcag gtccagatca aacagaaatg actattgaag
5340gcttgcagcc cacagtggag tatgtggtta gtgtctatgc tcagaatcca agcggagaga
5400gtcagcctct ggttcagact gcagtaacca acattgatcg ccctaaagga ctggcattca
5460ctgatgtgga tgtcgattcc atcaaaattg cttgggaaag cccacagggg caagtttcca
5520ggtacagggt gacctactcg agccctgagg atggaatcca tgagctattc cctgcacctg
5580atggtgaaga agacactgca gagctgcaag gcctcagacc gggttctgag tacacagtca
5640gtgtggttgc cttgcacgat gatatggaga gccagcccct gattggaacc cagtccacag
5700ctattcctgc accaactgac ctgaagttca ctcaggtcac acccacaagc ctgagcgccc
5760agtggacacc acccaatgtt cagctcactg gatatcgagt gcgggtgacc cccaaggaga
5820agaccggacc aatgaaagaa atcaaccttg ctcctgacag ctcatccgtg gttgtatcag
5880gacttatggt ggccaccaaa tatgaagtga gtgtctatgc tcttaaggac actttgacaa
5940gcagaccagc tcagggagtt gtcaccactc tggagaatgt cagcccacca agaagggctc
6000gtgtgacaga tgctactgag accaccatca ccattagctg gagaaccaag actgagacga
6060tcactggctt ccaagttgat gccgttccag ccaatggcca gactccaatc cagagaacca
6120tcaagccaga tgtcagaagc tacaccatca caggtttaca accaggcact gactacaaga
6180tctacctgta caccttgaat gacaatgctc ggagctcccc tgtggtcatc gacgcctcca
6240ctgccattga tgcaccatcc aacctgcgtt tcctggccac cacacccaat tccttgctgg
6300tatcatggca gccgccacgt gccaggatta ccggctacat catcaagtat gagaagcctg
6360ggtctcctcc cagagaagtg gtccctcggc cccgccctgg tgtcacagag gctactatta
6420ctggcctgga accgggaacc gaatatacaa tttatgtcat tgccctgaag aataatcaga
6480agagcgagcc cctgattgga aggaaaaaga cagacgagct tccccaactg gtaacccttc
6540cacaccccaa tcttcatgga ccagagatct tggatgttcc ttccacagtt caaaagaccc
6600ctttcgtcac ccaccctggg tatgacactg gaaatggtat tcagcttcct ggcacttctg
6660gtcagcaacc cagtgttggg caacaaatga tctttgagga acatggtttt aggcggacca
6720caccgcccac aacggccacc cccataaggc ataggccaag accatacccg ccgaatgtag
6780gtgaggaaat ccaaattggt cacatcccca gggaagatgt agactatcac ctgtacccac
6840acggtccggg actcaatcca aatgcctcta caggacaaga agctctctct cagacaacca
6900tctcatgggc cccattccag gacacttctg agtacatcat ttcatgtcat cctgttggca
6960ctgatgaaga acccttacag ttcagggttc ctggaacttc taccagtgcc actctgacag
7020gcctcaccag aggtgccacc tacaacatca tagtggaggc actgaaagac cagcagaggc
7080ataaggttcg ggaagaggtt gttaccgtgg gcaactctgt caacgaaggc ttgaaccaac
7140ctacggatga ctcgtgcttt gacccctaca cagtttccca ttatgccgtt ggagatgagt
7200gggaacgaat gtctgaatca ggctttaaac tgttgtgcca gtgcttaggc tttggaagtg
7260gtcatttcag atgtgattca tctagatggt gccatgacaa tggtgtgaac tacaagattg
7320gagagaagtg ggaccgtcag ggagaaaatg gccagatgat gagctgcaca tgtcttggga
7380acggaaaagg agaattcaag tgtgaccctc atgaggcaac gtgttatgat gatgggaaga
7440cataccacgt aggagaacag tggcagaagg aatatctcgg tgccatttgc tcctgcacat
7500gctttggagg ccagcggggc tggcgctgtg acaactgccg cagacctggg ggtgaaccca
7560gtcccgaagg cactactggc cagtcctaca accagtattc tcagagatac catcagagaa
7620caaacactaa tgttaattgc ccaattgagt gcttcatgcc tttagatgta caggctgaca
7680gagaagattc ccgagagtaa atcatctttc caatccagag gaacaagcat gtctctctgc
7740caagatccat ctaaactgga gtgatgttag cagacccagc ttagagttct tctttctttc
7800ttaagccctt tgctctggag gaagttctcc agcttcagct caactcacag cttctccaag
7860catcaccctg ggagtttcct gagggttttc tcataaatga gggctgcaca ttgcctgttc
7920tgcttcgaag tattcaatac cgctcagtat tttaaatgaa gtgattctaa gatttggttt
7980gggatcaata ggaaagcata tgcagccaac caagatgcaa atgttttgaa atgatatgac
8040caaaatttta agtaggaaag tcacccaaac acttctgctt tcacttaagt gtctggcccg
8100caatactgta ggaacaagca tgatcttgtt actgtgatat tttaaatatc cacagtactc
8160actttttcca aatgatccta gtaattgcct agaaatatct ttctcttacc tgttatttat
8220caatttttcc cagtattttt atacggaaaa aattgtattg aaaacactta gtatgcagtt
8280gataagagga atttggtata attatggtgg gtgattattt tttatactgt atgtgccaaa
8340gctttactac tgtggaaaga caactgtttt aataaaagat ttacattcca caacttgaag
8400ttcatctatt tgatataaga caccttcggg ggaaataatt cctgtgaata ttctttttca
8460attcagcaaa catttgaaaa tctatgatgt gcaagtctaa ttgttgattt cagtacaaga
8520ttttctaaat cagttgctac aaaaactgat tggtttttgt cacttcatct cttcactaat
8580ggagatagct ttacactttc tgctttaata gatttaagtg gaccccaata tttattaaaa
8640ttgctagttt accgttcaga agtataatag aaataatctt tagttgctct tttctaacca
8700ttgtaattct tcccttcttc cctccacctt tccttcattg aataaacctc tgttcaaaga
8760gattgcctgc aagggaaata aaaatgacta agatattaaa aaaaaaaaaa aaaaa
8815496578DNAHomo sapiens 49ctcagtggcg gagcgcggct gccggtgtgc ggccgggagc
gatcgccgcg gggcaggggc 60gcggcgggca ccgcgcagag cgcgcagaac agacggacgg
cggcggggac ccgacggcgg 120cgcctcggca ctccccagac tccggccagc gcccccctgc
cagccgcaag cacccagccc 180cggcccaccc cgggctctcg atggcccccg aggccggggc
gaccctgcgc gcgccgcgcc 240ggctgtcctg ggcggcgctg ctgctcttgg ccgcgctgct
ccccgtcgcc tcctcggcgg 300cggcctcagt tgaccaccca ctgaagccaa ggcatgtgaa
actgctgtcc actaaaatgg 360gcctgaaagt cacgtgggac ccacccaaag atgctaccag
tagacctgtg gagcattaca 420acattgccta tgggaagtca ctgaaaagtc ttaaatacat
caaggtgaat gcggagacat 480actccttcct tattgaggat gtggagccgg gggtagtgta
ctttgtgctg cttactgcag 540aaaaccacag tggagtgagc cgtcctgttt acagagctga
aagcccacct ggaggtgaat 600ggatcgagat tgatggtttt cccattaagg gtccaggacc
atttaatgaa accgtcacag 660aaaaggaagt gcccaacaag cccttgcgtg tgcgtgtccg
gtcctcagat gacaggctgt 720ccgttgcgtg gaaggcacca cgcctgtctg gagccaagag
tccacgcaga tcacggggtt 780ttctcctggg ctacggggag agtggccgga agatgaatta
tgttccactg acaagagatg 840aacggacaca cgaaattaaa aagctagcct cggaatccgt
gtatgtggtc tccctgcagt 900ccatgaactc tcagggccgg agccaaccag tctacagggc
tgccctaaca aagcgaaaga 960tttcagaaga ggacgaattg gatgtacctg acgacatcag
cgtccgggtt atgtcatctc 1020agtctgtgct tgtgtcctgg gtggatcctg ttctggaaaa
acagaagaaa gttgttgcat 1080caagacagta caccgtgcgc tatcgagaga agggggaatt
ggccaggtgg gattataagc 1140agatcgctaa caggcgtgtg ctgattgaga acctgattcc
agacactgtg tatgaatttg 1200cagtccgtat ttcacagggt gaaagagatg gcaaatggag
tacgtcagtc ttccaaagaa 1260caccagaatc tgcccctacc acagctcctg aaaacttgaa
cgtctggcca gtcaatggca 1320aacctacagt tgtcgctgca tcttgggatg cgctaccaga
gactgagggg aaagtgaaag 1380aatacattct ttcatacgcc ccggctctca aaccatttgg
agcaaagtcc ctcacctatc 1440ctggagacac tacttctgcc ctggtggatg gtctgcagcc
tggggaacgc tatcttttca 1500aaatccgggc cacaaacagg agaggcctgg gacctcactc
caaagccttc attgtcgcta 1560tgccaacaac cagtaaggcg gatgttgagc agaacacgga
ggacaatggg aaacccgaaa 1620aacctgagcc ttcctcacct tctcccagag ctccagcttc
ctcccaacac ccctctgtgc 1680ctgcttctcc ccaagggaga aatgccaagg accttcttct
tgacttgaag aacaaaatat 1740tggctaatgg tggggcgccc cgaaaacccc agcttcgcgc
caagaaggca gaggagctgg 1800atcttcagtc gacagaaatc actggggagg aggagctggg
ttcccgggag gactcgccca 1860tgtcaccctc agacacccaa gaccagaaac ggaccctgag
gccgccaagt agacacggcc 1920actcggtggt tgctcccggc aggactgcag tgagggcccg
gatgccagcg ctgccccgaa 1980gggaaggcgt agataagcct ggcttttccc tggccacgca
gccccgccca ggggcgcccc 2040cctcggcttc ggcctctcct gcccaccacg cgtccaccca
gggcacctct catcgtcctt 2100ccctgcctgc cagcttgaat gacaacgact tggtggactc
agacgaagat gagcgcgctg 2160tgggctccct ccaccccaag ggcgccttcg cccagccccg
gccagccctg tcccccagcc 2220gccagtcccc gtccagcgtt ctccgcgaca gaagctctgt
gcaccccggc gcaaagccag 2280cctcgccggc ccggaggacc ccccattcag gggccgcaga
ggaagattcc agtgcctcag 2340ccccaccctc aagactttct ccaccccatg ggggatcatc
tcggctgctg cccacccagc 2400cacacctgag ctctccactt tccaagggcg ggaaggatgg
tgaggacgcc ccagccacca 2460actccaatgc gccatcacgg tccaccatgt cctcctccgt
ctcttctcat ctctcgtcca 2520ggacgcaggt ctctgaggga gcggaggctt ctgatggtga
aagccacggt gacggcgata 2580gggaagacgg cggaaggcag gcggaggcca cggcccagac
gctgcgggcc cggcctgcct 2640ctggacactt ccatttgctc agacacaaac cctttgctgc
caacgggagg tctccaagca 2700ggttcagcat tgggcgggga cctcggctgc agccctccag
ctccccacag tcgactgtgc 2760cctcccgagc ccaccccagg gttccctctc actctgattc
ccaccctaag cttagctcag 2820gtatccatgg agacgaggag gatgagaagc cgcttcctgc
caccgttgtc aatgaccacg 2880tgccttcctc ctccaggcag cccatctccc ggggctggga
ggacttaagg agaagcccgc 2940agagaggggc cagcctgcat cggaaggaac ccatcccaga
gaaccccaaa tccacagggg 3000cagatacaca tcctcagggc aagtactcct ccctggcctc
caaggctcag gatgttcaac 3060agagcacaga cgcggacacg gagggtcatt ctcccaaagc
acagccaggg tccacagacc 3120gccacgcgtc ccctgctcgt ccgcccgcag cacggtcaca
gcagcatccc agtgttccca 3180gaaggatgac acccggccgg gccccacaac agcagccccc
tcctcccgtc gccacgtccc 3240agcaccaccc gggaccccag agcagagacg cgggtcggtc
accttcccag cccaggctct 3300cactgaccca ggccgggcgg ccccgcccca cgtcgcaggg
ccgctcccac tcctcctcgg 3360acccttacac ggcgagctcc agagggatgc tccccacggc
cctccagaac caggacgagg 3420atgcccaggg cagctacgac gacgacagca cagaagtcga
ggcccaggat gtgcgggccc 3480ccgcgcacgc cgcgcgcgcc aaggaggcag ctgcgtccct
tcccaagcac cagcaggtgg 3540agtctcccac aggcgcaggg gcaggtggcg accacaggtc
ccagcgcgga catgcggcct 3600cccccgccag gcccagccga cccggcggcc cccagtcccg
cgcccgggta cccagcaggg 3660cagcgccggg gaagtcggag cctccttcca agcggcccct
gtcctccaag tcccagcagt 3720cggtctcagc cgaggacgac gaggaggagg acgcgggatt
ttttaaaggc gggaaagaag 3780accttctgtc ttcctctgtg ccaaagtggc cctcttcctc
cactcccagg ggcggcaaag 3840acgccgatgg gagcctcgcc aaggaagaga gggagcctgc
catcgcgctt gcccctcgcg 3900gagggagcct ggctcctgtg aagcgacctc tccccccacc
tccaggcagc tcccccaggg 3960cctcccacgt cccttcccga ctgccgcctc gcagcgctgc
caccgtgagc cccgtcgcgg 4020gcacccaccc ctggccgcag tacaccacgc gcgccccacc
tggccacttc tccaccaccc 4080cgatgctgtc cttgcgccag aggatgatgc atgccagatt
ccgtaaccct ctctcccgac 4140agcctgccag accctcttac agacaaggtt ataatggcag
accaaatgta gaagggaaag 4200tccttcctgg tagtaatgga aaaccgaatg gacagagaat
tatcaatggc cctcaaggaa 4260caaagtgggt tgtggacctt gatcgtgggt tagtattgaa
tgcagaagga aggtacctcc 4320aagattcaca tggaaatcct cttcggatta aactaggagg
agatggtcga accattgtag 4380atctggaagg gacccccgtg gtgagtcctg acggcctccc
actctttggg caggggcgac 4440atggcacacc tctggccaat gcccaagata agccaatttt
gagtcttgga ggaaagccgc 4500tggtgggctt ggaggtcatc aaaaaaacca cccatccccc
taccactacc atgcagccca 4560ccactactac gacgcccctg cctaccacta caaccccgag
gcccaccact gccaccaccc 4620gccgcacgac caccacccgc cgcacgacca ccaggcgtcc
aacaaccaca gtccgaacca 4680ctacgcggac aaccaccacc accaccccca cacccaccac
tcccatcccc acctgtcccc 4740ctgggacctt ggaacggcac gacgatgatg gcaacctgat
aatgagctcc aatgggatcc 4800cagagtgcta cgctgaagaa gatgagttct caggcttgga
gactgacact gcagtaccta 4860cggaagaggc ctacgttata tatgatgaag attatgaatt
tgagacgtca aggccaccaa 4920ccaccactga gccttcgacc actgctacca caccgagggt
gatcccagag gaaggcgcca 4980tcagttcctt tcctgaagaa gaatttgatc tggctggaag
gaaacgattt gttgctcctt 5040acgtgacgta cctaaataaa gacccatcag ccccgtgctc
tctgactgat gcactggatc 5100acttccaagt ggacagcctg gatgaaatca tccccaatga
cctgaagaag agtgacctgc 5160ctccccagca tgctccccgc aacatcaccg tggtggccgt
ggaaggttgc cactcatttg 5220tcattgtgga ctgggacaaa gccaccccag gagatgtggt
cacaggttac ttggtttaca 5280gtgcatccta tgaagacttc atcaggaaca agtggtccac
tcaagcttca tcagtaactc 5340acttgcccat tgagaaccta aagcccaaca cgaggtatta
ttttaaagtg caagcacaaa 5400atcctcatgg ctacggacct atcagccctt cggtctcatt
tgtcaccgaa tcagataatc 5460ctctgcttgt tgtgaggccc ccaggcggtg agcctatctg
gatcccattc gctttcaaac 5520atgatcccag ctacacggac tgccatggac ggcaatatgt
gaagcgcacg tggtatcgaa 5580agttcgtggg agttgttctt tgtaattcac tgaggtataa
aatctacctc agtgacaacc 5640tgaaagatac attctacagc attggagaca gctggggaag
aggtgaagac cattgccaat 5700ttgtggattc acaccttgat ggaagaacag ggcctcagtc
ctatgtagaa gccctcccta 5760ctattcaagg ctactatcgc cagtatcgtc aggagcctgt
caggtttggg aacatcggct 5820tcggaacccc ctactactat gtgggctggt acgagtgtgg
ggtctccatc cctggaaagt 5880ggtaatcaca ggaccgtcat gctgcaagct tgccctgccc
agccccacca actaagtcgc 5940actaggggct gtgagcaaag acagccagcg tgctcagccc
cgctgcccta ggtgccagga 6000aggtcataga tggacactgg ccattctggt catctcagtc
tggaactcag tcccacttct 6060tggcctggac aatgaacagg attcagtttt gctgttaact
ttgcttctct actttttttt 6120gtttgtttgt aatagcacat cccagagaca tcagaaacca
gcaactgatt cagtgtgatt 6180tccagacttt ttaggcatga aattcggaca cttcagtatt
tccaggaata gcatatgcac 6240gctgttcttg cttcatggaa tgctacatgc tttctgtttt
tctcattttg gatttctcca 6300aaactaactg aatttaagct tcaggtccct ttgtatgcag
tagaaaggaa ttattaaaaa 6360caccaccaaa gaaaataaat atatcctact tgaaatttac
tctatggact tacccactgc 6420tagaataaat gtatcaaatc ttatttgtaa attctcaatt
ttgatatata tatgtatata 6480tgcatataca tatccacact tgtctgcaag aatattgatt
aaaattgcta aatttgtact 6540tgttcaccag aaaaaaaaaa aaaaaaaaaa aaaaaaaa
6578503219DNAHomo sapiens 50actgtaggca ccaccgggcg
ccgaatggct gttttctaac tgggatcctc ggtgacgtat 60ggctgcctgc cccttggcag
ctgtctttat ggaccagtag gcagagcgaa attgacgctg 120acaagacttt tgcatcttgg
aagggactgt aatctactgt agtgaagaac agagcctctc 180aatcagacgg gtgtaaataa
gagacggagg ggagtccaaa agaaaaggaa gaggaggaaa 240aacaagtgtg tgttgggggg
aacaggggga aaagcatttt tggtggatgg tatgaagcca 300gccatggaaa ctgcagccga
ggaaaatact gaacaaagcc aagagagaaa agtgaacagc 360agagctgaaa tggaaattgg
caggtaccac tggatgtacc caggctcaaa gaaccaccag 420taccatcccg tgccaaccct
gggggacagg gctagcccct tgagcagtcc aggctgcttt 480gaatgctgca tcaagtgtct
gggaggagtc ccctacgcct ccctggtggc caccatcctc 540tgcttctccg gggtggcctt
attctgcggc tgtgggcatg tggctctcgc aggcaccgtg 600gcgattcttg agcaacactt
ctccaccaac gccagtgacc atgccttgct gagcgaggtg 660atacaactga tgcagtatgt
catctatgga attgcgtcct ttttcttctt gtatgggatc 720attctgttgg cagaaggctt
ttacaccaca agtgcagtga aagaactgca cggtgagttt 780aaaacaaccg cttgtggccg
atgcatcagt ggaatgttcg ttttcctcac ctatgtgctt 840ggagtggcct ggctgggtgt
gtttggtttc tcagcggtgc ccgtgtttat gttctacaac 900atatggtcaa cttgtgaagt
catcaagtca ccgcagacca acgggaccac gggtgtggag 960cagatctgtg tggatatccg
acaatacggt atcattcctt ggaatgcttt ccccggaaaa 1020atatgtggct ctgccctgga
gaacatctgc aacacaaacg agttctacat gtcctatcac 1080ctgttcattg tggcctgtgc
aggagctggt gccaccgtca ttgccctgat ccacttcctc 1140atgatactgt cttctaactg
ggcttactta aaggatgcga gcaaaatgca ggcttaccag 1200gatatcaaag caaaggaaga
acaggaactg caagatatcc agtctcggtc aaaagaacaa 1260ctcaattctt acacataaat
gtttgccaga gtgtttcggc cgacgtattt acagctctga 1320caaatcatca gacagctgct
ctgcagtaca gatgtgtatc ccaccaaact aatgtagatg 1380tacaaacact tcactgtctg
tctcaagctg ctgggatgta tctctaggaa aaccttccag 1440tgggtaaatc tttttcttta
gaacaaatat tggaggtttc atgttagcca ttttaaaagg 1500caacactttg acaaaatgat
cgttcatact ttgggaattt gtggcatgtt cacatttatt 1560gctagggcaa ttctaccaag
acactcaatg gaatatgtca cactccttaa tagggacctg 1620tgactcctta ataaggacct
gtgacatgcc cagcatcaag ggataagacc gtaaattcac 1680atatatgcca tctgtcctca
agtgttatct acataggaaa taaaatggaa ttgatgtaaa 1740gttccatttc tgacagctga
catttattaa actttggatc aaagataatg tgattcttat 1800gattgatttc tcaaactagc
ttttccctcc caagtccagg acccattaat ttcctgagcc 1860aatcagaaat atatttttca
ataatgctaa aattagctac aattctgctg accctactat 1920taaagaatct ggatgctgga
ctcactgaca agctttccag aagcaatttt ataacagatt 1980tcattttaac aaaatactga
tccaattttc attattcttg agaaatgtca gctttgcctt 2040aatgagtatt tgctttaaat
ttctaagaat ttatatcata actagagacc caaatatctt 2100tcacagaatt ttgttccata
aatgtttttc ttaattatta agaagtgtta ccttattaaa 2160atgaccacca ttctaaacca
tttttcagtg gtctggatac gaagtttaca gtttcatacc 2220aactatctaa aacctaattg
caaattgacc acagacctct aacctcctac ttttatagac 2280ttgaatactt aagtaattta
aattagggtt ggtatttcat ttttttctta tctaaatctt 2340agtttcctgg aataataaag
tttgatgttc agcaagagaa ctgcttgagt ttaagccatt 2400ttcaaaagaa acttgccttt
tacattattg tgttccagaa cattaagtga ctgtaggtac 2460tgggtattag tgatggtaaa
ctttgtgttg ctctttatga aatgatccat ataactgttg 2520ggtgcatcag tgcttttcaa
aggggctgct tactataggg ttaactatgt atattcattg 2580ttaagagtta acttgtggtt
tggctgtttc ctggatttta taacatacat gtgcagaaat 2640gtattcaaat gaaaggaagc
atacctttat caagatgcta ttaaaattga acatcaagta 2700taatatttca tttggattct
cttttttggt taatgcctaa aaatgcctat ttgggatttt 2760tttttttttt taaattaaga
gaagctctct tctgtgtaga acagttgttc caaaatagct 2820tagtgttttg ttttcctgtt
gcatgacaga tttaactatt ctttccagca gtggaggtgc 2880tgtcagagtc cagtgttcta
gaagaggcag tgtctaaagc ctaattttac ttttctaatt 2940ctggtagcta ttaccaggaa
tttttgaaag ttttgtttaa gtagtctaat attttttatg 3000taaagagcat taaattttgc
tatgtataaa tttttgtaac ctaacagtga atcaatattt 3060tctatcagtg ccaagggctt
cctgtagttc tattcaagtg ttacaataaa tatttgtaga 3120taatagtcaa tacttgtgta
tgcttatttt aaagatctat ttaggtggaa atagttgtgg 3180atgtactaag agtaatgaaa
taaaattata gcttcaaaa 3219511521DNAHomo sapiens
51ttacattagc aagagagcaa gttgttccag tagtcgcctg gcaggagaat ttgaaagggt
60gccccaaagg acaatctcta aaggggtaag ggagatacct accttgtctg gtaggggaga
120tgtttcgttt tcatgcttta ccagaaaatc cacttccctg ccgaccttag tttcaaagct
180tattcttaat tagagacaag aaacctgttt caacttgaag acaccgtatg aggtgaatgg
240acagccagcc accacaatga aagaaatcaa accaggaata acctatgctg aacccacgcc
300tcaatcgtcc ccaagtgttt cctgacacgc atctttgctt acagtgcatc acaactgaag
360aatggggttc aacttgacgc ttgcaaaatt accaaataac gagctgcacg gccaagagag
420tcacaattca ggcaacagga gcgacgggcc aggaaagaac accacccttc acaatgaatt
480tgacacaatt gtcttgccgg tgctttatct cattatattt gtggcaagca tcttgctgaa
540tggtttagca gtgtggatct tcttccacat taggaataaa accagcttca tattctatct
600caaaaacata gtggttgcag acctcataat gacgctgaca tttccatttc gaatagtcca
660tgatgcagga tttggacctt ggtacttcaa gtttattctc tgcagataca cttcagtttt
720gttttatgca aacatgtata cttccatcgt gttccttggg ctgataagca ttgatcgcta
780tctgaaggtg gtcaagccat ttggggactc tcggatgtac agcataacct tcacgaaggt
840tttatctgtt tgtgtttggg tgatcatggc tgttttgtct ttgccaaaca tcatcctaac
900aaatggtcag ccaacagagg acaatatcca tgactgctca aaacttaaaa gtcctttggg
960ggtcaaatgg catacggcag tcacctatgt gaacagctgc ttgtttgtgg ccgtgctggt
1020gattctgatc ggatgttaca tagccatatc caggtacatc cacaaatcca gcaggcaatt
1080cataagtcag tcaagccgaa agcgaaaaca taaccagagc atcagggttg ttgtggctgt
1140gttttttacc tgctttctac catatcactt gtgcagaatt ccttttactt ttagtcactt
1200agacaggctt ttagatgaat ctgcacaaaa aatcctatat tactgcaaag aaattacact
1260tttcttgtct gcgtgtaatg tttgcctgga tccaataatt tactttttca tgtgtaggtc
1320attttcaaga aggctgttca aaaaatcaaa tatcagaacc aggagtgaaa gcatcagatc
1380actgcaaagt gtgagaagat cggaagttcg catatattat gattacactg atgtgtaggc
1440cttttattgt ttgttggaat cgatatgtac aaagtgtaaa taaatgtttc ttttcattat
1500ccttgcttga gcccatcaaa a
1521522021DNAHomo sapiens 52ggcccggacg ggacgtgcgc gctcaaaggt tgcccgtctc
tgacgcccgc atttcctggt 60ctggagccgg ctgagccaca gcagggtcgc cgcggggtcc
cggggccgtg ctcccctgcc 120cctcccggga gcgcgcgggg cggggcgggg cggggcggga
ccaggcgggc gagctgggcc 180ctcgcccctc cctcgggcgg tcacctgggc acgggcgctg
caggtgtcgg ggcctcaacc 240ttgcggagcc gacagccatc gatcctcggg tggcctcgag
gtggtggcag ggccgccccc 300tgcagtccgg agacgaacgc acggaccggg cctccggagg
caggttcggc tggaaggaac 360cgctctcgct tcgtcctaca cttgcgcaaa tgtctccgag
cttactcaca tagcatattg 420gtatatcaaa atgaaatgca aggaaccaaa aataacataa
ttgaaggcag taaaagtgaa 480attaaatagg aagatcatca gtcaaggaag acccactgga
gaggacagaa aatgaagcag 540tgttttatca tgtgtatttc agcaggtctt cttgaaattt
aactaaaaat atgactgctc 600tctcttcaga gaactgctct tttcagtacc agttacgtca
aacaaaccag cccctagatg 660ttaactatct gctattcttg atcatacttg ggaaaatatt
attaaatatc cttacactag 720gaatgagaag aaaaaacacc tgtcaaaatt ttatggaata
tttttgcatt tcactagcat 780tcgttgatct tttacttttg gtaaacattt ccattatatt
gtatttcagg gattttgtac 840ttttaagcat taggttcact aaataccaca tctgcctatt
tactcaaatt atttccttta 900cttatggctt tttgcattat ccagttttcc tgacagcttg
tatagattat tgcctgaatt 960tctctaaaac aaccaagctt tcatttaagt gtcaaaaatt
attttatttc tttacagtaa 1020ttttaatttg gatttcagtc cttgcttatg ttttgggaga
cccagccatc taccaaagcc 1080tgaaggcaca gaatgcttat tctcgtcact gtcctttcta
tgtcagcatt cagagttact 1140ggctgtcatt tttcatggtg atgattttat ttgtagcttt
cataacctgt tgggaagaag 1200ttactacttt ggtacaggct atcaggataa cttcctatat
gaatgaaact atcttatatt 1260ttcctttttc atcccactcc agttatactg tgagatctaa
aaaaatattc ttatccaagc 1320tcattgtctg ttttctcagt acctggttac catttgtact
acttcaggta atcattgttt 1380tacttaaagt tcagattcca gcatatattg agatgaatat
tccctggtta tactttgtca 1440atagttttct cattgctaca gtgtattggt ttaattgtca
caagcttaat ttaaaagaca 1500ttggattacc tttggatcca tttgtcaact ggaagtgctg
cttcattcca cttacaattc 1560ctaatcttga gcaaattgaa aagcctatat caataatgat
ttgttaatat tattaattaa 1620aagttacagc tgtcataaga tcataatttt atgaacagaa
agaactcagg acatattaaa 1680aaataaactg aactaaaaca acttttgccc cctgactgat
agcatttcag aatgtgtctt 1740ttgaagggct atgataccag ttattaaata gtgttttatt
ttaaaaacaa aataattcca 1800agaagttttt atagttattc agggacacta tattacaaat
attactttgt tattaacaca 1860aaaagtgata agagttaaca tttggctata ctgatgtttg
tgttactcaa aaaaactact 1920ggatgcaaac tgttatgtaa atctgagatt tcactgacaa
ctttaagata tcaacctaaa 1980catttttatt aaatgttcaa atgaaagcaa gaaaaaaaaa a
2021534150DNAHomo sapiens 53actcggtgcg ccttccgcgg
accgggcgac ccagtgcacg gccgccgcgt cactctcggt 60cccgctgacc ccgcgccgag
ccccggcggc tctggccgcg gccgcactca gcgccacgcg 120tcgaaagcgc aggccccgag
gacccgccgc actgacagta tgagccgcac agcctacacg 180gtgggagccc tgcttctcct
cttggggacc ctgctgccgg ctgctgaagg gaaaaagaaa 240gggtcccaag gtgccatccc
cccgccagac aaggcccagc acaatgactc agagcagact 300cagtcgcccc agcagcctgg
ctccaggaac cgggggcggg gccaagggcg gggcactgcc 360atgcccgggg aggaggtgct
ggagtccagc caagaggccc tgcatgtgac ggagcgcaaa 420tacctgaagc gagactggtg
caaaacccag ccgcttaagc agaccatcca cgaggaaggc 480tgcaacagtc gcaccatcat
caaccgcttc tgttacggcc agtgcaactc tttctacatc 540cccaggcaca tccggaagga
ggaaggttcc tttcagtcct gctccttctg caagcccaag 600aaattcacta ccatgatggt
cacactcaac tgccctgaac tacagccacc taccaagaag 660aagagagtca cacgtgtgaa
gcagtgtcgt tgcatatcca tcgatttgga ttaagccaaa 720tccaggtgca cccagcatgt
cctaggaatg cagccccagg aagtcccaga cctaaaacaa 780ccagattctt acttggctta
aacctagagg ccagaagaac ccccagctgc ctcctggcag 840gagcctgctt gtgcgtagtt
cgtgtgcatg agtgtggatg ggtgcctgtg ggtgttttta 900gacaccagag aaaacacagt
ctctgctaga gagcactccc tattttgtaa acatatctgc 960tttaatgggg atgtaccaga
aacccacctc accccggctc acatctaaag gggcggggcc 1020gtggtctggt tctgactttg
tgtttttgtg ccctcctggg gaccagaatc tcctttcgga 1080atgaatgttc atggaagagg
ctcctctgag ggcaagagac ctgttttagt gctgcattcg 1140acatggaaaa gtccttttaa
cctgtgcttg catcctcctt tcctcctcct cctcacaatc 1200catctcttct taagttgata
gtgactatgt cagtctaatc tcttgtttgc caaggttcct 1260aaattaattc acttaaccat
gatgcaaatg tttttcattt tgtgaagacc ctccagactc 1320tgggagaggc tggtgtgggc
aaggacaagc aggatagtgg agtgagaaag ggagggtgga 1380gggtgaggcc aaatcaggtc
cagcaaaagt cagtagggac attgcagaag cttgaaaggc 1440caataccaga acacaggctg
atgcttctga gaaagtcttt tcctagtatt taacagaacc 1500caagtgaaca gaggagaaat
gagattgcca gaaagtgatt aactttggcc gttgcaatct 1560gctcaaacct aacaccaaac
tgaaaacata aatactgacc actcctatgt tcggacccaa 1620gcaagttagc taaaccaaac
caactcctct gctttgtccc tcaggtggaa aagagaggta 1680gtttagaact ctctgcatag
gggtgggaat taatcaaaaa cctcagaggc tgaaattcct 1740aatacctttc ctttatcgtg
gttatagtca gctcatttcc attccactat ttcccataat 1800gcttctgaga gccactaact
tgattgataa agatcctgcc tctgctgagt gtacctgaca 1860gtagtctaag atgagagagt
ttagggacta ctctgtttta gcaagagata ttttgggggt 1920ctttttgttt taactattgt
caggagattg ggctaaagag aagacgacga gagtaaggaa 1980ataaagggaa ttgcctctgg
ctagagagta gttaggtgtt aatacctggt agagatgtaa 2040gggatatgac ctccctttct
ttatgtgctc actgaggatc tgaggggacc ctgttaggag 2100agcatagcat catgatgtat
tagctgttca tctgctactg gttggatgga cataactatt 2160gtaactattc agtatttact
ggtaggcact gtcctctgat taaacttggc ctactggcaa 2220tggctactta ggattgatct
aagggccaaa gtgcagggtg ggtgaacttt attgtacttt 2280ggatttggtt aacctgtttt
cttcaagcct gaggttttat atacaaactc cctgaatact 2340ctttttgcct tgtatcttct
cagcctccta gccaagtcct atgtaatatg gaaaacaaac 2400actgcagact tgagattcag
ttgccgatca aggctctggc attcagagaa cccttgcaac 2460tcgagaagct gtttttattt
cgtttttgtt ttgatccagt gctctcccat ctaacaacta 2520aacaggagcc atttcaaggc
gggagatatt ttaaacaccc aaaatgttgg gtctgatttt 2580caaactttta aactcactac
tgatgattct cacgctaggc gaatttgtcc aaacacatag 2640tgtgtgtgtt ttgtatacac
tgtatgaccc caccccaaat ctttgtattg tccacattct 2700ccaacaataa agcacagagt
ggatttaatt aagcacacaa atgctaaggc agaattttga 2760gggtgggaga gaagaaaagg
gaaagaagct gaaaatgtaa aaccacacca gggaggaaaa 2820atgacattca gaaccagcaa
acactgaatt tctcttgttg ttttaactct gccacaagaa 2880tgcaatttcg ttaacggaga
tgacttaagt tggcagcagt aatcttcttt taggagcttg 2940taccacagtc ttgcacataa
gtgcagattt ggctcaagta aagagaattt cctcaacact 3000aacttcactg ggataatcag
cagcgtaact accctaaaag catatcacta gccaaagagg 3060gaaatatctg ttcttcttac
tgtgcctata ttaagactag tacaaatgtg gtgtgtcttc 3120caactttcat tgaaaatgcc
atatctatac catattttat tcgagtcact gatgatgtaa 3180tgatatattt tttcattatt
atagtagaat atttttatgg caagatattt gtggtcttga 3240tcatacctat taaaataatg
ccaaacacca aatatgaatt ttatgatgta cactttgtgc 3300ttggcattaa aagaaaaaaa
cacacatcct ggaagtctgt aagttgtttt ttgttactgt 3360aggtcttcaa agttaagagt
gtaagtgaaa aatctggagg agaggataat ttccactgtg 3420tggaatgtga atagttaaat
gaaaagttat ggttatttaa tgtaattatt acttcaaatc 3480ctttggtcac tgtgatttca
agcatgtttt ctttttctcc tttatatgac tttctctgag 3540ttgggcaaag aagaagctga
cacaccgtat gttgttagag tcttttatct ggtcagggga 3600aacaaaatct tgacccagct
gaacatgtct tcctgagtca gtgcctgaat ctttattttt 3660taaattgaat gttccttaaa
ggttaacatt tctaaagcaa tattaagaaa gactttaaat 3720gttattttgg aagacttacg
atgcatgtat acaaacgaat agcagataat gatgactagt 3780tcacacataa agtcctttta
aggagaaaat ctaaaatgaa aagtggataa acagaacatt 3840tataagtgat cagttaatgc
ctaagagtga aagtagttct attgacattc ctcaagatat 3900ttaatatcaa ctgcattatg
tattatgtct gcttaaatca tttaaaaacg gcaaagaatt 3960atatagacta tgaggtacct
tgctgtgtag gaggatgaaa ggggagttga tagtctcata 4020aaactaattt ggcttcaagt
ttcatgaatc tgtaactaga atttaatttt caccccaata 4080atgttctata tagcctttgc
taaagagcaa ctaataaatt aaacctattc tttctgtgaa 4140aaaaaaaaaa
4150543044DNAHomo sapiens
54gctttattgt ttgcttgttt tgttccggag tcggggccgg gagggagtgc aggaggaggg
60atccaagctt ccaagcctct gctccgctct ccttctatcc agttggtctt tagggcactg
120aaggaaactc ttcttcagaa ataacctttt aacttttctt ctgtcagctg cctgccaatc
180acggagccag aggctgaggg gaggctttga gccggtctgc gagtccggaa ggcaaagatc
240gcgaagcttg gcgctccaga acgctcaggg ggcaggtgac acagtcgtgg gttccccggc
300gggcgctggc ttgacagttt cctccccgcc cactggcagg ggagcgcccc gccgggctgc
360acgcgcgcgc gcgcaggggg gcataaaagc cgcggccgcg cggagacgcg gagctcgccc
420accgcccgcc ccagcagtgg ctgcaccatg cacgtgaacg gcaaagtggc gctggtgacc
480ggcgcggctc agggcatagg cagagccttt gcagaggcgc tgctgcttaa gggcgccaag
540gtagcgctgg tggattggaa tcttgaagca ggtgtacagt gtaaagctgc cctggatgag
600cagtttgaac ctcagaagac tctgttcatc cagtgcgatg tggctgacca gcaacaactg
660agagacactt ttagaaaagt tgtagaccac tttggaagac tggacatttt ggtcaataat
720gctggagtga ataatgagaa aaactgggaa aaaactctgc aaattaattt ggtttctgtt
780atcagtggaa cctatcttgg tttggattac atgagtaagc aaaatggagg tgaaggcggc
840atcattatca atatgtcatc tttagcagga ctcatgcccg ttgcacagca gccggtttat
900tgtgcttcaa agcatggcat agttggattc acacgctcag cagcgttggc tgctaatctt
960atgaacagtg gtgtgagact gaatgccatt tgtccaggct ttgttaacac agccatcctt
1020gaatcaattg aaaaagaaga aaacatggga caatatatag aatataagga tcatatcaag
1080gatatgatta aatactatgg aattttggac ccaccattga ttgccaatgg attgataaca
1140ctcattgaag atgatgcttt aaatggtgct attatgaaga tcacaacttc taagggaatt
1200cattttcaag actatgatac aactccattt caagcaaaaa cccaatgaac agcttatgtg
1260ttagccatag ctgaaaataa gcacaaatag cttatattca gatcctatct tcatttgaat
1320atagctttta aatgaaatgt tacagtttga agttttcctt catgcacttg gtgataaacg
1380ttttctaaat ttttagttaa gtatatggat aaaaagttat gaactattaa aaatgtgatg
1440tggaccaaag gctaggttgt aatcttgata gtctaaaaaa tgatcaaaac aaatgatttt
1500caaggaatat tcaatattct gcctttcaga aagtgtattt atatctgtgc ttcataaata
1560ttaatgttct tcagaacatc attttaaagg agatacttga attgttattt aaatcaaacc
1620agatgtaaaa cactcacata caagttcata ctttaaaaga ggaaagctac ttaacaatga
1680caaatatttc acaataataa tttttactta tataccatct ttcaactgaa catttcagtt
1740cttccaagag cttcttagag tagtatattt tgggggcagt caaggaataa actacagtgt
1800aaacatatcc cagatgaaaa ctgctgtatg gaaaaatgac agaaagtaac tgattgacac
1860tgttgattca cagttcagcc tcctatctgg gaaagacatt tctttcctct gctcacttta
1920agaactttta ccgactccaa aaatctcagg aattaaactt ttaacagtta cagcaataaa
1980gaatagttag tactccaaaa atattatatt taagatgctc aacaagaaaa aaatgcaaat
2040gtaatatttt tttcaaatta cttctttatt gacttgtcca aatttcaaaa gtgcctaccc
2100ttcaataaaa cttttttatt ctgatctcca taaattactt agtcttctat gtatagctat
2160caaggaaata aaaccaattt tgccacagcc acaactgtaa atgtttttgt acccatgctg
2220aaactcataa caacacagac ataaaaatag ctgtgaggtt ttgctttttt tgttgtcagc
2280tatcttaaga atcattaaat acacctgctt tgggtaaaac tctttgcaag cagtaattaa
2340cactagtaac agtgaaagca caagatttcc aaatcagtcg ttttctcaaa aaaatatcgt
2400ataagtgact catcctgtct gctaactcca gacctcccag cttgaagcca aatctttcca
2460tgtgagattg atatggattt cctagaagta ctggaatgtt gtcatatctt gccctatttt
2520aattctgcta tagaaaacaa ttgccttcac ttttaaggag taatttgaat attaataact
2580ctggtctaga ttttcatata atgtattaaa gacaaagtag tgaacatcaa tgaacatctg
2640atagagataa actgtaatca ggcataagct tgtttgtatg ttctggcagt gactaatcag
2700taaatgatgt cggtttgccc agtatcactt atcttctgta tttttcctct gtcgtgtaaa
2760tagtataacc ttttcattta tggacaattt tttggactag tagccttcaa tatacattct
2820gctttgaatt aattttttca aatcaataaa ttatgtagac atttaaaatc aaatatcaag
2880tagaattgaa aaatgtgagt tacataagtt aaaaacttac tttaaatctt accttctata
2940ggtagctcta aataaattca tatggttata tggcatctct ggtgtatact gattgagaaa
3000ataattaaac tgaagttagg ggaggggaaa aaaaaaaaaa aaaa
3044553891DNAHomo sapiens 55gagagcgtag tggaggaggc gcggttgtga gtagtaccgg
gagtggggtg atcccgggct 60aggggagcgc ggcggccgcg atcgggctta gtcggagctc
cgaagggagt gactaggaca 120cccgggtggg ctacttttct tccggtgctt ttgctttttt
tttcctttgg gctcgggctg 180agtgtcgccc actgagcaaa gattccctcg taaaacccag
agcgaccctc ccgtcaattg 240ttgggctcgg gagtgtcgcg gtgccccgag cgcgccgggc
gcggaggcaa agggagcgga 300gccggccgcg gacggggccc ggagcttgcc tgcctccctc
gctcgcccca gcgggttcgc 360tcgcgtagag cgcagggcgc gcgcgatgaa ggcggtgagc
ccggtgcgcc cctcgggccg 420caaggcgccg tcgggctgcg gcggcgggga gctggcgctg
cgctgcctgg ccgagcacgg 480ccacagcctg ggtggctccg cagccgcggc ggcggcggcg
gcggcagcgc gctgtaaggc 540ggccgaggcg gcggccgacg agccggcgct gtgcctgcag
tgcgatatga acgactgcta 600tagccgcctg cggaggctgg tgcccaccat cccgcccaac
aagaaagtca gcaaagtgga 660gatcctgcag cacgttatcg actacatcct ggacctgcag
ctggcgctgg agacgcaccc 720ggccctgctg aggcagccac caccgcccgc gccgccacac
cacccggccg ggacctgtcc 780agccgcgccg ccgcggaccc cgctcactgc gctcaacacc
gacccggccg gcgcggtgaa 840caagcagggc gacagcattc tgtgccgctg agccgcgctg
tccaggtgtg cggccgcctg 900agcccgagcc aggagcacta gagagggagg gggaagagca
gaagttagag aaaaaaagcc 960accggaggaa aggaaaaaac atcggccaac ctagaaacgt
tttcattcgt cattccaaga 1020gagagagagg aaagaaaaat acaactttca ttctttcttt
gcacgttcat aaacattcta 1080catacgtatt ctcttttgtc tcttcattta taactgctgt
gaattgtaca tttctgtgtt 1140ttttggaggt gcagttaaac ttttaagctt aagtgtgaca
ggactgataa atagaagatc 1200aagagtagat ccgactttag aagcctactt tgtgaccaag
gagctcaatt tttgttttga 1260agctttacta atctaccaga gcattgtaga tatttttttt
ttacatctat tgtttaaaat 1320agatgattat aacggggcag agaactttct tttctctgca
agaatgttac atattgtata 1380gataaatgag tgacatttca taccatgtat atatagagat
gttctataag tgtgagaaag 1440tatatgcttt aatagatact gtaattataa gatattttta
attaaatatt tttttgtaaa 1500tattatgtgt gtgttttttt ttaatctatg ggaatatttc
ttttggaaaa tcatttttca 1560gctcaattac agagctcttg atatcttgaa tgtcttttct
gtttggcctg gctcttaatt 1620tgcttttgtt ttgcccagta tagactcgga agtaacagtt
atagctagtg gtcttgcatg 1680attgcatgag atgtttaatc acaaattaaa cttgttctga
gtccattcaa atgtgttttt 1740ttaaatgtag attgaaatct ttgtatttga agcatacatg
ttgaaaatac accttatcag 1800tttttaagta cagggtttta tagtgtaata tatacagagt
aagtgtttgt ttttgttttt 1860caactgaggt caaaatggat tctgaatgat tttgcatatg
ggatgaggaa atgcttggat 1920ccttaaggag tttacgaaat ctgctgtttt atcaaagtga
aaaaaaattg cttattactc 1980ttcattttac actaaagctt aatgtcacta agtttcatgt
ctgtacagat tatttaaatc 2040atggaaatga aaaaaatgtt ctctgcttgc taccaaagga
caaactcttg gaaatgaaca 2100ctttctgctt tccttcctcc aaagaattaa taggcaacag
tgggagaaaa aaaaggcata 2160atggcaaatc cttcaagcag ggataaaagt cgatcttcaa
acattaactt aagcagacca 2220aaaattctga tgaccgcatc tagattattt ttttataaaa
atgattttca ctatagctat 2280gttacgctaa gctactgtcc aatctcttgt gatgtgtaac
ttttacatgt gaatattaaa 2340gtagatttct ctgtcttgta ctgtgatttc tggtctcatt
tctttaaaac cttactctta 2400tttttctttt aaggctcttt tttctcctta aggaaggtaa
tattttctag gttagatagg 2460actatcaggg tttgtgaaca ttatgcattt aatgttatgg
gtactttaca cacaagttag 2520atggaatttt tagagtgaaa gaattaagta ggatttaatt
gggtgctttg taaatagtca 2580actgtgtgta taacgtggtc tgtttgattt ttaaaaggaa
aggatttgtt tcagattata 2640caagaataaa agtattatag acccaaggga cttcttatga
ggtcaaattc agatatttat 2700atgaatatga aataccatgg tccctagtag tcagttgaag
tggcaatgtc taaacagaaa 2760tgaacaaaac taatgctagc aggttaaaat caatcaaaat
gtttaaaaat tgattctgtc 2820ctcagcatgt tatttcctca gctctgataa tttactggtc
ttgagtattt tgagaatttg 2880atgttgaacg ttataaagtc aaagaactgc ttgtttagat
gaggtttatt tttatttttg 2940atattattca ttcttgtcac acatcaagaa gaaaacacta
gagtgctgct ggaattccaa 3000atctgaagaa ttctaacgac tgcattcttt gttattaaaa
agggcacaat ccttcctttt 3060tatttggcag tttaatttca gtaggaagca tgtcacatgt
gcactgttgg ttagaattat 3120gcatctgtca tgcctgactg ctgaacccta cctaagcctt
ttggcgcagt ttaaaactta 3180tactggtgga ctgtgaacct caaaacaaat gggtattttt
gggttttgag gatagatgtt 3240actccttaaa gtttgtattt ggggcatgaa aaactactga
aagaagaaaa gtgctacaga 3300tactacattt caaagagttg gcattttccc tttggccact
caagcagcat ttgatgtatc 3360taaagaaaca aagtcattgt ttatttttta aaaaattata
tgcagttgta caagatacta 3420cattccattg aaatgttggc tatgtcctaa ccaggcaacc
agataacaaa aacattttga 3480gtcttttatc taggtagttc taattattca gctacttagt
ttaacaaagg aaaatatcct 3540gacttctctc atttcatttg tagacttttc attgtatagg
cacaaccaaa gagtcagact 3600ggtttaaaac tccagaagga aaaaaagtat cccacacagt
ggatgttgtt tctaagaatg 3660ctacaaaatc ctgacatctc agacatctca atgttaaagg
aagaaaaaaa ataccttttc 3720atttcaaaga actaatatac tttgatattg tgtaaacctt
actcaagttt attgtcaagc 3780tttaactgcc tttttagaac tttttaaaat ttcgagccca
caaatctatt gtattagttg 3840ccttctataa caataaatct tcactgagca aaaggcaaaa
aaaaaaaaaa a 3891567370DNAHomo sapiens 56ttttgtagat aaatgtgagg
attttctcta aatccctctt ctgtttgcta aatctcactg 60tcactgctaa attcagagca
gatagagcct gcgcaatgga ataaagtcct caaaattgaa 120atgtgacatt gctctcaaca
tctcccatct ctctggattt ctttttgctt cattattcct 180gctaaccaat tcattttcag
actttgtact tcagaagcaa tgggaaaaat cagcagtctt 240ccaacccaat tatttaagtg
ctgcttttgt gatttcttga aggtgaagat gcacaccatg 300tcctcctcgc atctcttcta
cctggcgctg tgcctgctca ccttcaccag ctctgccacg 360gctggaccgg agacgctctg
cggggctgag ctggtggatg ctcttcagtt cgtgtgtgga 420gacaggggct tttatttcaa
caagcccaca gggtatggct ccagcagtcg gagggcgcct 480cagacaggca tcgtggatga
gtgctgcttc cggagctgtg atctaaggag gctggagatg 540tattgcgcac ccctcaagcc
tgccaagtca gctcgctctg tccgtgccca gcgccacacc 600gacatgccca agacccagaa
gtatcagccc ccatctacca acaagaacac gaagtctcag 660agaaggaaag gaagtacatt
tgaagaacgc aagtagaggg agtgcaggaa acaagaacta 720caggatgtag gaagaccctc
ctgaggagtg aagagtgaca tgccaccgca ggatcctttg 780ctctgcacga gttacctgtt
aaactttgga acacctacca aaaaataagt ttgataacat 840ttaaaagatg ggcgtttccc
ccaatgaaat acacaagtaa acattccaac attgtcttta 900ggagtgattt gcaccttgca
aaaatggtcc tggagttggt agattgctgt tgatctttta 960tcaataatgt tctatagaaa
agaaaaaaaa aatatatata tatatatatc ttagtccctg 1020cctctcaaga gccacaaatg
catgggtgtt gtatagatcc agttgcacta aattcctctc 1080tgaatcttgg ctgctggagc
cattcattca gcaaccttgt ctaagtggtt tatgaattgt 1140ttccttattt gcacttcttt
ctacacaact cgggctgttt gttttacagt gtctgataat 1200cttgttagtc tatacccacc
acctcccttc ataaccttta tatttgccga atttggcctc 1260ctcaaaagca gcagcaagtc
gtcaagaagc acaccaattc taacccacaa gattccatct 1320gtggcatttg taccaaatat
aagttggatg cattttattt tagacacaaa gctttatttt 1380tccacatcat gcttacaaaa
aagaataatg caaatagttg caactttgag gccaatcatt 1440tttaggcata tgttttaaac
atagaaagtt tcttcaactc aaaagagttc cttcaaatga 1500tgagttaatg tgcaacctaa
ttagtaactt tcctcttttt attttttcca tatagagcac 1560tatgtaaatt tagcatatca
attatacagg atatatcaaa cagtatgtaa aactctgttt 1620tttagtataa tggtgctatt
ttgtagtttg ttatatgaaa gagtctggcc aaaacggtaa 1680tacgtgaaag caaaacaata
ggggaagcct ggagccaaag atgacacaag gggaagggta 1740ctgaaaacac catccatttg
ggaaagaagg caaagtcccc ccagttatgc cttccaagag 1800gaacttcaga cacaaaagtc
cactgatgca aattggactg gcgagtccag agaggaaact 1860gtggaatgga aaaagcagaa
ggctaggaat tttagcagtc ctggtttctt tttctcatgg 1920aagaaatgaa catctgccag
ctgtgtcatg gactcaccac tgtgtgacct tgggcaagtc 1980acttcacctc tctgtgcctc
agtttcctca tctgcaaaat gggggcaata tgtcatctac 2040ctacctcaaa ggggtggtat
aaggtttaaa aagataaaga ttcagatttt ttttaccctg 2100ggttgctgta agggtgcaac
atcagggcgc ttgagttgct gagatgcaag gaattctata 2160aataacccat tcatagcata
gctagagatt ggtgaattga atgctcctga catctcagtt 2220cttgtcagtg aagctatcca
aataactggc caactagttg ttaaaagcta acagctcaat 2280ctcttaaaac acttttcaaa
atatgtggga agcatttgat tttcaatttg attttgaatt 2340ctgcatttgg ttttatgaat
acaaagataa gtgaaaagag agaaaggaaa agaaaaagga 2400gaaaaacaaa gagatttcta
ccagtgaaag gggaattaat tactctttgt tagcactcac 2460tgactcttct atgcagttac
tacatatcta gtaaaacctc gtttaatact ataaataata 2520ttctattcat tttgaaaaac
acaatgattc cttcttttct aggcaatata aggaaagtga 2580tccaaaattt gaaatattaa
aataatatct aataaaaagt cacaaagtta tcttctttaa 2640caaactttac tcttattctt
agctgtatat acattttttt aaaagtttgt taaaatatgc 2700ttgactagag tttccagttg
aaaggcaaaa acttccatca caacaagaaa tttcccatgc 2760ctgctcagaa gggtagcccc
tagctctctg tgaatgtgtt ttatccattc aactgaaaat 2820tggtatcaag aaagtccact
ggttagtgta ctagtccatc atagcctaga aaatgatccc 2880tatctgcaga tcaagatttt
ctcattagaa caatgaatta tccagcattc agatctttct 2940agtcacctta gaactttttg
gttaaaagta cccaggcttg attatttcat gcaaattcta 3000tattttacat tcttggaaag
tctatatgaa aaacaaaaat aacatcttca gtttttctcc 3060cactgggtca cctcaaggat
cagaggccag gaaaaaaaaa aaaaagactc cctggatctc 3120tgaatatatg caaaaagaag
gccccattta gtggagccag caatcctgtt cagtcaacaa 3180gtattttaac tctcagtcca
acattatttg aattgagcac ctcaagcatg cttagcaatg 3240ttctaatcac tatggacaga
tgtaaaagaa actatacatc atttttgccc tctgcctgtt 3300ttccagacat acaggttctg
tggaataaga tactggactc ctcttcccaa gatggcactt 3360ctttttattt cttgtcccca
gtgtgtacct tttaaaatta ttccctctca acaaaacttt 3420ataggcagtc ttctgcagac
ttaacgtgtt ttctgtcata gttagatgtg ataattctaa 3480gagtgtctat gacttatttc
cttcacttaa ttctatccac agtcaaaaat cccccaagga 3540ggaaagctga aagatgcact
gccatattat ctttcttaac tttttccaac acataatcct 3600ctccaactgg attataaata
aattgaaaat aactcattat accaattcac tattttattt 3660tttaatgaat taaaactaga
aaacaaattg atgcaaaccc tggaagtcag ttgattacta 3720tatactacag cagaatgact
cagatttcat agaaaggagc aaccaaaatg tcacaaccca 3780aaactttaca agctttgctt
cagaattaga ttgctttata attcttgaat gaggcaattt 3840caagatattt gtaaaagaac
agtaaacatt ggtaagaatg agctttcaac tcataggctt 3900atttccaatt taattgacca
tactggatac ttaggtcaaa tttctgttct ctcttcccca 3960aataatatta aagtattatt
tgaacttttt aagatgaggc agttcccctg aaaaagttaa 4020tgcagctctc catcagaatc
cactcttcta gggatatgaa aatctcttaa cacccaccct 4080acatacacag acacacacac
acacacacac acacacacac acacacacat tcaccctaag 4140gatccaatgg aatactgaaa
agaaatcact tccttgaaaa ttttattaaa aaacaaacaa 4200acaaacaaaa agcctgtcca
cccttgagaa tccttcctct ccttggaacg tcaatgtttg 4260tgtagatgaa accatctcat
gctctgtggc tccagggttt ctgttactat tttatgcact 4320tgggagaagg cttagaataa
aagatgtagc acattttgct ttcccattta ttgtttggcc 4380agctatgcca atgtggtgct
attgtttctt taagaaagta cttgactaaa aaaaaaagaa 4440aaaaagaaaa aaaagaaagc
atagacatat ttttttaaag tataaaaaca acaattctat 4500agatagatgg cttaataaaa
tagcattagg tctatctagc caccaccacc tttcaacttt 4560ttatcactca caagtagtgt
actgttcacc aaattgtgaa tttgggggtg caggggcagg 4620agttggaaat tttttaaagt
tagaaggctc cattgttttg ttggctctca aacttagcaa 4680aattagcaat atattatcca
atcttctgaa cttgatcaag agcatggaga ataaacgcgg 4740gaaaaaagat cttataggca
aatagaagaa tttaaaagat aagtaagttc cttattgatt 4800tttgtgcact ctgctctaaa
acagatattc agcaagtgga gaaaataaga acaaagagaa 4860aaaatacata gatttacctg
caaaaaatag cttctgccaa atcccccttg ggtattcttt 4920ggcatttact ggtttataga
agacattctc ccttcaccca gacatctcaa agagcagtag 4980ctctcatgaa aagcaatcac
tgatctcatt tgggaaatgt tggaaagtat ttccttatga 5040gatgggggtt atctactgat
aaagaaagaa tttatgagaa attgttgaaa gagatggcta 5100acaatctgtg aagatttttt
gtttcttgtt tttgtttttt tttttttttt actttataca 5160gtctttatga atttcttaat
gttcaaaatg acttggttct tttcttcttt ttttatatca 5220gaatgaggaa taataagtta
aacccacata gactctttaa aactataggc tagatagaaa 5280tgtatgtttg acttgttgaa
gctataatca gactatttaa aatgttttgc tatttttaat 5340cttaaaagat tgtgctaatt
tattagagca gaacctgttt ggctctcctc agaagaaaga 5400atctttccat tcaaatcaca
tggctttcca ccaatatttt caaaagataa atctgattta 5460tgcaatggca tcatttattt
taaaacagaa gaattgtgaa agtttatgcc cctcccttgc 5520aaagaccata aagtccagat
ctggtagggg ggcaacaaca aaaggaaaat gttgttgatt 5580cttggttttg gattttgttt
tgttttcaat gctagtgttt aatcctgtag tacatatttg 5640cttattgcta ttttaatatt
ttataagacc ttcctgttag gtattagaaa gtgatacata 5700gatatctttt ttgtgtaatt
tctatttaaa aaagagagaa gactgtcaga agctttaagt 5760gcatatggta caggataaag
atatcaattt aaataaccaa ttcctatctg gaacaatgct 5820tttgtttttt aaagaaacct
ctcacagata agacagaggc ccaggggatt tttgaagctg 5880tctttattct gcccccatcc
caacccagcc cttattattt tagtatctgc ctcagaattt 5940tatagagggc tgaccaagct
gaaactctag aattaaagga acctcactga aaacatatat 6000ttcacgtgtt ccctcttttt
ttttttcctt tttgtgagat ggggtctcgc actgtccccc 6060aggctggagt gcagtggcat
gatctcggct cactgcaacc tccacctcct gggtttaagc 6120gattctcctg cctcagcctc
ctgagtagct gggattacag gcacccacca ctatgcccgg 6180ctaatttttt ggatttttaa
tagagacggg gttttaccat gttggccagg ttggtctcaa 6240actcctgacc ttgtgatttg
cccgcctcag cctcccaaat tgctgggatt acaggcatga 6300gccaccacac cctgcccatg
tgttccctct taatgtatga ttacatggat cttaaacatg 6360atccttctct cctcattctt
caactatctt tgatggggtc tttcaagggg aaaaaaatcc 6420aagctttttt aaagtaaaaa
aaaaaaaaga gaggacacaa aaccaaatgt tactgctcaa 6480ctgaaatatg agttaagatg
gagacagagt ttctcctaat aaccggagct gaattacctt 6540tcactttcaa aaacatgacc
ttccacaatc cttagaatct gccttttttt atattactga 6600ggcctaaaag taaacattac
tcattttatt ttgcccaaaa tgcactgatg taaagtagga 6660aaaataaaaa cagagctcta
aaatcccttt caagccaccc attgacccca ctcaccaact 6720catagcaaag tcacttctgt
taatccctta atctgatttt gtttggatat ttatcttgta 6780cccgctgcta aacacactgc
aggagggact ctgaaacctc aagctgtcta cttacatctt 6840ttatctgtgt ctgtgtatca
tgaaaatgtc tattcaaaat atcaaaacct ttcaaatatc 6900acgcagctta tattcagttt
acataaaggc cccaaatacc atgtcagatc tttttggtaa 6960aagagttaat gaactatgag
aattgggatt acatcatgta ttttgcctca tgtattttta 7020tcacacttat aggccaagtg
tgataaataa acttacagac actgaattaa tttcccctgc 7080tactttgaaa ccagaaaata
atgactggcc attcgttaca tctgtcttag ttgaaaagca 7140tattttttat taaattaatt
ctgattgtat ttgaaattat tattcaattc acttatggca 7200gaggaatatc aatcctaatg
acttctaaaa atgtaactaa ttgaatcatt atcttacatt 7260tactgtttaa taagcatatt
ttgaaaatgt atggctagag tgtcataata aaatggtata 7320tctttcttta gtaattacat
taaaattagt catgtttgat taattagttc 7370572171DNAHomo sapiens
57aaagttacat tttctctgga actctcctag gccactccct gctgatgcaa catctgggtt
60tgggcagaaa ggagggtgct tcggagcccg ccctttctga gcttcctggg ccggctctag
120aacaattcag gcttcgctgc gactcagacc tcagctccaa catatgcatt ctgaagaaag
180atggctgaga tggacagaat gctttatttt ggaaagaaac aatgttctag gtcaaactga
240gtctaccaaa tgcagacttt cacaatggtt ctagaagaaa tctggacaag tcttttcatg
300tggtttttct acgcattgat tccatgtttg ctcacagatg aagtggccat tctgcctgcc
360cctcagaacc tctctgtact ctcaaccaac atgaagcatc tcttgatgtg gagcccagtg
420atcgcgcctg gagaaacagt gtactattct gtcgaatacc agggggagta cgagagcctg
480tacacgagcc acatctggat ccccagcagc tggtgctcac tcactgaagg tcctgagtgt
540gatgtcactg atgacatcac ggccactgtg ccatacaacc ttcgtgtcag ggccacattg
600ggctcacaga cctcagcctg gagcatcctg aagcatccct ttaatagaaa ctcaaccatc
660cttacccgac ctgggatgga gatcaccaaa gatggcttcc acctggttat tgagctggag
720gacctggggc cccagtttga gttccttgtg gcctactgga ggagggagcc tggtgccgag
780gaacatgtca aaatggtgag gagtgggggt attccagtgc acctagaaac catggagcca
840ggggctgcat actgtgtgaa ggcccagaca ttcgtgaagg ccattgggag gtacagcgcc
900ttcagccaga cagaatgtgt ggaggtgcaa ggagaggcca ttcccctggt actggccctg
960tttgcctttg ttggcttcat gctgatcctt gtggtcgtgc cactgttcgt ctggaaaatg
1020ggccggctgc tccagtactc ctgttgcccc gtggtggtcc tcccagacac cttgaaaata
1080accaattcac cccagaagtt aatcagctgc agaagggagg aggtggatgc ctgtgccacg
1140gctgtgatgt ctcctgagga actcctcagg gcctggatct cataggtttg cggaagggcc
1200caggtgaagc cgagaacctg gtctgcatga catggaaacc atgaggggac aagttgtgtt
1260tctgttttcc gccacggaca agggatgaga gaagtaggaa gagcctgttg tctacaagtc
1320tagaagcaac catcagaggc agggtggttt gtctaacaga acactgactg aggcttaggg
1380gatgtgacct ctagactggg ggctgccact tgctggctga gcaaccctgg gaaaagtgac
1440ttcatccctt cggtcctaag ttttctcatc tgtaatgggg gaattaccta cacacctgct
1500aaacacacac acacagagtc tctctctata tatacacacg tacacataaa tacacccagc
1560acttgcaagg ctagagggaa actggtgaca ctctacagtc tgactgattc agtgtttctg
1620gagagcagga cataaatgta tgatgagaat gatcaaggac tctacacact gggtggcttg
1680gagagcccac tttcccagaa taatccttga gagaaaagga atcatgggag caatggtgtt
1740gagttcactt caagcccaat gccggtgcag aggggaatgg cttagcgagc tctacagtag
1800gtgacctgga ggaaggtcac agccacactg aaaatgggat gtgcatgaac acggaggatc
1860catgaactac tgtaaagtgt tgacagtgtg tgcacactgc agacagcagg tgaaatgtat
1920gtgtgcaatg cgacgagaat gcagaagtca gtaacatgtg catgtttgtt gtgctccttt
1980tttctgttgg taaagtacag aattcagcaa ataaaaaggg ccaccctggc caaaagcggt
2040ctttaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
2100aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
2160aaaaaaaaaa a
2171582175DNAHomo sapiens 58agtacagtat aaaacttcac agtgccaata ccatgaagag
gagctcagac agctcttacc 60acatgataca agagccggct ggtggaagag tggggaccag
aaagagaatt tgctgaagag 120gagaaggaaa aaaaaaacac caaaaaaaaa aataaaaaaa
tccacacaca caaaaaaacc 180tgcgcgtgag gggggaggaa aagcagggcc ttttaaaaag
gcaatcacaa caacttttgc 240tgccaggatg cccttgcttt ggctgagagg atttctgttg
gcaagttgct ggattatagt 300gaggagttcc cccaccccag gatccgaggg gcacagcgcg
gcccccgact gtccgtcctg 360tgcgctggcc gccctcccaa aggatgtacc caactctcag
ccagagatgg tggaggccgt 420caagaagcac attttaaaca tgctgcactt gaagaagaga
cccgatgtca cccagccggt 480acccaaggcg gcgcttctga acgcgatcag aaagcttcat
gtgggcaaag tcggggagaa 540cgggtatgtg gagatagagg atgacattgg aaggagggca
gaaatgaatg aacttatgga 600gcagacctcg gagatcatca cgtttgccga gtcaggaaca
gccaggaaga cgctgcactt 660cgagatttcc aaggaaggca gtgacctgtc agtggtggag
cgtgcagaag tctggctctt 720cctaaaagtc cccaaggcca acaggaccag gaccaaagtc
accatccgcc tcttccagca 780gcagaagcac ccgcagggca gcttggacac aggggaagag
gccgaggaag tgggcttaaa 840gggggagagg agtgaactgt tgctctctga aaaagtagta
gacgctcgga agagcacctg 900gcatgtcttc cctgtctcca gcagcatcca gcggttgctg
gaccagggca agagctccct 960ggacgttcgg attgcctgtg agcagtgcca ggagagtggc
gccagcttgg ttctcctggg 1020caagaagaag aagaaagaag aggaggggga agggaaaaag
aagggcggag gtgaaggtgg 1080ggcaggagca gatgaggaaa aggagcagtc gcacagacct
ttcctcatgc tgcaggcccg 1140gcagtctgaa gaccaccctc atcgccggcg tcggcggggc
ttggagtgtg atggcaaggt 1200caacatctgc tgtaagaaac agttctttgt cagtttcaag
gacatcggct ggaatgactg 1260gatcattgct ccctctggct atcatgccaa ctactgcgag
ggtgagtgcc cgagccatat 1320agcaggcacg tccgggtcct cactgtcctt ccactcaaca
gtcatcaacc actaccgcat 1380gcggggccat agcccctttg ccaacctcaa atcgtgctgt
gtgcccacca agctgagacc 1440catgtccatg ttgtactatg atgatggtca aaacatcatc
aaaaaggaca ttcagaacat 1500gatcgtggag gagtgtgggt gctcatagag ttgcccagcc
cagggggaaa gggagcaaga 1560gttgtccaga gaagacagtg gcaaaatgaa gaaattttta
aggtttctga gttaaccaga 1620aaaatagaaa ttaaaaacaa aacaaaaaaa aaaacaaaaa
aaaacaaaag taaattaaaa 1680acaaaacctg atgaaacaga tgaaggaaga tgtggaaaaa
atccttagcc agggctcaga 1740gatgaagcag tgaaagagac aggaattggg agggaaaggg
agaatggtgt accctttatt 1800tcttctgaaa tcacactgat gacatcagtt gtttaaacgg
ggtattgtcc tttcccccct 1860tgaggttccc ttgtgagcct tgaatcaacc aatctagtct
gcagtagtgt ggactagaac 1920aacccaaata gcatctagaa agccatgagt ttgaaagggc
ccatcacagg cactttccta 1980cccaattacc caggtcataa ggtatgtctg tgtgacactt
atctctgtgt atatcagcat 2040acacacacac acacacacac acacacacac acacaggcat
ttccacacat tacatatata 2100cacatactgg taaaagaaca atcgtgtgca ggtggtcaca
cttccttttt ctgtaccact 2160tttgcaacaa aacaa
2175595035DNAHomo sapiens 59gcaggcgccg cgccgaggag
gctgccgctc tggcttgccg ccccccgccg ccgctgcaca 60ccggacccag ccgccgtgcc
gcgggccatg gacctgccca ggggcctggt ggtggcctgg 120gcgctcagcc tgtggccagg
gttcacggac accttcaaca tggacaccag gaagccccgg 180gtcatccctg gctccaggac
cgccttcttt ggctacacag tgcagcagca cgacatcagt 240ggcaataagt ggctggtcgt
gggcgcccca ctggaaacca atggctacca gaagacggga 300gacgtgtaca agtgtccagt
gatccacggg aactgcacca aactcaacct gggaagggtc 360accctgtcca acgtgtccga
gcggaaagac aacatgcgcc tcggccttag tctcgccacc 420aaccccaagg acaacagctt
cctggcctgc agccccctct ggtctcatga gtgtgggagc 480tcctactaca ccacagggat
gtgttcaaga gtcaactcca acttcaggtt ctccaagacc 540gtggccccag ctctccaaag
gtgccagacc tacatggaca tcgtcattgt cctggatggc 600tccaacagca tctacccctg
ggtggaggtt cagcacttcc tcatcaacat cctgaaaaag 660ttttacattg gcccagggca
gatccaggtt ggagttgtgc agtatggcga agatgtggtg 720catgagtttc acctcaacga
ctacaggtct gtaaaagatg tggtggaagc tgccagccac 780attgagcaga gaggaggaac
agagacccgg acggcatttg gcattgaatt tgcacgctca 840gaggctttcc agaagggtgg
aaggaaagga gccaagaagg tgatgattgt catcacagat 900ggggagtccc acgacagccc
agacctggag aaggtgatcc agcaaagcga aagagacaac 960gtaacaagat atgcggtggc
cgtcctgggc tactacaacc gcagggggat caatccagaa 1020acttttctaa atgaaatcaa
atacatcgcc agtgaccctg atgacaagca cttcttcaat 1080gtcactgatg aggctgcctt
gaaggacatt gtcgatgccc tgggggacag aatcttcagc 1140ctggaaggca ccaacaagaa
cgagacctcc tttgggctgg agatgtcaca gacgggcttt 1200tcctcgcacg tggtggagga
tggggttctg ctgggagccg tcggtgccta tgactggaat 1260ggagctgtgc taaaggagac
gagtgccggg aaggtcattc ctctccgcga gtcctacctg 1320aaagagttcc ccgaggagct
caagaaccat ggtgcatacc tggggtacac agtcacatcg 1380gtcgtgtcct ccaggcaggg
gcgggtgtac gtggccggag ccccccggtt caaccacacg 1440ggcaaggtca tcctgttcac
catgcacaac aaccggagcc tcaccatcca ccaggctatg 1500cggggccagc agataggctc
ttactttggg agtgaaatca cctcggtgga catcgacggc 1560gacggcgtga ctgatgtcct
gctggtgggc gcacccatgt acttcaacga gggccgtgag 1620cgaggcaagg tgtacgtcta
tgagctgaga cagaacctgt ttgtttataa cggaacgcta 1680aaggattcac acagttacca
gaatgcccga tttgggtcct ccattgcctc agttcgagac 1740ctcaaccagg attcctacaa
tgacgtggtg gtgggagccc ccctggagga caaccacgca 1800ggagccatct acatcttcca
cggcttccga ggcagcatcc tgaagacacc taagcagaga 1860atcacagcct cagagctggc
taccggcctc cagtattttg gctgcagcat ccacgggcaa 1920ttggacctca atgaggatgg
gctcatcgac ctggcagtgg gagcccttgg caacgctgtg 1980attctgtggt cccgcccagt
ggttcagatc aatgccagcc tccactttga gccatccaag 2040atcaacatct tccacagaga
ctgcaagcgc agtggcaggg atgccacctg cctggccgcc 2100ttcctctgct tcacgcccat
cttcctggca ccccatttcc aaacaacaac tgttggcatc 2160agatacaacg ccaccatgga
tgagaggcgg tatacaccga gggcccacct ggacgagggc 2220ggggaccgat tcaccaacag
agccgtactg ctctcctccg gccaggagct ctgtgagcgg 2280atcaacttcc atgtcctgga
cactgctgac tacgtgaagc cagtgacctt ctcagtcgag 2340tattccctgg aggaccctga
ccatggcccc atgctggacg acggctggcc caccactctc 2400agagtctcgg tgcccttctg
gaacggctgc aatgaggatg agcactgtgt ccctgacctt 2460gtgttggatg cccggagtga
cctgcccacg gccatggagt actgccagag ggtgctgagg 2520aagcctgcgc aggactgctc
cgcatacacg ctgtccttcg acaccacagt cttcatcata 2580gagagcacac gccagcgagt
ggcggtggag gccacactgg agaacagggg cgagaacgcc 2640tacagcacgg tcctaaatat
ctcgcagtca gcaaacctgc agtttgccag cttgatccag 2700aaggaggact cagacggtag
cattgagtgt gtgaacgagg agaggaggct ccagaagcaa 2760gtctgcaacg tcagctatcc
cttcttccgg gccaaggcca aggtggcttt ccgtcttgat 2820tttgagttca gcaaatccat
cttcctacac cacctggaga tcgagctcgc tgcaggcagt 2880gacagtaatg agcgggacag
caccaaggaa gacaacgtgg cccccttacg cttccacctc 2940aaatacgagg ctgacgtcct
cttcaccagg agcagcagcc tgagccacta cgaggtcaag 3000cccaacagct cgctggagag
atacgatggt atcgggcctc ccttcagctg catcttcagg 3060atccagaact tgggcttgtt
ccccatccac gggatgatga tgaagatcac cattcccatc 3120gccaccagga gcggcaaccg
cctactgaag ctgagggact tcctcacgga cgaggcgaac 3180acgtcctgta acatctgggg
caatagcact gagtaccggc ccaccccagt ggaggaagac 3240ttgcgtcgtg ctccacagct
gaatcacagc aactctgatg tcgtctccat caactgcaat 3300atacggctgg tccccaacca
ggaaatcaat ttccatctac tggggaacct gtggttgagg 3360tccctaaaag cactcaagta
caaatccatg aaaatcatgg tcaacgcagc cttgcagagg 3420cagttccaca gccccttcat
cttccgtgag gaggatccca gccgccagat cgtgtttgag 3480atctccaagc aagaggactg
gcaggtcccc atctggatca ttgtaggcag caccctgggg 3540ggcctcctac tgctggccct
gctggtcctg gcactgtgga agctcggctt ctttagaagt 3600gccaggcgca ggagggagcc
tggtctggac cccaccccca aagtgctgga gtgaggctcc 3660agaggagact ttgagttgat
gggggccagg acaccagtcc aggtagtgtt gagacccagg 3720cctgtggccc caccgagctg
gagcggagag gaagccagct ggctttgcac ttgacctcat 3780ctcccgagca atggcgcctg
ctccctccag aatggaactc aagctggttt taagtggaac 3840tgccctactg ggagactggg
acacctttaa cacagacccc tagggattta aagggacacc 3900cctacacaca cccaggccca
tgccaaggcc tccctcaggc tctgtggagg gcatttgctg 3960ccccagctac taaggtgcta
ggaattcgta atcatcccca tcctccagag aaacccaggg 4020aggaagactg taaatacgaa
cccaatctgc acactccagg cctctagttc cagaaggatc 4080caagacaaaa cagatctgaa
ttctgccctt ttctctcacc catcccaccc ctccattggc 4140tcccaagtca cacccactcc
cttccccata gataggcccc tggggctccc gaagaatgaa 4200cccaagagca agggcttgat
ggtgacagct gcaagccagg gatgaagaaa gactctgaga 4260tgtggagact gatggccagg
caagtgggac caggatactg gacgctgtcc tgagatgaga 4320ggtagccggg ctctgcaccc
acgtgcattc acattgaccg caactcacac attcccccac 4380cagctgcagc cccttgctct
cagctgccaa ccctcccggg tcacttttgt tcccaggtac 4440ctcatgggaa gcatgtggat
gacacaatcc ctggggctgt gcattcccac gtcttcttgc 4500tgcagcctgc ccctagacat
ggacgcaccg gcctggctgc agctgggcag caggggtagg 4560ggtagggagc ctcccctccc
tgtatcaccc cctccctaca cacacacaca cacacacaca 4620cacacacaca cacacacaca
cacacacaca cacacactgc ctcccatcct tccctcatgc 4680ccgccagtgc acagggaagg
gcttggccag cgctgttgag gggtcccctc tggaatgcac 4740tgaataaagc acgtgcaagg
actcccggag cctgtgcagc cttggtggca aatatctcat 4800ctgccggccc ccaggacaag
tggtatgacc agtgataatg ccccaaggac aaggggcgtg 4860cctggcgccc agtggagtaa
tttatgcctt agtcttgttt tgaggtagaa atgcaagggg 4920gacacatgaa aggcatcagt
ccccctgtgc atagtacgac ctttactgtc gtatttttga 4980aaaattaaaa atacagtgtt
taaaaacaaa aaaaaaaaaa aaaaaaaaaa aaaaa 5035603070DNAHomo sapiens
60gagcccagag ccagagagcg cgctgggcgg tgctgggcac ccgcggagtg gaacggggct
60ggtggaatgc acagggtcgc agcgcttggg ccaccctcgg tcagagggcg ccgtgtccag
120cgagcaaacg ggcgccccgg agccttgctg agaggcagct ctgggctttc ccagctccga
180agtcaatact gagatcccag atgtgtccag agacatcctg aagaggctcg ggggtggagg
240agccttagtg tgtccacaaa gggactcctg aaactgactg agagccagtg gatttgccag
300cagtctgagc ttctaccgag tcttccccca cctcaatccc tgttgctatg gagactacca
360atggaacgga gacctggtat gagagcctgc atgccgtgct gaaggctcta aatgccactc
420ttcacagcaa tttgctctgc cggccagggc cagggctggg gccagacaac cagactgaag
480agaggcgggc cagcctacct ggccgtgatg acaactccta catgtacatt ctctttgtca
540tgtttctatt tgctgtaact gtgggcagcc tcatcctggg atacacccgc tcccgcaaag
600tggacaagcg tagtgacccc tatcatgtgt atatcaagaa ccgtgtgtct atgatctaac
660acgagagggc tgggacggtg gaagaccaag acacctgggg attgcgtctg gggcctccag
720aactctgctg tggactgcat caggtctcag tgtccctatc tgtaagatca acaagaaaca
780cggttaaggg aggtcgtcac tggggtggga gaagaggggc tggtagaccg aagccttgtg
840cataaggatt ttttcccagg aaaagataga ctttataaac agtgggagcc catgaacaaa
900catataaaag tagcaacaga taatgaccaa taactggttc agtggctgga gtattagggg
960cctggggatt ggagaacgga gaagaagttg tagcagaggg aaatgagaca ggaagatgct
1020ctggggacac attttttatg tgttatcttc agccatgaga agcagtgatg actatcccat
1080atcacagata tgatttacca ccaccaccct gcccccgctc ccgtgaagaa agcagggcaa
1140gtgctgtgct gcccatttgg gcctgcatag tgccatgatt ggaacccagg aactctggtc
1200tccttgccta gtgcttttca aaactctgtg ctacacagga gtggatccag gcctgaaggt
1260catacaattc tggggactct ctttaagaaa aagaattcta aaatatctta cttttgcaaa
1320cattatgaaa atatactgcc acattaatat gttgctaggg cccctgctag gaccttaaga
1380aggagctcat gtgagtcagg accctgaatg ttaggcctcg ttagctctat ggttcatatg
1440cttcttgaac caagtcacag ggcacttccc agccacattg ccaggcaaca ggactaaact
1500acctccaaag caagcagtct tttcagtttt gactgagtga tgtgagaaac ttcttttctt
1560ttcttttctt tttttttttt tgagacagtc tccctatgtc acccaggctg tggtgcagca
1620acccaatctt ggctcactgc aacccccacc tcccgggttc aagcaattat cctgcctcag
1680ccacctgagt agctgggatt acaggttcct gtcaccacac ccagttaatt tatatatata
1740tatatatata tatatttaag tagagacagg gtttcacatg ttgcccaggc tggtctcgaa
1800ctcctgtcct caagttatct gcccattttg gtctcccaaa gtgctgggat tacaagtgta
1860agccaccacg actatctgag agaagttttc tgatgtcatg ttgaatctgc ttctaaaaga
1920ctgatactgc caaggtgggc ggatcacctg aggtcaggag ttcgagacca gcctggccaa
1980catggtgaaa ccccatctac taaaaaaata caaaaattag ccagacctgg tggcgggtgc
2040ccgtattccc agctacttgg gaggctgagg caggagaatt gtttgaaccc gggaggtgga
2100ggttgcagta agccaagatc acgccactgc actccagcct gggtgacaga gcaaggctct
2160gtctcaaaaa aaaacaaaaa caaaaacaaa aaagactgat atcgcaccta aattattatt
2220atattaaaag aagcagagta tgagagacag gtacatggtc cagtaggaag agaagcagcc
2280ctgattctac cacttaaggt gatgtatgat cttaggctgg acacttctct ccctcatccg
2340ttttcctctt caacataatg aaatagactt gaaagtctct aaggctctat cagttctgac
2400attctaggct tcatatacat taagttgagc catatgtaat cactgtgttt gtaggttaga
2460aacagctgag tatcgtagtt tcatatatgg ttccagctaa tacatgcaat gtggctggtg
2520aacacttctg aattcagaaa ctatcccaga tctcagctag aaccatccac tgttctgttt
2580gtccagtttc aacttaaggg atctccatgc ggtccctgga agtacccatt gaaacatgcg
2640tatttgtgta tagcagaact ctgaaataat attctgacag cagttatctc tgaggaattg
2700ggttataggt gattttccct ttccgcatga taaatttatg taatatttga ctgacttgac
2760cgtaagtatg ttacttgtat aataaaagga aaaaaggtac ttctattttg aaaaaataaa
2820aataaaagcc tttgggttct tgaatggagg atcatggaac acatttgctg ccatatgcag
2880ttatgttgat gctctgcaaa cctgtgctga gccctgttgc tcaagccctt cctcatctct
2940tcttgaggga gaaggtggag acttccttaa ggagatgtga catatgggaa gacaacagat
3000tcagaaattt acgtggatag gactttagac accacccagc ccaaacttcc aaataaaata
3060tggaacgcaa
3070612450DNAHomo sapiens 61atatttcata cctttctaga aactgggtgt gatctcactg
ttggtaaagc ccagcccttc 60ccaacctgca agctcacctt ccaggactgg gcccagccca
tgctctccat atataagctg 120ctgccccgag cctgattcct agtcctgctt ctcttccctc
tctcctccag cctctcacac 180tctcctcagc tctctcatct cctggaacca tggccagcac
atccaccacc atcaggagcc 240acagcagcag ccgccggggt ttcagtgcca actcagccag
gctccctggg gtcagccgct 300ctggcttcag cagcgtctcc gtgtcccgct ccaggggcag
tggtggcctg ggtggtgcat 360gtggaggagc tggctttggc agccgcagtc tgtatggcct
ggggggctcc aagaggatct 420ccattggagg gggcagctgt gccatcagtg gcggctatgg
cagcagagcc ggaggcagct 480atggctttgg tggcgccggg agtggatttg gtttcggtgg
tggagccggc attggctttg 540gtctgggtgg tggagccggc cttgctggtg gctttggggg
ccctggcttc cctgtgtgcc 600cccctggagg catccaagag gtcaccgtca accagagtct
cctgactccc ctcaacctgc 660aaatcgatcc caccatccag cgggtgcggg ctgaggagcg
tgaacagatc aagaccctca 720acaacaagtt tgcctccttc atcgacaagg tgcggttcct
ggagcagcag aacaaggttc 780tggaaacaaa gtggaccctg ctgcaggagc agggcaccaa
gactgtgagg cagaacctgg 840agccgttgtt cgagcagtac atcaacaacc tcaggaggca
gctggacagc attgtcgggg 900aacggggccg cctggactca gagctcagag gcatgcagga
cctggtggag gacttcaaga 960acaaatatga ggatgaaatc aacaagcgca cagcagcaga
gaatgaattt gtgactctga 1020agaaggatgt ggatgctgcc tacatgaaca aggttgaact
gcaagccaag gcagacactc 1080tcacagacga gatcaacttc ctgagagcct tgtatgatgc
agagctgtcc cagatgcaga 1140cccacatctc agacacatct gtggtgctgt ccatggacaa
caaccgcaac ctggacctgg 1200acagcatcat cgctgaggtc aaggcccaat atgaggagat
tgctcagaga agccgggctg 1260aggctgagtc ctggtaccag accaagtacg aggagctgca
ggtcacagca ggcagacatg 1320gggacgacct gcgcaacacc aagcaggaga ttgctgagat
caaccgcatg atccagaggc 1380tgagatctga gatcgaccac gtcaagaagc agtgcgccaa
cctgcaggcc gccattgctg 1440atgctgagca gcgtggggag atggccctca aggatgccaa
gaacaagctg gaagggctgg 1500aggatgccct gcagaaggcc aagcaggacc tggcccggct
gctgaaggag taccaggagc 1560tgatgaatgt caagctggcc ctggacgtgg agatcgccac
ctaccgcaag ctgctggagg 1620gtgaggagtg caggctgaat ggcgaaggcg ttggacaagt
caacatctct gtggtgcagt 1680ccaccgtctc cagtggctat ggcggtgcca gtggtgtcgg
cagtggctta ggcctgggtg 1740gaggaagcag ctactcctat ggcagtggtc ttggcgttgg
aggtggcttc agttccagca 1800gtggcagagc cattgggggt ggcctcagct ctgttggagg
cggcagttcc accatcaagt 1860acaccaccac ctcctcctcc agcaggaaga gctataagca
ctaaagtgcg tctgctagct 1920ctcggtccca cagtcctcag gcccctctct ggctgcagag
ccctctcctc aggttgcctt 1980tcctctcctg gcctccagtc tcccctgctg tcccaggtag
agctgggtat ggatgcttag 2040tgccctcact tcttctctct ctctctatac catctgagca
cccattgctc accatcagat 2100caacctctga ttttacatca tgatgtaatc accactggag
cttcactgtt actaaattat 2160taatttcttg cctccagtgt tctatctctg aggctgagca
ttataagaaa atgacctctg 2220ctccttttca ttgcagaaaa ttgccagggg cttatttcag
aacaacttcc acttactttc 2280cactggctct caaactctct aacttataag tgttgtgaac
ccccacccag gcagtatcca 2340tgaaagcaca agtgactagt cctatgatgt acaaagcctg
tatctctgtg atgatttctg 2400tgctcttcgc tgtttgcaat tgctaaataa agcagattta
taatacaata 2450622345DNAHomo sapiens 62cgcctccagc ctccaacgct
cgccacagcc ctctcatctc ctggaaccat ggccagcaca 60tccaccacca tcaggagcca
cagcagcagc cgccggggtt tcagtgccaa ctcagccagg 120ctccctgggg tcagccgctc
tggcttcagc agcatctccg tgtcccgctc caggggcagt 180ggtggcctgg gtggtgcatg
tggaggagct ggctttggca gccgcagtct gtatggcctg 240gggggctcca agaggatctc
cattggaggg ggcagctgtg ccatcagtgg cggctatggc 300agcagagccg gaggcagcta
tggctttggt ggcgccggga gtggatttgg tttcggtggt 360ggagccggca ttggctttgg
tctgggtggt ggagccggcc ttgctggtgg ctttgggggc 420cctggcttcc ctgtgtgccc
ccctggaggc atccaagagg tcaccgtcaa ccagagtctc 480ctgactcccc tcaacctgca
aattgacccc gccatccagc gggtgcgggc cgaggagcgt 540gagcagatca agaccctcaa
caacaagttt gcctccttca tcgacaaggt gcggttccta 600gagcagcaga acaaggttct
ggacaccaag tggaccctgc tgcaggagca gggcaccaag 660actgtgaggc agaacctgga
gccgttgttc gagcagtaca tcaacaacct caggaggcag 720ctggacagca tcgtcgggga
acggggccgc ctggactcgg agctgagaaa catgcaggac 780ctggtggagg acctcaagaa
caaatatgag gatgaaatca acaagcgcac agcagcagag 840aatgaatttg tgactctgaa
gaaggatgtg gatgctgcct acatgaacaa ggttgaactg 900caagccaagg cagacactct
cacagatgag atcaacttcc tgagagcctt gtatgatgca 960gagctgtccc agatgcagac
ccacatctca gacacatccg tggtgctatc catggacaac 1020aaccgcaacc tggacctgga
cagcatcatc gctgaggtca aggcccaata cgaggagatt 1080gctcagagga gccgggctga
ggctgagtcc tggtaccaga ccaagtacga ggagctgcag 1140gtcacagcag gcagacatgg
ggacgacctg cgcaacacca agcaggagat tgctgagatc 1200aaccgcatga tccagaggct
gagatctgag atcgaccatg tcaagaagca gtgtgccagc 1260ctgcaggctg ccattgctga
tgctgagcag cgtggggaga tggcactcaa ggatgctaag 1320aacaagctgg aagggctgga
ggatgccctg cagaaggcca agcaggacct ggcccggctg 1380ctgaaggagt accaggagct
gatgaatgtc aagctggccc tggatgtgga gatcgccacc 1440taccgcaagc tgctggaggg
cgaggagtgc aggctgaatg gcgaaggcgt tggacaagtc 1500aacgtctctg tagtacagtc
caccatctcc agtggctatg gcggtgccag cggtgtcggc 1560agtggcttag gcctgggtgg
aggaagcagc tactcctatg gcagtggtct tggcattgga 1620ggtggcttca gttccagcag
tggcagagcc attgggggtg gcctcagctc tgttggaggc 1680ggcagttcca ccatcaagta
caccaccacc tcctcctcca gcaggaagag ctacaagcac 1740taaagtgctg cctccagctc
tcggtcccac agtcctcagg cccttctctg gctgcagagc 1800cgtctcctca ggttgcctat
cctctcctgg cctctagtct tccctgctct ccgaggtaga 1860gctgggtatg gatgcttagt
gccctcactt ctctctgtct atacctgccc catctgagca 1920cccattgctc accatcagat
caacctttga ttttacatca taatgtattc accaatggag 1980cttcactttg ttactaaatt
attaatttct tgcctccaaa attgttctct ctgaggctga 2040gcattataag aaaatgatct
ctgttccttt tcattactga aaatcgcctg gggcttattt 2100cagaacaact tccacttatt
ttccattggc ccccaaactc cctaagttaa aagtattgtg 2160aacccccgcc ccgcagtatg
catggaagca caagtgacta gtcgtatgat gtacacagtc 2220tttctccctg tgatgatttc
tctgctcttt gctctttgta atttctaaat aaagcaggtt 2280ttagaataaa aaaaaaaaaa
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2340aaaaa
2345631753DNAHomo sapiens
63cagccccgcc cctacctgtg gaagcccagc cgcccgctcc cgcggataaa aggcgcggag
60tgtccccgag gtcagcgagt gcgcgctcct cctcgcccgc cgctaggtcc atcccggccc
120agccaccatg tccatccact tcagctcccc ggtattcacc tcgcgctcag ccgccttctc
180gggccgcggc gcccaggtgc gcctgagctc cgctcgcccc ggcggccttg gcagcagcag
240cctctacggc ctcggcgcct cacggccgcg cgtggccgtg cgctctgcct atgggggccc
300ggtgggcgcc ggcatccgcg aggtcaccat taaccagagc ctgctggccc cgctgcggct
360ggacgccgac ccctccctcc agcgggtgcg ccaggaggag agcgagcaga tcaagaccct
420caacaacaag tttgcctcct tcatcgacaa ggtgcggttt ctggagcagc agaacaagct
480gctggagacc aagtggacgc tgctgcagga gcagaagtcg gccaagagca gccgcctccc
540agacatcttt gaggcccaga ttgctggcct tcggggtcag cttgaggcac tgcaggtgga
600tgggggccgc ctggaggcgg agctgcggag catgcaggat gtggtggagg acttcaagaa
660taagtacgaa gatgaaatta accaccgcac agctgctgag aatgagtttg tggtgctgaa
720gaaggatgtg gatgctgcct acatgagcaa ggtggagctg gaggccaagg tggatgccct
780gaatgatgag atcaacttcc tcaggaccct caatgagacg gagttgacag agctgcagtc
840ccagatctcc gacacatctg tggtgctgtc catggacaac agtcgctccc tggacctgga
900cggcatcatc gctgaggtca aggcgcagta tgaggagatg gccaaatgca gccgggctga
960ggctgaagcc tggtaccaga ccaagtttga gaccctccag gcccaggctg ggaagcatgg
1020ggacgacctc cggaataccc ggaatgagat ttcagagatg aaccgggcca tccagaggct
1080gcaggctgag atcgacaaca tcaagaacca gcgtgccaag ttggaggccg ccattgccga
1140ggctgaggag cgtggggagc tggcgctcaa ggatgctcgt gccaagcagg aggagctgga
1200agccgccctg cagcggggca agcaggatat ggcacggcag ctgcgtgagt accaggaact
1260catgagcgtg aagctggccc tggacatcga gatcgccacc taccgcaagc tgctggaggg
1320cgaggagagc cggttggctg gagatggagt gggagccgtg aatatctctg tgatgaattc
1380cactggtggc agtagcagtg gcggtggcat tgggctgacc ctcgggggaa ccatgggcag
1440caatgccctg agcttctcca gcagtgcggg tcctgggctc ctgaaggctt attccatccg
1500gaccgcatcc gccagtcgca ggagtgcccg cgactgagcc gcctcccacc actccactcc
1560tccagccacc acccacaatc acaagaagat tcccacccct gcctcccatg cctggtccca
1620agacagtgag acagtctgga aagtgatgtc agaatagctt ccaataaagc agcctcattc
1680tgaggcctga gtgatccacg tgaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
1740aaaaaaaaaa aaa
1753641861DNAHomo sapiens 64cactcaaggt gtgcaggcag ctgtgtttgt caggaaggca
gaaggagttg gctttgcttt 60aggggaggag acgaggtccc acaacaccct ctgaagggta
tataaggagc cccagcgtgc 120agcctggcct ggtacctcct gccagcatct cttgggtttg
ctgagaactc acgggctcca 180gctacctggc catgaccacc acatttctgc aaacttcttc
ctccaccttt gggggtggct 240caacccgagg gggttccctc ctggctgggg gaggtggctt
tggtgggggg agtctctctg 300ggggaggtgg aagccgaagt atctcagctt cttctgctag
gtttgtctct tcagggtcag 360gaggaggata tgggggtggc atgagggtct gtggctttgg
tggaggggct ggtagtgttt 420tcggtggagg ctttggaggg ggcgttggtg ggggttttgg
tggtggcttt ggtggtggcg 480atggtggtct cctctctggc aatgagaaaa ttaccatgca
gaacctcaat gaccgcctgg 540cctcctacct ggacaaggta cgtgccctgg aggaggccaa
tgctgacctg gaggtgaaga 600tccatgactg gtaccagaag cagaccccaa ccagcccaga
atgcgactac agccaatact 660tcaagaccat tgaagagctc cgggacaaga tcatggccac
caccatcgac aactcccggg 720tcatcctgga gatcgacaat gccaggctgg ctgcggacga
cttcaggctc aagtatgaga 780atgagctggc cctgcgccag ggcgttgagg ctgacatcaa
cggcttgcgc cgagtcctgg 840atgagctgac cctggccagg actgacctgg agatgcagat
cgagggcctg aatgaggagc 900tagcctacct gaagaagaac cacgaagagg agatgaagga
gttcagcagc cagctggccg 960gccaggtcaa tgtggagatg gacgcagcac cgggtgtgga
cctgacccgt gtgctggcag 1020agatgaggga gcagtacgag gccatggcgg agaagaaccg
ccgggatgtc gaggcctggt 1080tcttcagcaa gactgaggag ctgaacaaag aggtggcctc
caacacagaa atgatccaga 1140ccagcaagac ggagatcaca gacctgagac gcacgatgca
ggagctggag atcgagctgc 1200agtcccagct cagcatgaaa gctgggctgg agaactcact
ggccgagaca gagtgccgct 1260atgccacgca gctgcagcag atccaggggc tcattggtgg
cctggaggcc cagctgagtg 1320agctccgatg cgagatggag gctcagaacc aggagtacaa
gatgctgctt gacataaaga 1380cacggctgga gcaggagatc gctacttacc gcagcctgct
cgagggccag gatgccaaga 1440tggctggcat tggcatcagg gaagcctctt caggaggtgg
tggtagcagc agcaatttcc 1500acatcaatgt agaagagtca gtggatggac aggtggtttc
ttcccacaag agagaaatct 1560aagtgtctat tgcaggagaa acgtcccttg ccactcccca
ctctcatcag gccaagtgga 1620ggactggcca gagggcctgc acatgcaaac tccagtccct
gccttcagag agctgaaaag 1680ggtccctcgg tcttttattt cagggctttg catgcgctct
attccccctc tgcctctccc 1740caccttcttt ggagcaagga gatgcagctg tattgtgtaa
caagctcatt tgtacagtgt 1800ctgttcatgt aataaagaat tacttttcct tttgcaaata
aaaaaaaaaa aaaaaaaaaa 1860a
1861651720DNAHomo sapiens 65agttaggagg gccccgcctt
ccccagctgc atataaaggt ctctggggtt ggaggcagcc 60acagcacgct ctcagccttc
ctgagcacct ttccttcttt cagccaactg ctcactcgct 120cacctccctc cttggcacca
tgaccacctg cagccgccag ttcacctcct ccagctccat 180gaagggctcc tgcggcatcg
gaggcggcat cgggggcggc tccagccgca tctcctccgt 240cctggccgga gggtcctgcc
gtgcccccag cacctacggg ggcggcctgt ctgtctcctc 300tcgcttctcc tctgggggag
cctgcgggct ggggggcggc tatggcggtg gcttcagcag 360cagcagcagc tttggtagtg
gcttcggggg aggatatggt ggtggccttg gtgctggctt 420cggtggtggc ttgggtgctg
gctttggtgg tggttttgct ggtggtgatg ggcttctggt 480gggcagtgag aaggtgacca
tgcagaacct caatgaccgc ctggcctcct acctggacaa 540ggtgcgtgct ctggaggagg
ccaacgccga cctggaagtg aagatccgtg actggtacca 600gaggcagcgg cccagtgaga
tcaaagacta cagtccctac ttcaagacca tcgaggacct 660gaggaacaag atcattgcgg
ccaccattga gaatgcgcag cccattttgc agattgacaa 720tgccaggctg gcagccgatg
acttcaggac caagtatgag catgaactgg ccctgcggca 780gactgtggag gccgacgtca
atggcctgcg ccgggtgttg gatgagctga ccctggccag 840gactgacctg gagatgcaga
tcgaaggcct gaaggaggag ctggcctacc tgaggaagaa 900ccacgaggag gagatgcttg
ctctgagagg tcagaccggc ggagatgtga acgtggagat 960ggatgctgca cctggcgtgg
acctgagccg catcctgaat gagatgcgtg accagtacga 1020gcagatggca gagaaaaacc
gcagagacgc tgagacctgg ttcctgagca agaccgagga 1080gctgaacaaa gaagtggcct
ccaacagcga actggtacag agcagccgca gtgaggtgac 1140ggagctccgg agggtgctcc
agggcctgga gattgagctg cagtcccagc tcagcatgaa 1200agcatccctg gagaacagcc
tggaggagac caaaggccgc tactgcatgc agctgtccca 1260gatccaggga ctgattggca
gtgtggagga gcagctggcc cagctacgct gtgagatgga 1320gcagcagagc caggagtacc
agatcttgct ggatgtgaag acgcggctgg agcaggagat 1380tgccacctac cgccgcctgc
tggagggcga ggatgcccac ctttcctccc agcaagcatc 1440tggccaatcc tattcttccc
gcgaggtctt cacctcctcc tcgtcctctt cgagccgtca 1500gacccggccc atcctcaagg
agcagagctc atccagcttc agccagggcc agagctccta 1560gaactgagct gcctctacca
cagcctcctg cccaccagct ggcctcacct cctgaaggcc 1620cgggtcagga ccctgctctc
ctggcgcagt tcccagctat ctcccctgct cctctgctgg 1680tggtgggcta ataaagctga
ctttctggtt gatgcaaaaa 1720661574DNAHomo sapiens
66atcgctacgc ccacttggtg gcctataaag gaagcgggcg aaccccggca gccctacaca
60acttggggcc cctctcctct ccagcccttc tcctgtgtgc ctgcctcctg ccgccgccac
120catgaccacc tccatccgcc agttcacctc ctccagctcc atcaagggct cctccggcct
180ggggggcggc tcgtcccgca cctcctgccg gctgtctggc ggcctgggtg ccggctcctg
240caggctggga tctgctggcg gcctgggcag caccctcggg ggtagcagct actccagctg
300ctacagcttt ggctctggtg gtggctatgg cagcagcttt gggggtgttg atgggctgct
360ggctggaggt gagaaggcca ccatgcagaa cctcaatgac cgcctggcct cctacctgga
420caaggtgcgt gccctggagg aggccaacac tgagctggag gtgaagatcc gtgactggta
480ccagaggcag gccccggggc ccgcccgtga ctacagccag tactacagga caattgagga
540gctgcagaac aagatcctca cagccaccgt ggacaatgcc aacatcctgc tacagattga
600caatgcccgt ctggctgctg atgacttccg caccaagttt gagacagagc aggccctgcg
660cctgagtgtg gaggccgaca tcaatggcct gcgcagggtg ctggatgagc tgaccctggc
720cagagccgac ctggagatgc agattgagaa cctcaaggag gagctggcct acctgaagaa
780gaaccacgag gaggagatga acgccctgcg aggccaggtg ggtggtgaga tcaatgtgga
840gatggacgct gccccaggcg tggacctgag ccgcatcctc aacgagatgc gtgaccagta
900tgagaagatg gcagagaaga accgcaagga tgccgaggat tggttcttca gcaagacaga
960ggaactgaac cgcgaggtgg ccaccaacag tgagctggtg cagagtggca agagtgagat
1020ctcggagctc cggcgcacca tgcaggcctt ggagatagag ctgcagtccc agctcagcat
1080gaaagcatcc ctggagggca acctggcgga gacagagaac cgctactgcg tgcagctgtc
1140ccagatccag gggctgattg gcagcgtgga ggagcagctg gcccagcttc gctgcgagat
1200ggagcagcag aaccaggaat acaaaatcct gctggatgtg aagacgcggc tggagcagga
1260gattgccacc taccgccgcc tgctggaggg agaggatgcc cacctgactc agtacaagaa
1320agaaccggtg accacccgtc aggtgcgtac cattgtggaa gaggtccagg atggcaaggt
1380catctcctcc cgcgagcagg tccaccagac cacccgctga ggactcagct accccggccg
1440gccacccagg aggcagggag gcagccgccc catctgcccc acagtctccg gcctctccag
1500cctcagcccc ctgcttcagt cccttcccca tgcttccttg cctgatgaca ataaagcttg
1560ttgactcagc tatg
1574671805DNAHomo sapiens 67gagacacact ctgccccaac catcctgaag ctacaggtgc
tccctcctgg aatctccaat 60ggatttcagt cgcagaagct tccacagaag cctgagctcc
tccttgcagg cccctgtagt 120cagtacagtg ggcatgcagc gcctcgggac gacacccagc
gtttatgggg gtgctggagg 180ccggggcatc cgcatctcca actccagaca cacggtgaac
tatgggagcg atctcacagg 240cggcggggac ctgtttgttg gcaatgagaa aatggccatg
cagaacctaa atgaccgtct 300agcgagctac ctagaaaagg tgcggaccct ggagcagtcc
aactccaaac ttgaagtgca 360aatcaagcag tggtacgaaa ccaacgcccc gagggctggt
cgcgactaca gtgcatatta 420cagacaaatt gaagagctgc gaagtcagat taaggatgct
caactgcaaa atgctcggtg 480tgtcctgcaa attgataatg ctaaactggc tgctgaggac
ttcagactga agtatgagac 540tgagagagga atacgtctaa cagtggaagc tgatctccaa
ggcctgaata aggtctttga 600tgacctaacc ctacataaaa cagatttgga gattcaaatt
gaagaactga ataaagacct 660agctctcctc aaaaaggagc atcaggagga agtcgatggc
ctacacaagc atctgggcaa 720cactgtcaat gtggaggttg atgctgctcc aggcctgaac
cttggcgtca tcatgaatga 780aatgaggcag aagtatgaag tcatggccca gaagaacctt
caagaggcca aagaacagtt 840tgagagacag actgcagttc tgcagcaaca ggtcacagtg
aatactgaag aattaaaagg 900aactgaggtt caactaacgg agctgagacg cacctcccag
agccttgaga tagaactcca 960gtcccatctc agcatgaaag agtctttgga gcacactcta
gaggagacca aggcccgtta 1020cagcagccag ttagccaacc tccagtcgct gttgagctct
ctggaggccc aactgatgca 1080gattcggagt aacatggaac gccagaacaa cgaataccat
atccttcttg acataaagac 1140tcgacttgaa caggaaattg ctacttaccg ccgccttctg
gaaggagaag acgtaaaaac 1200tacagaatat cagttaagca ccctggaaga gagagatata
aagaaaacca ggaagattaa 1260gacagtcgtg caagaagtag tggatggcaa ggtcgtgtca
tctgaagtca aagaggtgga 1320agaaaatatc taaatagcta ccagaaggag atgctgctga
ggttttgaaa gaaatttggc 1380tataatctta tctttgctcc ctgcaagaaa tcagccataa
gaaagcacta ttaatactct 1440gcagtgatta gaaggggtgg ggtggcggga atcctattta
tcagactctg taattgaata 1500taaatgtttt actcagagga gctgcaaatt gcctgcaaaa
atgaaatcca gtgagcacta 1560gaatatttaa aacatcatta ctgccatctt tatcatgaag
cacatcaatt acaagctgta 1620gaccacctaa tatcaatttg taggtaatgt tcctgaaaat
tgcaatacat ttcaattata 1680ctaaacctca caaagtagag gaatccatgt aaattgcaaa
taaaccactt tctaattttt 1740tcctgtttct gaattgtaaa accccctttg ggagtccctg
gtttcttatt gagccaattt 1800ctggg
180568967DNAHomo sapiens 68aaaaagggac tggttgctag
tggaaacctc agagtgaaac tcacccagct ttagtaacca 60actcgattgc atagacttta
gataaccatg tgaaggggat tctaccatca gaaaagaggc 120caaacttcta tcatcatggt
ggatgtgaag tgtctgagtg actgtaaatt gcagaaccaa 180cttgagaagc ttggattttc
acctggccca atactacctt ccaccagaaa gttgtatgaa 240aaaaagttag tacagttgtt
ggtctcacct ccctgtgcac cacctgtgat gaatggaccc 300agagagctgg atggagcgca
ggacagtgat gacagcgaag agcttaatat cattttgcaa 360ggaaatatca tactctcaac
agaaaaaagc aagaaactca aaaaatggcc tgaggcttcc 420accactaaac gcaaagctgt
agatacctat tgcttggatt ataagccttc caagggaaga 480aggtgggctg caagagcacc
aagcaccaga atcacatatg ggactatcac caaagagaga 540gactactgcg cggaagacca
gactatcgag agctggagag aagaaggttt cccagtgggc 600ttgaagcttg ctgtgcttgg
tattttcatc attgtggtgt ttgtctacct gactgtggaa 660aataagtcgc tgtttggtta
agtaatttag gagcaaagca atgctccaag cgaggcctcc 720tgcttcagga aagaaccaaa
acactaccct gaagggccag cctagcctgc agccctccct 780tgcagggagc cttcccttgc
actgtgctgc tctcacagat cggtgtctgg gctcagccag 840gtggaaggaa cctgcctaac
caggcacctg tgttaagagc atgatggtta ggaaatcccc 900caagtcatgt caactctcat
taaaggtgct tccatatttg agcaggcgtc aaacaaggaa 960aaaaaaa
967691291DNAHomo sapiens
69tggtcaccag ggaagctggc aagggaaggg agactagggt gcgctctagg agaagccgac
60agcctgagag tcccagaaga ggagccctgt ggaccctccc ctgccagcca ctcccttacc
120ctgggtataa gagccaccac cgcctgccat ccgccaccat ctcccactcc tgcagctctt
180ctcacaggac cagccactag cgcagcctcg agcgatggcc tatgtccccg caccgggcta
240ccagcccacc tacaacccga cgctgcctta ctaccagccc atcccgggcg ggctcaacgt
300gggaatgtct gtttacatcc aaggagtggc cagcgagcac atgaagcggt tcttcgtgaa
360ctttgtggtt gggcaggatc cgggctcaga cgtcgccttc cacttcaatc cgcggtttga
420cggctgggac aaggtggtct tcaacacgtt gcagggcggg aagtggggca gcgaggagag
480gaagaggagc atgcccttca aaaagggtgc cgcctttgag ctggtcttca tagtcctggc
540tgagcactac aaggtggtgg taaatggaaa tcccttctat gagtacgggc accggcttcc
600cctacagatg gtcacccacc tgcaagtgga tggggatctg caacttcaat caatcaactt
660catcggaggc cagcccctcc ggccccaggg acccccgatg atgccacctt accctggtcc
720cggacattgc catcaacagc tgaacagcct gcccaccatg gaaggacccc caaccttcaa
780cccgcctgtg ccatatttcg ggaggctgca aggagggctc acagctcgaa gaaccatcat
840catcaagggc tatgtgcctc ccacaggcaa gagctttgct atcaacttca aggtgggctc
900ctcaggggac atagctctgc acattaatcc ccgcatgggc aacggtaccg tggtccggaa
960cagccttctg aatggctcgt ggggatccga ggagaagaag atcacccaca acccatttgg
1020tcccggacag ttctttgatc tgtccattcg ctgtggcttg gatcgcttca aggtttacgc
1080caatggccag cacctctttg actttgccca tcgcctctcg gccttccaga gggtggacac
1140attggaaatc cagggtgatg tcaccttgtc ctatgtccag atctaatcta ttcctggggc
1200cataactcat gggaaaacag aattatcccc taggactcct ttctaagccc ctaataaaat
1260gtctgagggt gtctcatgaa aaaaaaaaaa a
1291703967DNAHomo sapiens 70agttgtcggg tcttattggc cgctgattag accagccggg
gtccactagc cctgggctgc 60agggaggctg ctgcgtccag tgaacacttc agcacctgta
gcacagaagg gccaaggagc 120tgcagtcctc gaccagcagg aggtttgctc ctcagcccac
tcgctgcatc cagatcagct 180cacccctctc ccttccctgc ccaccaggac tctgatagcc
cctggcagcc acagcccatt 240ttgccaagat gtctagagta gccaaatatc gccggcaggt
gagtgaagac cccgacatcg 300acagcctgct ggagaccctg tctcccgagg agatggagga
gctggagaag gagctggacg 360tggtggaccc agacgggagt gttcccgtgg ggctgcggca
gagaaaccag acggagaaac 420agtccacggg tgtgtacaac cgggaggcca tgctcaactt
ctgtgaaaag gagaccaaga 480aacttatgca gagggagatg tccatggatg aaagcaagca
agtggagacc aagacagatg 540ccaagaatgg agaggaaagg ggcagagatg ccagcaaaaa
agccctgggc cccagacggg 600actcagatct ggggaaggag ccaaagaggg gtggtttaaa
gaaaagcttc tctagagaca 660gagatgaagc tggtggcaag agtggcgaga agcccaagga
ggagaagatc atccggggca 720ttgacaaggg ccgggtcagg gctgcagtgg ataagaagga
ggcagggaag gatgggagag 780gagaggagag ggcagtggcc accaagaagg aagaggagaa
gaaagggagt gacaggaaca 840caggcttgag cagggacaag gataaaaaga gagaggagat
gaaggaggtg gccaagaaag 900aggatgatga gaaggtaaaa ggggagcgta ggaacacaga
caccagaaaa gagggtgaga 960agatgaaaag agcaggtggg aacacagaca tgaaaaagga
ggatgagaag gtaaaaagag 1020gaactgggaa cacagacacc aaaaaggacg atgaaaaagt
caagaagaat gaacccttac 1080atgaaaagga agccaaggat gacagcaaga ccaaaacacc
cgagaaacag acgcccagtg 1140gccccaccaa gccctctgaa ggaccggcca aggtggagga
ggaggcagct cccagcatat 1200ttgatgagcc tctggagaga gtgaagaaca atgaccccga
gatgactgag gtgaacgtca 1260acaactcaga ctgcatcaca aatgagatct tggtccggtt
tactgaggct ctggagttca 1320acactgtggt taagctgttc gccttggcca acacgcgagc
cgatgaccac gtggcctttg 1380ccattgccat catgctcaag gccaacaaga ccatcaccag
cctcaacctg gactccaacc 1440acatcacagg caaaggcatc ctggccatct tccgggccct
cctccagaac aacacgctga 1500ccgagctccg cttccacaac cagcgacaca tctgtggagg
caagacggag atggagatcg 1560ccaagctgct gaaggagaat actaccctgc tcaagctggg
ctaccatttt gagctggccg 1620ggccccgaat gactgtcacc aatctgctca gccgcaacat
ggacaagcag agacaaaagc 1680ggctgcagga gcaaaggcag gcacaggaag ccaagggaga
gaagaaggat ctgctggagg 1740tacccaaggc cggggccgtg gctaagggct ccccaaaacc
ttcacctcaa ccatctccaa 1800agccctctcc aaagaactca cccaaaaaag ggggtgctcc
agctgcccca ccaccccctc 1860cccctccctt ggctccaccc cttatcatgg agaacctgaa
gaattcactc tcaccagcta 1920cccagaggaa gatgggagac aaagtcctcc ctgcccagga
gaagaactcc cgtgaccagc 1980tattggctgc catccgctcc agcaacctca agcagctcaa
gaaggtggaa gtgcccaaac 2040tgcttcagta ggaccaggct gccaggcacc atctgccaat
gccatgactg ctcaggcctc 2100acctcccagg gctacacaga ccctgcccac cccatccctg
gctgacctgc tgtggatgtc 2160cctattctgc catgggagag tccaggcctg ggtcacgctc
aaggaaggat gccttatctc 2220ttctcacttt ccttttcttg tctctgaggc tctccaaatt
ttgctttagt acatggagct 2280caggtttctg gacaagaaga gtccttttag cacatcactg
agaagatggc actgtccagg 2340gcccatgtag ctggcaagct gcaaaaggcc tgtgatccag
gaaagatgtc ccacagggac 2400cacatccacc ccagccccac tgccctccag ggccaggatt
caggcctctg aggagcccac 2460ggggcaaagc tgctgggcca gtggcactct gtgtgggaaa
atggcagaaa gatggagagg 2520catgggggcc caaaggggag cgtggggagg ggctgaggat
accccaaagt ccaggctaat 2580tagaggatgt ggcaggggca gtggcctgga tgcacagtgc
ctgatgggag taggctccag 2640acaggaggag tgggacagac agcagctgga cttgaaggtt
tgatgccaaa gcagacattt 2700tcctcacacc cacctgctgc tgtatgaata gctgtgtatc
tgtttttcca taagattttg 2760ataatatata caaaccttta gctgtgaatg gctgtgcccc
acctgttgtc ctgaactgtg 2820agtcctgatc ctaaccctgg gctccctgga ggactctaga
agctcaggtt ccctgccaca 2880ctatttgagt tggccaagaa ataaattcac atcctcagaa
agtgcagcat ggaggaaaat 2940ctgaactcta agcagaagac tctccactga cctggttgtc
caggtctaga aggccaggcc 3000tctactaggt ctgctcctga accagtcctg ctgcctggag
tcagtagcca gagttgttct 3060caggggtgct ggggcagagt ggagcccagg gtgctgggat
ggctatatta ggcatgttca 3120gggatgctca ttccatgact ctgcctaacc atgggctcag
ggccaggtcc tcacagcagt 3180cacaggccca ggaaggcggc aggcagagaa gtggagtgac
tatttggaga atagcaccca 3240tatctgtgtg ccctagggct cagaggggcc tcatcttccc
cagccctccc cacctgctca 3300ccaattccac ttcctgcccc aactgcagga atgctgacaa
tgctgccatg cccaccatcg 3360ggtgtaggtg aaaggcatct ttctgaattt cattctcttg
aaggtgctgc caccccttgg 3420cactgtggaa ctgccacctt gggtctgtgt cacttgtagg
tttctctgcc tccaggttgc 3480ctcaacagca ggaggcacag cagtttcacc atctttgagg
tgagggtggg gtgccccagc 3540taggaagcaa gatcgctgtg ctaggtctga ccaaaaccag
agggcagtct agtcctgggg 3600gtaaagccct cagatcccag ggtacactct tctccattcc
ctccacccac ttgcctgtca 3660ccccagtcac ctaagcaatc actgggccca gaggagagga
gacagacaca cactggctcc 3720tggacctaaa gggtatgagc tggagctaag gccagctaga
gcttccactg tcagccctca 3780ctgtcagtcc cactgcaccc ccctgtgcct gctgggcact
gggcactagc tagatgcttt 3840aggttgcttc agctgatcct tcaactctgt gaggtggata
ccaatattct attttgcaga 3900tagaatttgg cccagagagg ttaactaata tatccatgat
cacacagcta ataaaagtca 3960gagctca
3967711121DNAHomo sapiens 71tgtgaacctg gggctcttgg
gcagcaacac gttgcagctt ccacctagca agccacgccg 60ggaccaggtc ccatctgatg
gaggagaatc aattcaagga gatgcccttc ctttacagaa 120caccctttaa cagcatccag
gaggaacgag aggctgcaat actgaggctt tcaaagtact 180cacgaggatg tccgagaatg
gctgtgatgc caggcttctg gcaggttcca gactccatca 240caagcccagc atccctgcac
cagatctgac atcgctgctg ttgtgccagc tgtttatgaa 300gggcctgagt agctagcagg
tttttatcag gagccctgct gggggcttag acaccaaaag 360agaagtctca tcctctgtag
ttcttcttgt gaatgtcctt ttagaaaaac aattagaacc 420aaccacaagc accaaagtcc
taatgggatc tcctgcgagc acatatcaag caggattgtt 480gctatttctc ctcactggct
ctttggacag actgtgtgag ctcctggagg gttccactgt 540atctaccctt tgacactgac
cagttggcac atggtgactt attccaatgt gttgattgaa 600aatgtgaacg tacagccagt
gctgtgtgcg ggaggactct ctcctcctca gtggggccac 660accgtgcact attaatggag
ccccactcct ttgcacagcc tggccatgca gtggctcata 720ttgaggtttt agccaactga
aatctcccgt gcatttttct gacaagccag ctaggcctct 780gctatgctgt ccttgtgtct
ttcatttgat gaccttaagg gtgggactgt tttatcttaa 840gttacaggtg gtcaagtcca
gcccaaggac agcaactctg agggtcaagc ctcataggct 900aactggatag atgttctctg
ctttgccacc cactggagcc cgacctgccc cactaattta 960tatttcccct ggtctcattt
tgtacttttt atttataatt cacccttaaa gtgtatgtgt 1020ctcttataag ctgcctccga
tctttcatgg tatgaggtgg ttacctaaat aaagaaggag 1080atttggcctt tgtttttatg
taaaaaaaaa aaaaaaaaaa a 1121726141DNAHomo sapiens
72gtttcagatt tgggatattg gtgtttctgt tttggagaaa ttattctttt tctttttaat
60ttgaagaaaa atcatcagtc ttggaataca gaagagaaac tagaaatata cgtattttgt
120ttcacatttg aacagtcatt cttgaggaat actccatacc tgagtagaca gccatgtggc
180catcgcagct actaattttc atgatgctct tagctccaat aattcatgct ttcagccgtg
240ccccaattcc aatggctgtg gtccgcagag agctatcctg tgagagctat cctatagagc
300ttcgctgtcc aggaacagac gtcatcatga tagaaagtgc caactatggc aggactgatg
360acaaaatttg tgactctgac cctgctcaga tggagaatat ccgatgttat ctgccagatg
420cctataagat tatgtctcaa agatgcaata acagaaccca gtgtgcagtg gtggcaggtc
480ctgatgtttt tccagacccg tgtccaggaa cctataaata ccttgaagtg cagtatgaat
540gtgtccctta caaagtggaa caaaaagttt ttctttgtcc tggactacta aaaggagtat
600accagagtga acatttgttt gagtccgacc accaatctgg ggcgtggtgc aaagaccctc
660tgcaggcatc tgacaagatt tattatatgc cctggactcc ctacagaact gataccctga
720ctgagtattc atccaaggat gacttcattg ctggaagacc aactacaacc tacaagctcc
780ctcacagggt ggatggcaca ggatttgtag tgtatgatgg agctttgttc ttcaacaaag
840agcgcaccag gaacatagta aagtttgatt tgcggactag gataaagagt ggagaggcta
900tcatagcaaa tgccaattac catgatacct ccccttaccg atggggaggc aaatctgaca
960tagacctggc agtagatgag aatgggctat gggtaatcta tgcaacagaa caaaacaatg
1020gtaaaattgt cattagtcaa ttgaaccctt acaccctacg gatcgaagga acatgggata
1080ctgcatatga taaaaggtca gcttccaatg cctttatgat ttgtggaatt ctgtatgtgg
1140tcaaatctgt atatgaggat gatgacaatg aggctactgg aaataagatt gactacattt
1200acaacactga ccaaagcaag gatagtttgg tggatgtacc ctttcctaat tcataccagt
1260acattgcagc tgtggattac aaccccaggg acaacctact ttatgtatgg aataactatc
1320acgtcgtgaa atattctttg gattttggac ctctggatag tagatcaggg caggcacatc
1380atggacaagt ttcatacatt tctccgccaa ttcaccttga ctctgagcta gaaagaccct
1440ctgttaaaga tatctctacc acaggacctc ttggcatggg aagcactacc accagtacca
1500cccttcggac cacaactttg agcccaggaa ggagtaccac cccgtcagtg tcaggaagaa
1560gaaaccggag tactagtacc ccatctccag ctgtcgaggt acttgatgac atgaccacac
1620accttccatc agcatcgtcc caaatcccag ctctcgaaga gagctgtgag gctgtggaag
1680cccgagaaat catgtggttt aagactcgtc aaggacagat agcaaagcag ccatgccctg
1740caggaactat aggtgtatca acttatctat gccttgctcc tgatggaatt tgggatcccc
1800aaggtccaga tctcagcaac tgttcttctc cttgggtcaa tcatataaca cagaagttga
1860aatctggtga aacagctgcc aacattgcta gagagctggc tgaacagaca agaaatcact
1920tgaatgctgg ggacatcacc tactctgtcc gggccatgga ccagctggta ggcctcctag
1980atgtacagct tcggaacttg accccaggtg gaaaagatag tgctgcccgg agtttgaaca
2040agcttcagaa aagagagcgc tcttgcagag cctatgtcca ggcaatggtc gagacagtta
2100acaacctcct tcagccacaa gctttgaatg catggagaga cctgactacg agtgatcagc
2160tgcgtgcggc caccatgttg cttcatactg tggaggaaag tgcttttgtg ctggctgata
2220accttttgaa gactgacatt gtcagggaga atacagacaa tattaaattg gaagttgcaa
2280gactgagcac agaaggaaac ttagaagacc taaaatttcc agaaaacatg ggccatggaa
2340gcactatcca gctgtctgca aataccttaa agcaaaatgg ccgaaatgga gagatcagag
2400tggcctttgt cctgtataac aacttgggtc cttatttatc cacggagaat gccagtatga
2460agttgggaac ggaagctttg tccacaaatc attctgttat tgtcaattcc cctgttatta
2520cggcagcaat aaacaaagag ttcagtaaca aggtttattt ggctgatcct gtggtattta
2580ctgttaaaca tatcaagcag tcagaggaaa atttcaaccc taactgttca ttttggagct
2640actccaagcg tacaatgaca ggttattggt caacacaagg ctgtcggctc ctgacaacaa
2700ataagacaca tactacatgc tcttgtaacc acctaacaaa ttttgcagta ctgatggcac
2760atgtggaagt taagcacagt gatgcggtcc atgacctcct tctggatgtg atcacgtggg
2820ttggaatttt gctgtccctt gtttgtctcc tgatttgcat cttcacattt tgctttttcc
2880gggggctcca gagtgaccgt aacaccatcc acaagaacct ctgcatcagt ctctttgtag
2940cagagctgct cttcctgatt gggatcaacc gaactgacca accaattgcc tgtgctgttt
3000tcgctgccct gttacatttc ttcttcttgg ctgccttcac ctggatgttc ctggaggggg
3060tgcagcttta tatcatgctg gtggaggttt ttgagagtga acattcacgt aggaaatact
3120tttatctggt cggctatggg atgcctgcac tcattgtggc tgtgtcagct gcagtagact
3180acaggagtta tggaacagat aaagtatgtt ggctccgact tgacacctac ttcatttgga
3240gttttatagg accagcaact ttgataatta tgcttaatgt aatcttcctt gggattgctt
3300tatataaaat gtttcatcat actgctatac tgaaacctga atcaggctgt cttgataaca
3360tcaactatga ggataacaga cccttcatca agtcatgggt tataggtgca atagctcttc
3420tctgcctatt aggattgacc tgggcctttg gactcatgta tattaatgaa agcacagtca
3480tcatggccta tctcttcacc attttcaatt ctctacaggg aatgtttata tttattttcc
3540attgtgtcct acagaagaag gtacgaaaag agtatgggaa atgcctgcga acacattgct
3600gtagtggcaa aagtacagag agttccattg gttcagggaa aacatctggt tctcgaactc
3660ctggacgcta ctccacaggc tcacagagcc gaatccgtag aatgtggaat gacacggttc
3720gaaagcagtc agagtcttcc tttattactg gagacataaa cagttcagcg tcactcaaca
3780gagaggggct tctgaacaat gccagggata caagtgtcat ggatactcta ccactgaatg
3840gtaaccatgg caatagttac agcattgcca gcggcgaata cctgagcaac tgtgtgcaaa
3900tcatagaccg tggctataac cataacgaga ccgccctaga gaaaaagatt ctgaaggaac
3960tcacttccaa ctatatccct tcttacctga acaaccatga gcgctccagt gaacagaaca
4020ggaatctgat gaacaagctg gtgaataacc ttggcagtgg aagggaagat gatgccattg
4080tcctggatga tgccacctcg tttaaccacg aggagagttt gggcctggaa ctcattcatg
4140aggaatctga tgctcctttg ctgcccccaa gagtatactc caccgagaac caccagccac
4200accattatac cagaaggcgg atcccccaag accacagtga gagctttttc cctttgctaa
4260ccaacgagca cacagaagat ctccagtcac cccatagaga ctctctctat accagcatgc
4320cgacactggc tggtgtggcc gccacagaga gtgttaccac cagcacccag accgaacccc
4380caccggccaa atgtggtgat gccgaagatg tttactacaa aagcatgcca aacctaggct
4440ccagaaacca cgtccatcag ctgcatactt actaccagct aggtcgcggc agcagtgatg
4500gatttatagt tcctccaaac aaagatggga cccctcccga gggaagttca aaaggaccgg
4560ctcatttggt cactagtcta tagaagatga cacagaaatt ggaaccaaca aaactgctaa
4620caccttgttg actgttctga gttgatataa gcagtggtaa taatgtgtgt actcctaaat
4680ctttatgctg tcctctaaag acaaacacaa actctcagac tttttttttt ttaatgggat
4740ttttaggtca gcccagggga gaaagataac tgctaaaatt cccctgtacc ccatcctttc
4800ttgtcctttc cccttcagat ggagacttca ttatgttaat gaacaagata tgaagaaaat
4860ggcactcatt gtggccttgt tgaattatgt tgtgtatgtt ttaacatctc tgatgctgtg
4920ttactaaaat tacaaggacc tgctttttaa aaggccagaa caattgtctg aaattagtaa
4980caatgctgca tctagattgg agtgctgcac aaacaaacat aagagcaaag caaaactgta
5040tcacataggg tttttggtca ctcacaacct gaattcacca cagctggaat agctgtggaa
5100aacaaaataa aacaacaaaa ttaataatga aatggagggg aattctagaa ttatatgcta
5160aatgcatatt ttatgatttg ctgtattaac tgatgataaa actaatggca gaaaaagaag
5220ttgagcaatt tctatgtaat gtacagatac tagcattgca catatagtct gctttctgtt
5280cctccagaat ttgagtcctg ttaatgtagt agaaaaaaaa aaaagaaatt ttctttttct
5340tttgtgctgg tcttgcaagt ttgtctacca gtaagagagc aaagtttcct tcctttcttc
5400tctttcttca ttttcttttt ttcttttttg ccttttattc ctttaaaatt tcgcctggca
5460aaaaataaat aaatggaact atcactttat aagaatcatt ttctagtaat gcaaacaaat
5520tattttttac aaaaaaacaa aataaataaa attagacttc cttccctcac tatatatctt
5580tatgcagtca gaatatttcc aacagtgttt tttgcaaatt agagcaggac aaacttttat
5640gtttacaggg cacgtctgtt gtaatgcaaa gcatatttgg caagcagttc atcaccagga
5700cactagctat gattctagaa gtcaaaaggt gtctatagaa ctagtggggc ttctgcatgt
5760gaaaaacggt tttccatagg cattaaagtg ctgaatgctc agtctgatca acaagtgggc
5820acctgcacta ccacttttta gaggaaattc actccctcgt aagcattgga aggtcaaatt
5880attttgaagt gattttttta aaaaaaagtc ttctgtttat taacaggaaa atttatttat
5940ttgacaggat tttgagtaat gtaggaatac aaaaggtaaa ttagcagcac atataatttt
6000tttttaattt atgatccatt ttgtatggtc tcaaagttgg atgacctcat tactaatatt
6060tgttgtaaaa gtgaaacttg tttgccaacc aataaacaac tgattgagat ttagaagata
6120ttgtaaaaaa aaaaaaaaaa a
6141732116DNAHomo sapiens 73acagtgagct tccttatttg aagcaggact caattcttgg
ttaaaagcta tggtatttga 60gctagttaaa cacatatctc tctcccattc catagggaat
gagctgggct gtcctttctc 120cccacgttca cctgcacttc gttagagagc agtgttcaca
tgccacacca caagatcccc 180acaatgacat aactccattc agagactggc gtgactgggc
tgggtctccc cacccccctt 240cagctcttgt atcactcaga atctggcagc cagttccgtc
ctgacagagt tcacagcata 300tattggtgga ttcttgtcca tagtgcatct gctttaagaa
ttaacgaaag cagtgtcaag 360acagtaagga ttcaaaccat ttgccaaaaa tgagtctaag
tgcatttact ctcttcctgg 420cattgattgg tggtaccagt ggccagtact atgattatga
ttttccccta tcaatttatg 480ggcaatcatc accaaactgt gcaccagaat gtaactgccc
tgaaagctac ccaagtgcca 540tgtactgtga tgagctgaaa ttgaaaagtg taccaatggt
gcctcctgga atcaagtatc 600tttaccttag gaataaccag attgaccata ttgatgaaaa
ggcctttgag aatgtaactg 660atctgcagtg gctcattcta gatcacaacc ttctagaaaa
ctccaagata aaagggagag 720ttttctctaa attgaaacaa ctgaagaagc tgcatataaa
ccacaacaac ctgacagagt 780ctgtgggccc acttcccaaa tctctggagg atctgcagct
tactcataac aagatcacaa 840agctgggctc ttttgaagga ttggtaaacc tgaccttcat
ccatctccag cacaatcggc 900tgaaagagga tgctgtttca gctgctttta aaggtcttaa
atcactcgaa taccttgact 960tgagcttcaa tcagatagcc agactgcctt ctggtctccc
tgtctctctt ctaactctct 1020acttagacaa caataagatc agcaacatcc ctgatgagta
tttcaagcgt tttaatgcat 1080tgcagtatct gcgtttatct cacaacgaac tggctgatag
tggaatacct ggaaattctt 1140tcaatgtgtc atccctggtt gagctggatc tgtcctataa
caagcttaaa aacataccaa 1200ctgtcaatga aaaccttgaa aactattacc tggaggtcaa
tcaacttgag aagtttgaca 1260taaagagctt ctgcaagatc ctggggccat tatcctactc
caagatcaag catttgcgtt 1320tggatggcaa tcgcatctca gaaaccagtc ttccaccgga
tatgtatgaa tgtctacgtg 1380ttgctaacga agtcactctt aattaatatc tgtatcctgg
aacaatattt tatggttatg 1440tttttctgtg tgtcagtttt catagtatcc atattttatt
actgtttatt acttccatga 1500attttaaaat ctgagggaaa tgttttgtaa acatttattt
tttttaaaga aaagatgaaa 1560ggcaggccta tttcatcaca agaacacaca catatacacg
aatagacatc aaactcaatg 1620ctttatttgt aaatttagtg tttttttatt tctactgtca
aatgatgtgc aaaacctttt 1680actggttgca tggaaatcag ccaagtttta taatccttaa
atcttaatgt tcctcaaagc 1740ttggattaaa tacatatgga tgttactctc ttgcaccaaa
ttatcttgat acattcaaat 1800ttgtctggtt aaaaaatagg tggtagatat tgaggccaag
aatattgcaa aatacatgaa 1860gcttcatgca cttaaagaag tatttttaga ataagaattt
gcatacttac ctagtgaaac 1920ttttctagaa ttatttttca ctctaagtca tgtatgtttc
tctttgatta tttgcatgtt 1980atgtttaata agctactagc aaaataaaac atagcaaatg
gcatcactgt gtttgacttc 2040ttgtgaaatt tctgtacttt gtatataaaa tacataaaac
aatagattag aaatcaaaag 2100atatctctgg cctgca
211674806DNAHomo sapiens 74gcccaccccc gcccagcccg
tgcctataag gccttggcaa tgcaggggcc cgcactgctc 60ccagacgaca tcagagatga
ggacagcatt gctgctcctt gcagccctgg ctgtggctac 120agggccagcc cttaccctgc
gctgccacgt gtgcaccagc tccagcaact gcaagcattc 180tgtggtctgc ccggccagct
ctcgcttctg caagaccacg aacacagtgg agcctctgag 240ggggaatctg gtgaagaagg
actgtgcgga gtcgtgcaca cccagctaca ccctgcaagg 300ccaggtcagc agcggcacca
gctccaccca gtgctgccag gaggacctgt gcaatgagaa 360gctgcacaac gctgcaccca
cccgcaccgc cctcgcccac agtgccctca gcctggggct 420ggccctgagc ctcctggccg
tcatcttagc ccccagcctg tgaccttccc cccagggaag 480gcccctcatg cctttccttc
cctttctctg gggattccac acctctcttc cccagccgca 540acgggggtgc caggagcccc
aggctgaggg cttccccgaa agtctgggac caggtccagg 600tgggcatgga atgctgatga
cttggagcag gccccacaga ccccacagag gatgaagcca 660ccccacagag gatgcagccc
ccagctgcat ggaaggtgga ggacagaagc cctgtggatc 720cccggatttc acactccttc
tgttttgttg ccgtttattt ttgtactcaa atctctacat 780ggagataaat gatttaaacc
agaaaa 806751516DNAHomo sapiens
75aaatactggg gccagctcac cctggtcagc ctagcactct gacctagcag tcaacatgaa
60ggctctcatt gttctggggc ttgtcctcct ttctgttacg gtccagggca aggtctttga
120aaggtgtgag ttggccagaa ctctgaaaag attgggaatg gatggctaca ggggaatcag
180cctagcaaac tggatgtgtt tggccaaatg ggagagtggt tacaacacac gagctacaaa
240ctacaatgct ggagacagaa gcactgatta tgggatattt cagatcaata gccgctactg
300gtgtaatgat ggcaaaaccc caggagcagt taatgcctgt catttatcct gcagtgcttt
360gctgcaagat aacatcgctg atgctgtagc ttgtgcaaag agggttgtcc gtgatccaca
420aggcattaga gcatgggtgg catggagaaa tcgttgtcaa aacagagatg tccgtcagta
480tgttcaaggt tgtggagtgt aactccagaa ttttccttct tcagctcatt ttgtctctct
540cacattaagg gagtaggaat taagtgaaag gtcacactac cattatttcc ccttcaaaca
600aataatattt ttacagaagc aggagcaaaa tatggccttt cttctaagag atataatgtt
660cactaatgtg gttattttac attaagccta caacattttt cagtttgcaa atagaactaa
720tactggtgaa aatttaccta aaaccttggt tatcaaatac atctccagta cattccgttc
780tttttttttt tgagacagtc tcgctctgtc gcccaggctg gagtgcagtg gcgcaatctc
840ggctcactgc aacctccacc tcccgggttc acgccattct cctgcctcag cctcccgagt
900agctgggatt acgggcgccc gccaccacgc ccggctaatt ttttgtattt ttagtagaga
960cagggtttca ccgtgttagc caggatggtc tcgatctcct gaccttgtga tccacccacc
1020tcggcctccc aaagtgctgg gattacaggc gtgagccact gcgcccggcc acattcagtt
1080cttatcaaag aaataaccca gacttaatct tgaatgatac gattatgccc aatattaagt
1140aaaaaatata agaaaaggtt atcttaaata gatcttaggc aaaataccag ctgatgaagg
1200catctgatgc cttcatctgt tcagtcatct ccaaaaacag taaaaataac cactttttgt
1260tgggcaatat gaaattttta aaggagtaga ataccaaatg atagaaacag actgcctgaa
1320ttgagaattt tgatttctta aagtgtgttt ctttctaaat tgctgttcct taatttgatt
1380aatttaattc atgtattatg attaaatctg aggcagatga gcttacaagt attgaaataa
1440ttactaatta atcacaaatg tgaagttatg catgatgtaa aaaatacaaa cattctaatt
1500aaaggctttg caacac
1516762383DNAHomo sapiens 76gaaagcagtt ctctgggacc accttctttt ggcttcaacc
tctcccactc ttgacatctg 60agtagctcag ggaagctctt ccaggtccga ctgttcatat
gtaaaggaga ctggccgctg 120gggctcagga ccgggattat ccgagctctg cagaagtgca
ccgctattgc tttgggaggt 180taaaaaaaaa atcacacggt ttccagtgaa aaagtgacag
agggtggtgg cctttggaac 240cgccgtgaag tcttctgcct ggaacccgaa acttgcatgc
tatggaacac ccgctctttg 300gctgcctgcg cagccctcac gccacggcgc aaggcttgca
cccgttctcc caatcctctc 360tcgccctcca tggaagatct gaccatatgt cttaccccga
gctctctact tcttcctcat 420cttgcataat cgcgggatac cccaacgaag agggcatgtt
tgccagccag catcacaggg 480ggcaccacca ccaccaccac caccaccacc atcaccacca
tcagcagcag cagcaccagg 540ctctgcaaac caactggcac ctcccgcaga tgtcttcccc
accgagtgcg gctcggcaca 600gcctctgcct ccagcccgac tctggagggc ccccagagtt
ggggagcagc ccgcccgtcc 660tgtgctccaa ctcttccagc ttgggctcca gcaccccgac
tggggccgcg tgcgcgccgg 720gggactacgg ccgccaggca ctgtcacctg cggaggcgga
gaagcgaagc ggcggcaaga 780ggaaaagcga cagctcagac tcccaggaag gaaattacaa
gtcagaagtc aacagcaaac 840ccaggaaaga aaggacagca tttaccaaag agcaaatcag
agaacttgaa gcagaatttg 900cccatcataa ttatctcacc agactgaggc gatacgagat
agcagtgaat ctggatctca 960ctgaaagaca ggtgaaagtc tggttccaaa acaggcggat
gaagtggaag agggtaaagg 1020gtggacagca aggagctgcg gctcgggaaa aggaactggt
gaatgtgaaa aagggaacac 1080ttctcccatc agagctgtcg ggaattggtg cagccaccct
ccagcaaaca ggggactcta 1140tagcaaatga agacagtcac gacagtgacc acagctcaga
gcatgcgcac ttatgatata 1200aacagaggac cagctccatt ctcaggaaag aaatgttgtg
atggcaagcc ttacccaaat 1260atcgtttaca cagagagatg actatggcag tgatgtttaa
tattattaaa tccaggcatt 1320tcgaatctgt ttttcatgat ttatagaggg tttacacaaa
gtgccactta ttaaagagct 1380tccacagtga agatggagaa ggtgaacttg ctttgaatat
tccagatgtg tttggtcgtg 1440cgtatggcag tgagcaggta tgtgtttgct tttgcttgca
ctgaaaatta aattgctatc 1500aagagcaaac tatgaacggt tttttattca agatgtctcc
agagtgaaga tgccgaggat 1560gaacttgcat tgaacattcc agatgtgtga gatcatgtgt
attacagtgg gcaggtattt 1620gcttttgctt gcactgaaaa ttaaattgct atcaagaata
aaccatgaaa cattttatcc 1680tgaacagcca cagtgcctga attcactcaa gtggataaaa
agtgtatttt aactctgtat 1740atattaccct taagtcattt tcctgtcttc actaatttag
caatgcattc atattagctg 1800atgaaaatag gcactcacaa tgacaaccag agccagtttc
ttgtcttttt tatacatttt 1860gtcatcccag agacaatcag tatgtgctta cctgtgttca
agtagagaaa aatacagtag 1920agtctgatag gacatattct tgtaccacag acaaaacaaa
tcttatgttg catttactat 1980caactgctgc taatacgtta ttataaaact tacctagctc
ctgaattctt cctatcttat 2040agcttaaaac aattaggatc ataggcaaat cagttacctt
gcagaaagag ctttgtatga 2100cagacattgt cttattttat ttctgtaaaa tattagctgt
atgaatatga tttaattaac 2160aagaaaacat ttcttcctga ttgacaacag tgttagacaa
ggtgcaaagc gaaactggtt 2220gctcaagttg atagaaaaca aaattctgaa tatcttcaaa
ttaaattcgg taaaaacaca 2280ttattttttc atatgtgatg tattcatgca gaacaactat
ctttgtattt tgtttttaaa 2340atgtgtttaa taaatgatcc tttgtaaata aaaaaaaaaa
aaa 2383776695DNAHomo sapiens 77gccctcgccg cccgcggcgc
cccgagcgct ttgtgagcag atgcggagcc gagtggaggg 60cgcgagccag atgcggggcg
acagctgact tgctgagagg aggcggggag gcgcggagcg 120cgcgtgtggt ccttgcgccg
ctgacttctc cactggttcc tgggcaccga aagataaacc 180tctcataatg aaggcccccg
ctgtgcttgc acctggcatc ctcgtgctcc tgtttacctt 240ggtgcagagg agcaatgggg
agtgtaaaga ggcactagca aagtccgaga tgaatgtgaa 300tatgaagtat cagcttccca
acttcaccgc ggaaacaccc atccagaatg tcattctaca 360tgagcatcac attttccttg
gtgccactaa ctacatttat gttttaaatg aggaagacct 420tcagaaggtt gctgagtaca
agactgggcc tgtgctggaa cacccagatt gtttcccatg 480tcaggactgc agcagcaaag
ccaatttatc aggaggtgtt tggaaagata acatcaacat 540ggctctagtt gtcgacacct
actatgatga tcaactcatt agctgtggca gcgtcaacag 600agggacctgc cagcgacatg
tctttcccca caatcatact gctgacatac agtcggaggt 660tcactgcata ttctccccac
agatagaaga gcccagccag tgtcctgact gtgtggtgag 720cgccctggga gccaaagtcc
tttcatctgt aaaggaccgg ttcatcaact tctttgtagg 780caataccata aattcttctt
atttcccaga tcatccattg cattcgatat cagtgagaag 840gctaaaggaa acgaaagatg
gttttatgtt tttgacggac cagtcctaca ttgatgtttt 900acctgagttc agagattctt
accccattaa gtatgtccat gcctttgaaa gcaacaattt 960tatttacttc ttgacggtcc
aaagggaaac tctagatgct cagacttttc acacaagaat 1020aatcaggttc tgttccataa
actctggatt gcattcctac atggaaatgc ctctggagtg 1080tattctcaca gaaaagagaa
aaaagagatc cacaaagaag gaagtgttta atatacttca 1140ggctgcgtat gtcagcaagc
ctggggccca gcttgctaga caaataggag ccagcctgaa 1200tgatgacatt cttttcgggg
tgttcgcaca aagcaagcca gattctgccg aaccaatgga 1260tcgatctgcc atgtgtgcat
tccctatcaa atatgtcaac gacttcttca acaagatcgt 1320caacaaaaac aatgtgagat
gtctccagca tttttacgga cccaatcatg agcactgctt 1380taataggaca cttctgagaa
attcatcagg ctgtgaagcg cgccgtgatg aatatcgaac 1440agagtttacc acagctttgc
agcgcgttga cttattcatg ggtcaattca gcgaagtcct 1500cttaacatct atatccacct
tcattaaagg agacctcacc atagctaatc ttgggacatc 1560agagggtcgc ttcatgcagg
ttgtggtttc tcgatcagga ccatcaaccc ctcatgtgaa 1620ttttctcctg gactcccatc
cagtgtctcc agaagtgatt gtggagcata cattaaacca 1680aaatggctac acactggtta
tcactgggaa gaagatcacg aagatcccat tgaatggctt 1740gggctgcaga catttccagt
cctgcagtca atgcctctct gccccaccct ttgttcagtg 1800tggctggtgc cacgacaaat
gtgtgcgatc ggaggaatgc ctgagcggga catggactca 1860acagatctgt ctgcctgcaa
tctacaaggt tttcccaaat agtgcacccc ttgaaggagg 1920gacaaggctg accatatgtg
gctgggactt tggatttcgg aggaataata aatttgattt 1980aaagaaaact agagttctcc
ttggaaatga gagctgcacc ttgactttaa gtgagagcac 2040gatgaataca ttgaaatgca
cagttggtcc tgccatgaat aagcatttca atatgtccat 2100aattatttca aatggccacg
ggacaacaca atacagtaca ttctcctatg tggatcctgt 2160aataacaagt atttcgccga
aatacggtcc tatggctggt ggcactttac ttactttaac 2220tggaaattac ctaaacagtg
ggaattctag acacatttca attggtggaa aaacatgtac 2280tttaaaaagt gtgtcaaaca
gtattcttga atgttatacc ccagcccaaa ccatttcaac 2340tgagtttgct gttaaattga
aaattgactt agccaaccga gagacaagca tcttcagtta 2400ccgtgaagat cccattgtct
atgaaattca tccaaccaaa tcttttatta gtacttggtg 2460gaaagaacct ctcaacattg
tcagttttct attttgcttt gccagtggtg ggagcacaat 2520aacaggtgtt gggaaaaacc
tgaattcagt tagtgtcccg agaatggtca taaatgtgca 2580tgaagcagga aggaacttta
cagtggcatg tcaacatcgc tctaattcag agataatctg 2640ttgtaccact ccttccctgc
aacagctgaa tctgcaactc cccctgaaaa ccaaagcctt 2700tttcatgtta gatgggatcc
tttccaaata ctttgatctc atttatgtac ataatcctgt 2760gtttaagcct tttgaaaagc
cagtgatgat ctcaatgggc aatgaaaatg tactggaaat 2820taagggaaat gatattgacc
ctgaagcagt taaaggtgaa gtgttaaaag ttggaaataa 2880gagctgtgag aatatacact
tacattctga agccgtttta tgcacggtcc ccaatgacct 2940gctgaaattg aacagcgagc
taaatataga gtggaagcaa gcaatttctt caaccgtcct 3000tggaaaagta atagttcaac
cagatcagaa tttcacagga ttgattgctg gtgttgtctc 3060aatatcaaca gcactgttat
tactacttgg gtttttcctg tggctgaaaa agagaaagca 3120aattaaagat ctgggcagtg
aattagttcg ctacgatgca agagtacaca ctcctcattt 3180ggataggctt gtaagtgccc
gaagtgtaag cccaactaca gaaatggttt caaatgaatc 3240tgtagactac cgagctactt
ttccagaaga tcagtttcct aattcatctc agaacggttc 3300atgccgacaa gtgcagtatc
ctctgacaga catgtccccc atcctaacta gtggggactc 3360tgatatatcc agtccattac
tgcaaaatac tgtccacatt gacctcagtg ctctaaatcc 3420agagctggtc caggcagtgc
agcatgtagt gattgggccc agtagcctga ttgtgcattt 3480caatgaagtc ataggaagag
ggcattttgg ttgtgtatat catgggactt tgttggacaa 3540tgatggcaag aaaattcact
gtgctgtgaa atccttgaac agaatcactg acataggaga 3600agtttcccaa tttctgaccg
agggaatcat catgaaagat tttagtcatc ccaatgtcct 3660ctcgctcctg ggaatctgcc
tgcgaagtga agggtctccg ctggtggtcc taccatacat 3720gaaacatgga gatcttcgaa
atttcattcg aaatgagact cataatccaa ctgtaaaaga 3780tcttattggc tttggtcttc
aagtagccaa aggcatgaaa tatcttgcaa gcaaaaagtt 3840tgtccacaga gacttggctg
caagaaactg tatgctggat gaaaaattca cagtcaaggt 3900tgctgatttt ggtcttgcca
gagacatgta tgataaagaa tactatagtg tacacaacaa 3960aacaggtgca aagctgccag
tgaagtggat ggctttggaa agtctgcaaa ctcaaaagtt 4020taccaccaag tcagatgtgt
ggtcctttgg cgtgctcctc tgggagctga tgacaagagg 4080agccccacct tatcctgacg
taaacacctt tgatataact gtttacttgt tgcaagggag 4140aagactccta caacccgaat
actgcccaga ccccttatat gaagtaatgc taaaatgctg 4200gcaccctaaa gccgaaatgc
gcccatcctt ttctgaactg gtgtcccgga tatcagcgat 4260cttctctact ttcattgggg
agcactatgt ccatgtgaac gctacttatg tgaacgtaaa 4320atgtgtcgct ccgtatcctt
ctctgttgtc atcagaagat aacgctgatg atgaggtgga 4380cacacgacca gcctccttct
gggagacatc atagtgctag tactatgtca aagcaacagt 4440ccacactttg tccaatggtt
ttttcactgc ctgaccttta aaaggccatc gatattcttt 4500gctcttgcca aaattgcact
attataggac ttgtattgtt atttaaatta ctggattcta 4560aggaatttct tatctgacag
agcatcagaa ccagaggctt ggtcccacag gccacggacc 4620aatggcctgc agccgtgaca
acactcctgt catattggag tccaaaactt gaattctggg 4680ttgaattttt taaaaatcag
gtaccacttg atttcatatg ggaaattgaa gcaggaaata 4740ttgagggctt cttgatcaca
gaaaactcag aagagatagt aatgctcagg acaggagcgg 4800cagccccaga acaggccact
catttagaat tctagtgttt caaaacactt ttgtgtgttg 4860tatggtcaat aacatttttc
attactgatg gtgtcattca cccattaggt aaacattccc 4920ttttaaatgt ttgtttgttt
tttgagacag gatctcactc tgttgccagg gctgtagtgc 4980agtggtgtga tcatagctca
ctgcaacctc cacctcccag gctcaagcct cccgaatagc 5040tgggactaca ggcgcacacc
accatccccg gctaattttt gtattttttg tagagacggg 5100gttttgccat gttgccaagg
ctggtttcaa actcctggac tcaagaaatc cacccacctc 5160agcctcccaa agtgctagga
ttacaggcat gagccactgc gcccagccct tataaatttt 5220tgtatagaca ttcctttggt
tggaagaata tttataggca atacagtcaa agtttcaaaa 5280tagcatcaca caaaacatgt
ttataaatga acaggatgta atgtacatag atgacattaa 5340gaaaatttgt atgaaataat
ttagtcatca tgaaatattt agttgtcata taaaaaccca 5400ctgtttgaga atgatgctac
tctgatctaa tgaatgtgaa catgtagatg ttttgtgtgt 5460atttttttaa atgaaaactc
aaaataagac aagtaatttg ttgataaata tttttaaaga 5520taactcagca tgtttgtaaa
gcaggataca ttttactaaa aggttcattg gttccaatca 5580cagctcatag gtagagcaaa
gaaagggtgg atggattgaa aagattagcc tctgtctcgg 5640tggcaggttc ccacctcgca
agcaattgga aacaaaactt ttggggagtt ttattttgca 5700ttagggtgtg ttttatgtta
agcaaaacat actttagaaa caaatgaaaa aggcaattga 5760aaatcccagc tatttcacct
agatggaata gccaccctga gcagaacttt gtgatgcttc 5820attctgtgga attttgtgct
tgctactgta tagtgcatgt ggtgtaggtt actctaactg 5880gttttgtcga cgtaaacatt
taaagtgtta tattttttat aaaaatgttt atttttaatg 5940atatgagaaa aattttgtta
ggccacaaaa acactgcact gtgaacattt tagaaaaggt 6000atgtcagact gggattaatg
acagcatgat tttcaatgac tgtaaattgc gataaggaaa 6060tgtactgatt gccaatacac
cccaccctca ttacatcatc aggacttgaa gccaagggtt 6120aacccagcaa gctacaaaga
gggtgtgtca cactgaaact caatagttga gtttggctgt 6180tgttgcagga aaatgattat
aactaaaagc tctctgatag tgcagagact taccagaaga 6240cacaaggaat tgtactgaag
agctattaca atccaaatat tgccgtttca taaatgtaat 6300aagtaatact aattcacaga
gtattgtaaa tggtggatga caaaagaaaa tctgctctgt 6360ggaaagaaag aactgtctct
accagggtca agagcatgaa cgcatcaata gaaagaactc 6420ggggaaacat cccatcaaca
ggactacaca cttgtatata cattcttgag aacactgcaa 6480tgtgaaaatc acgtttgcta
tttataaact tgtccttaga ttaatgtgtc tggacagatt 6540gtgggagtaa gtgattcttc
taagaattag atacttgtca ctgcctatac ctgcagctga 6600actgaatggt acttcgtatg
ttaatagttg ttctgataaa tcatgcaatt aaagtaaagt 6660gatgcaacat cttgtaaaaa
aaaaaaaaaa aaaaa 6695782276DNAHomo sapiens
78aagcccagca gccccggggc ggatggctcc ggccgcctgg ctccgcagcg cggccgcgcg
60cgccctcctg cccccgatgc tgctgctgct gctccagccg ccgccgctgc tggcccgggc
120tctgccgccg gacgcccacc acctccatgc cgagaggagg gggccacagc cctggcatgc
180agccctgccc agtagcccgg cacctgcccc tgccacgcag gaagcccccc ggcctgccag
240cagcctcagg cctccccgct gtggcgtgcc cgacccatct gatgggctga gtgcccgcaa
300ccgacagaag aggttcgtgc tttctggcgg gcgctgggag aagacggacc tcacctacag
360gatccttcgg ttcccatggc agttggtgca ggagcaggtg cggcagacga tggcagaggc
420cctaaaggta tggagcgatg tgacgccact cacctttact gaggtgcacg agggccgtgc
480tgacatcatg atcgacttcg ccaggtactg gcatggggac gacctgccgt ttgatgggcc
540tgggggcatc ctggcccatg ccttcttccc caagactcac cgagaagggg atgtccactt
600cgactatgat gagacctgga ctatcgggga tgaccagggc acagacctgc tgcaggtggc
660agcccatgaa tttggccacg tgctggggct gcagcacaca acagcagcca aggccctgat
720gtccgccttc tacacctttc gctacccact gagtctcagc ccagatgact gcaggggcgt
780tcaacaccta tatggccagc cctggcccac tgtcacctcc aggaccccag ccctgggccc
840ccaggctggg atagacacca atgagattgc accgctggag ccagacgccc cgccagatgc
900ctgtgaggcc tcctttgacg cggtctccac catccgaggc gagctctttt tcttcaaagc
960gggctttgtg tggcgcctcc gtgggggcca gctgcagccc ggctacccag cattggcctc
1020tcgccactgg cagggactgc ccagccctgt ggacgctgcc ttcgaggatg cccagggcca
1080catttggttc ttccaaggtg ctcagtactg ggtgtacgac ggtgaaaagc cagtcctggg
1140ccccgcaccc ctcaccgagc tgggcctggt gaggttcccg gtccatgctg ccttggtctg
1200gggtcccgag aagaacaaga tctacttctt ccgaggcagg gactactggc gtttccaccc
1260cagcacccgg cgtgtagaca gtcccgtgcc ccgcagggcc actgactgga gaggggtgcc
1320ctctgagatc gacgctgcct tccaggatgc tgatggctat gcctacttcc tgcgcggccg
1380cctctactgg aagtttgacc ctgtgaaggt gaaggctctg gaaggcttcc cccgtctcgt
1440gggtcctgac ttctttggct gtgccgagcc tgccaacact ttcctctgac catggcttgg
1500atgccctcag gggtgctgac ccctgccagg ccacgaatat caggctagag acccatggcc
1560atctttgtgg ctgtgggcac caggcatggg actgagccca tgtctcctca gggggatggg
1620gtggggtaca accaccatga caactgccgg gagggccacg caggtcgtgg tcacctgcca
1680gcgactgtct cagactgggc agggaggctt tggcatgact taagaggaag ggcagtcttg
1740ggcccgctat gcaggtcctg gcaaacctgg ctgccctgtc tccatccctg tccctcaggg
1800tagcaccatg gcaggactgg gggaactgga gtgtccttgc tgtatccctg ttgtgaggtt
1860ccttccaggg gctggcactg aagcaagggt gctggggccc catggccttc agccctggct
1920gagcaactgg gctgtagggc agggccactt cctgaggtca ggtcttggta ggtgcctgca
1980tctgtctgcc ttctggctga caatcctgga aatctgttct ccagaatcca ggccaaaaag
2040ttcacagtca aatggggagg ggtattcttc atgcaggaga ccccaggccc tggaggctgc
2100aacatacctc aatcctgtcc caggccggat cctcctgaag cccttttcgc agcactgcta
2160tcctccaaag ccattgtaaa tgtgtgtaca gtgtgtataa accttcttct tctttttttt
2220tttttaaact gaggattgtc attaaacaca gttgttttct aaaaaaaaaa aaaaaa
2276791369DNAHomo sapiens 79aaacaggaaa taaatacgaa tgaaactgag ctctaagcag
catgtaacct ggcctgcatc 60caggaaatag aggacttcgg atccttctaa ccctaccacc
caactggccc cagtacattc 120attctctcag gaaaaaaaac aaggtcccca cagcaaagaa
aaggaatagg atcaagagat 180acgtggctgc tggcagagca agcatgaatt cgatgacttc
agcagttccg gtggccaatt 240ctgtgttggt ggtggcaccc cacaatggtt atcctgtgac
cccaggaatt atgtctcacg 300tgcccctgta tccaaacagc cagccgcaag tccacctagt
tcctgggaac ccacctagtt 360tggtgtcgaa tgtgaatggg cagcctgtgc agaaagctct
gaaagaaggc aaaaccttgg 420gggccatcca gatcatcatt ggcctggctc acatcggcct
cggctccatc atggcgacgg 480ttctcgtagg ggaatacctg tctatttcat tctacggagg
ctttcccttc tggggaggct 540tgtggtttat catttcagga tctctctccg tggcagcaga
aaatcagcca tattcttatt 600gcctgctgtc tggcagtttg ggcttgaaca tcgtcagtgc
aatctgctct gcagttggag 660tcatactctt catcacagat ctaagtattc cccacccata
tgcctacccc gactattatc 720cttacgcctg gggtgtgaac cctggaatgg cgatttctgg
cgtgctgctg gtcttctgcc 780tcctggagtt tggcatcgca tgcgcatctt cccactttgg
ctgccagttg gtctgctgtc 840aatcaagcaa tgtgagtgtc atctatccaa acatctatgc
agcaaaccca gtgatcaccc 900cagaaccggt gacctcacca ccaagttatt ccagtgagat
ccaagcaaat aagtaaggct 960acagattctg gaagcatctt tcactgggac caaaagaagt
cctcctccct ttctgggctt 1020ccataaccca ggtcgttcct gttctgacag ctgaggaaac
gtctctccca ctgtttgtac 1080tctcaccttc attcttcaat tcagtctagg aaaccatgct
gtttctctat caagaagaag 1140acagagattt taaacagatg ttaaccaaga gggactccct
agggcacatg catcagcaca 1200tatgtgggca tccagcctct ggggccttgg cacacacaca
ttcgtgtgct ctgctgcatg 1260tgagcttgtg ggttagagga acaaatatct agacattcaa
tcttcactct ttcaattgtg 1320cattcattta ataaatagat actgagcatt caaaaaaaaa
aaaaaaaaa 1369802187DNAHomo sapiens 80tgccaggctc tccaccccca
cttcccaatt gaggaaaccg aggcagagga ggctcagcgc 60cacgcactcc tctttctgcc
tggccggcca ctcccgtctg ctgtgacgcg cggacagaga 120gctaccggtg gacccacggt
gcctccctcc ctgggatcta cacagaccat ggccttgcca 180acggctcgac ccctgttggg
gtcctgtggg acccccgccc tcggcagcct cctgttcctg 240ctcttcagcc tcggatgggt
gcagccctcg aggaccctgg ctggagagac agggcaggag 300gctgcgcccc tggacggagt
cctggccaac ccacctaaca tttccagcct ctcccctcgc 360caactccttg gcttcccgtg
tgcggaggtg tccggcctga gcacggagcg tgtccgggag 420ctggctgtgg ccttggcaca
gaagaatgtc aagctctcaa cagagcagct gcgctgtctg 480gctcaccggc tctctgagcc
ccccgaggac ctggacgccc tcccattgga cctgctgcta 540ttcctcaacc cagatgcgtt
ctcggggccc caggcctgca cccgtttctt ctcccgcatc 600acgaaggcca atgtggacct
gctcccgagg ggggctcccg agcgacagcg gctgctgcct 660gcggctctgg cctgctgggg
tgtgcggggg tctctgctga gcgaggctga tgtgcgggct 720ctgggaggcc tggcttgcga
cctgcctggg cgctttgtgg ccgagtcggc cgaagtgctg 780ctaccccggc tggtgagctg
cccgggaccc ctggaccagg accagcagga ggcagccagg 840gcggctctgc agggcggggg
acccccctac ggccccccgt cgacatggtc tgtctccacg 900atggacgctc tgcggggcct
gctgcccgtg ctgggccagc ccatcatccg cagcatcccg 960cagggcatcg tggccgcgtg
gcggcaacgc tcctctcggg acccatcctg gcggcagcct 1020gaacggacca tcctccggcc
gcggttccgg cgggaagtgg agaagacagc ctgtccttca 1080ggcaagaagg cccgcgagat
agacgagagc ctcatcttct acaagaagtg ggagctggaa 1140gcctgcgtgg atgcggccct
gctggccacc cagatggacc gcgtgaacgc catccccttc 1200acctacgagc agctggacgt
cctaaagcat aaactggatg agctctaccc acaaggttac 1260cccgagtctg tgatccagca
cctgggctac ctcttcctca agatgagccc tgaggacatt 1320cgcaagtgga atgtgacgtc
cctggagacc ctgaaggctt tgcttgaagt caacaaaggg 1380cacgaaatga gtcctcaggt
ggccaccctg atcgaccgct ttgtgaaggg aaggggccag 1440ctagacaaag acaccctaga
caccctgacc gccttctacc ctgggtacct gtgctccctc 1500agccccgagg agctgagctc
cgtgcccccc agcagcatct gggcggtcag gccccaggac 1560ctggacacgt gtgacccaag
gcagctggac gtcctctatc ccaaggcccg ccttgctttc 1620cagaacatga acgggtccga
atacttcgtg aagatccagt ccttcctggg tggggccccc 1680acggaggatt tgaaggcgct
cagtcagcag aatgtgagca tggacttggc cacgttcatg 1740aagctgcgga cggatgcggt
gctgccgttg actgtggctg aggtgcagaa acttctggga 1800ccccacgtgg agggcctgaa
ggcggaggag cggcaccgcc cggtgcggga ctggatccta 1860cggcagcggc aggacgacct
ggacacgctg gggctggggc tacagggcgg catccccaac 1920ggctacctgg tcctagacct
cagcatgcaa gaggccctct cggggacgcc ctgcctccta 1980ggacctggac ctgttctcac
cgtcctggca ctgctcctag cctccaccct ggcctgaggg 2040ccccactccc ttgctggccc
cagccctgct ggggatcccc gcctggccag gagcaggcac 2100gggtggtccc cgttccaccc
caagagaact cgcgctcagt aaacgggaac atgccccctg 2160cagacacgta aaaaaaaaaa
aaaaaaa 2187816882DNAHomo sapiens
81gggagatttg gacgctccgg cctgggaggt gcgtcagatc cgagctcgcc atccagtttc
60ctctccacta gtccccccag ttggagatct gggaccaaca aggcaccatg gcgcagaagg
120gccaactcag tgacgatgag aagttcctct ttgtggacaa aaacttcatc aacagcccag
180tggcccaggc tgactgggcc gccaagagac tcgtctgggt cccctcggag aagcagggct
240tcgaggcagc cagcattaag gaggagaagg gggatgaggt ggttgtggag ctggtggaga
300atggcaagaa ggtcacggtt gggaaagatg acatccagaa gatgaaccca cccaagttct
360ccaaggtgga ggacatggcg gagctgacgt gcctcaacga agcctccgtg ctacacaacc
420tgagggagcg gtacttctca gggctaatat atacgtactc tggcctcttc tgcgtggtgg
480tcaaccccta taaacacctg cccatctact cggagaagat cgtcgacatg tacaagggca
540agaagaggca cgagatgccg cctcacatct acgccatcgc agacacggcc taccggagca
600tgcttcaaga tcgggaggac cagtccattc tatgcacagg cgagtctgga gccgggaaaa
660ccgaaaacac caagaaggtc attcagtacc tggccgtggt ggcctcctcc cacaagggca
720agaaagacac aagtatcacg ggagagctgg aaaagcagct tctacaagca aacccgattc
780tggaggcttt cggcaacgcc aaaacagtga agaacgacaa ctcctcacga ttcggcaaat
840tcatccgcat caacttcgac gtcacgggtt acatcgtggg agccaacatt gagacctatc
900tgctagaaaa atcacgggca attcgccaag ccagagacga gaggacattc cacatctttt
960actacatgat tgctggagcc aaggagaaga tgagaagtga cttgcttttg gagggcttca
1020acaactacac cttcctctcc aatggctttg tgcccatccc agcagcccag gatgatgaga
1080tgttccagga aaccgtggag gccatggcaa tcatgggttt cagcgaggag gagcagctat
1140ccatattgaa ggtggtatca tcggtcctgc agcttggaaa tatcgtcttc aagaaggaaa
1200gaaacacaga ccaggcgtcc atgccagata acacagctgc tcagaaagtt tgccacctca
1260tgggaattaa tgtgacagat ttcaccagat ccatcctcac tcctcgtatc aaggttgggc
1320gagatgtggt acagaaagct cagacaaaag aacaggctga ctttgctgta gaggctttgg
1380ccaaggcaac atatgagcgc cttttccgct ggatactcac ccgcgtgaac aaagccctgg
1440acaagaccca tcggcaaggg gcttccttcc tggggatcct ggatatagct ggatttgaga
1500tctttgaggt gaactccttc gagcagctgt gcatcaacta caccaacgag aagctgcagc
1560agctcttcaa ccacaccatg ttcatcctgg agcaggagga gtaccagcgc gagggcatcg
1620agtggaactt catcgacttt gggctggacc tacagccctg catcgagctc atcgagcgac
1680cgaacaaccc tccaggtgtg ctggccctgc tggacgagga atgctggttc cccaaagcca
1740cggacaagtc tttcgtggag aagctgtgca cggagcaggg cagccacccc aagttccaga
1800agcccaagca gctcaaggac aagactgagt tctccatcat ccattatgct gggaaggtgg
1860actataatgc gagtgcctgg ctgaccaaga atatggaccc gctgaatgac aacgtgactt
1920ccctgctcaa tgcctcctcc gacaagtttg tggccgacct gtggaaggac gtggaccgca
1980tcgtgggcct ggaccagatg gccaagatga cggagagctc gctgcccagc gcctccaaga
2040ccaagaaggg catgttccgc acagtggggc agctgtacaa ggagcagctg ggcaagctga
2100tgaccacgct acgcaacacc acgcccaact tcgtgcgctg catcatcccc aaccacgaga
2160agaggtccgg caagctggat gcgttcctgg tgctggagca gctgcggtgc aatggggtgc
2220tggaaggcat tcgcatctgc cggcagggct tccccaaccg gatcgtcttc caggagttcc
2280gccaacgcta cgagatcctg gcggcgaatg ccatccccaa aggcttcatg gacgggaagc
2340aggcctgcat tctcatgatc aaagccctgg aacttgaccc caacttatac aggatagggc
2400agagcaaaat cttcttccga actggcgtcc tggcccacct agaggaggag cgagatttga
2460agatcaccga tgtcatcatg gccttccagg cgatgtgtcg tggctacttg gccagaaagg
2520cttttgccaa gaggcagcag cagctgaccg ccatgaaggt gattcagagg aactgcgccg
2580cctacctcaa gctgcggaac tggcagtggt ggaggctttt caccaaagtg aagccactgc
2640tgcaggtgac acggcaggag gaggagatgc aggccaagga ggatgaactg cagaagacca
2700aggagcggca gcagaaggca gagaatgagc ttaaggagct ggaacagaag cactcgcagc
2760tgaccgagga gaagaacctg ctacaggaac agctgcaggc agagacagag ctgtatgcag
2820aggctgagga gatgcgggtg cggctggcgg ccaagaagca ggagctggag gagatactgc
2880atgagatgga ggcccgcctg gaggaggagg aagacagggg ccagcagcta caggctgaaa
2940ggaagaagat ggcccagcag atgctggacc ttgaagaaca gctggaggag gaggaagctg
3000ccaggcagaa gctgcaactt gagaaggtca cggctgaggc caagatcaag aaactggagg
3060atgagatcct ggtcatggat gatcagaaca ataaactatc aaaagaacga aaactccttg
3120aggagaggat tagtgactta acgacaaatc ttgcagaaga ggaagaaaag gccaagaatc
3180ttaccaagct gaaaaacaag catgaatcta tgatttcaga actggaagtg cggctaaaga
3240aggaagagaa gagccgacag gagctggaga agctgaaacg gaagctggag ggtgatgcca
3300gcgacttcca cgagcagatc gctgacctcc aggcgcagat cgcagagctc aagatgcagc
3360tggccaagaa ggaggaggag ctgcaggcgg ccctggccag gcttgacgat gaaatcgctc
3420agaagaacaa tgccctgaag aagatccggg agctggaggg ccacatctca gacctccagg
3480aggacctgga ctcagagcgg gccgccagga acaaggctga aaagcagaag cgagacctcg
3540gcgaggagct ggaggcccta aagacagagc tggaagacac actggacagc acagccactc
3600agcaggagct cagggccaag agggagcagg aggtgacggt gctgaagaag gccctggatg
3660aagagacgcg gtcccatgag gctcaggtcc aggagatgag gcagaaacac gcacaggcgg
3720tggaggagct cacagagcag cttgagcagt tcaagagggc caaggcgaac ctagacaaga
3780ataagcagac gctggagaaa gagaacgcag acctggccgg ggagctgcgg gtcctgggcc
3840aggccaagca ggaggtggaa cataagaaga agaagctgga ggcgcaggtg caggagctgc
3900agtccaagtg cagcgatggg gagcgggccc gggcggagct caatgacaaa gtccacaagc
3960tgcagaatga agttgagagc gtcacaggga tgcttaacga ggccgagggg aaggccatta
4020agctggccaa ggacgtggcg tccctcagtt cccagctcca ggacacccag gagctgcttc
4080aagaagaaac ccggcagaag ctcaacgtgt ctacgaagct gcgccagctg gaggaggagc
4140ggaacagcct gcaagaccag ctggacgagg agatggaggc caagcagaac ctggagcgcc
4200acatctccac tctcaacatc cagctctccg actcgaagaa gaagctgcag gactttgcca
4260gcaccgtgga agctctggaa gaggggaaga agaggttcca gaaggagatc gagaacctca
4320cccagcagta cgaggagaag gcggccgctt atgataaact ggaaaagacc aagaacaggc
4380ttcagcagga gctggacgac ctggttgttg atttggacaa ccagcggcaa ctcgtgtcca
4440acctggaaaa gaagcagagg aaatttgatc agttgttagc cgaggagaaa aacatctctt
4500ccaaatacgc ggatgagagg gacagagctg aggcagaagc cagggagaag gaaaccaagg
4560ccctgtccct ggctcgggcc cttgaagagg ccttggaagc caaagaggaa ctcgagcgga
4620ccaacaaaat gctcaaagcc gaaatggaag acctggtcag ctccaaggat gacgtgggca
4680agaacgtcca tgagctggag aagtccaagc gggccctgga gacccagatg gaggagatga
4740agacgcagct ggaagagctg gaggacgagc tgcaagccac ggaggacgcc aaactgcggc
4800tggaagtcaa catgcaggcg ctcaagggcc agttcgaaag ggatctccaa gcccgggacg
4860agcagaatga ggagaagagg aggcaactgc agagacagct tcacgagtat gagacggaac
4920tggaagacga gcgaaagcaa cgtgccctgg cagctgcagc aaagaagaag ctggaagggg
4980acctgaaaga cctggagctt caggccgact ctgccatcaa ggggagggag gaagccatca
5040agcagctacg caaactgcag gctcagatga aggactttca aagagagctg gaagatgccc
5100gtgcctccag agatgagatc tttgccacag ccaaagagaa tgagaagaaa gccaagagct
5160tggaagcaga cctcatgcag ctacaagagg acctcgccgc cgctgagagg gctcgcaaac
5220aagcggacct cgagaaggag gaactggcag aggagctggc cagtagcctg tcgggaagga
5280acgcactcca ggacgagaag cgccgcctgg aggcccggat cgcccagctg gaggaggagc
5340tggaggagga gcagggcaac atggaggcca tgagcgaccg ggtccgcaaa gccacacagc
5400aggccgagca gctcagcaac gagctggcca cagagcgcag cacggcccag aagaatgaga
5460gtgcccggca gcagctcgag cggcagaaca aggagctccg gagcaagctc cacgagatgg
5520agggggccgt caagtccaag ttcaagtcca ccatcgcggc gctggaggcc aagattgcac
5580agctggagga gcaggtcgag caggaggcca gagagaaaca ggcggccacc aagtcgctga
5640agcagaaaga caagaagctg aaggaaatct tgctgcaggt ggaggacgag cgcaagatgg
5700ccgagcagta caaggagcag gcagagaaag gcaatgccag ggtcaagcag ctcaagaggc
5760agctggagga ggcagaggag gagtcccagc gcatcaacgc caaccgcagg aagctgcagc
5820gggagctgga tgaggccacg gagagcaacg aggccatggg ccgcgaggtg aacgcactca
5880agagcaagct caggcgagga aacgagacct ctttcgttcc ttctagaagg tctggaggac
5940gtagagttat tgaaaatgca gatggttctg aggaggaaac ggacactcga gacgcagact
6000tcaatggaac caaggccagt gaataagcaa ctttctacag ttttgcacca cggcaagaaa
6060accaaaaacc aaaacaaaca aacaaaaaaa acccaacaac aacccagaac aaagcaaaac
6120ccagcagact gtacttagca ttgtctaaat ccattctcaa attccaaata tcacagacac
6180ccctcacaca aggaatataa aaaccaccac cctccagcct gggcaacgta gtaaaacctc
6240atctatacaa gaatttaaaa ataagctggg cgtggtggta cacacctgtg gtcccagcta
6300ctagggaggc tgagccagga agaacgctcc agcccaggac ttcgaggctg caatgagcta
6360taattgcatc attgcactcc agcctgggca acagagaccc tgtctcaacc accaccacca
6420ccaccacccc tactacccct gtattcaagg taaaaattga agtttgtatg atgtaagaga
6480tgagaaaaac ccaacaggaa acacagacac atcctccagt tctatcaatg gattgtgcag
6540acactgagtt tttagaaaaa catatccacg gtaaccggtc cctggcaatt ctgtttacat
6600gaaatgggga gaaagtcacc gaaatgggtg ccgccggccc ccactcccaa ttcattccct
6660aacctgcaaa cctttccaac ttctcacgtc aggcctttga gaattctttc cccctctcct
6720ggtttccaca cctcagacac gcacagttca ccaagtgcct tctgtagtca catgaattga
6780aaaggagacg ctgctcccac ggaggggagc aggaatgctg cactgtttac accctgactg
6840tgcttaaaaa cactttcact aataaatggt tataaatcac aa
6882823658DNAHomo sapiens 82aatgcaaggg ggagttcaat gaaactggga catctataca
catgtgaggg agcctgggct 60ggaagaggca gcaaaaggga aaatcagaag agtggacact
ggcaagagga gggcagcctt 120tttcccagct tccttgcacc atggacagct cccattaagc
cacctctcca tcctggggcc 180aggactctta tgccccattc ctgtcaaatt gagatttcat
ccaccattct ccaaggacag 240tgaagttata ccctagttcc agtgttggga tcagtggccc
ctctggacat gcctctcctg 300gaaggttctg tgggggtgga ggatcttgtc ctcctggaac
ccttggtgga ggagtcactg 360ctcaagaatc ttcagcttcg ctatgaaaac aaggagattt
atacctacat tgggaatgtg 420gtgatctcag tgaatcccta tcaacagctt cccatctatg
ggccagagtt cattgccaaa 480tatcaagact atactttcta tgagctgaag ccccatatct
acgcattggc aaatgtggcg 540taccagtcac tgagggacag ggaccgagac cagtgtatcc
tcatcacagg cgagagtgga 600tcagggaaga ctgaggccag caagctggtg atgtcttatg
tggctgccgt ctgtgggaaa 660ggagagcagg tgaactctgt gaaggagcag ctgctacagt
ctaacccagt gctggaggct 720tttggcaatg ccaagaccat tcgcaacaac aattcctccc
gatttggaaa atacatggat 780attgaatttg acttcaaggg atcccccctc ggtggtgtca
tcacaaacta tctgcttgag 840aaatcccgat tagtgaagca gctcaaagga gaaaggaact
tccacatctt ctatcagctg 900ctggctggag cagatgaaca gctgctgaag gccctgaagc
ttgagcggga tacaactggc 960tatgcctatc tgaatcatga agtatccaga gtggatggca
tggacgacgc ctccagcttc 1020agggctgtac agagtgcaat ggcagtgatt gggttctcgg
aggaggagat tcgacaagtg 1080ctagaggtga catccatggt gctaaagctg gggaacgtgt
tggtggctga tgagttccag 1140gccagtggga taccagcaag tggcatccgt gatgggagag
gtgttcggga gattggggag 1200atggtgggct tgaattcaga agaagtagag agagctttgt
gctcgaggac catggaaaca 1260gccaaggaaa aggtggtcac tgcactgaat gttatgcagg
ctcagtatgc tcgggacgcc 1320ctggctaaga acatctacag ccgcctcttt gactggatag
tgaatcgaat caatgagagc 1380atcaaggtgg gcatcgggga aaagaagaag gtaatgggag
tccttgatat ctacggtttt 1440gagatattag aggataatag ctttgagcaa tttgtgatca
actactgcaa tgagaagctg 1500cagcaggtgt tcatagagat gaccctgaaa gaagagcaag
aggaatataa gagagaaggc 1560ataccgtgga caaaggtgga ctactttgat aatggcatca
tttgtaagct cattgagcat 1620aatcagcgag gtatcctggc catgttggat gaggagtgcc
tgcggcctgg ggtggtcagt 1680gactccactt tcctagcaaa gctgaaccag ctcttctcca
agcatggcca ctacgagagc 1740aaagtcaccc agaatgccca gcgtcagtat gaccacacca
tgggcctcag ctgcttccgc 1800atctgccact atgcgggcaa ggtgacatac aacgtgacca
gctttattga caagaataat 1860gacctactct tccgagacct gttgcaggcc atgtggaagg
cccagcaccc cctccttcgg 1920tccttgtttc ctgagggcaa tcctaagcag gcatctctca
aacgcccccc gactgctggg 1980gcccagttca agagttctgt ggccatcctc atgaagaatc
tgtattccaa gagccccaac 2040tacatcaggt gcataaagcc caatgagcat cagcagcgag
gtcagttctc ttcagacctg 2100gtggcaaccc aggctcggta cctgggactg ctggagaacg
tacgggtgcg acgggcaggc 2160tatgcccacc gccagggtta tgggcccttc ctggaaaggt
accgattgct gagccggagc 2220acctggcctc actggaatgg gggagaccgg gaaggtgttg
agaaggtcct gggggagctg 2280agcatgtcct cgggggagct ggcctttggc aagacaaaga
tcttcattag aagccccaag 2340actcttttct acctcgaaga acagaggcgc ctgagactcc
agcagctggc cacactcata 2400cagaagattt accgaggctg gcgctgccgc acccactacc
aactgatgcg aaagagtcag 2460atcctcatct cctcttggtt tcggggaaac atgcaaaaga
aatgctatgg gaagataaag 2520gcatccgtgt tattgatcca ggcttttgtg agagggtgga
aggcccgaaa gaattatcgc 2580aaatatttcc ggtcagaggc tgccctcacc ttggcagatt
tcatctacaa gagcatggta 2640cagaaattcc tactggggct gaagaacaat ttgccatcca
caaacgtctt agacaagaca 2700tggccagccg ccccctacaa gtgcctcagc acagcaaatc
aggagctgca gcagctcttc 2760taccagtgga agtgcaagag gttccgggat cagctgtccc
cgaagcaggt agagatcctg 2820agggaaaagc tctgtgccag tgaactgttc aagggcaaga
aggcttcata tccccagagt 2880gtccccattc cattctgtgg tgactacatt gggctgcaag
ggaaccccaa gctgcagaag 2940ctgaaaggcg gggaggaggg gcctgttctg atggcagagg
ccgtgaagaa ggtcaatcgt 3000ggcaatggca agacttcttc tcggattctc ctcctgacca
agggccatgt gattctcaca 3060gacaccaaga agtcccaggc caaaattgtc attgggctag
acaatgtggc tggggtgtca 3120gtcaccagcc tcaaggatgg gctctttagc ttgcatctga
gtgagatgtc atcggtgggc 3180tccaaggggg acttcctgct ggtcagcgag catgtgattg
aactgctgac caaaatgtac 3240cgggctgtgc tggatgccac gcagaggcag cttacagtca
ccgtgactga gaagttctca 3300gtgaggttca aggagaacag tgtggctgtc aaggtcgtcc
agggccctgc aggtggtgac 3360aacagcaagc tacgctacaa aaaaaagggg agtcattgct
tggaggtgac tgtgcagtga 3420ggagggggca ccatgcagag atggcagttg cttcctcctg
aaccagcact aatccccctc 3480tgccctcctg tgtgggagga tctctaaccc ctctgatcgt
ggcgcatggc ttggggatta 3540aactaccctt gaagaggacc cttgtcccaa acccttcttg
ttctctcctc caaaagtagc 3600ttcctccaac ccgcagcctc tctgcacact aataaaacat
gtggcttgga aaggttca 3658834499DNAHomo sapiens 83gccgccgccg cggccaagcg
agcgccgtcg gggcgggtgg gcgggaagaa gcggcgggcc 60cgaggtgggg gggagcagag
agagcgcgcc caccaccttc ccttcccccc tcgatgggag 120cgggggcgtc ccggctcctg
cagccgccag aggagggaga gccgggggcc gtcgcttcgg 180agttggggct gagcagtcct
cggggagagc gcgccaagac cgctgcagcc gctggctgac 240ggaaggagag ttttacatgg
aagtggctta cagaaacttg gcgctgaggt gcagggaagc 300cagaaactct ttgtgtctct
aaggccgatg aggaatttgg aaacacatgt gggacataca 360agcgttggat atagaggact
gagcaggggg aggaacattt aagctgatgg aagtggaagt 420ggaagttgct gtacattggc
agcaaggcct ccgagttagc ttttgaatgc agttaactgg 480tttctcttaa ctgtggaatt
cattgaaaag tcagactccg agtggtcgtt ccaggatatc 540ttgaaaagcc caggttaaac
ccatccagag taatggctgc ggccttaccc aggaccctgg 600gggagttgca gctgtataga
atattacaaa aagccaatct actttcttat tttgatgcct 660ttatccaaca aggtggtgat
gatgtccagc aactctgtga agcaggagaa gaggagtttt 720tggaaatcat ggcactcgtg
ggcatggcta gcaagcccct tcatgttaga aggctgcaga 780aggctttgag agactgggtc
acaaaccctg ggcttttcaa tcagccactg acttcccttc 840ctgtcagtag catacccatc
tataaattac cagagggatc accaacatgg ctgggaatat 900cctgcagtag ttatgaaagg
agtagcaatg cccgggaacc tcatttaaaa atccccaaat 960gtgctgccac cacctgtgtg
cagagcttgg gacaggggaa gtcagatgtg gttgggagcc 1020tagcactgca gagtgttggt
gagtccagac tctggcaagg ccaccatgcc actgagagcg 1080agcacagcct ctccccagca
gacctgggct cccccgcgtc cccaaaggag agcagtgagg 1140cgctggatgc tgctgctgcg
ctctctgtgg ctgagtgtgt ggagcggatg gcccccacac 1200tgccaaaaag tgacttgaat
gaagtgaaag agctgctaaa aaccaacaag aagttggcca 1260aaatgattgg tcacatcttt
gagatgaacg atgatgatcc acacaaagag gaggaaattc 1320ggaaatacag tgcaatatat
ggcagatttg actcaaagag gaaggatggg aaacatctca 1380cacttcatga gctcactgtt
aatgaagcgg ctgctcaact ctgtgtgaag gataatgccc 1440tgctgacaag aagagatgag
ctttttgcct tggctcgaca gatttctcga gaagtcacct 1500ataaatatac ttacagaacc
accaagtcaa aatgtggaga aagagatgaa ttatccccaa 1560agagaattaa agtggaggat
gggtttccag atttccagga ttctgtgcaa acactcttcc 1620agcaggctag agctaagagt
gaagaacttg cagctcttag ttcacagcag cctgaaaagg 1680tgatggcaaa gcagatggag
ttcctttgca accaagctgg ctatgagaga ctgcagcatg 1740ccgagaggag gttgtctgca
gggctttaca ggcagagctc agaagagcac agtcctaacg 1800gcttgacttc cgataactca
gatggacaag gagaaagacc tttgaatctc cgaatgccta 1860atttacagaa cagacaaccc
catcattttg tggtggatgg ggagctgagc agactttacc 1920ccagtgaggc aaagtcccac
tcatcagaga gccttgggat tttaaaagac taccctcatt 1980cagcttttac cttagaaaag
aaagtcatca aaacagagcc tgaagattca agatagctgt 2040gatttctctc accgttctct
ggaaatggca tcagatttaa ggataatact ccatcataga 2100aataagcctt aataaccagt
gttgcctcat tcagctcaaa cagatttcat agccaaagca 2160aaaggactgg tacggtagtc
tgtggaaacc aggaagataa aacaacagcc acaaaagaga 2220aaatcaagag tgttgcaatc
tataacagta atattgattc attcacattc ctgtgttaag 2280tcattttata tggaaaggct
tacaaatcaa tattgtaagc attcattatt taagaatgta 2340caatgtattt gtgtaattta
tagaagtaaa atctagatgt tgagacctgt ttggtctaat 2400agatgtggat acagtttatt
ttacttgaaa ttttgttgtc tactttgtgt gtttaacgta 2460aatatatgtc agagtttaga
atctgcctgc agttgtgaaa aagaaagctt aagtgatgca 2520gttattggca agattgcaat
gattatggaa aaatagaaag cgaatactca gtttaagcca 2580aggaaaatat tgtggattta
atatttgata aaactgattt tgtttaacag gaaattttta 2640gcattcagtc atataacatc
tggttatcaa tgcacgttta cacaataaat acttgagtgg 2700aggaaagtta aaaagatgag
caatagagta gaaaatatat cttaaactag ttgacctaga 2760ttgtattaat agctacttaa
gatgtttcaa agataggaag ctattgcttg gacagagaac 2820ttgaaataag tggacccatg
tataaaagct ttgacttaaa cattgatatt tcagaatgtg 2880ttaaatagat taagacacag
taagttaacc ctacatgtta taaagatggc gactgttaac 2940aaaggctgta acagattaag
tactatttta tatccagaaa gtcttctcta tgtagagaag 3000tcagagagac tagatgcttt
cactagggaa tgtcttccca cccagccatc acaaatgtgg 3060acaatcactg catccacatc
tgtaggcata tttctatgga agtttaattg acagctatat 3120tcattattta ttttacaatt
tcatttttct acacctttga gatttatgaa tgcagttttt 3180tcttaaaatt tattttaact
tgacagtatg tttttagttc ccccaattta attaatggac 3240catgtgcata tatatgggag
tgtgcttaca tgttaataat ttacttgcat acttatgaga 3300atttcacatt ggaattcata
atggtaaaac aacatacatc tgccaatata cgttttttct 3360gttggtttaa gagaagataa
ctgacagctt tacctacttc ctacagatgc atctaaaccc 3420agatattact gagaagagtg
tattgactct gagtgtaaga gagtatgtgt ttttttgttt 3480ttagttctgc tctagatcat
aattgtaaaa aatattaagt cataatctgt tacactaaaa 3540tttgtcagcc aaatgttaga
tgaaatgtct gcactgtagt ctcagatcac tgtcacgtat 3600ataaattgct tcttcatttt
aatttgtaga agtactttac agtaggaaac gccagtaaac 3660aacttttata ctgttaaaag
gcttttttcc ccttcctaaa tgttttaatt gtaccatagt 3720gttttgctca ctgaagaagc
ttcttatgga ccttgcaact ttgttgctag cttgaggttg 3780attattgtgg ttgtattgtt
cactgtgtgt agaaatagta tgagtacgat ttcaatagac 3840tgttcagttt ttaatattag
ccatagcact ggttagtata tctcagtagt ttcatgaaac 3900gtttcctgta ttctaatcta
ttttgaaaca ttttgttttt ttttaattgt gtcttacagt 3960caagtttgta gattttcata
agccacaatt ttaaaagatg cagtaatctt ccaacttcca 4020atatttatcc attcgttgtg
gacccacaga ttgcatcttt aaattcataa taagtttcct 4080taactatctt atgtttctag
tctttcaagc ttagtgataa ggtggaagca caagaaaaat 4140ttcagtagaa tacagttttt
attttgtaaa cactaatgta ttaaacttgc tatacattaa 4200agcaaataat atatattttt
atttgaattg tatatgtgaa ttggaagtta taattagttg 4260attttttcat tttgttagag
gtattttcac tgaacaaggt caattggtta cctcagtatt 4320acagccaata tagtccaagg
gaccatttct ccccgagtct cttacacttt attgtgcgat 4380gtccacgttt ttgtgactct
tcaagctgtt ggtgaggtgg gacgaatgca cttgcttcct 4440gtggcaataa agattttctg
tgcctcacaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaa 4499843068DNAHomo sapiens
84cacatggaaa ctgttcttaa agctgctggg ccctgaaatt ttactcagca gtttgaaatc
60aagacatagc ttttctcatt caccctccca cttggggcta atgcacagac atgaacatct
120attgaggaaa accacaaaaa acttcaaaac agctacaacg gtatcctaag aatatttcaa
180ttaaatatta gtatgtctgc tgaaggcact taattattaa gaaacttaaa attatcaatc
240tttcttgaat ttctgataga gaagtaaaac tattttccaa aactattttt cagaatgttc
300actgatacat aaaaactgct agcatctaat taaagatcac taagggttaa atactgttct
360ctggccctta ctgcgcacac cctgccaaaa catcctctaa gcttttaaat attgcttcga
420tggtctgaat ttttatttcc agggaaaaag agagttttgt cccacagtca gcaggccact
480agtttattaa cttccagtca ccttgatttt tgctaaaatg aagactctgc agtctacact
540tctcctgtta ctgcttgtgc ctctgataaa gccagcacca ccaacccagc aggactcacg
600cattatctat gattatggaa cagataattt tgaagaatcc atatttagcc aagattatga
660ggataaatac ctggatggaa aaaatattaa ggaaaaagaa actgtgataa tacccaatga
720gaaaagtctt caattacaaa aagatgaggc aataacacca ttacctccca agaaagaaaa
780tgatgaaatg cccacgtgtc tgctgtgtgt ttgtttaagt ggctctgtat actgtgaaga
840agttgacatt gatgctgtac cacccttacc aaaggaatca gcctatcttt acgcacgatt
900caacaaaatt aaaaagctga ctgccaaaga ttttgcagac atacctaact taagaagact
960cgattttaca ggaaatttga tagaagatat agaagatggt actttttcaa aactttctct
1020gttagaagaa ctttcacttg ctgaaaatca actactaaaa cttccagttc ttcctcccaa
1080gctcacttta tttaatgcaa aatacaacaa aatcaagagt aggggaatca aagcaaatgc
1140attcaaaaaa ctgaataacc tcaccttcct ctacttggac cataatgccc tggaatccgt
1200gcctcttaat ttaccagaaa gtctacgtgt aattcatctt cagttcaaca acatagcttc
1260aattacagat gacacattct gcaaggctaa tgacaccagt tacatccggg accgcattga
1320agagatacgc ctggagggca atccaatcgt cctgggaaag catccaaaca gttttatttg
1380cttaaaaaga ttaccgatag ggtcatactt ttaacctcta ttggtacaac atataaatga
1440aagtacacct acactaatag tctgtctcaa caatgagtaa aggaacttaa gtattggttt
1500aatattaacc ttgtatctca ttttgaagga atttaatatt ttaagcaagg atgttcaaaa
1560tcttacatat aataagtaaa aagtaagact gaatgtctac gttcgaaaca aagtaatatg
1620aaaatattta aacagcatta caaaatccta gtttatacta gactaccatt taaaaatcat
1680gtttttatat aaatgcccaa atttgagatg cattattcct attactaatg atgtaagtac
1740gaggataaat ccaagaaact ttcaactctt tgcctttcct ggcctttact ggatcccaaa
1800agcatttaag gtacatgttc caaaaacttt gaaaagctaa atgtttccca tgatcgctca
1860ttcttctttt atgattcata cgttattcct tataaagtaa gaactttgtt ttcctcctat
1920caaggcagct attttattaa atttttcact tagtctgaga aatagcagat agtctcatat
1980ttaggaaaac tttccaaata aaataaatgt tattctctga taaagagcta atacagaaat
2040gttcaagtta ttttactttc tggtaatgtc ttcagtaaaa tattttcttt atctaaatat
2100taacattcta agtctaccaa aaaaagtttt aaactcaagc aggccaaaac caatatgctt
2160ataagaaata atgaaaagtt catccatttc tgataaagtt ctctatggca aagtctttca
2220aatacgagat aactgcaaaa tattttcctt ttatactaca gaaatgagaa tctcatcaat
2280aaattagttc aagcataaga tgaaaacaga atattctgtg gtgccagtgc acactacctt
2340cccacccata cacatccatg ttcactgtaa caaactgaat attcacaata aagcttctga
2400gtaacacttt ctgattactc atgataaact gacatggcta actgcaagaa ttaaatcttc
2460tatctgagag taataattta tgatgactca gtggtgccag agtaaagttt ctaaaataac
2520attcctctca cttgtacccc actaaaagta ttagactaca cattacattg aagttaaaca
2580caaaattatc agtgttttag aaacatgagt ccggactgtg taagtaaaag tacaaacatt
2640atttccacca taaagtatgt attgaaatca agttgtctct gtgtacagaa tacatactta
2700ttcccatttt taagcatttg cttctgtttt ccctacctag aatgtcagat gtttttcagt
2760tatctcccca tttgtcaaag ttgacctcaa gataacattt ttcattaaag catctgagat
2820ctaagaacac aattattatt ctaacaatga ttattagctc attcacttat tttgataact
2880aatgatcaca gctattatac tactttctcg ttattttgtg tgcatgcctc atttccctga
2940cttaaacctc actgagagcg caaaatgcag ctttatactt tttactttca attgcctagc
3000acaatagtga gtacatttga attgaatata taataaatat tgcaaaataa aatccatcta
3060aatagaaa
3068851020DNAHomo sapiens 85ggccttccaa agtgctggga ttacaggcgt gagtcaccgc
gcccggccaa ataaaataaa 60atgttaaagc aaattcagga ctacccctcc tccaagtctt
ctgttccctt tgggcgccca 120ggtgagcggg ggaggggctg ggggagtaat aacatcaaaa
gagcgccttt tcctccctta 180ttccgaggag acttccctgg gcctgactcc cggtcctgtc
cccagcgccc cgcggcctct 240ggagcccctt cagtgaccaa gatacagaga tcaggacgcc
tttgcgccgc cccaggtgcc 300cgcccctagc tggctctgct tgggccgcga gggaaggtga
ggtcgggggc ggagccgggg 360cgtgacagcc ggggtgtgtg tccgccgggc ttggtgcctc
cggtggccct gcagcaccgt 420cccacctctg ccaccctccg atggggccgc tacctgtgtg
cctgccaatc atgctgctcc 480tgctactgcc gtcgctgctg ctgctgctgc ttctacctgg
ccccgggtcc ggcgaggcct 540ccaggatatt acgtgtgcac cggcgtggga tcctggaact
ggcaggaact gtgggttgtg 600ttggtccccg aacccccatc gcctatatga aatatggttg
cttttgtggc ttgggaggcc 660atggccagcc ccgcgatgcc attgactggt gctgccatgg
ccacgactgt tgttacactc 720gagctgagga ggccggctgc agccccaaga cagagcgcta
ctcctggcag tgcgtcaatc 780agagcgtcct gtgcggaccg gcagagaaca aatgccaaga
actgttgtgc aagtgtgacc 840aggagattgc taactgctta gcccaaactg agtacaactt
aaagtacctc ttctaccccc 900agttcctatg tgagccggac tcgcccaagt gtgactgact
accttgactt gaaatgctct 960tttgcacaag gaaataaagc gtcctctcag taatgaaaaa
aaaaaaaaaa aaaaaaaaaa 1020867434DNAHomo sapiens 86gcctgggaaa gatgctggat
cctgcagtaa ccacaacagc atcctctccc tgcgccaggg 60acctgccagc cggagagatg
actgattaga tcagattaga tccggagccc cgctctgcag 120aagggggccc caggggcggg
ggaggaggac cccagctggc ctgagctggg gggaggggtg 180ccttggggct cgcagagtta
gagctttcca gcgcggggat cacacctcag aagccgccac 240aatgaaagac ggaacacatt
tctacaccca gtgactggcc aggtcccaga ggaaaacaaa 300aaatttgact tgaaaatatc
gaccttggac atgtccaata aaacaggtgg gaaacgcccg 360gctaccacca acagtgacat
acccaaccac aacatggtgt ccgaggtccc tccagagcgg 420cccagcgtcc gggcaactcg
cacagcccgc aaagccgtcg cctttggcaa gcgctcacac 480tccatgaagc ggaaccccaa
tgcacctgtc accaaggcgg gctggctctt caaacaggcc 540agctccgggg ttaagcagtg
gaacaagcgc tggttcgtcc tggtggatcg ctgcctcttc 600tactataaag atgagaagga
agagagtatc ctgggcagca tccccctcct gagcttccgg 660gtagccgcag tgcagccctc
agacaacatc agccggaaac acacgtttaa ggctgagcat 720gccggggtcc gcacctactt
cttcagtgcc gagagccccg aggagcaaga ggcctggatc 780caggccatgg gggaggctgc
tcgagtacag atccctccag cccagaagtc agtgccccaa 840gctgtgcggc acagccatga
gaagccagac tcggagaacg tcccacccag caagcaccac 900cagcagccac cccacaacag
cctccctaag cctgagccag aggccaagac tcgaggggag 960ggtgatggcc gaggctgtga
gaaggcagag agaaggcctg agaggccaga agtcaagaaa 1020gagcctccgg tgaaagccaa
tggcctccca gctggaccgg agccagcctc agagccgggc 1080agcccttacc ccgagggccc
aagagtgcca gggggtgggg aacagcctgc ccagcccaat 1140ggctggcagt accactcccc
aagccggcca gggagcacag ctttcccgtc tcaggatgga 1200gagactgggg gacaccggcg
gagtttccca ccacgcacca accctgacaa aattgcccag 1260cgcaagagct ccatgaacca
gcttcagcag tgggtgaatc tgcgccgggg ggtacccccg 1320cctgaagacc ttcggagtcc
ctctaggttc tatcctgtgt ctcgcagggt ccctgagtac 1380tatggcccct actcctccca
gtaccccgat gattatcagt actacccgcc aggagtgcgg 1440ccggagagca tctgttccat
gccggcctat gatcggatca gcccgccctg ggccctggag 1500gacaagcgcc atgccttccg
caatgggggt ggccctgcct accagctgcg agagtggaag 1560gagcccgcca gctacgggcg
gcaggatgcc accgtctgga tcccaagccc ctcccggcag 1620ccagtctatt atgatgagct
ggatgccgcc tctagctccc tgcgccgcct gtccctgcag 1680ccccgctccc actctgtgcc
ccgctcaccc agccagggct cctacagccg tgcccgcatt 1740tactcccctg tccgctcacc
cagtgcccgt tttgagcggc tgccacctcg cagtgaggac 1800atctatgctg accctgctgc
ctatgtgatg aggcgatcca tcagctcccc caaggtccct 1860ccatacccag aagtgttccg
ggacagcctc cacacctaca agttaaacga gcaagacaca 1920gataagctgc tgggaaaatt
gtgtgagcag aacaaggtgg tgagggagca ggaccggctg 1980gtgcagcagc tccgagctga
gaaggagagc ctggaaagtg ccttgatggg gacccaccag 2040gagctggaga tgtttggaag
ccagcccgcc tacccagaaa agctgcgaca caaaaaggat 2100tcactgcaga accagctcat
caacatccgc gtggagctgt ctcaggcgac cacggccctg 2160acaaacagca ccatagagta
tgagcacctc gagtctgagg tctctgccct gcacgatgac 2220ctctgggagc agctcaattt
ggacacccag aatgaggtgc tgaaccggca aatccaaaag 2280gagatctgga ggatccagga
cgtgatggag gggctgagga agaacaaccc ctcccggggc 2340acggacaccg ccaagcacag
aggaggactt ggcccctcag ccacctacag ctccaacagc 2400ccggccagcc ccctcagctc
tgccagcctc accagccccc tgagcccctt ttcactggtg 2460tcgggctctc aggggtcccc
caccaagcct ggctccaacg agcccaaggc aaactatgaa 2520caaagcaaga aagaccccca
ccagacattg cccctggaca cccccagaga catcagcctt 2580gtgcccacca ggcaagaggt
agaggcagag aagcaggcag ctctcaacaa agttggcgtt 2640gtgccccctc ggacaaaatc
gcccactgat gatgaggtga ccccatcagc agtggtaaga 2700aggaatgcca gtgggctcac
caatggactc tcctcccagg aacgccccaa gagtgctgtg 2760tttcctggcg aggggaaggt
caagatgagc gtggaggagc agattgaccg aatgcggcgg 2820caccagagtg gctccatgag
ggagaagcgg aggagcctgc agctcccggc cagcccggcc 2880cccgacccca gtccccggcc
agcctacaaa gtggtgcgcc gccaccgcag catccatgag 2940gtagacatct ccaacctgga
ggcagccctg cgggcagagg agcctggcgg gcatgcctac 3000gagacacccc gggaggaaat
tgcccggctt cgcaaaatgg agctagagcc ccagcattat 3060gacgtggaca tcaataagga
gctctccact ccagacaaag tcctcatccc tgaacggtac 3120attgacctgg agcctgacac
tcccctgagc cctgaggagt tgaaggagaa gcagaagaag 3180gtggagagga tcaagacact
cattgccaaa tccagtatgc agaacgtggt gcccatcggc 3240gagggggact ctgtggacgt
gccccaggac tcagagagcc agctgcagga gcaggagaag 3300cggattgaaa tctcctgcgc
cctggcgacc gaggcctccc gcaggggccg catgctgtct 3360gtgcaatgtg ccaccccaag
ccctcccacc tcccctgctt ccccggctcc tccagcaaac 3420cccctgtcgt ctgaatcccc
acggggcgcc gacagcagct ataccatgcg ggtctgagct 3480ctgactgcaa gccctggctg
aggccaatgc tgtgaagctc cacagagcca cattctgaag 3540ccgtcctctg cccacctgag
gtcctggctc cccaccctgg ccccctgccc ctgcactccc 3600atgggaatgc cgcagggagc
caggctgggg ccatgggctg ctgccagagg accgtggata 3660cctcagtgtc cacacaccca
ccatgcccag ccctggagcc atcactactc acaccgtggt 3720cctgggccag ggcctgagat
gacagtgggg agcaccatcc tcattaatgt ccaagtcaca 3780gggagcctca gccttgccct
ggctggggtt gtggtgactc cagtggaaca ttccctgatg 3840ggggacatgc cgtggtggag
aacacacctg tggctatctt atgtgaggac tagaggtgaa 3900gaggagatgg acactgcctc
tggagccagc ctgacaccaa ggacagcact tgtcatcatc 3960cctatcctcg tcagccccac
cctactgcct cagctggacc cagggctttg acacaaaccc 4020agtgctttgc ttatgggtgc
tcgctggggt ccggtggaga ctgaccaccc tgcttgagcc 4080aaagacaagg tgatgagaga
tggggagagg ccattggctc ccagagggaa cagtgctggc 4140tgtggctaga gaacagcagg
tctgtgcagt gtctgagggc aggttgggaa gggtagcaga 4200gagagagaga cagaaagaga
gagagagaga gagagagaga gagatcctca gagtggaagg 4260agggggaagc agcaggacac
attggcaagt caagcaggaa ggagggagat ggaaagggga 4320tatcagattg gtttcccccg
gtggagcctt aggttagtgc ccagtgcagt gccagactgt 4380ctcctctgct cctcccacct
catccctagg aggacccacc agtggagcac atgcagcctc 4440agtggagatg cttggtgtgg
ggatctgggt gaagggggtt gagtagcgac tgcctgggag 4500atggctgtta gtaggtctgt
gcctggtgtc tgcctcgcca tcctggggta aggggcagag 4560agaaggactt gtcttatgta
gggtgtggtc agccttgggg ccttacctac ccagttccat 4620gatatttctt gccctgttcc
ccctggaatg tgcagtgggc cagctgagag tacgccttga 4680ggagggggga tgaggcctta
atctgggagg cctatccccc tatcccaggc atcccagacg 4740aggactggct gaggctaggc
gctctcatga tccacctgcc ccgggagggc agcggggaag 4800acagagaaaa gcaaacgcat
tcctcctcag ctccacccac ctggagacga atgtagccag 4860agaggaggaa ggagggaaac
tgaagacacc gtggcccctc ggccttctct ctgctagagt 4920tgccgctcag aggcttcagc
ctgacttcca gcggtcccaa gaacacctac taattcctct 4980gcactccttc atggctggga
cagttactgg ttcatatgca agtaaagatg acaatttact 5040caacaaatat ttatcgagca
ccttttatgt accaggcact gttgtaggtg cttaggatat 5100tctcatgttt ctgagggatt
acagcctggg aggattccac cgatcttcac ttctagcagg 5160ttttttaaac gtgacccttg
gctgtatttc ccatcttcac agttcaagca ccccaaacct 5220gccctttctc ccctgcagac
tggcaggtgg gattggctcc caggtcattt cctctctctc 5280tctttctccc aagcctttct
ccctccacag gaaacagatt tttcagggcc ttccatgcct 5340gccactttgt ccgtctcttt
tttttttttt aaagtaatat tttttagaaa tacatgtaaa 5400ataccaagaa ataatgtctg
cgcctctgcc acctctctct catctcttat ttcataaagc 5460tggctcttta tgttgcttta
catgccttaa tatatgtttt atggagatta tacatattat 5520agatctatat atgtaatata
tttgtttgga tgtctgatct tttcctacag ttcctgccca 5580ggcctgtttt tctctccctt
attttccaca tcattcctag catatcaaga ccatacccct 5640tgcctttttg atccaaagaa
ggggagagga aagacttatt ttaagacatc tattttatgc 5700ttttgtttaa aaaaagcaaa
tctattttca aattgtgcaa tgtctgaatg gaattggcaa 5760atgaggtgag gagatggggt
aactgccttt aggtcccaac cctgtgcctg tccctctgcc 5820acagtggctt ccttccatcc
tattccccag tcacgaaagg gctggtcccc agagctcacc 5880cagaaagcat cttgccgggg
gcagggcatt ccctcggaag tcctggccta aggattttat 5940tacactgtct gctcctgatg
accctctgcg gttgtcaggc gtgaatcctc tggaaggaac 6000tgtttgcagt tgtatgcaca
aaggagaccc atgcaggcat gcaccccaga tgggctgcta 6060gactacccct cttcatcagc
ctgctggaaa gagcagacag cgtaacagca gcctatgcca 6120aacagcacta aggatgtcag
tgtacgagac ggtgcgaagc ccagaggagg cggcttggga 6180tcctctgggc catccctggc
cctgtagcac agacacatac aagttccttg gacttgtcac 6240actccttttc acttgatccc
tctctggcct gggtcacatt tatgacatcc cccctctgca 6300acctttctct ctgggggctg
taccaaaggc atggtacctc agtctggtat ccagactgtg 6360aaatcagcct ctgggccaag
gagggtccta cagacaaggt ggcatctcaa cctcacccaa 6420gaaacccccg ggactttacc
ccagcaggct gccactgggg caccgaacta agccagctca 6480gccttccaag ctgcttagcc
tttgtcaacc atcaaagtgg ggaaagggtc tctggccttg 6540caaggatttg ggcaggatgg
aagtctttgt ctaggtctga aatcaagggc ttgggaatct 6600agagcagagc tcctggccag
cgtttgaagc ctggcctccc acctggggca tttactgaca 6660ccctgtcgcc agcctagaac
atcctgctct gaatgaacag agcccagatg ggggtccaga 6720gctaagtgac ccacctgggg
aagctctgat tattcatcca gggccttctc aatgagcaca 6780ttacctccat gatcaataac
caagaggctg cagtcaaata ctgctactgt cactttgaag 6840attttctaag ggcacaagtg
gcacacttgc tctccggcct ttgggagagg catgttcttg 6900ggcagggctg gctacggcaa
gagggaacca attcctttcc cctttaggca ccctctacct 6960ggcagctctt aaaactaaga
aaaaatatga tatacacaca cattatatat atatatatat 7020atacacacac acacacacac
acacgtatat ctatttatgc acatattggg gccaatttga 7080tttgctagta ttagacaata
aactgtacaa ggaaatttat tatctcagtg tctttattta 7140gcataaaagt ggccttagaa
gaagggtgaa ggttagatag atggaatctc tgtcagggga 7200caaacagctg tagcatcccc
ttagcatctt tctagccaat gagccccctc cccaggagga 7260gggatggcct gactgaaaca
ggtgagagtt tgcttccccc tttcagcagc attcttttta 7320gatttaggta ctgaagaatc
ccagtgagtt ctgcatttgt atcttgcaag tttgtataaa 7380cccaatacag gaaataaaat
gaatggtttg tataaaaaaa aaaaaaaaaa aaaa 7434873038DNAHomo sapiens
87tcaatcagaa agcccttttc attgcaggag aagaggacaa agatactcag agagaaaaag
60taaaagaccg aagaaggagg ctggagagac caggatcctt ccagctgaac aaagtcagcc
120acaaagcaga ctagccagcc ggctacaatt ggagtcagag tcccaaagac atgggcttgt
180tagagtgctg tgcaagatgt ctggtagggg ccccctttgc ttccctggtg gccactggat
240tgtgtttctt tggggtggca ctgttctgtg gctgtggaca tgaagccctc actggcacag
300aaaagctaat tgagacctat ttctccaaaa actaccaaga ctatgagtat ctcatcaatg
360tgatccatgc cttccagtat gtcatctatg gaactgcctc tttcttcttc ctttatgggg
420ccctcctgct ggctgagggc ttctacacca ccggcgcagt caggcagatc tttggcgact
480acaagaccac catctgcggc aagggcctga gcgcaacggt aacagggggc cagaagggga
540ggggttccag aggccaacat caagctcatt ctttggagcg ggtgtgtcat tgtttgggaa
600aatggctagg acatcccgac aagtttgtgg gcatcaccta tgccctgacc gttgtgtggc
660tcctggtgtt tgcctgctct gctgtgcctg tgtacattta cttcaacacc tggaccacct
720gccagtctat tgccttcccc agcaagacct ctgccagtat aggcagtctc tgtgctgatg
780ccagaatgta tggtgttctc ccatggaatg ctttccctgg caaggtttgt ggctccaacc
840ttctgtccat ctgcaaaaca gctgagttcc aaatgacctt ccacctgttt attgctgcat
900ttgtgggggc tgcagctaca ctggtttccc tgctcacctt catgattgct gccacttaca
960actttgccgt ccttaaactc atgggccgag gcaccaagtt ctgatccccc gtagaaatcc
1020ccctttctct aatagcgagg ctctaaccac acagcctaca atgctgcgtc tcccatctta
1080actctttgcc tttgccacca actggccctc ttcttacttg atgagtgtaa caagaaagga
1140gagtcttgca gtgattaagg tctctctttg gactctcccc tcttatgtac ctcttttagt
1200cattttgctt catagctggt tcctgctaga aatgggaaat gcctaagaag atgacttccc
1260aactgcaagt cacaaaggaa tggaggctct aattgaattt tcaagcatct cctgaggatc
1320agaaagtaat ttcttctcaa agggtacttc cactgatgga aacaaagtgg aaggaaagat
1380gctcaggtac agagaaggaa tgtctttggt cctcttgcca tctatagggg ccaaatatat
1440tctctttggt gtacaaaatg gaattcattc tggtctctct attaccactg aagatagaag
1500aaaaaagaat gtcagaaaaa caataagagc gtttgcccaa atctgcctat tgcagctggg
1560agaagggggt caaagcaagg atctttcacc cacagaaaga gagcactgac cccgatggcg
1620atggactact gaagccctaa ctcagccaac cttacttaca gcataaggga gcgtagaatc
1680tgtgtagacg aagggggcat ctggccttac acctcgttag ggaagagaaa cagggtgttg
1740tcagcatctt ctcactccct tctccttgat aacagctacc atgacaaccc tgtggtttcc
1800aaggagctga gaatagaagg aaactagctt acatgagaac agactggcct gaggagcagc
1860agttgctggt ggctaatggt gtaacctgag atggccctct ggtagacaca ggatagataa
1920ctctttggat agcatgtctt tttttctgtt aattagttgt gtactctggc ctctgtcata
1980tcttcacaat ggtgctcatt tcatgggggt attatccatt cagtcatcgt aggtgatttg
2040aaggtcttga tttgttttag aatgatgcac atttcatgta ttccagtttg tttattactt
2100atttggggtt gcatcagaaa tgtctggaga ataattcttt gattatgact gttttttaaa
2160ctaggaaaat tggacattaa gcatcacaaa tgatattaaa aattggctag ttgaatctat
2220tgggattttc tacaagtatt ctgcctttgc agaaacagat ttggtgaatt tgaatctcaa
2280tttgagtaat ctgatcgttc tttctagcta atggaaaatg attttactta gcaatgttat
2340cttggtgtgt taagagttag gtttaacata aaggttattt tctcctgata tagatcacat
2400aacagaatgc accagtcatc agctattcag ttggtaagct tccaggaaaa aggacaggca
2460gaaagagttt gagacctgaa tagctcccag atttcagtct tttcctgttt ttgttaactt
2520tgggttaaaa aaaaaaaaag tctgattggt tttaattgaa ggaaagattt gtactacagt
2580tcttttgttg taaagagttg tgttgttctt ttcccccaaa gtggtttcag caatatttaa
2640ggagatgtaa gagctttaca aaaagacact tgatacttgt tttcaaacca gtatacaaga
2700taagcttcca ggctgcatag aaggaggaga gggaaaatgt tttgtaagaa accaatcaag
2760ataaaggaca gtgaagtaat ccgtaccttg tgttttgttt tgatttaata acataacaaa
2820taaccaaccc ttccctgaaa acctcacatg catacataca catatataca cacacaaaga
2880gagttaatca actgaaagtg tttccttcat ttctgatata gaattgcaat tttaacacac
2940ataaaggata aacttttaga aacttatctt acaaagtgta ttttataaaa ttaaagaaaa
3000taaaattaag aatgttctca atcaaaaaaa aaaaaaaa
3038883720DNAHomo sapiens 88atgcgcagtg gcgcgagcgc agcggctacg cgggcgcgga
gaggtagccg cagagtggac 60ctgcaggtac ttggatctcc agtgggagct gccctctcga
aggcaggaca gcggtggcgg 120cagatataaa gacctgaaga tagtcttttc tgtccaaaga
tggaaaacag tactactacc 180atttctcggg aggagcttga agaactacaa gaggcattta
ataaaataga tattgacaat 240agtgggtatg tcagtgacta tgaacttcaa gacctgttta
aggaagcaag ccttcctctg 300cctggctaca aggtgcgcga gattgtggag aaaattctat
cagttgctga cagcaacaaa 360gatggcaaaa tcagttttga agagtttgtg tcactaatgc
aagaattaaa aagcaaagat 420atcagcaaaa cattccgaaa aataattaac aagagggaag
ggattactgc tattggagga 480acttcaacta tttccagtga gggcacacag cattcttatt
cagaggaaga aaaagtggct 540tttgttaact ggataaacaa agccctggag aatgaccctg
actgtaagca tcttataccc 600atgaatccca atgatgatag tcttttcaag tcacttgcag
atggcatcct tctttgcaaa 660atgatcaact tatctgaacc agatacaatt gatgaaagag
ccatcaataa gaaaaagctc 720acgccattca ctatttctga aaatttaaac ctagctctga
attctgcctc agccattggt 780tgtacagtgg tcaacattgg tgcatcagat ctcaaagaag
gaaaacctca cttggtcttg 840ggacttctct ggcagatcat caaagttggc ctttttgctg
atattgagat ttccaggaat 900gaagctctga ttgcattgtt aaatgaaggt gaggaactag
aggagctgat gaagctttct 960cccgaggaat tactgctgcg atgggtgaac taccatctga
ccaatgcagg atggcatacc 1020atcagcaact tcagccaaga cattaaggac tcgagagcct
attttcatct gcttaatcag 1080attgccccta aaggtgggga agatggacct gccattgcca
ttgacctttc aggaattaat 1140gagacaaatg acctgaagcg tgctggactc atgcttcaag
aagcagataa actgggctgc 1200aaacagtttg ttactcctgc agatgtggtt tcaggcaatc
ctaaacttaa tttagctttt 1260gtagctaatt tgtttaacac atacccgtgc ctgcacaagc
cgaataataa tgacatcgat 1320atgaatttac tggaaggaga gagcaaggaa gagagaacat
ttcggaactg gatgaattcc 1380ttgggagtca acccatacat taatcatttg tacagtgacc
ttgcagatgc tttagtgatc 1440tttcagctct atgagatgat ccgagtgcca gtcaactgga
gccatgtcaa caaacctcct 1500tatcctgccc ttggagggaa catgaagaag attgaaaact
gtaactatgc agtggaactt 1560gggaagaaca aggccaaatt ctccttggtt ggcattgctg
ggcaggacct aaatgaaggg 1620aattcaacac ttaccctggc attggtatgg cagctgatga
gaaggtacac attgaatgtg 1680ttatcggatc ttggagaggg tgaaaaagta aatgatgaaa
ttataattaa atgggtcaat 1740cagactctta aaagtgcaaa caaaaagact tctatttcca
gcttcaagga taaatctata 1800agcacaagtt tacctgtcct agatttaata gatgccattg
caccaaatgc agttcgtcaa 1860gaaatgatca ggagagaaaa cttatctgat gaggacaagc
tgaacaatgc taaatacgcc 1920atttcagttg ctcgaaagat cggtgcccgg atatatgcat
tacctgatga cctcgtagaa 1980gtgaaaccaa agatggttat gacggtgttt gcatgcttaa
tgggaaaagg actgaacaga 2040ataaaataat catttcatat gattttctgc cacattaaac
atattgtatg cctcacagtt 2100tacaggattc tgaaatgtag tgggtgtaaa accagagatt
atttgtatgc tcaaaatagt 2160tatatattca ttaatgaatt caatatcctg ttcatactag
ttagagctgg tcagcctttt 2220tgggtaacac agttaattta ccaactgata cagataatag
aatatattca taatcaagct 2280gatacttcat gattaaatta tttttgttgc ttaaaagtcg
tattagacaa gactaaatca 2340ttctttttta tggttcaaaa aagatgaata caaacgtttt
tgcaggttct gctgtgaaat 2400gtggtttgat ttttttggtg tgttaatttt gatcataaat
gcattcatac tcataatcca 2460gtttaatcct tttatttgct tcctccaact atttaaagtg
gtccaaaaac acttttctgt 2520aagtttctat actgtctaaa accttatggt gaccagaatt
gtttattaat atcaaacttt 2580tttatatatg agaactaatt cttgaataaa ccccaaagtt
cactctcttg tttaagtagc 2640agcagctttt tacttaaaat ttaattttaa ctacattgat
actttacaca tcctagtttg 2700gtaacacagc tttaactatg tcatgcaaca tatatatgtt
ggtaggatgt tattagagag 2760atatgtgtgc atatatattt ttttgcacct gaatcaccca
gcttttcata agtggtatgt 2820ttaattggtc attcagccaa ccatcagtat tttcccccca
caacatgtgt aacacttttc 2880agtctgtgga tatctgatac attaagattt ctttttataa
gtattcattt tgaatgtgca 2940tatagttatt tgaccccttc caaatacttg tagccaaaca
ttggctagaa catcccaaga 3000tatgctgaca ctgtcctgtt agcttcatat tatacttgct
agtttaggtc tctatagaag 3060ccctatataa tttagaatat gcccactgaa tatctttaat
agaaagtaac ataaagctag 3120tattcaatgt agagtatttt catatgtttt tcacagcccg
ttacaaattg gcaatgtttg 3180gttaatgttt gtattacttg gaaatcgcta cagcttggac
tatttttttc taaattttta 3240gcattagtcc atttctgctg ctaacaattg aatccagaaa
tctactttct ccatcttcca 3300ctgttagtgc cagtgagcaa tactgttgtg caacaaaaat
gtcactttat ctcagtgtga 3360atgagtagtc taaattccct ttctaccatt gatttaaata
tatatattgg taagagagac 3420tgcccatgtg tttagaatag aattttttaa atgaaatgat
caacaggtgg aatttgaaat 3480atattcttct acaaaagaga tttctttccc ttttatattt
tgatgattgt tttcttaaga 3540ttaagatatg ttcttgctct tttataagat tatttaaatt
atgtttccct ctgatttttt 3600ttcaccattg tatttactaa gttattggat ttacatgaaa
tctggcactt tagggtgttc 3660tttttctcac agagtatatt taataaaaat gctgtgtata
tagaaaaaaa aaaaaaaaaa 3720893390DNAHomo sapiens 89agactctcag gttgatgcag
tgttccctcc cacaactctg acatgtatat aaattctgag 60ctctccaaag cccactgcca
gttctcttcg gggactaact gcaacggaga gactcaagat 120gattcccttt ttacccatgt
tttctctact attgctgctt attgttaacc ctataaacgc 180caacaatcat tatgacaaga
tcttggctca tagtcgtatc aggggtcggg accaaggccc 240aaatgtctgt gcccttcaac
agattttggg caccaaaaag aaatacttca gcacttgtaa 300gaactggtat aaaaagtcca
tctgtggaca gaaaacgact gtgttatatg aatgttgccc 360tggttatatg agaatggaag
gaatgaaagg ctgcccagca gttttgccca ttgaccatgt 420ttatggcact ctgggcatcg
tgggagccac cacaacgcag cgctattctg acgcctcaaa 480actgagggag gagatcgagg
gaaagggatc cttcacttac tttgcaccga gtaatgaggc 540ttgggacaac ttggattctg
atatccgtag aggtttggag agcaacgtga atgttgaatt 600actgaatgct ttacatagtc
acatgattaa taagagaatg ttgaccaagg acttaaaaaa 660tggcatgatt attccttcaa
tgtataacaa tttggggctt ttcattaacc attatcctaa 720tggggttgtc actgttaatt
gtgctcgaat catccatggg aaccagattg caacaaatgg 780tgttgtccat gtcattgacc
gtgtgcttac acaaattggt acctcaattc aagacttcat 840tgaagcagaa gatgaccttt
catcttttag agcagctgcc atcacatcgg acatattgga 900ggcccttgga agagacggtc
acttcacact ctttgctccc accaatgagg cttttgagaa 960acttccacga ggtgtcctag
aaaggatcat gggagacaaa gtggcttccg aagctcttat 1020gaagtaccac atcttaaata
ctctccagtg ttctgagtct attatgggag gagcagtctt 1080tgagacgctg gaaggaaata
caattgagat aggatgtgac ggtgacagta taacagtaaa 1140tggaatcaaa atggtgaaca
aaaaggatat tgtgacaaat aatggtgtga tccatttgat 1200tgatcaggtc ctaattcctg
attctgccaa acaagttatt gagctggctg gaaaacagca 1260aaccaccttc acggatcttg
tggcccaatt aggcttggca tctgctctga ggccagatgg 1320agaatacact ttgctggcac
ctgtgaataa tgcattttct gatgatactc tcagcatgga 1380tcagcgcctc cttaaattaa
ttctgcagaa tcacatattg aaagtaaaag ttggccttaa 1440tgagctttac aacgggcaaa
tactggaaac catcggaggc aaacagctca gagtcttcgt 1500atatcgtaca gctgtctgca
ttgaaaattc atgcatggag aaagggagta agcaagggag 1560aaacggtgcg attcacatat
tccgcgagat catcaagcca gcagagaaat ccctccatga 1620aaagttaaaa caagataagc
gctttagcac cttcctcagc ctacttgaag ctgcagactt 1680gaaagagctc ctgacacaac
ctggagactg gacattattt gtgccaacca atgatgcttt 1740taagggaatg actagtgaag
aaaaagaaat tctgatacgg gacaaaaatg ctcttcaaaa 1800catcattctt tatcacctga
caccaggagt tttcattgga aaaggatttg aacctggtgt 1860tactaacatt ttaaagacca
cacaaggaag caaaatcttt ctgaaagaag taaatgatac 1920acttctggtg aatgaattga
aatcaaaaga atctgacatc atgacaacaa atggtgtaat 1980tcatgttgta gataaactcc
tctatccagc agacacacct gttggaaatg atcaactgct 2040ggaaatactt aataaattaa
tcaaatacat ccaaattaag tttgttcgtg gtagcacctt 2100caaagaaatc cccgtgactg
tctatacaac taaaattata accaaagttg tggaaccaaa 2160aattaaagtg attgaaggca
gtcttcagcc tattatcaaa actgaaggac ccacactaac 2220aaaagtcaaa attgaaggtg
aacctgaatt cagactgatt aaagaaggtg aaacaataac 2280tgaagtgatc catggagagc
caattattaa aaaatacacc aaaatcattg atggagtgcc 2340tgtggaaata actgaaaaag
agacacgaga agaacgaatc attacaggtc ctgaaataaa 2400atacactagg atttctactg
gaggtggaga aacagaagaa actctgaaga aattgttaca 2460agaagaggtc accaaggtca
ccaaattcat tgaaggtggt gatggtcatt tatttgaaga 2520tgaagaaatt aaaagactgc
ttcagggaga cacacccgtg aggaagttgc aagccaacaa 2580aaaagttcaa ggatctagaa
gacgattaag ggaaggtcgt tctcagtgaa aatccaaaaa 2640ccagaaaaaa atgtttatac
aaccctaagt caataacctg accttagaaa attgtgagag 2700ccaagttgac ttcaggaact
gaaacatcag cacaaagaag caatcatcaa ataattctga 2760acacaaattt aatatttttt
tttctgaatg agaaacatga gggaaattgt ggagttagcc 2820tcctgtggta aaggaattga
agaaaatata acaccttaca ccctttttca tcttgacatt 2880aaaagttctg gctaactttg
gaatccatta gagaaaaatc cttgtcacca gattcattac 2940aattcaaatc gaagagttgt
gaactgttat cccattgaaa agaccgagcc ttgtatgtat 3000gttatggata cataaaatgc
acgcaagcca ttatctctcc atgggaagct aagttataaa 3060aataggtgct tggtgtacaa
aactttttat atcaaaaggc tttgcacatt tctatatgag 3120tgggtttact ggtaaattat
gttatttttt acaactaatt ttgtactctc agaatgtttg 3180tcatatgctt cttgcaatgc
atatttttta atctcaaacg tttcaataaa accatttttc 3240agatataaag agaattactt
caaattgagt aattcagaaa aactcaagat ttaagttaaa 3300aagtggtttg gacttgggaa
caggacttta tacctctttt actgtaacaa gtactcatta 3360aaggaaattg aatgaaatta
aaaaaaaaaa 3390902211DNAHomo sapiens
90cgggcggctg ggcgcgcgcg gcgcagagca ggtgccgggg agcccttcgc atgcggctgc
60cgggccggag gtggtagcgg cgccgggcgc gctccgcccg cccctcctcc gggccgcact
120gaggctcggg cgcgcgggga catgtcggtg gcgacgggca gcagcgagac ggccggcggg
180gccagcggcg gcggcgcacg ggttttcttc caaagccccc ggggtggcgc cggtggcagc
240cccggctcca gcagcggctc aggctcctcc cgggaggact cggcgcccgt ggccacggcg
300gccgctgcag ggcaggttca gcagcaacag cagcggcgac accagcaggg aaaagtgaca
360gtgaaatacg atcgtaagga gcttcggaag cggctggtgc tggaggaatg gatcgtggag
420cagctgggtc agctctacgg ctgcgaggaa gaagaaatgc cagaggtaga aattgacatt
480gatgatcttc ttgatgcaga cagtgatgaa gagagagctt caaaattaca ggaagctctt
540gtagactgct acaaaccaac agaggaattt atcaaagagc tgctttctcg gataagaggc
600atgaggaaac tgagccctcc gcagaagaag agtgtatgat tctggaacag ggtgaaactc
660tcccagagac gaagaaagag tcctgggatt tgtacttcat gaagactttt gtgaaagaat
720aggtgtcctt atgaacaacg tttttgtttt tttttttttc ttttttggtg tgaaggtggg
780ggggtctatt agacatttat tcaagagcgt tctttttttg gttttaaagg tttttgttaa
840tgtaatattt taatagcaaa gatatcatga ctctagccac agcctaacca aggattatca
900aaggaggtgg acactcaagg aagggccacg ccaggctgcg tttcctgcaa ggactcagat
960gttcagtacc ttatgataca gggaagatag ttttcttaca agtagtttgg taatattttt
1020tttcttaagt tgtacatttg actcagctgt caaatttctc acacttgtat atatctacac
1080acaactaagt taaaatgttg atgtgagttt tatttcacat ggatggaata aacttgtggt
1140tgtcctttaa ctggaggtcc cagcacatgt gttttcaaga ggccactagg cattcttcac
1200tgagtgctgc tgacttcaac gttcacttta tgcaccaaag tgaaagaatt cagtgtatcc
1260gttattttaa tgcactacac cacagaaatg ttaagttggt caagggctta atttatggaa
1320tttctattat ttcattggtt gggatgcttt gccaggtcat gtgcttattg tctcattttg
1380tagtctttta aagttgtatg aacccttaat ttgaagaact aacttgattt ctagagaaat
1440atccacacta tctcagtggt attttgcatt ggaaaaagga agcactgtgt agcagtgaat
1500tgtctgcttt ccaccgagta ctgtgtttat tctctctcca ggaaagcaga tcaaaagaaa
1560gttagcagat cgagtgtctt cttccttaga aataggttct ggtagcttct gtgcctgggt
1620agtatcagac cagtgggagt aaaccgagtg ttaagtgtca aggtgagaaa gcctcacatt
1680ctctcaagac agttgctcta ggagctgagt tgctggtttg gaagtgtgga gattgcattt
1740ctggcttctc tcaatggctt gtgttgagga ctctgggtgc tcctggccct aattgtgcac
1800cctgatcccc gtgcttggag ctaggcctgg tggcgtgctc tagccctctc agttaccagc
1860tctttggaga aggatcaaaa ttcagatgga atgtgggatg ggtaataggt gagagtagaa
1920acccttccct ccaggaggcc cccgctgact cccacagaaa cccacctacc ataagatgtc
1980ttcaggtggc cttgtccaag gatgggggtc aggcatttta tatcaagggt gctctgaaca
2040tattttattt tttaaaaaaa ctatgtttgt gaattttgcg tatactggca agcttttgaa
2100aatgtattta attttgtatt gtttaccaat gatttattta caagatattt actcaaataa
2160atggagctgc ttacaagcct gttgacatgt gtggcttgca caacacgtta a
2211911787DNAHomo sapiens 91gctgctcctc tgtcgagctg atcacaccca cagttgagct
gcgctggcca gagatgcctg 60cccacagcct ggtgatgagc agcccggccc tcccggcctt
cctgctctgc agcacgctgc 120tggtcatcaa gatgtacgtg gtggccatca tcacgggcca
agtgaggctg cggaagaagg 180cctttgccaa ccccgaggat gccctgagac acggaggccc
ccagtattgc aggagcgacc 240ccgacgtgga acgctgcctc agggcccacc ggaacgacat
ggagaccatc taccccttcc 300ttttcctggg cttcgtctac tcctttctgg gtcctaaccc
ttttgtcgcc tggatgcact 360tcctggtctt cctcgtgggc cgtgtggcac acaccgtggc
ctacctgggg aagctgcggg 420cacccatccg ctccgtgacc tacaccctgg cccagctccc
ctgcgcctcc atggctctgc 480agatcctctg ggaagcggcc cgccacctgt gaccagcagc
tgatgcctcc ttggccacca 540gaccatgggc caagagccgc cgtggctata cctggggact
tgatgttcct tccagattgt 600ggtgggccct gagtcctggt ttcctggcag cctgctgcgc
gtgtgggtct ctgggcacag 660tgggcctgtg tgtgtgcccg tgtgtgtgta tgtgtgtgtg
tatgtttctt agccccttgg 720attcctgcac gaagtggctg atgggaacca tttcaagaca
gattgtgaag attgatagaa 780aatccttcag ctaaagtaac agagcatcaa aaacatcact
ccctctccct ccctaacagt 840gaaaagagag aagggagact ctatttaaga ttcccaaacc
taatgatcat ctgaatcccg 900ggctaagaat gcagactttt cagactgacc ccagaaattc
tggcccagcc aatctagagg 960caagcctggc catctgtatt tttttttttc caagacagag
tcttgctctg ttgcccaagc 1020tggagtgaag tggtacaatc tggctcactg cagcctccgc
ctcccgggtt caagcgattc 1080tcccgcctca gcctcctgag tagctgggat tacaggcgcg
tatcaccata cccagctaat 1140ttttgtattt ttagtagaga cgggttcacc atgttgccca
ggagggtctc gaactcctgg 1200cctcaagtga tccaccggcc tcggcctccc aaagtgctgg
gatgacaggc atgaatcact 1260gtgctcagcc accatctgga gttttaaaag gctcccatgt
gagtccctgt gatggccagg 1320ccaggggacc cctgccagtt ctctgtggaa gcaaggctgg
ggtcttgggt tcctgtatgg 1380tggaagctgg gtgagccaag gacagggctg gctcctctgc
ccccgctgac gcttcccttg 1440ccgttggctt tggatgtctt tgctgcagtc ttctctctgg
ctcaggtgtg ggtgggaggg 1500gcccacagga agctcagcct tctcctccca aggtttgagt
ccctccaaag ggcagtgggt 1560ggaggaccgg gagctttggg tgaccagcca ctcaaaggaa
ctttctggtc ccttcagtat 1620cttcaaggtt tggaaactgc aaatgtcccc ttgatgggga
atccgtgtgt gtgtgtgtgt 1680gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gttttctcct
agacccgtga cctgagatgt 1740gtgattttta gtcattaaat ggaagtgtct gccagctggg
cccagca 1787921955DNAHomo sapiens 92attcatcccc attcaggctt
tcctcagcat ttattaagga ctctctgctc cagcctctca 60ctctcactct cctccgctca
aactcagctc acttgagagt ctcctcccgc cagctgtgga 120aagaactttg cgtctctcca
gcaatgcatc tccttgcgat tctgttttgt gctctctggt 180ctgcagtgtt ggccgagaac
tcggatgatt atgatctcat gtatgtgaat ttggacaacg 240aaatagacaa tggactccat
cccactgagg accccacgcc gtgcgcctgc ggtcaggagc 300actcggaatg ggacaagctc
ttcatcatgc tggagaactc gcagatgaga gagcgcatgc 360tgctgcaagc cacggacgac
gtcctgcggg gcgagctgca gaggctgcgg gaggagctgg 420gccggctcgc ggaaagcctg
gcgaggccgt gcgcgccggg ggctcccgca gaggccaggc 480tgaccagtgc tctggacgag
ctgctgcagg cgacccgcga cgcgggccgc aggctggcgc 540gtatggaggg cgcggaggcg
cagcgcccag aggaggcggg gcgcgccctg gccgcggtgc 600tagaggagct gcggcagacg
cgagccgacc tgcacgcggt gcagggctgg gctgcccgga 660gctggctgcc ggcaggttgt
gaaacagcta ttttattccc aatgcgttcc aagaagattt 720ttggaagcgt gcatccagtg
agaccaatga ggcttgagtc ttttagtgcc tgcatttggg 780tcaaagccac agatgtatta
aacaaaacca tcctgttttc ctatggcaca aagaggaatc 840catatgaaat ccagctgtat
ctcagctacc aatccatagt gtttgtggtg ggtggagagg 900agaacaaact ggttgctgaa
gccatggttt ccctgggaag gtggacccac ctgtgcggca 960cctggaattc agaggaaggg
ctcacatcct tgtgggtaaa tggtgaactg gcggctacca 1020ctgttgagat ggccacaggt
cacattgttc ctgagggagg aatcctgcag attggccaag 1080aaaagaatgg ctgctgtgtg
ggtggtggct ttgatgaaac attagccttc tctgggagac 1140tcacaggctt caatatctgg
gatagtgttc ttagcaatga agagataaga gagaccggag 1200gagcagagtc ttgtcacatc
cgggggaata ttgttgggtg gggagtcaca gagatccagc 1260cacatggagg agctcagtat
gtttcataaa tgttgtgaaa ctccacttga agccaaagaa 1320agaaactcac acttaaaaca
catgccagtt gggaaggtct gaaaactcag tgcataatag 1380gaacacttga gactaatgaa
agagagagtt gagaccaatc tttatttgta ctggccaaat 1440actgaataaa cagttgaagg
aaagacattg gaaaaagctt ttgaggataa tgttactaga 1500ctttatgcca tggtgctttc
agtttaatgc tgtgtctctg tcagataaac tctcaaataa 1560ttaaaaagga ctgtattgtt
gaacagaggg acaattgttt tacttttctt tggttaattt 1620tgttttggcc agagatgaat
tttacattgg aagaataaca aaataagatt tgttgtccat 1680tgttcattgt tattggtatg
taccttatta caaaaaaaag atgaaaacat atttatacta 1740caaggtgact taacaactat
aaatgtagtt tatgtgttat aatcgaatgt cacgtttttg 1800agaagatagt catataagtt
atattgcaaa agggatttgt attaatttaa gactattttt 1860gtaaagctct actgtaaata
aaatatttta taaaactagc tcacgtcatt taattataaa 1920tttaagagat gttttggaaa
aaaaaaaaaa aaaaa 1955932019DNAHomo sapiens
93gaaaaggcgg aggcggcggc ccctccggct cccactgcct cccccgccgc accccctccc
60caccttccgc acccgccaaa cttgatgtga ccctggcccg acgcggaggc tgcccctctc
120actgccccgt gggtcccccg ccacccgctc cgcacccgcg agcgcaccgc tccccgcgcc
180ccttcccact tcccgcgggg ccggcgccgc gctcgccctc gcgttccttc ccgccgcccc
240ctcccccgca ccatgagcaa cctgaagccg gacggcgagc acggcggcag caccggcacc
300ggctccggcg cgggctccgg cggcgccctg gaggaggagg tccggacact gtttgtcagc
360ggcctccctg tggacattaa acccagagaa ctctacttgc tcttccggcc gttcaagggg
420tatgaagggt ccctgatcaa gctcactgca agacagcctg ttggttttgt gatctttgac
480agccgtgcag gagcagaagc ggccaagaat gcgctgaacg gtattcgctt tgatcccgaa
540aatccacaga ctctgaggct agagtttgcc aaagccaaca ccaagatggc caagagcaag
600ctaatggcaa ctccaaatcc cagcaacgtg caccccgccc taggagcaca cttcatcgca
660cgggacccct atgacctgat gggggctgct ctgatccctg catccccaga ggcctgggcc
720ccctaccctt tgtacaccac agagctgacc ccagccatct cccatgctgc gttcacctac
780ccaactgcca ctgccgctgc cgccgccctc cacgctcagg tgcgctggta cccttcctct
840gacaccaccc agcaaggatg gaagtaccgt cagttctgtt agtttttcag tctggtcacc
900ggggaggtgg ttctggtaat ctgtggtggt gccgggacag gcgccccgag ttcccactgc
960ccccgggcgg cctgcacaga gctgctgccc tccagagact gtgaatccca agcctgactc
1020agtggactgc ttcctgttcc cctccctcct cttcctcacc ttgttctgca ccctcaagcc
1080tttctccaat gcctcccagg aggatttggg gactttctcc ctggggcgcc cagatccagc
1140tcggaggcct cactgggacc tggcaaggcc tgacctcccg cccaaacttg cttctgtagc
1200tccccctcga ggaagtgagg tgtttaattt tgcatgtttt ctggcatgaa ttaagacact
1260tatacttgta tatatgagtg tacagtttgt tctcacactg tcaccatagc gacaggtcct
1320ggctcccagt ggttcatcct gcctgcccct ctctcctcgc cccgcccctg cacccacccc
1380gcttcaggga ggcccaagtt ccgtggcccc acacgcttcc aggctcagct cccacctcca
1440cccaacagat agatggggtt tgctttttca tttcacatgg ggctcctccg ctcctgcctt
1500ctcggatggg ccaacagtcg taagaaagcc ctctctgccc gttctgttca cctctccaca
1560gcgcaccccg cccgccgctg ctcctcattc tttccaaacc tcgaaaccaa ccaaaacgtg
1620agaagtattt ttgtaccctg tgtaacaaaa tatttatgca tcataaagga tttttcatgt
1680gcgtaccatt aattattaaa gcgacctcgt tcgccctgtc agataagttt aatgtttagt
1740ttgaggcatg aagaagaaaa gggtttccat tcttcagcag tacgcctttg tgtctggcat
1800ttgtttaaga aaatgaaatg aaggaaacac tgtgcaatgt tttttgtttt gagcatatca
1860gtgctttact gtcagccgca gctgtgaccg tctggccatt tcagacttgg gagatgaggc
1920ggctgttgtc attgctgatc ctgtgagaat gtgaaactgg ataatatatg aaatgcaaaa
1980taaaacaaaa ccaaaaaaaa aaaaaaaaaa aaaaaaaaa
2019941517DNAHomo sapiens 94ataagacttt tatggatgga ttgtttttct caaataatat
tatcgctttg tgactaaagt 60aaagattatt aattcctgag gcaagaagat ataaaagctc
cagaaacgtt gactgggacc 120actggagaca ctgaagaagg caggggccct tagagtcttg
gttgccaaac agaatgccca 180tatccgtctt acctgtgagg aagcttgcct tgggcgccct
ctgctggccc tcctgaagct 240aacaggggcg agtgctcggt ggtttacaaa ttgcctccat
gcagactatg aaactgttca 300gcctgctata gttagatctc tggcactggc ccaggaggtc
ttgcagattt gcagatcaag 360gagaacccag gagtttcaaa gaagcgctag taaggtctct
gagatccttg cactagctac 420atcctcaggg taggaggaag atggcttcca gaagcatgcg
gctgctccta ttgctgagct 480gcctggccaa aacaggagtc ctgggtgata tcatcatgag
acccagctgt gctcctggat 540ggttttacca caagtccaat tgctatggtt acttcaggaa
gctgaggaac tggtctgatg 600ccgagctcga gtgtcagtct tacggaaacg gagcccacct
ggcatctatc ctgagtttaa 660aggaagccag caccatagca gagtacataa gtggctatca
gagaagccag ccgatatgga 720ttggcctgca cgacccacag aagaggcagc agtggcagtg
gattgatggg gccatgtatc 780tgtacagatc ctggtctggc aagtccatgg gtgggaacaa
gcactgtgct gagatgagct 840ccaataacaa ctttttaact tggagcagca acgaatgcaa
caagcgccaa cacttcctgt 900gcaagtaccg accatagagc aagaatcaag attctgctaa
ctcctgcaca gccccgtcct 960cttcctttct gctagcctgg ctaaatctgc tcattatttc
agaggggaaa cctagcaaac 1020taagagtgat aagggcccta ctacactggc ttttttaggc
ttagagacag aaactttagc 1080attggcccag tagtggcttc tagctctaaa tgtttgcccc
gccatccctt tccacagtat 1140ccttcttccc tcctcccctg tctctggctg tctcgagcag
tctagaagag tgcatctcca 1200gcctatgaaa cagctgggtc tttggccata agaagtaaag
atttgaagac agaaggaaga 1260aactcaggag taagcttcta gaccccttca gcttctacac
ccttctgccc tctctccatt 1320gcctgcaccc caccccagcc actcaactcc tgcttgtttt
tcctttggcc atgggaaggt 1380ttaccagtag aatccttgct aggttgatgt gggccataca
ttcctttaat aaaccattgt 1440gtacataaga ggttgctgtg ttccagttca gtaatggtga
atgtggaaaa gtgaaataag 1500accaagaaat acaccca
1517951162DNAHomo sapiens 95acagggcagt gtagttccag
aaaataggac tgaccaagaa gcagaaaagc aagatgaatg 60atgtgaagct tgctgtcttg
ggtggtgaag gaacaggcaa atctggtccc tacatcctta 120aataactgca aatacttggt
agtgtctttg tgactattac acatagtacc ttgaagccag 180actgagtttg acagagaaaa
taaacagatg tcaaacttct tgcatctcaa atataatgag 240aaatctgttt ctgttacaaa
agcccttaca gtgaggtttc ttactaagcg attcattgga 300gaatatgctt ctaattttga
atctatctat aagaagcact tgtgtttgga aaggaaacaa 360ctaaatctag aaatatatga
cccttgttct caaacacaga aagcaaaatt ctccctcaca 420agtgagcttc actgggcaga
tgggtttgtt attgtgtatg acatcagtga taggtcttca 480tttgcttttg caaaagcgct
gatctacaga atccgggagc cacaaactag tcattgtaaa 540agagctgtgg aatcagcagt
gtttttggtt ggcaacaaac gagatctttg tcatgtgcga 600gaggttggct gggaagaagg
gcaaaagctg gcactggaaa accgatgcca attctgtgaa 660ctgtctgcag cagagcagtc
tctggaggtg gaaatgatgt ttatcagaat tatcaaggac 720atcctgataa acttcaaact
caaagaaaag agacgtccca gtggatctaa atcaatggcc 780aaattgatca ataatgtatt
tggaaagaga aggaaatctg tttagtagac aggtaatcct 840gggagatttc ctatatcaga
gagtttcaaa cattcacatg ataattaaac taacctttgt 900atgcaatttt tttttggtaa
aaagaattct cttggagata tgaaatgatt gagtatgaac 960cacagctgtg ttttcaaata
tgtagtttgc ctttttggtt gttgtaccct gctcactctc 1020cttcacacag aacctttcat
ttattgtaca acatcacact caccctaacc tactggcgga 1080cagcgatccc agtttgcctt
gccaaataaa ctctgtttat gtgaatttat taaacgacca 1140tgccataaaa aaaaaaaaaa
aa 1162964583DNAHomo sapiens
96gcggccgccc cggcggctcc tggaaccccg gttcgcggcg atgccagcca ccccagcgaa
60gccgccgcag ttcagtgctt ggataatttg aaagtacaat agttggtttc cctgtccacc
120cgccccactt cgcttgccat cacagcacgc ctatcggatg tgagaggaga agtcccgctg
180ctcgggcact gtctatatac gcctaacacc tacatatatt ttaaaaacat taaatataat
240taacaatcaa aagaaagagg agaaaggaag ggaagcatta ctgggttact atgcacttgc
300gactgatttc ttggcttttt atcattttga actttatgga atacatcggc agccaaaacg
360cctcccgggg aaggcgccag cgaagaatgc atcctaacgt tagtcaaggc tgccaaggag
420gctgtgcaac atgctcagat tacaatggat gtttgtcatg taagcccaga ctattttttg
480ctctggaaag aattggcatg aagcagattg gagtatgtct ctcttcatgt ccaagtggat
540attatggaac tcgatatcca gatataaata agtgtacaaa atgcaaagct gactgtgata
600cctgtttcaa caaaaatttc tgcacaaaat gtaaaagtgg attttactta caccttggaa
660agtgccttga caattgccca gaagggttgg aagccaacaa ccatactatg gagtgtgtca
720gtattgtgca ctgtgaggtc agtgaatgga atccttggag tccatgcacg aagaagggaa
780aaacatgtgg cttcaaaaga gggactgaaa cacgggtccg agaaataata cagcatcctt
840cagcaaaggg taacctgtgt cccccaacaa atgagacaag aaagtgtaca gtgcaaagga
900agaagtgtca gaagggagaa cgaggaaaaa aaggaaggga gaggaaaaga aaaaaaccta
960ataaaggaga aagtaaagaa gcaatacctg acagcaaaag tctggaatcc agcaaagaaa
1020tcccagagca acgagaaaac aaacagcagc agaagaagcg aaaagtccaa gataaacaga
1080aatcggtatc agtcagcact gtacactaga gggttccatg agattattgt agactcatga
1140tgctgctatc tcaaccagat gcccaggaca ggtgctctag ccattaggac cacaaatgga
1200catgtcagtt attgctctgt ctaaacaaca ttcccagtag ttgctatatt cttcatacaa
1260gcatagttaa caacaaagag ccaaaagatc aaagaaggga tactttcaga tggttgtctt
1320gtgtgcttct ctgcattttt aaaagacaag acattcttgt acatattatc aataggctat
1380aagatgtaac aacgaaatga tgacatctgg agaagaaaca tcttttcctt ataaaaatgt
1440gttttcaagc tgttgtttta agaagcaaaa gatagttctg caaattcaaa gatacagtat
1500cccttcaaaa caaataggag ttcagggaag agaaacatcc ttcaaaggac agtgttgttt
1560tgaccgggag atctagagag tgctcagaat tagggcctgg catttggaat cacaggattt
1620atcatcacag aaacaactgt tttaagatta gttccatcac tctcatcctg tatttttata
1680agaaacacaa gagtgcatac cagaattgaa tataccatat gggattggag aaagacaaat
1740gtggaagaaa tcatagagct ggagactact tttgtgcttt acaaaactgt gaaggattgt
1800ggtcacctgg aacaggtctc caatctatgt tagcactatg tggctcagcc tctgttaccc
1860cttggattat atatcaacct gtaaacatgt gcctgtaact tacttccaaa aacaaaatca
1920tacttattag aagaaaattc tgattttata gaaaaaaaat agagcaagga gaatataaca
1980tgtttgcaaa gtcatgtgtt ttctttctca atgagggaaa aacaatttta ttacctgctt
2040aatggtccac ctggaactaa aagggatact attttctaac aaggtatatc tagtagggga
2100gaaagccacc acaataaata tatttgttaa tagtttttca agttttgttc actctgtttt
2160attgtttgtt ttattgagaa attcttactc ttagagactc atgaattaag aaagagaatt
2220ctgctaactc agagaacctg gttcctatgt aattcagaat atattacatt tctcagtaat
2280atttgttttt tgaatccacc tttatctgag ccaatggaga tttacttata gcgtattagg
2340agatatttat tccattttct tattttaatc aacattctaa ttatagacac atgggcctcc
2400ctagctgatt tcactgctcc cccttcattg cttagaaatg ggcatcattt cttgtatgtc
2460agatccccct gcatcttcaa catttagtct tttcttctcc atattttcta tctgtggatc
2520tctttagggg attgaagtca ccctagctga aggcctcacc agtgtttcac agaggacaca
2580gcccacccct tgcaggagga ggtatctctg agtgtgcagc acagaatcgc atgacccacc
2640ttaaccttcc tgttgtcatg gaaggatgca cggctgctct gtccactgtg attcctagcc
2700ctctcaagat cactgctttc tgaagaattt gcaatgactc tggcttctgg ctgcttatct
2760ctggacaccc gttctccacc agttgtacag ttcatgtaat ctacttggct taattgattt
2820tccacttctc tcttcctctt ctaagatata aacattttaa atgatttatt cctgtttctt
2880attctggtgt ttctttcctt gtccctatga gataagtgtc tcaactcact aaatctattc
2940ccaatgtata aaataattct aattccattt tcagctaaaa catatattac caagaagaaa
3000caaactttat cctacagaat gatgttaggt agaaatatgt ccccaggttt gagacctttc
3060ggatgatttc atataccatc tttcttctga gtgttaccca gtcaagtata agtagccaaa
3120ttatttttgc acatctttct gtttctcatg tcttcattta ttcaacaagc acttactggg
3180aaggtctaca cctgcatagg caatgctgga aaaagggtta agtaaaccag gacatgacaa
3240tggtggcaaa tgactatcag gtcttcccat gtgtttgact caaacttatt accctatggt
3300ccttctgaca atggcagaag gtctgaatcc ttgatgctaa acttatataa aagtagaatt
3360attacaaagg aaaaagaaat aaaaactaac attcattttc atatgttgga tgaaatataa
3420atgaagaaaa agataacatc aattttaact gtaattctcc atccaccagt aacagatcct
3480taagacaata gaatcataca gtattcaaac cagcagcctt ctcaaatttg agcaaaaact
3540ctatcaacct ctggtaaagt tcctacacta gtcacagaag gtgttaactt tctactctga
3600ttctgtctcc ataatggggt aaactgttga tagtttaccc catcaacaga tggtcggtaa
3660attattgatt cgaagaatcg agagagtgca gcaacataaa tctgttaatg tctgatcaag
3720ctcctgccct gttctccgaa ttcagcttca taattaaggg aaggcctgtt ttctatcctc
3780agatttaggt tctagtagca gttgtgtaac cactagtgag tcacttaact cctctgggtc
3840cccatttctc atgtgcaaca agaaagaggg gaactggaga tgatcactct agttccagac
3900aagggaacat ttcacacttt gtttacttca gggtgatgtc cctgagtcct cattagtgac
3960tgcgtccttt ggaagttatc ccaaccctgc ttttctcaaa agtgaaaatg tataggctct
4020cagaggagac agatttaact ctgcttctct aatgttattg aattaaaagc tgttcacatt
4080agtggttatt aaatattgaa ataacactgg gaagaaaaag catatataaa tacagctaaa
4140aacaagaata gatattcatt ctcacaaagg gagacagcaa agaaaatgga aagtgcactg
4200gtgctagcgt tagacagctt gtgttaatgt ctcaattctg ctactaactg gttgcagctt
4260gtgtgacctt gggcacattg tatgatctcg cagaatatca tcccaaatct gcaaaatgga
4320attggcatca tctcttttgc aagattgtta tgagaattaa aaggttcttc attcaatata
4380ataataaata ttttgtatat aaatgaatat caattaaaag ttatgactaa ttccacaagt
4440caaacatata aattttattt cttgattcat gatatgtgat agtattcata aaaatgtaca
4500tgcatgataa tttcaaggaa taagtatata tgtgagaatc atggaaatga aattaataat
4560attaactagt aattaaattg taa
458397970DNAHomo sapiens 97ctcccctcac cccggtccag gatgcccagt ccccacgaca
cctcccactt cccactgtgg 60cctgggtggg ctcaggggct gcccttgacc tggcctagag
ccctccccca gctggtggtg 120gagctggcac tctctgggag ggagggggct gggagggaat
gagtgggaat ggcaagaggc 180cagggtttgg tgggatcagg ttgaggcagg tttggtttcc
ttaaaatgcc aagttggggg 240ccagtggggc ccacatataa atcctcaccc tgggagcctg
gctgccttgc tctccttcct 300gggtctgtct ctgccacctg gtctgccaca gatccatgat
gtgcagttct ctggagcagg 360cgctggctgt gctggtcact accttccaca agtactcctg
ccaagagggc gacaagttca 420agctgagtaa gggggaaatg aaggaacttc tgcacaagga
gctgcccagc tttgtggggg 480agaaagtgga tgaggagggg ctgaagaagc tgatgggcag
cctggatgag aacagtgacc 540agcaggtgga cttccaggag tatgctgttt tcctggcact
catcactgtc atgtgcaatg 600acttcttcca gggctgccca gaccgaccct gaagcagaac
tcttgacttc ctgccatgga 660tctcttgggc ccaggactgt tgatgccttt gagttttgta
ttcaataaac tttttttgtc 720tgttgataat attttaattg ctcagtgatg ttccataacc
cggctggctc agctggagtg 780ctgggagatg agggcctcct ggatcctgct cccttctggg
ctctgactct cctggaaatc 840tctccaaggc cagagctatg ctttaggtct caattttgga
atttcaaaca ccagcaaaaa 900attggaaatc gagataggtt gctgactttt attttgtcaa
ataaagatat taaaaaaggc 960aaaaaaaaaa
970983240DNAHomo sapiens 98aagaaacctc tgaactgttc
actaatacag tcaggtagag gttgagactc cactgaataa 60actctaggtt cccatttctt
tcagccagat cctcccaggg aatcactaca ggctggttag 120ccaaaaagtc ctgattttct
gctcaataga ggtccttact ggaaggcagc atgtccaatg 180ttaccttgag aaaaatgtct
cccacaggaa atgagatgaa gagcaccact cagggaacca 240cacggaagca gcaggatttt
cacgaggtga acaaaagaag aactttctta caggataaca 300gttggataaa gaaacgccct
gaagaagaaa aagatgaaaa ttacggtagg gtggtgctca 360accgacataa ttcccatgat
gcattggaca ggaaagtaaa tgagagagat gtgccaaaag 420ctacaattag tcggtacagt
tctgatgaca ctttggacag gatctcagac agaaatgatg 480ctgctaaaac atataaggcc
aataccttgg ataaccaact aaccaatagg agcatgtcca 540tgtttagatc actggaagta
acaaagttgc aacctggcgg ttcattgaat gccaacacct 600ccaacaccat agcatccact
tctgctacta ctcctgtaaa gaagaagagg cagtcctggt 660ttccaccgcc ccctccaggt
tacaatgcct cctcgagcac aggaaccagg agacgggaac 720caggtgttca ccctccaata
cctccaaagc ccagttctcc tgtttcttct cctaaccagc 780tgagacagga taataggcag
atacatccac ctaaaccagg tgtatataca gaaaccaaca 840gatctgctga aagaaatata
aggagtcagg atcttgataa catcgtcaaa gtggccactt 900cacttcagag aagtgacaaa
ggtgaagaat tggataatct catcaaaatg aacaaaagct 960tgaataggaa tcaaggtctt
gatagtctct tcagagcaaa tccaaaggta gaagaaagag 1020agaaaagagc caaaagcctt
gaaagtctca tctatatgag tacccggaca gataaagatg 1080gcaaaggaat ccaaagcctt
ggaagtccga ttaaagttaa tcaaaggact gacaaaaatg 1140agaaaggaag acaaaatctc
gaatctgttg ctaaagtgaa tgccaggatg aataaaacga 1200gcagaagaag tgaagacctt
gataatgcta ctgaagtaaa tcccaaagga catgaaaata 1260ccactggaaa aaaagacctt
gatgggctta ttaaagtgga tcctgaaaca aataaaaata 1320ttacgagggg ccagagcctt
gataatctca tcaaagtgac ccctgaagta aagagaagta 1380accaaggttc caaagacctt
aataacttca tcaaagtgta tccaggaaca gaaaaaagta 1440ctgaaggggg ccaaagtctc
gacagcctca ttaaagtgac tcctgaaaga aacagaacta 1500accaagggaa ccaagacttg
gaaaatctta tcaaagtgat cccttcagca aacaaaagca 1560gtgaacaagg tcttgatgaa
catattaatg tcagccccaa agctgtcaaa aacactgatg 1620gaaaacaaga tcttgataaa
ctcatcaagg tgaatcctga aattttcaca aacaaccaaa 1680gaaaccaaga tcttgctaac
ctcatcaaag taaatcctgc agtaatcaga aacaatcaga 1740gccaagactt ggacaatctt
attaaagtga aaccttcagc tcttagaaac actaatcgag 1800accagaacct ggaaaattta
attgaagtaa attctcatgt gtctgaaaac aagaatggaa 1860gctctaacac tggagccaag
caggcaggac cacaggatac tgttgtgtac acaaggacat 1920atgtggagaa tagtaaatca
cccaaggatg gatatcagga gaatatctct ggaaaataca 1980tacaaactgt ttattcaact
tctgataggt ctgtcattga aagagatatg tgcacttact 2040gccgaaaacc cttgggtgta
gaaactaaaa tgattttaga tgaattacaa atttgctgcc 2100attctacttg ctttaagtgt
gaaatatgca agcagccttt ggaaaatcta caagcgggtg 2160atagtatttg gatttataga
cagacaatac actgtgaacc ttgctactct aaaattatgg 2220caaagtggat tccataactc
tggcacaagg aaatcaagat gaaaagcact cattaaggaa 2280ttaaagttac aagttttatc
ttaataatat gtaatctaga aaagctttca cattgaagat 2340caactcttgt acaaaattaa
caattctgtt attgcataag taatctaatt gtcttcaata 2400aggtcacaca cataaaaaga
gccatctggt ctctggctag agttagcaat aaaaagttca 2460aatggttcca gattccagtg
tcaaaggagt gatgcattac actccagcca ggtccatccc 2520tgctccgtat gttggctgtg
agtggtggtt tccatttaaa ccaagtttct catttcttca 2580cctttttttc tctaagaatt
tggattcgta gacattgaca tcccgaagaa ctgtcaagga 2640agcaagatat gctttcttca
tctgcaaaag aaatactaac aacaattttc ttatacagtt 2700tggcagaaag atgttaacat
aaaaagttta tatacctcaa aaatcactaa actttccaga 2760tctctgtcct attatttgta
acacaagggg cattggataa aatgatttct agggttcctt 2820ttgcttccca aattctctga
ttctaaagca gtttttagaa tcattagctc tttggaaaca 2880tatatgcata catgtttgtt
aagcctattg aactaggtag gacatataaa caatttaatt 2940ttagtgtcat tgtttaatca
cagacttagt gtttgaaaac tgtgttttaa aaacagaaac 3000agattgatgg gtaacaggta
aaatatgaca tgtatagctt acatgttatt atttgttaaa 3060ttttctttgt atacatttca
aaatctgggt atacttataa tccattagaa gtaatggtta 3120tggactaaaa agatatgttc
tttagtatgt tatatatact catattacat agcagtatgt 3180ttacaaaagg cttataaaaa
taaaatgaac tatcagttac atagaaaaaa aaaaaaaaaa 324099907DNAHomo sapiens
99cttttcttaa gggaaaaatc actctgtgtt cttttaaaat ccctcaggtt ttatgtttta
60ttgctaccag agtctgcctc cctgaggttc ttgtatagac tagttatttc cctctgtaaa
120gaagctgttc tattcgttct cgcctggttt ggaacaaact gaacacttcc aaaggaggca
180gtccttgcag ccttgtctcc ttccactccc ctcctcccca cagtcctggc tggagcagcg
240agtctgtcga tcccaggcca gagacaaggc agacaaaggt tcatttgtaa agaagctcct
300tccagcacct cctctcttct ccttttgccc aaactcaccc agtgagtgtg agcatttaag
360aagcatcctc tgccaagacc aaaaggaaag aagaaaaagg gccaaaagcc aaaatgaaac
420tgatggtact tgttttcacc attgggctaa ctttgctgct aggagttcaa gccatgcctg
480caaatcgcct ctcttgctac agaaagatac taaaagatca caactgtcac aaccttccgg
540aaggagtagc tgacctgaca cagattgatg tcaatgtcca ggatcatttc tgggatggga
600agggatgtga gatgatctgt tactgcaact tcagcgaatt gctctgctgc ccaaaagacg
660ttttctttgg accaaagatc tctttcgtga ttccttgcaa caatcaatga gaatcttcat
720gtattctgga gaacaccatt cctgatttcc cacaaactgc actacatcag tataactgca
780tttctagttt ctatatagtg caatagagca tagattctat aaattcttac ttgtctaaga
840caagtaaatc tgtgttaaac aagtagtaat aaaagttaat tcaatctaat ttttctctgt
900ggaaaaa
9071001793DNAHomo sapiens 100aaatactaac cacagaggga gaggcagcaa gaggagaggc
ataaattcag gatctcaccc 60ttcattccac agacacacat agcctctctg cccacctctg
cttcctctag gaacacagga 120gttccagatc acatcgagtt caccatgaat tcactcagtg
aagccaacac caagttcatg 180ttcgacctgt tccaacagtt cagaaaatca aaagagaaca
acatcttcta ttcccctatc 240agcatcacat cagcattagg gatggtcctc ttaggagcca
aagacaacac tgcacaacag 300attaagaagg ttcttcactt tgatcaagtc acagagaaca
ccacaggaaa agctgcaaca 360tatcatgttg ataggtcagg aaatgttcat caccagtttc
aaaagcttct gactgaattc 420aacaaatcca ctgatgcata tgagctgaag atcgccaaca
agctcttcgg agaaaaaacg 480tatctatttt tacaggaata tttagatgcc atcaagaaat
tttaccagac cagtgtggaa 540tctgttgatt ttgcaaatgc tccagaagaa agtcgaaaga
agattaactc ctgggtggaa 600agtcaaacga atgaaaaaat taaaaaccta attcctgaag
gtaatattgg cagcaatacc 660acattggttc ttgtgaacgc aatctatttc aaagggcagt
gggagaagaa atttaataaa 720gaagatacta aagaggaaaa attttggcca aacaagaata
catacaagtc catacagatg 780atgaggcaat acacatcttt tcattttgcc tcgctggagg
atgtacaggc caaggtcctg 840gaaataccat acaaaggcaa agatctaagc atgattgtgt
tgctgccaaa tgaaatcgat 900ggtctccaga agcttgaaga gaaactcact gctgagaaat
tgatggaatg gacaagtttg 960cagaatatga gagagacacg tgtcgattta cacttacctc
ggttcaaagt ggaagagagc 1020tatgacctca aggacacgtt gagaaccatg ggaatggtgg
atatcttcaa tggggatgca 1080gacctctcag gcatgaccgg gagccgcggt ctcgtgctat
ctggagtcct acacaaggcc 1140tttgtggagg ttacagagga gggagcagaa gctgcagctg
ccaccgctgt agtaggattc 1200ggatcatcac ctacttcaac taatgaagag ttccattgta
atcacccttt cctattcttc 1260ataaggcaaa ataagaccaa cagcatcctc ttctatggca
gattctcatc cccgtagatg 1320caattagtct gtcactccat ttggaaaatg ttcacctgca
gatgttctgg taaactgatt 1380gctggcaaca acagattctc ttggctcata tttcttttct
ttctcatctt gatgatgatc 1440gtcatcatca agaatttaat gattaaaata gcatgccttt
ctctctttct cttaataagc 1500ccacatataa atgtactttt tcttccagaa aaattctcct
tgaggaaaaa tgtccaaaat 1560aagatgaatc acttaatacc gtatcttcta aatttgaaat
ataattctgt ttgtgacctg 1620ttttaaatga accaaaccaa atcatacttt ttctttgaat
ttagcaacct agaaacacac 1680atttctttga atttaggtga tacctaaatc cttcttatgt
ttctaaattt tgtgattcta 1740taaaacacat catcaataaa atagtgacat aaaatcaaaa
aaaaaaaaaa aaa 17931011787DNAHomo sapiens 101aaccacagag
ggaaaggcag caagaggaga ggcataaatt taggatctca cccttcattc 60cacagacaca
cacagcctct ctgcccacct ctgcttcctc taggaacaca ggagttccag 120atcacatcga
gttcaccatg aattcactca gtgaagccaa caccaagttc atgttcgatc 180tgttccaaca
gttcagaaaa tcaaaagaga acaacatctt ctattcccct atcagcatca 240catcagcatt
agggatggtc ctcttaggag ccaaagacaa cactgcacaa caaattagca 300aggttcttca
ctttgatcaa gtcacagaga acaccacaga aaaagctgca acatatcatg 360ttgataggtc
aggaaatgtt catcaccagt ttcaaaagct tctgactgaa ttcaacaaat 420ccactgatgc
atatgagctg aagatcgcca acaagctctt cggagaaaag acgtatcaat 480ttttacagga
atatttagat gccatcaaga aattttacca gaccagtgtg gaatctactg 540attttgcaaa
tgctccagaa gaaagtcgaa agaagattaa ctcctgggtg gaaagtcaaa 600cgaatgaaaa
aattaaaaac ctatttcctg atgggactat tggcaatgat acgacactgg 660ttcttgtgaa
cgcaatctat ttcaaagggc agtgggagaa taaatttaaa aaagaaaaca 720ctaaagagga
aaaattttgg ccaaacaaga atacatacaa atctgtacag atgatgaggc 780aatacaattc
ctttaatttt gccttgctgg aggatgtaca ggccaaggtc ctggaaatac 840catacaaagg
caaagatcta agcatgattg tgctgctgcc aaatgaaatc gatggtctgc 900agaagcttga
agagaaactc actgctgaga aattgatgga atggacaagt ttgcagaata 960tgagagagac
atgtgtcgat ttacacttac ctcggttcaa aatggaagag agctatgacc 1020tcaaggacac
gttgagaacc atgggaatgg tgaatatctt caatggggat gcagacctct 1080caggcatgac
ctggagccac ggtctctcag tatctaaagt cctacacaag gcctttgtgg 1140aggtcactga
ggagggagtg gaagctgcag ctgccaccgc tgtagtagta gtcgaattat 1200catctccttc
aactaatgaa gagttctgtt gtaatcaccc tttcctattc ttcataaggc 1260aaaataagac
caacagcatc ctcttctatg gcagattctc atccccatag atgcaattag 1320tctgtcactc
catttagaaa atgttcacct agaggtgttc tggtaaactg attgctggca 1380acaacagatt
ctcttggctc atatttcttt tctatctcat cttgatgatg atagtcatca 1440tcaagaattt
aatgattaaa atagcatgcc tttctctctt tctcttaata agcccacata 1500taaatgtact
tttccttcca gaaaaatttc ccttgaggaa aaatgtccaa gataagatga 1560atcatttaat
accgtgtctt ctaaatttga aatataattc tgtttctgac ctgttttaaa 1620tgaaccaaac
caaatcatac tttctcttca aatttagcaa cctagaaaca cacatttctt 1680tgaatttagg
tgatacctaa atccttctta tgtttctaaa ttttgtgatt ctataaaaca 1740catcatcaat
aaaataatga cataaaatca aaaaaaaaaa aaaaaaa
17871022633DNAHomo sapiens 102agtgggcgtg gcggtgctgc ccaggtgagc caccgctgct
tctgcccaga cacggtcgcc 60tccacatcca ggtctttgtg ctcctcgctt gcctgttcct
tttccacgca ttttccagga 120taactgtgac tccaggcccg caatggatgc cctgcaacta
gcaaattcgg cttttgccgt 180tgatctgttc aaacaactat gtgaaaagga gccactgggc
aatgtcctct tctctccaat 240ctgtctctcc acctctctgt cacttgctca agtgggtgct
aaaggtgaca ctgcaaatga 300aattggacag gttcttcatt ttgaaaatgt caaagatgta
ccctttggat ttcaaacagt 360aacatcggat gtaaacaaac ttagttcctt ttactcactg
aaactaatca agcggctcta 420cgtagacaaa tctctgaatc tttctacaga gttcatcagc
tctacgaaga gaccgtatgc 480aaaggaattg gaaactgttg acttcaaaga taaattggaa
gaaacgaaag gtcagatcaa 540caactcaatt aaggatctca cagatggcca ctttgagaac
attttagctg acaacagtgt 600gaacgaccag accaaaatcc ttgtggttaa tgctgcctac
tttgttggca agtggatgaa 660gaaattttct gaatcagaaa caaaagaatg tcctttcaga
gtcaacaaga cagacaccaa 720accagtgcag atgatgaaca tggaggccac gttctgtatg
ggaaacattg acagtatcaa 780ttgtaagatc atagagcttc cttttcaaaa taagcatctc
agcatgttca tcctactacc 840caaggatgtg gaggatgagt ccacaggctt ggagaagatt
gaaaaacaac tcaactcaga 900gtcactgtca cagtggacta atcccagcac catggccaat
gccaaggtca aactctccat 960tccaaaattt aaggtggaaa agatgattga tcccaaggct
tgtctggaaa atctagggct 1020gaaacatatc ttcagtgaag acacatctga tttctctgga
atgtcagaga ccaagggagt 1080ggccctatca aatgttatcc acaaagtgtg cttagaaata
actgaagatg gtggggattc 1140catagaggtg ccaggagcac ggatcctgca gcacaaggat
gaattgaatg ctgaccatcc 1200ctttatttac atcatcaggc acaacaaaac tcgaaacatc
attttctttg gcaaattctg 1260ttctccttaa gtggcatagc ccatgttaag tcctccctga
cttttctgtg gatgccgatt 1320tctgtaaact ctgcatccag agattcattt tctagataca
ataaattgct aatgttgctg 1380gatcaggaag ccgccagtac ttgtcatatg tagccttcac
acagatagac cttttttttt 1440tttccaattc tatcttttgt ttcctttttt cccataagac
aatgacatac gcttttaatg 1500aaaaggaatc acgttagagg aaaaatattt attcattatt
tgtcaaattg tccggggtag 1560ttggcagaaa tacagtcttc cacaaagaaa attcctataa
ggaagatttg gaagctcttc 1620ttcccagcac tatgctttcc ttctttggga tagagaatgt
tccagacatt ctcgcttccc 1680tgaaagactg aagaaagtgt agtgcatggg acccacgaaa
ctgccctggc tccagtgaaa 1740cttgggcaca tgctcaggct actataggtc cagaagtcct
tatgttaagc cctggcaggc 1800aggtgtttat taaaattctg aattttgggg attttcaaaa
gataatattt tacatacact 1860gtatgttata gaacttcatg gatcagatct ggggcagcac
cctataaatc aacaccttaa 1920tatgctgcaa caaaatgtag aatattcaga caaaatggat
acataaagac taagtagccc 1980ataaggggtc aaaatttgct gccaaatgcg tatgccacca
acttacaaaa acacttcgtt 2040cgcagagctt ttcagattgt ggaatgttgg ataaggaatt
atagacctct agtagctgaa 2100atgcaagacc ccaagaggaa gttcagatct taatataaat
tcactttcat ttttgatagc 2160tgtcccatct ggtcatttgg ttggcactag actggtggca
ggggcttcta gctgacttgc 2220acagggattc tcacaatagc cgatatcaga atttgtgttg
aaggaacttg tctcttcatc 2280taatatgata gcgggaaaag gagaggaaac tactgccttt
agaaaatata agtaaagtga 2340ttaaagtgct cacgttacct tgacacatag tttttcagtc
tatgggttta gttactttag 2400atggcaagca tgtaacttat attaatagta atttgtaaag
ttggttggat aagctatccg 2460tgttgcaggt tcatggatta cttctctata aaaaatatgt
atttaccaaa aattttgtga 2520cattccttct cccatctctt ccttgacctg cattgtaaat
aggttcttct tgttctgaga 2580ttcaatattg aatttttcct atgctattga caataaaata
ttattgaact aca 26331032005DNAHomo sapiens 103caacggctca
ttctgctccc ccgggtcgga gccccccgga gctgcgcgcg ggcttgcagc 60gcctcgcccg
cgctgtcctc ccggtgtccc gcttctccgc gccccagccg ccggctgcca 120gcttttcggg
gccccgagtc gcacccagcg aagagagcgg gcccgggaca agctcgaact 180ccggccgcct
cgcccttccc cggctccgct ccctctgccc cctcggggtc gcgcgcccac 240gatgctgcag
ggccctggct cgctgctgct gctcttcctc gcctcgcact gctgcctggg 300ctcggcgcgc
gggctcttcc tctttggcca gcccgacttc tcctacaagc gcagcaattg 360caagcccatc
cctgccaacc tgcagctgtg ccacggcatc gaataccaga acatgcggct 420gcccaacctg
ctgggccacg agaccatgaa ggaggtgctg gagcaggccg gcgcttggat 480cccgctggtc
atgaagcagt gccacccgga caccaagaag ttcctgtgct cgctcttcgc 540ccccgtctgc
ctcgatgacc tagacgagac catccagcca tgccactcgc tctgcgtgca 600ggtgaaggac
cgctgcgccc cggtcatgtc cgccttcggc ttcccctggc ccgacatgct 660tgagtgcgac
cgtttccccc aggacaacga cctttgcatc cccctcgcta gcagcgacca 720cctcctgcca
gccaccgagg aagctccaaa ggtatgtgaa gcctgcaaaa ataaaaatga 780tgatgacaac
gacataatgg aaacgctttg taaaaatgat tttgcactga aaataaaagt 840gaaggagata
acctacatca accgagatac caaaatcatc ctggagacca agagcaagac 900catttacaag
ctgaacggtg tgtccgaaag ggacctgaag aaatcggtgc tgtggctcaa 960agacagcttg
cagtgcacct gtgaggagat gaacgacatc aacgcgccct atctggtcat 1020gggacagaaa
cagggtgggg agctggtgat cacctcggtg aagcggtggc agaaggggca 1080gagagagttc
aagcgcatct cccgcagcat ccgcaagctg cagtgctagt cccggcatcc 1140tgatggctcc
gacaggcctg ctccagagca cggctgacca tttctgctcc gggatctcag 1200ctcccgttcc
ccaagcacac tcctagctgc tccagtctca gcctgggcag cttccccctg 1260ccttttgcac
gtttgcatcc ccagcatttc ctgagttata aggccacagg agtggatagc 1320tgttttcacc
taaaggaaaa gcccacccga atcttgtaga aatattcaaa ctaataaaat 1380catgaatatt
tttatgaagt ttaaaaatag ctcactttaa agctagtttt gaataggtgc 1440aactgtgact
tgggtctggt tggttgttgt ttgttgtttt gagtcagctg attttcactt 1500cccactgagg
ttgtcataac atgcaaattg cttcaatttt ctctgtggcc caaacttgtg 1560ggtcacaaac
cctgttgaga taaagctggc tgttatctca acatcttcat cagctccaga 1620ctgagactca
gtgtctaagt cttacaacaa ttcatcattt tataccttca atgggaactt 1680aaactgttac
atgtatcaca ttccagctac aatacttcca tttattagaa gcacattaac 1740catttctata
gcatgatttc ttcaagtaaa aggcaaaaga tataaatttt ataattgact 1800tgagtacttt
aagccttgtt taaaacattt cttacttaac ttttgcaaat taaacccatt 1860gtagcttacc
tgtaatatac atagtagttt acctttaaaa gttgtaaaaa tattgcttta 1920accaacactg
taaatatttc agataaacat tatattcttg tatataaact ttacatcctg 1980ttttacctat
aaaaaaaaaa aaaaa
20051043687DNAHomo sapiens 104tccaccattt tgctagagaa ggccgcggag gctcagagag
gtgcgcacac ttgccctgag 60tcacacagcg aatgccctcc gcggtcccaa cgcagagaga
acgagccgat cggcagcctg 120agcgaggcag tggttagggg gggccccggc cccggccact
cccctcaccc cctccccgca 180gagcgccgcc caggacaggc tgggccccag gccccgcccc
gaggtcctgc ccacacaccc 240ctgacacacc ggcgtcgcca gccaatggcc ggggtcctat
aaacgctacg gtccgcgcgc 300tctctggcaa gaggcaagag gtagcaacag cgagcgtgcc
ggtcgctagt cgcgggtccc 360cgagtgagca cgccagggag caggagacca aacgacgggg
gtcggagtca gagtcgcagt 420gggagtcccc ggaccggagc acgagcctga gcgggagagc
gccgctcgca cgcccgtcgc 480cacccgcgta cccggcgcag ccagagccac cagcgcagcg
ctgccatgga gcccagcagc 540aagaagctga cgggtcgcct catgctggcc gtgggaggag
cagtgcttgg ctccctgcag 600tttggctaca acactggagt catcaatgcc ccccagaagg
tgatcgagga gttctacaac 660cagacatggg tccaccgcta tggggagagc atcctgccca
ccacgctcac cacgctctgg 720tccctctcag tggccatctt ttctgttggg ggcatgattg
gctccttctc tgtgggcctt 780ttcgttaacc gctttggccg gcggaattca atgctgatga
tgaacctgct ggccttcgtg 840tccgccgtgc tcatgggctt ctcgaaactg ggcaagtcct
ttgagatgct gatcctgggc 900cgcttcatca tcggtgtgta ctgcggcctg accacaggct
tcgtgcccat gtatgtgggt 960gaagtgtcac ccacagccct tcgtggggcc ctgggcaccc
tgcaccagct gggcatcgtc 1020gtcggcatcc tcatcgccca ggtgttcggc ctggactcca
tcatgggcaa caaggacctg 1080tggcccctgc tgctgagcat catcttcatc ccggccctgc
tgcagtgcat cgtgctgccc 1140ttctgccccg agagtccccg cttcctgctc atcaaccgca
acgaggagaa ccgggccaag 1200agtgtgctaa agaagctgcg cgggacagct gacgtgaccc
atgacctgca ggagatgaag 1260gaagagagtc ggcagatgat gcgggagaag aaggtcacca
tcctggagct gttccgctcc 1320cccgcctacc gccagcccat cctcatcgct gtggtgctgc
agctgtccca gcagctgtct 1380ggcatcaacg ctgtcttcta ttactccacg agcatcttcg
agaaggcggg ggtgcagcag 1440cctgtgtatg ccaccattgg ctccggtatc gtcaacacgg
ccttcactgt cgtgtcgctg 1500tttgtggtgg agcgagcagg ccggcggacc ctgcacctca
taggcctcgc tggcatggcg 1560ggttgtgcca tactcatgac catcgcgcta gcactgctgg
agcagctacc ctggatgtcc 1620tatctgagca tcgtggccat ctttggcttt gtggccttct
ttgaagtggg tcctggcccc 1680atcccatggt tcatcgtggc tgaactcttc agccagggtc
cacgtccagc tgccattgcc 1740gttgcaggct tctccaactg gacctcaaat ttcattgtgg
gcatgtgctt ccagtatgtg 1800gagcaactgt gtggtcccta cgtcttcatc atcttcactg
tgctcctggt tctgttcttc 1860atcttcacct acttcaaagt tcctgagact aaaggccgga
ccttcgatga gatcgcttcc 1920ggcttccggc aggggggagc cagccaaagt gacaagacac
ccgaggagct gttccatccc 1980ctgggggctg attcccaagt gtgagtcgcc ccagatcacc
agcccggcct gctcccagca 2040gccctaagga tctctcagga gcacaggcag ctggatgaga
cttccaaacc tgacagatgt 2100cagccgagcc gggcctgggg ctcctttctc cagccagcaa
tgatgtccag aagaatattc 2160aggacttaac ggctccagga ttttaacaaa agcaagactg
ttgctcaaat ctattcagac 2220aagcaacagg ttttataatt tttttattac tgattttgtt
atttttatat cagcctgagt 2280ctcctgtgcc cacatcccag gcttcaccct gaatggttcc
atgcctgagg gtggagacta 2340agccctgtcg agacacttgc cttcttcacc cagctaatct
gtagggctgg acctatgtcc 2400taaggacaca ctaatcgaac tatgaactac aaagcttcta
tcccaggagg tggctatggc 2460cacccgttct gctggcctgg atctccccac tctaggggtc
aggctccatt aggatttgcc 2520ccttcccatc tcttcctacc caaccactca aattaatctt
tctttacctg agaccagttg 2580ggagcactgg agtgcaggga ggagagggga agggccagtc
tgggctgccg ggttctagtc 2640tcctttgcac tgagggccac actattacca tgagaagagg
gcctgtggga gcctgcaaac 2700tcactgctca agaagacatg gagactcctg ccctgttgtg
tatagatgca agatatttat 2760atatattttt ggttgtcaat attaaataca gacactaagt
tatagtatat ctggacaagc 2820caacttgtaa atacaccacc tcactcctgt tacttaccta
aacagatata aatggctggt 2880ttttagaaac atggttttga aatgcttgtg gattgagggt
aggaggtttg gatgggagtg 2940agacagaagt aagtggggtt gcaaccactg caacggctta
gacttcgact caggatccag 3000tcccttacac gtacctctca tcagtgtcct cttgctcaaa
aatctgtttg atccctgtta 3060cccagagaat atatacattc tttatcttga cattcaaggc
atttctatca catatttgat 3120agttggtgtt caaaaaaaca ctagttttgt gccagccgtg
atgctcaggc ttgaaatgca 3180ttattttgaa tgtgaagtaa atactgtacc tttattggac
aggctcaaag aggttatgtg 3240cctgaagtcg cacagtgaat aagctaaaac acctgctttt
aacaatggta ccatacaacc 3300actactccat taactccacc cacctcctgc acccctcccc
acacacacaa aatgaaccac 3360gttctttgta tgggcccaat gagctgtcaa gctgccctgt
gttcatttca tttggaattg 3420ccccctctgg ttcctctgta tactactgct tcatctctaa
agacagctca tcctcctcct 3480tcacccctga atttccagag cacttcatct gctccttcat
cacaagtcca gttttctgcc 3540actagtctga atttcatgag aagatgccga tttggttcct
gtgggtcctc agcactattc 3600agtacagtgc ttgatgcaca gcaggcactc agaaaatact
ggaggaaata aaacaccaaa 3660gatatttgtc aaaaaaaaaa aaaaaaa
36871052643DNAHomo sapiens 105agggcggggc gggcagcagg
tgagacgcca ggtctccagg gctccaatca ctccggagac 60tgagccatgg ggggaaagca
gcgggacgag gatgacgagg cctacgggaa gccagtcaaa 120tacgacccct cctttcgagg
ccccatcaag aacagaagct gcacagatgt catctgctgc 180gtcctcttcc tgctcttcat
tctaggttac atcgtggtgg ggattgtggc ctggttgtat 240ggagaccccc ggcaagtcct
ctaccccagg aactctactg gggcctactg tggcatgggg 300gagaacaaag ataagccgta
tctcctgtac ttcaacatct tcagctgcat cctgtccagc 360aacatcatct cagttgctga
gaacggccta cagtgcccca caccccaggt gtgtgtgtcc 420tcctgcccgg aggacccatg
gactgtggga aaaaacgagt tctcacagac tgttggggaa 480gtcttctata caaaaaacag
gaacttttgt ctgccagggg taccctggaa tatgacggtg 540atcacaagcc tgcaacagga
actctgcccc agtttcctcc tcccctctgc tccagctctg 600gggcgctgct ttccatggac
caacgttact ccaccggcgc tcccagggat caccaatgac 660accaccatac agcaggggat
cagcggtctt attgacagcc tcaatgcccg agacatcagt 720gttaagatct ttgaagattt
tgcccagtcc tggtattgga ttcttgttgc cctgggggtg 780gctctggtct tgagcctact
gtttatcttg cttctgcgcc tggtggctgg gcccctggtg 840ctggtgctga tcctgggagt
gctgggcgtg ctggcatacg gcatctacta ctgctgggag 900gagtaccgag tgctgcggga
caagggcgcc tccatctccc agctgggttt caccaccaac 960ctcagtgcct accagagcgt
gcaggagacc tggctggccg ccctgatcgt gttggcggtg 1020cttgaagcca tcctgctgct
gatgctcatc ttcctgcggc agcggattcg tattgccatc 1080gccctcctga aggaggccag
caaggctgtg ggacagatga tgtctaccat gttctaccca 1140ctggtcacct ttgtcctcct
cctcatctgc attgcctact gggccatgac tgctctgtac 1200ctggctacat cggggcaacc
ccagtatgtg ctctgggcat ccaacatcag ctcccccggc 1260tgtgagaaag tgccaataaa
tacatcatgc aaccccacgg cccaccttgt gaactcctcg 1320tgcccagggc tgatgtgcgt
cttccagggc tactcatcca aaggcctaat ccaacgttct 1380gtcttcaatc tgcaaatcta
tggggtcctg gggctcttct ggacccttaa ctgggtactg 1440gccctgggcc aatgcgtcct
cgctggagcc tttgcctcct tctactgggc cttccacaag 1500ccccaggaca tccctacctt
ccccttaatc tctgccttca tccgcacact ccgttaccac 1560actgggtcat tggcatttgg
agccctcatc ctgacccttg tgcagatagc ccgggtcatc 1620ttggagtata ttgaccacaa
gctcagagga gtgcagaacc ctgtagcccg ctgcatcatg 1680tgctgtttca agtgctgcct
ctggtgtctg gaaaaattta tcaagttcct aaaccgcaat 1740gcatacatca tgatcgccat
ctacgggaag aatttctgtg tctcagccaa aaatgcgttc 1800atgctactca tgcgaaacat
tgtcagggtg gtcgtcctgg acaaagtcac agacctgctg 1860ctgttctttg ggaagctgct
ggtggtcgga ggcgtggggg tcctgtcctt cttttttttc 1920tccggtcgca tcccggggct
gggtaaagac tttaagagcc cccacctcaa ctattactgg 1980ctgcccatca tgacctccat
cctgggggcc tatgtcatcg ccagcggctt cttcagcgtt 2040ttcggcatgt gtgtggacac
gctcttcctc tgcttcctgg aagacctgga gcggaacaac 2100ggctccctgg accggcccta
ctacatgtcc aagagccttc taaagattct gggcaagaag 2160aacgaggcgc ccccggacaa
caagaagagg aagaagtgac agctccggcc ctgatccagg 2220actgcacccc acccccaccg
tccagccatc caacctcact tcgccttaca ggtctccatt 2280ttgtggtaaa aaaaggtttt
aggccaggcg ccgtggctca cgcctgtaat ccaacacttt 2340gagaggctga ggcgggcgga
tcacctgagt caggagttcg agaccagcct ggccaacatg 2400gtgaaacctc cgtctctatt
aaaaatacaa aaattagccg agagtggtgg catgcacctg 2460tcatcccagc tactcgggag
gctgaggcag gagaatcgct tgaacccggg aggcagaggt 2520tgcagtgagc cgagatcgcg
ccactgcact ccaacctggg tgacagactc tgtctccaaa 2580acaaaacaaa caaacaaaaa
gattttatta aagatatttt gttaactcag taaaaaaaaa 2640aaa
26431063604DNAHomo sapiens
106gggagaagga ggaggccggg ggaaggagga gacaggagga ggagggacca cggggtggag
60gggagataga cccagcccag agctctgagt ggtttcctgt tgcctgtctc taaacccctc
120cacattcccg cggtccttca gactgcccgg agagcgcgct ctgcctgccg cctgcctgcc
180tgccactgag ggttcccagc accatgaggg cctggatctt ctttctcctt tgcctggccg
240ggagggcctt ggcagcccct cagcaagaag ccctgcctga tgagacagag gtggtggaag
300aaactgtggc agaggtgact gaggtatctg tgggagctaa tcctgtccag gtggaagtag
360gagaatttga tgatggtgca gaggaaaccg aagaggaggt ggtggcggaa aatccctgcc
420agaaccacca ctgcaaacac ggcaaggtgt gcgagctgga tgagaacaac acccccatgt
480gcgtgtgcca ggaccccacc agctgcccag cccccattgg cgagtttgag aaggtgtgca
540gcaatgacaa caagaccttc gactcttcct gccacttctt tgccacaaag tgcaccctgg
600agggcaccaa gaagggccac aagctccacc tggactacat cgggccttgc aaatacatcc
660ccccttgcct ggactctgag ctgaccgaat tccccctgcg catgcgggac tggctcaaga
720acgtcctggt caccctgtat gagagggatg aggacaacaa ccttctgact gagaagcaga
780agctgcgggt gaagaagatc catgagaatg agaagcgcct ggaggcagga gaccaccccg
840tggagctgct ggcccgggac ttcgagaaga actataacat gtacatcttc cctgtacact
900ggcagttcgg ccagctggac cagcacccca ttgacgggta cctctcccac accgagctgg
960ctccactgcg tgctcccctc atccccatgg agcattgcac cacccgcttt ttcgagacct
1020gtgacctgga caatgacaag tacatcgccc tggatgagtg ggccggctgc ttcggcatca
1080agcagaagga tatcgacaag gatcttgtga tctaaatcca ctccttccac agtaccggat
1140tctctcttta accctcccct tcgtgtttcc cccaatgttt aaaatgtttg gatggtttgt
1200tgttctgcct ggagacaagg tgctaacata gatttaagtg aatacattaa cggtgctaaa
1260aatgaaaatt ctaacccaag acatgacatt cttagctgta acttaactat taaggccttt
1320tccacacgca ttaatagtcc catttttctc ttgccatttg tagctttgcc cattgtctta
1380ttggcacatg ggtggacacg gatctgctgg gctctgcctt aaacacacat tgcagcttca
1440acttttctct ttagtgttct gtttgaaact aatacttacc gagtcagact ttgtgttcat
1500ttcatttcag ggtcttggct gcctgtgggc ttccccaggt ggcctggagg tgggcaaagg
1560gaagtaacag acacacgatg ttgtcaagga tggttttggg actagaggct cagtggtggg
1620agagatccct gcagaaccca ccaaccagaa cgtggtttgc ctgaggctgt aactgagaga
1680aagattctgg ggctgtgtta tgaaaatata gacattctca cataagccca gttcatcacc
1740atttcctcct ttacctttca gtgcagtttc ttttcacatt aggctgttgg ttcaaacttt
1800tgggagcacg gactgtcagt tctctgggaa gtggtcagcg catcctgcag ggcttctcct
1860cctctgtctt ttggagaacc agggctcttc tcaggggctc tagggactgc caggctgttt
1920cagccaggaa ggccaaaatc aagagtgaga tgtagaaagt tgtaaaatag aaaaagtgga
1980gttggtgaat cggttgttct ttcctcacat ttggatgatt gtcataaggt ttttagcatg
2040ttcctccttt tcttcaccct cccctttttt cttctattaa tcaagagaaa cttcaaagtt
2100aatgggatgg tcggatctca caggctgaga actcgttcac ctccaagcat ttcatgaaaa
2160agctgcttct tattaatcat acaaactctc accatgatgt gaagagtttc acaaatcctt
2220caaaataaaa agtaatgact tagaaactgc cttcctgggt gatttgcatg tgtcttagtc
2280ttagtcacct tattatcctg acacaaaaac acatgagcat acatgtctac acatgactac
2340acaaatgcaa acctttgcaa acacattatg cttttgcaca cacacacctg tacacacaca
2400ccggcatgtt tatacacagg gagtgtatgg ttcctgtaag cactaagtta gctgttttca
2460tttaatgacc tgtggtttaa cccttttgat cactaccacc attatcagca ccagactgag
2520cagctatatc cttttattaa tcatggtcat tcattcattc attcattcac aaaatattta
2580tgatgtattt actctgcacc aggtcccatg ccaagcactg gggacacagt tatggcaaag
2640tagacaaagc atttgttcat ttggagctta gagtccagga ggaatacatt agataatgac
2700acaatcaaat ataaattgca agatgtcaca ggtgtgatga agggagagta ggagagacca
2760tgagtatgtg taacaggagg acacagcatt attctagtgc tgtactgttc cgtacggcag
2820ccactaccca catgtaactt tttaagattt aaatttaaat tagttaacat tcaaaacgca
2880gctccccaat cacactagca acatttcaag tgcttgagag ccatgcatga ttagtggtta
2940ccctattgaa taggtcagaa gtagaatctt ttcatcatca cagaaagttc tattggacag
3000tgctcttcta gatcatcata agactacaga gcacttttca aagctcatgc atgttcatca
3060tgttagtgtc gtattttgag ctggggtttt gagactcccc ttagagatag agaaacagac
3120ccaagaaatg tgctcaattg caatgggcca catacctaga tctccagatg tcatttcccc
3180tctcttattt taagttatgt taagattact aaaacaataa aagctcctaa aaaatcaaac
3240tgtattctgg tgttctcttc tacacagtgg gagggcgagc agtaggagag attggcccat
3300ttggtgctgg ccatttgagg aatgcaagcc cagcactagt ctcataatct ctaggaatct
3360gtagagagag gaattgaagt aaatttcagc attggctcat tcagtcattc ggcgacattc
3420atcaggtacc tgcaatgtgt taggggatct tatgagtagg cagcgtgcgt gatccttgct
3480cccctggagc tttctaacat tctagcaggc agaccacaca taaatttgca atactgtttc
3540tgataaaaac gtgctgtaaa ggaaataaag cagagaacta tcatggaaaa aaaaaaaaaa
3600aaaa
3604107386DNAHomo sapiens 107ccaggatcag catggccgtc cgccagtggg taatcgccct
ggccttggct gccctccttg 60ttgtggacag ggaagtgcca gtggcagcag gaaagctccc
tttctcaaga atgcccatct 120gtgaacacat ggtagagtct ccaacctgtt cccagatgtc
caacctggtc tgcggcactg 180atgggctcac atatacgaat gaatgccagc tctgcttggc
ccggataaaa accaaacagg 240acatccagat catgaaagat ggcaaatgct gatcccacag
gagcacctca agccatgaag 300tgtcagctgg agaacagtgg tgggcatgga gaggatatga
catgaaataa aagatccagc 360ccaactgaaa aaaaaaaaaa aaaaaa
386108641DNAHomo sapiens 108accagttcta agggaccata
cagagtattc ctctcttcac accaggacca gtcactgttg 60cagcatgagt tcccagcagc
agaagcagcc ttgcacccca ccccctcagc ttcagcagca 120gcaggtgaaa cagccttgcc
agcctccacc tcaggaacca tgcatcccca aaaccaagga 180gccctgccac cccaaggtgc
ctgagccctg ccaccccaaa gtgcccgagc cctgccagcc 240caaggttcca gagccatgcc
accccaaggt gcctgagccc tgcccttcaa tagtcactcc 300agcaccagcc cagcagaaga
ccaagcagaa gtaatgtggt ccacagccat gcccttgagg 360agccggccac cagatgctga
atcccctatc ccattctgcg tatgagtccc atttgccttg 420caattagcat tctgtctccc
ccaaaaaaga atgtgctatg aagctttctt tcctacacac 480tctgagtctc tgaatgaagc
tgaaggtctt agtaccagag ctagttttca gctgctcaga 540attcatctga agagagactt
aagatgaaag caaatgattc agctccctta tacccccatt 600aaattcactt tcaattccaa
aaaaaaaaaa aaaaaaaaaa a 6411091002DNAHomo sapiens
109tcaccagatc ccagaggctg aacacctcga ccttctctgc acagcagatg atccctgagc
60agctgaagac cagaaaagcc actaagactt tctgcttaat tcaggagctt agaggattct
120tcaaagagtg tgtccagcat cctttgaagc atgagttctt accagcagaa gcagaccttt
180accccaccac ctcagcttca acagcagcag gtgaaacaac ccagccagcc tccacctcag
240gaaatatttg ttcccacaac caaggagcca tgccactcaa aggttccaca acctggaaac
300acaaagattc cagagccagg ctgtaccaag gtccctgagc caggctgtac caaggtccct
360gagccaggct gtaccaaggt ccctgagcca ggttgtacca aggtccctga gccaggctgt
420accaaggtcc ctgagccagg ttgtaccaag gtccctgagc caggctacac caaggtccct
480gaaccaggca gcatcaaggt ccctgaccaa ggcttcatca agtttcctga gccaggtgcc
540atcaaagttc ctgagcaagg atacaccaaa gttcctgtgc caggctacac aaagctacca
600gagccatgtc cttcaacggt cactccaggc ccagctcagc agaagaccaa gcagaagtaa
660tttggtgcac agacaagccc ttgagaagcc aaccaccaga tgctggacac cctcttccca
720tctgtttctg tgtcttaatt gtctgtagac cttgtaatca gcacattgtc accccaagcc
780atagtctctc tcttatttgt atcctaaaaa tacgtactat aaagcttttg ttcacacaca
840ctctgaagaa tcctgtaagc ccctgaatta agcagaaagt cttcatggct tttctggtct
900tcggctgctc agggttcatc tgaagattcg aatgaaaaga aatgcatgtt tcctgctctt
960ccctcattaa attgctttta attccaaaaa aaaaaaaaaa aa
10021102593DNAHomo sapiens 110ataagccttc atacagcagt gaaggcggtt cctcccttcc
caggcagaga ctgataaact 60cagcacttgc cggagtggct cattgttaag acaaagggtg
tgcacttcct ggccaggaaa 120cctgagcggt gagactccca gctgcctaca tcaaggcccc
aggacatgca gaaccttcct 180ctagaacccg acccaccacc atgaggtcct gcctgtggag
atgcaggcac ctgagccaag 240gcgtccagtg gtccttgctt ctggctgtcc tggtcttctt
tctcttcgcc ttgccctctt 300ttattaagga gcctcaaaca aagccttcca ggcatcaacg
cacagagaac attaaagaaa 360ggtctctaca gtccctggca aagcctaagt cccaggcacc
cacaagggca aggaggacaa 420ccatctatgc agagccagtg ccagagaaca atgccctcaa
cacacaaacc cagcccaagg 480cccacaccac cggagacaga ggaaaggagg ccaaccaggc
accgccggag gagcaggaca 540aggtgcccca cacagcacag agggcagcat ggaagagccc
agaaaaagag aaaaccatgg 600tgaacacact gtcacccaga gggcaagatg cagggatggc
ctctggcagg acagaggcac 660aatcatggaa gagccaggac acaaagacga cccaaggaaa
tgggggccag accaggaagc 720tgacggcctc caggacggtg tcagagaagc accagggcaa
agcggcaacc acagccaaga 780cgctcattcc caaaagtcag cacagaatgc tggctcccac
aggagcagtg tcaacaagga 840cgagacagaa aggagtgacc acagcagtca tcccacctaa
ggagaagaaa cctcaggcca 900ccccaccccc tgcccctttc cagagcccca cgacgcagag
aaaccaaaga ctgaaggccg 960ccaacttcaa atctgagcct cggtgggatt ttgaggaaaa
atacagcttc gaaataggag 1020gccttcagac gacttgccct gactctgtga agatcaaagc
ctccaagtcg ctgtggctcc 1080agaaactctt tctgcccaac ctcactctct tcctggactc
cagacacttc aaccagagtg 1140agtgggaccg cctggaacac tttgcaccac cctttggctt
catggagctc aactactcct 1200tggtgcagaa ggtcgtgaca cgcttccctc cagtgcccca
gcagcagctg ctcctggcca 1260gcctccccgc tgggagcctc cggtgcatca cctgtgccgt
ggtgggcaac gggggcatcc 1320tgaacaactc ccacatgggc caggagatag acagtcacga
ctacgtgttc cgattgagcg 1380gagctctcat taaaggctac gaacaggatg tggggactcg
gacatccttc tacggcttta 1440ccgccttctc cctgacccag tcactcctta tattgggcaa
tcggggtttc aagaacgtgc 1500ctcttgggaa ggacgtccgc tacttgcact tcctggaagg
cacccgggac tatgagtggc 1560tggaagcact gcttatgaat cagacggtga tgtcaaaaaa
ccttttctgg ttcaggcaca 1620gaccccagga agcttttcgg gaagccctgc acatggacag
gtacctgttg ctgcacccag 1680actttctccg atacatgaag aacaggtttc tgaggtctaa
gaccctggat ggtgcccact 1740ggaggatata ccgccccacc actggggccc tcctgctgct
cactgccctt cagctctgtg 1800accaggtgag tgcttatggc ttcatcactg agggccatga
gcgcttttct gatcactact 1860atgatacatc atggaagcgg ctgatctttt acataaacca
tgacttcaag ctggagagag 1920aagtctggaa gcggctacac gatgaaggga taatccggct
gtaccagcgt cctggtcccg 1980gaactgccaa agccaagaac tgaccggggc cagggctgcc
atggtctcct tgcctgctcc 2040aaggcacagg atacagtggg aatcttgaga ctctttggcc
atttcccatg gctcagacta 2100agctccaagc ccttcaggag ttccaaggga acacttgaac
catggacaag actctctcaa 2160gatggcaaat ggctaattga ggttctgaag ttcttcagta
cattgctgta ggtcctgagg 2220ccagggattt ttaattaaat ggggtgatgg gtggccaata
ccacaattcc tgctgaaaaa 2280cactcttcca gtccaaaagc ttcttgatac agaaaaaaga
gcctggattt acagaaacat 2340atagatctgg tttgaattcc agatcgagtt tacagttgtg
aaatcttgaa ggtattactt 2400aacttcacta cagattgtct agaagacctt tctaggagtt
atctgattct agaagggtct 2460atacttgtcc ttgtctttaa gctatttgac aactctacgt
gttgtagaaa actgataata 2520atacaaatga ttgttgtcca tggaaaggca aataaatttt
ctacagtgaa gatgcaaaaa 2580aaaaaaaaaa aaa
25931115716DNAHomo sapiens 111gtgatgataa taatacgcgg
gcttatataa ccgtcttcat cttgcgagca cttcgcagac 60cgtcgctaat gaatcttggg
gccggtgtcg ggccggggcg gcttgatcgg caactaggaa 120accccaggcg cagaggccag
gagcgagggc agcgaggatc agaggccagg ccttcccggc 180tgccggcgct cctcggaggt
cagggcagat gaggaacatg actctccccc ttcggaggag 240gaaggaagtc ccgctgccac
cttatctctg ctcctctgcc tcctccctgt tcccagagct 300ttttctctag agaagatttt
gaaggcggct tttggattct tcacttctct tgaacaagga 360actcactcag agactaacac
aaaggaagta atttcttacc tggtcattat ttagtctaca 420ataagttcat ccttcttcag
tgtgaccagt aaattcttcc catactcttg aagagagcat 480aattggaatg gagaggtgct
gacggccacc caccatcatc taaagaagat aaacttggca 540aatgacatgc aggttcttca
aggcagaata attgcagaaa atcttcaaag gaccctatct 600gcagatgttc tgaatacctc
tgagaataga gattgattat tcaaccagga tacctaattc 660aagaactcca gaaatcagga
gacggagaca ttttgtcagt tttgcaacat tggaccaaat 720acaatgaagt attcttgctg
tgctctggtt ttggctgtcc tgggcacaga attgctggga 780agcctctgtt cgactgtcag
atccccgagg ttcagaggac ggatacagca ggaacgaaaa 840aacatccgac ccaacattat
tcttgtgctt accgatgatc aagatgtgga gctggggtcc 900ctgcaagtca tgaacaaaac
gagaaagatt atggaacatg ggggggccac cttcatcaat 960gcctttgtga ctacacccat
gtgctgcccg tcacggtcct ccatgctcac cgggaagtat 1020gtgcacaatc acaatgtcta
caccaacaac gagaactgct cttccccctc gtggcaggcc 1080atgcatgagc ctcggacttt
tgctgtatat cttaacaaca ctggctacag aacagccttt 1140tttggaaaat acctcaatga
atataatggc agctacatcc cccctgggtg gcgagaatgg 1200cttggattaa tcaagaattc
tcgcttctat aattacactg tttgtcgcaa tggcatcaaa 1260gaaaagcatg gatttgatta
tgcaaaggac tacttcacag acttaatcac taacgagagc 1320attaattact tcaaaatgtc
taagagaatg tatccccata ggcccgttat gatggtgatc 1380agccacgctg cgccccacgg
ccccgaggac tcagccccac agttttctaa actgtacccc 1440aatgcttccc aacacataac
tcctagttat aactatgcac caaatatgga taaacactgg 1500attatgcagt acacaggacc
aatgctgccc atccacatgg aatttacaaa cattctacag 1560cgcaaaaggc tccagacttt
gatgtcagtg gatgattctg tggagaggct gtataacatg 1620ctcgtggaga cgggggagct
ggagaatact tacatcattt acaccgccga ccatggttac 1680catattgggc agtttggact
ggtcaagggg aaatccatgc catatgactt tgatattcgt 1740gtgccttttt ttattcgtgg
tccaagtgta gaaccaggat caatagtccc acagatcgtt 1800ctcaacattg acttggcccc
cacgatcctg gatattgctg ggctcgacac acctcctgat 1860gtggacggca agtctgtcct
caaacttctg gacccagaaa agccaggtaa caggtttcga 1920acaaacaaga aggccaaaat
ttggcgtgat acattcctag tggaaagagg caaatttcta 1980cgtaagaagg aagaatccag
caagaatatc caacagtcaa atcacttgcc caaatatgaa 2040cgggtcaaag aactatgcca
gcaggccagg taccagacag cctgtgaaca accggggcag 2100aagtggcaat gcattgagga
tacatctggc aagcttcgaa ttcacaagtg taaaggaccc 2160agtgacctgc tcacagtccg
gcagagcacg cggaacctct acgctcgcgg cttccatgac 2220aaagacaaag agtgcagttg
tagggagtct ggttaccgtg ccagcagaag ccaaagaaag 2280agtcaacggc aattcttgag
aaaccagggg actccaaagt acaagcccag atttgtccat 2340actcggcaga cacgttcctt
gtccgtcgaa tttgaaggtg aaatatatga cataaatctg 2400gaagaagaag aagaattgca
agtgttgcaa ccaagaaaca ttgctaagcg tcatgatgaa 2460ggccacaagg ggccaagaga
tctccaggct tccagtggtg gcaacagggg caggatgctg 2520gcagatagca gcaacgccgt
gggcccacct accactgtcc gagtgacaca caagtgtttt 2580attcttccca atgactctat
ccattgtgag agagaactgt accaatcggc cagagcgtgg 2640aaggaccata aggcatacat
tgacaaagag attgaagctc tgcaagataa aattaagaat 2700ttaagagaag tgagaggaca
tctgaagaga aggaagcctg aggaatgtag ctgcagtaaa 2760caaagctatt acaataaaga
gaaaggtgta aaaaagcaag agaaattaaa gagccatctt 2820cacccattca aggaggctgc
tcaggaagta gatagcaaac tgcaactttt caaggagaac 2880aaccgtagga ggaagaagga
gaggaaggag aagagacggc agaggaaggg ggaagagtgc 2940agcctgcctg gcctcacttg
cttcacgcat gacaacaacc actggcagac agccccgttc 3000tggaacctgg gatctttctg
tgcttgcacg agttctaaca ataacaccta ctggtgtttg 3060cgtacagtta atgagacgca
taattttctt ttctgtgagt ttgctactgg ctttttggag 3120tattttgata tgaatacaga
tccttatcag ctcacaaata cagtgcacac ggtagaacga 3180ggcattttga atcagctaca
cgtacaacta atggagctca gaagctgtca aggatataag 3240cagtgcaacc caagacctaa
gaatcttgat gttggaaata aagatggagg aagctatgac 3300ctacacagag gacagttatg
ggatggatgg gaaggttaat cagccccgtc tcactgcaga 3360catcaactgg caaggcctag
aggagctaca cagtgtgaat gaaaacatct atgagtacag 3420acaaaactac agacttagtc
tggtggactg gactaattac ttgaaggatt tagatagagt 3480atttgcactg ctgaagagtc
actatgagca aaataaaaca aataagactc aaactgctca 3540aagtgacggg ttcttggttg
tctctgctga gcacgctgtg tcaatggaga tggcctctgc 3600tgactcagat gaagacccaa
ggcataaggt tgggaaaaca cctcatttga ccttgccagc 3660tgaccttcaa accctgcatt
tgaaccgacc aacattaagt ccagagagta aacttgaatg 3720gaataacgac attccagaag
ttaatcattt gaattctgaa cactggagaa aaaccgaaaa 3780atggacgggg catgaagaga
ctaatcatct ggaaaccgat ttcagtggcg atggcatgac 3840agagctagag ctcgggccca
gccccaggct gcagcccatt cgcaggcacc cgaaagaact 3900tccccagtat ggtggtcctg
gaaaggacat ttttgaagat caactatatc ttcctgtgca 3960ttccgatgga atttcagttc
atcagatgtt caccatggcc accgcagaac accgaagtaa 4020ttccagcata gcggggaaga
tgttgaccaa ggtggagaag aatcacgaaa aggagaagtc 4080acagcaccta gaaggcagcg
cctcctcttc actctcctct gattagatga aactgttacc 4140ttaccctaaa cacagtattt
ctttttaact tttttatttg taaactaata aaggtaatca 4200cagccaccaa cattccaagc
taccctgggt acctttgtgc agtagaagct agtgagcatg 4260tgagcaagcg gtgtgcacac
ggagactcat cgttataatt tactatctgc caagagtaga 4320aagaaaggct ggggatattt
gggttggctt ggttttgatt ttttgcttgt ttgtttgttt 4380tgtactaaaa cagtattatc
ttttgaatat cgtagggaca taagtatata catgttatcc 4440aatcaagatg gctagaatgg
tgcctttctg agtgtctaaa acttgacacc cctggtaaat 4500ctttcaacac acttccactg
cctgcgtaat gaagttttga ttcattttta accactggaa 4560tttttcaatg ccgtcatttt
cagttagatg attttgcact ttgagattaa aatgccatgt 4620ctatttgatt agtcttattt
ttttattttt acaggcttat cagtctcact gttggctgtc 4680attgtgacaa agtcaaataa
acccccaagg acgacacaca gtatggatca catattgttt 4740gacattaagc ttttgccaga
aaatgttgca tgtgttttac ctcgacttgc taaaatcgat 4800tagcagaaag gcatggctaa
taatgttggt ggtgaaaata aataaataag taaacaaaat 4860gaagattgcc tgctctctct
gtgcctagcc tcaaagcgtt catcatacat cataccttta 4920agattgctat attttgggtt
attttcttga caggagaaaa agatctaaag atcttttatt 4980ttcatctttt ttggttttct
tggcatgact aagaagctta aatgttgata aaatatgact 5040agttttgaat ttacaccaag
aacttctcaa taaaagaaaa tcatgaatgc tccacaattt 5100caacatacca caagagaagt
taatttctta acattgtgtt ctatgattat ttgtaagacc 5160ttcaccaagt tctgatatct
tttaaagaca tagttcaaaa ttgcttttga aaatctgtat 5220tcttgaaaat atccttgttg
tgtattaggt ttttaaatac cagctaaagg attacctcac 5280tgagtcatca gtaccctcct
attcagctcc ccaagatgat gtgtttttgc ttaccctaag 5340agaggttttc ttcttatttt
tagataattc aagtgcttag ataaattatg ttttctttaa 5400gtgtttatgg taaactcttt
taaagaaaat ttaatatgtt atagctgaat ctttttggta 5460actttaaatc tttatcatag
actctgtaca tatgttcaaa ttagctgctt gcctgatgtg 5520tgtatcatcg gtgggatgac
agaacaaaca tatttatgat catgaataat gtgctttgta 5580aaaagatttc aagttattag
gaagcatact ctgtttttta atcatgtata atattccatg 5640atacttttat agaacaattc
tggcttcagg aaagtctaga agcaatattt cttcaaataa 5700aaggtgttta aacttt
57161127355DNAHomo sapiens
112agtctgcggg cctccggggc agcggcgagg ccggagcgtc gcggcggaga ggacgagacc
60gggacaagac cagggcagga gggagccggc cagccgcgag aaccccgcac gcccggcaag
120atgctgtcct ggcggctgca gacgggcccc gagaaggccg agctccagga gctcaacgcc
180cggctctatg actacgtgtg tcgggtgcgg gagctggagc gcgaaaacct actcctggag
240gaggagctgc gcggccggcg cgggcgagag ggcctgtggg ccgaggggca ggcccgctgc
300gccgaggagg cgcgcagctt gcggcagcag ctggacgagc tgagctgggc cactgcgctg
360gcggagggcg agcgggacgc tctgcggcgc gagctgcggg agctgcagcg cctggatgcg
420gaggagcgcg ccgcccgcgg ccgcctggac gccgagctgg gtgcgcagca gcgcgagctg
480caggaggcgc tgggcgcgcg cgccgccctc gaggcgctgc tgggccggct gcaggccgag
540cgccgaggcc tcgacgcggc ccacgaacgc gacgtgaggg agctgcgcgc gcgcgccgcc
600agccttacca tgcatttccg cgcccgcgcc accggccccg ccgcgccgcc gccacgcctg
660cgggaggtgc acgacagcta cgcactgctg gtggccgagt cgtggcggga gacggtgcag
720ctgtacgagg acgaggtgcg cgagctggag gaggcgctgc ggcgcggcca ggagagcaga
780ctccaggcgg aggaagagac gcggctgtgc gcgcaggagg cagaggcgct gcggcgcgag
840gcgctcgggt tggagcagct gcgcgcgcgg ctggaggacg cgctgctgcg gatgcgcgag
900gagtacggga tacaggccga ggagcggcag agagtgattg actgcctgga ggatgagaag
960gcaaccctca ccttggccat ggctgactgg ctgcgggact atcaggacct cctgcaggtg
1020aagaccggcc tcagtctgga ggtggcgacc taccgggcct tattggaagg agaaagtaat
1080ccagagatag tgatctgggc tgagcacgtt gaaaacatgc cgtcagaatt cagaaacaaa
1140tcctatcact ataccgactc actactacag agggaaaatg aaaggaatct attttcaagg
1200cagaaagcac ctttggcaag tttcaatcac agctcggcac tgtattctaa cctgtcaggg
1260caccgtggat ctcagacggg cacatctatt ggaggtgatg ccagaagagg cttcttgggc
1320tcgggatatt cttcctcggc cactacccag caggaaaact catacggaaa agccgtcagc
1380agtcaaacca acgtcagaac tttctctcca acctatggcc ttttaagaaa tactgaggct
1440caagtgaaaa cattccctga cagaccaaaa gccggagata caagggaggt ccccgtttac
1500ataggtgaag attccacaat tgcccgcgag tcgtaccggg atcgccgaga caaggtggca
1560gcaggtgctt cggaaagcac acggtcaaat gagaggaccg tcattctggg aaagaaaaca
1620gaagtgaaag ccacgaggga gcaagaaaga aacagaccag aaaccatccg aacaaagcca
1680gaagagaaaa tgttcgattc taaagagaag gcttccgagg agagaaacct aagatgggaa
1740gaattgacaa agttagataa ggaagcgaga cagagagaaa gccagcagat gaaggagaag
1800gctaaggaga aggactcacc gaaggagaag agcgtgcgag agagagaggt gccgattagt
1860ctagaagtat cccaggacag aagagcagag gtgtccccga aaggtttgca gacgcctgtg
1920aaggatgctg gtggtgggac cggtagagag gcagaagcaa gagagctacg gttcaggttg
1980ggcaccagtg atgccactgg ttctctgcaa ggcgattcca tgacagaaac cgtagcagaa
2040aacatcgtta ccagtatcct gaagcagttc actcagtctc cagagacaga agcatctgct
2100gattcttttc cagacacaaa agtcacttac gtggacagga aagagcttcc tggggaaagg
2160aaaacaaaga ctgaaatagt tgtggagtct aaactgactg aggatgttga tgtttccgat
2220gaagctggcc tggactacct tttaagcaag gatattaagg aagtggggct gaaaggcaag
2280tcagccgagc agatgatagg agacatcatc aacctcggcc tgaaagggag ggaggggaga
2340gcaaaggtcg tcaacgtgga gatcgtggag gagcccgtga gttatgtcag cggggagaag
2400ccggaggagt tttccgtccc attcaaagtg gaggaggtcg aagatgtgtc gccaggcccc
2460tgggggttgg ttaaggagga ggaaggttat ggagaaagcg atgtcacatt ctcagttaat
2520cagcatcgaa ggaccaagca gcctcaggag aacacgactc acgtggaaga agtgacagag
2580gcaggtgatt cagagggcga gcagagttat tttgtgtcca ctccagatga acaccccggg
2640gggcacgaca gagatgacgg ctcggtgtac gggcagatcc acatcgagga ggaatccacc
2700atcaggtact cttggcagga tgaaatcgtg caggggactc gaaggaggac acagaaggac
2760ggtgcagtgg gcgagaaggt tgtgaagccc ttggatgtcc cagcgccctc tctggagggg
2820gacctgggtt ccactcactg gaaagaacaa gctagaagcg gtgaatttca tgccgaaccc
2880acagtcattg aaaaagaaat taaaataccc cacgaattcc acacctccat gaagggcatc
2940tcctccaagg agccccggca gcagctggtg gaggtcatcg ggcagctgga ggaaaccctt
3000cccgagcgca tgagggagga gctgtccgcc ctcaccagag aggggcaggg tgggccgggg
3060agcgtttccg tggatgtcaa gaaggtccag ggtgctggtg gcagttccgt gaccctggtt
3120gctgaagtca acgtctcaca aactgtggat gccgatcggt tagacctgga ggagctgagc
3180aaagatgagg ccagtgagat ggagaaggct gtggagtcgg tggttcggga gagcctgagc
3240aggcaacgca gcccagcgcc tggcagccca gatgaggaag gtggagcgga ggccccggct
3300gctggcattc gctttaggcg ttgggccacc cgggagctgt acatcccttc aggcgagagc
3360gaggttgctg gtggggcctc tcacagctcg ggacagcgca ctccccaggg cccagtgtcg
3420gccactgtgg aggtcagcag ccccacaggc tttgcccagt cacaggtgct ggaggatgtg
3480agccaggctg caaggcacat aaaactcggc ccctctgaag tctggaggac tgagcgaatg
3540tcatatgaag gacccactgc agaagtggtg gaggtaagtg cgggaggtga cctaagtcag
3600gcagcgagcc cgaccggagc cagccggtct gtgaggcatg tcacgctggg tcccggtcaa
3660agtccactgt ccagagaagt catcttccta ggccctgccc ctgcctgtcc agaggcatgg
3720ggctcgccag aacctggccc agcagagtct tctgcagata tggacggatc agggaggcac
3780agcacatttg gctgcagaca atttcatgct gaaaaggaga ttatttttca gggccccatt
3840tctgctgcag ggaaggttgg tgattatttt gcaacagaag agtcagtggg tacccagact
3900tctgtcaggc aactccagtt aggccctaaa gaagggttca gtgggcaaat ccagttcaca
3960gctccacttt cagacaaggt ggagttgggt gtcataggag attctgtaca catggaaggg
4020ttgccaggga gcagcacatc catcaggcac atcagcattg ggcctcagag gcatcagacc
4080acccagcaga tagtttacca tgggctggtt ccccaactgg gggaatctgg tgactcagag
4140agcactgtgc acggagaggg ctcagcagat gtgcaccagg ccactcacag tcatacctcg
4200ggtagacaaa ccgttatgac tgaaaagagc accttccaaa gtgtcgtttc tgaatctccc
4260caggaggata gtgcagagga cacatcaggg gcagaaatga catcgggtgt tagcagatcc
4320tttaggcaca ttcgactagg tcctacagaa acggaaacct ctgaacacat tgccatccgt
4380ggacccgtgt ccagaacatt tgtgcttgct ggttcagcgg actcccctga gctaggcaag
4440ttagcagaca gcagcagaac gctaaggcac attgcaccag ggcccaaaga aacttcgttt
4500acctttcaga tggatgtgag taacgtagag gcgatccgca gccggacaca ggaagcggga
4560gctctcggtg tgtctgaccg tggttcctgg agagacgcgg acagtaggaa tgaccaggca
4620gttggtgtga gctttaaggc ctctgctggg gaaggagacc aggcccacag agaacagggc
4680aaggagcagg ccatgtttga taagaaggtg cagctccaga gaatggtaga ccaaaggtcg
4740gtgatttcag atgaaaagaa agttgccctc ctctatctag acaatgagga ggaggagaat
4800gatgggcatt ggttttaata agcagaaaca ttttgtttta atggcagcct gttggcgacg
4860tgccaacatc caaaggcctt aacttatttt aagaggccga gggagtctat gaaaatctcc
4920ccttttttac ttttttaaag agtactcccg gcatggtcaa tttcctttat agttaatccg
4980taaaggtttc cagttaattc atgccttaaa aggcactgca attttatttt tgagttggga
5040cttttacaaa acactttttt ccctggagtc ttctctccac ttctggagat gaatttctat
5100gttttgcacc tggtcacaga catggcttgc atctgtttga aactacaatt aattatagat
5160gtcaaaacat taaccagatt aaagtaatat atttaagagt aaattttgct tgcatgtgct
5220aatatgaaat aacagactaa cattttaggg gaaaaataaa tacaatttag actctaaaaa
5280gtcttttcaa aaagaaatgg gaaataggca gactgtttat gttaaaaaaa ttcttgctaa
5340atgatttcat ctttaggaaa aaattacttg ccatatagag ctaaattcat cttaagactt
5400gaatgaattg ctttctatgt acagaacttt aaacaatata gtatttatgg cgaggacagc
5460tgtagtctgt tgtgatattt cacattctat ttgcacaggt tccctggcac tggtagggta
5520gatgattatt gggaatcgct tacagtacca tttcattttt tggcactagg tcattaagta
5580gcacacagtc tgaatgccct tttctggagt ggccagttcc tatcagactg tgcagacttg
5640cgcttctctg caccttatcc cttagcaccc aaacatttaa tttcactggt gggaggtaga
5700ccttgaagac aatgaagaga atgccgatac tcagactgca gctggaccgg caagctggct
5760gtgtacagga aaattggaag cacacagtgg actgtgcctc ttaaagatgc ctttcccaac
5820cctccattca tgggatgcag gtctttctga gctcaagggt gaaagatgaa tacaataaca
5880accatgaacc cacctcacgg aagctttttt tgcactttga acagaagtca ttgcagttgg
5940ggtgttttgt ccagggaaac agtttattaa atagaaggat gttttgggga aggaactgga
6000tatctctcct gcagcccagc accgagatac ccaggacggg cctggggggc gagaaaggcc
6060cccatgctca tgggccgcgg agtgtggacc tgtagatagg caccaccgag tttaagatac
6120tgggatgagc atgcttcatt ggattcattt tattttacac gtcagtattg ttttaaagtt
6180tctgtctgta aagtgtagca tcatatataa aaagagtttc gctagcagcg catttttttt
6240agttcaggct agcttctttc acataatgct gtctcagctg tatttccagt aacacagcat
6300catcgcactg actgtggcgc actggggaat aacagtctga gctagcacca ccctcagcca
6360ggctacaacg acagcactgg agggtcttcc ctctcagatt cacctggagg ccctcagacc
6420cccagggtgc acgtctcccc aggtcctggg agtggctacc gcaggtagtt tctggagagc
6480acgttttctt cattgataag tggaggagaa atgcagcaca gctttcaaga tactatttta
6540aaaacaccat gaatcagata gggaaagaaa gttgattgga atagcaagtt taaacctttg
6600ttgtccatct gccaaatgaa ctagtgattg tcagactggt atggaggtga ctgctttgta
6660aggttttgtc gtttctaata cagacagaga tgtgctgatt ttgttttagc tgtaacaggt
6720aatggttttt ggatagatga ttgactggtg agaatttggt caaggtgaca gcctcctgtc
6780tgatgacagg acagactggt ggtgaggagt ctaagtgggc tcagtttgat gtcagtgtct
6840gggctcatga cttgtaaatg gaagctgatg tgaacaggta attaatatta tgacccactt
6900ctatttactt tgggaaatat cttggatctt aattatcatc tgcaagtttc aagaagtatt
6960ctgccaaaag tatttacaag tatggactca tgagctattg ttggttgcta aatgtgaatc
7020acgcgggagt gagtgtgccc ttcacactgt gacattgtga cattgtgaca agctccatgt
7080cctttaaaat cagtcactct gcacacaaga gaaatcaact tcgtggttgg atggggccgg
7140aacacaacca gtctttttgt atttattgtt actgagacaa aacagtactc actgagtgtt
7200tttcagtttc ctactggtgg ttttgatatt gtttgtttaa gatgtatatt tagaatgaca
7260tcatctaaga agctgatttt gctaaactcc tgttccctac aatgggaaat gtcacaagaa
7320tgtgcaaaaa taaaaatctg aggaaaaaac ccaca
73551132356DNAHomo sapiens 113atcataccct ttacagaaga atagagaggc tgttgcagcc
ccgtgctttc tcctgctgct 60ggccgattgc ttgctctgaa ctaaccctct aacccttggg
agtctgtgtg cagcagtgat 120gtgagctgca tccgggctga aatgggagct cccgctatgg
cactggctca aggagctaat 180ctaaagagaa aatcaagcaa aagtagataa tcagccagaa
gaattagtgc gtagtgctga 240agatgatgag aaaccagatc agaagccagt tacaaatgaa
tgcgtaccaa gaatttccac 300agtgcctaca caacctgata atccattttc tcaccctgac
aaactcaaaa ggatgagcaa 360gtctgttcca gcatttctcc aagatgaggt gagtggcagt
gtgatgagtg tttatagtgg 420agactttggc aatctggaag ttaaaggaaa tattcagttt
gcaattgaat atgtggagtc 480actgaaggag ttgcatgttt ttgtggccca gtgtaaggac
ttagcagcag cggatgtaaa 540aaaacagcgt tcagacccat atgtaaaggc ctatttgcta
ccagacaaag gcaaaatggg 600caagaagaaa acactcgtag tgaagaaaac cttgaatcct
gtgtataacg aaatactgcg 660gtataaaatt gaaaaacaaa tcttaaagac acagaaattg
aacctgtcca tttggcatcg 720ggatacattt aagcgcaata gtttcctagg ggaggtggaa
cttgatttgg aaacatggga 780ctgggataac aaacagaata aacaattgag atggtaccct
ctgaagcgga agacagcacc 840agttgccctt gaagcagaaa acagaggtga aatgaaacta
gctctccagt atgtcccaga 900gccagtccct ggtaaaaagc ttcctacaac tggagaagtg
cacatctggg tgaaggaatg 960ccttgatcta ccactgctaa ggggaagtca tctaaattct
tttgttaaat gtaccatcct 1020tccagataca agtaggaaaa gtcgccagaa gacaagagct
gtagggaaaa ccaccaaccc 1080tatcttcaac cacactatgg tgtatgatgg gttcaggcct
gaagatctga tggaagcctg 1140tgtagagctt actgtctggg accattacaa attaaccaac
caatttttgg gaggtcttcg 1200tattggcttt ggaacaggta aaagttatgg gactgaagtg
gactggatgg actctacttc 1260agaggaagtt gctctctggg agaagatggt aaactccccc
aatacttgga ttgaagcaac 1320actgcctctc agaatgcttt tgattgccaa gatttccaaa
tgagcccaaa ttccactggc 1380tcctccactg aaaactacta aaccggtgga atctgatctt
gaaaatctga gtaggtggac 1440aaatatcctc actttctatc tattgcacct aaggaatact
acacagcatg taaaagtcaa 1500tctgcatgtg cttctttgat tacaaggccc aagggattta
aatataacaa aatgtgtaat 1560ttgtgactct aatattaaat aagatatttg aacaagctag
gaaaattgaa tttctgctgc 1620tgcttcaaag aaaaagctgc cccagagcat taaacatggg
gtattgttaa gaagcaaaat 1680gttcttgttt gccatcatgt gtttcacacc acaattctgt
gccacagtta agagggtctg 1740gtacccttgc aggacctttg taggttgtgg gaaaaagtcg
cagaaagata ctcaaagtgg 1800agcagggaat ggagacagac atcagtgatg ataaaaaaaa
aaatggacct taagaaacta 1860tttactctgt aatctctaat aaaatatgga attccatatt
agggcaatga gactgaaact 1920actggtgttt ttctgccttg agaaaacaaa cagttaaaac
aagcctcaaa tgtattttag 1980tgccacccac tggccatagg tacaattcag ttgttggctt
gttttgactt aattctaaaa 2040taggtctcaa gcctgtattt ttatgagttt atttttttaa
aaccctgcat atatatgatt 2100gtttttctta taactttact atatgaaagc agcataagag
tagtcacaaa catgttttgc 2160aacaaagttt taattagaat gtaagttgct cagttatact
gttcttctta tgtatgtaaa 2220attttcgtat tttgtaaaaa cccttagaat aaattatcat
ttgatttaaa ttgtattaga 2280aaattagcgt gacttctcat tttaaataaa atattttagg
aattctaaac atctaaaaaa 2340aaaaaaaaaa aaaaaa
2356114508DNAHomo sapiens 114atccctgact cggggtcgcc
tttggagcag agaggaggca atggccacca tggagaacaa 60ggtgatctgc gccctggtcc
tggtgtccat gctggccctc ggcaccctgg ccgaggccca 120gacagagacg tgtacagtgg
ccccccgtga aagacagaat tgtggttttc ctggtgtcac 180gccctcccag tgtgcaaata
agggctgctg tttcgacgac accgttcgtg gggtcccctg 240gtgcttctat cctaatacca
tcgacgtccc tccagaagag gagtgtgaat tttagacact 300tctgcaggga tctgcctgca
tcctgacgcg gtgccgtccc cagcacggtg attagtccca 360gagctcggct gccacctcca
ccggacacct cagacacgct tctgcagctg tgcctcggct 420cacaacacag attgactgct
ctgactttga ctactcaaaa ttggcctaaa aattaaaaga 480gatcgatatt aaaaaaaaaa
aaaaaaaa 508115717DNAHomo sapiens
115cacggtggaa gggctggggc cacggggcag agaagaaagg ttatctctgc ttgttggaca
60aacagagggg agattataaa acatacccgg cagtggacac catgcattct gcaagccacc
120ctggggtgca gctgagctag acatgggacg gcgagacgcc cagctcctgg cagcgctcct
180cgtcctgggg ctatgtgccc tggcggggag tgagaaaccc tccccctgcc agtgctccag
240gctgagcccc cataacagga cgaactgcgg cttccctgga atcaccagtg accagtgttt
300tgacaatgga tgctgtttcg actccagtgt cactggggtc ccctggtgtt tccaccccct
360cccaaagcaa gagtcggatc agtgcgtcat ggaggtctca gaccgaagaa actgtggcta
420cccgggcatc agccccgagg aatgcgcctc tcggaagtgc tgcttctcca acttcatctt
480tgaagtgccc tggtgcttct tcccgaagtc tgtggaagac tgccattact aagagaggct
540ggttccagag gatgcatctg gctcaccggg tgttccgaaa ccaaagaaga aacttcgcct
600tatcagcttc atacttcatg aaatcctggg ttttcttaac catcttttcc tcattttcaa
660tggtttaaca tataatttct ttaaataaaa cccttaaaat ctgctaaaaa aaaaaaa
7171161054DNAHomo sapiens 116gccaaaacag tgggggctga actgacctct cccctttggg
agagaaaaac tgtctgggag 60cttgacaaag gcatgcagga gagaacagga gcagccacag
ccaggaggga gagccttccc 120caagcaaaca atccagagca gctgtgcaaa caacggtgca
taaatgaggc ctcctggacc 180atgaagcgag tcctgagctg cgtcccggag cccacggtgg
tcatggctgc cagagcgctc 240tgcatgctgg ggctggtcct ggccttgctg tcctccagct
ctgctgagga gtacgtgggc 300ctgtctgcaa accagtgtgc cgtgccagcc aaggacaggg
tggactgcgg ctacccccat 360gtcaccccca aggagtgcaa caaccggggc tgctgctttg
actccaggat ccctggagtg 420ccttggtgtt tcaagcccct gcaggaagca gaatgcacct
tctgaggcac ctccagctgc 480ccccggccgg gggatgcgag gctcggagca cccttgcccg
gctgtgattg ctgccaggca 540ctgttcatct cagcttttct gtccctttgc tcccggcaag
cgcttctgct gaaagttcat 600atctggagcc tgatgtctta acgaataaag gtcccatgct
ccacccgagg acagttcttc 660gtgcctgaga ctttctgagg ttgtgcttta tttctgctgc
gtcgtgggag agggcgggag 720ggtgtcaggg gagagtctgc ccaggcctca agggcaggaa
aagactccct aaggagctgc 780agtgcatgca aggatatttt gaatccagac tggcacccac
gtcacaggaa agcctaggaa 840cactgtaagt gccgcttcct cgggaaagca gaaaaaatac
atttcaggta gaagttttca 900aaaatcacaa gtctttcttg gtgaagacag caagccaata
aaactgtctt ccaaagtggt 960cctttatttc acaaccactc tcgctactgt tcaatacttg
tactattcct gggttttgtt 1020tctttgtaca gtaaacatta tgaacaaaca ggca
10541175898DNAHomo sapiens 117aaaagtgagt ccctgccttc
cctctctccg tctggctcct cccaggcctg tctggcaggg 60gccggggtgc aggaggagga
gacggcatcc agtacagagg ggctggactt ggacccctgc 120agcagccctg cacaggagaa
gcggcatata aagccgcgct gcccgggagc cgctcggcca 180cgtccaccgg agcatcctgc
actgcagggc cggtctctcg ctccagcaga gcctgcgcct 240ttctgactcg gtccggaaca
ctgaaaccag tcatcactgc atctttttgg caaaccagga 300gctcagctgc aggaggcagg
atggtctgga ggctggtcct gctggctctg tgggtgtggc 360ccagcacgca agctggtcac
caggacaaag acacgacctt cgaccttttc agtatcagca 420acatcaaccg caagaccatt
ggcgccaagc agttccgcgg gcccgacccc ggcgtgccgg 480cttaccgctt cgtgcgcttt
gactacatcc caccggtgaa cgcagatgac ctcagcaaga 540tcaccaagat catgcggcag
aaggagggct tcttcctcac ggcccagctc aagcaggacg 600gcaagtccag gggcacgctg
ttggctctgg agggccccgg tctctcccag aggcagttcg 660agatcgtctc caacggcccc
gcggacacgc tggatctcac ctactggatt gacggcaccc 720ggcatgtggt ctccctggag
gacgtcggcc tggctgactc gcagtggaag aacgtcaccg 780tgcaggtggc tggcgagacc
tacagcttgc acgtgggctg cgacctcata gacagcttcg 840ctctggacga gcccttctac
gagcacctgc aggcggaaaa gagccggatg tacgtggcca 900aaggctctgc cagagagagt
cacttcaggg gtttgcttca gaacgtccac ctagtgtttg 960aaaactctgt ggaagatatt
ctaagcaaga agggttgcca gcaaggccag ggagctgaga 1020tcaacgccat cagtgagaac
acagagacgc tgcgcctggg tccgcatgtc accaccgagt 1080acgtgggccc cagctcggag
aggaggcccg aggtgtgcga acgctcgtgc gaggagctgg 1140gaaacatggt ccaggagctc
tcggggctcc acgtcctcgt gaaccagctc agcgagaacc 1200tcaagagagt gtcgaatgat
aaccagtttc tctgggagct cattggtggc cctcctaaga 1260caaggaacat gtcagcttgc
tggcaggatg gccggttctt tgcggaaaat gaaacgtggg 1320tggtggacag ctgcaccacg
tgtacctgca agaaatttaa aaccatttgc caccaaatca 1380cctgcccgcc tgcaacctgc
gccagtccat cctttgtgga aggcgaatgc tgcccttcct 1440gcctccactc ggtggacggt
gaggagggct ggtctccgtg ggcagagtgg acccagtgct 1500ccgtgacgtg tggctctggg
acccagcaga gaggccggtc ctgtgacgtc accagcaaca 1560cctgcttggg gccctccatc
cagacacggg cttgcagtct gagcaagtgt gacacccgca 1620tccggcagga cggcggctgg
agccactggt caccttggtc ttcatgctct gtgacctgtg 1680gagttggcaa tatcacacgc
atccgtctct gcaactcccc agtgccccag atggggggca 1740agaattgcaa agggagtggc
cgggagacca aagcctgcca gggcgcccca tgcccaatcg 1800atggccgctg gagcccctgg
tccccgtggt cggcctgcac tgtcacctgt gccggtggga 1860tccgggagcg cacccgggtc
tgcaacagcc ctgagcctca gtacggaggg aaggcctgcg 1920tgggggatgt gcaggagcgt
cagatgtgca acaagaggag ctgccccgtg gatggctgtt 1980tatccaaccc ctgcttcccg
ggagcccagt gcagcagctt ccccgatggg tcctggtcat 2040gcggctcctg ccctgtgggc
ttcttgggca atggcaccca ctgtgaggac ctggacgagt 2100gtgccctggt ccccgacatc
tgcttctcca ccagcaaggt gcctcgctgt gtcaacactc 2160agcctggctt ccactgcctg
ccctgcccgc cccgatacag agggaaccag cccgtcgggg 2220tcggcctgga agcagccaag
acggaaaagc aagtgtgtga gcccgaaaac ccatgcaagg 2280acaagacaca caactgccac
aagcacgcgg agtgcatcta cctgggccac ttcagcgacc 2340ccatgtacaa gtgcgagtgc
cagacaggct acgcgggcga cgggctcatc tgcggggagg 2400actcggacct ggacggctgg
cccaacctca atctggtctg cgccaccaac gccacctacc 2460actgcatcaa ggataactgc
ccccatctgc caaattctgg gcaggaagac tttgacaagg 2520acgggattgg cgatgcctgt
gatgatgacg atgacaatga cggtgtgacc gatgagaagg 2580acaactgcca gctcctcttc
aatccccgcc aggctgacta tgacaaggat gaggttgggg 2640accgctgtga caactgccct
tacgtgcaca accctgccca gatcgacaca gacaacaatg 2700gagagggtga cgcctgctcc
gtggacattg atggggacga tgtcttcaat gaacgagaca 2760attgtcccta cgtctacaac
actgaccaga gggacacgga tggtgacggt gtgggggatc 2820actgtgacaa ctgccccctg
gtgcacaacc ctgaccagac cgacgtggac aatgaccttg 2880ttggggacca gtgtgacaac
aacgaggaca tagatgacga cggccaccag aacaaccagg 2940acaactgccc ctacatctcc
aacgccaacc aggctgacca tgacagagac ggccagggcg 3000acgcctgtga ccctgatgat
gacaacgatg gcgtccccga tgacagggac aactgccggc 3060ttgtgttcaa cccagaccag
gaggacttgg acggtgatgg acggggtgat atttgtaaag 3120atgattttga caatgacaac
atcccagata ttgatgatgt gtgtcctgaa aacaatgcca 3180tcagtgagac agacttcagg
aacttccaga tggtcccctt ggatcccaaa gggaccaccc 3240aaattgatcc caactgggtc
attcgccatc aaggcaagga gctggttcag acagccaact 3300cggaccccgg catcgctgta
ggttttgacg agtttgggtc tgtggacttc agtggcacat 3360tctacgtaaa cactgaccgg
gacgacgact atgccggctt cgtctttggt taccagtcaa 3420gcagccgctt ctatgtggtg
atgtggaagc aggtgacgca gacctactgg gaggaccagc 3480ccacgcgggc ctatggctac
tccggcgtgt ccctcaaggt ggtgaactcc accacgggga 3540cgggcgagca cctgaggaac
gcgctgtggc acacggggaa cacgccgggg caggtgcgaa 3600ccttatggca cgaccccagg
aacattggct ggaaggacta cacggcctat aggtggcacc 3660tgactcacag gcccaagact
ggctacatca gagtcttagt gcatgaagga aaacaggtca 3720tggcagactc aggacctatc
tatgaccaaa cctacgctgg cgggcggctg ggtctatttg 3780tcttctctca agaaatggtc
tatttctcag acctcaagta cgaatgcaga gatatttaaa 3840caagatttgc tgcatttccg
gcaatgccct gtgcatgcca tggtccctag acacctcagt 3900tcattgtggt ccttgtggct
tctctctcta gcagcacctc ctgtcccttg accttaactc 3960tgatggttct tcacctcctg
ccagcaaccc caaacccaag tgccttcaga ggataaatat 4020caatggaact cagagatgaa
catctaaccc actagaggaa accagtttgg tgatatatga 4080gactttatgt ggagtgaaaa
ttgggcatgc cattacattg ctttttcttg tttgtttaaa 4140aagaatgacg tttacatata
aaatgtaatt acttattgta tttatgtgta tatggagttg 4200aagggaatac tgtgcataag
ccattatgat aaattaagca tgaaaaatat tgctgaacta 4260cttttggtgc ttaaagttgt
cactattctt gaattagagt tgctctacaa tgacacacaa 4320atcccattaa ataaattata
aacaagggtc aattcaaatt tgaagtaatg ttttagtaag 4380gagagattag aagacaacag
gcatagcaaa tgacataagc taccgattaa ctaatcggaa 4440catgtaaaac agttacaaaa
ataaacgaac tctcctcttg tcctacaatg aaagccctca 4500tgtgcagtag agatgcagtt
tcatcaaaga acaaacatcc ttgcaaatgg gtgtgacgcg 4560gttccagatg tggatttggc
aaaacctcat ttaagtaaaa ggttagcaga gcaaagtgcg 4620gtgctttagc tgctgcttgt
gccgctgtgg cgtcggggag gctcctgcct gagcttcctt 4680ccccagcttt gctgcctgag
aggaaccaga gcagacgcac aggccggaaa aggcgcatct 4740aacgcgtatc taggctttgg
taactgcgga caagttgctt ttacctgatt tgatgataca 4800tttcattaag gttccagtta
taaatatttt gttaatattt attaagtgac tatagaatgc 4860aactccattt accagtaact
tattttaaat atgcctagta acacatatgt agtataattt 4920ctagaaacaa acatctaata
agtatataat cctgtgaaaa tatgaggctt gataatatta 4980ggttgtcacg atgaagcatg
ctagaagctg taacagaata catagagaat aatgaggagt 5040ttatgatgga accttaaata
tataatgttg ccagcgattt tagttcaata tttgttactg 5100ttatctatct gctgtatatg
gaattctttt aattcaaacg ctgaaaagaa tcagcattta 5160gtcttgccag gcacacccaa
taatcagtca tgtgtaatat gcacaagttt gtttttgttt 5220ttgttttttt tgttggttgg
tttgtttttt tgctttaagt tgcatgatct ttctgcagga 5280aatagtcact catcccactc
cacataaggg gtttagtaag agaagtctgt ctgtctgatg 5340atggataggg ggcaaatctt
tttccccttt ctgttaatag tcatcacatt tctatgccaa 5400acaggaacaa tccataactt
tagtcttaat gtacacattg cattttgata aaattaattt 5460tgttgtttcc tttgaggttg
atcgttgtgt tgttgttttg ctgcactttt tacttttttg 5520cgtgtggagc tgtattcccg
agaccaacga agcgttggga tacttcatta aatgtagcga 5580ctgtcaacag cgtgcaggtt
ttctgtttct gtgttgtggg gtcaaccgta caatggtgtg 5640ggagtgacga tgatgtgaat
atttagaatg taccatattt tttgtaaatt atttatgttt 5700ttctaaacaa atttatcgta
taggttgatg aaacgtcatg tgttttgcca aagactgtaa 5760atatttattt atgtgttcac
atggtcaaaa tttcaccact gaaaccctgc acttagctag 5820aacctcattt ttaaagatta
acaacaggaa ataaattgta aaaaaggttt tctatacatg 5880aaaaaaaaaa aaaaaaaa
58981182207DNAHomo sapiens
118acgcacttgg cgcgcggcgc gggctgcaga cggctgcgag gcgctgggca caggtgtcct
60gatggcaaat ttcaagggcc acgcgcttcc agggagtttc ttcctgatca ttgggctgtg
120ttggtcagtg aagtacccgc tgaagtactt tagccacacg cggaagaaca gcccactaca
180ttactatcag cgtctcgaga tcgtcgaagc cgcaattagg actttgtttt ccgtcactgg
240gatcctggca gagcagtttg ttccggatgg gccccacctg cacctctacc atgagaacca
300ctggataaag ttaatgaatt ggcagcacag caccatgtac ctattctttg cagtctcagg
360aattgttgac atgctcacct atctggtcag ccacgttccc ttgggggtgg acagactggt
420tatggctgtg gcagtattca tggaaggttt cctcttctac taccacgtcc acaaccggcc
480tccgctggac cagcacatcc actcactcct gctgtatgct ctgttcggag ggtgtgttag
540tatctcccta gaggtgatct tccgggacca cattgtgctg gaacttttcc gaaccagtct
600catcattctt cagggaacct ggttctggca gattgggttt gtgctgttcc caccttttgg
660aacacccgaa tgggaccaga aggatgatgc caacctcatg ttcatcacca tgtgcttctg
720ctggcactac ctggctgccc tcagcattgt ggccgtcaac tattctcttg tttactgcct
780tttgactcgg atgaagagac acggaagggg agaaatcatt ggaattcaga agctgaattc
840agatgacact taccagaccg ccctcttgag tggctcagat gaggaatgag ccgagatgcg
900gagggcgcag atgtcccact gcacagctgg aatgaatgga gttcatcccc tccacctgaa
960tgcctgctgt ggtctgatct taagggtcta tatatttgca cctcctcatt caacacaggg
1020ctggaggttc tacaacagga aatcaggcct acagcatcct gtgtatcttg cagttgggat
1080ttttaaacat actataaagt ctgtgttggt atagtaccct tcataaggaa aaatgaagta
1140atgcctataa gtagcaggcc tttgtgcctc agtgtcaaga gaaatcaaga gatgctaaaa
1200gctttacaat ggaagtggcc tcatggatga atccggggta tgagcccagg agaacgtgct
1260gcttttggta acttatccct ttttctctta agaaagcagg tactttctta ttagaaatat
1320gttagaatgt gtaagcaaac gacagtgcct ttagaattac aattctaact tacatatttt
1380ttgaaagtaa aataattcac aagctttggt attttaaaat tattgttaaa catatcataa
1440ctaatcatac cagggtactg caataccact gtttataagt gacaaaatta ggccaaaggt
1500gatttttttt taaatcagga agctggttac tggctctact gagagttgga gccctgatgt
1560tctgattctt caaagtcacc ctaaaagaag atctgacagg aaagctgtat aatgagatag
1620aaaaacgtca ggtatggaag gctttcagtt ttaatatggc tgaaagcaaa ggataacgaa
1680ttcagaatta gtaatgtaaa atcttgatac cctaatcttg cttctggatc tgttcttttt
1740ttaaaaaaac ttccttcacc gcgcctataa tcctagcact ttgggaggcc gaggcaggca
1800gatcacgggg tcaggagatc aagaccatcc tggctaacat ggtgaaaccc cgtctctact
1860gaaaatacaa aaaattagcc gggtgtggtg gcgggcgcct gtagttccag ctactcggga
1920ggctgaggca agagaatggc atgaacccgg taggggagct tgcagtgagc ccagatcatg
1980ccactgtact ccagcctagg tgacagagca agactctgtc tcaaaaacaa gcaaacagac
2040ttccttcaac aaatatttat taaatatcca ctttgcaaca gcactgaaat ggctgtaagg
2100actcctgaga tatgtgtcca gcaaggagtt tacagtcaaa caggagagac atgcctgtag
2160ttacatccag tgtgatgggt gctgagaggc aagtacaaac cacgatg
22071194090DNAHomo sapiens 119ggactctgag tcgtcttggt cccaggagcc agtagtgaag
gcaacagtct gcccacctgt 60ggacaccaga tcctgggagc tcctggttag caagtgagat
ctctgggatg tcagtgaggc 120tggttgaaga ccagaggtaa actgcagagg tcaccacccc
caccatgtcc caggtgatgt 180ccagcccact gctggcagga ggccatgctg tcagcttggc
gccttgtgat gagcccagga 240ggaccctgca cccagcaccc agccccagcc tgccacccca
gtgttcttac tacaccacgg 300aaggctgggg agcccaggcc ctgatggccc ccgtgccctg
catggggccc cctggccgac 360tccagcaagc cccacaggtg gaggccaaag ccacctgctt
cctgccgtcc cctggtgaga 420aggccttggg gaccccagag gaccttgact cctacattga
cttctcactg gagagcctca 480atcagatgat cctggaactg gaccccacct tccagctgct
tcccccaggg actgggggct 540cccaggctga gctggcccag agcaccatgt caatgagaaa
gaaggaggaa tctgaagcct 600tggacataaa gtacatcgag gtgacctccg ccagatcaag
gtgccacgat ggcccccagc 660actgctccag cccctctgtc accccgccct tcggctccct
tcgcagtggt ggcctcctcc 720tttccagaga cgtcccccga gagacacgaa gcagcagtga
gagcctcatc ttctctggga 780accagggcag ggggcaccag cgccctctgc ccccctcaga
gggtctctcc cctcgacccc 840caaattcccc cagcatctca atcccttgca tggggagcaa
ggcctcgagc ccccatggtt 900tgggctcccc gctggtggct tctccaagac tggagaagcg
gctgggaggc ctggccccac 960agcggggcag caggatctct gtgctgtcag ccagcccagt
gtctgatgtc agctatatgt 1020ttggaagcag ccagtccctc ctgcactcca gcaactccag
ccatcagtca tcttccagat 1080ccttggaaag tccagccaac tcttcctcca gcctccacag
ccttggctca gtgtccctgt 1140gtacaagacc cagtgacttc caggctccca gaaaccccac
cctaaccatg ggccaaccca 1200gaacacccca ctctccacca ctggccaaag aacatgccag
cagctgcccc ccatccatca 1260ccaactccat ggtggacata cccattgtgc tgatcaacgg
ctgcccagaa ccagggtctt 1320ctccacccca gcggacccca ggacaccaga actccgttca
acctggagct gcttctccca 1380gcaacccctg tccagccacc aggagcaaca gccagaccct
gtcagatgcc ccctttacca 1440catgcccaga gggtcccgcc agggacatgc agcccaccat
gaagttcgtg atggacacat 1500ctaaatactg gtttaagcca aacatcaccc gagagcaagc
aatcgagctg ctgaggaagg 1560aggagccagg ggcttttgtc ataagggaca gctcttcata
ccgaggctcc ttcggcctgg 1620ccctgaaggt gcaggaggtt cccgcgtctg ctcagagtcg
accaggtgag gacagcaatg 1680acctcatccg acacttcctc atcgagtcgt ctgccaaagg
agtgcatctc aaaggagcag 1740atgaggagcc ctactttggg agcctctctg ccttcgtgtg
ccagcattcc atcatggccc 1800tggccctgcc ctgcaaactc accatcccac agagagaact
gggaggtgca gatggggcct 1860cggactctac agacagccca gcctcctgcc agaagaaatc
tgcgggctgc cacaccctgt 1920acctgagctc agtgagcgtg gagaccctga ctggagccct
ggccgtgcag aaagccatct 1980ccaccacctt tgagagggac atcctcccca cgcccaccgt
ggtccacttc aaagtcacag 2040agcagggcat cactctgact gatgtccaga ggaaggtgtt
tttccggcgc cattacccac 2100tcaccaccct ccgcttctgt ggtatggacc ctgagcaacg
gaagtggcag aagtactgca 2160aaccctcctg gatctttggg tttgtggcca agagccagac
agagcctcag gagaacgtat 2220gccacctctt tgcggagtat gacatggtcc agccagcctc
gcaggtcatc ggcctggtga 2280ctgctctgct gcaggacgca gaaaggatgt aggggagaga
ctgcctgtgc acctaaccaa 2340cacctccagg ggctcgctaa ggagcccccc tccaccccct
gaatgggtgt ggcttgtggc 2400catattgaca gaccaatcta tgggactagg gggattggca
tcaagttgac acccttgaac 2460ctgctatggc cttcagcagt caccatcatc cagacccccc
gggcctcagt ttcctcaatc 2520atagaagaag accaatagac aagatcagct gttcttagat
gctggtgggc atttgaacat 2580gctcctccat gattctgaag catgcacacc tctgaagacc
cctgcatgaa aataacctcc 2640aaggaccctc tgaccccatc gacctgggcc ctgcccacac
aacagtctga gcaagagacc 2700tgcagcccct gtttcgtggc agacagcagg tgcctggcgg
tgacccacgg ggctcctggc 2760ttgcagctgg tgatggtcaa gaactgacta caaaacagga
atggatagac tctatttcct 2820tccatatctg ttcctctgtt ccttttccca ctttctgggt
ggctttttgg gtccacccag 2880ccaggatgct gcaggccaag ctgggtgtgg tatttagggc
agctcagcag ggggaacttg 2940tccccatggt cagaggagac ccagctgtcc tgcaccccct
tgcagatgag tatcacccca 3000tcttttcttt ccacttggtt tttattttta tttttttttg
agacagagtc tcactgtcac 3060ccaggctgaa ctgcagtggt gtgatctagg ctcactgcaa
cctccacctc ccaggttcaa 3120gcaattatcc tgcctcaggc tcccaagtag ctgggattac
aggcatgtgc aactcaccca 3180gctaattttg tatttttagt agagacaggg tttcaccatg
ttggccaggc tggtcttgaa 3240ctcctgaccg caggtaatcc acctgcttcg gcctcccaaa
gtgctgggat tacaggcgca 3300agccacccag cccagcttct ttccattcct tgataggcga
gtattccaaa gctggtatcg 3360tagctgccct aatgttgcat attaggcggc gggggcagag
ataagggcca tctctctgtg 3420attctgcctc agctcctgtc ttgctgagcc ctcccccaac
ccacgctcca acacacacac 3480acacacacac acacacacac acacacacac acacacacac
acacacacac gcccctctac 3540tgctatgtgg cttcaaccag cctcacagcc acacggggga
agcagagagt caagaatgca 3600aagaggccgc ttccctaaga ggcttggagg agctgggctc
tatcccacac ccacccccac 3660cccaccccca cccagcctcc agaagctgga accatttctc
ccgcaggcct gagttcctaa 3720ggaaaccacc ctaccggggt ggaagggagg gtcagggaag
aaacccactc ttgctctacg 3780aggagcaagt gcctgccccc tcccagcagc cagccctgcc
aaagttgcat tatctttggc 3840caaggctggg cctgacggtt atgatttcag ccctgggcct
gcaggagagg ctgagaccag 3900cccacccagc cagtggtcga gcactgcccc gccgccaaag
tctgcagaat gtgagatgag 3960gttctcaagg tcacaggccc cagtcccagc ctgggggctg
gcagaggccc ccatatactc 4020tgctacagct cctatcatga aaaataaaat gtttgtcttt
gcaaaacagt aaaaaaaaaa 4080aaaaaaaaaa
40901203679DNAHomo sapiens 120ctgcagcctc cgcagaggcg
attggctgaa gctaccggcc gcgtggggcg ggactcggtt 60gccagggagc ggcgcgggag
ccctgagggg actgcggcgg ctgcgcggag gagcgaggca 120cttgctgggg tcggggctgc
gcgacggcgc aggggctgcg gggagcgccg cgcaggccgt 180gcagttccta gcgaggaggc
gccgccgcca ttgccgctct ctcggtgagc gcagccccgc 240tctccgggcc gggccttcgc
gggccaccgg cgccatgggc cagtgcggca tcacctcctc 300caagaccgtg ctggtctttc
tcaacctcat cttctggttt gtcatcatcc tgctcttggt 360ttttgtcaca gaagttgttg
tagtggtttt gggatatgtt tacagagcaa aggtggaaaa 420tgaggttgat cgcagcattc
agaaagtgta taagacctac aatggaacca accctgatgc 480tgctagccgg gctattgatt
atgtacagag acagctgcat tgttgtggaa ttcacaacta 540ctcagactgg gaaaatacag
attggttcaa agaaaccaaa aaccagagtg tccctcttag 600ctgctgcaga gagactgcca
gcaattgtaa tggcagcctg gcccaccctt ccgacctcta 660tgctgagggg tgtgaggctc
tagtagtgaa gaagctacaa gaaatcatga tgcatgtgat 720ctgggccgca ctggcatttg
cagctattca gctgctgggc atgctgtgtg cttgcatcgt 780gttgtgcaga aggagtagag
atcctgctta cgagctcctc atcactggcg gaacctatgc 840atagttgaca actcaagcct
gagctttttg gtcttgttct gatttggaag gtgaattgag 900caggtctgct gctgttggcc
tctggagttc atttagttaa agcacatgta cactggtgtt 960ggacagagca gcttggcttt
tcatgtgccc acctacttac ctactacctg cgactttctt 1020tttccttgtt ctagctgact
cttcatgccc ctaagatttt aagtacgatg gtgaacgttc 1080taatttcaga accaattgcg
agtcatgtag tgtggtagaa ttaaaggagg acacgagcct 1140gcttctgtta cctccaagtg
gtaacaggac tgatgccgaa atgtcaccag gtcctttcag 1200tcttcacagt ggagaactct
tggccaaagg tttttgcggg gaggaggagg aaaccagctt 1260tctggttaag gttaacacca
gatggtgccc ctcattggtg tccttttaaa aaatatttac 1320tgtagtccaa taagatagca
gctgtacaaa atgactaaaa tagattgtag gatcatatgg 1380cgtatatctt ggttcatctt
caaaatcaga gactgagctt tgaaactagt ggtttttaat 1440caaagttggc tttataggag
gagtataatg tatgcactac tgttttaaaa gaattagtgt 1500gagtgtgttt ttgtatgaat
gagcccattc atggtaagtc ttaagcttgt tggaaataat 1560gtacccatgt agactagcaa
aatagtatgt agatgtgatc tcagttgtaa atagaaaaat 1620ctaattcaat aaactctgta
tcagccccca acatattatt tttcattatt tgggggatat 1680ttcagttcca gagcagcagt
atcatgtttt ctttgttggt gctgtctata gttcatcatg 1740gtttacgtgt gttttcgtta
tagctgttgc cagattctaa agggcttgat attcaaaaaa 1800ccacagatgc tttcagtcca
gtatatccta gaaatataga gctctacttt gtgcaatgca 1860ctggggatac agtggcgata
ctgtccttgt cttcaaggag ttcggagtcc tagtatagga 1920gacatacata ggagaagata
attttcacac tgcagtggtt gtagtaatag aatgggagtc 1980caaaggggag ttccggagag
gtcaggggtg acttcctgga ggagatgccc aagcttggag 2040gctggatagg ctttgttgaa
agatggtaca caagagtgtg aaacaaaatt gtgtgtgcag 2100ggagcttaaa atacaaggct
ggggaaagaa gttggagaag caggaaagcc caggccctct 2160agtgtcttac ggaacatcct
gtggaggtga gagctgactt gtaggtggaa gcagctcttt 2220ggaggtttga tttggaaggt
gaactgagaa gaaggtggtg atgcaagccg gccgtgctga 2280agccaggatg aattggtgtg
actgggcttc agttaggctg gcatagcagt tgagagagct 2340tagcagtggg cagcagggca
ctgttggggg cggtggtgag cggtggattc tggctgtttg 2400gaggcaggac agtggtggaa
ttcgatcatt gattggacgt gggacagaga agggagaaga 2460gaagactcat ctaggatgag
tcccaggttt ctggctcagt caactgggga aacaaagtca 2520cagagctagg gagtagttag
agaacacatc tgggggtgtg actcatggtc agttttgggc 2580tcgtcagttt tgagatgccc
aaatatcatg cagatctgtc ccacctgaaa atggagaaga 2640cacgggaaag gagaggagta
aaaactaact ccttttcaca aagtggaagt taccagaatg 2700tgatttcaga ggccccgggg
gatttatcat gtgactactg acccatccca cctcttgccc 2760ctgcctgttg cacagtgggc
aagaatgttt gtgacctttc actaccacca cctcccccga 2820gcatggtccc cccagttttc
aatatgaacc atcctgtggg taccctgtca caggctggcc 2880ctgaggtgag caatatttgg
actgtgatgt tggttgttct ccactctttc tacaggacag 2940aacagggcct ctagagtggg
aaatggcttt gggaaatatg ccaagcagta gccttgttct 3000tcaacttgcc caagaggata
attctccaca cccttcctgt actcagtcct cagtttgcct 3060ggtgagagag cagcctcctc
ccgtgtgctc tgccagctgg acccagactg gccatattac 3120cagtgagacc aaaaagatgg
aggtggggag gtagctctga ggtctgggaa accattccag 3180ctcctgccag ttttaacttg
tgtttaattc ctggcacagt tgtcctggaa atgccttttt 3240ctcttgcctg ggaaccacta
gaaggggatg ttgtctgtgt tggccagggc catgcaaatt 3300caacatcttg tttctgccct
tcccccgtgt agctgaggct aggtgttggc attacccagt 3360gcttgttctt cagagagcaa
aagcactgct cgtcatgtct gaaatttagt gagtgagctc 3420acccactagg ctggtgtttc
ctgcccgtgg ctgcacattg gaagcaccgg ggcactttga 3480gaactacaga tgcctgggtc
ccagagcatc taaggtgctc tagggtgtgt ccaggacaca 3540gccctggttg aggaccactg
ctatattgta tggcctcttt taaaaaagtt aattttactt 3600ggaaatgatt tcaaagctac
agaaaagttg caagaataaa aactgtacaa atgaggctca 3660aaaaaaaaaa aaaaaaaaa
36791211442DNAHomo sapiens
121ttgacattct tctggacaat gagtcccatc atctctccac catgcacctt gtgactccct
60cctctgctga caacagataa ccacctttaa ctgtaacttt ccacagccta ccccagccct
120ataaagctgc ctctctccta tctcccttcg ctgactctct tttcagactc agcccacttg
180cacccaagtg aattaacagc cttgttgctc acacaaagcc tgtttaggtg gtcttctata
240tggacatgcg tgacacttgg tgccaaaatc tgggccaggg ggactccttt gtgagaccgg
300ccccctgtcc tggccctcac tccgtgaaga gatccacctg cgacctcggg tcctcagacc
360agcccaagga acatctcacc aatttcaaat cggatctcct cggcttagtg gctgaagact
420gatgctgccc gatcgcctca gaagccccct ggaccatcac agatgccgag cttcgggtaa
480ctcttacggt ggaggattcc cagccatatg aagacaccct agctggacga tcagtccttg
540tcaaaagtct gacccctcaa actctacagc ctcaatggac cagaccctac ccggtcattt
600atagcacacc aactgccgtc catctgcagg accctctcca ttgggttcac cattccagaa
660taaagccatg cccatcagac agccagcttg atctctcctc ttcctcctgg aagccacaag
720attaggccga gagccgatca gacaaacaac ctacaaccct taagctcctg gcagcgccca
780gccaaggcca tgcttccatg caacactcct tccaaatggc catcccagca tgcttccaag
840caggcttcat ccgttcctct ggaccctcat ctcttaagac ctgccgccta taaaaaggat
900tatatcttga gaccctatcc tctaaaattt tttccacacc caaaacaaaa aatctctggg
960tcaaaagtct aaaacgctta ggctggcaac catcagatcc ttgcccatgg tgtcctcaag
1020cctactctca tgaaatggac aacagtacac gcatatgggg ccagttccac atatttggca
1080accagaccag catccaggac aacacaaagt atgttgtttg ttgttagagg gcttgggaca
1140tttcactctt tgccagcctc agcttaatcc aggagacaaa gattattttc cttattatct
1200cttctgcata ggatctgcaa tcagaactat tgaacttctc cattcagacc gccactcaca
1260cctatgggaa aagggtaatg tatcatcggc ttagcaacag ggaatactat tcgtatgatg
1320gaaaatgggg acaaaaggct ttggtacata aaacattatt ccttccttgg cctaaaaact
1380catcgccacc tacattaaag ctaatatgcc tgataaaaaa aaaaaaaaaa aaaaaaaaaa
1440aa
144212212416DNAHomo sapiens 122cttcttctcg ctgagtctcc tcctcggctc
tgacggtaca gtgatataat gatgatgggt 60gtcacaaccc gcatttgaac ttgcaggcga
gctgccccga gcctttctgg ggaagaactc 120caggcgtgcg gacgcaacag ccgagaacat
taggtgttgt ggacaggagc tgggaccaag 180atcttcggcc agccccgcat cctcccgcat
cttccagcac cgtcccgcac cctccgcatc 240cttccccggg ccaccacgct tcctatgtga
cccgcctggg caacgccgaa cccagtcgcg 300cagcgctgca gtgaattttc cccccaaact
gcaataagcc gccttccaag gccaagatgt 360tcataaatat aaagagcatc ttatggatgt
gttcaacctt aatagtaacc catgcgctac 420ataaagtcaa agtgggaaaa agcccaccgg
tgaggggctc cctctctgga aaagtcagcc 480taccttgtca tttttcaacg atgcctactt
tgccacccag ttacaacacc agtgaatttc 540tccgcatcaa atggtctaag attgaagtgg
acaaaaatgg aaaagatttg aaagagacta 600ctgtccttgt ggcccaaaat ggaaatatca
agattggtca ggactacaaa gggagagtgt 660ctgtgcccac acatcccgag gctgtgggcg
atgcctccct cactgtggtc aagctgctgg 720caagtgatgc gggtctttac cgctgtgacg
tcatgtacgg gattgaagac acacaagaca 780cggtgtcact gactgtggat ggggttgtgt
ttcactacag ggcggcaacc agcaggtaca 840cactgaattt tgaggctgct cagaaggctt
gtttggacgt tggggcagtc atagcaactc 900cagagcagct ctttgctgcc tatgaagatg
gatttgagca gtgtgacgca ggctggctgg 960ctgatcagac tgtcagatat cccatccggg
ctcccagagt aggctgttat ggagataaga 1020tgggaaaggc aggagtcagg acttatggat
tccgttctcc ccaggaaact tacgatgtgt 1080attgttatgt ggatcatctg gatggtgatg
tgttccacct cactgtcccc agtaaattca 1140ccttcgagga ggctgcaaaa gagtgtgaaa
accaggatgc caggctggca acagtggggg 1200aactccaggc ggcatggagg aacggctttg
accagtgcga ttacgggtgg ctgtcggatg 1260ccagcgtgcg ccaccctgtg actgtggcca
gggcccagtg tggaggtggt ctacttgggg 1320tgagaaccct gtatcgtttt gagaaccaga
caggcttccc tccccctgat agcagatttg 1380atgcctactg ctttaaacct aaagaggcta
caaccatcga tttgagtatc ctcgcagaaa 1440ctgcatcacc cagtttatcc aaagaaccac
aaatggtttc tgatagaact acaccaatca 1500tccctttagt tgatgaatta cctgtcattc
caacagagtt ccctcccgtg ggaaatattg 1560tcagttttga acagaaagcc acagtccaac
ctcaggctat cacagatagt ttagccacca 1620aattacccac acctactggc agtaccaaga
agccctggga tatggatgac tactcacctt 1680ctgcttcagg acctcttgga aagctagaca
tatcagaaat taaggaagaa gtgctccaga 1740gtacaactgg cgtctctcat tatgctacgg
attcatggga tggtgtcgtg gaagataaac 1800aaacacaaga atcggttaca cagattgaac
aaatagaagt gggtcctttg gtaacatcta 1860tggaaatctt aaagcacatt ccttccaagg
aattccctgt aactgaaaca ccattggtaa 1920ctgcaagaat gatcctggaa tccaaaactg
aaaagaaaat ggtaagcact gtttctgaat 1980tggtaaccac aggtcactat ggattcacct
tgggagaaga ggatgatgaa gacagaacac 2040ttacagttgg atctgatgag agcaccttga
tctttgacca aattcctgaa gtcattacgg 2100tgtcaaagac ttcagaagac accatccaca
ctcatttaga agacttggag tcagtctcag 2160catccacaac tgtttcccct ttaattatgc
ctgataataa tggatcatcc atggatgact 2220gggaagagag acaaactagt ggtaggataa
cggaagagtt tcttggcaaa tatctgtcta 2280ctacaccttt tccatcacag catcgtacag
aaatagaatt gtttccttat tctggtgata 2340aaatattagt agagggaatt tccacagtta
tttatccttc tctacaaaca gaaatgacac 2400atagaagaga aagaacagaa acactaatac
cagagatgag aacagatact tatacagatg 2460aaatacaaga agagatcact aaaagtccat
ttatgggaaa aacagaagaa gaagtcttct 2520ctgggatgaa actctctaca tctctctcag
agccaattca tgttacagag tcttctgtgg 2580aaatgaccaa gtcttttgat ttcccaacat
tgataacaaa gttaagtgca gagccaacag 2640aagtaagaga tatggaggaa gactttacag
caactccagg tactacaaaa tatgatgaaa 2700atattacaac agtgcttttg gcccatggta
ctttaagtgt tgaagcagcc actgtatcaa 2760aatggtcatg ggatgaagat aatacaacat
ccaagccttt agagtctaca gaaccttcag 2820cctcttcaaa attgccccct gccttactca
caactgtggg gatgaatgga aaggataaag 2880acatcccaag tttcactgaa gatggagcag
atgaatttac tcttattcca gatagtactc 2940aaaagcagtt agaggaggtt actgatgaag
acatagcagc ccatggaaaa ttcacaatta 3000gatttcagcc aactacatca actggtattg
cagaaaagtc aactttgaga gattctacaa 3060ctgaagaaaa agttccacct atcacaagca
ctgaaggcca agtttatgca accatggaag 3120gaagtgcttt gggtgaagta gaagatgtgg
acctctctaa gccagtatct actgttcccc 3180aatttgcaca cacttcagag gtggaaggat
tagcatttgt tagttatagt agcacccaag 3240agcctactac ttatgtagac tcttcccata
ccattcctct ttctgtaatt cccaagacag 3300actggggagt gttagtacct tctgttccat
cagaagatga agttctaggt gaaccctctc 3360aagacatact tgtcattgat cagactcgcc
ttgaagcgac tatttctcca gaaactatga 3420gaacaacaaa aatcacagag ggaacaactc
aggaagaatt cccttggaaa gaacagactg 3480cagagaaacc agttcctgct ctcagttcta
cagcttggac tcccaaggag gcagtaacac 3540cactggatga acaagagggc gatggatcag
catatacagt ctctgaagat gaattgttga 3600caggttctga gagggtccca gttttagaaa
caactccagt tggaaaaatt gatcacagtg 3660tgtcttatcc accaggtgct gtaactgagc
acaaagtgaa aacagatgaa gtggtaacac 3720taacaccacg cattgggcca aaagtatctt
taagtccagg gcctgaacaa aaatatgaaa 3780cagaaggtag tagtacaaca ggatttacat
catctttgag tccttttagt acccacatta 3840cccagcttat ggaagaaacc actactgaga
aaacatccct agaggatatt gatttaggct 3900caggattatt tgaaaagccc aaagccacag
aactcataga attttcaaca atcaaagtca 3960cagttccaag tgatattacc actgccttca
gttcagtaga cagacttcac acaacttcag 4020cattcaagcc atcttccgcg atcactaaga
aaccacctct catcgacagg gaacctggtg 4080aagaaacaac cagtgacatg gtaatcattg
gagaatcaac atctcatgtt cctcccacta 4140cccttgaaga tattgtagcc aaggaaacag
aaaccgatat tgatagagag tatttcacga 4200cttcaagtcc tcctgctaca cagccaacaa
gaccacccac tgtggaagac aaagaggcct 4260ttggacctca ggcgctttct acgccacagc
ccccagcaag cacaaaattt caccctgaca 4320ttaatgttta tattattgag gtcagagaaa
ataagacagg tcgaatgagt gatttgagtg 4380taattggtca tccaatagat tcagaatcta
aagaagatga accttgtagt gaagaaacag 4440atccagtgca tgatctaatg gctgaaattt
tacctgaatt ccctgacata attgaaatag 4500acctatacca cagtgaagaa aatgaagaag
aagaagaaga gtgtgcaaat gctactgatg 4560tgacaaccac cccatctgtg cagtacataa
atgggaagca tctcgttacc actgtgccca 4620aggacccaga agctgcagaa gctaggcgtg
gccagtttga aagtgttgca ccttctcaga 4680atttctcgga cagctctgaa agtgatactc
atccatttgt aatagccaaa acggaattgt 4740ctactgctgt gcaacctaat gaatctacag
aaacaactga gtctcttgaa gttacatgga 4800agcctgagac ttaccctgaa acatcagaac
atttttcagg tggtgagcct gatgttttcc 4860ccacagtccc attccatgag gaatttgaaa
gtggaacagc caaaaaaggg gcagaatcag 4920tcacagagag agatactgaa gttggtcatc
aggcacatga acatactgaa cctgtatctc 4980tgtttcctga agagtcttca ggagagattg
ccattgacca agaatctcag aaaatagcct 5040ttgcaagggc tacagaagta acatttggtg
aagaggtaga aaaaagtact tctgtcacat 5100acactcccac tatagttcca agttctgcat
cagcatatgt ttcagaggaa gaagcagtta 5160ccctaatagg aaatccttgg ccagatgacc
tgttgtctac caaagaaagc tgggtagaag 5220caactcctag acaagttgta gagctctcag
ggagttcttc gattccaatt acagaaggct 5280ctggagaagc agaagaagat gaagatacaa
tgttcaccat ggtaactgat ttatcacaga 5340gaaatactac tgatacactc attactttag
acactagcag gataatcaca gaaagctttt 5400ttgaggttcc tgcaaccacc atttatccag
tttctgaaca accttctgca aaagtggtgc 5460ctaccaagtt tgtaagtgaa acagacactt
ctgagtggat ttccagtacc actgttgagg 5520aaaagaaaag gaaggaggag gagggaacta
caggtacggc ttctacattt gaggtatatt 5580catctacaca gagatcggat caattaattt
taccctttga attagaaagt ccaaatgtag 5640ctacatctag tgattcaggt accaggaaaa
gttttatgtc cttgacaaca ccaacacagt 5700ctgaaaggga aatgacagat tctactcctg
tctttacaga aacaaataca ttagaaaatt 5760tgggggcaca gaccactgag cacagcagta
tccatcaacc tggggttcag gaagggctga 5820ccactctccc acgtagtcct gcctctgtct
ttatggagca gggctctgga gaagctgctg 5880ccgacccaga aaccaccact gtttcttcat
tttcattaaa cgtagagtat gcaattcaag 5940ccgaaaagga agtagctggc actttgtctc
cgcatgtgga aactacattc tccactgagc 6000caacaggact ggttttgagt acagtaatgg
acagagtagt tgctgaaaat ataacccaaa 6060catccaggga aatagtgatt tcagagcgat
taggagaacc aaattatggg gcagaaataa 6120ggggcttttc cacaggtttt cctttggagg
aagatttcag tggtgacttt agagaatact 6180caacagtgtc tcatcccata gcaaaagaag
aaacggtaat gatggaaggc tctggagatg 6240cagcatttag ggacacccag acttcaccat
ctacagtacc tacttcagtt cacatcagtc 6300acatatctga ctcagaagga cccagtagca
ccatggtcag cacttcagcc ttcccctggg 6360aagagtttac atcctcagct gagggctcag
gtgagcaact ggtcacagtc agcagctctg 6420ttgttccagt gcttcccagt gctgtgcaaa
agttttctgg tacagcttcc tccattatcg 6480acgaaggatt gggagaagtg ggtactgtca
atgaaattga tagaagatcc accattttac 6540caacagcaga agtggaaggt acgaaagctc
cagtagagaa ggaggaagta aaggtcagtg 6600gcacagtttc aacaaacttt ccccaaacta
tagagccagc caaattatgg tctaggcaag 6660aagtcaaccc tgtaagacaa gaaattgaaa
gtgaaacaac atcagaggaa caaattcaag 6720aagaaaagtc atttgaatcc cctcaaaact
ctcctgcaac agaacaaaca atctttgatt 6780cacagacatt tactgaaact gaactcaaaa
ccacagatta ttctgtacta acaacaaaga 6840aaacttacag tgatgataaa gaaatgaagg
aggaagacac ttctttagtt aacatgtcta 6900ctccagatcc agatgcaaat ggcttggaat
cttacacaac tctccctgaa gctactgaaa 6960agtcacattt tttcttagct actgcattag
taactgaatc tataccagct gaacatgtag 7020tcacagattc accaatcaaa aaggaagaaa
gtacaaaaca ttttccgaaa ggcatgagac 7080caacaattca agagtcagat actgagctct
tattctctgg actgggatca ggagaagaag 7140ttttacctac tctaccaaca gagtcagtga
attttactga agtggaacaa atcaataaca 7200cattatatcc ccacacttct caagtggaaa
gtacctcaag tgacaaaatt gaagacttta 7260acagaatgga aaatgtggca aaagaagttg
gaccactcgt atctcaaaca gacatctttg 7320aaggtagtgg gtcagtaacc agcacaacat
taatagaaat tttaagtgac actggagcag 7380aaggacccac ggtggcacct ctccctttct
ccacggacat cggacatcct caaaatcaga 7440ctgtcaggtg ggcagaagaa atccagacta
gtagaccaca aaccataact gaacaagact 7500ctaacaagaa ttcttcaaca gcagaaatta
acgaaacaac aacctcatct actgattttc 7560tggctagagc ttatggtttt gaaatggcca
aagaatttgt tacatcagca ccaaaaccat 7620ctgacttgta ttatgaacct tctggagaag
gatctggaga agtggatatt gttgattcat 7680ttcacacttc tgcaactact caggcaacca
gacaagaaag cagcaccaca tttgtttctg 7740atgggtccct ggaaaaacat cctgaggtgc
caagcgctaa agctgttact gctgatggat 7800tcccaacagt ttcagtgatg ctgcctcttc
attcagagca gaacaaaagc tcccctgatc 7860caactagcac actgtcaaat acagtgtcat
atgagaggtc cacagacggt agtttccaag 7920accgtttcag ggaattcgag gattccacct
taaaacctaa cagaaaaaaa cccactgaaa 7980atattatcat agacctggac aaagaggaca
aggatttaat attgacaatt acagagagta 8040ccatccttga aattctacct gagctgacat
cggataaaaa tactatcata gatattgatc 8100atactaaacc tgtgtatgaa gacattcttg
gaatgcaaac agatatagat acagaggtac 8160catcagaacc acatgacagt aatgatgaaa
gtaatgatga cagcactcaa gttcaagaga 8220tctatgaggc agctgtcaac ctttctttaa
ctgaggaaac atttgagggc tctgctgatg 8280ttctggctag ctacactcag gcaacacatg
atgaatcaat gacttatgaa gatagaagcc 8340aactagatca catgggcttt cacttcacaa
ctgggatccc tgctcctagc acagaaacag 8400aattagacgt tttacttccc acggcaacat
ccctgccaat tcctcgtaag tctgccacag 8460ttattccaga gattgaagga ataaaagctg
aagcaaaagc cctggatgac atgtttgaat 8520caagcacttt gtctgatggt caagctattg
cagaccaaag tgaaataata ccaacattgg 8580gccaatttga aaggactcag gaggagtatg
aagacaaaaa acatgctggt ccttcttttc 8640agccagaatt ctcttcagga gctgaggagg
cattagtaga ccatactccc tatctaagta 8700ttgctactac ccaccttatg gatcagagtg
taacagaggt gcctgatgtg atggaaggat 8760ccaatccccc atattacact gatacaacat
tagcagtttc aacatttgcg aagttgtctt 8820ctcagacacc atcatctccc ctcactatct
actcaggcag tgaagcctct ggacacacag 8880agatccccca gcccagtgct ctgccaggaa
tagacgtcgg ctcatctgta atgtccccac 8940aggattcttt taaggaaatt catgtaaata
ttgaagcgac tttcaaacca tcaagtgagg 9000aataccttca cataactgag cctccctctt
tatctcctga cacaaaatta gaaccttcag 9060aagatgatgg taaacctgag ttattagaag
aaatggaagc ttctcccaca gaacttattg 9120ctgtggaagg aactgagatt ctccaagatt
tccaaaacaa aaccgatggt caagtttctg 9180gagaagcaat caagatgttt cccaccatta
aaacacctga ggctggaact gttattacaa 9240ctgccgatga aattgaatta gaaggtgcta
cacagtggcc acactctact tctgcttctg 9300ccacctatgg ggtcgaggca ggtgtggtgc
cttggctaag tccacagact tctgagaggc 9360ccacgctttc ttcttctcca gaaataaacc
ctgaaactca agcagcttta atcagagggc 9420aggattccac gatagcagca tcagaacagc
aagtggcagc gagaattctt gattccaatg 9480atcaggcaac agtaaaccct gtggaattta
atactgaggt tgcaacacca ccattttccc 9540ttctggagac ttctaatgaa acagatttcc
tgattggcat taatgaagag tcagtggaag 9600gcacggcaat ctatttacca ggacctgatc
gctgcaaaat gaacccgtgc cttaacggag 9660gcacctgtta tcctactgaa acttcctacg
tatgcacctg tgtgccagga tacagcggag 9720accagtgtga acttgatttt gatgaatgtc
actctaatcc ctgtcgtaat ggagccactt 9780gtgttgatgg ttttaacaca ttcaggtgcc
tctgccttcc aagttatgtt ggtgcacttt 9840gtgagcaaga taccgagaca tgtgactatg
gctggcacaa attccaaggg cagtgctaca 9900aatactttgc ccatcgacgc acatgggatg
cagctgaacg ggaatgccgt ctgcagggtg 9960cccatctcac aagcatcctg tctcacgaag
aacaaatgtt tgttaatcgt gtgggccatg 10020attatcagtg gataggcctc aatgacaaga
tgtttgagca tgacttccgt tggactgatg 10080gcagcacact gcaatacgag aattggagac
ccaaccagcc agacagcttc ttttctgctg 10140gagaagactg tgttgtaatc atttggcatg
agaatggcca gtggaatgat gttccctgca 10200attaccatct cacctatacg tgcaagaaag
gaacagtcgc ttgcggccag ccccctgttg 10260tagaaaatgc caagaccttt ggaaagatga
aacctcgtta tgaaatcaac tccctgatta 10320gataccactg caaagatggt ttcattcaac
gtcaccttcc aactatccgg tgcttaggaa 10380atggaagatg ggctatacct aaaattacct
gcatgaaccc atctgcatac caaaggactt 10440attctatgaa atactttaaa aattcctcat
cagcaaagga caattcaata aatacatcca 10500aacatgatca tcgttggagc cggaggtggc
aggagtcgag gcgctgatcc ctaaaatggc 10560gaacatgtgt tttcatcatt tcagccaaag
tcctaacttc ctgtgccttt cctatcacct 10620cgagaagtaa ttatcagttg gtttggattt
ttggaccacc gttcagtcat tttgggttgc 10680cgtgctccca aaacatttta aatgaaagta
ttggcattca aaaagacagc agacaaaatg 10740aaagaaaatg agagcagaaa gtaagcattt
ccagcctatc taatttcttt agttttctat 10800ttgcctccag tgcagtccat ttcctaatgt
ataccagcct actgtactat ttaaaatgct 10860caatttcagc accgatggcc atgtaaataa
gatgatttaa tgttgatttt aatcctgtat 10920ataaaataaa aagtcacaat gagtttgggc
atatttaatg atgattatgg agccttagag 10980gtctttaatc attggttcgg ctgcttttat
gtagtttagg ctggaaatgg tttcacttgc 11040tctttgactg tcagcaagac tgaagatggc
ttttcctgga cagctagaaa acacaaaatc 11100ttgtaggtca ttgcacctat ctcagccata
ggtgcagttt gcttctacat gatgctaaag 11160gctgcgaatg ggatcctgat ggaactaagg
actccaatgt cgaactcttc tttgctgcat 11220tcctttttct tcacttacaa gaaaggcctg
aatggaggac ttttctgtaa ccaggaacat 11280tttttagggg tcaaagtgct aataattaac
tcaaccaggt ctacttttta atggctttca 11340taacactaac tcataaggtt accgatcaat
gcatttcata cggatataga cctagggctc 11400tggagggtgg gggattgtta aaacacatgc
aaaaaaaaaa aaaaaaaaaa aaaaagaaat 11460tttgtatata taaccatttt aatcttttat
aaagttttga atgttcatgt atgaatgctg 11520cagctgtgaa gcatacataa ataaatgaag
taagccatac tgatttaatt tattggatgt 11580tattttccct aagacctgaa aatgaacata
gtatgctagt tatttttcag tgttagcctt 11640ttactttcct cacacaattt ggaatcatat
aatataggta ctttgtccct gattaaataa 11700tgtgacggat agaatgcatc aagtgtttat
tatgaaaaga gtggaaaagt atatagcttt 11760tagcaaaagg tgtttgccca ttctaagaaa
tgagcgaata tatagaaata gtgtgggcat 11820ttcttcctgt taggtggagt gtatgtgttg
acatttctcc ccatctcttc ccactctgtt 11880ttctccccat tatttgaata aagtgactgc
tgaagatgac tttgaatcct tatccactta 11940atttaatgtt taaagaaaaa cctgtaatgg
aaagtaagac tccttcccta atttcagttt 12000agagcaactt gaagaagagt agacaaaaaa
taaaatgcac atagaaaaag agaaaaaggg 12060cacaaaggga ttggcccaat attgattctt
tttttataaa acctcctttg gcttagaagg 12120aatgactcta gctacaataa tacacagtat
gtttaagcag gttcccttgg ttgttgcatt 12180aaatgtaatc cacctttagg tattttagag
cacagaacaa cactgtgttg atctagtagg 12240tttctatttt tcctttctct ttacaatgca
cataatactt tcctgtattt atatcataac 12300gtgtatagtg taaaatgtga atgacttttt
ttgtgaatga aaatctaaaa tctttgtaac 12360tttttatatc tgcttttgtt tcaccaaaga
aacctaaaat ccttctttta ctacac 124161231242DNAHomo sapiens
123ctagaggggc ggaaagtaac aaggaggtgg gggtacaaat cctcagctcc tgcttccgca
60agcactaacc tgctctgaag tgagccaggc agctctggcc atcttttccc agccacagaa
120tcaggtgatg gtccagaatt aagagctgtc acctgtgtca ttcactcaca atggaagaaa
180tgaagaagac tgccatccgg ctgcccaaag gcaaacagaa gcctataaag acggaatgga
240attcccggtg tgtccttttc acctacttcc aaggggacat cagcagcgta gtggatgaac
300acttctccag agctctgagc aatatcaaga gcccccagga attgaccccc tcgagtcaga
360gtgaaggtgt gatgctgaaa aacgatgata gcatgtctcc aaatcagtgg cgttactcgt
420ctccatggac aaagccacaa ccagaagtac ctgtcacaaa ccgtgccgcc aactgcaact
480tgcatgtgcc tggtcccatg gctgtgaatc agttctcacc gtccctggct aggagggcct
540ctgttcggcc tggggagctg tggcatttct cctccctggc gggcaccagc tccttagagc
600ctggctactc tcatcccttc cccgctcggc acctggttcc agagccccag cctgatggga
660aacgtgagcc tctcctaagt ctcctccagc aagacagatg cctagcccgt cctcaggaat
720ctgccgccag ggagaatggc aaccctggcc agatagctgg aagcacaggg ttgctcttca
780acctgcctcc cggctcagtt cactataaga aactatatgt atctcgtgga tctgccagta
840ccagccttcc aaatgaaact ctttcagagt tagagacacc tgggaaatac tcacttacac
900caccaaacca ctggggccac ccacatcgat acctgcagca tctttagtca agttggagga
960gaaagacaac acttggtcta agacacggca gcaagacatc cctgcatatt gttccagata
1020aaaatgaaag ctgctcacac ccacttgcct ccccaatctg ttaaacagct tcgtgtctag
1080tatgagctca gtacttgccc tgtgaaaatc ccagaagccc ccgctgtcaa tgttccccat
1140ccacaccctg cttgctcctg tgtaacagct cagatgatga ataataataa aactgtactt
1200ttttggatgg tgctaaaaaa aaaaaaaaaa aaaaaaaaaa aa
12421242832DNAHomo sapiens 124aattcagcac taaaaccagt gtgagaagca gtatgtgtga
gcatgtgtgt gtgtgtatgt 60gtgtgggggg gggtgtaaaa tgtgtttttc aaagtactga
gaggttgatg ggactgttcg 120attagctcct ctgagaagaa gagaaaaggt tcttggacct
ctccctgttt cttccttaga 180ataatttgga tgggatttgt gatgcaggaa agcctaaggg
aaaaagaata ttcattctgt 240gtggtgaaaa ttttttgaaa aaaaaattgc cttcttcaaa
caagggtgtc attctgatat 300ttatgaggac tgttgttctc actatgaagg catctgttat
tgaaatgttc cttgttttgc 360tggtgactgg agtacattca aacaaagaaa cggcaaagaa
gattaaaagg cccaagttca 420ctgtgcctca gatcaactgc gatgtcaaag ccggaaagat
catcgatcct gagttcattg 480tgaaatgtcc agcaggatgc caagacccca aataccatgt
ttatggcact gacgtgtatg 540catcctactc cagtgtgtgt ggcgctgccg tacacagtgg
tgtgcttgat aattcaggag 600ggaaaatact tgttcggaag gttgctggac agtctggtta
caaagggagt tattccaacg 660gtgtccaatc gttatcccta ccacgatgga gagaatcctt
tatcgtctta gaaagtaaac 720ccaaaaaggg tgtaacctac ccatcagctc ttacatactc
atcatcgaaa agtccagctg 780cccaagcagg tgagaccaca aaagcctatc agaggccacc
tattccaggg acaactgcac 840agccggtcac tctgatgcag cttctggctg tcactgtagc
tgtggccacc cccaccacct 900tgccaaggcc atccccttct gctgcttcta ccaccagcat
ccccagacca caatcagtgg 960gccacaggag ccaggagatg gatctctggt ccactgccac
ctacacaagc agccaaaaca 1020ggcccagagc tgatccaggt atccaaaggc aagatccttc
aggagctgcc ttccagaaac 1080ctgttggagc ggatgtcagc ctgggagaga tggactcatg
gaaacctgga tcggtccttt 1140tagatgaagg acttgttcca aaagaagaat tgagcacaca
gtctttggag ccagtatccc 1200tgggagatcc aaactgcaaa attgacttgt cgtttttaat
tgatgggagc accagcattg 1260gcaaacggcg attccgaatc cagaagcagc tcctggctga
tgttgcccaa gctcttgaca 1320ttggccctgc cggtccactg atgggtgttg tccagtatgg
agacaaccct gctactcact 1380ttaacctcaa gacacacacg aattctcgag atctgaagac
agccatagag aaaattactc 1440agagaggagg actttctaat gtaggtcggg ccatctcctt
tgtgaccaag aacttctttt 1500ccaaagccaa tggaaacaga agcggggctc ccaatgtggt
ggtggtgatg gtggatggct 1560ggcccacgga caaagtggag gaggcttcaa gacttgcgag
agagtcagga atcaacattt 1620tcttcatcac cattgaaggt gctgctgaaa atgagaagca
gtatgtggtg gagcccaact 1680ttgcaaacaa ggccgtgtgc agaacaaacg gcttctactc
gctccacgtg cagagctggt 1740ttggcctcca caagaccctg cagcctctgg tgaagcgggt
ctgcgacact gaccgcctgg 1800cctgcagcaa gacctgcttg aactcggctg acattggctt
cgtcatcgac ggctccagca 1860gtgtggggac gggcaacttc cgcaccgtcc tccagtttgt
gaccaacctc accaaagagt 1920ttgagatttc cgacacggac acgcgcatcg gggccgtgca
gtacacctac gaacagcggc 1980tggagtttgg gttcgacaag tacagcagca agcctgacat
cctcaacgcc atcaagaggg 2040tgggctactg gagtggtggc accagcacgg gggctgccat
caacttcgcc ctggagcagc 2100tcttcaagaa gtccaagccc aacaagagga agttaatgat
cctcatcacc gacgggaggt 2160cctacgacga cgtccggatc ccagccatgg ctgcccatct
gaagggagtg atcacctatg 2220cgataggcgt tgcctgggct gcccaagagg agctagaagt
cattgccact caccccgcca 2280gagaccactc cttctttgtg gacgagtttg acaacctcca
tcagtatgtc cccaggatca 2340tccagaacat ttgtacagag ttcaactcac agcctcggaa
ctgaattcag agcaggcaga 2400gcaccagcaa gtgctgcttt actaactgac gtgttggacc
accccaccgc ttaatggggc 2460acgcacggtg catcaagtct tgggcagggc atggagaaac
aaatgtcttg ttattattct 2520ttgccatcat gctttttcat attccaaaac ttggagttac
aaagatgatc acaaacgtat 2580agaatgagcc aaaaggctac atcatgttga gggtgctgga
gattttacat tttgacaatt 2640gttttcaaaa taaatgttcg gaatacagtg cagcccttac
gacaggctta cgtagagctt 2700ttgtgagatt tttaagttgt tatttctgat tagaactctg
taaccctcag caagtttcat 2760ttttgtcatg acaatgtagg aattgctgaa ttaaatgttt
agaaggatga catgcaaaaa 2820aaaaaaaaaa aa
28321251138DNAHomo sapiens 125cccttccctg cccgacaccc
agaccgacct tgaccgccca cctggcagga gcaggacagg 60acggccggac gcggccatgg
ccgagctccc ggggcccttt ctctgcgggg ccctgctagg 120cttcctgtgc ctgagtgggc
tggccgtgga ggtgaaggta cccacagagc cgctgagcac 180gcccctgggg aagacagccg
agctgacctg cacctacagc acgtcggtgg gagacagctt 240cgccctggag tggagctttg
tgcagcctgg gaaacccatc tctgagtccc atccaatcct 300gtacttcacc aatggccatc
tgtatccaac tggttctaag tcaaagcggg tcagcctgct 360tcagaacccc cccacagtgg
gggtggccac actgaaactg actgacgtcc acccctcaga 420tactggaacc tacctctgcc
aagtcaacaa cccaccagat ttctacacca atgggttggg 480gctaatcaac cttactgtgc
tggttccccc cagtaatccc ttatgcagtc agagtggaca 540aacctctgtg ggaggctcta
ctgcactgag atgcagctct tccgaggggg ctcctaagcc 600agtgtacaac tgggtgcgtc
ttggaacttt tcctacacct tctcctggca gcatggttca 660agatgaggtg tctggccagc
tcattctcac caacctctcc ctgacctcct cgggcaccta 720ccgctgtgtg gccaccaacc
agatgggcag tgcatcctgt gagctgaccc tctctgtgac 780cgaaccctcc caaggccgag
tggccggagc tctgattggg gtgctcctgg gcgtgctgtt 840gctgtcagtt gctgcgttct
gcctggtcag gttccagaaa gagaggggga agaagcccaa 900ggagacatat gggggtagtg
accttcggga ggatgccatc gctcctggga tctctgagca 960cacttgtatg agggctgatt
ctagcaaggg gttcctggaa agaccctcgt ctgccagcac 1020cgtgacgacc accaagtcca
agctccctat ggtcgtgtga cttctcccga tccctgaggg 1080cggtgagggg gaatatcaat
aattaaagtc tgtgggtacc aaaaaaaaaa aaaaaaaa 113812613203DNAHomo sapiens
126atgcctgggg agcgcccccg aggagcgccg ccccccacca tgactggaga cctgcagccc
60cgccaagttg ccagcagccc ggggcacccc tcccagccgc cactggagga caacacccca
120gctaccagga ccaccaaggg tgccagggag gctggcggcc aggcccaggc catggagctc
180cccgaggccc agccaaggca ggccagggac ggggagctca agcccccatc cctgagaggc
240caggccccga gcagcacccc tgggaagagg ggcagccccc agaccccacc ggggagaagc
300cccttgcagg ctccctcaag gctggcgggc agggcagagg gcagcccccc acagcgctac
360attctgggca tcgccagctc gaggaccaag cccaccctgg acgagacacc agagaaccca
420cagctggagg ctgcccagct ccctgaggtg gacacccccc agggccctgg gactggagct
480ccactcaggc cgggcctccc aaggactgag gcccaacccg ccgccgaaga gcttggcttc
540cacaggtgct tccaggagcc accctccagc tttacctcca ccaactatac ctcaccaagc
600gccaccccca ggcccccagc cccggggccc ccccagagca ggggcaccag ccccctccag
660cccggttcct atcccgaata ccaggccagt ggggccgact cctggcctcc cgctgctgag
720aatagcttcc caggtgctaa tttcggggtt ccccccgccg agccggaacc tattcccaaa
780ggcagcaggc ccggcggcag ccccagggga gtttccttcc agttcccctt cccggcactg
840catggggcca gcacaaaacc cttccctgcg gatgtggctg ggcacgcatt caccaatggg
900ccactggtgt ttgccttcca tcagccccag ggagcgtggc cggaggaggc cgtgggcacg
960ggccctgcct acccgctgcc cacccagcct gcgccctcac ccctgccctg ctaccagggc
1020cagccaggtg gcctgaaccg ccacagcgac ctcagtggtg ccctctcttc ccctggagct
1080gctcactcgg ccccgagacc cttctctgac agtttacaca agagcctgac caaaatcctt
1140cccgaaagac caccttcagc ccaggatggg ctggggagca cgagagggcc ccctagctcc
1200ctaccccaga ggcactttcc agggcaggcg tacagagcca gtggggtgga caccagcccg
1260gggcctccgg acaccgagct ggccgcccca gggcccccac ccgccaggct gccccagctg
1320tgggacccca cagcagcccc ttaccccaca cctcctgggg gccccctggc tgccaccagg
1380agtatgttct ttaacggcca gcccagccca ggccagcggc tctgcctccc ccagagtgcc
1440cccctgcctt ggccccaagt gctcccgacc gcccggccaa gtccccacgg aatggagatg
1500ctgagccggc tgcctttccc cgcggggggc cccgagtggc aggggggcag ccaaggagcc
1560ctgggcactg ctggcaagac accgggaccc agagagaagc tgccagccgt gagaagcagc
1620cagggcggct ccccagcact gttcacctac aacggaatga cagaccctgg ggctcagccc
1680ctgttcttcg gggtggccca gccccaggtt tcaccccacg ggacacccag cctgccccca
1740ccgagggtag tgggagcctc ccccagcgag tccccactgc cgtcaccggc caccaacacg
1800gccggcagca cctgctcttc cctgtcgccg atgtccagca gcccagccaa ccccagctca
1860gaggaaagcc agctccccgg ccccctcggg ccctcggcct tcttccaccc acccactcac
1920ccccaggaga cgggcagccc cttcccgtcc ccggagcccc cccactccct ccccacccac
1980taccagccag agccagccaa ggccttccct tttcccgcag atgggctggg agccgagggt
2040gccttccagt gcctggagga gaccccattc ccccacgagg gccccgaggt gggtcgggga
2100gggctgcagg gcttcccccg tgcgccgcct ccgtacccca cacaccactt ctccctcagc
2160agcgccagcc tggaccagct ggacgtgctg ctgacctgca ggcagtgtga ccgcaactac
2220agcagcctgg cggccttcct ggcccaccgg cagttctgtg gcctgctcct ggccagggcc
2280aaggatggcc accagcggtc tccaggcccc cctgggctcc cctcgccccc cgctgccccc
2340agagtccctg ccgacgcaca cgcgggcttg ctcagccacg cgaagacctt cctgttagct
2400ggggacgccc aggccgaggg caaagacgac cccctgagga caggcttcct gcccagcctg
2460gccgccaccc ccttcccgct ccctgcctcg gacctggaca tggaggatga cgccaagctg
2520gacagcctca tcacagaggc gctcaacggc atggagtacc agtcggacaa cccggagatc
2580gacagcagct tcatcgacgt cttcgcggac gaggagcctt ccggccccag aggtcccagc
2640tccggacacc cccttaagag caaggcgggg gtgactccag agagcaaagc tccgcccccg
2700ctcccagcag ccacgccgga cccccaaacc ccccgccctg gggacagggg ctgcccagcc
2760cgaggcaggc ccaaaacgcg ttccctgggt ctggccccca ccgaggcgga tgcgcccagc
2820cagggcaggc agcagaggag ggggaagcag ttgaagctgt tccggaagga tctggactcg
2880ggcggcgcag cagaggggtc ggggtcgggc ggcggcggca gagcctccgg cctgaggccc
2940cggaggaacg acggtctcgg ggagcggccc ccaccccgtc cccggcgccc tagaacgcag
3000gcccccggga gccgcgcaga ccccgcgccc cgggtcccga gagccgccgc cctccccgag
3060gagacccgca gctcccggcg ccgccggctg ccccccagga aggaccccag gaagaggaag
3120gctcggggcg gcgcctgggg caaggagctc attctgaaga tcgtgcagca gaagaacagg
3180ctccgcgagt acgacttcgc ctcggagtcc gaggaggacg agcagcctcc gccgcggggc
3240cccggcttca gaggccggcg gggccgaggc gagaagagga aggaagtgga gctgacccag
3300ggtcccagag aggatgagcc acagaaaccc cggaaggcgg cgaggcagga agccggcggg
3360gacggagccc ccgcgaaccc cgaggagccg ggcgggtctc gcccgggccc cggcaggagc
3420cctcaggccc gtggcccgtc tcgaagcctg gagacgggag cggccgccag ggagggaggc
3480cccaagtgtg ctgatcgccc ctcagtggcc cccaaggatc ccctgcaggt ccccaccaac
3540accgagacct cagaggaaac ccgcccgtcg ctggactttc cccaggaggc caaggagcct
3600gaaactgccg aagagtcagc cccggacagc acagaattca cagaggcttt gcgttctcct
3660ccagccgcct gtgcgggaga aatgggagca agccccggtc tcctgatacc agagcagccg
3720ccgcccagca gacatgacac cggcaccccc aagccgtcgg gaagcctcgc caacacggcg
3780ccccacggaa gctcgccaac gccaggtgtg ggcagcctgc tgggtggtcc tgggggcaca
3840caggccccag tctcccacaa cagcaaggac ccccctgccc gccagcctgg agaatttctg
3900gcacccgtgg ctaacccctc aagtaccgcc tgccccaaac ccagtgttct gtcttcaaag
3960atctccagtt ttggctgtga ccctgctggt tttaacagag accccttggg ggttccagtt
4020gccaaaaagg ggcctcagcc ctacagcagc ccccacagtg agttgttcct cggacccaaa
4080gacctggctg gctgtttcct ggaagaactg caccccaagc cctcagccag ggatgccccg
4140ccggccagca gctcctgcct ttgccaggac ggcgaggatg ccggttccct cgagccacag
4200ctgccaagga gcccacctgg caccgctgag acggagccag gcagggctgc atcgccaccg
4260accttggagt cctcatccct cttcccagac ctgccggtgg acagattcga cccacccctc
4320tatggcagcc tgtctgcgaa cagggactcc ggtctgccgt tcgcatgtgc cgaccctccc
4380cagaagacgg tgccgtcaga tccaccgtac ccctcttttt tgctgcttga ggaagtatcc
4440ccgatgctgc ctagccattt tcctgatctc tcggggggaa aggtgctcag taagacgtgt
4500ccccctgaac ggacagtggt tcccggcgcc gccccatctt tgcctgggaa ggggagtgga
4560tgtagcgttg ctcttatgag tcacctgtcc gaggatgaac tggagatcca gaaattggtc
4620accgaattag aaagtcagct gcaaaggagc aaagacacac gtggggcccc gagagagctt
4680gcagaagctg agtcggtggg cagggtggag ctcggcacag gcacagagcc accctcccaa
4740cggcgcacct gccaggccac cgtgccccac gaggacacgt tctcggcagc tgacctcacg
4800cgcgttggag aatccactgc acatcgggag ggtgcggaat cggctgtggc caccgtggaa
4860gcggttcagg ggaggcctgg ggggacgtgg ccctgcccag cctccttcca tccgggacat
4920gcagcccttc tcccctgtgc ccaggaagac ctggtttctg gggctccttt cagccccagg
4980ggagccaact tccattttca gccagtgcag aaagccggag cctccaagac tggactttgc
5040caggcagaag gagacagcag gcccccccaa gatgtctgcc tgcctgagcc cagcaagcag
5100cctggcccac agctggatgc cgggagttta gcaaagtgca gccccgacca ggaactttca
5160tttcctaaga ataaggaggc cgccagctca caagaaagtg aagactccct gcggctgctt
5220ccctgtgaac agagaggagg gttcctccca gagcccggca cagcagacca gccccaccga
5280ggggcccctg ctccagaagc ttttggcagc cctgctgtcc atctggcccc tgacttggca
5340tttcagggtg acggggctcc acctctggat gccacctggc cttttggtgc cagtcccagc
5400catgctgccc agggacattc tgcaggcaga gcaggtgggc acctccaccc cacggcaggg
5460aggcctggct ttgagggtaa tgagtttgca ccggcggggg cctcctcact gactgccccc
5520cggggcaggg aggcttggtt ggtccctgtg ccaagtcccg cctgtgtatc caacacccac
5580cctagcagga ggtcccagga cccagctttg agccccccca tacgtcagct ccagctccca
5640gggcctggag tggctaagag taaagatggc atcctgggct tgcaggagct gacacctgct
5700gcccagagcc ctccacgagt gaacccctca ggtctggaag ggggcactgt ggaaggaggg
5760aaggtggcct gtggccccgc ccagggctcc ccagggggtg tgcaggtgac aactctccct
5820gcagtggccg gacatcagct ggggctggag gcagatggac attggggctt gcttggccaa
5880gccgagaaaa cccagggcca aggcacagcc aaccagcttc agccagagaa cggggtgagc
5940ccagggggca cggacaacca cgcctcagtc aatgccagtc ccaaaacagc gctgaccggc
6000cccaccgagg gtgcagtcct gctagagaaa tgcaagggaa gcagggcagc catgagcctt
6060caggaggagg ccgagcccac cccaagcccc ccgtccccta atagggagtc cctggcgctg
6120gccttgacag cagcccacag ccgaagtgga tctgagggcc ggactccaga gagggcgtcc
6180agccccggcc tgaacaagcc actgctggcc acaggggata gcccagcacc ctctgtcggg
6240gacctggccg cctgcgcccc ctcacccact tcagccgccc acatgccctg cagccttggg
6300cccctgcccc gtgaagaccc acttacctcg ccttccaggg cccaaggtgg gctggggggg
6360cagctgccag catctccgtc ctgcagggac cctcccggcc cccagcagct gctggcctgt
6420tctcctgcct gggcacctct ggaagaggca gatggcgtcc aagccacgac agatactggg
6480gctgaggatt ccccggtggc tcccccgtct ttgacaacaa gcccctgcga tcccaaggaa
6540gccctggctg gttgccttct ccagggggag ggcagccccc tggaagaccc ttcctcctgg
6600cctcctggct ccgtcagtgc tgtaacctgc actcacagtg gggacacccc caaagacagc
6660actttaagaa ttccagagga ttccagaaaa gagaagctgt gggagtctcc tggccgagcc
6720acctctcctc ctctggcagg ggccgtctcc cccagcgtgg ccgtcagggc tactggcctg
6780tccagcactc ccaccggaga tgaggcacag gcaggcaggg gactcccagg gccagacccc
6840cagagcaggg gagccccgcc ccacaccaac cctgacagga tgcccagggg ccactcctcg
6900tattctccaa gcaatactgc ccgcctcggc cacagggagg gccaggctgt cacagctgtg
6960cccactgagc ctcccacgct acagggtgca gggccggact cccccgcctg cctggaaggt
7020gagatgggga ccagcagcaa ggagccggag gacccaggga cccctgagac cgggcgctct
7080ggtgctacca agatgcccag ggtcacctgc ccttccacag gactgggctt gggaagaacc
7140acagccccaa gcagcacagc cagtgacttc cagtctgact ccccccaaag ccacagaaat
7200gcctcccacc agactcccca gggggacccc ctcggccccc aagacctcaa acagaggtcc
7260cgtggctata aaaagaagcc tgcatctaca gagaacggcc agtggaaggg ccaagctcca
7320catgggcctg tgacctgtga ggtctgcgca gcctccttcc gctccgggcc gggcctgagc
7380cggcacaagg ccaggaagca ccggccacac ccgggagccc ccgcggagcc gagcccagcg
7440gccttgcctg ctcagcagcc tctagagccc ctagcccaaa agtgccagcc gcccaggaag
7500aaaagccaca gggtgtctgg gaaggagaga ccaaatcact cacggggaga ccccagccac
7560gtcacccagc caccgcctgc ccagggctca aaggaggttc tcagagcacc ggggtcccca
7620cacagccagc agctgcaccc tccaagccct actgagcatg aggtagatgt gaagactccg
7680gcctccaagc ccagaccaga ccaggccagg gaagatgagc tgcatcccaa acaggcagaa
7740aaaagagaag gccggaggtg gcgccgagag cccaccgtgg actctcctag ccactcagag
7800gggaagtcaa ataagaaaag gggaaagctg agagggagaa ggctccggga ggagagcatt
7860cttccagtct ctgctgatgt gatttcagat gggcgcggct ccagaccatc ccctgcaatg
7920gccagttacg cagcctctcc gagccactgc ctctctgtgg aaggagggcc tgaggctgac
7980ggggagcagc cgcctcgctt ggccactctg ggacctgggg tgatggaggg tgcagcggag
8040actgaccagg aggctctgtg tgcaggggag actggggccc agaagccacc tggagatcgg
8100atgctgtgtc cagggaggat ggatggtgca gctctggggg aacagccaac tgggcagaag
8160ggagcctcgg caagggggtt ctggggacca agagagacca aggcgttggg tgtgtgcaaa
8220gagtctggga gcgagcctgc ggaggacagc agcagggccc acagccgatc agaggaaggt
8280gtctgggagg agaacacgcc ccccttgggc cccctgggtt ttcccgagac ttccagctct
8340ccggcggaca gcaccaccag cagctgcctc cagggcctcc cggacaaccc agacacccag
8400ggtggagtcc aggggcctga aggccccact cctgatgcct ctggctccag tgccaaggat
8460cctccaagct tgtttgatga tgaggtctct ttctcccagc tcttccctcc aggcggtcgc
8520ttgactagaa agaggaaccc gcatgtctac gggaagcgct gtgagaagcc ggtgctcccg
8580ctgccaaccc agcccagctt tgaggagggc ggtgacccca cgctgggccc agcccgcctg
8640cccacggacc tcagcgactc cagctccctc tgcctctgcc atgaggaccc gtgggaggac
8700gaggatcccg caggtctgcc cgagtccttc ctcctggatg ggttcctcaa tagcagggtg
8760cctggcattg acccctgggc ccccggcctc agcctgtggg ccctggagcc cagcagggaa
8820gctggtgcag agaagctgcc ctcccactgc cccgaggacg atcggccgga ggccattcct
8880gagctgcaca tggtcccagc ggcttggcga ggcctggaga tgccggcccc tgccgatgac
8940tcctcctctt ctctcggaga tgtgagcccc gagcccccca gcctggagag agaacgctgt
9000gacggtgggc ttcccgggaa cacccacctg ctgccgctcc gtgccacgga ctttgaggtg
9060ctcagcacca agtttgagat gcaagacctg tgctttctgg gaccctttga agaccccgtg
9120ggtctccccg gccccagctt cttagacttc gagggcacgg cgagctcaca ggggccacag
9180agccgaagga cagaggaggc tgcaggggca gggagggccc aaggcagagg ccggccggcc
9240aagggcaggc gggcctccta caagtgcaaa gtgtgcttcc agcgcttccg cagcctgggc
9300gagctggacc tgcacaagct ggcccacacg cccgcgccgc cgcccacctg ctacatgtgc
9360gtggagcgca ggtttggctc gcgggagctg ctgcgggggc acctgcagga gaggcacgcg
9420cagagcaagg ccgggccctg ggcgtgcggc atgtgcctga aggaggtggc cgacgtctgg
9480atgtacaacg agcacctgcg tgagcacgcg gtccgcttcg cccgcagggg gcaggcgcgg
9540aggtccttgg gggacctgcc cggaggcctg gagggcagca gcgctgtcgc ccaccttctg
9600aacagcatca cggaacccgc gcccaaacac cacaggggca agcgctccgc cggcaaggcc
9660gccgggagcc cgggagaccc gtgggggcaa gagggagaag ccaagaaaga cagcccgggc
9720gagagggcga aaccccgggc acgcagcacc cccagcaacc cagacggggc cgcgacccca
9780gacagcgcct ctgccaccgc cctggctgac gccggcagcc cgggcccccc caggacgacc
9840cccagcccgt cccccgaccc ctgggccggc ggggagcccc tcctgcaagc caccccggtg
9900cacgaggcct gcaaggaccc ctcccgcgac tgccaccact gcgggaagcg cttccccaag
9960cccttcaagc tgcagcgcca cctggcggtg cacagcccgc agcgcgtcta cctgtgcccc
10020cggtgccccc gggtctaccc cgagcacggg gagctgctgg cacacctggg cggggcgcac
10080gggctgctgg agcggccgga gctgcagcac acgccgctgt atgcctgcga gctctgcgcc
10140acggttatgc gcatcatcaa gaagtccttc gcctgcagct cctgcaacta caccttcgcc
10200aagaaggagc agttcgaccg ccacatgaac aagcacctca ggggggggcg gcagcccttc
10260gcgttccgcg gcgtgcggag gccgggagcg ccgggacaga aggcccgggc cctcgagggc
10320acactgccca gcaaacggcg cagggtggcc atgcccggca gtgcccctgg gcccggcgag
10380gacaggcctc ctccccgggg aagcagcccc atcctgagtg agggctctct cccggccctg
10440ctccacctgt gttcggaggt ggctcccagc accaccaagg gatggcccga gaccctagag
10500aggcctgtag accccgtgac ccacccgatc agaggttgtg agctgccatc caaccaccag
10560gagtgtcccc cgccgtctct gtctcccttc ccagctgcct tggctgatgg cagaggagac
10620tgcgcgctgg acggagccct ggagaggcca gagaacgagg cttccccagg cagccccggg
10680cctcttctcc agcaagctct ccctctgggg gcatctctgc cgcggccggg agccagaggc
10740caagatgcgg agggaaagag ggctcctctc gtgttctcag ggaaacgcag ggccccgggt
10800gcccgtggca ggtgtgcccc tgaccatttc caggaagacc acctacttca gaaagagaag
10860gaggtgtcct caagccacat ggtgtctgag ggggggcccc gaggcacctt ccacaagggc
10920agcgccacca agcctgcggg ctgccagagc tcatcaaagg acaggtcggc agcatccacc
10980cccagcaaag cactcaagtt cccagtgcac ccaaggaagg cggtggggag cctggcaccc
11040ggggagctgg cccgtggcac agagaatggg atgaagcccg ccacccccaa agccaaaccc
11100ggccccagct cccagggcag tggaagccct cgccccggca ccaagacagg aggtggcagc
11160cagccccagc cagccagcgg gcagctccag agcgagacag ccaccacccc agccaagccc
11220agcttcccca gccggagccc tgcaccagag aggctccccg ctcgagccca agccaagagc
11280tgcaccaagg ggccaaggga agctggtgag caggggcccc acgggagcct aggtcccaag
11340gagaagggag agagcagtac gaagaggaaa aagggccagg tcccagggcc agccaggagt
11400gaaagtgtgg ggagcttcgg gagagccccc tcagcccctg acaagccccc ccggacccct
11460cggaagcagg caactcccag ccgcgtgctc ccgaccaagc ccaagcccaa cagccagaac
11520aaacccaggc cgccaccatc agagcagcgg aaggcagagc cgggccacac acagaggaag
11580gacagactgg gcaaggcctt cccccagggg agacccctgc tcaggccccc caagaggggc
11640acagctgtcc acggtgctga acctgccgag ccacacaccc accggacggc cgaggcccag
11700agtgacctcc tcagccagct cttcgggcag agactaactg gcttcaaaat ccctttaaag
11760aaagatgctt ccgagtaatt tctaggagca agagcctggg accggagctg ggcgttcctg
11820tctcggcctg cctccttggc cagctccggc tccctgagat ggtccactct gtggccactt
11880gacttcttgt gcaactgctc aggccttgat gtcagagctg aggtggtgat gctttgaaca
11940gggcccaggt gggcagcatt ccctttcttg ctggaaggct gggggtgaaa gacggggcca
12000ctgcagccct tttgagacca cacagctgtt ttcttggtac caagtacttg aagagacagc
12060agcccatccc ctcagcccac acccctgcgc cctgtgggca ccgacaccac agaagccaat
12120gtttggagat ttgcacaacc tcccgttccc ccacatggag aagggaagta agttgaggca
12180gccgtgggat ggtggtaggt tccctcttag tcttgctgct gttgctggaa ttccaaagtg
12240accttagaaa ccacgtgggg gaaggcagtg ctcactactt agaagggttg cttctgagcc
12300gcctggtccc ccaagagcac aacaggcctc ctccctctga ccacagggtc atgcctcctc
12360cctctgacca cagggtcatg ccagcctcca tttgctctgc gggagaaaag cccatctcta
12420gcacaccttg accccaggaa ccgggttccc gtatggaact gggaagaaac cgcccctgtg
12480ccagctcccg cgggccctcc tcgttccctc ccagcctcca tggccgccct ctagagcctc
12540cctgctgtac ggagctctgg gctccgccta tttgcaatgt tactctgaag tttctggtgc
12600tatttttgtg ttgtaatgtg aatacaggct tccttgattt ttttttttaa tgggggtatt
12660gggtgggaca gacggggtca gggaggcccc accatggctt gtcgagggca cgggcacctg
12720catggcggcg ctctccctgc ctcccctgcc gggctgcaag cctgaggtct gtgctgccag
12780acggggatgc tcagggctgg ggctgcagag ccgctgccct ggccagggca ccctcatgca
12840ccgacccaac ccaggcctgg gacgcacgtg tcctctcaca gcgtcgtgcc tgtgaaggtg
12900ggtcaaaggg tgagagggct tccttctcac ccttctctcc ataagtatct tgaagatcca
12960tggtttgttt tgctctattg tttagttttt acttgggtgc aatgtgtacg tcaaaagttt
13020ttattttgat atttgaaaga gaccaaatca ggcccagacc gcctctctgg aaggtgttgt
13080aggccattca aaacgcctcc ggagtgtcgc aaaccaagtg cggaggggcc ctgaggttgt
13140actgtaaaca tcatagtgac ttgtcttttc aaatatattc ccactatttt cgcagaaaac
13200ctc
13203
User Contributions:
Comment about this patent or add new information about this topic: