HUMHBB

LA home
Computing
Algorithms
 glossary
 Dynamic Prog.
  Edit dist.
  Hirschberg's
  e.g.

Also see
Bioinformatics
 Alignment

Example

HUMHBB, an example entry from `Genbank'. The DNA sequence proper starts about half way through the file after the bibliographic entries and annotations. As of 1998 this is no longer considered a particularly long sequence, with some complete chromosomes, ~1 million base-pairs, having been sequenced for other organisms. As of June 2000, a preliminary map of the entire human genome had been produced.




LOCUS       HUMHBB      73326 bp ds-DNA             PRI       10-OCT-1991
DEFINITION  Human beta globin region on chromosome 11.
ACCESSION   J00179 J00093 J00094 J00096 J00158 J00159 J00160 J00161 J00162
            J00163 J00164 J00165 J00166 J00167 J00168 J00169 J00170 J00171
            J00172 J00173 J00174 J00175 J00177 J00178 K01239 K01890 K02544
            M18047 M19067 M24868 M24886 X00423 X00424 X00672
KEYWORDS    Alu repetitive element; HPFH; KpnI repetitive sequence;
            RNA polymerase III; allelic variation; alternate cap site;
            beta-1 pseudogene; beta-globin; delta-globin; epsilon-globin;
            gamma-globin; gene duplication; globin; polymorphism;
            promoter mutation; pseudogene; repetitive sequence; thalassemia.
SOURCE      Human mRNA, cDNA and DNA.
  ORGANISM  Homo sapiens
            Eukaryota; Animalia; Metazoa; Chordata; Vertebrata; Mammalia;
            Theria; Eutheria; Primates; Haplorhini; Catarrhini; Hominidae.
REFERENCE   1  (bases 62427 to 62649; 63500 to 63628)
  AUTHORS   Marotta,C.A., Forget,B.G., Weissman,S.M., Verma,I.M.,
            McCaffrey,R.P. and Baltimore,D.
  TITLE     Nucleotide sequences of human globin messenger RNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 71, 2300-2304 (1974)
  STANDARD  full staff_review
REFERENCE   2  (bases 63620 to 63664)
  AUTHORS   Forget,B.G., Marotta,C.A., Weissman,S.M. and Cohen-Solal,M.
  TITLE     Nucleotide sequences of the 3'-terminal untranslated region of
            messenger RNA for human beta globin chain
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 72, 3614-3618 (1975)
  STANDARD  full staff_review
REFERENCE   3  (bases 63611 to 63644)
  AUTHORS   Proudfoot,N.J. and Brownlee,G.G.
  TITLE     Nucleotide sequences of globin messenger RNA
  JOURNAL   Br. Med. Bull. 32, 251-256 (1976)
  STANDARD  simple staff_entry
REFERENCE   4  (bases 63691 to 63761)
  AUTHORS   Proudfoot,N.J. and Longley,J.I.
  TITLE     The 3' terminal sequences of human alpha and beta globin messenger
            RNAs: Comparison with rabbit globin messenger RNA
  JOURNAL   Cell 9, 733-746 (1976)
  STANDARD  full staff_review
REFERENCE   5  (sites)
  AUTHORS   Proudfoot,N.J. and Brownlee,G.G.
  TITLE     Non-coding region sequences in eukaryotic messenger RNA
  JOURNAL   Nature 263, 211-214 (1976)
  STANDARD  full staff_review
REFERENCE   6  (bases 63614 to 63761)
  AUTHORS   Proudfoot,N.J.
  TITLE     Complete 3' noncoding region sequences of rabbit and human
            beta-globin messenger RNA's
  JOURNAL   Cell 10, 559-570 (1977)
  STANDARD  full staff_review
REFERENCE   7  (bases 62155 to 62211)
  AUTHORS   Baralle,F.E.
  TITLE     Complete nucleotide sequence of the 5' noncoding region of human
            alpha- and beta-globin mRNA
  JOURNAL   Cell 12, 1085-1095 (1977)
  STANDARD  full staff_review
REFERENCE   8  (sites)
  AUTHORS   Marotta,C.A., Forget,B.G., Cohen-Solal,M., Wilson,J.T. and
            Weissman,S.M.
  TITLE     Human beta-globin messenger RNA: I. Nucleotide sequences derived
            from complementary RNA
  JOURNAL   J. Biol. Chem. 252, 5019-5031 (1977)
  STANDARD  full staff_review
REFERENCE   9  (bases 62205 to 63628)
  AUTHORS   Cohen-Solal,M., Forget,B.G., Prensky,W., Marotta,C.A. and
            Weissman,S.M.
  TITLE     Human beta-globin messenger RNA: II. Nucleotide sequences derived
            from 125-I-labeled globin messenger RNA
  JOURNAL   J. Biol. Chem. 252, 5032-5039 (1977)
  STANDARD  full staff_review
REFERENCE   10 (bases 54808 to 54899; 62427 to 62649; 63500 to 63733)
  AUTHORS   Marotta,C.A., Wilson,J.T., Forget,B.G. and Weissman,S.M.
  TITLE     Human beta-globin messenger RNA. III. Nucleotide sequences derived
            from complementary DNA
  JOURNAL   J. Biol. Chem. 252, 5040-5053 (1977)
  STANDARD  full staff_review
REFERENCE   11 (bases 62155 to 62207)
  AUTHORS   Chang,J.C., Temple,G.F., Poon,R., Neumann,K.H. and Kan,Y.W.
  TITLE     The nucleotide sequences of the untranslated 5' regions of human
            alpha- and beta-globin mRNAs
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 74, 5145-5149 (1977)
  STANDARD  full staff_review
REFERENCE   12 (bases 56121 to 56202)
  AUTHORS   Lawn,R.M., Fritsch,E.F., Parker,R.C., Blake,G. and Maniatis,T.
  TITLE     The isolation and characterization of linked delta- and beta-
            globin genes from a library of human DNA
  JOURNAL   Cell 15, 1157-1174 (1978)
  STANDARD  full staff_review
REFERENCE   13 (bases 35872 to 35964; 63500 to 63549)
  AUTHORS   Little,P., Curtis,P., Coutelle,C., Van Den Berg,J., Dalgleish,R.,
            Malcolm,S., Courtney,M., Westaway,D. and Williamson,R.
  TITLE     Isolation and partial sequence of recombinant plasmids containing
            human alpha-, beta- and gamma-globin cDNA fragments
  JOURNAL   Nature 273, 640-643 (1978)
  STANDARD  full staff_review
REFERENCE   14 (bases 62205 to 63760)
  AUTHORS   Wilson,J.T., Wilson,L.B., DeRiel,J.K., Villa-Komaroff,L.,
            Efstratiadis,A., Forget,B.G. and Weissman,S.M.
  TITLE     Insertion of synthetic copies of human globin genes into bacterial
            plasmids
  JOURNAL   Nucleic Acids Res. 5, 563-581 (1978)
  STANDARD  full staff_review
REFERENCE   15 (bases 34496 to 34581; 39432 to 39517)
  AUTHORS   Chang,J.C., Poon,R., Neumann,K.H. and Kan,Y.W.
  TITLE     The nucleotide sequence of the 5' untranslated region of human
            gamma-globin mRNA
  JOURNAL   Nucleic Acids Res. 5, 3515-3522 (1978)
  STANDARD  full staff_review
REFERENCE   16 (bases 40857 to 41003)
  AUTHORS   Poon,R., Wai,K. and Boyer,H.W.
  TITLE     Sequence of the 3' noncoding and adjacent regions of human
            gamma-globin mRNA
  JOURNAL   Nucleic Acids Res. 5, 4625-4630 (1978)
  STANDARD  full staff_review
REFERENCE   17 (bases 35844 to 35925; 40760 to 40841)
  AUTHORS   Smithies,O., Blechl,A.E., Denniston-Thompson,K., Newell,N.,
            Richards,J.E., Slightom,J.L., Tucker,P.W. and Blattner,F.R.
  TITLE     Cloning human fetal gamma globin and mouse alpha-type globin DNA:
            Characterization and partial sequencing
  JOURNAL   Science 202, 1284-1289 (1978)
  STANDARD  full staff_review
REFERENCE   18 (bases 34496 to 34554; 62155 to 62210; 63626 to 63760)
  AUTHORS   Kan,Y.W., Chang,C.S. and Poon,R.
  TITLE     Nucleotide sequences of the untranslated 5' and 3' regions of human
            alpha-, beta-, and gamma-globin mRNAS
  JOURNAL   (in) Stamatoyannopoulos,G. and Newhouse,A.W. (Eds.);
            Cellular and Molecular regulation of hemoglobin switching:  0-0,
            Grune and Stratton, New York (1979)
  STANDARD  simple staff_entry
REFERENCE   19 (sites)
  AUTHORS   Chang,J.C. and Kan,Y.W.
  TITLE     Beta-0 thalassemia, a nonsense mutation in man
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 76, 2886-2889 (1979)
  STANDARD  full staff_review
REFERENCE   20 (bases 19864 to 19983)
  AUTHORS   Proudfoot,N.J. and Baralle,F.E.
  TITLE     Molecular cloning of human epsilon-globin gene
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 76, 5435-5439 (1979)
  STANDARD  full staff_review
REFERENCE   21 (sites)
  AUTHORS   Chang,J.C., Kan,Y.W., Trecartin,R.F. and Temple,G.F.
  TITLE     Nonsense mutation as a cause of beta-0 thalassemia
  JOURNAL   Ann. N.Y. Acad. Sci. 344, 113-119 (1980)
  STANDARD  full staff_review
REFERENCE   22 (sites)
  AUTHORS   Fritsch,E.F., Lawn,R.M. and Maniatis,T.
  TITLE     Molecular cloning and characterization of the human beta-like
            globin gene cluster
  JOURNAL   Cell 19, 959-972 (1980)
  STANDARD  full staff_review
REFERENCE   23 (bases 19480 to 21759)
  AUTHORS   Baralle,F.E., Shoulders,C.C. and Proudfoot,N.J.
  TITLE     The primary structure of the human epsilon-globin gene
  JOURNAL   Cell 21, 621-626 (1980)
  STANDARD  full staff_review
REFERENCE   24 (bases 34440 to 36087; 39376 to 41004)
  AUTHORS   Slightom,J.L., Blechl,A.E. and Smithies,O.
  TITLE     Human fetal G Gamma- And A gamma-globin genes: complete nucleotide
            sequences suggest that DNA can be exchanged between these
            duplicated genes
  JOURNAL   Cell 21, 627-638 (1980)
  STANDARD  full staff_review
REFERENCE   25 (bases 54636 to 56620)
  AUTHORS   Spritz,R.A., DeRiel,J.K., Forget,B.G. and Weissman,S.M.
  TITLE     Complete nucleotide sequence of the human delta-globin gene
  JOURNAL   Cell 21, 639-646 (1980)
  STANDARD  full staff_review
REFERENCE   26 (bases 62052 to 64101)
  AUTHORS   Lawn,R.M., Efstratiadis,A., O'Connell,C. and Maniatis,T.
  TITLE     The nucleotide sequence of the human beta-globin gene
  JOURNAL   Cell 21, 647-651 (1980)
  STANDARD  full staff_review
REFERENCE   27 (sites)
  AUTHORS   Efstratiadis,A., Posakony,J.W., Maniatis,T., Lawn,R.M.,
            O'Connell,C., Spritz,R.A., DeRiel,J.K., Forget,B.G., Weissman,S.M.,
            Slightom,J.L., Blechl,A.E., Smithies,O., Baralle,F.E.,
            Shoulders,C.C. and Proudfoot,N.J.
  TITLE     The structure and evolution of the human beta-globin gene family
  JOURNAL   Cell 21, 653-668 (1980)
  STANDARD  full staff_review
REFERENCE   28 (bases 17841 to 20001)
  AUTHORS   Baralle,F.E., Shoulders,C.C., Goodbourn,S., Jeffreys,A. and
            Proudfoot,N.J.
  TITLE     The 5' flanking region of human epsilon-globin gene
  JOURNAL   Nucleic Acids Res. 8, 4393-4404 (1980)
  STANDARD  full staff_review
REFERENCE   29 (bases 62472 to 62631)
  AUTHORS   Orkin,S.H., Goff,S.C. and Nathan,D.G.
  TITLE     Heterogeneity of DNA deletion in gamma-delta-beta-thalassemia
  JOURNAL   J. Clin. Invest. 67, 878-884 (1981)
  STANDARD  full staff_review
REFERENCE   30 (sites)
  AUTHORS   Trecartin,R.F., Liebhaber,S.A., Chang,J.C., Lee,K.Y., Kan,Y.W.,
            Furbetta,M., Angius,A. and Cao,A.
  TITLE     Beta-0 thalassemia in Sardinia is caused by a nonsense mutation
  JOURNAL   J. Clin. Invest. 68, 1012-1017 (1981)
  STANDARD  full staff_review
REFERENCE   31 (bases 32371 to 43746)
  AUTHORS   Shen,S.H., Slightom,J.L. and Smithies,O.
  TITLE     A history of the human fetal globin gene duplication
  JOURNAL   Cell 26, 191-203 (1981)
  STANDARD  full staff_review
REFERENCE   32 (sites)
  AUTHORS   Busslinger,M., Moschonas,N. and Flavell,R.A.
  TITLE     +beta thalassemia: Aberrant splicing results from a single point
            mutation in an intron
  JOURNAL   Cell 27, 289-298 (1981)
  STANDARD  full staff_review
REFERENCE   33 (bases 32371 to 33236; 51996 to 52490)
  AUTHORS   Duncan,C.H., Jagadeeswaran,P., Wang,R.R. and Weissman,S.M.
  TITLE     Structural analysis of templates and RNA polymerase III transcripts
            of Alu family sequences interspersed among the human beta-like
            globin genes
  JOURNAL   Gene 13, 185-196 (1981)
  STANDARD  full staff_review
REFERENCE   34 (sites)
  AUTHORS   Orkin,S.H. and Goff,S.C.
  TITLE     Nonsense and frameshift mutations in beta-0-thalassemia detected in
            cloned beta-globin genes
  JOURNAL   J. Biol. Chem. 256, 9782-9784 (1981)
  STANDARD  full staff_review
REFERENCE   35 (sites)
  AUTHORS   Westaway,D. and Williamson,R.
  TITLE     An intron nucleotide sequence variant in a cloned
            beta-plus-thalassemia globin gene
  JOURNAL   Nucleic Acids Res. 9, 1777-1787 (1981)
  STANDARD  full staff_review
REFERENCE   36 (sites)
  AUTHORS   Moschonas,N., de Boer,E., Grosveld,F.G., Dahl,H.-H.M., Wright,S.,
            Shewmaker,C.K. and Flavell,R.A.
  TITLE     Structure and expression of a cloned beta-0 thalassemic globin gene
  JOURNAL   Nucleic Acids Res. 9, 4391-4401 (1981)
  STANDARD  full staff_review
REFERENCE   37 (bases 60507 to 60966)
  AUTHORS   Spritz,R.A.
  TITLE     Duplication/deletion polymorphism 5'- to the human beta globin gene
  JOURNAL   Nucleic Acids Res. 9, 5037-5047 (1981)
  STANDARD  full staff_review
REFERENCE   38 (bases 59363 to 59611)
  AUTHORS   Miesfeld,R., Krystal,M. and Arnheim,N.
  TITLE     A member of a new repeated sequence family which is conserved
            throughout eukaryotic evolution is found between the human delta
            and beta globin genes
  JOURNAL   Nucleic Acids Res. 9, 5931-5947 (1981)
  STANDARD  full staff_review
REFERENCE   39 (bases 16595 to 17840)
  AUTHORS   Di Segni,G., Carrara,G., Tocchini-Valentini,G.R., Shoulders,C.C.
            and Baralle,F.E.
  TITLE     Selective in vitro transcription of one of the two Alu family
            repeats present in the 5' flanking region of the human
            epsilon-globin gene
  JOURNAL   Nucleic Acids Res. 9, 6709-6722 (1981)
  STANDARD  full staff_review
REFERENCE   40 (sites)
  AUTHORS   Adams,J.G.III., Steinberg,M.H., Newman,M.V., Morrison,W.T.,
            Benz,E.J.Jr. and Iyer,R.
  TITLE     Beta-thalassemia present in cis to a new beta-chain structural
            variant, Hb Vicksburg [beta75(e19)leu->0]
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 78, 469-473 (1981)
  STANDARD  full staff_review
REFERENCE   41 (bases 61939 to 63842)
  AUTHORS   Spritz,R.A., Jagadeeswaran,P., Choudary,P.V., Biro,P.A.,
            Elder,J.T., DeRiel,J.K., Manley,J.L., Gefter,M.L., Forget,B.G. and
            Weissman,S.M.
  TITLE     Base substitution in an intervening sequence of a
            beta-plus-thalassemic human globin gene
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 78, 2455-2459 (1981)
  STANDARD  full staff_review
REFERENCE   42 (sites)
  AUTHORS   Baird,M., Driscoll,C., Schreiner,H., Sciarratta,G.V., Sansone,G.,
            Niazi,G., Ramirez,F. and Bank,A.
  TITLE     A nucleotide change at a splice junction in the human beta-globin
            gene is associated with beta-0-thalassemia
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 78, 4218-4221 (1981)
  STANDARD  full staff_review
REFERENCE   43 (bases 62391 to 62439)
  AUTHORS   Flavell,R.A., Bud,H., Bullman,H., Busslinger,M., de Boer,E.,
            de Kleine,A., Golden,L., Groffen,J., Grosveld,F.G., Mellor,A.L.,
            Moschonas,N. and Weiss,E.
  TITLE     The structure and expression of mammalian gene clusters
  JOURNAL   Prog. Clin. Biol. Res. 103, 37-55 (1982)
  STANDARD  full staff_entry
REFERENCE   44 (sites)
  AUTHORS   Grosveld,F., Busslinger,M., Grosveld,G., Groffen,J., DeKleine,A.
            and Flavell,R.A.
  TITLE     The structure and expression of the haemoglobin genes
  JOURNAL   Adv. Exp. Med. Biol. 158, 65-80 (1982)
  STANDARD  full staff_review
REFERENCE   45 (sites)
  AUTHORS   Treisman,R., Proudfoot,N.J., Shander,M. and Maniatis,T.
  TITLE     A single-base change at a splice site in a 0-beta-thalassemic gene
            causes abnormal RNA splicing
  JOURNAL   Cell 29, 903-911 (1982)
  STANDARD  full staff_review
REFERENCE   46 (bases 63841 to 64101)
  AUTHORS   Poncz,M., Ballantine,M., Solowiejczyk,D., Barak,I., Schwartz,E. and
            Surrey,S.
  TITLE     Beta-thalassemia in a Kurdish Jew: Single base changes in the
            T-A-T-A box
  JOURNAL   J. Biol. Chem. 257, 5994-5996 (1982)
  STANDARD  full staff_entry
REFERENCE   47 (bases 61891 to 62216; 62393 to 62466)
  AUTHORS   Gorski,J., Fiori,M. and Mach,B.
  TITLE     A new nonsense mutation as the molecular basis for beta-0
            thalassemia
  JOURNAL   J. Mol. Biol. 154, 537-540 (1982)
  STANDARD  full staff_review
REFERENCE   48 (bases 50734 to 51233)
  AUTHORS   Jagadeeswaran,P., Tuan,D., Forget,B.G. and Weissman,S.M.
  TITLE     A gene deletion ending at the midpoint of a repetitive DNA sequence
            in one form of hereditary persistence of fetal haemoglobin
  JOURNAL   Nature 296, 469-470 (1982)
  STANDARD  full staff_review
REFERENCE   49 (sites)
  AUTHORS   Orkin,S.H., Kazazian,H.H.Jr., Antonarakis,S.E., Goff,S.C.,
            Boehm,C.D., Sexton,J.P., Waber,P.G. and Giardina,P.J.
  TITLE     Linkage of beta-thalassemia mutations and beta-globin gene
            polymorphisms with DNA polymorphisms in human beta-globin gene
            cluster
  JOURNAL   Nature 296, 627-631 (1982)
  STANDARD  full staff_review
REFERENCE   50 (sites)
  AUTHORS   Ottolenghi,S. and Giglioni,B.
  TITLE     The deletion in a type of delta-0-beta-0-thalassemia begins in an
            inverted AluI repeat
  JOURNAL   Nature 300, 770-771 (1982)
  STANDARD  full staff_review
REFERENCE   51 (bases 62669 to 62733)
  AUTHORS   Spence,S.E., Pergolizzi,R.G., Donovan-Peluso,M., Kosche,K.A.,
            Dobkin,C.S. and Bank,A.
  TITLE     Five nucleotide changes in the large intervening sequence of a beta
            globin gene in a beta-plus thalassemia patient
  JOURNAL   Nucleic Acids Res. 10, 1283-1294 (1982)
  STANDARD  full staff_review
REFERENCE   52 (bases 60694 to 62155)
  AUTHORS   Moschonas,N., de Boer,E. and Flavell,R.A.
  TITLE     The DNA sequence of the 5' flanking region of the human beta-globin
            gene: Evolutionary conservation and polymorphic differences
  JOURNAL   Nucleic Acids Res. 10, 2109-2120 (1982)
  STANDARD  full staff_review
REFERENCE   53 (sites)
  AUTHORS   Allan,M., Grindlay,G.J., Stefani,L. and Paul,J.
  TITLE     Epsilon globin gene transcripts originating upstream of the mRNA
            cap site in K562 cells and normal human embryos
  JOURNAL   Nucleic Acids Res. 10, 5133-5147 (1982)
  STANDARD  full staff_review
REFERENCE   54 (sites)
  AUTHORS   Kinniburgh,A.J., Maquat,L.E., Schedl,T., Rachmilewitz,E. and
            Ross,J.
  TITLE     mRNA-deficient beta-0-thalassemia results from a single nucleotide
            deletion
  JOURNAL   Nucleic Acids Res. 10, 5421-5427 (1982)
  STANDARD  full staff_review
REFERENCE   55 (bases 54456 to 54758)
  AUTHORS   Kimura,A., Matsunaga,E., Ohta,Y., Fujiyoshi,T., Matsuo,T.,
            Nakamura,T., Imamura,T., Yanase,T. and Takagi,Y.
  TITLE     Structure of cloned delta-globin genes from a normal subject and a
            patient with delta-thalassemia; sequence polymorphisms found in the
            delta-globin gene region of Japanese individuals
  JOURNAL   Nucleic Acids Res. 10, 5725-5732 (1982)
  STANDARD  full staff_review
REFERENCE   56 (bases 10410 to 13774)
  AUTHORS   Shen,S.-H. and Smithies,O.
  TITLE     Human globin pseudo-beta-2 is not a globin-related sequence
  JOURNAL   Nucleic Acids Res. 10, 7809-7818 (1982)
  STANDARD  full staff_review
REFERENCE   57 (sites)
  AUTHORS   Spritz,R.A. and Orkin,S.H.
  TITLE     Duplication followed by deletion accounts for the structure of an
            Indian deletion beta-0-thalassemia gene
  JOURNAL   Nucleic Acids Res. 10, 8025-8029 (1982)
  STANDARD  full staff_review
REFERENCE   58 (sites)
  AUTHORS   Ley,T.J. and Nienhuis,A.W.
  TITLE     A weak upstream promoter gives rise to long human beta-globin RNA
            molecules
  JOURNAL   Biochem. Biophys. Res. Commun. 112, 1041-1048 (1983)
  STANDARD  full staff_review
REFERENCE   59 (sites)
  AUTHORS   Carlson,D.P. and Ross,J.
  TITLE     Human beta-globin promoter and coding sequences transcribed by RNA
            polymerase III
  JOURNAL   Cell 34, 857-864 (1983)
  STANDARD  full staff_review
REFERENCE   60 (sites)
  AUTHORS   Allan,M., Lanyon,W.G. and Paul,J.
  TITLE     Multiple origins of transcription in the 4.5 kb upstream of the
            epsilon-globin gene
  JOURNAL   Cell 35, 187-197 (1983)
  STANDARD  full staff_review
REFERENCE   61 (sites)
  AUTHORS   Vanin,E.F., Henthorn,P.S., Kioussis,D., Grosveld,F. and Smithies,O.
  TITLE     Unexpected relationships between four large deletions in the human
            beta-globin gene cluster
  JOURNAL   Cell 35, 701-709 (1983)
  STANDARD  full staff_review
REFERENCE   62 (bases 50762 to 67222)
  AUTHORS   Poncz,M., Schwartz,E., Ballantine,M. and Surrey,S.
  TITLE     Nucleotide sequence analysis of the delta-beta-globin gene region
            in humans
  JOURNAL   J. Biol. Chem. 258, 11599-11609 (1983)
  STANDARD  full staff_review
REFERENCE   63 (bases 62068 to 62068; 62297 to 62297; 62301 to 62301)
  AUTHORS   Treisman,R., Orkin,S.H. and Maniatis,T.
  TITLE     Specific transcription and RNA splicing defects in five cloned
            beta-thalassaemia genes
  JOURNAL   Nature 302, 591-596 (1983)
  STANDARD  full staff_entry
REFERENCE   64 (bases 62208 to 62262)
  AUTHORS   Chang,J.C., Alberti,A. and Kan,Y.W.
  TITLE     A beta-thalassemia lesion abolishes the same MstII site as the
            sickle mutation
  JOURNAL   Nucleic Acids Res. 11, 7789-7794 (1983)
  STANDARD  full staff_entry
REFERENCE   65 (bases 50734 to 53994)
  AUTHORS   Maeda,N., Bliska,J.B. and Smithies,O.
  TITLE     Recombination and balanced chromosome polymorphism suggested by DNA
            sequences 5' to the human delta-globin gene
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 80, 5012-5016 (1983)
  STANDARD  full staff_review
REFERENCE   66 (sites)
  AUTHORS   Kazazian,H.H.Jr., Orkin,S.H., Antonarakis,S.E., Sexton,J.P.,
            Boehm,C.D. and Waber,P.G.
  TITLE     Molecular characterization of seven beta-thalassemia mutations in
            Asian Indians
  JOURNAL   EMBO J. 3, 593-596 (1984)
  STANDARD  full staff_review
REFERENCE   67 (sites)
  AUTHORS   Guida,S., Giglioni,B., Ottolenghi,S., Camaschella,C. and Saglio,G.
  TITLE     The beta-globin gene in Sardinian delta-beta-0-thalassemia carries
            a C -> T nonsense mutation at codon 39
  JOURNAL   EMBO J. 3, 785-787 (1984)
  STANDARD  full staff_review
REFERENCE   68 (sites)
  AUTHORS   Giglioni,B., Casini,C., Mantovani,R., Merli,S., Comi,P.,
            Ottolenghi,S., Saglio,G., Camaschella,C. and Mazza,U.
  TITLE     A molecular study of a family with Greek hereditary persistence of
            fetal hemoglobin and beta-thalassemia
  JOURNAL   EMBO J. 3, 2641-2645 (1984)
  STANDARD  full staff_review
REFERENCE   69 (sites)
  AUTHORS   Kimura,A., Ohta,Y., Fukumaki,Y. and Takagi,Y.
  TITLE     A fusion gene in man: DNA sequence analysis of the abnormal globin
            gene of hemoglobin Miyada
  JOURNAL   Biochem. Biophys. Res. Commun. 119, 968-974 (1984)
  STANDARD  full staff_review
REFERENCE   70 (bases 61575 to 61641)
  AUTHORS   Semenza,G.L., Malladi,P., Surrey,S., Delgrosso,K., Poncz,M. and
            Schwartz,E.
  TITLE     Detection of a novel DNA polymorphism in the beta-globin gene
            cluster
  JOURNAL   J. Biol. Chem. 259, 6045-6048 (1984)
  STANDARD  full staff_entry
REFERENCE   71 (sites)
  AUTHORS   Orkin,S.H., Antonarakis,S.E. and Kazazian,H.H.Jr.
  TITLE     Base substitution at position -88 in a beta-thalassemic globin gene
  JOURNAL   J. Biol. Chem. 259, 8679-8681 (1984)
  STANDARD  full staff_review
REFERENCE   72 (bases 45354 to 47481)
  AUTHORS   Chang,L.Y. and Slightom,J.L.
  TITLE     Isolation and nucleotide sequence analysis of the beta-type globin
            pseudogene from human, gorilla and chimpanzee
  JOURNAL   J. Mol. Biol. 180, 767-784 (1984)
  STANDARD  full staff_review
REFERENCE   73 (sites)
  AUTHORS   Grindlay,G.J., Lanyon,W.G., Allan,M. and Paul,J.
  TITLE     Alternative sites of transcription initiation upstream of the
            canonical cap site in human gamma-globin and beta-globin genes
  JOURNAL   Nucleic Acids Res. 12, 1811-1821 (1984)
  STANDARD  full staff_review
REFERENCE   74 (sites)
  AUTHORS   Stoeckert,C.J., Collins,F.S. and Weissman,S.M.
  TITLE     Human fetal globin DNA sequences suggest novel conversion event
  JOURNAL   Nucleic Acids Res. 12, 4469-4479 (1984)
  STANDARD  full staff_review
REFERENCE   75 (sites)
  AUTHORS   Mager,D.L. and Henthorn,P.S.
  TITLE     Identification of a retrovirus-like repetitive element in human DNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 81, 7510-7514 (1984)
  STANDARD  full staff_review
REFERENCE   76 (bases 19120 to 61794; 19120 to 61794)
  AUTHORS   Collins,F.S. and Weissman,S.M.
  TITLE     The molecular genetics of human hemoglobin
  JOURNAL   Prog. Nucleic Acid Res. Mol. Biol. 31, 315-462 (1984)
  STANDARD  full staff_review
REFERENCE   77 (bases 34294 to 34294; 34300 to 34300; 34339 to 34339)
  AUTHORS   Gilman,J.G. and Huisman,T.H.
  TITLE     DNA sequence variation associated with elevated fetal gamma-G
            globin production
  JOURNAL   Blood 66, 783-787 (1985)
  STANDARD  simple staff_entry
REFERENCE   78 (sites)
  AUTHORS   Ruskin,B., Greene,J.M. and Green,M.R.
  TITLE     Cryptic branch point activation allows accurate in vitro splicing
            of human beta-globin intron mutants
  JOURNAL   Cell 41, 833-844 (1985)
  STANDARD  full staff_review
REFERENCE   79 (sites)
  AUTHORS   Lang,K.M. and Spritz,R.A.
  TITLE     Cloning specific complete polyadenylated 3'-terminal cDNA segments
  JOURNAL   Gene 33, 191-196 (1985)
  STANDARD  full staff_review
REFERENCE   80 (bases 1 to 19312)
  AUTHORS   Li,Q., Powers,P.A. and Smithies,O.
  TITLE     Nucleotide sequence of 16 kilobase pairs of DNA 5' to the human
            epsilon-globin gene
  JOURNAL   J. Biol. Chem. 260, 14901-14910 (1985)
  STANDARD  full staff_review
REFERENCE   81 (sites)
  AUTHORS   Gelinas,R., Endlich,B., Pfeiffer,C., Yagi,M. and
            Stamatoyannopoulos,G.
  TITLE     G to A substitution in the distal CCAAT box of the
            alpha-gamma-globin gene in Greek hereditary persistence of fetal
            haemoglobin
  JOURNAL   Nature 313, 323-325 (1985)
  STANDARD  full staff_review
REFERENCE   82 (sites)
  AUTHORS   Collins,F.S., Metherall,J.E., Yamakawa,M., Pan,J., Weissman,S.M.
            and Forget,B.G.
  TITLE     A point mutation in the alpha-gamma-globin gene promoter in Greek
            hereditary persistence of fetal haemoglobin
  JOURNAL   Nature 313, 325-326 (1985)
  STANDARD  full staff_review
REFERENCE   83 (bases 67089 to 73326)
  AUTHORS   Hattori,M., Hidaka,S. and Sakaki,Y.
  TITLE     Sequence analysis of a KpnI family member near the 3'end of human
            beta-globin gene
  JOURNAL   Nucleic Acids Res. 13, 7813-7827 (1985)
  STANDARD  full staff_review
REFERENCE   84 (sites)
  AUTHORS   van Santen,V.L. and Spritz,R.A.
  TITLE     mRNA precursor splicing in vivo: Sequence requirements determined
            by deletion analysis of an intervening sequence
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 82, 2885-2889 (1985)
  STANDARD  full staff_review
REFERENCE   85 (sites)
  AUTHORS   Tuan,D., Solomon,W., Li,Q. and London,I.M.
  TITLE     The 'beta-like-globin' gene domain in human erythroid cells
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 82, 6384-6388 (1985)
  STANDARD  full staff_review
REFERENCE   86 (sites)
  AUTHORS   Chabot,B., Black,D.L., LeMaster,D.M. and Steitz,J.A.
  TITLE     The 3' splice site of pre-messenger RNA is recognized by a small
            nuclear ribonucleoprotein
  JOURNAL   Science 230, 1344-1349 (1985)
  STANDARD  full staff_review
REFERENCE   87 (bases 21372 to 21378)
  AUTHORS   Collins,F.S.
  JOURNAL   Unpublished (1986)
  STANDARD  full staff_review
REFERENCE   88 (bases 58817 to 58976; 63054 to 63313)
  AUTHORS   Popovich,B.W., Rosenblatt,D.S., Kendall,A.G. and Nishioka,Y.
  TITLE     Molecular characterization of an atypical beta-thalassemia caused
            by a large deletion in the 5' beta-globin gene region
  JOURNAL   Am. J. Hum. Genet. 39, 797-810 (1986)
  STANDARD  full staff_entry
REFERENCE   89 (bases 62391 to 62437)
  AUTHORS   Metherall,J.E., Collins,F.S., Pan,J., Weissman,S.M. and Forget,B.G.
  TITLE     Beta-0 thalassemia caused by a base substitution that creates an
            alternative splice acceptor site in an intron
  JOURNAL   EMBO J. 5, 2551-2557 (1986)
  STANDARD  full staff_entry
REFERENCE   90 (bases 54892 to 54910)
  AUTHORS   Lapoumeroulie,C., Pagnier,J., Bank,A., Labie,D. and
            Krishnmoorthy,R.
  TITLE     Beta thalassemia due to a novel mutation in IVS 1 sequence donor
            site consensus sequence creating a restriction site
  JOURNAL   Biochem. Biophys. Res. Commun. 139, 709-713 (1986)
  STANDARD  full staff_entry
REFERENCE   91 (bases 37658 to 37695; 40180 to 40217)
  AUTHORS   Tate,V.E., Hill,A.V., Bowden,D.K., Sadler,J.R., Weatherall,D.J. and
            Clegg,J.B.
  TITLE     A silent deletion in the beta-globin gene cluster
  JOURNAL   Nucleic Acids Res. 14, 4743-4750 (1986)
  STANDARD  full staff_entry
REFERENCE   92 (sites)
  AUTHORS   Prchal,J.T., Cashman,D.P. and Kan,Y.W.
  TITLE     Hemoglobin Long Island is caused by a single mutation (adenine to
            cytosine) resulting in a failure to cleave amino-terminal
            methionine
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 83, 24-27 (1986)
  STANDARD  full staff_review
REFERENCE   93 (bases 59607 to 59736; 72229 to 72358)
  AUTHORS   Gilman,J.G. and Abraham,J.
  TITLE     DNA sequence analysis of the Dutch beta-0-thalassemia deletion
  JOURNAL   Biomed. Biochim. Acta 46, 131-135 (1987)
  STANDARD  full staff_entry
REFERENCE   94 (bases 43741 to 50739)
  AUTHORS   Miyamoto,M.M., Slightom,J.L. and Goodman,M.
  TITLE     Phylogenetic relations of humans and African apes from DNA
            sequences in the pseudo-eta-globin region
  JOURNAL   Science 238, 369-373 (1987)
  STANDARD  full staff_entry
REFERENCE   95 (bases 55056 to 55070)
  AUTHORS   Atweh,G.F., Brickner,H.E., Zhu,X.-X., Kazazian,H.H.Jr. and
            Forget,B.G.
  TITLE     New amber mutation in a beta-thalassemic gene with nonmeasurable
            levels of mutant messenger RNA in vivo
  JOURNAL   J. Clin. Invest. 82, 557-561 (1988)
  STANDARD  full staff_entry
REFERENCE   96 (sites)
  AUTHORS   Fei,Y.J., Stoming,T.A., Efremov,G.D., Efremov,D.G., Battacharia,R.,
            Gonzalez-Redondo,J.M., Altay,C., Gurgey,A. and Huisman,T.H.
  TITLE     Beta-thalassemia due to a T->A mutation within the ATA box
  JOURNAL   Biochem. Biophys. Res. Commun. 153, 741-747 (1988)
  STANDARD  full staff_review
REFERENCE   97 (sites)
  AUTHORS   Engelke,D.R., Hoener,P.A. and Collins,F.S.
  TITLE     Direct sequencing of enzymatically amplified human genomic DNA
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 85, 544-548 (1988)
  STANDARD  full staff_review
COMMENT
            [1]  cRNA fragments.
            [2]  mRNA.
            [5]  sites; polyadenylation signal and site for the beta gene.
            [4]  cDNA from normal and thalassemic mRNAs.
            [6]  cDNA.
            [9]  mRNA and cDNA fragments.
            [10]  mRNA.
            [7]  cDNA.
            [13]  see comment below for 35872 to 35964 - may be 40788 to 40880.
            [19]  sites; amber mutation at codon 17 of the beta chain.
            [22]  sites; gene order for the beta-like globin cluster.
            [27]  sites; consensus sequences in the promoter regions.
            [40]  sites; mutation associated with Hb Vicksburg.
            [41]  beta thalassemia DNA.
            [42]  sites; mutation associated with beta-0 thalassemia.
            [35]  sites; mutation associated with beta-plus thalassemia.
            [36]  sites; mutation associated with beta-0 thalassemia.
            [37]  six alleles over this span.
            [34]  sites; mutations associated with beta-0 thalassemia.
            [30]  sites; mutation associated with beta-0 thalassemia.
            [53]  sites; alternative cap sites for mRNA.
            [54]  sites; mutation associated with beta-plus thalassemia.
            [57]  sites; deletion mutation associated with beta-0 thalassemia;.
            [49]  sites; mutations associated with beta thalassemias.
            [50]  sites; deletion mutation; for sequence, see separate entry.
            [62]  DNA from Kurdish Jew with thalassemia.
            [58]  sites; promoter region for beta gene.
            [59]  sites; termini for RNA polymerase III transcripts.
            [60]  sites; alternative cap sites for mRNA.
            [61]  sites; deletion mutations associated with thalassemias;.
            [65]  for the R allele;.
            [73]  sites; beta and gamma gene cap sites.
            [71]  sites; mutation associated with beta thalassemia.
            [69]  sites; Miyada Hb lesion; for sequence, see separate entry.
            [75]  sites; hsRTVL-H element; for sequence, see separate entry.
            [66]  sites; mutations associated with thalassemia.
            [67]  sites; mutation associated with thalassemia.
            [68]  sites; mutation associated with thalassemia.
            [76]  see comment; review.
            [74]  sites; mutations in the A-gamma gene.
            [81]  sites; mutation in promoter region leading to HPFH;.
            [82]  sites; mutation in promoter region leading to HPFH.
            [85]  sites; DNAse I hypersensitivity sites in the region.
            [78]  sites; cryptic branch points in beta IVS1.
            [79]  sites; 3' segments of beta and G-gamma cDNAs.
            [84]  sites; mutational analysis of G-gamma IVS-2.
            [86]  sites; small nuclear ribonucleoprotein binding site to mRNA.
            [92]  sites; hemoglobin Long Island mutation.
            [94]  revises [72].
            [96]  sites; mutations resulting in beta-thalassemia.
            [97]  sites; sickle cell anemia mutation site.
            [32]  sites; thalassemia mutations.
            [44]  sites; thalassemia mutations.
            [45]  sites; thalassemia mutation.
            [21]  sites; thalassemia mutation.
            [89]  beta-0 thalassemia mutations.
            This 73 kb sequence, which includes all of the known beta genes in
            the cluster on chromosome 11, was compiled from the following
            sources primarily:
                bases             references
                ------           ------------
               1 to 10409           [80]
               10410 to 13774       [56]
               13775 to 16594       [80]
               16595 to 21399       [23], [28], [39]
               21400 to 32370       [63; see acknowledgments therein; bases
            31906-32038 sequenced on one strand only]
               32371 to 43746       [31]
               43747 to 50733       [94]
               50734 to 67222       [48], [62]
               67223 to 73326       [83]
            Other sequence work is referenced and annotated below. Oliver
            Smithies provided the sequence in [80] via Arpanet and Francis
            Collins supplied a diskette with the sequence in [76].
            Computer-readable sequence for [94] kindly provided by
            M.M.Miyamoto, 15-FEB-1988.
            The five beta-like globin genes are found within a 45 kb cluster on
            chromosome 11 in the following order:
                  5'-epsilon -G-gamma -A-gamma -delta -beta-3'   [22]
            Additionally, the pseudogene beta-1 is located between the A-gamma
            and delta genes [72]. A region 5' to the epsilon gene was thought
            to be another pseudogene; however [56] shows this not to be so.
            These embryonic, fetal and adult beta-like genes have the same
            overall exonic structure, leading to the conclusion that they are
            derived from one ancestral gene. In particular, they have many
            consensus sequences and repetitive sequences in common which have
            been analyzed by [27] and [76].
                   Epsilon gene
                   ------------
            The epsilon globin gene (hbe below) is normally expressed in the
            embryonic yolk sac: two epsilon chains together with two zeta
            chains (an alpha-like globin; see separate entry) constitute the
            embryonic hemoglobin Hb Gower I; two epsilon chains together with
            two alpha chains form the embryonic Hb Gower II.  Both of these
            embryonic hemoglobins are normally supplanted by fetal, and later,
            adult hemoglobin.
            The promoter region sequences 'ccaat', 'ata' and 'cttccg' found at
            19421, 19476 and 19513 are characteristic of all human beta-like
            genes, as well as of some other mammalian genes, and are thought to
            influence initiation of transcription and translation [27],[76].
            However, at least nine alternative cap sites which do not possess
            these conserved sequences have been found upstream from the
            so-called canonical cap sites at 19504 and 19506 [53],[60].
            The Alu family sequences found at 16910-17176 and 17945-18208 are
            typical of the 5' flanking regions of the beta-like globin genes
            [23],[39]. The first of these bipolar repeats has been shown to be
            active as a template for RNA polymerase III [39].
                   G-gamma and A-gamma genes
                   -------------------------
            The gamma globin genes (hbgg and hbga below) are normally expressed
            in the fetal liver, spleen and bone marrow.  Two gamma chains
            together with two alpha chains constitute fetal hemoglobin (HbF)
            which is normally replaced by adult hemoglobin (HbA) at birth.  In
            some beta-thalassemias and related conditions (HPFH or 'hereditary
            persistence of fetal hemoglobin'), gamma chain production continues
            into adulthood.  The mapping of deletions in these pathologies is
            therefore of special interest with regard to developmental control
            mechanisms.
            The two types of gamma chains differ at residue 136 where glycine
            is found in the G-gamma product and alanine is found in the A-gamma
            product.  The former is predominant at birth.  Because of the
            sequence identity of the two genes over large stretches, it was not
            always possible in the early work to know which sequence was being
            investigated [13],[15],[17].  Moreover, because allelic variation
            has been reported for each of these non-allelic genes, further
            sequence work is required to determine the consensus sequence for
            each.  Thus far the sequences of two A-gamma alleles have been
            reported ([24]; see separate entry which annotates the allelic
            variation).  The second introns for the hbgg and hbga genes shown
            below contain 886 and 866 bases respectively, while their alleles
            on the opposing chromosome have 904 and 876 bases for the
            corresponding introns.  [24] and [31] present an analysis of this
            phenomenon and conclude that intergenic exchange can occur in human
            germ line cells with significant frequency.
            Given the above-mentioned uncertainties with regard to polymorphism
            and material, differences, where annotated, are treated as
            variations rather than as conflicts.
            The promoter region sequences 'ccaat', 'ata' and 'cttctg' found at
            bases 34408, 34466 and 34503 in the hbgg gene, and bases 39344,
            39402 and 39439 in the hbga gene, are characteristic of all
            beta-like genes, as well as of some other mammalian genes, and are
            thought to influence transcription and translation [27],[76].  The
            gamma genes each manifest duplicate 'ccaat' boxes at positions
            34381 (hbgg) and 39317 (hbga).  Alternative cap sites are active in
            vitro upstream from the canonical cap sites at 34496 and 39432: at
            bases 34416, 34426, 34436 and 34446 for hbgg, and at bases 39352,
            39362, 39372 and 39382 for hbga [73].  Alternative cap sites have
            also been reported for epsilon and beta mRNAs when there is no
            duplication of the 'ccaat' sequence.  [81] and [82] show that the
            distal 'ccaat' box for the hbga gene (for the B allele -- see
            separate sequence) has some vital function: a g -> a mutation at
            base 39315 is apparently responsible for one form of Greek HPFH.
            The Alu family sequences found at bases 32408-32741 (approx.) and
            37343-37580 (approx.) are typical of the beta-like globin Alu's
            [27],[76],[33],[31].  A study of the hbgg repeat has revealed RNA
            transcription by polymerase III [33]( also reported for Alu regions
            in the 5' flanks of the epsilon and beta genes).
                   Pseudo-beta-1
                   -------------
            Human, gorilla and chimpanzee beta-like pseudogenes were sequenced
            and compared (see separate entries for the other primate sequences)
            by [72] and revised by [94]. The pseudogene structure was deduced
            through comparison with the A-gamma globin gene. Base substitutions
            in the initiation codon and in codons downstream, that create
            internal termination signals in exons 2 and 3, make this sequence a
            pseudogene.
                   Delta and beta genes
                   --------------------
            The delta and beta genes (denoted hbd and hbb below) are normally
            expressed in the adult: two alpha chains plus two beta chains
            constitute HbA, which in normal adult life comprises about 97% of
            the total hemoglobin.  Two alpha chains plus two delta chains
            constitute HbA-2, which with HbF comprises the remaining 3% of
            adult hemoglobin.
            The sequence given below has been reconstructed from the sequences
            reported by [48] and [62]: the mutation at base 62161 is apparently
            sufficient to distinguish the Kurdish Jew thalassemic sequence from
            a normal (consensus) beta globin DNA; [62] has resolved all
            sequence differences to date with exception of the differences
            reported herein as variations.
            The promoter region sequences 'ccaat', 'ata' and 'cttctg' found at
            bases 54690, 54727 and 54765 in the delta gene, and at bases 62079,
            62124 and 62162 in the beta gene, are characteristic of all
            beta-like genes, as well as of some other mammalian genes, and are
            thought to influence transcription and translation [27],[76].
            However, alternative cap sites have been found for these genes as
            well as other beta-like genes [58],[73].
            [96] describes the mutation (substitution of 'a' for 't' at
            position 62125, thereby destroying the promoter) found in an Hb
            Lepore-beta+ -thalassemia patient, who was homozygous for this
            mutation.  The father who had the simple beta-thalassemia trait was
            found to be heterozygous.
            A form of thalassemia exists, where 19 bp are added after the first
            exon, starting at position 62408 and ending at a stop codon at
            position 62424-62426 [32],[44],[43]. This aberrant splicing is
            caused by a mutation of 'g' to 'a' in the first intron of
            beta-hemoglobin at position 62406.  The substitution of a 'g' for a
            't' at position 62412 also causes premature splicing and results in
            an abnormal and abbreviated beta hemoglobin [89].
            Another form of beta-thalassemia is produced by the substitution of
            an 'a' for a 'g' at position 62650.  This causes a readthrough at
            the exon/intron boundary of exon 2/intron 2 [45].
            The Alu family sequences found at bases 50933, 51994, 65531 and
            66794 are again typical of the beta-like globin Alu's.  These
            sequences are of considerable interest in relation to regulation,
            recombination and transcription by RNA polymerase III.
            There are non-Alu repetitive elements in the gene cluster that have
            been partially characterized: the EC-1 repeat [38] and a
            retrovirus-like element in the 3' flank about 300 kb downstream of
            this sequence [75] (see separate entry); there are numerous repeats
            in the 5' flank which will be included in a future update [80].
            Reference [83] has characterized a novel KpnI family sequence in
            the 3' flank of the cluster.
            Potential polyadenylation signals were identified for the following
            genes:
               hbe       positions 21073-21078
               hbgg      positions 36061-36066
               hbga      positions 40977-40982
               hbd       positions 56383-56388
               hbb       positions 63736-63741
FEATURES             Location/Qualifiers
     exon            <19559..19650
                     /note="epsilon-globin, exon 1"
                     /gene="HBE1"
     exon            <34549..34640
                     /note="G-gamma globin, exon 1"
                     /gene="HBG2"
     exon            <39485..39576
                     /note="A-gamma globin, exon 1"
                     /gene="HBG1"
     exon            <54808..54899
                     /note="delta-globin, exon 1"
                     /gene="HBD"
     exon            <62205..62296
                     /note="beta-globin, exon 1"
                     /gene="HBB"
     exon            <62205..62296
                     /note="beta-globin thalassemia, exon 1 [32],[44]"
     exon            <45728..45818
                     /note="pseudo-hbp, exon 1 [72]"
     CDS             join(19559..19650,19773..19995,20851..20979)
                     /note="epsilon-globin"
                     /codon_start=1
                     /translation="MVHFTAEEKAAVTSLWSKMNVEEAGGEALGRLLVVYPWTQRFFD
                     SFGNLSSPSAILGNPKVKAHGKKVLTSFGDAIKNMDNLKPAFAKLSELHCDKLHVDPE
                     NFKLLGNVMVIILATHFGKEFTPEVQAAWQKLVSAVAIALAHKYH"
     CDS             join(34549..34640,34763..34985,35872..36000)
                     /note="G-gamma globin"
                     /codon_start=1
                     /translation="MGHFTEEDKATITSLWGKVNVEDAGGETLGRLLVVYPWTQRFFD
                     SFGNLSSASAIMGNPKVKAHGKKVLTSLGDAIKHLDDLKGTFAQLSELHCDKLHVDPE
                     NFKLLGNVLVTVLAIHFGKEFTPEVQASWQKMVTGVASALSSRYH"
     CDS             join(39485..39576,39699..39921,40788..40916)
                     /note="A-gamma globin"
                     /codon_start=1
                     /translation="MGHFTEEDKATITSLWGKVNVEDAGGETLGRLLVVYPWTQRFFD
                     SFGNLSSASAIMGNPKVKAHGKKVLTSLGDAIKHLDDLKGTFAQLSELHCDKLHVDPE
                     NFKLLGNVLVTVLAIHFGKEFTPEVQASWQKMVTAVASALSSRYH"
     CDS             join(54808..54899,55028..55250,56149..56277)
                     /note="delta-globin"
                     /codon_start=1
                     /translation="MVHLTPEEKTAVNALWGKVNVDAVGGEALGRLLVVYPWTQRFFE
                     SFGDLSSPDAVMGNPKVKAHGKKVLGAFSDGLAHLDNLKGTFSQLSELHCDKLHVDPE
                     NFRLLGNVLVCVLARNFGKEFTPQMQAAYQKVVAGVANALAHKYH"
     CDS             join(62205..62296,62427..62649,63500..63628)
                     /note="beta-globin"
                     /codon_start=1
                     /translation="MVHLTPEEKSAVTALWGKVNVDEVGGEALGRLLVVYPWTQRFFE
                     SFGDLSTPDAVMGNPKVKAHGKKVLGAFSDGLAHLDNLKGTFATLSELHCDKLHVDPE
                     NFRLLGNVLVCVLAHHFGKEFTPPVQAAYQKVVAGVANALAHKYH"
     CDS             join(62205..62296,62408..62426)
                     /note="beta-globin thalassemia"
                     /codon_start=1
                     /translation="MVHLTPEEKSAVTALWGKVNVDEVGGEALGSLFSHP"
     CDS             join(45728..45818,45940..46163,47015..47142)
                     /pseudo
                     /note="pseudo-hbp"
                     /codon_start=1
     repeat_region   10597..10611
                     /note="Alu flank repeat 5' copy"
     repeat_region   10612..10924
                     /note="Alu family repeat"
     repeat_region   10925..10939
                     /note="Alu flank repeat 3' copy"
     repeat_region   complement(16910..17176)
                     /note="Alu family repeat [28],[39]"
     variation       17864..17866
                     /note="cag in clone lambda-epsilon; g in ph 1.8 [28]"
     repeat_region   17945..18208
                     /note="Alu family repeat [28],[39]"
     prim_transcript 19289..21098
                     /note="hbe mRNA (alt.) [23],[53],[60]"
     prim_transcript 19504..21098
                     /note="hbe mRNA (alt.) [23],[53],[60]"
     prim_transcript 19506..21098
                     /note="hbe mRNA (alt.) [23],[53],[60]"
     intron          19651..19772
                     /note="hbe intron 1 [23]"
     exon            19773..19995
                     /note="epsilon-globin, exon 2"
     intron          19996..20850
                     /note="hbe intron 2 [23]"
     exon            20851..>20979
                     /note="epsilon-globin, exon 3"
     variation       21001
                     /note="g in [76]; c in [23]"
     old_sequence    21372..21378
     unsure          31906..32038
                     /note="sequenced on one strand only [76]"
     repeat_region   32408..32424
                     /note="Alu flank repeat 5' copy [33]"
     repeat_region   32425..32729
                     /note="Alu family repeat [33]"
     variation       32626
                     /note="g in [31]; a in [33]"
     repeat_region   32730..32746
                     /note="Alu flank repeat 3' copy [33]"
     variation       32761..32762
                     /note="ag in [31]; ga in [33]"
     variation       33204
                     /note="a in [31]; g in [33]"
     variation       33216
                     /note="a in [31]; g in [33]"
     mutation        34294
                     /note="c in wt; g in persons with elevated gamma chain
                     [77]"
     mutation        34300
                     /note="c in wt; t in persons with elevated gamma chain
                     [77]"
     mutation        34339
                     /note="c in wt; t in high G-gamma SS; beta thalassemia;
                     and in low Hb F G-gamma-beta+-HPHF and HPFH [77]"
     mutation        34379
                     /note="g in wt; a in persons with elevated gamma chain
                     [77]"
     prim_transcript 34496..36087
                     /note="hbgg mRNA [15],[24],[27]"
     intron          34641..34762
                     /note="hbgg intron 1 [24]"
     exon            34763..34985
                     /note="G-gamma globin, exon 2"
     intron          34986..35871
                     /note="hbgg intron 2 [24]"
     exon            35872..>36000
                     /note="G-gamma globin, exon 3"
     mutation        37675..40197
                     /note="g-25 kb-c in wt; gc in silent deletion [91]"
     mutation        39315
                     /note="g in wt (b allele, see separate sequence); a in
                     HPFH [81],[82]"
     prim_transcript 39432..41003
                     /note="hbga mRNA [5],[15],[24],[27]"
     variation       39456
                     /note="a in [31]; g in [15]"
     intron          39577..39698
                     /note="hbga intron 1 [24]"
     exon            39699..39921
                     /note="A-gamma globin, exon 2"
     intron          39922..40787
                     /note="hbga intron 2 [17],[24]"
     exon            40788..>40916
                     /note="A-gamma globin, exon 3"
     variation       43800..43802
                     /note="gat in [94]; gt in [76]"
     variation       43848..43850
                     /note="ttg in [94]; tg in [76]"
     variation       43861
                     /note="t in [94]; c in [76]"
     variation       44140
                     /note="t in [94]; c in [76]"
     variation       45087..45095
                     /note="aaaaaaaaa in [94]; aa in [76]"
     variation       45245..45246
                     /note="ca in [94]; ctca in [76]"
     variation       45298
                     /note="t in [94]; c in [76]"
     variation       45315
                     /note="c in [94]; t in [76]"
     variation       45317
                     /note="t in [94]; c in [76]"
     variation       45327..45329
                     /note="cat in [94]; ct in [76]"
     variation       45335..45337
                     /note="tta in [94]; ta in [76]"
     variation       45339..45341
                     /note="tag in [94]; tg in [76]"
     mRNA            45675..47388
                     /pseudo
                     /note="pseudo-hbp mRNA [72]"
     intron          45819..45941
                     /note="pseudo-hbp intron 1 (no splice consensus at 45941
                     [17],[24],[72]"
     exon            45940..46163
                     /pseudo
                     /note="pseudo-hbp, exon 2 [72]"
     intron          46164..47014
                     /note="pseudo-hbp intron 2 (no splice consensus at 46164
                     [72]"
     exon            47015..>47142
                     /note="pseudo-hbp, exon 3 [72]"
     repeat_region   complement(50907..50912)
                     /note="Alu flank repeat 3' copy [48],[62]"
     repeat_region   complement(50933..51216)
                     /note="Alu family repeat [48],[62]"
     repeat_region   complement(51217..51222)
                     /note="Alu flank repeat 5' copy [48],[62]"
     repeat_region   51984..51993
                     /note="Alu flank repeat 5' copy [62]"
     repeat_region   51994..52277
                     /note="Alu family repeat [33],[62]"
     repeat_region   52304..52313
                     /note="Alu flank repeat 3' copy [62]"
     prim_transcript 54758..56407
                     /note="hbd mRNA [25]"
     intron          54900..55027
                     /note="hbd intron 1 [25]"
     mutation        54904
                     /note="g in wt; a in beta thalassemia [90]"
     exon            55028..55250
                     /note="delta-globin, exon 2"
     mutation        55065
                     /note="g in wt; t in beta-thalassemia Glu->stop"
     intron          55251..56148
                     /note="hbd intron 2 [25]"
     variation       55541
                     /note="ct in [55],[62],[76]; tc in [25]"
     variation       55589..55590
                     /note="ct in [62],[76]; c in [25]; cc in [55]"
     exon            56149..>56277
                     /note="delta-globin, exon 3"
     mutation        58873..63111
                     /note="a-4239 bp-c in wt; ac in atypical beta-thalassemia
                     [88]"
     mutation        59681..72304
                     /note="t-12622 bp-t in wt; tt in Dutch beta-0- thalassemia
                     [93]"
     variation       60763..60768
                     /note="tatttt in [62],[76]; t in [37]"
     variation       60850
                     /note="a in [37],[62],[76]; g in [52]"
     variation       60860
                     /note="c in [62],[76]; ca in [37],[52]"
     variation       61166
                     /note="c in [52],[62],[76]; g in [52]"
     variation       61312
                     /note="c in [62],[76]; t in [52]"
     variation       61326
                     /note="t in [62],[76]; tt in [52]"
     allele          61604
                     /note="t in wt; c in Albanian allele (produces RsaI site)
                     [70]"
     allele          61626..61627
                     /note="tt in wt; tatat in Albanian allele [70]"
     variation       61681
                     /note="a in [62],[76]; g in [52]"
     variation       61771
                     /note="gc in [62],[76]; g in [52]"
     variation       61791
                     /note="g in [62]; a in [76]"
     variation       61815
                     /note="t in [55],[62],[76]; c in [52]"
     variation       61842
                     /note="g in [62],[76]; c in [52]"
     variation       61940
                     /note="g in [52],[62],[76]; c in [41]"
     variation       61950
                     /note="a in [52],[62],[76]; t in [41]"
     mutation        62068
                     /note="c in wt; g in beta-thalassemia [63]"
     mutation        62125
                     /note="t in normal promoter; a in thalassemia patient (Hb
                     Lepore-beta+ -thal)"
     mutation        62127
                     /note="a in [26],[41],[52]; c in [62],[76],[J. Biol. C"
     prim_transcript 62155..63760
                     /note="hbb mRNA [5],[11],[10],[7],[26]"
     variation       62174
                     /note="c in [62],[76]; t in [49]"
     variation       62213
                     /note="c in [26],[62],[76]; t in [49]"
     mutation        62223..62225
                     /note="gag in wt; gg in beta-thalassemia [64]"
     mutation        62224
                     /note="a in normal hbb; t in sickle cell anemia 78]"
     mutation        62256
                     /note="a in wt; t in thalassemia [21]"
     intron          62297..62426
                     /note="hbb intron 1 [10],[26],[55]"
     mutation        62297
                     /note="g in wt; a in thalassemia [63]"
     intron          62297..62407
                     /gene="HBB thalassemia"
     mutation        62301
                     /note="g in wt; c in thalassemia [63]"
     mutation        62302
                     /note="t in wt; c in thalassemia [63]"
     mutation        62406
                     /note="g in wt; a in thalassemia [32],[44],[43],[89]"
     exon            62408..>62426
                     /note="beta-globin thalassemia, exon 2 [32],[44]"
     mutation        62412
                     /note="t in wt; g in thalassemia [89]"
     exon            62427..62649
                     /note="beta-globin, exon 2"
     mutation        62448
                     /note="g in wt; a in a form of thalassemia [32]"
     intron          62650..63499
                     /note="hbb intron 2 [10],[12],[26]"
     mutation        62650
                     /note="g in wt; a in a form of thalassemia [45]"
     variation       62665
                     /note="c in [26],[49],[62],[76]; g in [49]"
     variation       62723
                     /note="g in [26],[62],[76]; t in [49]"
     variation       62730
                     /note="c in [26],[62],[76]; t in [49]"
     variation       63315
                     /note="t in [26],[49],[62],[76]; c in [49]"
     mutation        63394
                     /note="c in wt; g thalassemia [63] (causes an extra exon)"
     exon            63500..>63628
                     /note="beta-globin, exon 3"
     variation       63848..63849
                     /note="ta in [62],[76]; t in [26]"
     mutation        63936..63937
                     /note="ct in wt; cct in a form of thalassemia [46]"
     variation       63936
                     /note="c in [62],[76]; cc in [26]"
     mutation        63975..63976
                     /note="tt in wt; aa in a form of thalassemia [46]"
     variation       63975..63976
                     /note="aa in [37],[62],[76]; tt in [26]"
     variation       63992
                     /note="g in [37],[62],[76]; c in [26]"
     repeat_region   65508..65520
                     /note="Alu flank repeat 5' copy [62],[76]"
     repeat_region   65531..65785
                     /note="Alu family repeat [62],[76]"
     repeat_region   65786..65798
                     /note="Alu flank repeat 3' copy [62],[76]"
     repeat_region   66783..66793
                     /note="Alu flank repeat 5' copy [62],[76]"
     repeat_region   66794..67060
                     /note="Alu family repeat [62],[76]"
     repeat_region   67087..67097
                     /note="Alu flank repeat 3' copy [62],[76]"
     repeat_region   67089..73213
                     /note="KpnI family repeat [83]"
BASE COUNT    22072 a  14169 c  14789 g  22293 t      3 others
ORIGIN      1 bp upstream of EcoRI site; chromosome 11p15 [J. Biol. Chem. 260,
        1 gaattctaat ctccctctca accctacagt cacccatttg gtatattaaa gatgtgttgt
       61 ctactgtcta gtatccctca agtagtgtca ggaattagtc atttaaatag tctgcaagcc
      121 aggagtggtg gctcatgtct gtaattccag cactggagag gtagaagtgg gaggactgct
      181 tgagctcaag agtttgatat tatcctggac aacatagcaa gacctcgtct ctacttaaaa
      241 aaaaaaaaat tagccaggca tgtgatgtac acctgtagtc ccagctactc aggaggccga
      301 aatgggagga tcccttgagc tcaggaggtc aaggctgcag tgagacatga tcttgccact
      361 gcactccagc ctggacagca gagtgaaacc ttgcctcacg aaacagaata caaaaacaaa
      421 caaacaaaaa actgctccgc aatgcgcttc cttgatgctc taccacatag gtctgggtac
      481 tttgtacaca ttatctcatt gctgttcgta attgttagat taattttgta atattgatat
      541 tattcctaga aagctgaggc ctcaagatga taacttttat tttctggact tgtaatagct
      601 ttctcttgta ttcaccatgt tgtaactttc ttagagtagt aacaatataa agttattgtg
      661 agtttttgca aacacagcaa acacaacgac ccatatagac attgatgtga aattgtctat
      721 tgtcaattta tgggaaaaca agtatgtact ttttctacta agccattgaa acaggaataa
      781 cagaacaaga ttgaaagaat acattttccg aaattacttg agtattatac aaagacaagc
      841 acgtggacct gggaggaggg ttattgtcca tgactggtgt gtggagacaa atgcaggttt
      901 ataatagatg ggatggcatc tagcgcaatg actttgccat cacttttaga gagctcttgg
      961 ggaccccagt acacaagagg ggacgcaggg tatatgtaga catctcattc tttttcttag
     1021 tgtgagaata agaatagcca tgacctgagt ttatagacaa tgagcccttt tctctctccc
     1081 actcagcagc tatgagatgg cttgccctgc ctctctacta ggctgactca ctccaaggcc
     1141 cagcaatggg cagggctctg tcagggcttt gatagcacta tctgcagagc cagggccgag
     1201 aaggggtgga ctccagagac tctccctccc attcccgagc agggtttgct tatttatgca
     1261 tttaaatgat atatttattt taaaagaaat aacaggagac tgcccagccc tggctgtgac
     1321 atggaaacta tgtagaatat tttgggttcc attttttttt ccttctttca gttagaggaa
     1381 aaggggctca ctgcacatac actagacaga aagtcaggag ctttgaatcc aagcctgatc
     1441 atttccatgt catactgaga aagtccccac ccttctctga gcctcagttt ctctttttat
     1501 aagtaggagt ctggagtaaa tgatttccaa tggctctcat ttcaatacaa aatttccgtt
     1561 tattaaatgc atgagcttct gttactccaa gactgagaag gaaattgaac ctgagactca
     1621 ttgactggca agatgtcccc agaggctctc attcagcaat aaaattctca ccttcaccca
     1681 ggcccactga gtgtcagatt tgcatgcact agttcacgtg tgtaaaaagg aggatgcttc
     1741 tttcctttgt attctcacat acctttagga aagaacttag cacccttccc acacagccat
     1801 cccaataact catttcagtg actcaaccct tgactttata aaagtcttgg gcagtataga
     1861 gcagagatta agagtacaga tgctggagcc agaccacctg agtgattagt gactcagttt
     1921 ctcttagtaa ttgtatgact cagtttcttc atctgtaaaa tggagggttt tttaattagt
     1981 ttgtttttga gaaagggtct cactctgtca cccaaatggg agtgtagtgg caaaatctcg
     2041 gctcactgca acttgcactt cccaggctca agcggtcctc ccacctcaac atcctgagta
     2101 gctggaacca caggtacaca ccaccatacc tcgctaattt tttgtatttt tggtagagat
     2161 ggggtttcac atgttacaca ggatggtctc agactccgga gctcaagcaa tctgcccacc
     2221 tcagccttcc aaagtgctgg gattataagc atgattacag gagttttaac aggctcataa
     2281 gattgttctg cagcccgagt gagttaatac atgcaaagag tttaaagcag tgacttataa
     2341 atgctaacta ctctagaaat gtttgctagt attttttgtt taactgcaat cattcttgct
     2401 gcaggtgaaa actagtgttc tgtactttat gcccattcat ctttaactgt aataataaaa
     2461 ataactgaca tttattgaag gctatcagag actgtaatta gtgctttgca taattaatca
     2521 tatttaatac tcttggattc tttcaggtag atactattat tatccccatt ttactacagt
     2581 taaaaaaact acctctcaac ttgctcaagc atacactctc acacacacaa acataaacta
     2641 ctagcaaata gtagaattga gatttggtcc taattatgtc tttgctcact atccaataaa
     2701 tatttattga catgtacttc ttggcagtct gtatgctgga tgctggggat acaaagatgt
     2761 ttaaatttaa gctccagtct ctgcttccaa aggcctccca ggccaagtta tccattcaga
     2821 aagcattttt tactctttgc attccactgt ttttcctaag tgactaaaaa attacacttt
     2881 attcgtctgt gtcctgctct gggatgatag tctgactttc ctaacctgag cctaacatcc
     2941 ctgacatcag gaaagactac accatgtgga gaaggggtgg tggttttgat tgctgctgtc
     3001 ttcagttaga tggttaactt tgtgaagttg aaaactgtgg ctctctggtt gactgttaga
     3061 gttctggcac ttgtcactat gcctattatt taacaaatgc atgaatgctt cagaatatgg
     3121 gaatattatc ttctggaata gggaatcaag ttatattatg taacccagga ttagaagatt
     3181 cttctgtgtg taagaatttc ataaacatta agctgtctag caaaagcaag ggcttggaaa
     3241 atctgtgagc tcctcaccat atagaaagct tttaacccat cattgaataa atccctatag
     3301 gggatttcta ccctgagcaa aaggctggtc ttgattaatt cccaaactca tatagctctg
     3361 agaaagtcta tgctgttaac gttttcttgt ctgctacccc atcatatgca caacaataaa
     3421 tgcaggccta ggcatgactg aaggctctct cataattctt ggttgcatga atcagattat
     3481 caacagaaat gttgagacaa actatgggga agcagggtat gaaagagctc tgaatgaaat
     3541 ggaaaccgca atgcttcctg cccattcagg gctccagcat gtagaaatct ggggctttgt
     3601 gaagactggc ttaaaatcag aagccccatt ggataagagt agggaagaac ctagagccta
     3661 cgctgagcag gtttccttca tgtgacaggg agcctcctgc cccgaacttc cagggatcct
     3721 ctcttaagtg tttcctgctg gaatctcctc acttctatct ggaaatggtt tctccacagt
     3781 ccagcccctg gctagttgaa agagttaccc atgcagaggc cctcctagca tccagagact
     3841 agtgcttaga ttcctacttt cagcgttgga caacctggat ccacttgccc agtgttcttc
     3901 cttagttcct accttcgacc ttgatcctcc tttatcttcc tgaaccctgc tgagatgatc
     3961 tatgtgggga gaatggcttc tttgagaaac atcttcttcg ttagtggcct gcccctcatt
     4021 cccactttaa tatccagaat cactataaga agaatataat aagaggaata actcttatta
     4081 taggtaaggg aaaattaaga ggcatacgtg atgggatgag taagagagga gagggaagga
     4141 ttaatggatg ataaaatcta ctactatttg ttgagacctt ttatagtcta atcaattttg
     4201 ctattgtttt ccatcctcac gctaactcca taaaaaaaca ctattattat ctttattttg
     4261 ccatgacaag actgagctca gaagagtcaa gcatttgcct aaggtcggac atgtcagagg
     4321 cagtgccaga cctatgtgag actctgcagc tactgctcat gggccctgtg ctgcactgat
     4381 gaggaggatc agatggatgg ggcaatgaag caaaggaatc attctgtgga taaaggagac
     4441 agccatgaag aagtctatga ctgtaaattt gggagcagga gtctctaagg acttggattt
     4501 caaggaattt tgactcagca aacacaagac cctcacggtg actttgcgag ctggtgtgcc
     4561 agatgtgtct atcagaggtt ccagggaggg tggggtgggg tcagggctgg ccaccagcta
     4621 tcagggccca gatgggttat aggctggcag gctcagatag gtggttaggt caggttggtg
     4681 gtgctgggtg gagtccatga ctcccaggag ccaggagaga tagaccatga gtagagggca
     4741 gacatgggaa aggtggggga ggcacagcat agcagcattt ttcattctac tactacatgg
     4801 gactgctccc ctataccccc agctaggggc aagtgccttg actcctatgt tttcaggatc
     4861 atcatctata aagtaagagt aataattgtg tctatctcat agggttatta tgaggatcaa
     4921 aggagatgca cactctctgg accagtggcc taacagttca ggacagagct atgggcttcc
     4981 tatgtatggg tcagtggtct caatgtagca ggcaagttcc agaagatagc atcaaccact
     5041 gttagagata tactgccagt ctcagagcct gatgttaatt tagcaatggg ctgggaccct
     5101 cctccagtag aaccttctaa ccagctgctg cagtcaaagt cgaatgcagc tggttagact
     5161 ttttttaatg aaagcttagc tttcattaaa gattaagctc ctaagcaggg cacagatgaa
     5221 attgtctaac agcaactttg ccatctaaaa aaatctgact tcactggaaa catggaagcc
     5281 caaggttctg aacatgagaa atttttagga atctgcacag gagttgagag ggaaacaaga
     5341 tggtgaaggg actagaaacc acatgagaga cacgaggaaa tagtgtagat ttaggctgga
     5401 ggtaaatgaa agagaagtgg gaattaatac ttactgaaat ctttctatat gtcaggtgcc
     5461 attttatgat atttaataat ctcattacat atggtaattc tgtgagatat gtattattga
     5521 acatactata attaatacta atgataagta acacctcttg agtacttagt atatgctaga
     5581 atcaaattta agtttatcat atgaggccgg gcacggtggc tcatatatgg gattacatgc
     5641 ctgtaatccc agcactttgg gaggccaagg caattggatc acctgaggtc aggagttcca
     5701 gaccagcctg gccaacatgg tgaaacccct tctctactaa aaaatacaaa aaatcagcca
     5761 ggtgtggtgg cacgcgtcta taatcccagc tactcaggag gctgaggcag gagaatcact
     5821 tgaacccagg aggtggaggt tgcagtgagc taagattgca ccactgcact ccagcctagg
     5881 cgacagagtg agactccatc tcaaaaaaaa aaaaagaagt ttattatatg aattaactta
     5941 gttttactca caccaatact cagaagtaga ttattacctc atttattgat gaggagccca
     6001 atgtacttgt agtgtagatc aacttattga aagcacaagc taataagtag acaattagta
     6061 attagaagtc agatggtctg agctctccta ctgtctacat tacatgagct cttattaact
     6121 ggggactcga aaatcaaaga catgaaataa tttgtccaag cttacagaac caccaagtag
     6181 taaggctagg atgtagaccc agttctgcta cctctgaaga cagtgttttt tccacagcaa
     6241 aacacaaact cagatattgt ggatgcgaga aattagaagt agatattcct gccctgtggc
     6301 ccttgcttct tacttttact tcttggcgat tggaagttgt ggtccaagcc acagttgcag
     6361 accatacttc ctcaaccata attgcatttc ttcaggaaag tttgagggag aaaaaggtaa
     6421 agaaaaattt agaaacaact tcagaataaa gagattttct cttgggttac agagattgtc
     6481 atatgacaaa ttataagcag acacttgaga aaactgaagg cccatgcctg cccaaattac
     6541 cctttgaccc cttggtcaag ctgcaacttt ggttaaaggg agtgtttatg tgttatagtg
     6601 ttcatttact cttctggtct aacccattgg ctccgtcttc atcctgcagt gacctcagtg
     6661 cctcagaaac atacatatgt ttgtctagtt taagtttgtg tgaaattcta actagcgtca
     6721 agaactgagg gccctaaact atgctaggaa tagtgctgtg gtgctgtgat aggtacacaa
     6781 gaaatgagaa gaaactgcag attctctgca tctccctttg ccgggtctga caacaaagtt
     6841 tccccaaatt ttaccaatgc aagccatttc tccatatgct aactacttta aaatcatttg
     6901 gggcttcaca ttgtctttct catctgtaaa aagaatggaa gaactcattc ctacagaact
     6961 ccctatgtct tccctgatgg gctagagttc ctctttctca aaaattagcc attattgtat
     7021 ttccttctaa gccaaagctc agaggtcttg tattgcccag tgacatgcac actggtcaaa
     7081 agtaggctaa gtagaagggt actttcacag gaacagagag caaaagaggt gggtgaatga
     7141 gagggtaagt gagaaaagac aaatgagaag ttacaacatg atggcttgtt gtctaaatat
     7201 ctcctaggga attattgtga gaggtctgaa tagtgttgta aaataagctg aatctgctgc
     7261 ctaacattaa cagtcaagaa atacctccga ataactgtac ctccaattat tctttaaggt
     7321 agcatgcaac tgtaatagtt gcatgtatat atttatcata atactgtaac agaaaacact
     7381 tactgaatat atactgtgtc cctagttctt tacacaataa actaatctca tcctcataat
     7441 tctattagct aatacatatt atcatcctat atttcagaga cttcaagaag ttaagcaact
     7501 tgctcaagat catctaagaa gtaggtggta tttctgggct catttggccc ctcctaatct
     7561 ctcatggcaa catggctgcc taaagtgttg attgccttaa ttcatcaggg atgggctcat
     7621 actcactgca gaccttaact ggcatcctct tttcttatgt gatctgcctg accctagtag
     7681 aacttatgaa atttctgatg agaaaggaga gaggagaaag gcagagctga ctgtgatgag
     7741 tgatgaaggt gccttctcat ctgggtacca gtggggcctc taagactaag tcactctgtc
     7801 tcactgtgtc ttagccagtt ccttacagct tgccctgatg ggagatagag aatgggtatc
     7861 ctccaacaaa aaaataaatt ttcatttctc aaggtccaac ttatgttttc ttaattttta
     7921 aaaaaatctt gaccattctc cactctctaa aataatccac agtgagagaa acattctttt
     7981 cccccatccc ataaatacct ctattaaata tggaaaatct gggcatggtg tctcacacct
     8041 gtaatcccag cactttggga ggctgaggtg ggtggactgc ttggagctca ggagttcaag
     8101 accatcttgg acaacatggt gataccctgc ctctacaaaa agtacaaaaa ttagcctggc
     8161 atggtggtgt gcacctgtaa tcccagctat tagggtggct gaggcaggag aattgcttga
     8221 acccgggagg cggaggttgc agtgagctga gatcgtgcca ctgcactcca gcctggggga
     8281 cagagcacat tataattaac tgttattttt tacttggact cttgtgggga ataagataca
     8341 tgttttattc ttatttatga ttcaagcact gaaaatagtg tttagcatcc agcaggtgct
     8401 tcaaaaccat ttgctgaatg attactatac tttttacaag ctcagctccc tctatccctt
     8461 ccagcatcct catctctgat taaataagct tcagtttttc cttagttcct gttacatttc
     8521 tgtgtgtctc cattagtgac ctcccatagt ccaagcatga gcagttctgg ccaggcccct
     8581 gtcggggtca gtgccccacc cccgccttct ggttctgtgt aaccttctaa gcaaaccttc
     8641 tggctcaagc acagcaatgc tgagtcatga tgagtcatgc tgaggcttag ggtgtgtgcc
     8701 cagatgttct cagcctagag tgatgactcc tatctgggtc cccagcagga tgcttacagg
     8761 gcagatggca aaaaaaagga gaagctgacc acctgactaa aactccacct caaacggcat
     8821 cataaagaaa atggatgcct gagacagaat gtgacatatt ctagaatata ttatttcctg
     8881 aatatatata tatatatata tacacatata cgtatatata tatatatata tatatttgtt
     8941 gttatcaatt gccatagaat gattagttat tgtgaatcaa atatttatct tgcaggtggc
     9001 ctctatacct agaagcggca gaatcaggct ttattaatac atgtgtatag atttttagga
     9061 tctatacaca tgtattaata tgaaacaagg atatggaaga ggaaggcatg aaaacaggaa
     9121 aagaaaacaa accttgtttg ccattttaag gcacccctgg acagctaggt ggcaaaaggc
     9181 ctgtgctgtt agaggacaca tgctcacata cggggtcaga tctgacttgg ggtgctactg
     9241 ggaagctctc atcttaagga tacatctcag gccagtcttg gtgcattagg aagatgtagg
     9301 caactctgat cctgagagga aagaaacatt cctccaggag agctaaaagg gttcacctgt
     9361 gtgggtaact gtgaaggact acaagaggat gaaaaacaat gacagacaga cataatgctt
     9421 gtgggagaaa aaacaggagg tcaaggggat agagaaggct tccagaagaa tggctttgaa
     9481 gctggcttct gtaggagttc acagtggcaa agatgtttca gaaatgtgac atgacttaag
     9541 gaactataca aaaaggaaca aatttaagga gaggcagata aattagttca acagacatgc
     9601 aaggaatttt cagatgaatg ttatgtctcc actgagcttc ttgaggttag cagctgtgag
     9661 ggttttgcag gcccaggacc cattacagga cctcacgtat acttgacact gttttttgta
     9721 ttcatttgtg aatgaatgac ctcttgtcag tctactcggt ttcgctgtga atgaatgatg
     9781 tcttgtcagc ctacttggtt tcgctaagag cacagagaga agatttagtg atgctatgta
     9841 aaaacttcct ttttggttca agtgtatgtt tgtgatagaa atgaagacag gctacatgat
     9901 gcatatctaa cataaacaca aacattaaga aaggaaatca acctgaagag tatttataca
     9961 gataacaaaa tacagagagt gagttaaatg tgtaataact gtggcacagg ctggaatatg
    10021 agccatttaa atcacaaatt aattagaaaa aaaacagtgg ggaaaaaatt ccatggatgg
    10081 gtctagaaag actagcattg ttttaggttg agtggcagtg tttaaagggt gatatcagac
    10141 taaacttgaa atatgtggct aaataactag aatactcttt attttttcgt atcatgaata
    10201 gcagatatag cttgatggcc ccatgcttgg tttaacatcc ttgctgttcc tgacatgaaa
    10261 tccttaattt ttgacaaagg ggctattcat tttcatttta tattgggcct agaaattatg
    10321 tagatggtcc tgaggaaaag tttatagctt gtctatttct ctctctaaca tagttgtcag
    10381 cacaatgcct aggctatagg aagtactcaa agcttgttaa attgaattct atccttctta
    10441 ttcaattcta cacatggagg aaaaactcat cagggatgga ggcacgcctc taaggaaggc
    10501 aggtgtggct ctgcagtgtg attgggtact tgcaggacga agggtggggt gggagtggct
    10561 aaccttccat tcctagtgca gaggtcacag cctaaacatc aaattccttg aggtgcggtg
    10621 gctcactcct gtaatcacag cagtttggga cgccaaggtg ggcagatcac ttgaggtcag
    10681 gagttggaca ccagcccagc caacatagtg aaacctggtc tctgcttaaa aatataaaaa
    10741 ttagctggac gtggtgacgg gagcctgtaa tccaactact tgggaggctg aggcaggaga
    10801 atcgcttgaa ccggggaggt ggagtttgca ctgagcagag atcatgccat tgcactccag
    10861 cctccagagc gagactctgt ctaaagaaaa acgaaaacaa acaaacaaac aaacaaacaa
    10921 aacccatcaa attccctgac cgaacagaat tctgtctgat tgttctctga cttatctacc
    10981 attttccctc cttaaagaaa ctgtggaact tccttcagct agaggggcct ggctcagaag
    11041 cctctggtca gcatccaaga aatacttgat gtcactttgg ctaaaggtat gatgtgtaga
    11101 caagctccag agatggtttc tcatttccat atccacccac ccagctttcc aattttaaag
    11161 ccaattctga ggtagagact gtgatgaaca aacaccttga caaaattcaa cccaaagact
    11221 cactttgcct agcttcaaaa tccttactct gacatatact cacagccaga aattagcatg
    11281 cactagagtg tgcatgagtg caacacacac acacaccaat tccatattct ctgtcagaaa
    11341 atcctgttgg tttttcgtga aaggatgttt tcagaggctg accccttgcc ttcacctcca
    11401 atgctaccac tctggtctaa gtcactgtca ccaccaccta aattatagct gttgactcat
    11461 aacaatcttc ctgcttctac cactgcccca ctacaatttc ttcccaatat actatccaaa
    11521 ttagtctttt caaaatgtaa gtcatatatg gtcacctctt tgttcaaagt cttctgatag
    11581 tttcctatat catttataat aaaaccaaat ccttacaatt ctctacaata gttgttcatg
    11641 catatattat gtttattaca gatacgcata tatatagctc tcatataaat aaatatatat
    11701 atttatgtgt atgtgtgtag agtgtttttt cttacaactc tatgatgtag gtattattag
    11761 tgtcccaaat tttataattt aggacttcta tgatctcatc ttttattctc cccttcaccg
    11821 aatctcatcc tacattggcc ttattgatat tccttgaaaa ttctaagcat cttacatctt
    11881 tagggtattt acatttgcca ttccctatgc cctaaatatt taatcatagt ttcatataaa
    11941 tgggttcctc atcatctatg ggtactctct caggtgttaa ctttatagtg aggactttcc
    12001 tgccatacta cttaaagtag cgataccctt tcaccctgtc ctaatcacac tctggccttc
    12061 atttcagttt tttttttttc tccatagcac ctaatctcat tggtatataa catgtttcat
    12121 ttgcttattt aatgtcaagc tctttccact atcaagtcca tgaaaacagg aactttattc
    12181 ctctattctg tttttgtgct gtattcttag caattttaca attttgaatg aaatgaatga
    12241 gcagtcaaac acatatacaa ctataattaa aaggatgtat gctgacacat ccactgctat
    12301 gcacacacaa agaaatcagt ggagtagagc tggaagcgct aagcctgcat agagctagtt
    12361 agccctccgc aggcagagcc ttgatgggat tactgagttc tagaattgga ctcatttgtt
    12421 ttgtaggctg agatttgctc ttgaaaactt gttctgacca aaataaaagg ctcaaaagat
    12481 gaatatcgaa accagggtgt tttttacact ggaatttata actagagcac tcatgtttat
    12541 gtaagcaatt aattgtttca tcagtcaggt aaaagtaaag aaaaactgtg ccaaggcagg
    12601 tagcctaatg caatatgcca ctaaagtaaa cattattcca taggtgtcag atatggctta
    12661 ttcatccatc ttcatgggaa ggatggcctt ggcctggaca tcagtgttat gtgaggttca
    12721 aaacacctct aggctataag gcaacagagc tccttttttt tttttctgtg ctttcctggc
    12781 tgtccaaatc tctaatgata agcatacttc tattcaatga gaatattctg taagattata
    12841 gttaagaatt gtgggagcca ttccgtctct tatagttaaa tttgagcttc ttttatgatc
    12901 actgtttttt taatatgctt taagttctgg ggtacatgtg ccatggtggt ttgctgcacc
    12961 catcaacccg tcatctacat taggtatttc tcctaatgct atccttcccc tagcccccca
    13021 cccccaacag gccccagtgt gtgatgttcc cctccctgtg tccatggatc actggttttt
    13081 tttttttttt tttttttttt tttaaagtct cagttaaatt tttggaatgt aatttatttt
    13141 cctggtatcc taggacctgc aagttatctg gtcactttag ccctcacgtt ttgatgataa
    13201 tcacatattt gtaaacacaa cacacacaca cacacacaca cacatatata tatataaaac
    13261 atatatatac ataaacacac ataacatatt tatcgggcat ttctgagcaa ctaactcatg
    13321 caggactctc aaacactaac ctatagcctt ttctatgtat ctacttgtgt agaaaccaag
    13381 cgtggggact gagaaggcaa tagcaggagc attctgactc tcactgcctt tggctaggtc
    13441 cctccctcat cacagctcag catagtccga gctcttatct atatccacac acagtttctg
    13501 acgctgccca gctatcacca tcccaagtct aaagaaaaaa ataatgggtt tgcccatctc
    13561 tgttgattag aaaacaaaac aaaataaaat aagcccctaa gctcccagaa aacatgacta
    13621 aaccagcaag aagaagaaaa tacaataggt atatgaggag actggtgaca ctagtgtctg
    13681 aatgaggctt gagtacagaa aagaggctct agcagcatag tggtttagag gagatgtttc
    13741 tttccttcac agatgcctta gcctcaataa gcttgcggtt gtggaagttt actttcagaa
    13801 caaactcctg tggggctaga attattgatg gctaaaagaa gcccggggga gggaaaaatc
    13861 attcagcatc ctcaccctta gtgacacaaa acagaggggg cctggttttc catatttcct
    13921 catgatggat gatctcgtta atgaaggtgg tctgacgaga tcattgcttc ttccatttaa
    13981 gccttgctca cttgccaatc ctcagtttta accttctcca gagaaataca cattttttat
    14041 tcaggaaaca tactatgtta tagtttcaat actaaataat caaagtactg aagatagcat
    14101 gcataggcaa gaaaaagtcc ttagctttat gttgctgttg tttcagaatt taaaaaagat
    14161 caccaagtca aggacttctc agttctagca ctagaggtgg aatcttagca tataatcaga
    14221 ggtttttcaa aatttctaga catgagattc aaagccctgc acttaaaata gtctcatttg
    14281 aattaactct ttatataaat tgaaagcaca ttctgaacta cttcagagta ttgttttatt
    14341 tctatgttct tagttcataa atacattagg caatgcaatt taattaaaaa aacccaagaa
    14401 tttcttagaa ttttaatcat gaaaataaat gaaggcatct ttacttactc aaggtcccaa
    14461 aaggtcaaag aaaccaggaa agtaaagcta tatttcagcg gaaaatggga tatttatgag
    14521 ttttctaagt tgacagactc aagttttaac cttcagtgcc catgatgtag gaaagtgtgg
    14581 cataactggc tgattctggc tttctactcc tttttcccat taaagatccc tcctgcttaa
    14641 ttaacattca caagtaactc tggttgtact ttaggcacag tggctcccga ggtcagtcac
    14701 acaataggat gtctgtgctc caagttgcca gagagagaga ttactcttga gaatgagcct
    14761 cagccctggc tcaaactcac ctgcaaactt cgtgagagat gaggcagagg tacactacga
    14821 aagcaacagt tagaagctaa atgatgagaa cacatggact catagaggga aacaacgcat
    14881 actggggcct atcagagggt ggagggtgag agaaggagag gatcaggaaa aatcactaat
    14941 ggatgctaag cgtaatacct gagtgatgag atcatctata caacaaaccc ccttgacatt
    15001 catttatcta tgtaacaaac ctgcacatcc tgtacacgta cccctgaact taaaataaaa
    15061 gttgaaaaca agaaagcaac agtttgaaca cttgttatgg tctattctct cattctttac
    15121 aattacacta gaaaatagcc acaggctcct gcaaggcagc cacagaattt atgacttgtg
    15181 atatccaagt cattcctgga taatgcaaaa tctaacacaa aatctagtag aatcatttgc
    15241 ttacatctat ttttgttctg agaatataga tttagataca taatggaagc agaataattt
    15301 aaaatctggc taatttagaa tcctaagcag ctcttttcct atcagtggtt tacaagcctt
    15361 gtttatattt ttcctatttt aaaaataaaa ataaagtaag ttatttgtgg taaagaatat
    15421 tcattaaagt atttatttct tagataatac catgaaaaac attcagtgaa gtgaagggcc
    15481 tactttaccc aacaagaatc taatttatat aatttttcat actaatagca tctaagaaca
    15541 gtacaatatt tgactcttca ggttaaacat atgtcataaa ttagccagaa agatttaaga
    15601 aaatattgga tgtttccttg tttaaattag gcatcttaca gtttttagaa tcctgcatag
    15661 aacttaagaa attacaaatg ctaaagcaaa cccaaacagg caggaattaa tcttcatcga
    15721 atttgggtgt ttctttctaa aagtccttta tacttaaatg tcttaagaca tacatagatt
    15781 ttattttact aattttaatt atacagacaa taaatgaata ttcttactga ttactttttc
    15841 tgactgtcta atctttctga tctatcctgg atggccataa cacttatctc tctgaacttt
    15901 gggcttttaa tataggaaag aaaagcaata atccattttt catggtatct catatgataa
    15961 acaaataaaa tgcttaaaaa tgagcaggtg aagcaattta tcttgaacca acaagcatcg
    16021 aagcaataat gagactgccc gcagcctacc tgacttctga gtcaggattt ataagccttg
    16081 ttactgagac acaaacctgg gcctttcaat gctataacct ttcttgaagc tcctccctac
    16141 cacctttagc cataaggaaa catggaatgg gtcagatccc tggatgcaag ccaggtctgg
    16201 aaccataggc agtaaggaga gaagaaaatg tgggctctgc aactggctcc gagggagcag
    16261 gagagaatca accccatact ctgaatctaa gagaagactg gtgtccatac tctgaatggg
    16321 aagaatgatg ggattaccca tagggcttgt tttagggaga aacctgttct ccaaactctt
    16381 ggccttgaga tacctggtcc ttattccttg gactttggca atgtctgacc ctcacattca
    16441 agttctgagg aagggccact gccttcatac tgtggatctg tagcaaattc cccctgaaaa
    16501 cccagagctg tatcttaatt gtttaaaaaa attatattat ctcaaggact gttcttctct
    16561 gagtagccaa gctcagcttg gttcaagcta caagcagctg cgctgctttt tgtctagtca
    16621 ttgttctttt atttcagtgg atcaaatacg ttctttccaa acctaggatc ttgtcttcct
    16681 ggactatata ttttatccac gaagtcttaa tctggggtcc acagaacact agggggctgg
    16741 tgaagtttat agaaaaaaaa tctgtatttt tacttacatg taactgaaat ttagcatttt
    16801 cttctacttt gaatgcaaag gacaaactag aatgacatca tcagtaccta ttgcatagtt
    16861 ataaagagaa accacagata ttttcatact acaccatagg tattgcagat ctttttgttt
    16921 ttgtttttgt ttgagatgga gtttcgctct tattgcccag gctggagtgc agtggcatga
    16981 tttcggctca ctgcaacctc cccttcctgc attcaagcaa ttctcctgcc ttggcctcca
    17041 gagtagctgg ggattacagg cacctgccac catgccagtc taatttttgt atttttagta
    17101 gagaatgggt ttcgccatgt tggccaggct ggtcttgaac tcctgacctc agatgatctg
    17161 cccgccttgg cctcctgaag tgctgggatt ataggtgtga gccaccacgc ctggcccatt
    17221 gcagatattt ttaattcaca tttatctgca tcactacttg gatcttaagg tagctgcaga
    17281 cccaatccca gatctaatgc tttcataaag aagcaaatat aataaatact ataccacaaa
    17341 tgtaatgttt gatgtctgat aatgatattt cagtgtaatt aaacttagca ctccatgtat
    17401 attatttgat gcaataaaaa catatttttt tagcacttac agtctgccaa actggcctgt
    17461 gacacaaaaa aagtttaggg gaattcccct agttttgtct gtgttagcca atggttagaa
    17521 tatatgctca gaaagatacc attggttaat agctaaaaga aaatggagta gaaattcagt
    17581 ggcctggaat aataacaatt tgggcagtca ttaagtcagg tgaagacttc tggaatcatg
    17641 ggagaaaagc aagggagaca ttcttacttg ccacaagtgt tttttttttt tttttttttt
    17701 atcacaaaca taagaaaata taataaataa caaagtcagg ttatagaaga gagaaacgct
    17761 cttagtaaac ttggaatatg gaatccccaa aggcacttga cttgggagac aggagccata
    17821 ctgctaagtg aaaaagacga agaacctcta gggcctgaac atacaggaaa ttgtaggaac
    17881 agaaattcct agatctggtg gggcaagggg agccatagga gaaagaaatg gtagaaatgg
    17941 atggagacgg aggcagaggt gggcagatca tgaggtcaag agatcgagac catcctggca
    18001 aacatggtga aatcccgtct ctactaaaaa taaaaaaatt agctgggcat ggtggcatgc
    18061 gcctgtagtc ccagctgctc gggaggctga ggcaggagaa tcgtttgaac ccaggaggcg
    18121 aaggttgcag tgagctgaga tagtgccatt gcactccagt ctggcaacag agtgagactc
    18181 cgtctcaaaa aaaaaaaaaa gaaagaaaga aaagaaaaag aaaaaagaaa aaataaatgg
    18241 atgtagaaca agccagaagg aggaactggg ctggggcaat gagattatgg tgatgtaagg
    18301 gacttttata gaattaacaa tgctggaatt tgtggaactc tgcttctatt attcccccaa
    18361 tcattacttc tgtcacattg atagttaaat aatttctgtg aatttattcc ttgantccca
    18421 aaatattgag gtaaataaca atggtattat aaaagggcag attaagtgat atagcataag
    18481 caatattctt caggcacatg gatcgaattg aatacactgt aaatcccaac ttccagtttc
    18541 agctctacca agtaaagagc tagcaagtca tcaaaatggg gacatacaga aaaaaaaaag
    18601 gacactagag gaataatata ccctgactcc tagcctgatt aatatatcga ttcactttta
    18661 ctctgtttgg tgacaaattc tggctttaaa taattttagg attttaggct tctcagctcc
    18721 cttcccagtg agaagtataa gcaggacagc aggcaagcaa gaagagagcc caaggcaata
    18781 ctcacaaagt agccagtgtc ccctgtggtc atagagaaat ggaaagagag aggantcccc
    18841 ccttggagcc actgggtggt aatcctttcc gtccgttcct ctctagggaa tcaccccaag
    18901 gtactgtact ttgggattaa ggctttagtc ccactgtgga ctacttgcta ttctgttcag
    18961 tttctgaagg aactatgtac ggtttttgtc tccctagaga aactaaggta cagaagtttt
    19021 gtttacaatg cactccttaa gagagctaga actgggtgaa gantcctggt ttaaccagcc
    19081 ttaatttcct ttccctgggc cccggtttgg tcacgtcact gtcaccacct ttaaggcaaa
    19141 tgttaaatgc gctttggctg aactttttcc tattttgaga tttgctcctt tatatgaggc
    19201 tttcttggaa aaggagaatg ggagagatgg atatcatttt ggaagatgat gaagagggta
    19261 aaaaagggta caaatggaaa tttgtgttgc agatagtatg aggagccaac aaaaaagagc
    19321 ctcaggatcc agcacacatt atcacaaact tagtgtccat ccatcactgc tgaccctctc
    19381 cggacctgac tccacccctg aggacacagg tcagccttga ccaatgactt ttaagtacca
    19441 tggagaacag ggggccagaa cttcggcagt aaagaataaa aggccagaca gagaggcagc
    19501 agcacatatc tgcttccgac acagctgcaa tcactagcaa gctctcaggc ctggcatcat
    19561 ggtgcatttt actgctgagg agaaggctgc cgtcactagc ctgtggagca agatgaatgt
    19621 ggaagaggct ggaggtgaag ccttgggcag gtaagcattg gttctcaatg catgggaatg
    19681 aagggtgaat attaccctag caagttgatt gggaaagtcc tcaagatttt ttgcatctct
    19741 aattttgtat ctgatatggt gtcatttcat agactcctcg ttgtttaccc ctggacccag
    19801 agattttttg acagctttgg aaacctgtcg tctccctctg ccatcctggg caaccccaag
    19861 gtcaaggccc atggcaagaa ggtgctgact tcctttggag atgctattaa aaacatggac
    19921 aacctcaagc ccgcctttgc taagctgagt gagctgcact gtgacaagct gcatgtggat
    19981 cctgagaact tcaaggtgag ttcaggtgct ggtgatgtga ttttttggct ttatattttg
    20041 acattaattg aagctcataa tcttattgga aagaccaaca aagatctcag aaatcatggg
    20101 tcgagcttga tgttagaaca gcagacttct agtgagcata accaaaactt acatgattca
    20161 gaactagtga cagtaaagga ctactaacag cctgaattgg cttaactttt caggaaatct
    20221 tgccagaact tgatgtgttt atcccagaga attgtattat agaattgtag acttgtgaaa
    20281 gaagaatgaa atttggcttt tggtagatga aagtccattt caaggaaata gaaatgcctt
    20341 attttatgtg ggtcatgata attgaggttt agaagagatt tttgcaaaaa aaataaaaga
    20401 tttgctcaaa gaaaaataag acacattttc taaaatatgt taaatttccc atcagtattg
    20461 tgaccaagtg aaggcttgtt tccgaatttg ttggggattt taaactcccg ctgagaactc
    20521 ttgcagcact cacattctac atttacaaaa attagacaat tgcttaaaga aaaacaggga
    20581 gagagggaac ccaataatac tggtaaaatg gggaaggggg tgagggtgta ggtaggtaga
    20641 atgttgaatg tagggctcat agaataaaat tgaacctaag ctcatctgaa ttttttgggt
    20701 gggcacaaac cttggaacag tttgaggtca gggttgtcta ggaatgtagg tataaagccg
    20761 tttttgtttg tttgtttgtt ttttcatcaa gttgttttcg gaaacttcta ctcaacatgc
    20821 ctgtgtgtta ttttgtcttt tgcctaacag ctcctgggta acgtgatggt gattattctg
    20881 gctactcact ttggcaagga gttcacccct gaagtgcagg ctgcctggca gaagctggtg
    20941 tctgctgtcg ccattgccct ggcccataag taccactgag ttctcttcca gtttgcaggt
    21001 cttcctgtga ccctgacacc ctccttctgc acatggggac tgggcttggc cttgagagaa
    21061 agccttctgt ttaataaagt acattttctt cagtaatcaa aaattgcaat tttatcttct
    21121 ccatctttta ctcttgtgtt aaaaggaaaa agtgttcatg ggctgaggga tggagagaaa
    21181 cataggaaga accaagagct tccttaagaa atgtatgggg gcttgtaaaa ttaatgtgga
    21241 tgttatggga gaattcccaa gattcccaag gaggatgata tgatggagaa aaatctttat
    21301 cggggtggga aaatggttaa ttaagtggca gagactccta ggcagttttt actgcaccgg
    21361 ggaaagaagg agctgttgtg gtacctgaga aagcagattt gtggtacatg tcacttttca
    21421 ttaaaaacaa aaacaaaaca aaacaaaact tcatagatat ccaagatata ggctgagaat
    21481 tactatttta atttactctt atttacattt tgaagtagct agcttgtcac atgttttatg
    21541 aaattgattt ggagataaga tgagtgtgta tcaacaatag cctgctcttt ccatgaagga
    21601 ttccattatt tcatgggtta gctgaagcta agacacatga tatcattgtg cattatcttc
    21661 tgatacaatg taacatgcac taaaataaag ttagagttag gacctgagtg ggaaagtttt
    21721 tggagagtgt gatgaagact ttccgtggga gatagaatac taataaaggc ttaaattcta
    21781 aaaccagcaa gctagggctt cgtgacttgc atgaaactgg ctctctggaa gtagaaggga
    21841 gagtaagaca tacgtagagg actaggaaag accagatagt acagggcctg gctacaaaaa
    21901 tacaagcttt tactatgcta ttgcaatact aaacgataag cattaggatg ttaagtgact
    21961 caggaaataa gattttggga aaaagtaatc tgcttatgtg cacaaaatgg attcaagttt
    22021 gcagataaaa taaaatatgg atgatgattc aaggggacag atacaatggt tcaaacccaa
    22081 gaggagcagt gagtctgtgg aattttgaag gatggacaaa ggtggggtga gaaagacata
    22141 gtattcgacc tgactgtggg agatgagaag gaagaaggag gtgataaatg actgaaagct
    22201 cccagactgg tgaagataac aggaggaaac catgcacttg accctggtga ctctcatgtg
    22261 tgaagggtag agggatatta acagatttac tttttaggaa gtgctagatt ggtcagggag
    22321 ttttgacctt caggtcttgt gtctttcata tcaaggaacc tttgcatttt ccaagttaga
    22381 gtgccatatt ttggcaaata taactttatt agtaatttta tagtgctctc acattgatca
    22441 gactttttcc tgtgaattac ttttgaattt ggctgtatat atccagaata tgggagagag
    22501 acaaataatt attgtagttg caggctatca acaatactgg tctctctgag ccttataacc
    22561 tttcaatatg ccccataaac agagtaaaca gggattattc atggcactaa atattttcac
    22621 ctaggtcagt caacaaatgg aggcaatgtg cattttttga tacatatttt tatatattta
    22681 tggggcatgt gatacttaca tgcctagaac atgtgactga ttaagtctag atatttagga
    22741 tatccattac tttgagcatt tatcatttct atgtattgag aaaatttcaa atcctcattt
    22801 ctgaccattt tgaaatatat aataaatagt aattaactat agtcacccta ctcaaatatc
    22861 aacattataa actaactaat ccttctttcc acttttttac caaccaacat ctcttaaatc
    22921 ccctgccata cacatcacac atttttcagc tctgataact atcattctac tctcatacca
    22981 ccatgagacc acttttttag ctccacagat gaataaaaac atgtgatatt tgactttctg
    23041 tatctggctt attttattat ctatctcttt ggcataccaa gagtttgttt ttgttctgct
    23101 tcagggcttt caattaacat aatgacctct ggttccatcc atgttgctac aaatgacaag
    23161 atttcattct ttttcatggc aaaatagtac tgtgcaaaaa atacaatttt ttaatccgtt
    23221 catctgttga tagacactta ggttgatccc aaaccttaac tattgtgaat aggtgcttca
    23281 ataaacatga gtgtaatgtg tccattggat atactgattt cctttctttt ggataaataa
    23341 ccactagtga gattgctgga ttgtatgata gttctgtttt tagtttattg agaaatcttc
    23401 atactgtttt ccataatggt tgtactattt tacattccca ccaacagtgt gtaagaaaga
    23461 gttccctttt ctccatatcc tcacaaggat ctgttatttt ttgtcttttt tgttaatagc
    23521 attttaacta gagtaagtag atatctcatt gtagttttga tttgcatttc cctgatcatt
    23581 agtgatgttg agattttttc atatgtttgt tggtcatttg tatatctttt tctgagattg
    23641 tctgttcatg tccttatcct acttttattg ggattgttgt tattttcttg ataatcattg
    23701 tgtcatttta gagcctggat attattcttt tgtcagatgt atagattgtg aagattttct
    23761 cctctgtggg ttgtctgttt attctgcaga ctcttccttt tgccatgcaa aagctcttta
    23821 gtttaattta gtcccagata ttttctttgt ttttatgtgt ttgcatttgt gttcttgtca
    23881 tgaaatcctt tcctaagcca atgtgtagaa gggtttttcc gatgttattt tctagaattg
    23941 ttacagtttc aggcttagat ttaagtcctt gatccatctt aagttgattt ttgtataagg
    24001 tgagagatga agatccagtt tcattctcct acatgtagct tgccagctat cccgactcat
    24061 ttgttgaata gggtgccctt tcccatttat gtttttgttt gctttgtcaa agatcagttc
    24121 ggatgtaagt atttgagttt atttctgggt tctctattct gttccattgg tccgatgtgc
    24181 ctatttgtac accagcatca tgctgtgttt ttggtgacta tggccttatt gtatagtttg
    24241 aaatgaggta atgtaatgcc attcagattt gttctttttt ttagacttgc ttgtttattg
    24301 ggctcttttt tggttccata agaattttag gattgttttt tctagttctg tgaaggctaa
    24361 tggtggtatt tatgggaatt gcaatgcaat ttgtaggttg cttctggcat tatggccatt
    24421 ttcacaatat tgattctacc catctatgag aatggcatgt gtttccattt gtttgtgtct
    24481 tatatgatta ctatcagccg tgttttgtag ttttccttgt agatgtcttt cacctccttg
    24541 gttaggtata tattcctaag tttttgtttt gttttgtttt gttttttgca gctattgtaa
    24601 aaggggttga gttattgatt ttattctcat cttggtcatt gctggtatgt aagaaagcaa
    24661 ctcattggtg tacgttaatt ttgtatccag aaactttgct gaattatttt atcagttcta
    24721 gggggttttg gaggagtctt tagagttttc tacatacaca atcatatcat cagcaaacag
    24781 tgacagtttg actttctctt taacaatttg gatgtgcttt acttgtttct cttgtctgat
    24841 tgctcttgct aggacttcca gtaatatgtt aaagagaagt ggtgagagtg ggtatccttg
    24901 tctcattcca gttttcagac agaatgcttt taactttttc ccattcaata taatgttggc
    24961 tgtgtgttta ccatagctgg cttttattac attgaggtat gtcctttgta aaccgatttt
    25021 gctgagtttt agtcataaag tgatgttgaa ttttgttgaa tgcagtttct gtggctattg
    25081 agataatcac atgatttttg tttccaattc tctttatgtt gtgtatcaca cttattgact
    25141 tgcgtatgtt aaaccatccg tgcatccctc gcatgaaacc acttgatcat gggttttgat
    25201 atgccgtgtg ggatgctatt agctatattt tgtcaaggat gttggcatct atgttcatca
    25261 gggatattga tctgtagtgt tttttttttt tggttatgtt ctttcccagt tttggtatta
    25321 aggtgatact ggcttcatag aatgatttag ggaggattct ctctttctct atcttgtaga
    25381 atactgtcaa taggattggt atcaattctt ctttgaatgt ctggtagaat tcgaacgtct
    25441 cctttaggtt ttctagttta ttcatgtaaa ggtgttcata gtaaccttga ataatctttt
    25501 gtatttctgt ggtatcagta atagtatctc ctgttttgtt tctaactgag tttatttgca
    25561 cttctctcct cttttcttgg ttaatcttgc taatggtcta tcagttttat ttatcttttc
    25621 aaagaaccag ctttttattt catttagctt ttgtattttt ttgcagttgt tttaatttca
    25681 tttagttctc ctcttatctt agttattccc tttcttttgc tgggttttgg ttctgtttgt
    25741 ttttgtttct ctagtttctt gtggtgtgac cttatattgt ctgtcctctt tcagactctt
    25801 tgacatcgac atttagggct gtgaactttc cttttagcac catctttgct gtatcctaga
    25861 ggttttgata ggtgtgtcac tattgtcggt cagttcaagt aattttgttg ttcttattat
    25921 actttaagtt ctgggataca tgtgcagaat gtgcaggttt gttacatagg tatagatgtg
    25981 ccatggtggt ttgctgctcc catcaacctg tcatctacat taggtatttc ttttaatgtt
    26041 atccctctcc taaccccctc accccccgac aggccctggt gtgtgatgtt cccctccctg
    26101 tgtccatgtg ttctcattgt tcaactccca cttatgagtg agaacgtgtg gtgtttggtt
    26161 tctctgttcc tgtgttagtt tgctcagaat gatgtttcca ccttcaccat gtccctgcaa
    26221 agacatgaac tcatcatttt atggctgcat atattccatg gtgtatatgt gccacatttt
    26281 ctttatccat tatatcgctg atggccattt gggttggttc caagtctttg gtattgtgaa
    26341 tagtgccgca ataaacatac gtgtgcacat gtctttatag tagaatgatt tctaattctt
    26401 tgggtatata cccagtaatg ggattgctgg gtcaaacagt atttctggtt ctagatcctt
    26461 gaggaattgc cacactgtct tccacaatgg ttgaactaat ttacacaccc atcaacagtg
    26521 taaaattttt cctattcttc cacatcctct ccagcacctt ttgtttcctg actttttaat
    26581 aattgccatt ctaactggca tgagatggta tctcattgtg gttttgattt gcatttctct
    26641 aatgaccagt gatgatgagc ttcttttcat gtgtttcttg gccacataaa tgacttcttt
    26701 agagaagcat ctgttcatat cctttgtcca ctttttgatg gggtcgttag gttttttctt
    26761 gtaaatttgt tgaagttctt tgtagatttt ggatgttagc cctttgtcag atggatagat
    26821 tggcaaaaat tttctcccat tctgtaggtt gcctgttcac tctgatgata gtcttttgct
    26881 gtgcagaagc tctttagttt aattagatcc catatgtcaa ttttggcctt tgttgtcatt
    26941 gcttttgatg tttagtcgtg gaattttgcc catgcctatg tcctgaatgg tattgcctag
    27001 gttatcttct aggattttta tggttttagg ttgcacattt aagtctttaa tccaccttga
    27061 gttaattttt gtataaggtg taaggaaggg gtacagtttc agttttatgc atattgctag
    27121 ccagtttttc cagcaccatt tattaaatag ggaattcttt ctccattgct tttgtgatgt
    27181 ttgtcaaaga tcagatggtc gtagatgtgt ggcattattt ctgaggcttc tgttctgttc
    27241 cactggtcta tatatctgtt ttggtaccag taccatgctg tttttgttac tgtagccttg
    27301 tagtatagct tgaagtcagg tagcatcatg cctccagctt tgttcttttt gtttaggatt
    27361 gtcttggcta tatgggctct tttttgattc catatgacat ttaaagtagt tttttctaat
    27421 tctttgaaaa aagtcagtgg tagcttgatg gggatagcat tgaatctata aattactttg
    27481 ggcagtatgg ccattttaaa gatattgatt ctttctatct atgagcatgg aatgtttttc
    27541 catttgtttg tgtcctctct tatttccttg agcagtgagt ggtttgtagc tctccttgaa
    27601 gaggttcttc acatccctta taagttgtat ttctaggtat tttattttat tctctttgca
    27661 gcaattgtga atgggagttc acccatgatt tggctctctg cttgtctatt attggtgtat
    27721 aggaatgctt gtgatttttg cacactgatt ttgtatcttg agactttgct gaagctgttt
    27781 atcagcttaa gattttgggc tgagatgaca gggtcttcta aatatacaat catgtcatct
    27841 gcaaacagag acaatttgac ttcctctctt cctatttgaa tatgctttat ttctttctct
    27901 tgcctgattg tcctggcgag aacttccaat actatgttga gtaagagtgg cgagagggca
    27961 tccttgtctt gtgccggttt tcaaagcaaa tgatttttaa atttccgtct tgatttcatt
    28021 gttgacccaa tgatcattca ggagcaggtt atttaatttc cctgtatttg catggttttg
    28081 aaggttcctt ttgtagttga tttccaattt tattctactg tggtctgaga gagtgcttga
    28141 tataatttca atttttaaaa atttattgag gcttgttttg tggcatatca tatggcctat
    28201 cttggagaaa gttccatgtg ctgatgaata gaatgtgtat tctgcagttg ttgggtagaa
    28261 tgtcctgtaa atatctgtta agtccatttg ttctttaaat ccattgtttc tttgtagact
    28321 gtcttgatga cctgcctagt gcagtcagtg gagtattgaa gtcccccact attattatgt
    28381 tgctgtctag tagtaattgt tttataaatt tgggatctcc agtattagat gcatatatat
    28441 taagaattgt aatattctcc cattggacaa gggcttttat cattatatga tgtccctctt
    28501 tgtctttttt aactgctgtt tctttaaagt ttgttttgtc tgacataaga atagctgctt
    28561 tggctcgctt ttggtgtcca tttgtgtgga atgtcatttt ccaccccttt accttaagtt
    28621 tatgtgagtc cttatgtgtt aggtgagtct cctgaaggcg gcagataact ggttggtgaa
    28681 ttctattcat tctgcaattc tgtatctttt aagtggagca tttagtccat ttacattcaa
    28741 catcagtatt gaggtgtgag gtgactattc cattcttcgt ggtatttgtt gcctgtgtat
    28801 ctttttatct gtatttttgt tgtatatgtc ctatgggatt tatgctttaa agaggttctg
    28861 ttttgatgtg cttccagggt ttatttcaag atttagagct ccttttatca ttcttgtagt
    28921 gttggcttgg tagtgccgaa ttctctcagc atttgttttt ctgaaaaaca ctgtgtattt
    28981 tcttcatttg tgaagcttag tttcactgga tataaaattc ttggctgata attgttttgt
    29041 ttaagaaggc tgaagatagg gccatattca cttctagctt ttacggtttc tgctgagaaa
    29101 tctgctgtta atctgatagg ttttctttca taggttacct ggtagtttca cctcacagct
    29161 cttaagattc tctttgtctt tagataactt tggatactct gatgacaatg tacctaggca
    29221 atgatatttt tgcaatgaat ttcccaggtg tttattgagc ttctttgtat ttggatatct
    29281 aggtctctag caaggagggg gaagttttcc ttgattattt ccatggacaa gttttccaaa
    29341 cttttagatt tctcttcttt ctcaggaatg ctgattattc ttaggtttga ttgtttaaca
    29401 taatcccaga tttcttggag gctttgttca tattttctta ttcttttttc tttgtctttg
    29461 ttggattggg taattcaaaa actttgtctt caagctctga atttcttctg cttggattct
    29521 attgctgaga ctttctagag cattttgcat ttctataagt gcatccattc atccattgtt
    29581 tcctgaagtt ttgaatgttt tttatttatg ctatctcttt aactgaagat ttctcccctc
    29641 atttcttgta tcatattttt ggttttttta aaattggact tcaccttcct cggatgcctc
    29701 cttgattagc ttaataactg accttctgaa ttatttttca ggtaaatcag ggatttcttc
    29761 ttggtttgga tgcattgctg gtgagctagt atgatttttt ggggggtgtt aaagaacctt
    29821 gtttttcata ttaccagagt tagttttctg gttccttctc acttgggtag gctctgtcag
    29881 agggaaagtc taggcctcaa ggctgagact tttgtcccag caggtgttcc cttgatgtag
    29941 cacagtcccc cttttcctag gacgtggggc ttcctgagag ccgaactgta gtgattgtta
    30001 tctctcttct ggatctagcc acccatcagg tctaccagac tccaggctgg tactggggtt
    30061 tgtctgcaca gagtcttgtg acgtgaacca tctgtgggtc tctcagccat agatacaacc
    30121 acctgctcca atggaggtgg tagaggatga aatgaactct gtgagggtcc ttacttttgg
    30181 ttgttcaatg cactatcttt ttgtgctggt tggcctcctg ccaggaggtg gcactttcta
    30241 gaaagcatca gcagaggcag tcaggtggtg gtggctgggg gggctggggc actagaactc
    30301 ccaagaatat atgccctttg tcttcagcta ctagggtgag taaggaagga ccatcaggtg
    30361 ggggcaggac tagtcgtgtc tgagctcaga gtctccttgg gcaggtcttt ctgtggctac
    30421 tgtgggagga tgggggtgta gtttccaggt caatggattt atgttcctag gacaattatg
    30481 gctgcctctg ctgtgtcatg caggtcatca ggaaagtggg ggaaagcaag cagtcacgtg
    30541 acttgcccag ctcccatgca actcaaaagg ttggtctcac ttccagcgtg caccctcccc
    30601 cgcaacagct ccgaatctgt ttccatgcag tcagtgagca aggctgagaa cttgcccagg
    30661 ctaccagctg cgaaaccaag tagggctgtc ctacttccct gccagtggag tctgcacacc
    30721 aaattcatgt ccccccacca acccccccac tgcccagccc ctagatctgg ccaggtggag
    30781 attttctttt tcctgtctct tttcccagtt cctctggcag ccctcccaaa tgacccctgt
    30841 gaggcaaggc agaaatggct tcctagggga cccagagagc ccacagggct tttcccgctg
    30901 cttcctctac ccctgtattt tgcttggccc tctaaattga ctcagctcca ggtaaggtca
    30961 gaatcttctc ctgtggtcta gatcttcagg ttcccagtga ggatgtgtgt ttgggggtag
    31021 acggtccccc ttttccactt ccacagtttg ggcactcaca atatttgggg tgtttcccgg
    31081 gtcctacatg agcaatctgc ttctttcaga gggtgtgtgc gttctctcag ctttcttgaa
    31141 tttatttctg caggtggttc tgcaaaaaaa attcctgatg ggagacttca catgctgctc
    31201 tgtgcatccg agtgggagct gcaatgtact tctgctgcca cccatctgcc atcaccctct
    31261 aatttgtcgg taatatgcat ttttaatcaa tctttttttc tctctctctc ttttcttctc
    31321 ccccaaaact atactgccct ttgatatcaa ggaatcaagg ccgtgatgtt gaggggtggg
    31381 cagtggatac actctttacc ccttagggag catatctaga tttagatatt gccaattcaa
    31441 gataacttaa ttgaaagcaa attcataatg aatacacaca cacacacaca catctgcatg
    31501 acaagatttt taatagttga aagaataact aataattgtc cacaggcaat aagggctttt
    31561 taagcaaaac agttgtgata aaacaggtca ttcttagaat agtaatccag ccaatagtac
    31621 aggttgctta gagattatga cattaccaga gttaaaattc aataatggct tctcactccc
    31681 taccactgag gacaagttta tgtccttagg tttatgcttc cctgaaacaa taccacctgc
    31741 tattctccac tttacatatc aacggcactg gttctttatc taactctctg gcacagcagg
    31801 agtttgtttt cttctgcttc agagctttga atttactatt tcagcttcta aactttattt
    31861 gcaatgcctt cccatggcag actccttctg tcattttgcc tctgttcgaa aactttttcc
    31921 ttaatttcat tcttagttaa taatatctga aattattttg ttgtttaact taattattaa
    31981 ttttatgtat gttctaccta gatataatct tctagaggat tgttttattc tctgacttat
    32041 ttaacttaaa tgcccactac ctttaaaaat tatgacattt atttaacaga tatttgctga
    32101 acaaatgttt gaaaatacat gggaaagaat gcttgaaaac acttgaaatt gcttgtgtaa
    32161 agaaacagtt ttatcagtta ggatttaatc aatgtcagaa gcaatgatat aggaaaaatc
    32221 gaggaataag acagttatgg ataaggagaa atcaacaaac tcttaaaaga tattgcctca
    32281 aaagcataag aggaaataag ggtttataca tgacttttag aacactgcct gggtttttgg
    32341 ataaatgggg aagttgttgg aaaacaggag ggatcctaga tattccttag tctgaggagg
    32401 agcaattaag attcacttgt ttagaggctg ggagtggtgg ctcacgcctg taatcccaga
    32461 attttgggag gccaaggcag gcagatcacc tgaggtcaag agttcaagac caacctggcc
    32521 aacatggtga aatcccatct ctacaaaaat acaaaaatta gacaggcatg atggcaagtg
    32581 cctgtaatcc cagctacttg ggaggctgag gaaggagaat tgcttgaacc tggaaggcag
    32641 gagttgcagt gagccgagat cataccactg cactccagcc tgggtgacag aacaagactc
    32701 tgtctcaaaa aaaaaaaaga gagattcaaa agattcactt gtttaggcct tagcgggctt
    32761 agacaccagt ctctgacaca ttcttaaagg tcaggctcta caaatggaac ccaaccagac
    32821 tctcagatat ggccaaagat ctatacacac ccatctcaca gatcccctat cttaaagaga
    32881 ccctaatttg ggttcacctc agtctctata atctgtacca gcataccaat aaaaatcttt
    32941 ctcacccatc cttagattga gagaagtcac ttattattat gtgagtaact ggaagatact
    33001 gataagttga caaatctttt tctttccttt cttattcaac ttttatttta acttccaaag
    33061 aacaagtgca atatgtgcag ctttgttgcg caggtcaaca tgtatctttc tggtctttta
    33121 gccgcctaac actttgagca gatataagcc ttacacagga ttatgaagtc tgaaaggatt
    33181 ccaccaatat tattataatt cctatcaacc tgataagtta ggggaaggta gagctctcct
    33241 ccaataagcc agatttccag agtttctgac gtcataatct accaaggtca tggatcgagt
    33301 tcagagaaaa aacaaaagca aaaccaaacc taccaaaaaa taaaaatccc aaagaaaaaa
    33361 taaagaaaaa aacagcatga atacttcctg ccatgttaag tggccaatat gtcagaaaca
    33421 gcactgagtt acagataaag atgtctaaac tacagtgaca tcccagctgt cacagtgtgt
    33481 ggactattag tcaataaaac agtccctgcc tcttaagagt tgttttccat gcaaatacat
    33541 gtcttatgtc ttagaataag attccctaag aagtgaacct agcatttata caagataatt
    33601 aattctaatc catagtatct ggtaaagagc attctaccat catctttacc gagcatagaa
    33661 gagctacacc aaaaccctgg gtcatcagcc agcacataca cttatccagt gataaataca
    33721 catcatcggg tgcctacata catacctgaa tataaaaaaa atacttttgc tgagatgaaa
    33781 caggcgtgat ttatttcaaa taggtacgga taagtagata ttgaagtaag gattcagtct
    33841 tatattatat tacataacat taatctattc ctgcactgaa actgttgctt tataggattt
    33901 ttcactacac taatgagaac ttaagagata atggcctaaa accacagaga gtatattcaa
    33961 gaataagtat agcacttctt atttggaaac caatgcttac taaatgagac taagacgtgt
    34021 cccatcaaaa atcctggacc tatgcctaaa acacatttca caatccctga acttttcaaa
    34081 aattggtaca tgctttaact ttaaactaca ggcctcactg gagctacaga caagaaggtg
    34141 aaaaacggct gacaaaagaa gtcctggtat cttctatggt gggagaagaa aactagctaa
    34201 agggaagaat aaattagaga aaaattggaa tgactgaatc ggaacaaggc aaaggctata
    34261 aaaaaaatta agcagcagta tcctcttggg ggccccttcc ccacactatc tcaatgcaaa
    34321 tatctgtctg aaacggttcc tggctaaact ccacccatgg gttggccagc cttgccttga
    34381 ccaatagcct tgacaaggca aacttgacca atagtcttag agtatccagt gaggccaggg
    34441 gccggcggct ggctagggat gaagaataaa aggaagcacc cttcagcagt tccacacact
    34501 cgcttctgga acgtctgagg ttatcaataa gctcctagtc cagacgccat gggtcatttc
    34561 acagaggagg acaaggctac tatcacaagc ctgtggggca aggtgaatgt ggaagatgct
    34621 ggaggagaaa ccctgggaag gtaggctctg gtgaccagga caagggaggg aaggaaggac
    34681 cctgtgcctg gcaaaagtcc aggtcgcttc tcaggatttg tggcaccttc tgactgtcaa
    34741 actgttcttg tcaatctcac aggctcctgg ttgtctaccc atggacccag aggttctttg
    34801 acagctttgg caacctgtcc tctgcctctg ccatcatggg caaccccaaa gtcaaggcac
    34861 atggcaagaa ggtgctgact tccttgggag atgccataaa gcacctggat gatctcaagg
    34921 gcacctttgc ccagctgagt gaactgcact gtgacaagct gcatgtggat cctgagaact
    34981 tcaaggtgag tccaggagat gtttcagcac tgttgccttt agtctcgagg caacttagac
    35041 aactgagtat tgatctgagc acagcagggt gtgagctgtt tgaagatact ggggttggga
    35101 gtgaagaaac tgcagaggac taactgggct gagacccagt ggcaatgttt tagggcctaa
    35161 ggagtgcctc tgaaaatcta gatggacaac tttgactttg agaaaagaga ggtggaaatg
    35221 aggaaaatga cttttcttta ttagatttcg gtagaaagaa ctttcacctt tcccctattt
    35281 ttgttattcg ttttaaaaca tctatctgga ggcaggacaa gtatggtcgt taaaaagatg
    35341 caggcagaag gcatatattg gctcagtcaa agtggggaac tttggtggcc aaacatacat
    35401 tgctaaggct attcctatat cagctggaca catataaaat gctgctaatg cttcattaca
    35461 aacttatatc ctttaattcc agatgggggc aaagtatgtc caggggtgag gaacaattga
    35521 aacatttggg ctggagtaga ttttgaaagt cagctctgtg tgtgtgtgtg tgtgtgtgcg
    35581 cgcgtgtgtt tgtgtgtgtg tgagagcgtg tgtttctttt aacgttttca gcctacagca
    35641 tacagggttc atggtggcaa gaagataaca agatttaaat tatggccagt gactagtgct
    35701 gcaagaagaa caactacctg catttaatgg gaaagcaaaa tctcaggctt tgagggaagt
    35761 taacataggc ttgattctgg gtggaagctt ggtgtgtagt tatctggagg ccaggctgga
    35821 gctctcagct cactatgggt tcatctttat tgtctccttt catctcaaca gctcctggga
    35881 aatgtgctgg tgaccgtttt ggcaatccat ttcggcaaag aattcacccc tgaggtgcag
    35941 gcttcctggc agaagatggt gactggagtg gccagtgccc tgtcctccag ataccactga
    36001 gctcactgcc catgatgcag agctttcaag gataggcttt attctgcaag caatacaaat
    36061 aataaatcta ttctgctaag agatcacaca tggttgtctt cagttctttt ttttatgtct
    36121 ttttaaatat atgagccaca aagggtttta tgttgaggga tgtgtttatg tgtatttata
    36181 catggctatg tgtgtttgtg tcatgtgcac actccacact tttttgttta cgttagatgt
    36241 gggttttgat gagcaaataa aagaactagg caataaagaa acttatacat gggagcgtct
    36301 gcaagtggga gtaaaaggtg caggagaaat ctggttggaa gaaagacctc tataggacag
    36361 gactcctcag aaacagatgt tttggaagag atggggaaag gttcagtgaa gggggctgaa
    36421 cccccttccc tggattgcag cacagcagcg aggaaggggc tcaacgaaga aaaagtgttc
    36481 caagctttag gaagtcaagg tttaggcagg gatagccatt ctattttatt aggggcaata
    36541 ctatttccaa cggcatctgg cttttctcag cccttgtgag gctctacggg gaggttgagg
    36601 tgttagagat cagagcagga aacaggtttt tctttccacg gtaactacaa tgaagtgatc
    36661 cttactttac taaggaactt tttcatttta agtgttgacg catgcctaaa gaggtgaaat
    36721 taatcccata cccttaagtc tacagactgg tcacagcatt tcaaggagga gacctcattg
    36781 taagcttcta gggaggtggg gacctaggtg aaggaaatga gccagcagaa gctcacaagt
    36841 cagcatcagc gtgtcatgtc tcagcagcag aacagcacgg tcagatgaaa atatagtgtg
    36901 aagaatttgt ataacattaa ttgagaaggc agattcactg gagttcttat ataattgaaa
    36961 gttaatgcac gttaataagc aagagtttag tttaatgtga tggtgttatg aacttaacgc
    37021 ttgtgtctcc agaaaattca catgctgaat ccccaactcc caattggctc catttgtggg
    37081 ggaggctttg gaaaagtaat caggtttaga ggagctcatg agagcagatc cccatcatag
    37141 aattattttc ctcatcagaa gcagagagat tagccatttc tcttccttct ggtgaggaca
    37201 cagtgggaag tcagccacct gcaacccagg aagagagccc tgaccaggaa ccagcagaaa
    37261 agtgagaaaa aatcctgttg ttgaagtcac ccagtctatg ctattttgtt atagcacctt
    37321 gcactaagta aggcagatga agaaagagaa aaaaataagc ttcggtgttc agtggattag
    37381 aaaccatgtt tatctcaggt ttacaaatct ccacttgtcc tctgtgtttc agaataaaat
    37441 accaactcta ctactctcat ctgtaagatg caaatagtaa gcctgatccc ttctgtctaa
    37501 cttcgaattc tattttttct tcaacgtact ttaggcttgt aatgtgttta tatacagtga
    37561 aatgtcaagt tctttcttta tatttctttc tttctttttt ttcctcagcc tcagagtttt
    37621 ccacatgccc ttcctacctt caggaacttc tttctccaaa cgtcttctgc ctggcctcca
    37681 ttcaaatcat aaaggaccca cttcaaatgc catcactcac taccatttca caattcgcac
    37741 tttctttctt tgtccttttt ttttttagta aaacaagttt ataaaaaatt gaaggaataa
    37801 atgaatggct acttcatagg cagagtagac acaagggcta ctggttgccg atttttattg
    37861 ttatttttca atagtatgct aaacaagggg tagattattt atgctgccca tttttagacc
    37921 ataaaagata acttcctgat gttgccatgg catttttttt ccttttaatt ttatttcatt
    37981 tcattttaat ttcgaaggta catgtgcagg atgtgcaggc ttgttacatg ggtaaatgtg
    38041 tgtctttctg gccttttagc catctgtatc aatgagcaga tataagcttt acacaggatc
    38101 atgaaggatg aaagaatttc accaatatta taataatttc aatcaacctg atagcttagg
    38161 ggataaacta atttgaagat acagcttgcc tccgataagc cagaattcca gagcttctgg
    38221 cattataatc tagcaaggtt agagatcatg gatcactttc agagaaaaac aaaaacaaac
    38281 taaccaaaag caaaacagaa ccaaaaaacc tccataaata cttcctaccc agttaatggt
    38341 ccaatatgtc agaaacagca ctgtgttaga aataaagctg tctaaagtac actaatattc
    38401 gagttataat agtgtgtgga ctattagtca ataaaaacaa cccttgcctc tttagagttg
    38461 ttttccatgt acacgcacat cttatgtctt agagtaagat tccctgagaa gtgaacctag
    38521 catttataca agataattaa ttctaatcca cagtacctgc caaagaacat tctaccatca
    38581 tctttactga gcatagaaga gctacgccaa aaccctgggt catcagccag cacacacact
    38641 tatccagtgg taaatacaca tcatctggtg tatacataca tacctgaata tggaatcaaa
    38701 tatttttcta agatgaaaca gtcatgattt atttcaaata ggtacggata agtagatatt
    38761 gaggtaagca ttaggtctta tattatgtaa cactaatcta ttactgcgct gaaactgtgg
    38821 tctttatgaa aattgttttc actacactat tgagaaatta agagataatg gcaaaagtca
    38881 caaagagtat attcaaaaag aagtatagca ctttttcctt agaaaccact gctaactgaa
    38941 agagactaag atttgtcccg tcaaaaatcc tggacctatg cctaaaacac atttcacaat
    39001 ccctgaactt ttcaaaaatt ggtacatgct ttagctttaa actacaggcc tcactggagc
    39061 tacagacaag aaggtaaaaa acggctgaca aaagaagtcc tggtatcctc tatgatggga
    39121 gaaggaaact agctaaaggg aagaataaat tagagaaaaa ctggaatgac tgaatcggaa
    39181 caaggcaaag gctataaaaa aaattaagca gcagtatcct cttgggggcc ccttccccac
    39241 actatctcaa tgcaaatatc tgtctgaaac ggtccctggc taaactccac ccatgggttg
    39301 gccagccttg ccttgaccaa tagccttgac aaggcaaact tgaccaatag tcttagagta
    39361 tccagtgagg ccaggggccg gcggctggct agggatgaag aataaaagga agcacccttc
    39421 agcagttcca cacactcgct tctggaacgt ctgagattat caataagctc ctagtccaga
    39481 cgccatgggt catttcacag aggaggacaa ggctactatc acaagcctgt ggggcaaggt
    39541 gaatgtggaa gatgctggag gagaaaccct gggaaggtag gctctggtga ccaggacaag
    39601 ggagggaagg aaggaccctg tgcctggcaa aagtccaggt cgcttctcag gatttgtggc
    39661 accttctgac tgtcaaactg ttcttgtcaa tctcacaggc tcctggttgt ctacccatgg
    39721 acccagaggt tctttgacag ctttggcaac ctgtcctctg cctctgccat catgggcaac
    39781 cccaaagtca aggcacatgg caagaaggtg ctgacttcct tgggagatgc cataaagcac
    39841 ctggatgatc tcaagggcac ctttgcccag ctgagtgaac tgcactgtga caagctgcat
    39901 gtggatcctg agaacttcaa ggtgagtcca ggagatgttt cagcactgtt gcctttagtc
    39961 tcgaggcaac ttagacaact gagtattgat ctgagcacag cagggtgtga gctgtttgaa
    40021 gatactgggg ttgggagtga agaaactgca gaggactaac tgggctgaga cccagtggca
    40081 atgttttagg gcctaaggag tgcctctgaa aatctagatg gacaactttg actttgagaa
    40141 aagagaggtg gaaatgagga aaatgacttt tctttattag atttcggtag aaagaacttt
    40201 cacctttccc ctatttttgt tattcgtttt aaaacatcta tctggaggca ggacaagtat
    40261 ggtcgttaaa aagatgcagg cagaaggcat atattggctc agtcaaagtg gggaactttg
    40321 gtggccaaac atacattgct aaggctattc ctatatcagc tggacacata taaaatgctg
    40381 ctaatgcttc attacaaact tatatccttt aattccagat gggggcaaag tatgtccagg
    40441 ggtgaggaac aattgaaaca tttgggctgg agtagatttt gaaagtcagc tctgtgtgtg
    40501 tgtgtgtgtg tgtgtgtgtc agcgtgtgtt tcttttaacg tcttcagcct acaacataca
    40561 gggttcatgg tgggaagaag atagcaagat ttaaattatg gccagtgact agtgcttgaa
    40621 ggggaacaac tacctgcatt taatgggaag gcaaaatctc aggctttgag ggaagttaac
    40681 ataggcttga ttctgggtgg aagctgggtg tgtagttatc tggaggccag gctggagctc
    40741 tcagctcact atgggttcat ctttattgtc tcctttcatc tcaacagctc ctgggaaatg
    40801 tgctggtgac cgttttggca atccatttcg gcaaagaatt cacccctgag gtgcaggctt
    40861 cctggcagaa gatggtgact gcagtggcca gtgccctgtc ctccagatac cactgagcct
    40921 cttgcccatg attcagagct ttcaaggata ggctttattc tgcaagcaat acaaataata
    40981 aatctattct gctgagagat cacacatgat tttcttcagc tctttttttt acatcttttt
    41041 aaatatatga gccacaaagg gtttatattg agggaagtgt gtatgtgtat ttctgcatgc
    41101 ctgtttgtgt ttgtggtgtg tgcatgctcc tcatttattt ttatatgaga tgtgcatttt
    41161 gatgagcaaa taaaagcagt aaagacactt gtacacggga gttctgcaag tgggagtaaa
    41221 tggtgttgga gaaatccggt gggaagaaag acctctatag gacaggactt ctcagaaaca
    41281 gatgttttgg aagagatggg aaaaggttca gtgaagacct gggggctgga ttgattgcag
    41341 ctgagtagca aggatggttc ttaatgaagg gaaagtgttc caagctttag gaattcaagg
    41401 tttagtcagg tgtagcaatt ctattttatt aggaggaata ctatttctaa tggcacttag
    41461 cttttcacag cccttgtgga tgcctaagaa agtgaaatta atcccatgcc ctcaagtgtg
    41521 cagattggtc acagcatttc aagggagaga cctcattgta agactctggg ggaggtgggg
    41581 acttaggtgt aagaaatgaa tcagcagagg ctcacaagtc agcatgagca tgttatgtct
    41641 gagaaacaga ccagcactgt gagatcaaaa tgtagtggga agaatttgta caacattaat
    41701 tggaaggttt acttaatgga atttttgtat agttggatgt tagtgcatct ctataagtaa
    41761 gagtttaata tgatggtgtt acggacctgg tgtttgtgtc tcctcaaaat tcacatgctg
    41821 aatccccaac tcccaactga ccttatctgt gggggaggct tttgaaaagt aattaggttt
    41881 agctgagctc ataagagcag atccccatca taaaattatt ttccttatca gaagcagaga
    41941 gacaagccat ttctctttcc tcccggtgag gacacagtga gaagtccgcc atctgcaatc
    42001 caggaagaga accctgacca cgagtcagcc ttcagaaatg tgagaaaaaa ctctgttgtt
    42061 gaagccaccc agtcttttgt attttgttat agcaccttac actgagtaag gcagatgaag
    42121 aaggagaaaa aaataagctt gggttttgag tgaactacag accatgttat ctcaggtttg
    42181 caaagctccc ctcgtcccct atgtttcagc ataaaatacc tactctacta ctctcatcta
    42241 taagacccaa ataataagcc tgcgcccttc tctctaactt tgatttctcc tatttttact
    42301 tcaacatgct ttactctagc cttgtaatgt ctttacatac agtgaaatgt aaagttcttt
    42361 attctttttt tctttctttc ttttttctcc tcagcctcag aatttggcac atgcccttcc
    42421 ttctttcagg aacttctcca acatctctgc ctggctccat catatcataa aggtcccact
    42481 tcaaatgcag tcactaccgt ttcaggatat gcactttctt tcttttttgt tttttgtttt
    42541 ttttaagtca aagcaaattt cttgagagag taaagaaata aacgaatgac tactgcatag
    42601 gcagagcagc cccgagggcc gctggttgtt ccttttatgg ttatttcttg atgatatgtt
    42661 aaacaagttt tggattattt atgccttctc tttttaggcc atatagggta actttctgac
    42721 attgccatgg catgtttctt ttaatttaat ttactgttac cttaaattca ggggtacacg
    42781 tacaggatat gcaggtttgt tttataggta aaagtgtgcc atggttttaa tgggtttttt
    42841 ttttcttgta aagttgttta agtttcttgt ttactctgga tattggcctt tgtcagaaga
    42901 atagattgga aaatcttttt cccattctgt agattgtctt tcgctctgat ggtagtttct
    42961 tttgctgagc aggagctctt tagtttaatt agattccatt ggtcaatttt tgcttttgct
    43021 gcaattgctt ttcacgcttt catcatgaaa tctgtgcccg tgtttatatc atgaatagta
    43081 ttgccttgat ttttttctag gctttttata gtttggggtt tttcatttaa gtctctaatc
    43141 catccggagt taattttgga taaggtataa ggaaggagtc cagtttcatt tttcagcata
    43201 tggctagcca gttctccccc atcatttatt aaattgaaaa tcctttcccc attgcttgct
    43261 tttgtcaggt ttctaaaaga cagatggttg taggtacaat atgcagtttc ttcaagtcat
    43321 ataataccat ctgaaatctc ttattaattc atttctttta gtatgtatgc tggtctcctc
    43381 tgctcactat agtgagggca ccattagcca gagaatctgt ctgtctagtt catgtaagat
    43441 tctcagaatt aagaaaaatg gatggcatat gaatgaaact tcatggatga catatggaat
    43501 ctaatgtgta tttgttgaat taatgcataa gatgcaacaa gggaaaggtt gacaactgca
    43561 gtgataacct ggtattgatg atataagagt ctatagatca cagtagaagc aataatcatg
    43621 gaaaacaatt ggaaatgggg aacagccaca aacaagaaag aatcaatact accaggaaag
    43681 tgactgcagg tcacttttcc tggagcgggt gagagaaaag tggaagttgc agtaactgcc
    43741 gaattcctgg ttggctgatg gaaagatggg gcaactgttc actggtacgc agggttttag
    43801 atgtatgtac ctaaggatat gaggtatggc aatgaacaga aattcttttg ggaatgagtt
    43861 ttagggccat taaaggacat gacctgaagt ttcctctgag gccagtcccc acaactcaat
    43921 ataaatgtgt ttcctgcata tagtcaaagt tgccacttct ttttcttcat atcatcgatc
    43981 tctgctctta aagataatct tggttttgcc tcaaactgtt tgtcactaca aactttcccc
    44041 atgttcctaa gtaaaacagg taactgcctc tcaactatat caagtagact aaaatattgt
    44101 gtctctaata tcagaaattc agctttaata tattgggttt aactctttga aatttagagt
    44161 ctccttgaaa tacacatggg ggtgatttcc taaactttat ttcttgtaag gatttatctc
    44221 aggggtaaca cacaaaccag catcctgaac ctctaagtat gaggacagta agccttaaga
    44281 atataaaata aactgttctt ctctctgccg gtggaagtgt gccctgtcta ttcctgaaat
    44341 tgcttgtttg agacgcatga gacgtgcagc acatgagaca cgtgcagcag cctgtggaat
    44401 attgtcagtg aagaatgtct ttgcctgatt agatataaag acaagttaaa cacagcatta
    44461 gactatagat caagcctgtg ccagacacaa atgacctaat gcccagcacg ggccacggaa
    44521 tctcctatcc tcttgcttga acagagcagc acacttctcc cccaacacta ttagatgttc
    44581 tggcataatt ttgtagatat gtaggatttg acatggacta ttgttcaatg attcagagga
    44641 aatctccttt gttcagataa gtacactgac tactaaatgg attaaaaaac acagtaataa
    44701 aacccagttt tccccttact tccctagttt gtttcttatt ctgctttctt ccaagttgat
    44761 gctggataga ggtgtttatt tctattctaa aaagtgatga aattggccgg gcgcggtggc
    44821 tcacacctgt aatcccagca ctttgggagg ctgaggtggg cggatcacga ggtcaggaga
    44881 tcaagaccat cctggctaac atggtgaaac cccatctcta ctaaaaatac aaaaaattag
    44941 ccagagacgg tggcgggtgc ctgtagtccc agctactcgg gaggctgagg caggagaatg
    45001 gcgtgaacct gggaggcaga gctgcagtga gcagagatcg cgccactgca cactccagcc
    45061 tgggtgacaa agcgagactc catctcaaaa aaaaaaaaaa aaaaaaaaag aaagaaagaa
    45121 agaaaaaaaa agtgatgaaa ttgtgtattc aatgtagtct caagagaatt gaaaaccaag
    45181 aaaggctgtg gcttcttcca cataaagcct ggatgaataa caggataaca cgttgttaca
    45241 ttgtcacaac tcctgatcca ggaattgatg gctaagatat tcgtaattct tatccttttc
    45301 agttgtaact tattcctatt tgtcagcatt caggttatta gcggctgctg gcgaagtcct
    45361 tgagaaataa actgcacact ggatggtggg ggtagtgtag gaaaatggag gggaaggaag
    45421 taaagtttca aattaagcct gaacagcaaa gttcccctga gaaggccacc tggattctat
    45481 cagaaactcg aatgtccatc ttgcaaaact tccttgccca aaccccaccc ctggagtcac
    45541 aacccaccct tgaccaatag attcatttca ctgagggagg caaagggctg gtcaatagat
    45601 tcatttcact gggagaggca aagggctggg ggccagagag gagaagtaaa aagccacaca
    45661 tgaagcagca atgcaggcat gcttctggct catctgtgat caccaggaaa ctcccagatc
    45721 tgacactgta gtgcatttca ctgctgacaa gaaggctgct gccaccagcc tgtgaagcaa
    45781 ggttaaggtg agaaggctgg aggtgagatt ctgggcaggt aggtactgga agccgggaca
    45841 aggtgcagaa aggcagaaag tgtttctgaa agagggatta gcccgttgtc ttacatagtc
    45901 tgactttgca cctgctctgt gattatgact atcccacagt ctcctggttg tctacccatg
    45961 gacctagagg tactttgaaa gttttggata tctgggctct gactgtgcaa taatgggcaa
    46021 ccccaaagtc aaggcacatg gcaagaaggt gctgatctcc ttcggaaaag ctgttatgct
    46081 cacggatgac ctcaaaggca cctttgctac actgagtgac ctgcactgta acaagctgca
    46141 cgtggaccct gagaacttcc tggtgagtag taagtacact cacgctttct tctttaccct
    46201 tagatatttg cactatgggt acttttgaaa gcagaggtgg ctttctcttg tgttatgagt
    46261 cagctatggg atatgatatt tcagcagtgg gattttgaga gttatgttgc tgtaaataac
    46321 ataactaaaa tttggtagag caaggactat gaataatgga aggccactta ccatttgata
    46381 gctctgaaaa acacatctta taaaaaattc tggccaaaat caaactgagt gttttggatg
    46441 agggaacaga agttgagata gagaaaataa catctttcct ttggtcagcg aaattttcta
    46501 taaaaattaa tagtcacttt tctgcatagt cctggaggtt agaaaaagat caactgaaca
    46561 aagtagtggg aagctgttaa aagaggattg tttccctccg aatgatgatg gtatactttt
    46621 gtacgcatgg tacaggattc tttgttatga gtgtttggga aaattgtatg tatgtatgta
    46681 tgtatgtgat gactggggac ttatcctatc cattactgtt ccttgaagta ctattatcct
    46741 actttttaaa aggacgaagt ctctaaaaaa aaaatgaaac aatcacaata tgttggggta
    46801 gtgagttggc atagcaagta agagaaggat aggacacaat gggaggtgca gggctgccag
    46861 tcatattgaa gctgatatct agcccataat ggtgagagtt gctcaaactc tggtcaaaaa
    46921 ggatgtaagt gttatatcta tttactgcaa gtccagcttg aggccttcta ttcactatgt
    46981 accattttct tttttatctt cactccctcc ccagctctta ggcaacgtga tattgattgt
    47041 tttggcaacc cacttcagcg aggattttac cctacagata caggcttctt ggcagtaact
    47101 aacaaatgct gtggttaatg ctgtagccca caagaccact gagttccctg tccactatgt
    47161 ttgtacctat gtcccaaaat ctcatctcct ttagatgggg gaggttgggg agaagagcag
    47221 tatcctgcct gctgattcag ttcctgcatg ataaaaatag aataaagaaa tatgctctct
    47281 aagaaatatc attgtactct ttttctgtct ttatatttta ccctgattca gccaaaagga
    47341 cgcactattt ctgatggaaa tgagaatgtt ggagaatggg agtttaagga cagagaagat
    47401 actttcttgc aatcctgcaa gaaaagagag aactcgtggg tggatttagt ggggtagtta
    47461 ctcctaggaa ggggaaatcg tctctagaat aagacaatgt ttttacagaa agggaggtca
    47521 atggaggtac tctttggagg tgtaagagga ttgttggtag tgtgtagagg tatgttagga
    47581 ctcaaattag aagttctgta taggctatta tttgtatgaa actcaggata tagctcattt
    47641 ggtgactgca gttcacttct acttatttta aacaacatat tttttatgat ttataatgaa
    47701 gtggggatgg ggcttcctag agaccaatca agggccaaac cttgaacttt ctcttaacgt
    47761 cttcaatggt attaatagag aattatctct aaggcatgtg aactggctgt cttggttttc
    47821 atctgtactt catctgctac ctctgtgacc tgaaacatat ttataattcc attaagctgt
    47881 gcatatgata gatttatcat atgtattttc cttaaaggat ttttgtaaga actaattgaa
    47941 ttgatacctg taaagtcttt atcacactac ccaataaata ataaatctct ttgttcagct
    48001 ctctgtttct ataaatatgt acaagtttta ttgtttttag tggtagtgat tttattctct
    48061 ttctatatat atacacacac atgtgtgcat tcataaatat atacaatttt tatgaataaa
    48121 aaattattag caatcaatat tgaaaaccac tgatttttgt ttatgtgagc aaacagcaga
    48181 ttaaaaggct gagatttagg aaacagcacg ttaagtcaag ttgatagagg agaatatgga
    48241 catttaaaag aggcaggatg atataaaatt agggaaactg gatgcagaga ccagatgaag
    48301 taagaaaaat agctatcgtt ttgagcaaaa atcactgaag tttcttgcat atgagagtga
    48361 cataataaat agggaaacgt agaaaattga ttcacatgta tatatatata tagaactgat
    48421 tagacaaagt ctaacttggg tatagtcaga ggagcttgct gtaattatat tgaggtgatg
    48481 gataaagaac tgaagttgat ggaaacaatg aagttaagaa aaaaaatcga gtaagagacc
    48541 attgtggcag tgattgcaca gaactggaaa acattgtgaa acagagagtc agagatgaca
    48601 gctaaaatcc ctgtctgtga atgaaaagaa ggaaatttat tgacagaaca gcaaatgcct
    48661 acaagccccc tgtttggatc tggcaatgaa cgtagccatt ctgtggcaat cacttcaaac
    48721 tcctgtaccc aagaccctta ggaagtatgt agcaccctca aacctaaaac ctcaaagaaa
    48781 gaggttttag aagatataat accctttctt ctccagtttc attaatccca aaacctcttt
    48841 ctcaaagtat ttcctctatg tgtccacccc aaagagctca cctcaccata tctcttgagt
    48901 gggagcacat agataggcgg tgctaccatc taacagcttc tgaaattcct ttgtcatatt
    48961 tttgagtccc cactaataac ccacaaagca gaataaatac cagttgctca tgtacaataa
    49021 tcactcaact gctgtcttgt agcatacatt aattaagcac attctttgaa taattactgt
    49081 gtccaaacaa tcacacttta aaatctcaca cttgtgctat cccttgccct tctgaatgtc
    49141 actctgtatt ttaaatgaag agatgagggt tgaatttcct gtgttactta ttgttcattt
    49201 ctcgatgagg agttttcaca ttcaccttta ctggaaaaca cataagtaca catcttacag
    49261 gaaaaatata ccaaactgac atgtagcatg aatgcttgtg catgtagtca tataaaatct
    49321 tgtagcaatg taaacattct ctgatataca catacagatg tgtctatatg tctacacaat
    49381 ttcttatgct ccatgaacaa acattccatg cacacataag aacacacact gttacagatg
    49441 catacttgag tgcattgaca aaattacccc agtcaatcta gagaatttgg atttctgcat
    49501 ttgactctgt tagctttgta catgctgttc atttactctg ggtgatgtct ttccctcatt
    49561 ttgccttgtc tatcttgtac tcatacttta agtcctaact tatatgttat ctcaactaag
    49621 aagctatttt tttttaattt taactgggct taaagccctg tctataaact ctgctacaat
    49681 tatgggctct ttcttataat atttagtgtt tttcctacta atgtacttaa tctgctcatt
    49741 gtatattcct accactaaat tttaacctct tttatggtag agacattgtc ttgtaaactc
    49801 ttatttccct agtatttgga gatgaaaaaa aagattaaat tatccaaaat tagatctctc
    49861 ttttctacat tatgagtatt acactatcca tagggaagtt tgtttgagac ctaaactgag
    49921 gaacctttgg ttctaaaatg actatgtgat atcttagtat ttataggtca tgaggttcct
    49981 tcctctgcct ctgctatagt ttgattagtc agcaagcatg tgtcatgcat ttattcacat
    50041 cagaatttca tacactaata agacatagta tcagaagtca gtttattagt tatatcagtt
    50101 agggtccatc aaggaaagga caaaccatta tcagttactc aacctagaat taaatacagc
    50161 tcttaatagt taattatcct tgtattggaa gagctaaaat atcaaataaa ggacagtgca
    50221 gaaatctaga tgttagtaac atcagaaaac ctcttccgcc attaggccta gaagggcaga
    50281 aggagaaaat gtttatacca ccagagtcca gaaccagagc ccataaccag aggtccactg
    50341 gattcagtga gctagtgggt gctccttgga gagagccaga actgtctaat gggggcatca
    50401 aagtatcagc cataaaaaac cataaaaaag actgtctgct gtaggagatc cgttcagaga
    50461 gagagagaga ccagaaataa tcttgcttat gctttccctc agccagtgtt taccattgca
    50521 gaatgtacat gcgactgaaa gggtgaggaa acctgggaaa tgtcagttcc tcaaatacag
    50581 agaacactga gggaaggatg agaaataaat gtgaaagcag acatgaatgg taattgacag
    50641 aaggaaacta ggatgtgtcc agtaaatgaa taattacagt gtgcagtgat tattgcaatg
    50701 attaatgtat tgataagata atatgaaaac acagaattca aacagcagtg aactgagatt
    50761 agaattgtgg agagcactgg catttaagaa tgtcacactt agaatgtgtc tctaggcatt
    50821 gttctgtgca tatatcatct caatattcat tatctgaaaa ttatgaatta ggtacaaagc
    50881 tcaaataatt tattttttca ggttagcaag aacttttttt tttttttttt ctgagatgga
    50941 gcattgctat ggttgcccag gctggagtgc aatggcatga tccaggctca ctgcaacatc
    51001 tgcctcccag gttcaagcga ttctcctgcc tcagcctccc aagtagctgg cattacaggc
    51061 atgtgccacc accatgcctg gctaattttc tatttttagt agataggggg tttcaccatg
    51121 ttggtcaggc tgatctcgaa ctcctaacat caggtgatcc accctcctcg gcctctgaat
    51181 gtactgggat cacaggcgtg agccaccaca cccagccaag aatgtgaatt ttgtagaagg
    51241 atataaccca tatttctctg accctagagt ccttagtata cctcccatac catgtggctc
    51301 atcctcctta catacatttc ccatctttca ccctaccttt tcctttttgt ttcagctttt
    51361 cactgtgtgt caaaatctag aaccttatct cctacctgct ctgaaaccaa cagcaagttg
    51421 acttccattc taacccacat tggcattaca ctaattaaaa tcgatactga gttctaaaat
    51481 catctgggat tttggggact atgtcttact tcatacttcc ttgagatttc acattaaatg
    51541 ttggtgttca ttaaaggtcc ttcatttaac tttgtattca tcacactctt ggattcacag
    51601 ttatatctaa actcttatat atagcctgta taatcccaat tcccaagtct gatttctaac
    51661 ctctgacctc caacctcagt gccaaaccca tatatcaaac aatgtactgg gcttatttat
    51721 atagatgtcc tataggcacc tcagactcag catgggtatt tcacttgtta tactaaaact
    51781 gtttctcttc cagtgttttc cattttagtc attagatagc tacttgccca ttcaccaagg
    51841 tcacagatta aaatcatttc cctacctcta atcaacagtt caattctgct tcaatttgtc
    51901 cctatctatt aatcaccact cttactgccc agtcaggtcc tcattgtttc ctgaacaaga
    51961 gtagatgcta ttctttccac tttaagacct tatcctggct ggatgcggtg gctcaggctt
    52021 gtaaacccag cactttggga ggccgaggca ggcagatcac ttgaggtcag gagttcaaga
    52081 ccagcctgac caacatggtg aaaccccatc tctactaaaa atacaaaatc agccgggcgt
    52141 gtggtgcatg cctgcagtcc cagctattca ggtggctgag gcaggagaat tgcttgaacc
    52201 caggaggcgg aggttgcggt gagcctagat tgcaccattg cactctagct tgggcaatag
    52261 ggatgaaact ccatctcaga agagaaaaga aaaaaagacc ttattctgtt acacaaatcc
    52321 tctcaatgca atccatatag aataaacatg taaccagatc tcccaatgtg taaaatcatt
    52381 tcaggtagaa cagaattaaa gtgaaaagcc aagtctttgg aattaacaga caaagttcaa
    52441 ataacagtcc tcatggcctt aagaatttac ctaacatttt ttttagaatc aattttctta
    52501 tatatgaatt ggaaacataa ttcctccctc acaaacacat tctaagattt taaggagata
    52561 ttgatgaagt acatcatctg tcatttttaa cagttagtgg tagtgattca cacagcacat
    52621 tatgatctgt tcttgtatgt tctgttccat tctgtattct tgacctggtt gtattctttc
    52681 tgagctccag atccacatat ctaagtacat ctttttgcat tttacaagag tgcatacaat
    52741 acaatgtatc caagactgta tttctgattt tatcgtacca ctaaactcac aaatgtggcc
    52801 ctattcttgt gttcacgact gacatcaccg tcatggtcca agtctgataa tagaaatggc
    52861 attgtcactt tcttccctac tgcaacagaa gcccagctat ttgtctccca ttttctctac
    52921 ttctaaaata catttcttca ctaagtgaga ataatctttt aaagacacaa atcaaaccat
    52981 gccaccacct ttcttgaatt attcaatatc tttcgttggc ttccaggtta cagaaaaata
    53041 acttgtaaca aagtttaaag gtcattcatg gctcctctct accctatttt ataacatttc
    53101 cccttgtgat cagaatctca ggcacatcat ccatctttct atatacaaat aaagtcatat
    53161 agtttgaact cacctctggt tacttttaat caaccaaatg ctgtaaaatg catttgtatc
    53221 gctacgtgtt aagcagtagt tgattctttt catttcttgt taatattcta ttctttgact
    53281 ataccgtaat ttatcaattc tactgttggt aagcatttaa gtggctaccg gtttgaggtt
    53341 tttatgatta ttgctgtcat aagcatttct atacatgtct ttggatacac acatgcatgt
    53401 gtttctgaat atctaaaaat gtaattgcta ggtaatagac ttatcaagca tccagcattt
    53461 gtggatacta ttaaaggttt tccaaagggg ttatactatt gtacagtgtc accaacagag
    53521 tttgagtttc tattgatcca tatcaccacc aaaatttgaa ctgtcagtct tatctcttct
    53581 cttgtctctt ttttcctctt ttttttcctt cccttcccct ctcttcgttt cttttctctc
    53641 ctcttctctt ctttcctctc ttcccttccc tttctctttc tcttccctat cccttctcct
    53701 ctcctctccc ctcctttttt ctcctctcct ctccattatt tatttttcct tcttctcctc
    53761 catcccttcc atcctctctc ttcccctctt ccttccttcc tttctccatt tcttcctcct
    53821 ctttccctca atccttcctt ttggatatgc tcatgggtgt gtatttgtct gccattgtgg
    53881 cattatttga attcagaaaa gagtgaaaaa ctactgggat cttcattctg ggtctaattc
    53941 cacatttttt tttaagaaca cactctgtaa aaatgttctg tactagcata ttcccaggaa
    54001 cttcgttaaa tttaatctgg ctgaatatgg taaatctact ttgcactttg cattctttct
    54061 ttagtcatac cataatttta aacattcaaa atatttgtat ataatatttg attttatctg
    54121 tcattaaaat gttaacctta aaattcatgt ttccagaacc tatttcaata actggtaaat
    54181 aaacactatt cattttttaa atattctttt aatggatatt tatttcaata taataaaaaa
    54241 ttagagtttt attataggaa gaatttacca aaagaaggag gaagcaagca agtttaaact
    54301 gcagcaatag ttgtccattc caacctctca aaattccctt ggagacaaaa tctctagagg
    54361 caaagaagaa ctttatattg agtcaacttg ttaaaacatc tgcttttaga taagttttct
    54421 tagtataaag tgacagaaac aaataagtta aactctaaga tacattccac tatattagcc
    54481 taaaacactt ctgcaaaaat gaaactagga ggatattttt agaaacaact gctgaaagag
    54541 atgcggtggg gagatatgca gaggagaaca gggtttctga gtcaagacac acatgacaga
    54601 acagccaatc tcagggcaag ttaagggaat agtggaatga aggttcattt ttcattctca
    54661 caaactaatg aaaccctgct tatcttaaac caacctgctc actggagcag ggaggacagg
    54721 accagcataa aaggcagggc agagtcgact gttgcttaca ctttcttctg acataacagt
    54781 gttcactagc aacctcaaac agacaccatg gtgcatctga ctcctgagga gaagactgct
    54841 gtcaatgccc tgtggggcaa agtgaacgtg gatgcagttg gtggtgaggc cctgggcagg
    54901 ttggtatcaa ggttataaga gaggctcaag gaggcaaatg gaaactgggc atgtgtagac
    54961 agagaagact cttgggtttc tgataggcac tgactctctg tcccttgggc tgttttccta
    55021 ccctcagatt actggtggtc tacccttgga cccagaggtt ctttgagtcc tttggggatc
    55081 tgtcctctcc tgatgctgtt atgggcaacc ctaaggtgaa ggctcatggc aagaaggtgc
    55141 taggtgcctt tagtgatggc ctggctcacc tggacaacct caagggcact ttttctcagc
    55201 tgagtgagct gcactgtgac aagctgcacg tggatcctga gaacttcagg gtgagtccag
    55261 gagatgcttc acttttctct ttttactttc taatcttaca ttttggttct tttacctacc
    55321 tgctcttctc ccacattttt gtcattttac tatattttat catttaatgc ttctaaaatt
    55381 ttgttaattt tttatttaaa tattctgcat tttttccttc ctcacaatct tgctatttta
    55441 aattatttaa tatcctgtct ttctctccca accccctccc ttcatttttc cttctctaac
    55501 aacaactcaa attatgcata ccagctctca cctgctaatt ctgcacttag aataatcctt
    55561 ttgtctctcc acatgggtat gggagaggct ccaactcaaa gatgagaggc atagaatact
    55621 gttttagagg ctataaatca ttttacaata aggaataatt ggaattttat aaattctgta
    55681 gtaaatggaa tggaaaggaa agtgaatatt tgattatgaa agactaggca gttacactgg
    55741 aggtggggca gaagtcgttg ctaggagaca gcccatcatc acactgatta atcaattaat
    55801 ttgtatctat taatctgttt atagtaatta atttgtatat gctatataca catacaaaat
    55861 taaaactaat ttggaattaa tttgtatata gtattataca gcatatatag catatatgta
    55921 catatataga ctacatgcta gttaagtaca tagaggatgt gtgtgtatag atatatgtta
    55981 tatgtatgca ttcatatatg tacttattta tgctgatggg aataacctgg ggatcagttt
    56041 tgtctaagat ttgggcagaa aaaaatgggt gttggctcag tttctcagaa gccagtcttt
    56101 atttctctgt taaccatatg catgtatctg cctacctctt ctccgcagct cttgggcaat
    56161 gtgctggtgt gtgtgctggc ccgcaacttt ggcaaggaat tcaccccaca aatgcaggct
    56221 gcctatcaga aggtggtggc tggtgtggct aatgccctgg ctcacaagta ccattgagat
    56281 cctggactgt ttcctgataa ccataagaag accctatttc cctagattct attttctgaa
    56341 cttgggaaca caatgcctac ttcaagggta tggcttctgc ctaataaaga atgttcagct
    56401 caacttcctg attaatttca cttatttcat ttttttgtcc aggtgtgtaa gaaggttcct
    56461 gaggctctac agatagggag cacttgttta ttttacaaag agtacatggg aaaagagaaa
    56521 agcaagggaa ccgtacaagg cattaatggg tgacacttct acctccaaag agcagaaatt
    56581 atcaagaact cttgatacaa agataatact ggcactgcag aggttctagg gaagacctca
    56641 accctaagac atagcctcaa gggtaatagc tacgattaaa ctccaacaat tactgagaaa
    56701 ataatgtgct caattaaagg cataatgatt actcaagaca atgttatgtt gtctttcttc
    56761 ctccttcctt tgcctgcaca ttgtagccca taatactata ccccatcaag tgttcctgct
    56821 ccaagaaata gcttcctcct cttacttgcc ccagaacatc tctgtaaaga atttcctctt
    56881 atcttcccat atttcagtca agattcattg ctcacgtatt acttgtgacc tctcttgacc
    56941 ccagccacaa taaacttctc tatactaccc aaaaaatctt tccaaaccct ccccgacacc
    57001 atatttttat atttttctta tttatttcat gcacacacac acactccgtg ctttataagc
    57061 aattctgcct attctctacc ttcttacaat gcctactgtg cctcatatta aattcatcaa
    57121 tgggcagaaa gaaaatattt attcaagaaa acagtgaatg aatgaacgaa tgagtaaatg
    57181 agtaaatgaa ggaatgatta ttccttgctt tagaacttct ggaattagag gacaatatta
    57241 ataataccat cgcacagtgt ttctttgttg ttaatgctac aacatacaaa gaggaagcat
    57301 gcagtaaaca accgaacagt tatttccttt ctgatcatag gagtaatatt tttttccttg
    57361 agcacatttt tgccataggt aaaattagaa ggatttttag aactttctca gttgtataca
    57421 tttttaaaaa tctgtattat atgcatgttg attaatttta aacttacttg aatacctaaa
    57481 cagaatctgt tgtttccttg tgtttgaaag tgctttcaca gtaactctgt ctgtactgcc
    57541 agaatatact gacaatgtgt tatagttaac tgttttgatc acaacatttt gaattgactg
    57601 gcagcagaag ctctttttat atccatgtgt tttccttaag tcattataca tagtaggcat
    57661 gagactcttt atactgaata agatatttag gaaccactgg tttacatatc agaagcagag
    57721 ctactcaggg cattttgggg aagatcactt tcacattcct gagcataggg aagttctcat
    57781 aagagtaaga tattaaaagg agatacttgt gtggtattcg aaagacagta agagagattg
    57841 tagaccttat gatcttgata gggaaaacaa actacattcc tttctccaaa agtcaaaaaa
    57901 aaagagcaaa tatagcttac tataccttct attcctacac cattagaagt agtcagtgag
    57961 tctaggcaag atgttggccc taaaaatcca aataccagag aattcatgag aacatcacct
    58021 ggatgggaca tgtgccgagc aacacaatta ctatatgcta ggcattgcta tcttcatatt
    58081 gaagatgagg aggtcaagag atgaaaaaag acttggcacc ttgttgttat attaaaatta
    58141 tttgttagag tagagctttt gtaagagtct aggagtgtgg gagctaaatg atgatacaca
    58201 tggacacaaa gaatagatca acagacaccc aggcctactt gagggttgag ggtgggaaga
    58261 gggagacgat gaaaaagaac ctattgggta ttaagttcat cactgagtga tgaaataatc
    58321 tgtacatcaa gacccagtga tatgcaattt acctatataa cttgtacatg tacccccaaa
    58381 tttaaaataa agttaaaaca aagtatagga atggaattaa ttcctcaaga tttggcttta
    58441 attttatttg ataatttatc aaatggttgt ttttcttttc tcactatggc gttgctttat
    58501 aaactatgtt cagtatgtct gaatgaaagg gtgtgtgtgt gtgtgaaaga gagggagaga
    58561 ggaagggaag agaggacgta ataatgtgaa tttgagttca tgaaaatttt tcaataaaat
    58621 aatttaatgt caggagaatt aagcctaata gtctcctaaa tcatccatct cttgagcttc
    58681 agagcagtcc tctgaattaa tgcctacatg tttgtaaagg gtgttcagac tgaagccaag
    58741 attctacctc taaagagatg caatctcaaa tttatctgaa gactgtacct ctgctctcca
    58801 taaattgaca ccatggccca cttaatgagg ttaaaaaaaa gctaattctg aatgaaaatc
    58861 tgagcccagt ggaggaaata ttaatgaaca aggtgcagac tgaaatataa attttctgta
    58921 ataattatgc atatacttta gcaaagttct gtctatgttg actttattgc ttttggtaag
    58981 aaatacaact ttttaaagtg aactaaacta tcctatttcc aaactatttt gtgtgtgtgc
    59041 ggtttgtttc tatgggttct ggttttcttg gagcattttt atttcatttt aattaattaa
    59101 ttctgagagc tgctgagttg tgtttactga gagattgtgt atctgcgaga gaagtctgta
    59161 gcaagtagct agactgtgct tgacctagga acatatacag tagattgcta aaatgtctca
    59221 cttggggaat tttagactaa acagtagagc atgtataaaa atactctagt caagtgctgc
    59281 ttttgaaaca aatgataaaa ccacactccc atagatgagt gtcatgattt tcatggagga
    59341 agttaatatt catcctctaa gtatacccag actagggcca ttctgatata aaacattagg
    59401 acttaagaaa gattaataga ctggagtaaa ggaaatggac ctctgtctct ctcgctgtct
    59461 cttttttgag gacttgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgttgt ggtcagtggg
    59521 gctggaataa aagtagaata gacctgcacc tgctgtggca tccattcaca gagtagaagc
    59581 aagctcacaa tagtgaagat gtcagtaagc ttgaatagtt tttcaggaac tttgaatgct
    59641 gatttagatt tgaaactgag gctctgacca taaccaaatt tgcactattt attgcttctt
    59701 gaaacttatt tgcctggtat gcctgggctt ttgatggtct tagtatagct tgcagccttg
    59761 tccctgcagg gtattatggg taatagaaag aaaagtctgc gttacactct agtcacacta
    59821 agtaactacc attggaaaag caacccctgc cttgaagcca ggatgatggt atctgcagca
    59881 gttgccaaca caagagaagg atccatagtt catcatttaa aaaagaaaac aaaatagaaa
    59941 aaggaaaact atttctgagc ataagaagtt gtagggtaag tctttaagaa ggtgacaatt
    60001 tctgccaatc aggatttcaa agctcttgct ttgacaattt tggtctttca gaatactata
    60061 aatataacct atattataat ttcataaagt ctgtgcattt tctttgaccc aggatatttg
    60121 caaaagacat attcaaactt ccgcagaaca ctttatttca catatacatg cctcttatat
    60181 cagggatgtg aaacagggtc ttgaaaactg tctaaatcta aaacaatgct aatgcaggtt
    60241 taaatttaat aaaataaaat ccaaaatcta acagccaagt caaatctgta tgttttaaca
    60301 tttaaaatat tttaaagacg tcttttccca ggattcaaca tgtgaaatct tttctcaggg
    60361 atacacgtgt gcctagatcc tcattgcttt agttttttac agaggaatga atataaaaag
    60421 aaaatactta aattttatcc ctcttacctc tataatcata cataggcata attttttaac
    60481 ctaggctcca gatagccata gaagaaccaa acactttctg cgtgtgtgag aataatcaga
    60541 gtgagatttt ttcacaagta cctgatgagg gttgagacag gtagaaaaag tgagagatct
    60601 ctatttattt agcaataata gagaaagcat ttaagagaat aaagcaatgg aaataagaaa
    60661 tttgtaaatt tccttctgat aactagaaat agaggatcca gtttcttttg gttaacctaa
    60721 attttatttc attttattgt tttattttat tttattttat tttattttgt gtaatcgtag
    60781 tttcagagtg ttagagctga aaggaagaag taggagaaac atgcaaagta aaagtataac
    60841 actttcctta ctaaaccgac tgggtttcca ggtaggggca ggattcagga tgactgacag
    60901 ggcccttagg gaacactgag accctacgct gacctcataa atgcttgcta cctttgctgt
    60961 tttaattaca tcttttaata gcaggaagca gaactctgca cttcaaaagt ttttcctcac
    61021 ctgaggagtt aatttagtac aaggggaaaa agtacagggg gatgggagaa aggcgatcac
    61081 gttgggaagc tatagagaaa gaagagtaaa ttttagtaaa ggaggtttaa acaaacaaaa
    61141 tataaagaga aataggaact tgaatcaagg aaatgatttt aaaacgcagt attcttagtg
    61201 gactagagga aaaaaataat ctgagccaag tagaagacct tttcccctcc tacccctact
    61261 ttctaagtca cagaggcttt ttgttccccc agacactctt gcagattagt ccaggcagaa
    61321 acagttagat gtccccagtt aacctcctat ttgacaccac tgattacccc attgatagtc
    61381 acactttggg ttgtaagtga ctttttattt atttgtattt ttgactgcat taagaggtct
    61441 ctagtttttt atctcttgtt tcccaaaacc taataagtaa ctaatgcaca gagcacattg
    61501 atttgtattt attctatttt tagacataat ttattagcat gcatgagcaa attaagaaaa
    61561 acaacaacaa atgaatgcat atatatgtat atgtatgtgt gtatatatac acatatatat
    61621 atatattttt tttcttttct taccagaagg ttttaatcca aataaggaga agatatgctt
    61681 agaactgagg tagagttttc atccattctg tcctgtaagt attttgcata ttctggagac
    61741 gcaggaagag atccatctac atatcccaaa gctgaattat ggtagacaaa gctcttccac
    61801 ttttagtgca tcaatttctt atttgtgtaa taagaaaatt gggaaaacga tcttcaatat
    61861 gcttaccaag ctgtgattcc aaatattacg taaatacact tgcaaaggag gatgttttta
    61921 gtagcaattt gtactgatgg tatggggcca agagatatat cttagaggga gggctgaggg
    61981 tttgaagtcc aactcctaag ccagtgccag aagagccaag gacaggtacg gctgtcatca
    62041 cttagacctc accctgtgga gccacaccct agggttggcc aatctactcc caggagcagg
    62101 gagggcagga gccagggctg ggcataaaag tcagggcaga gccatctatt gcttacattt
    62161 gcttctgaca caactgtgtt cactagcaac ctcaaacaga caccatggtg cacctgactc
    62221 ctgaggagaa gtctgccgtt actgccctgt ggggcaaggt gaacgtggat gaagttggtg
    62281 gtgaggccct gggcaggttg gtatcaaggt tacaagacag gtttaaggag accaatagaa
    62341 actgggcatg tggagacaga gaagactctt gggtttctga taggcactga ctctctctgc
    62401 ctattggtct attttcccac ccttaggctg ctggtggtct acccttggac ccagaggttc
    62461 tttgagtcct ttggggatct gtccactcct gatgctgtta tgggcaaccc taaggtgaag
    62521 gctcatggca agaaagtgct cggtgccttt agtgatggcc tggctcacct ggacaacctc
    62581 aagggcacct ttgccacact gagtgagctg cactgtgaca agctgcacgt ggatcctgag
    62641 aacttcaggg tgagtctatg ggacccttga tgttttcttt ccccttcttt tctatggtta
    62701 agttcatgtc ataggaaggg gagaagtaac agggtacagt ttagaatggg aaacagacga
    62761 atgattgcat cagtgtggaa gtctcaggat cgttttagtt tcttttattt gctgttcata
    62821 acaattgttt tcttttgttt aattcttgct ttcttttttt ttcttctccg caatttttac
    62881 tattatactt aatgccttaa cattgtgtat aacaaaagga aatatctctg agatacatta
    62941 agtaacttaa aaaaaaactt tacacagtct gcctagtaca ttactatttg gaatatatgt
    63001 gtgcttattt gcatattcat aatctcccta ctttattttc ttttattttt aattgataca
    63061 taatcattat acatatttat gggttaaagt gtaatgtttt aatatgtgta cacatattga
    63121 ccaaatcagg gtaattttgc atttgtaatt ttaaaaaatg ctttcttctt ttaatatact
    63181 tttttgttta tcttatttct aatactttcc ctaatctctt tctttcaggg caataatgat
    63241 acaatgtatc atgcctcttt gcaccattct aaagaataac agtgataatt tctgggttaa
    63301 ggcaatagca atatttctgc atataaatat ttctgcatat aaattgtaac tgatgtaaga
    63361 ggtttcatat tgctaatagc agctacaatc cagctaccat tctgctttta ttttatggtt
    63421 gggataaggc tggattattc tgagtccaag ctaggccctt ttgctaatca tgttcatacc
    63481 tcttatcttc ctcccacagc tcctgggcaa cgtgctggtc tgtgtgctgg cccatcactt
    63541 tggcaaagaa ttcaccccac cagtgcaggc tgcctatcag aaagtggtgg ctggtgtggc
    63601 taatgccctg gcccacaagt atcactaagc tcgctttctt gctgtccaat ttctattaaa
    63661 ggttcctttg ttccctaagt ccaactacta aactggggga tattatgaag ggccttgagc
    63721 atctggattc tgcctaataa aaaacattta ttttcattgc aatgatgtat ttaaattatt
    63781 tctgaatatt ttactaaaaa gggaatgtgg gaggtcagtg catttaaaac ataaagaaat
    63841 gaagagctag ttcaaacctt gggaaaatac actatatctt aaactccatg aaagaaggtg
    63901 aggctgcaaa cagctaatgc acattggcaa cagccctgat gcctatgcct tattcatccc
    63961 tcagaaaagg attcaagtag aggcttgatt tggaggttaa agttttgcta tgctgtattt
    64021 tacattactt attgttttag ctgtcctcat gaatgtcttt tcactaccca tttgcttatc
    64081 ctgcatctct cagccttgac tccactcagt tctcttgctt agagatacca cctttcccct
    64141 gaagtgttcc ttccatgttt tacggcgaga tggtttctcc tcgcctggcc actcagcctt
    64201 agttgtctct gttgtcttat agaggtctac ttgaagaagg aaaaacaggg ggcatggttt
    64261 gactgtcctg tgagcccttc ttccctgcct cccccactca cagtgacccg gaatctgcag
    64321 tgctagtctc ccggaactat cactctttca cagtctgctt tggaaggact gggcttagta
    64381 tgaaaagtta ggactgagaa gaatttgaaa gggggctttt tgtagcttga tattcactac
    64441 tgtcttatta ccctatcata ggcccacccc aaatggaagt cccattcttc ctcaggatgt
    64501 ttaagattag cattcaggaa gagatcagag gtctgctggc tcccttatca tgtcccttat
    64561 ggtgcttctg gctctgcagt tattagcata gtgttaccat caaccacctt aacttcattt
    64621 ttcttattca atacctaggt aggtagatgc tagattctgg aaataaaata tgagtctcaa
    64681 gtggtccttg tcctctctcc cagtcaaatt ctgaatctag ttggcaagat tctgaaatca
    64741 aggcatataa tcagtaataa gtgatgatag aagggtatat agaagaattt tattatatga
    64801 gagggtgaaa cctaaaatga aatgaaatca gacccttgtc ttacaccata aacaaaaata
    64861 aatttgaatg ggttaaagaa ttaaactaag acctaaaacc ataaaaattt ttaaagaaat
    64921 caaaagaaga aaattctaat attcatgttg cagccgtttt ttgaatttga tatgagaagc
    64981 aaaggcaaca aaaggaaaaa taaagaagtg aggctacatc aaactaaaaa atttccacac
    65041 aaaaaagaaa acaatgaaca aatgaaaggt gaaccatgaa atggcatatt tgcaaaccaa
    65101 atatttctta aatattttgg ttaatatcca aaatatataa gaaacacaga tgattcaata
    65161 acaaacaaaa aattaaaaat aggaaaataa aaaaattaaa aagaagaaaa tcctgccatt
    65221 tatgcgagaa ttgatgaacc tggaggatgt aaaactaaga aaaataagcc tgacacaaaa
    65281 agacaaatac tacacaacct tgctcatatg tgaaacataa aaaagtcact ctcatggaaa
    65341 cagacagtag aggtatggtt tccaggggtt gggggtggga gaatcaggaa actattactc
    65401 aaagggtata aaatttcagt tatgtgggat gaataaattc tagatatcta atgtacagca
    65461 tcgtgactgt agttaattgt actgtaagta tatttaaaat ttgcaaagag agtagatttt
    65521 tttgtttttt tagatggagt tttgctcttg ttgtccaggc tggagtgcaa tggcaagatc
    65581 ttggctcact gcaacctccg cctcctgggt tcaagcaaat ctcctgcctc agcctcccga
    65641 gtagctggga ttacaggcat gcgacaccat gcccagctaa ttttgtattt ttagtagaga
    65701 cggggtttct ccatgttggt caggctgatc cgcctcctcg gccaccaaag ggctgggatt
    65761 acaggcgtga ccaccgggcc tggccgagag tagatcttaa aagcatttac cacaagaaaa
    65821 aggtaactat gtgagataat gggtatgtta attagcttga ttgtggtaat catttcacaa
    65881 ggtatacata tattaaaaca tcatgttgta caccttaaat atatacaatt tttatttgtg
    65941 aatgatacct caataaagtt gaagaataat aaaaaagaat agacatcaca tgaattaaaa
    66001 aactaaaaaa taaaaaaatg catcttgatg attagaattg cattcttgat ttttcagata
    66061 caaatatcca tttgactgtt tactcttttc caaaacaata caataaattt tagcacttta
    66121 tcttcatttt ccccttccca atctataatt ttatatatat atattttaga tattttgtat
    66181 agttttactc cctagatttt ctagtgttat tattaaatag tgaagaaatg tttacactta
    66241 tgtacaaaat gttttgcatg cttttcttca tttctaacat tctctctaag tttattctat
    66301 tttttcctga ttatccttaa tattatctct ttctgctgga aatatattgt tacttttggt
    66361 ttatctaaaa atggcttcat tttcttcatt ctaaaatcat gttaaattaa taccactcat
    66421 gtgtaagtaa gatagtggaa taaatagaaa tccaaaaact aaatctcaca aaatataata
    66481 atgtgatata taaaaatata gcttttaaat ttagcttgga aataaaaaac aaacagtaat
    66541 tgaacaacta tactttttga aaagagtaaa gtgaaatgct taactgcata taccacaatc
    66601 gattacacaa ttaggtgtga aggtaaaatt cagtcacgaa aaaactagaa taaaaatatg
    66661 ggaagacatg tatataatct tagagataac agtgttattt aattatcaac ccaaagtaga
    66721 aactatcaag ggagaaataa attcagtcaa caataaaagc atttaagaag ttattctagg
    66781 ctgggagcgg tggctcacac ctgcaattgc agcactttgg gaggcctaga caggcggatc
    66841 acgacgtcag gagttcaaga tcagcctggc caacatagtg aaacctcatc gctactaaaa
    66901 atataaaaac ttagcctggc gtggtggcag gcatgtgtaa tcccagcaat ttgggaggct
    66961 gaggcaggag aatcgcttga tcctgggagg cagaggttgc agtgagccaa gattgtgcca
    67021 ctgcattcca gcccaggtga cagcatgaga ctccgtcaca aaaaaaaaag aaaaaaaagg
    67081 gggggggggg cggtggagcc aagatgaccg aataggaaca gctccagtct atagctccca
    67141 tcgtgagtga cgcagaagac gggtgatttc tgcatttcca actgaggtac caggttcatc
    67201 tcacagggaa gtgccaggca gtgggtgcag gacagtagtg cagtgcactg tgcatgagcc
    67261 gaagcagggc gaggcatcac ctcacccggg aagcacaagg ggtcagggaa ttccctttcc
    67321 tagtcaaaga aaagggtgac agatggcacc tggaaaatcg ggtcactccc gccctaatac
    67381 tgcgctcttc caacaagctt aacaaatggc acaccaggag attatatccc atgcctggct
    67441 cagagggtcc tacgcccatg gagcctcgct cattgctagc acagcagtct gaggtcaaac
    67501 tgcaaggtgg cagtgaggct gggggagggg tgcccaccat tgtccaggct tgagcaggta
    67561 aacaaagccg cctggaagct cgaactgggt ggagcccacc acagctcaag gaggcctgcc
    67621 tgcctctgta ggctccacct ctaggggcag ggcacagaca aacaaaagac aacaagaacc
    67681 tctgcagact taaatgtccc tgtctgacag ctttgaagag agtagtggtt ctcccagcac
    67741 atagcttcag atctgagaac aggcagactg cctcctcaag tgggtccctg acccccgagt
    67801 agcctaactg ggaggcatcc cccagtaggg cggactgaca cctcacatgg ctggtactcc
    67861 tctaagacaa aacttccaga ggaatgatca ggcagcagca tttgcggttc accaatatcc
    67921 actgttctgc agccaccgct gctgataccc aggaaaacag catctggagt ggacctccag
    67981 taaactccaa cagacctgca gctgagggtc ctgactgtta gaaggaaaac taacaaacag
    68041 aaaggacatc cacaccaaaa acccatctgt acatcaccat catcaaagac caaaggtaga
    68101 taaaaccata aagatgggga aaaagcagag cagaaaaact ggacactcta aaaatgagag
    68161 tgcctctcct tctccaaagt aacgcagctc ctcaccagca atggaacaaa gctgggcaga
    68221 gaatgacttt gacgagttga gagaggaagg cttcagaaga tcaaactact ccaagctaaa
    68281 ggaggaagtt cgaacaaacg gcaaagaagt aaaaaacttt gaaaaaaaat tagatgaatg
    68341 gataactaga ataaccaatg cacagaagtc cttaaaggac ctgatggagc tgaaaaccaa
    68401 ggcaggagaa ctacgtgaca aatacacaag cctcagtaac cgatgagatc aactggaaga
    68461 aagggtatca atgacggaag atgaaatgaa tgaaatgaag catgaagaga agtttagaga
    68521 aaaaagaata aaaagaaacg aacaaagcct ccaagaaata tgggactatg tgaaaagacc
    68581 aaatctacat ctaattggtg tagctgaaag tgatggggag aatggaacca agttggaaaa
    68641 cactctgcag gatattatcc aggagaactt ccccaatcta gcaaggcagc ccaaattcac
    68701 attcaggaaa tacagagaac gccacaaaga tactcctaga gaaaagcaac tccaagacac
    68761 ataactgaca gattcaccaa agttgaaatg aaggaaaaaa tgttaagggc agccagagag
    68821 aaaggtcggg ttacccacaa agggaagccc atcagactaa cagctgatct atcggcagaa
    68881 actctacaag ccagaagaaa gtgggggcca atattcaaca ttgttaaaga aaagaatttt
    68941 cggcccagaa tttcatatcc agccaaacta agcttcataa gcattggaga aataaaatcc
    69001 tttacagaca agcaaatgct gagagatttt gtcaccacca ggcctgccct acaagagctc
    69061 ctgaaggaag cactaaacat ggaaaggaac aactagtatc agccactgca aaaacatgcc
    69121 aaattgtaaa cgaccatcaa ggctaggaag aaactgcatc aaggagcaaa ataaccagct
    69181 aacatcataa tgacaggatc aaattcatac ataacaatac tcaccttaaa tgtaaatagg
    69241 ctaaatgctc caattaaaag acacagactg gcaaattgga taaggagtca agacccatct
    69301 gtcgttatgt attcaggaaa cccatctcac gtgcagagac acacataggc tcgaaataaa
    69361 aggatggagg aatatctacc aagcaaatgg aaaacaaaaa aaggcagggg ttgcaatcct
    69421 agtctctgat aaaacagatt ttaaaccaac aaagatcaaa agagacaaag aaggccatta
    69481 cataatggca aagggatcta ttcaagaaga agaactaact atactaaata tatatgcacc
    69541 caatacagga gcacccagat tcataaaaca agtcctgagt gacctacaaa gagacttaga
    69601 tgcccacaca ataataatgg gagactttaa caccccactg tcaacattag acagatcaac
    69661 gagacagaaa gttaacaagg atatccagga attggactca gctctgcacc aagcagacct
    69721 aatagacatc tacagaactc tccaccccaa atcaacagaa tatacattct tttcagcacc
    69781 acaccacacc tattccaaaa ctgaccacat agttggaagt aaagctctcc tcagcaaatg
    69841 taaaagaaca gaaactataa caaactgtct ctcagaccac agtgcaatca aactagaact
    69901 caggattaag aaactcactc aaaaccactc agctacatgg aaactgaaca gcctgctcct
    69961 gaatgactac tgggtacata acaaaatgaa ggcagaaata aagatgttct ttgaaacaac
    70021 gagaacaaag acacaacaca ccagaatctc tgagacacat tcaaagcagt gtgtagaggg
    70081 aaatttatag cactaaatgc ccacaaggga aagcaggaaa gatctaaaat tgacacccta
    70141 acatcacaat taaaaaacta gagaagcagg agcaaacaca ttcaaaagct aacagaagac
    70201 aagaaataac taagatcaga gcagaagtga agaagataga gacacaaaaa acccttcaaa
    70261 aaaatcaatg aatccagaag ctgttttttt gaaaagatca acaaaattga tagactgcta
    70321 gcaagactaa taaagaagaa aggggagaag aatcaaatag acgcaataaa aaatgacacg
    70381 gggtatcacc actgatccca cagaaataca aactaccgtc agagaatact ataaacacct
    70441 ctacgcaaat aaactagaaa atctagaaga aatggataaa ttcctcgaca catacactct
    70501 gccaagacta aaccaggaag aagttgtatc tctgaataga ccaataacag gctctgaaat
    70561 tgaggcaata attaatagct tatcaaccaa aaaaagtccg ggaccagtag gattcatagc
    70621 cgaattctac cagaggtaca aggaggagct ggtaccattc cttctgaaac tattccaatc
    70681 aatagaaaaa gagggaatcc tccctaactc attttatgag gccagcatca tcctgatacc
    70741 aaagcctgac agagacacaa caaaaaaaga gaatgttaca ccaatatcct tgatgaacat
    70801 cgatgcaaaa atcctcaata aaatactggc aaactgaatc cagcagcaca tcaaaaagct
    70861 tatcctccat gatcaagtgg gcttcatccc tgccatgcaa ggctggttca acatacgaaa
    70921 tcaataaaca taatccagca tataaacaga accaaagaca caaaccatat gattatctca
    70981 atagatgcag aaaaggcctt tgacaaaatt caacaatgct tcatgctaaa aactctcaat
    71041 aaattaggta ttgatgggac atatctcaaa ataataagag ctatctatga caaacccaca
    71101 gccaatatca tactgagtgg acaaaaactg gaagcattcc ctttgaaaac tggcacaagg
    71161 cagggatgcc ctctctcacc actcctattc aacatagtgt tggaagttct ggccagggca
    71221 atcaggcagg agaaggaaat aaagggcatt caattaggaa aagaggaagg tgaaattgtc
    71281 cctgtttgca gatgacatga ttgtatatct agaaaacccc attgtctcag cccaaaatct
    71341 ccttaagctg ataagcaact tcagcaaagt ctcaggatat aaaatcagtg tgcaaaaatc
    71401 acaagtattc ctatgcacca ataacagaca aacagagagc caaatcatga gtgaactccc
    71461 attcacaatt gcttcaaaga gaataaaata cctaggaatc caacttacaa gggatgtgaa
    71521 ggacctcttc aaggagaact acaaaccact gctcaatgaa ataaaagagg atacaaacaa
    71581 atggaagaac attccatgct tatgggtagg aagaatcata tcgtgaaaat ggtcatactg
    71641 cccaaggtaa tttatagatt caatgccatc cccatcaagc taccaatgac tttcttcaca
    71701 gaactggaaa aaactacttt aaagttcata tggaatcaaa aaagagccca catcaccaag
    71761 gcaatcctaa gccaaaagaa caaagctgga ggcatcacgc tacctgactt caaactatac
    71821 tacaatgcta cggtaaccaa aacagcatgg tactggtacc aaaacagaga tctagaccaa
    71881 tggaacagaa cagagccctc agaaataatg ccgcatatct acaactatcc gatctttgac
    71941 aaacctgaga gaaacaagca atggggaaag gattccctat ttaataaatg gtgctgggaa
    72001 aactggctag ccatatgtag aaagctgaaa ctggatcctt ccttacacct tatacaaaaa
    72061 ttaattcaag atggattaaa gacttaaaca ttagacctaa aaccataaaa accctagaaa
    72121 aaaacctagg caataccatt caggacatag gcatgggcaa ggacttcatg tctaaaacac
    72181 caaaacgaat ggcaacaaaa gacaaaatgg acaaacggga tctaattaaa ctaaagagct
    72241 tctgcacagc taaagaaact accatcagag tgaacaggca acctacaaaa tgggagaaaa
    72301 tttttgcaat ctactcatct gacaaagggc taatatccag aatctacaat gaactcaaac
    72361 aaatttacaa gaaaaaacaa acaaccccat caaaaagtgg gcaaaggata tgaacagaca
    72421 cttctcaaaa gaagacattt atgtaatcaa aaaacacatg aaaaaatgct catcatcact
    72481 agccatcaga gaaatgcaaa tcaaaaccac aatgagatac catctcacac cagttagaat
    72541 ggcgatcatt aaaaagtcag gaaacaacag gtgctggaga ggatgtggag aaacaggaac
    72601 aacttttaca ctgttggtgg gactgtaaac tagttcaacc attgcggaag tcagtgtggc
    72661 aattcctcag gaatctagaa ctagaaatac catttgaccc agccatccca ttactgggta
    72721 gatacccaaa ggattataaa tcatgctgct ataaagacac atgcacacgt atgtttattg
    72781 cagcactatt cacaatagca aagacttgga accaacccaa atgtccaaca acgatagatt
    72841 ggattaagaa aatgtggcac atatacacca tggaatacta tgcagccata aaaaatgatg
    72901 agttcatgtc ctttgtaggg acatggatga agctggaaac tatcattctc agcaaactat
    72961 cacaaggaca ataaaccaaa caccgcatgt tctcactcat aggtgggaat tgaacaatga
    73021 gaacacatgg acacatgaag aggaacatca cactctgggg actgttatgg ggtggggggc
    73081 aggggcaggg atagcactag gagatatacc taatgctaaa tgacgagtta atgggtgcag
    73141 cacaccaaca tggcacatgt atacatatat aacaaacctg ccgttgtgca catgtaccct
    73201 aaaacttgaa gtataataat aaaaaaaagt tatcctatta aaactgatct cacacatccg
    73261 tagagccatt atcaagtctt tctctttgaa acagacagaa atttagtgtt ttctcagtca
    73321 gttaac
//
© Genebank 1991
www #ad:

↑ © L. Allison, www.allisons.org/ll/   (or as otherwise indicated).
Created with "vi (Linux)",  charset=iso-8859-1,   fetched Friday, 29-Mar-2024 10:55:39 UTC.

Free: Linux, Ubuntu operating-sys, OpenOffice office-suite, The GIMP ~photoshop, Firefox web-browser, FlashBlock flash on/off.