US20030113817A1 - Hemogen-EDAG: novel nuclear factors expressed in hematopoietic development - Google Patents
Hemogen-EDAG: novel nuclear factors expressed in hematopoietic development Download PDFInfo
- Publication number
- US20030113817A1 US20030113817A1 US10/103,140 US10314002A US2003113817A1 US 20030113817 A1 US20030113817 A1 US 20030113817A1 US 10314002 A US10314002 A US 10314002A US 2003113817 A1 US2003113817 A1 US 2003113817A1
- Authority
- US
- United States
- Prior art keywords
- glu
- pro
- edag
- gln
- ser
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 230000003394 haemopoietic effect Effects 0.000 title abstract description 25
- 238000011161 development Methods 0.000 title abstract description 7
- 102100022191 Hemogen Human genes 0.000 claims abstract description 155
- 101001045553 Homo sapiens Hemogen Proteins 0.000 claims abstract description 84
- 101710186177 Hemogen Proteins 0.000 claims abstract description 81
- 210000004027 cell Anatomy 0.000 claims abstract description 53
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 44
- 230000014509 gene expression Effects 0.000 claims abstract description 38
- 150000007523 nucleic acids Chemical class 0.000 claims description 32
- 102000039446 nucleic acids Human genes 0.000 claims description 30
- 108020004707 nucleic acids Proteins 0.000 claims description 30
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 24
- 229920001184 polypeptide Polymers 0.000 claims description 23
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 23
- 102000004169 proteins and genes Human genes 0.000 claims description 23
- 230000011132 hemopoiesis Effects 0.000 claims description 22
- 239000012634 fragment Substances 0.000 claims description 20
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 17
- 210000003958 hematopoietic stem cell Anatomy 0.000 claims description 17
- 239000000523 sample Substances 0.000 claims description 16
- 239000002773 nucleotide Substances 0.000 claims description 14
- 125000003729 nucleotide group Chemical group 0.000 claims description 14
- 238000000034 method Methods 0.000 claims description 12
- 239000013604 expression vector Substances 0.000 claims description 8
- 238000009396 hybridization Methods 0.000 claims description 8
- 230000001105 regulatory effect Effects 0.000 claims description 8
- 239000013598 vector Substances 0.000 claims description 8
- 230000005856 abnormality Effects 0.000 claims description 4
- 210000003527 eukaryotic cell Anatomy 0.000 claims description 4
- 230000002159 abnormal effect Effects 0.000 claims description 3
- 239000013060 biological fluid Substances 0.000 claims description 3
- 239000013068 control sample Substances 0.000 claims 1
- 241000282414 Homo sapiens Species 0.000 abstract description 38
- 210000001519 tissue Anatomy 0.000 abstract description 27
- 210000000601 blood cell Anatomy 0.000 abstract description 23
- 210000000349 chromosome Anatomy 0.000 abstract description 10
- 241001529936 Murinae Species 0.000 abstract description 8
- 102000007999 Nuclear Proteins Human genes 0.000 abstract description 6
- 108010089610 Nuclear Proteins Proteins 0.000 abstract description 6
- 230000018109 developmental process Effects 0.000 abstract description 6
- 208000032839 leukemia Diseases 0.000 abstract description 6
- 230000024245 cell differentiation Effects 0.000 abstract description 5
- 206010028980 Neoplasm Diseases 0.000 abstract description 4
- 230000002607 hemopoietic effect Effects 0.000 abstract description 3
- 235000018102 proteins Nutrition 0.000 description 20
- 241000282326 Felis catus Species 0.000 description 19
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 18
- 210000004185 liver Anatomy 0.000 description 18
- 241000699666 Mus <mouse, genus> Species 0.000 description 17
- 210000001185 bone marrow Anatomy 0.000 description 17
- 235000001014 amino acid Nutrition 0.000 description 16
- 239000002299 complementary DNA Substances 0.000 description 16
- 230000001605 fetal effect Effects 0.000 description 16
- 108020004414 DNA Proteins 0.000 description 14
- 150000001413 amino acids Chemical class 0.000 description 14
- 210000000952 spleen Anatomy 0.000 description 14
- 210000004369 blood Anatomy 0.000 description 12
- 239000008280 blood Substances 0.000 description 12
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 11
- 210000002996 primitive erythroblast Anatomy 0.000 description 11
- 108010062796 arginyllysine Proteins 0.000 description 9
- 238000003757 reverse transcription PCR Methods 0.000 description 9
- AITGTTNYKAWKDR-CIUDSAMLSA-N Asn-His-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O AITGTTNYKAWKDR-CIUDSAMLSA-N 0.000 description 8
- 108010029485 Protein Isoforms Proteins 0.000 description 8
- 102000001708 Protein Isoforms Human genes 0.000 description 8
- 108010005233 alanylglutamic acid Proteins 0.000 description 8
- 238000007901 in situ hybridization Methods 0.000 description 8
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 7
- TYEJPFJNAHIKRT-DCAQKATOSA-N Lys-Met-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N TYEJPFJNAHIKRT-DCAQKATOSA-N 0.000 description 7
- 108010078144 glutaminyl-glycine Proteins 0.000 description 7
- 108010070643 prolylglutamic acid Proteins 0.000 description 7
- SWSVTNGMKBDTBM-DCAQKATOSA-N His-Gln-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SWSVTNGMKBDTBM-DCAQKATOSA-N 0.000 description 6
- 101000928713 Homo sapiens Dehydrodolichyl diphosphate synthase complex subunit DHDDS Proteins 0.000 description 6
- 101000863415 Kluyveromyces lactis (strain ATCC 8585 / CBS 2359 / DSM 70799 / NBRC 1267 / NRRL Y-1140 / WM37) 40S ribosomal protein S14 Proteins 0.000 description 6
- 241000699670 Mus sp. Species 0.000 description 6
- 108010077850 Nuclear Localization Signals Proteins 0.000 description 6
- DOBIBIXIHJKVJF-XKBZYTNZSA-N Thr-Ser-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DOBIBIXIHJKVJF-XKBZYTNZSA-N 0.000 description 6
- 230000013020 embryo development Effects 0.000 description 6
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 6
- 108010064235 lysylglycine Proteins 0.000 description 6
- 108010026333 seryl-proline Proteins 0.000 description 6
- 210000001541 thymus gland Anatomy 0.000 description 6
- 210000001325 yolk sac Anatomy 0.000 description 6
- VBRDBGCROKWTPV-XHNCKOQMSA-N Ala-Glu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N VBRDBGCROKWTPV-XHNCKOQMSA-N 0.000 description 5
- 102100036511 Dehydrodolichyl diphosphate synthase complex subunit DHDDS Human genes 0.000 description 5
- 108091060211 Expressed sequence tag Proteins 0.000 description 5
- 102100037042 Forkhead box protein E1 Human genes 0.000 description 5
- LWYUQLZOIORFFJ-XKBZYTNZSA-N Glu-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O LWYUQLZOIORFFJ-XKBZYTNZSA-N 0.000 description 5
- 101001029304 Homo sapiens Forkhead box protein E1 Proteins 0.000 description 5
- SJNZALDHDUYDBU-IHRRRGAJSA-N Lys-Arg-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(O)=O SJNZALDHDUYDBU-IHRRRGAJSA-N 0.000 description 5
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 description 5
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 5
- 238000006243 chemical reaction Methods 0.000 description 5
- 210000003743 erythrocyte Anatomy 0.000 description 5
- 210000005259 peripheral blood Anatomy 0.000 description 5
- 239000011886 peripheral blood Substances 0.000 description 5
- 239000013612 plasmid Substances 0.000 description 5
- 108010029020 prolylglycine Proteins 0.000 description 5
- 210000000130 stem cell Anatomy 0.000 description 5
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 4
- HQIZDMIGUJOSNI-IUCAKERBSA-N Arg-Gly-Arg Chemical compound N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQIZDMIGUJOSNI-IUCAKERBSA-N 0.000 description 4
- BDMIFVIWCNLDCT-CIUDSAMLSA-N Asn-Arg-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O BDMIFVIWCNLDCT-CIUDSAMLSA-N 0.000 description 4
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 4
- SARSTIZOZFBDOM-FXQIFTODSA-N Asp-Met-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O SARSTIZOZFBDOM-FXQIFTODSA-N 0.000 description 4
- WSFSSNUMVMOOMR-UHFFFAOYSA-N Formaldehyde Chemical compound O=C WSFSSNUMVMOOMR-UHFFFAOYSA-N 0.000 description 4
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 4
- KDXKFBSNIJYNNR-YVNDNENWSA-N Gln-Glu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KDXKFBSNIJYNNR-YVNDNENWSA-N 0.000 description 4
- ZNZPKVQURDQFFS-FXQIFTODSA-N Gln-Glu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZNZPKVQURDQFFS-FXQIFTODSA-N 0.000 description 4
- DRDSQGHKTLSNEA-GLLZPBPUSA-N Gln-Glu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DRDSQGHKTLSNEA-GLLZPBPUSA-N 0.000 description 4
- BUAKRRKDHSSIKK-IHRRRGAJSA-N Glu-Glu-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BUAKRRKDHSSIKK-IHRRRGAJSA-N 0.000 description 4
- DXVOKNVIKORTHQ-GUBZILKMSA-N Glu-Pro-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O DXVOKNVIKORTHQ-GUBZILKMSA-N 0.000 description 4
- HFXJIZNEXNIZIJ-BQBZGAKWSA-N Gly-Glu-Gln Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFXJIZNEXNIZIJ-BQBZGAKWSA-N 0.000 description 4
- JJGBXTYGTKWGAT-YUMQZZPRSA-N Gly-Pro-Glu Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O JJGBXTYGTKWGAT-YUMQZZPRSA-N 0.000 description 4
- WZUVPPKBWHMQCE-UHFFFAOYSA-N Haematoxylin Chemical compound C12=CC(O)=C(O)C=C2CC2(O)C1C1=CC=C(O)C(O)=C1OC2 WZUVPPKBWHMQCE-UHFFFAOYSA-N 0.000 description 4
- 102100031573 Hematopoietic progenitor cell antigen CD34 Human genes 0.000 description 4
- 108010033040 Histones Proteins 0.000 description 4
- 101000777663 Homo sapiens Hematopoietic progenitor cell antigen CD34 Proteins 0.000 description 4
- HOLOYAZCIHDQNS-YVNDNENWSA-N Ile-Gln-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HOLOYAZCIHDQNS-YVNDNENWSA-N 0.000 description 4
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 4
- 238000000636 Northern blotting Methods 0.000 description 4
- 108091028043 Nucleic acid sequence Proteins 0.000 description 4
- OFGUOWQVEGTVNU-DCAQKATOSA-N Pro-Lys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OFGUOWQVEGTVNU-DCAQKATOSA-N 0.000 description 4
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 4
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 4
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 4
- 210000001744 T-lymphocyte Anatomy 0.000 description 4
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 4
- 108010087924 alanylproline Proteins 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 4
- 108010060199 cysteinylproline Proteins 0.000 description 4
- 230000004069 differentiation Effects 0.000 description 4
- 230000000925 erythroid effect Effects 0.000 description 4
- 230000004927 fusion Effects 0.000 description 4
- 108010079547 glutamylmethionine Proteins 0.000 description 4
- 102000057348 human HEMGN Human genes 0.000 description 4
- 108010009298 lysylglutamic acid Proteins 0.000 description 4
- 210000004940 nucleus Anatomy 0.000 description 4
- 238000013518 transcription Methods 0.000 description 4
- 230000035897 transcription Effects 0.000 description 4
- 230000005945 translocation Effects 0.000 description 4
- 238000011144 upstream manufacturing Methods 0.000 description 4
- WFDIJRYMOXRFFG-UHFFFAOYSA-N Acetic anhydride Chemical compound CC(=O)OC(C)=O WFDIJRYMOXRFFG-UHFFFAOYSA-N 0.000 description 3
- 108020005544 Antisense RNA Proteins 0.000 description 3
- 108020004705 Codon Proteins 0.000 description 3
- JFSNBQJNDMXMQF-XHNCKOQMSA-N Gln-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O JFSNBQJNDMXMQF-XHNCKOQMSA-N 0.000 description 3
- KUTPGXNAAOQSPD-LPEHRKFASA-N Glu-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O KUTPGXNAAOQSPD-LPEHRKFASA-N 0.000 description 3
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 3
- UMZHHILWZBFPGL-LOKLDPHHSA-N Glu-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O UMZHHILWZBFPGL-LOKLDPHHSA-N 0.000 description 3
- GZUKEVBTYNNUQF-WDSKDSINSA-N Gly-Ala-Gln Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GZUKEVBTYNNUQF-WDSKDSINSA-N 0.000 description 3
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 3
- 208000002250 Hematologic Neoplasms Diseases 0.000 description 3
- 102100033636 Histone H3.2 Human genes 0.000 description 3
- HXIDVIFHRYRXLZ-NAKRPEOUSA-N Ile-Ser-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)O)N HXIDVIFHRYRXLZ-NAKRPEOUSA-N 0.000 description 3
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 3
- NTEVEUCLFMWSND-SRVKXCTJSA-N Lys-Arg-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O NTEVEUCLFMWSND-SRVKXCTJSA-N 0.000 description 3
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 3
- NCFZHKMKRCYQBJ-CIUDSAMLSA-N Met-Cys-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N NCFZHKMKRCYQBJ-CIUDSAMLSA-N 0.000 description 3
- NBEFNGUZUOUGFG-KKUMJFAQSA-N Met-Tyr-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N NBEFNGUZUOUGFG-KKUMJFAQSA-N 0.000 description 3
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 3
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 3
- 102000003992 Peroxidases Human genes 0.000 description 3
- 238000012300 Sequence Analysis Methods 0.000 description 3
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 3
- NMZXJDSKEGFDLJ-DCAQKATOSA-N Ser-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CCCCN)C(=O)O NMZXJDSKEGFDLJ-DCAQKATOSA-N 0.000 description 3
- 108091081024 Start codon Proteins 0.000 description 3
- UTCFSBBXPWKLTG-XKBZYTNZSA-N Thr-Cys-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O UTCFSBBXPWKLTG-XKBZYTNZSA-N 0.000 description 3
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 3
- 230000004071 biological effect Effects 0.000 description 3
- 210000003855 cell nucleus Anatomy 0.000 description 3
- 239000003153 chemical reaction reagent Substances 0.000 description 3
- 238000010367 cloning Methods 0.000 description 3
- 239000003184 complementary RNA Substances 0.000 description 3
- 210000004748 cultured cell Anatomy 0.000 description 3
- 210000000805 cytoplasm Anatomy 0.000 description 3
- 210000002257 embryonic structure Anatomy 0.000 description 3
- 238000010195 expression analysis Methods 0.000 description 3
- 102000018146 globin Human genes 0.000 description 3
- 108060003196 globin Proteins 0.000 description 3
- 108010049041 glutamylalanine Proteins 0.000 description 3
- 108010079413 glycyl-prolyl-glutamic acid Proteins 0.000 description 3
- 108010050848 glycylleucine Proteins 0.000 description 3
- 210000002216 heart Anatomy 0.000 description 3
- 210000005003 heart tissue Anatomy 0.000 description 3
- 208000014951 hematologic disease Diseases 0.000 description 3
- 108010085325 histidylproline Proteins 0.000 description 3
- 238000012744 immunostaining Methods 0.000 description 3
- 238000000338 in vitro Methods 0.000 description 3
- 108010057821 leucylproline Proteins 0.000 description 3
- 239000011159 matrix material Substances 0.000 description 3
- 239000000203 mixture Substances 0.000 description 3
- 108040007629 peroxidase activity proteins Proteins 0.000 description 3
- 108010051242 phenylalanylserine Proteins 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 3
- 241000894007 species Species 0.000 description 3
- 238000010186 staining Methods 0.000 description 3
- IAOXXKYIZHCAQJ-ACZMJKKPSA-N (2s)-2-[[2-[[(2s)-2-[[(2s)-2,4-diamino-4-oxobutanoyl]amino]propanoyl]amino]acetyl]amino]propanoic acid Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O IAOXXKYIZHCAQJ-ACZMJKKPSA-N 0.000 description 2
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 2
- UMCMPZBLKLEWAF-BCTGSCMUSA-N 3-[(3-cholamidopropyl)dimethylammonio]propane-1-sulfonate Chemical compound C([C@H]1C[C@H]2O)[C@H](O)CC[C@]1(C)[C@@H]1[C@@H]2[C@@H]2CC[C@H]([C@@H](CCC(=O)NCCC[N+](C)(C)CCCS([O-])(=O)=O)C)[C@@]2(C)[C@@H](O)C1 UMCMPZBLKLEWAF-BCTGSCMUSA-N 0.000 description 2
- 208000031261 Acute myeloid leukaemia Diseases 0.000 description 2
- UCIYCBSJBQGDGM-LPEHRKFASA-N Ala-Arg-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N UCIYCBSJBQGDGM-LPEHRKFASA-N 0.000 description 2
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 2
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 2
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 2
- GSHKMNKPMLXSQW-KBIXCLLPSA-N Ala-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C)N GSHKMNKPMLXSQW-KBIXCLLPSA-N 0.000 description 2
- RUQBGIMJQUWXPP-CYDGBPFRSA-N Ala-Leu-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O RUQBGIMJQUWXPP-CYDGBPFRSA-N 0.000 description 2
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 2
- SDZRIBWEVVRDQI-CIUDSAMLSA-N Ala-Lys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O SDZRIBWEVVRDQI-CIUDSAMLSA-N 0.000 description 2
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 2
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 2
- ZXKNLCPUNZPFGY-LEWSCRJBSA-N Ala-Tyr-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N ZXKNLCPUNZPFGY-LEWSCRJBSA-N 0.000 description 2
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 2
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 2
- NYZGVTGOMPHSJW-CIUDSAMLSA-N Arg-Glu-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N NYZGVTGOMPHSJW-CIUDSAMLSA-N 0.000 description 2
- WKPXXXUSUHAXDE-SRVKXCTJSA-N Arg-Pro-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O WKPXXXUSUHAXDE-SRVKXCTJSA-N 0.000 description 2
- MSBDSTRUMZFSEU-PEFMBERDSA-N Asn-Glu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MSBDSTRUMZFSEU-PEFMBERDSA-N 0.000 description 2
- GFFRWIJAFFMQGM-NUMRIWBASA-N Asn-Glu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFFRWIJAFFMQGM-NUMRIWBASA-N 0.000 description 2
- JQBCANGGAVVERB-CFMVVWHZSA-N Asn-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N JQBCANGGAVVERB-CFMVVWHZSA-N 0.000 description 2
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 description 2
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 2
- MYRLSKYSMXNLLA-LAEOZQHASA-N Asn-Val-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MYRLSKYSMXNLLA-LAEOZQHASA-N 0.000 description 2
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 2
- VTYQAQFKMQTKQD-ACZMJKKPSA-N Asp-Ala-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O VTYQAQFKMQTKQD-ACZMJKKPSA-N 0.000 description 2
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 2
- SMZCLQGDQMGESY-ACZMJKKPSA-N Asp-Gln-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N SMZCLQGDQMGESY-ACZMJKKPSA-N 0.000 description 2
- USNJAPJZSGTTPX-XVSYOHENSA-N Asp-Phe-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O USNJAPJZSGTTPX-XVSYOHENSA-N 0.000 description 2
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 2
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 2
- XWKBWZXGNXTDKY-ZKWXMUAHSA-N Asp-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O XWKBWZXGNXTDKY-ZKWXMUAHSA-N 0.000 description 2
- JGLWFWXGOINXEA-YDHLFZDLSA-N Asp-Val-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JGLWFWXGOINXEA-YDHLFZDLSA-N 0.000 description 2
- 108091026890 Coding region Proteins 0.000 description 2
- 108091035707 Consensus sequence Proteins 0.000 description 2
- MGAWEOHYNIMOQJ-ACZMJKKPSA-N Cys-Gln-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N MGAWEOHYNIMOQJ-ACZMJKKPSA-N 0.000 description 2
- SFRQEQGPRTVDPO-NRPADANISA-N Cys-Gln-Val Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O SFRQEQGPRTVDPO-NRPADANISA-N 0.000 description 2
- TXGDWPBLUFQODU-XGEHTFHBSA-N Cys-Pro-Thr Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O TXGDWPBLUFQODU-XGEHTFHBSA-N 0.000 description 2
- 101100480530 Danio rerio tal1 gene Proteins 0.000 description 2
- SHIBSTMRCDJXLN-UHFFFAOYSA-N Digoxigenin Natural products C1CC(C2C(C3(C)CCC(O)CC3CC2)CC2O)(O)C2(C)C1C1=CC(=O)OC1 SHIBSTMRCDJXLN-UHFFFAOYSA-N 0.000 description 2
- CKNUKHBRCSMKMO-XHNCKOQMSA-N Gln-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O CKNUKHBRCSMKMO-XHNCKOQMSA-N 0.000 description 2
- CYTSBCIIEHUPDU-ACZMJKKPSA-N Gln-Asp-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O CYTSBCIIEHUPDU-ACZMJKKPSA-N 0.000 description 2
- SXIJQMBEVYWAQT-GUBZILKMSA-N Gln-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N SXIJQMBEVYWAQT-GUBZILKMSA-N 0.000 description 2
- MCAVASRGVBVPMX-FXQIFTODSA-N Gln-Glu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MCAVASRGVBVPMX-FXQIFTODSA-N 0.000 description 2
- BLOXULLYFRGYKZ-GUBZILKMSA-N Gln-Glu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BLOXULLYFRGYKZ-GUBZILKMSA-N 0.000 description 2
- CGVWDTRDPLOMHZ-FXQIFTODSA-N Gln-Glu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CGVWDTRDPLOMHZ-FXQIFTODSA-N 0.000 description 2
- SNLOOPZHAQDMJG-CIUDSAMLSA-N Gln-Glu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SNLOOPZHAQDMJG-CIUDSAMLSA-N 0.000 description 2
- XWIBVSAEUCAAKF-GVXVVHGQSA-N Gln-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)N)N XWIBVSAEUCAAKF-GVXVVHGQSA-N 0.000 description 2
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 2
- IHSGESFHTMFHRB-GUBZILKMSA-N Gln-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O IHSGESFHTMFHRB-GUBZILKMSA-N 0.000 description 2
- QKWBEMCLYTYBNI-GVXVVHGQSA-N Gln-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O QKWBEMCLYTYBNI-GVXVVHGQSA-N 0.000 description 2
- KVQOVQVGVKDZNW-GUBZILKMSA-N Gln-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N KVQOVQVGVKDZNW-GUBZILKMSA-N 0.000 description 2
- SYTFJIQPBRJSOK-NKIYYHGXSA-N Gln-Thr-His Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 SYTFJIQPBRJSOK-NKIYYHGXSA-N 0.000 description 2
- XKPACHRGOWQHFH-IRIUXVKKSA-N Gln-Thr-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XKPACHRGOWQHFH-IRIUXVKKSA-N 0.000 description 2
- BPDVTFBJZNBHEU-HGNGGELXSA-N Glu-Ala-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 BPDVTFBJZNBHEU-HGNGGELXSA-N 0.000 description 2
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 2
- HUFCEIHAFNVSNR-IHRRRGAJSA-N Glu-Gln-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUFCEIHAFNVSNR-IHRRRGAJSA-N 0.000 description 2
- QQLBPVKLJBAXBS-FXQIFTODSA-N Glu-Glu-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QQLBPVKLJBAXBS-FXQIFTODSA-N 0.000 description 2
- LYCDZGLXQBPNQU-WDSKDSINSA-N Glu-Gly-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O LYCDZGLXQBPNQU-WDSKDSINSA-N 0.000 description 2
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 2
- YVYVMJNUENBOOL-KBIXCLLPSA-N Glu-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N YVYVMJNUENBOOL-KBIXCLLPSA-N 0.000 description 2
- KRRFFAHEAOCBCQ-SIUGBPQLSA-N Glu-Ile-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KRRFFAHEAOCBCQ-SIUGBPQLSA-N 0.000 description 2
- JHSRJMUJOGLIHK-GUBZILKMSA-N Glu-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N JHSRJMUJOGLIHK-GUBZILKMSA-N 0.000 description 2
- ITVBKCZZLJUUHI-HTUGSXCWSA-N Glu-Phe-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ITVBKCZZLJUUHI-HTUGSXCWSA-N 0.000 description 2
- AAJHGGDRKHYSDH-GUBZILKMSA-N Glu-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O AAJHGGDRKHYSDH-GUBZILKMSA-N 0.000 description 2
- SWDNPSMMEWRNOH-HJGDQZAQSA-N Glu-Pro-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWDNPSMMEWRNOH-HJGDQZAQSA-N 0.000 description 2
- JPUNZXVHHRZMNL-XIRDDKMYSA-N Glu-Pro-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JPUNZXVHHRZMNL-XIRDDKMYSA-N 0.000 description 2
- BPLNJYHNAJVLRT-ACZMJKKPSA-N Glu-Ser-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O BPLNJYHNAJVLRT-ACZMJKKPSA-N 0.000 description 2
- ZAPFAWQHBOHWLL-GUBZILKMSA-N Glu-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N ZAPFAWQHBOHWLL-GUBZILKMSA-N 0.000 description 2
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 2
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 2
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 2
- RQZGFWKQLPJOEQ-YUMQZZPRSA-N Gly-Arg-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)CN)CN=C(N)N RQZGFWKQLPJOEQ-YUMQZZPRSA-N 0.000 description 2
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 2
- DWUKOTKSTDWGAE-BQBZGAKWSA-N Gly-Asn-Arg Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DWUKOTKSTDWGAE-BQBZGAKWSA-N 0.000 description 2
- OCDLPQDYTJPWNG-YUMQZZPRSA-N Gly-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN OCDLPQDYTJPWNG-YUMQZZPRSA-N 0.000 description 2
- GVVKYKCOFMMTKZ-WHFBIAKZSA-N Gly-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)CN GVVKYKCOFMMTKZ-WHFBIAKZSA-N 0.000 description 2
- SWQALSGKVLYKDT-ZKWXMUAHSA-N Gly-Ile-Ala Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SWQALSGKVLYKDT-ZKWXMUAHSA-N 0.000 description 2
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 2
- LOEANKRDMMVOGZ-YUMQZZPRSA-N Gly-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O LOEANKRDMMVOGZ-YUMQZZPRSA-N 0.000 description 2
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 2
- FEUPVVCGQLNXNP-IRXDYDNUSA-N Gly-Phe-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FEUPVVCGQLNXNP-IRXDYDNUSA-N 0.000 description 2
- CSMYMGFCEJWALV-WDSKDSINSA-N Gly-Ser-Gln Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O CSMYMGFCEJWALV-WDSKDSINSA-N 0.000 description 2
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 2
- KOYUSMBPJOVSOO-XEGUGMAKSA-N Gly-Tyr-Ile Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KOYUSMBPJOVSOO-XEGUGMAKSA-N 0.000 description 2
- 208000036066 Hemophagocytic Lymphohistiocytosis Diseases 0.000 description 2
- JFFAPRNXXLRINI-NHCYSSNCSA-N His-Asp-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JFFAPRNXXLRINI-NHCYSSNCSA-N 0.000 description 2
- JWLWNCVBBSBCEM-NKIYYHGXSA-N His-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N)O JWLWNCVBBSBCEM-NKIYYHGXSA-N 0.000 description 2
- BKOVCRUIXDIWFV-IXOXFDKPSA-N His-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CN=CN1 BKOVCRUIXDIWFV-IXOXFDKPSA-N 0.000 description 2
- HYWZHNUGAYVEEW-KKUMJFAQSA-N His-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N HYWZHNUGAYVEEW-KKUMJFAQSA-N 0.000 description 2
- QCBYAHHNOHBXIH-UWVGGRQHSA-N His-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CN=CN1 QCBYAHHNOHBXIH-UWVGGRQHSA-N 0.000 description 2
- GIRSNERMXCMDBO-GARJFASQSA-N His-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O GIRSNERMXCMDBO-GARJFASQSA-N 0.000 description 2
- 241000282412 Homo Species 0.000 description 2
- PJLLMGWWINYQPB-PEFMBERDSA-N Ile-Asn-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PJLLMGWWINYQPB-PEFMBERDSA-N 0.000 description 2
- DFJJAVZIHDFOGQ-MNXVOIDGSA-N Ile-Glu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DFJJAVZIHDFOGQ-MNXVOIDGSA-N 0.000 description 2
- DFFTXLCCDFYRKD-MBLNEYKQSA-N Ile-Gly-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N DFFTXLCCDFYRKD-MBLNEYKQSA-N 0.000 description 2
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 2
- CKRFDMPBSWYOBT-PPCPHDFISA-N Ile-Lys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CKRFDMPBSWYOBT-PPCPHDFISA-N 0.000 description 2
- KCTIFOCXAIUQQK-QXEWZRGKSA-N Ile-Pro-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O KCTIFOCXAIUQQK-QXEWZRGKSA-N 0.000 description 2
- ZGKVPOSSTGHJAF-HJPIBITLSA-N Ile-Tyr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CO)C(=O)O)N ZGKVPOSSTGHJAF-HJPIBITLSA-N 0.000 description 2
- RQZFWBLDTBDEOF-RNJOBUHISA-N Ile-Val-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N RQZFWBLDTBDEOF-RNJOBUHISA-N 0.000 description 2
- 102100034343 Integrase Human genes 0.000 description 2
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 2
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 2
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 2
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 2
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 2
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 2
- QPXBPQUGXHURGP-UWVGGRQHSA-N Leu-Gly-Met Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCSC)C(=O)O)N QPXBPQUGXHURGP-UWVGGRQHSA-N 0.000 description 2
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 2
- REPBGZHJKYWFMJ-KKUMJFAQSA-N Leu-Lys-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N REPBGZHJKYWFMJ-KKUMJFAQSA-N 0.000 description 2
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 2
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 2
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 2
- MUCIDQMDOYQYBR-IHRRRGAJSA-N Leu-Pro-His Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N MUCIDQMDOYQYBR-IHRRRGAJSA-N 0.000 description 2
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 2
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 2
- MPGHETGWWWUHPY-CIUDSAMLSA-N Lys-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN MPGHETGWWWUHPY-CIUDSAMLSA-N 0.000 description 2
- BTSXLXFPMZXVPR-DLOVCJGASA-N Lys-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N BTSXLXFPMZXVPR-DLOVCJGASA-N 0.000 description 2
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 2
- CRNNMTHBMRFQNG-GUBZILKMSA-N Lys-Glu-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N CRNNMTHBMRFQNG-GUBZILKMSA-N 0.000 description 2
- PGLGNCVOWIORQE-SRVKXCTJSA-N Lys-His-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O PGLGNCVOWIORQE-SRVKXCTJSA-N 0.000 description 2
- XDPLZVNMYQOFQZ-BJDJZHNGSA-N Lys-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N XDPLZVNMYQOFQZ-BJDJZHNGSA-N 0.000 description 2
- WGILOYIKJVQUPT-DCAQKATOSA-N Lys-Pro-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WGILOYIKJVQUPT-DCAQKATOSA-N 0.000 description 2
- LUTDBHBIHHREDC-IHRRRGAJSA-N Lys-Pro-Lys Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O LUTDBHBIHHREDC-IHRRRGAJSA-N 0.000 description 2
- TVOOGUNBIWAURO-KATARQTJSA-N Lys-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N)O TVOOGUNBIWAURO-KATARQTJSA-N 0.000 description 2
- QVTDVTONTRSQMF-WDCWCFNPSA-N Lys-Thr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CCCCN QVTDVTONTRSQMF-WDCWCFNPSA-N 0.000 description 2
- SQRLLZAQNOQCEG-KKUMJFAQSA-N Lys-Tyr-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 SQRLLZAQNOQCEG-KKUMJFAQSA-N 0.000 description 2
- VKCPHIOZDWUFSW-ONGXEEELSA-N Lys-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN VKCPHIOZDWUFSW-ONGXEEELSA-N 0.000 description 2
- TXTZMVNJIRZABH-ULQDDVLXSA-N Lys-Val-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TXTZMVNJIRZABH-ULQDDVLXSA-N 0.000 description 2
- GAELMDJMQDUDLJ-BQBZGAKWSA-N Met-Ala-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O GAELMDJMQDUDLJ-BQBZGAKWSA-N 0.000 description 2
- DTICLBJHRYSJLH-GUBZILKMSA-N Met-Ala-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O DTICLBJHRYSJLH-GUBZILKMSA-N 0.000 description 2
- OSOLWRWQADPDIQ-DCAQKATOSA-N Met-Asp-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OSOLWRWQADPDIQ-DCAQKATOSA-N 0.000 description 2
- FBQMBZLJHOQAIH-GUBZILKMSA-N Met-Asp-Met Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O FBQMBZLJHOQAIH-GUBZILKMSA-N 0.000 description 2
- WPTHAGXMYDRPFD-SRVKXCTJSA-N Met-Lys-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O WPTHAGXMYDRPFD-SRVKXCTJSA-N 0.000 description 2
- 101001045554 Mus musculus Hemogen Proteins 0.000 description 2
- 101100480538 Mus musculus Tal1 gene Proteins 0.000 description 2
- 208000033776 Myeloid Acute Leukemia Diseases 0.000 description 2
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 2
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 2
- 229930040373 Paraformaldehyde Natural products 0.000 description 2
- 101100312945 Pasteurella multocida (strain Pm70) talA gene Proteins 0.000 description 2
- YVXPUUOTMVBKDO-IHRRRGAJSA-N Phe-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CS)C(=O)O YVXPUUOTMVBKDO-IHRRRGAJSA-N 0.000 description 2
- QARPMYDMYVLFMW-KKUMJFAQSA-N Phe-Pro-Glu Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 QARPMYDMYVLFMW-KKUMJFAQSA-N 0.000 description 2
- ZVRJWDUPIDMHDN-ULQDDVLXSA-N Phe-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 ZVRJWDUPIDMHDN-ULQDDVLXSA-N 0.000 description 2
- HBXAOEBRGLCLIW-AVGNSLFASA-N Phe-Ser-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HBXAOEBRGLCLIW-AVGNSLFASA-N 0.000 description 2
- KQCCDMFIALWGTL-GUBZILKMSA-N Pro-Asn-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 KQCCDMFIALWGTL-GUBZILKMSA-N 0.000 description 2
- SWXSLPHTJVAWDF-VEVYYDQMSA-N Pro-Asn-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWXSLPHTJVAWDF-VEVYYDQMSA-N 0.000 description 2
- GDXZRWYXJSGWIV-GMOBBJLQSA-N Pro-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 GDXZRWYXJSGWIV-GMOBBJLQSA-N 0.000 description 2
- PZSCUPVOJGKHEP-CIUDSAMLSA-N Pro-Gln-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O PZSCUPVOJGKHEP-CIUDSAMLSA-N 0.000 description 2
- UPJGUQPLYWTISV-GUBZILKMSA-N Pro-Gln-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UPJGUQPLYWTISV-GUBZILKMSA-N 0.000 description 2
- WGAQWMRJUFQXMF-ZPFDUUQYSA-N Pro-Gln-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WGAQWMRJUFQXMF-ZPFDUUQYSA-N 0.000 description 2
- LANQLYHLMYDWJP-SRVKXCTJSA-N Pro-Gln-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O LANQLYHLMYDWJP-SRVKXCTJSA-N 0.000 description 2
- VDGTVWFMRXVQCT-GUBZILKMSA-N Pro-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 VDGTVWFMRXVQCT-GUBZILKMSA-N 0.000 description 2
- WFHYFCWBLSKEMS-KKUMJFAQSA-N Pro-Glu-Phe Chemical compound N([C@@H](CCC(=O)O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 WFHYFCWBLSKEMS-KKUMJFAQSA-N 0.000 description 2
- LGSANCBHSMDFDY-GARJFASQSA-N Pro-Glu-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O LGSANCBHSMDFDY-GARJFASQSA-N 0.000 description 2
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 2
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 2
- SSWJYJHXQOYTSP-SRVKXCTJSA-N Pro-His-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O SSWJYJHXQOYTSP-SRVKXCTJSA-N 0.000 description 2
- CLJLVCYFABNTHP-DCAQKATOSA-N Pro-Leu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O CLJLVCYFABNTHP-DCAQKATOSA-N 0.000 description 2
- SWRNSCMUXRLHCR-ULQDDVLXSA-N Pro-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 SWRNSCMUXRLHCR-ULQDDVLXSA-N 0.000 description 2
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 description 2
- PCWLNNZTBJTZRN-AVGNSLFASA-N Pro-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 PCWLNNZTBJTZRN-AVGNSLFASA-N 0.000 description 2
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 2
- 108020004518 RNA Probes Proteins 0.000 description 2
- 239000003391 RNA probe Substances 0.000 description 2
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 2
- 102100022513 Selenocysteine lyase Human genes 0.000 description 2
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 2
- KCFKKAQKRZBWJB-ZLUOBGJFSA-N Ser-Cys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O KCFKKAQKRZBWJB-ZLUOBGJFSA-N 0.000 description 2
- KJMOINFQVCCSDX-XKBZYTNZSA-N Ser-Gln-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KJMOINFQVCCSDX-XKBZYTNZSA-N 0.000 description 2
- UFKPDBLKLOBMRH-XHNCKOQMSA-N Ser-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)C(=O)O UFKPDBLKLOBMRH-XHNCKOQMSA-N 0.000 description 2
- WBINSDOPZHQPPM-AVGNSLFASA-N Ser-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)O WBINSDOPZHQPPM-AVGNSLFASA-N 0.000 description 2
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 2
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 2
- JLKWJWPDXPKKHI-FXQIFTODSA-N Ser-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC(=O)N)C(=O)O JLKWJWPDXPKKHI-FXQIFTODSA-N 0.000 description 2
- WNDUPCKKKGSKIQ-CIUDSAMLSA-N Ser-Pro-Gln Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O WNDUPCKKKGSKIQ-CIUDSAMLSA-N 0.000 description 2
- PCJLFYBAQZQOFE-KATARQTJSA-N Ser-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N)O PCJLFYBAQZQOFE-KATARQTJSA-N 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 2
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 2
- XOTBWOCSLMBGMF-SUSMZKCASA-N Thr-Glu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOTBWOCSLMBGMF-SUSMZKCASA-N 0.000 description 2
- CYVQBKQYQGEELV-NKIYYHGXSA-N Thr-His-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O CYVQBKQYQGEELV-NKIYYHGXSA-N 0.000 description 2
- JWQNAFHCXKVZKZ-UVOCVTCTSA-N Thr-Lys-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWQNAFHCXKVZKZ-UVOCVTCTSA-N 0.000 description 2
- JAJOFWABAUKAEJ-QTKMDUPCSA-N Thr-Pro-His Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O JAJOFWABAUKAEJ-QTKMDUPCSA-N 0.000 description 2
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 2
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 2
- CJEHCEOXPLASCK-MEYUZBJRSA-N Thr-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=C(O)C=C1 CJEHCEOXPLASCK-MEYUZBJRSA-N 0.000 description 2
- CCZXBOFIBYQLEV-IHPCNDPISA-N Trp-Leu-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(O)=O CCZXBOFIBYQLEV-IHPCNDPISA-N 0.000 description 2
- WLQRIHCMPFHGKP-PMVMPFDFSA-N Trp-Leu-Phe Chemical compound C([C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)CC(C)C)C(O)=O)C1=CC=CC=C1 WLQRIHCMPFHGKP-PMVMPFDFSA-N 0.000 description 2
- ARPONUQDNWLXOZ-KKUMJFAQSA-N Tyr-Gln-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ARPONUQDNWLXOZ-KKUMJFAQSA-N 0.000 description 2
- IYHNBRUWVBIVJR-IHRRRGAJSA-N Tyr-Gln-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IYHNBRUWVBIVJR-IHRRRGAJSA-N 0.000 description 2
- GITNQBVCEQBDQC-KKUMJFAQSA-N Tyr-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O GITNQBVCEQBDQC-KKUMJFAQSA-N 0.000 description 2
- MQGGXGKQSVEQHR-KKUMJFAQSA-N Tyr-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 MQGGXGKQSVEQHR-KKUMJFAQSA-N 0.000 description 2
- SYFHQHYTNCQCCN-MELADBBJSA-N Tyr-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O SYFHQHYTNCQCCN-MELADBBJSA-N 0.000 description 2
- QHFQQRKNGCXTHL-AUTRQRHGSA-N Val-Gln-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QHFQQRKNGCXTHL-AUTRQRHGSA-N 0.000 description 2
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 2
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 2
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 2
- GQMNEJMFMCJJTD-NHCYSSNCSA-N Val-Pro-Gln Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O GQMNEJMFMCJJTD-NHCYSSNCSA-N 0.000 description 2
- USLVEJAHTBLSIL-CYDGBPFRSA-N Val-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C USLVEJAHTBLSIL-CYDGBPFRSA-N 0.000 description 2
- BGXVHVMJZCSOCA-AVGNSLFASA-N Val-Pro-Lys Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N BGXVHVMJZCSOCA-AVGNSLFASA-N 0.000 description 2
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 2
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 2
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 2
- 108010041407 alanylaspartic acid Proteins 0.000 description 2
- 108010047495 alanylglycine Proteins 0.000 description 2
- 125000000539 amino acid group Chemical group 0.000 description 2
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 2
- 108010068380 arginylarginine Proteins 0.000 description 2
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 2
- 108010066988 asparaginyl-alanyl-glycyl-alanine Proteins 0.000 description 2
- 230000027455 binding Effects 0.000 description 2
- 230000000903 blocking effect Effects 0.000 description 2
- 201000011510 cancer Diseases 0.000 description 2
- 230000005742 definitive hemopoiesis Effects 0.000 description 2
- QONQRTHLHBTMGP-UHFFFAOYSA-N digitoxigenin Natural products CC12CCC(C3(CCC(O)CC3CC3)C)C3C11OC1CC2C1=CC(=O)OC1 QONQRTHLHBTMGP-UHFFFAOYSA-N 0.000 description 2
- SHIBSTMRCDJXLN-KCZCNTNESA-N digoxigenin Chemical compound C1([C@@H]2[C@@]3([C@@](CC2)(O)[C@H]2[C@@H]([C@@]4(C)CC[C@H](O)C[C@H]4CC2)C[C@H]3O)C)=CC(=O)OC1 SHIBSTMRCDJXLN-KCZCNTNESA-N 0.000 description 2
- 201000010099 disease Diseases 0.000 description 2
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 2
- 230000003828 downregulation Effects 0.000 description 2
- 239000003814 drug Substances 0.000 description 2
- 230000010437 erythropoiesis Effects 0.000 description 2
- 108020001507 fusion proteins Proteins 0.000 description 2
- 102000037865 fusion proteins Human genes 0.000 description 2
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 2
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 2
- 208000018706 hematopoietic system disease Diseases 0.000 description 2
- 230000002440 hepatic effect Effects 0.000 description 2
- 108010028295 histidylhistidine Proteins 0.000 description 2
- 108010025306 histidylleucine Proteins 0.000 description 2
- 108010018006 histidylserine Proteins 0.000 description 2
- 210000004072 lung Anatomy 0.000 description 2
- 108010054155 lysyllysine Proteins 0.000 description 2
- 210000002540 macrophage Anatomy 0.000 description 2
- 210000001161 mammalian embryo Anatomy 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- 108010005942 methionylglycine Proteins 0.000 description 2
- 210000001616 monocyte Anatomy 0.000 description 2
- 210000000822 natural killer cell Anatomy 0.000 description 2
- 239000013642 negative control Substances 0.000 description 2
- 230000005868 ontogenesis Effects 0.000 description 2
- 229920002866 paraformaldehyde Polymers 0.000 description 2
- PHEDXBVPIONUQT-RGYGYFBISA-N phorbol 13-acetate 12-myristate Chemical compound C([C@]1(O)C(=O)C(C)=C[C@H]1[C@@]1(O)[C@H](C)[C@H]2OC(=O)CCCCCCCCCCCCC)C(CO)=C[C@H]1[C@H]1[C@]2(OC(C)=O)C1(C)C PHEDXBVPIONUQT-RGYGYFBISA-N 0.000 description 2
- 230000008488 polyadenylation Effects 0.000 description 2
- 230000003999 primitive hemopoiesis Effects 0.000 description 2
- 108010087846 prolyl-prolyl-glycine Proteins 0.000 description 2
- 108010090894 prolylleucine Proteins 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 210000002966 serum Anatomy 0.000 description 2
- 239000000758 substrate Substances 0.000 description 2
- 230000014621 translational initiation Effects 0.000 description 2
- 210000004881 tumor cell Anatomy 0.000 description 2
- 108010020532 tyrosyl-proline Proteins 0.000 description 2
- QRXMUCSWCMTJGU-UHFFFAOYSA-N 5-bromo-4-chloro-3-indolyl phosphate Chemical compound C1=C(Br)C(Cl)=C2C(OP(O)(=O)O)=CNC2=C1 QRXMUCSWCMTJGU-UHFFFAOYSA-N 0.000 description 1
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 1
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 1
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 1
- JPOQZCHGOTWRTM-FQPOAREZSA-N Ala-Tyr-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPOQZCHGOTWRTM-FQPOAREZSA-N 0.000 description 1
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 1
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- JUWQNWXEGDYCIE-YUMQZZPRSA-N Arg-Gln-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O JUWQNWXEGDYCIE-YUMQZZPRSA-N 0.000 description 1
- QAODJPUKWNNNRP-DCAQKATOSA-N Arg-Glu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QAODJPUKWNNNRP-DCAQKATOSA-N 0.000 description 1
- XLWSGICNBZGYTA-CIUDSAMLSA-N Arg-Glu-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XLWSGICNBZGYTA-CIUDSAMLSA-N 0.000 description 1
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 1
- ZUFPUBYQYWCMDB-NUMRIWBASA-N Asn-Thr-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZUFPUBYQYWCMDB-NUMRIWBASA-N 0.000 description 1
- VHQOCWWKXIOAQI-WDSKDSINSA-N Asp-Gln-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VHQOCWWKXIOAQI-WDSKDSINSA-N 0.000 description 1
- BPTFNDRZKBFMTH-DCAQKATOSA-N Asp-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N BPTFNDRZKBFMTH-DCAQKATOSA-N 0.000 description 1
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 1
- QTIZKMMLNUMHHU-DCAQKATOSA-N Asp-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QTIZKMMLNUMHHU-DCAQKATOSA-N 0.000 description 1
- IWLZBRTUIVXZJD-OLHMAJIHSA-N Asp-Thr-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O IWLZBRTUIVXZJD-OLHMAJIHSA-N 0.000 description 1
- QPDUWAUSSWGJSB-NGZCFLSTSA-N Asp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N QPDUWAUSSWGJSB-NGZCFLSTSA-N 0.000 description 1
- 208000019838 Blood disease Diseases 0.000 description 1
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 1
- 101710184216 Cardioactive peptide Proteins 0.000 description 1
- 101100341294 Chlorobium chlorochromatii (strain CaD3) ispE gene Proteins 0.000 description 1
- 206010010144 Completed suicide Diseases 0.000 description 1
- MBILEVLLOHJZMG-FXQIFTODSA-N Cys-Gln-Glu Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N MBILEVLLOHJZMG-FXQIFTODSA-N 0.000 description 1
- PORWNQWEEIOIRH-XHNCKOQMSA-N Cys-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N)C(=O)O PORWNQWEEIOIRH-XHNCKOQMSA-N 0.000 description 1
- GGRDJANMZPGMNS-CIUDSAMLSA-N Cys-Ser-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O GGRDJANMZPGMNS-CIUDSAMLSA-N 0.000 description 1
- 239000003298 DNA probe Substances 0.000 description 1
- 241000450599 DNA viruses Species 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 238000009007 Diagnostic Kit Methods 0.000 description 1
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 108010067770 Endopeptidase K Proteins 0.000 description 1
- 101710100588 Erythroid transcription factor Proteins 0.000 description 1
- 102100031690 Erythroid transcription factor Human genes 0.000 description 1
- 208000035366 Familial hemophagocytic lymphohistiocytosis Diseases 0.000 description 1
- 101710082961 GATA-binding factor 2 Proteins 0.000 description 1
- GMGKDVVBSVVKCT-NUMRIWBASA-N Gln-Asn-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GMGKDVVBSVVKCT-NUMRIWBASA-N 0.000 description 1
- SOIAHPSKKUYREP-CIUDSAMLSA-N Gln-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N SOIAHPSKKUYREP-CIUDSAMLSA-N 0.000 description 1
- NKCZYEDZTKOFBG-GUBZILKMSA-N Gln-Gln-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NKCZYEDZTKOFBG-GUBZILKMSA-N 0.000 description 1
- LWDGZZGWDMHBOF-FXQIFTODSA-N Gln-Glu-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LWDGZZGWDMHBOF-FXQIFTODSA-N 0.000 description 1
- GNMQDOGFWYWPNM-LAEOZQHASA-N Gln-Gly-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)CNC(=O)[C@@H](N)CCC(N)=O)C(O)=O GNMQDOGFWYWPNM-LAEOZQHASA-N 0.000 description 1
- CELXWPDNIGWCJN-WDCWCFNPSA-N Gln-Lys-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CELXWPDNIGWCJN-WDCWCFNPSA-N 0.000 description 1
- LVRKAFPPFJRIOF-GARJFASQSA-N Gln-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N LVRKAFPPFJRIOF-GARJFASQSA-N 0.000 description 1
- OGMQXTXGLDNBSS-FXQIFTODSA-N Glu-Ala-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O OGMQXTXGLDNBSS-FXQIFTODSA-N 0.000 description 1
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 1
- CGYDXNKRIMJMLV-GUBZILKMSA-N Glu-Arg-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CGYDXNKRIMJMLV-GUBZILKMSA-N 0.000 description 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 1
- SBYVDRJAXWSXQL-AVGNSLFASA-N Glu-Asn-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SBYVDRJAXWSXQL-AVGNSLFASA-N 0.000 description 1
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 1
- WDTAKCUOIKHCTB-NKIYYHGXSA-N Glu-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N)O WDTAKCUOIKHCTB-NKIYYHGXSA-N 0.000 description 1
- ZQYZDDXTNQXUJH-CIUDSAMLSA-N Glu-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(=O)O)N ZQYZDDXTNQXUJH-CIUDSAMLSA-N 0.000 description 1
- HQOGXFLBAKJUMH-CIUDSAMLSA-N Glu-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N HQOGXFLBAKJUMH-CIUDSAMLSA-N 0.000 description 1
- BFEZQZKEPRKKHV-SRVKXCTJSA-N Glu-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O BFEZQZKEPRKKHV-SRVKXCTJSA-N 0.000 description 1
- VHPVBPCCWVDGJL-IRIUXVKKSA-N Glu-Thr-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VHPVBPCCWVDGJL-IRIUXVKKSA-N 0.000 description 1
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 1
- HJARVELKOSZUEW-YUMQZZPRSA-N Gly-Pro-Gln Chemical compound [H]NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJARVELKOSZUEW-YUMQZZPRSA-N 0.000 description 1
- IMRNSEPSPFQNHF-STQMWFEESA-N Gly-Ser-Trp Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C12)C(=O)O IMRNSEPSPFQNHF-STQMWFEESA-N 0.000 description 1
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 1
- HTTJABKRGRZYRN-UHFFFAOYSA-N Heparin Chemical compound OC1C(NC(=O)C)C(O)OC(COS(O)(=O)=O)C1OC1C(OS(O)(=O)=O)C(O)C(OC2C(C(OS(O)(=O)=O)C(OC3C(C(O)C(O)C(O3)C(O)=O)OS(O)(=O)=O)C(CO)O2)NS(O)(=O)=O)C(C(O)=O)O1 HTTJABKRGRZYRN-UHFFFAOYSA-N 0.000 description 1
- JCOSMKPAOYDKRO-AVGNSLFASA-N His-Glu-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N JCOSMKPAOYDKRO-AVGNSLFASA-N 0.000 description 1
- ZYDYEPDFFVCUBI-SRVKXCTJSA-N His-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ZYDYEPDFFVCUBI-SRVKXCTJSA-N 0.000 description 1
- CCUSLCQWVMWTIS-IXOXFDKPSA-N His-Thr-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O CCUSLCQWVMWTIS-IXOXFDKPSA-N 0.000 description 1
- 208000032672 Histiocytosis haematophagic Diseases 0.000 description 1
- 101100273831 Homo sapiens CDS1 gene Proteins 0.000 description 1
- 101001002657 Homo sapiens Interleukin-2 Proteins 0.000 description 1
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 description 1
- OONBGFHNQVSUBF-KBIXCLLPSA-N Ile-Gln-Cys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CS)C(O)=O OONBGFHNQVSUBF-KBIXCLLPSA-N 0.000 description 1
- LBRCLQMZAHRTLV-ZKWXMUAHSA-N Ile-Gly-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LBRCLQMZAHRTLV-ZKWXMUAHSA-N 0.000 description 1
- PMMMQRVUMVURGJ-XUXIUFHCSA-N Ile-Leu-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O PMMMQRVUMVURGJ-XUXIUFHCSA-N 0.000 description 1
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 1
- SHVFUCSSACPBTF-VGDYDELISA-N Ile-Ser-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SHVFUCSSACPBTF-VGDYDELISA-N 0.000 description 1
- GNXGAVNTVNOCLL-SIUGBPQLSA-N Ile-Tyr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N GNXGAVNTVNOCLL-SIUGBPQLSA-N 0.000 description 1
- YWCJXQKATPNPOE-UKJIMTQDSA-N Ile-Val-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YWCJXQKATPNPOE-UKJIMTQDSA-N 0.000 description 1
- 108060003951 Immunoglobulin Proteins 0.000 description 1
- 208000026350 Inborn Genetic disease Diseases 0.000 description 1
- 108010002350 Interleukin-2 Proteins 0.000 description 1
- 102100020873 Interleukin-2 Human genes 0.000 description 1
- 208000031671 Large B-Cell Diffuse Lymphoma Diseases 0.000 description 1
- 241000272168 Laridae Species 0.000 description 1
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 1
- XWEVVRRSIOBJOO-SRVKXCTJSA-N Leu-Pro-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O XWEVVRRSIOBJOO-SRVKXCTJSA-N 0.000 description 1
- IXHKPDJKKCUKHS-GARJFASQSA-N Lys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IXHKPDJKKCUKHS-GARJFASQSA-N 0.000 description 1
- JGAMUXDWYSXYLM-SRVKXCTJSA-N Lys-Arg-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O JGAMUXDWYSXYLM-SRVKXCTJSA-N 0.000 description 1
- GKFNXYMAMKJSKD-NHCYSSNCSA-N Lys-Asp-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GKFNXYMAMKJSKD-NHCYSSNCSA-N 0.000 description 1
- MRWXLRGAFDOILG-DCAQKATOSA-N Lys-Gln-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MRWXLRGAFDOILG-DCAQKATOSA-N 0.000 description 1
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 1
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 1
- YRAWWKUTNBILNT-FXQIFTODSA-N Met-Ala-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YRAWWKUTNBILNT-FXQIFTODSA-N 0.000 description 1
- 241000699660 Mus musculus Species 0.000 description 1
- 101100273832 Mus musculus Cds1 gene Proteins 0.000 description 1
- 101100342977 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-1 gene Proteins 0.000 description 1
- 108091005461 Nucleic proteins Chemical group 0.000 description 1
- 108091036407 Polyadenylation Proteins 0.000 description 1
- 229920001213 Polysorbate 20 Polymers 0.000 description 1
- YFNOUBWUIIJQHF-LPEHRKFASA-N Pro-Asp-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O YFNOUBWUIIJQHF-LPEHRKFASA-N 0.000 description 1
- ZPPVJIJMIKTERM-YUMQZZPRSA-N Pro-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ZPPVJIJMIKTERM-YUMQZZPRSA-N 0.000 description 1
- DRIJZWBRGMJCDD-DCAQKATOSA-N Pro-Gln-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O DRIJZWBRGMJCDD-DCAQKATOSA-N 0.000 description 1
- DIFXZGPHVCIVSQ-CIUDSAMLSA-N Pro-Gln-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DIFXZGPHVCIVSQ-CIUDSAMLSA-N 0.000 description 1
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 1
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 1
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 1
- WFIVLLFYUZZWOD-RHYQMDGZSA-N Pro-Lys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WFIVLLFYUZZWOD-RHYQMDGZSA-N 0.000 description 1
- WCNVGGZRTNHOOS-ULQDDVLXSA-N Pro-Lys-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O WCNVGGZRTNHOOS-ULQDDVLXSA-N 0.000 description 1
- AJJDPGVVNPUZCR-RHYQMDGZSA-N Pro-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1)O AJJDPGVVNPUZCR-RHYQMDGZSA-N 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 108010076504 Protein Sorting Signals Proteins 0.000 description 1
- KDCGOANMDULRCW-UHFFFAOYSA-N Purine Natural products N1=CNC2=NC=NC2=C1 KDCGOANMDULRCW-UHFFFAOYSA-N 0.000 description 1
- 238000002123 RNA extraction Methods 0.000 description 1
- 239000012980 RPMI-1640 medium Substances 0.000 description 1
- 108700005075 Regulator Genes Proteins 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 1
- HMRAQFJFTOLDKW-GUBZILKMSA-N Ser-His-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O HMRAQFJFTOLDKW-GUBZILKMSA-N 0.000 description 1
- CJINPXGSKSZQNE-KBIXCLLPSA-N Ser-Ile-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O CJINPXGSKSZQNE-KBIXCLLPSA-N 0.000 description 1
- DJACUBDEDBZKLQ-KBIXCLLPSA-N Ser-Ile-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O DJACUBDEDBZKLQ-KBIXCLLPSA-N 0.000 description 1
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 1
- FLONGDPORFIVQW-XGEHTFHBSA-N Ser-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FLONGDPORFIVQW-XGEHTFHBSA-N 0.000 description 1
- HAYADTTXNZFUDM-IHRRRGAJSA-N Ser-Tyr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HAYADTTXNZFUDM-IHRRRGAJSA-N 0.000 description 1
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 1
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 1
- 108010006785 Taq Polymerase Proteins 0.000 description 1
- KRDSCBLRHORMRK-JXUBOQSCSA-N Thr-Lys-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O KRDSCBLRHORMRK-JXUBOQSCSA-N 0.000 description 1
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 1
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 1
- 108091023040 Transcription factor Proteins 0.000 description 1
- 102000040945 Transcription factor Human genes 0.000 description 1
- BOBZBMOTRORUPT-XIRDDKMYSA-N Trp-Ser-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 BOBZBMOTRORUPT-XIRDDKMYSA-N 0.000 description 1
- QPOUERMDWKKZEG-HJPIBITLSA-N Tyr-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QPOUERMDWKKZEG-HJPIBITLSA-N 0.000 description 1
- LDKDSFQSEUOCOO-RPTUDFQQSA-N Tyr-Thr-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LDKDSFQSEUOCOO-RPTUDFQQSA-N 0.000 description 1
- HZWPGKAKGYJWCI-ULQDDVLXSA-N Tyr-Val-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O HZWPGKAKGYJWCI-ULQDDVLXSA-N 0.000 description 1
- 241000700618 Vaccinia virus Species 0.000 description 1
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 1
- RYQUMYBMOJYYDK-NHCYSSNCSA-N Val-Pro-Glu Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RYQUMYBMOJYYDK-NHCYSSNCSA-N 0.000 description 1
- GBIUHAYJGWVNLN-AEJSXWLSSA-N Val-Ser-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N GBIUHAYJGWVNLN-AEJSXWLSSA-N 0.000 description 1
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 1
- DVLWZWNAQUBZBC-ZNSHCXBVSA-N Val-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N)O DVLWZWNAQUBZBC-ZNSHCXBVSA-N 0.000 description 1
- 210000005221 acidic domain Anatomy 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- -1 and similarly Proteins 0.000 description 1
- 239000000427 antigen Substances 0.000 description 1
- 108091007433 antigens Proteins 0.000 description 1
- 102000036639 antigens Human genes 0.000 description 1
- 210000000709 aorta Anatomy 0.000 description 1
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 210000003719 b-lymphocyte Anatomy 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 210000004556 brain Anatomy 0.000 description 1
- 210000000069 breast epithelial cell Anatomy 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 244000309466 calf Species 0.000 description 1
- 238000007675 cardiac surgery Methods 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 230000014107 chromosome localization Effects 0.000 description 1
- 238000013270 controlled release Methods 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 230000002559 cytogenic effect Effects 0.000 description 1
- 238000007418 data mining Methods 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000002405 diagnostic procedure Methods 0.000 description 1
- 230000009274 differential gene expression Effects 0.000 description 1
- 230000003292 diminished effect Effects 0.000 description 1
- 238000007877 drug screening Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 210000002889 endothelial cell Anatomy 0.000 description 1
- 210000002919 epithelial cell Anatomy 0.000 description 1
- 108010089558 erythroid Kruppel-like factor Proteins 0.000 description 1
- 210000003013 erythroid precursor cell Anatomy 0.000 description 1
- ZMMJGEGLRURXTF-UHFFFAOYSA-N ethidium bromide Chemical compound [Br-].C12=CC(N)=CC=C2C2=CC=C(N)C=C2[N+](CC)=C1C1=CC=CC=C1 ZMMJGEGLRURXTF-UHFFFAOYSA-N 0.000 description 1
- 229960005542 ethidium bromide Drugs 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 239000012091 fetal bovine serum Substances 0.000 description 1
- 238000000684 flow cytometry Methods 0.000 description 1
- 238000001943 fluorescence-activated cell sorting Methods 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- 230000005714 functional activity Effects 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 208000016361 genetic disease Diseases 0.000 description 1
- 229930195712 glutamate Natural products 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 210000003714 granulocyte Anatomy 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 230000009067 heart development Effects 0.000 description 1
- 230000009033 hematopoietic malignancy Effects 0.000 description 1
- 210000000777 hematopoietic system Anatomy 0.000 description 1
- 208000014752 hemophagocytic syndrome Diseases 0.000 description 1
- 229960002897 heparin Drugs 0.000 description 1
- 229920000669 heparin Polymers 0.000 description 1
- 210000005260 human cell Anatomy 0.000 description 1
- 210000003917 human chromosome Anatomy 0.000 description 1
- 238000003018 immunoassay Methods 0.000 description 1
- 102000018358 immunoglobulin Human genes 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 239000000252 konjac Substances 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 108010034529 leucyl-lysine Proteins 0.000 description 1
- 210000000265 leukocyte Anatomy 0.000 description 1
- 210000005229 liver cell Anatomy 0.000 description 1
- 238000011068 loading method Methods 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 210000001165 lymph node Anatomy 0.000 description 1
- 210000003563 lymphoid tissue Anatomy 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 210000003519 mature b lymphocyte Anatomy 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 210000003716 mesoderm Anatomy 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000000877 morphologic effect Effects 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 208000025113 myeloid leukemia Diseases 0.000 description 1
- 210000003643 myeloid progenitor cell Anatomy 0.000 description 1
- 210000004967 non-hematopoietic stem cell Anatomy 0.000 description 1
- 230000009871 nonspecific binding Effects 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 230000005305 organ development Effects 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- 239000012188 paraffin wax Substances 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 210000004976 peripheral blood cell Anatomy 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 210000001778 pluripotent stem cell Anatomy 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 238000006116 polymerization reaction Methods 0.000 description 1
- 239000000256 polyoxyethylene sorbitan monolaurate Substances 0.000 description 1
- 235000010486 polyoxyethylene sorbitan monolaurate Nutrition 0.000 description 1
- 239000001120 potassium sulphate Substances 0.000 description 1
- 108010015796 prolylisoleucine Proteins 0.000 description 1
- 238000001273 protein sequence alignment Methods 0.000 description 1
- 125000000561 purinyl group Chemical group N1=C(N=C2N=CNC2=C1)* 0.000 description 1
- 239000000376 reactant Substances 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 201000006845 reticulosarcoma Diseases 0.000 description 1
- 208000029922 reticulum cell sarcoma Diseases 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 239000003161 ribonuclease inhibitor Substances 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 210000002027 skeletal muscle Anatomy 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 239000001509 sodium citrate Substances 0.000 description 1
- NLJMYIDDQXHKNR-UHFFFAOYSA-K sodium citrate Chemical compound O.O.[Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O NLJMYIDDQXHKNR-UHFFFAOYSA-K 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 210000002784 stomach Anatomy 0.000 description 1
- 230000004960 subcellular localization Effects 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 238000001356 surgical procedure Methods 0.000 description 1
- 229940124597 therapeutic agent Drugs 0.000 description 1
- 210000000115 thoracic cavity Anatomy 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 241000701161 unidentified adenovirus Species 0.000 description 1
- 239000013603 viral vector Substances 0.000 description 1
- 210000002845 virion Anatomy 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/68—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids
- G01N33/6875—Nucleoproteins
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/46—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
- C07K14/47—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6883—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
- C12Q1/6886—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material for cancer
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/158—Expression markers
Definitions
- the present invention in the field of medicine and molecular biology is directed to new nucleic acid encoding a protein that is involved in early stages of hematopoiesis.
- the gene maps to a chromosomal region rich in translocation breaks associated with human hematological disease, primarily leukemias.
- Hematopoiesis is a dynamic process with sequential shifting of primary hematopoietic sites from yolk sac to fetal liver and finally bone marrow (BM).
- hematopoietic tissues are derived from the ventral mesoderm.
- E7.5 embryonic day 7.5
- fetal liver becomes the predominant site of blood cell formation in the embryo (Dzierzak and Medvinsky, 1995).
- hematopoiesis in fetal liver gradually decreases, and BM becomes the major hematopoietic site.
- hematopoiesis In contrast to the human spleen, in mouse spleen, hematopoiesis, particularly erythropoiesis is very active in the red pulp in adulthood (Seifert and Marks, 1985). Hematopoiesis in the yolk sac is termed “primitive” (embryonic) hematopoiesis, and that in the fetal liver and BM is termed “definitive” (adult) hematopoiesis. Mature blood cells of the three main hematopoietic lineages, erythroid, myeloid and lymphoid, are derived from common pluripotent hematopoietic stem cells (Spangrude et al., 1988).
- Hemogen hemopoietic gene
- Hemogen transcripts were specifically detected in blood islands, primitive blood cells and fetal liver during embryogenesis, and then remained in bone marrow and spleen in adult mice. Immunostaining demonstrated that Hemogen is a nuclear protein.
- EDAG human homologue of Hemogen
- chromosome 9q22 a leukemia breakpoint.
- EDAG a human homologue of Hemogen
- EDAG exhibited specific expression in hematopoietic tissues and cells.
- Hemogen and EDAG play an important role in hematopoietic development and neoplasms.
- FIG. 1 presents the nucleotide sequence of mouse Hemogen Numbers on the left refer to the nucleotide sequence (SEQ ID NO:1) (upper) and the deduced amino acid sequence (SEQ ID NO:2) (lower).
- the first ATG (191-193 nucleotides) defines a 1512 bp ORF with multiple upstream stop codons (upper case and bold) in all three reading frames.
- the Kozak consensus sequence around ATG (191-193) is underlined.
- a polyadenylation signal double-underscored is 16 nucleotides upstream of the poly(A) tail.
- the amino acid sequence contains 503 residues with an N-terminal basic domain (underlined) and an acidic domain at the C-terminus (residues 450-480, double underline).
- the region of residues 34-50 (in brackets) is a predicted coiled-coil domain.
- This protein also contains a bipartite nuclear localization signal (residues 61-78, underlined italic) that is a basic amino acid cluster.
- FIG. 2 is a photomicrograph showing Hemogen protein localized in cell nuclei.
- the pcDNA3.1 plasmid with Hemogen-FLAG fusion gene was transfected into COS-7 cells and detected with anti-FLAG antibody. The signal was localized in the cell nuclei (nu) but not in the nucleoli (no).
- FIGS. 3 A- 3 Q are a series of photomicrographs showing expression of Hemogen during mouse embryogenesis by in situ hybridization.
- Digoxigenin-labeled antisense RNA probes were hybridized with mouse frontal section at E8.5 (panel A) and sagittal sections at 9.5, 10.5, 11.5, 12.5 and 14.5 (respectively in panels D, F, I, L, and 0).
- the Hemogen transcripts are shown as purple staining.
- Panels B, E, G, J, M, and P show circulating blood cells in panels A, D, F, I, L and 0 at high magnification (1000 ⁇ ).
- the high magnification (1000 ⁇ ) of the blood island is shown in panel C.
- ao aorta
- bc blood cell
- bi blood island
- lv liver.
- FIGS. 4 A- 4 B is a set of photomicrographs showing E10.5 and E12.5 blood cells. Embryo sections were stained with hematoxylin to show the distinct morphology of circulating blood cells at these two stages. (Panel A). The E10.5 primitive erythrocytes have higher nucleus/cytoplasm ratio. (B). The E12.5 primitive erythrocytes have more condensed nuclei.
- FIG. 5A-C shows a Northern blot analysis of Hemogen.
- Total RNA ( ⁇ 15 ⁇ g) from each indicated tissue was hybridized with a 32 P-labeled antisense RNA probe derived from Hemogen.
- Panel A A ⁇ 2.4 kb message was detected in spleen and BM by 5 h exposure.
- Panel B The same filter was overexposed for 5 days. Besides the signals in BM and spleen, three very weak bands (arrows) were detected in the peripheral blood but not in other tissues.
- Panel C The same agarose gel was stained with ethidium bromide before transfer to monitor RNA loading.
- FIGS. 6 A- 6 C is a series of photomicrographs showing expression of Hemogen in adult mouse spleen by in situ hybridization.
- Panel A A spleen section was hybridized with the Hemogen sense RNA probes as a negative control.
- Panel B By hybridization with antisense RNA probe, Hemogen transcripts were localized in the red pulp (rp) but not in the white pulp (wp).
- FIG. 7 shows expression analysis of Hemogen by RT-PCR.
- the PCR products were amplified from the templates as indicated. Lanes containing RT-PCR reactions without reverse transcriptase are labeled RT( ⁇ ). Histone H3 was used as the internal control.
- FIG. 8 shows the amino acid sequence alignment of Hemogen, EDAG and RP59.
- the sequences were aligned by ClustalW program.
- the mouse gene Hemogen (accession # AF269248) shares 70% and 43% identity with a rat gene RP59 (accession # AJ302650) and a human gene EDAG (accession # AF322875) respectively at the amino acid level.
- the nuclear localization signal (61-78 residues) and coiled-coil domain (34-50 residues) are highly conserved.
- FIGS. 9 A- 9 B shows an expression analysis of EDAG in human tissues, cultured cells and cell lines.
- Panel A Northern analysis of human tissue RNA blots with 32 P-labeled EDAG cDNA probes. Two isoforms, a 2.4 kb major isoform and a 1.8 kb minor isoform, were detected (arrow).
- Panel B shows expression of EDAG in hematopoietic tissues, cultured cells, cell lines and non-hematopoietic cell lines by RT-PCR. Histone H3 was used as the internal control.
- the present invention provides a new nuclear protein, Hemogen in mice, EDAG in humans (the names are used interchangeably herein), which shows a spatial-temporal expression pattern corresponding to the ontogeny of hematopoiesis. Its expression is strictly localized to hematopoietic tissues from embryonic stages through adulthood. Hemogen is differentially expressed in immature hematopoietic progenitor cells but downregulated in mature nucleated blood cells. A close correlation exists between Hemogen expression and hematopoiesis.
- nucleotide sequence SEQ ID NO:3 and amino acid sequence (SEQ ID NO:4) of human EDAG are shown below.
- the nucleotide sequence includes both 5′ and 3′ flanking non-coding sequence as well. Only the nucleotide sequence is numbered 1 gttatgaagataggtactg 20 tgggtgttagaaagattcacggcaaaacagggaagcatctaggct 65 gcttgtggaagtcagaccaaaatagcaggaaggtattgcagcaag 110 atggatttgggaaaggaccaatctcatttgaagcaccatcagaca M D L G K D Q S H L K H H Q T 155 cctgaccctcatcaagaagagaaccattctccagaagtcattgga P D P H Q E E N H S P E V I G 200 acctggagtttgggt
- EDAG also specifically expressed in hematopoietic cells, maps to chromosome 9q22, a region containing breakpoints (for translocation) present in several hematopoietic neoplasms.
- the invention is also directed to an isolated nucleic acid molecule that hybridizes with any of the above nucleic acid molecules under stringent hybridization conditions.
- Preferred stringent conditions include incubation in 6 ⁇ sodium chloride/sodium citrate (SSC) at about 45° C., followed by a wash in about 0.2 ⁇ SSC at a temperature of about 50° C.
- SSC 6 ⁇ sodium chloride/sodium citrate
- a preferred nucleic acid molecule as above encodes a protein having an amino acid sequence selected from SEQ ID NO:2 and SEQ ID NO:4 or encodes a biologically active fragment, homologue or other functional derivative of the protein.
- the nucleic acid molecule encodes the protein having the sequence SEQ ID NO:4 (EDAG of human origin) or encodes the biologically active fragment, homologue or other functional derivative of SEQ ID NO:4.
- the present invention includes an “isolated” Hemogen or EDAG polypeptide having the sequence SEQ ID NO:2 or SEQ ID NO:4. While the present disclosure exemplifies the full length human and murine proteins (and DNA), it is to be understood that homologues of EDAG from other mammalian species and mutants thereof that possess the characteristics disclosed herein are intended within the scope of this invention.
- EDAG EDAG
- a functional derivative retains measurable EDAG activity, preferably that of binding to an anti-EDAG antibody, or expression in an hematopoietic cells, preferably progenitors, which permits its utility in accordance with the present invention.
- “Functional derivatives” encompass “variants” and “fragments” regardless of whether the terms are used in the conjunctive or the alternative herein.
- a functional homologue must possess the above biochemical and biological activity.
- sequences are aligned for optimal comparison purposes (e.g., gaps can be introduced in one or both of a first and a second amino acid or nucleic acid sequence for optimal alignment and non-homologous sequences can be disregarded for comparison purposes).
- gaps can be introduced in one or both of a first and a second amino acid or nucleic acid sequence for optimal alignment and non-homologous sequences can be disregarded for comparison purposes.
- Cys residues are aligned.
- the length of a sequence being compared is at least 30%, preferably at least 40%, more preferably at least 50%, even more preferably at least 60%, and even more preferably at least 70%, 80%, or 90% of the length of the reference sequence.
- the amino acid residues (or nucleotides) at corresponding amino acid positions (or nucleotide) positions are then compared.
- a position in the first sequence is occupied by the same amino acid residue (or nucleotide) as the corresponding position in the second sequence, then the molecules are identical at that position (as used herein amino acid or nucleic acid “identity” is equivalent to amino acid or nucleic acid “homology”).
- the percent identity between the two sequences is a function of the number of identical positions shared by the sequences, taking into account the number of gaps, and the length of each gap, which need to be introduced for optimal alignment of the two sequences.
- the comparison of sequences and determination of percent identity between two sequences can be accomplished using a mathematical algorithm.
- the percent identity between two amino acid sequences is determined using the Needleman and Wunsch ( J. Mol. Biol. 48:444-453 (1970) algorithm which has been incorporated into the GAP program in the GCG software package (available at http://www.gcg.com), using either a Blossom 62 matrix or a PAM250 matrix, and a gap weight of 16, 14, 12, 10, 8, 6, or 4 and a length weight of 1, 2, 3, 4, 5, or 6.
- the percent identity between two nucleotide sequences is determined using the GAP program in the GCG software package (available at http://www.gcg.com), using a NWSgapdna.CMP matrix and a gap weight of 40, 50, 60, 70, or 80 and a length weight of 1, 2, 3, 4, 5, or 6.
- the percent identity between two amino acid or nucleotide sequences is determined using the algorithm of E. Meyers and W. Miller (CABIOS, 4:11-17 (1989)) which has been incorporated into the ALIGN program (version 2.0), using a PAM120 weight residue table, a gap length penalty of 12 and a gap penalty of 4.
- nucleic acid and protein sequences of the present invention can further be used as a “query sequence” to perform a search against public databases, for example, to identify other family members or related sequences.
- search can be performed using the NBLAST and XBLAST programs (version 2.0) of Altschul et al. (1990) J. Mol. Biol. 215:403-10.
- Gapped BLAST can be utilized as described in Altschul et al. (1997) Nucleic Acids Res. 25:3389-3402.
- the default parameters of the respective programs e.g., XBLAST and NBLAST
- XBLAST and NBLAST can be used. See http://www.ncbi.nlm.nih.gov.
- a homologue of the EDAG protein described above is characterized as having (a) functional activity of native EDAG, and (b) sequence similarity to a native EDAG protein (such as SEQ ID NO:2 or SEQ ID NO:4, when determined above, of at least about 30% (at the amino acid level), preferably at least about 50%, more preferably at least about 70%, even more preferably at least about 90%.
- a native EDAG protein such as SEQ ID NO:2 or SEQ ID NO:4
- an expression vector comprising any of the above nucleic acid molecules operatively linked to
- the above expression vector may be a plasmid or a viral vector.
- These vectors include self replicating RNA replicons (DNA-launched or RNA), suicide RNA vectors DNA viruses (such as adenovirus, vaccinia virus, etc.) and RNA virions grown on packaging cell lines.
- the vector DNA or RNA may be complexed to gold particles for gene gun-mediated introduction to a host or complexed with other polymers, for example, in controlled release formulations, that enhance delivery to the desired target cells and tissues.
- This invention includes a cell transformed or transfected with any of the above nucleic acid molecules or expression vectors.
- the cell is preferably a eukaryotic cell, more preferably a mammalian cell, most preferably a human cell.
- the cell may be a hematopoietic cell, preferably a progenitor cell.
- the cell is a tumor cell.
- a preferred embodiment is an isolated mammalian tumor cell transfected with an exogenous nucleic acid molecule encoding a mammalian EDAG (preferably SEQ ID NO:2 or SEQ ID NO:4) or a biologically active fragment, homologue or other functional derivative thereof, wherein the EDAG is expressed in the cells.
- a mammalian EDAG preferably SEQ ID NO:2 or SEQ ID NO:4
- a biologically active fragment, homologue or other functional derivative thereof wherein the EDAG is expressed in the cells.
- EDAG fusion polypeptide having a first fusion partner comprising all or a—part of a EDAG protein fused
- a EDAG fusion protein may also be fused to a second polypeptide, preferably one or more domains of an Ig heavy chain constant region, preferably having an amino acid sequence corresponding to the hinge, C H 2 and C H 3 regions of a human immunoglobulin Cyl chain.
- the present invention includes antibodies specific to Hemogen and to EDAG, produced by conventional methods. These include polyclonal antisera and monoclonal antibodies. Also included are antigen-binding fragments of these antibodies.
- the antibodies may be used to detect the presence or measure the amount of Hemogen or EDAG protein in a tissue sample or biological fluid in any conventional immunoassay. The antibodies may therefore be used in diagnostic assays for abnormalities associated with altered expression of these proteins in humans, mice or in the homologues of these proteins in other species.
- an antibody that is specific for an epitope of a EDAG protein.
- the epitope may be a linear or conformational epitope of a polypeptide of SEQ ID NO:2 or SEQ ID NO:4.
- the antibody is preferably a monoclonal antibody, more preferably a human or humanized (via engineering) monoclonal antibody.
- Another method is provided for isolating cells expressing a EDAG polypeptide from a cell population, comprising
- the nucleic acid encoding EDAG and the EDAG protein are used in diagnostic methods and kits for evaluating abnormalities in early hematopoiesis such as those that lead to cancer. These molecules are also useful in drug screening for potential agents that stimulate or inhibit early hematopoietic differentiation and may contribute to the inhibition of leukemogenesis or lymphomagenesis. Finally, the protein, biologically active fragments thereof, or antibodies to the protein, are useful as therapeutic agents to treat diseases associated with abnormal early hematopoietic differentiation such as certain forms of leukemia.
- Hemogen also shared homology with several human ESTs.
- Two human EST clones (accession # T52254 and AA393302) were sequenced and the ORF was deduced.
- three human homologous sequences including two draft human genomic sequences (accession # AC015928 and AL354726) and a human cDNA with hypothetical protein EDAG-1 (accession # AF228713), were deposited to GenBank.
- the putative ORF that we defined and named “EDAG” shows an additional 175 amino acids at the N-terminus of the previous deposited EDAG-1 sequence.
- a rat homologue, RP59 (GenBank # AJ302650) was deposited.
- the amino acid sequences of Hemogen, EDAG and RP59 were aligned (FIG. 8) using ClustalW program (http://www2.ebi.ac.uk/clustalw/).
- Mammalian expression vector pcDNA3.1 (Invitrogen) was modified by inserting a FLAG tag into the multiple cloning sites to express the FLAG-fusion protein. This plasmid was re-named pcDNA3.1-FLAG. To generate a mammalian expression vector of Hemogen, the ORF (191-1699 bp) was cloned into pcDNA 3.1-FLAG. The resulting construct, named pcDNA3.1-Hemogen, was sequenced to confirm that the Hemogen ORF was fused in-frame with the FLAG tag at the C-terminus.
- COS-7 cells were cultured in Dulbecco's modified Eagle medium with 10% fetal bovine serum (Gibco BRL). Using Lipofectamine PlusTM reagent (Gibco BRL), 1 ⁇ g plasmid DNA of pcDNA3.1-Hemogen was transfected into COS-7 cells to express Hemogen-FLAG fusion protein. In parallel, the same amount of pcDNA3.1-FLAG vector was transfected as a negative control. For immunostaining, 24 hours after transfection cells were fixed in 4% paraformaldehyde for 4 minutes. The endogenous peroxidase was quenched with 0.3% H 2 O 2 in methanol.
- the cells were incubated with the mouse anti-FLAG M2 monoclonal antibody (Sigma), and then, after washing, with the anti-mouse IgG antibody conjugated with peroxidase (Vector). The signals were detected through the reaction of peroxidase with the substrate DAB (Roche).
- RNAs synthesized by in vitro transcription were used as the riboprobes for Northern and in situ hybridization.
- a 224 bp EcoRI-ApaI fragment of Hemogen cDNA was cloned into pBluescript-SKII vector to produce riboprobe.
- the conditions for in vitro transcription were the following: 1 ⁇ g linearized DNA, lx transcription buffer (Roche), 10 mM DTT, 2 ⁇ l DIG RNA labeling mix (Roche), 20 units RNase inhibitor, 10 units RNA polymerase in 20 ⁇ l volume. The reaction was incubated at 37° C. for 2 hours and then digested with 2 units RNase-free DNaseI at 37° C. for 15 minutes to remove the template DNA.
- the prehybridization was performed in the hybridization buffer (50% formamide, 5 ⁇ SSC, pH 4.5, 2% blocking reagent (Roche), 0.1% Tween 20, 0.5% CHAPS, 50 ⁇ g/ml yeast RNA, 5 mM EDTA, 50 ⁇ g/ml heparin) at 60-65° C. for 1 hour.
- the hybridization was done with 1 ⁇ g/ml digoxigenin-labeled RNA probes at 60-65° C. overnight. Non-specific reactants were removed by three 15-minute washes in 50% formamide, 2 ⁇ SSC and 0.1% CHAPS at 60-65° C.
- RNAs were extracted from tissues or cells with TRIzol reagent (Gibco BRL), and fractionated in 1.2% agarose-formaldehyde gel.
- the Northern blots were performed by standard methods (Sambrook et al., 1989). Human tissue blot was purchased from Clontech.
- RNA was used to synthesize first-strand cDNA using SuperScriptTM reverse transcriptase (Gibco BRL) in 20 ⁇ l reaction.
- the PCR reaction was performed with Taq DNA polymerase (Gibco BRL) for 30 cycles.
- Primer pairs were: Hemogen: 5′-AAACACACCTCTCTCCTACCAC-3′ and (SEQ ID NO:6) 5′-CCTACTTTCTGGGCTCCTTCTG-3′.
- SEQ ID NO:7 EDAG: 5′-AAGCACCATCAGACACCTGACC-3′ and (SEQ ID NO:8) 5′-TGCTTGAAGAGAGCATCCTGCC-3′.
- SEQ ID NO:9 Histone H3: 5′-CCACTGAACTTCTGATTCGC-3′ and (SEQ ID NO:10 5′-GGGTGCTAGCTGGATGTCTT-3′.
- SEQ ID NO:11
- PCR products of Hemogen, EDAG and Histone are 881 bp, 751 bp and 214 bp respectively.
- BM cells were isolated from fragments of ribs that were removed from patients undergoing thoracic surgery. Young thymus tissue was obtained from children undergoing cardiac surgery. CD34+cells and monocytes were obtained from the blood of cancer patients undergoing peripheral mobilization for autologous transplant. T cells were obtained from blood, cultured with IL-2 for several weeks and were >99% CD3+. Hematopoietic cell lines K562, U937 and non-hematopoietic cell lines 24SV48, HUVEC and SKBR3 were cultured in RPMI-1640 medium with 10% calf serum.
- One of the clones, 6B2 showed differential expression at E10.5. Based on a search in GenBank, this clone showed sequence similarity and homology with several mouse ESTs, e.g., GenBank accession numbers AA051237, AI121196 and AI006512. These three independent clones were sequenced to obtain the full-length cDNA sequences from which the amino acid sequence was deduced (FIG. 1).
- This novel gene now designated Hemogen (hemopoietic gene), encodes 503 amino acids with a calculated molecular weight 55,043 Da and a pI of 4.84.
- ATG is the translation initiation codon and the surrounding sequence AAG ATG G is consistent with the Kozak consensus sequence (purine at position ⁇ 3 and G at position +4 (Kozak, 1997). Stop codons exist in all three reading frames upstream of the presumed initiation codon.
- the polyadenylation signal sequence is ATTAAA (2288-2293 nt), a most common variant of the canonical sequence AATAAA (Graber et al., 1999).
- the putative Hemogen protein has a basic N-terminal domain (34-78 residues with a net charge of +15) and an acidic C-terminal domain (450-480 residues with a net charge ⁇ 11).
- the region from residues 34-50 is predicted (at a window size of 14) to be a coiled-coil domain (Lupas, 1996), which is implicated in protein polymerization.
- Residues 61-78 contain a bipartite nuclear localization signal suggesting that Hemogen is a nuclear protein (Dingwall and Laskey, 1991).
- Analysis of amino acid composition shows that usages of proline (10.3%), glutamate (8.7%) and glutamic acid (11.9%) are higher than average (Brendel et al., 1992).
- Hemogen was expressed during early embryogenesis at E8.5 in the blood islands of the yolk sac and in the circulating primitive blood cells (FIG. 3A,B,C). The blood islands are the first sites to produce primitive blood cells. Expression in circulating blood cells was highly detectable at E9.5 and E10.5 (FIG. 3D,E,F,G). As liver organogenesis emerged at E10.5, Hemogen expression became detectable in the developing hepatic primordia (FIG. 3F,H). Fetal liver is the primary site of definitive blood cells generation during fetal stages.
- Hemogen is expressed in both primitive and definitive hematopoiesis. It is sequentially expressed in the active hematopoietic sites, such as yolk sac, fetal liver, BM and spleen. Hemogen expression is highly specific to the hematopoietic system throughout development since no transcripts were detected in any non-hematopoietic tissues.
- Hemogen was primarily expressed in Lineage ⁇ blast cells, Lin lo cKit + Sca-1 + pluripotent stem cells and CD34 + stem cells. Previous studies have shown that these three cell types (Spangrude et al., 1988, Li and Johnson, 1995, Krause et al., 1996) are enriched of early multipotential stem cells. Low levels of expression were found in cultured macrophages and natural killer cells.
- EDAG was found to contain the STS marker SHGC-33415.
- a search of GeneMap'99 http://www.ncbi.nlm.nih.gov/genemap/
- GDB database http://gdbwww.gdb.org/gdb/advancedSearch.html
- the human gene EDAG homologous with Hemogen maps to chromosome 9q22, which correlates with breakpoints detected in several human hematological neoplasms, such as acute myeloid leukemia (Mitelman, 1991; Mitelman et al., 1997).
- EDAG is Also Specifically Expressed in Human Hematopoietic Tissues and Cells
- RT-PCR permitted detection of high level of transcripts in the myelogenous leukemia cell line K562, K562 stimulated with phorbol myristate acetate (PMA), adult BM and CD34 + progenitor cells.
- Low level of expression appeared in thymus of a child and in cells of the histiocytic lymphoma (macrophage-like) cell line U-937.
- No expression was detected in cultured blood T cells, monocytes or in other non-hematopoietic cell lines including SV-40 transformed thymus epithelial cell line 24SV48, endothelial cell line HUVEC and breast epithelial cell line SKBR3 (FIG. 9B).
- EDAG is specifically expressed in hematopoietic cells, and expression is developmentally regulated.
- Hemogen Described herein are a novel murine gene Hemogen and its human homologue EDAG that are specifically expressed in hematopoietic tissues.
- Hemogen exhibited highly specific expression in hematopoiesis throughout development.
- primitive hematopoiesis Hemogen is detectable in blood islands and in circulating primitive erythrocytes.
- definitive hematopoiesis Hemogen is expressed in the fetal liver as early as the time of formation of hepatic primordia at E10.5. After El 1.5, expression was limited to the fetal liver.
- Hemogen was expressed in BM, spleen and very weakly in peripheral blood but not in other tissues. Very few genes exhibit such tissue- and stage-specific expression patterns in the blood system throughout all developmental stages. Given its presence as a nuclear factor, Hemogen is expected to play a regulatory roles in hematopoiesis.
- EDAG is the human homologue of Hemogen, and similarly, EDAG is closely tied in with hematopoiesis. Based on sequences in GenBank (accession # AF228713), EDAG cDNA was cloned from the fetal liver, an active site of hematopoiesis. Moreover, EDAG was specifically expressed in a variety of hematopoietic cells and tissues but not in non-hematopoietic cells.
- Hemogen a novel murine gene expressed in cells and tissues that coincide with active hematopoiesis.
- human homologue that is specifically expressed in hematopoietic tissues and maps to leukemia breakpoints of a human chromosome.
- FKHL15 a new human member of the forkhead gene family located on chromosome 9q22. Genomics 41, 390-396.
- M-CSF macrophage colony-stimulating factor
- GM-CSF granulocyte-macrophage colony-stimulating factor
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Immunology (AREA)
- Molecular Biology (AREA)
- Analytical Chemistry (AREA)
- Biochemistry (AREA)
- Zoology (AREA)
- Pathology (AREA)
- Genetics & Genomics (AREA)
- General Health & Medical Sciences (AREA)
- Wood Science & Technology (AREA)
- Physics & Mathematics (AREA)
- Biotechnology (AREA)
- Medicinal Chemistry (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Urology & Nephrology (AREA)
- Microbiology (AREA)
- Hematology (AREA)
- Toxicology (AREA)
- Gastroenterology & Hepatology (AREA)
- General Physics & Mathematics (AREA)
- Food Science & Technology (AREA)
- Hospice & Palliative Care (AREA)
- Oncology (AREA)
- Cell Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Peptides Or Proteins (AREA)
Abstract
A novel murine gene, designated Hemogen (hemopoietic gene) is sequentially expressed in active hematopoietic sites and downregulated during blood cell differentiation. Hemogen is a nuclear protein. A human homologue of Hemogen, named EDAG, which maps to chromosome 9q22, a leukemia breakpoint, exhibits also specific expression in hematopoietic tissues and cells. Hemogen and EDAG play an important role in hematopoietic development and neoplasms.
Description
- [0001] This invention was funded in part by a grant from the National Heart Lung and Blood Institute (HL 58916-01A1), National Institutes of Health which provides to the United States government certain rights in this invention.
- 1. Field of the Invention
- . The present invention in the field of medicine and molecular biology is directed to new nucleic acid encoding a protein that is involved in early stages of hematopoiesis. The gene maps to a chromosomal region rich in translocation breaks associated with human hematological disease, primarily leukemias.
- 2. Description of the Background Art
- References cited herein are listed before the claims.
- Hematopoiesis is a dynamic process with sequential shifting of primary hematopoietic sites from yolk sac to fetal liver and finally bone marrow (BM). During embryonic development, hematopoietic tissues are derived from the ventral mesoderm. The first blood cells, primitive erythrocytes, appear in the blood islands in extraembryonic yolk sac at about embryonic day 7.5 (E7.5) in mice. By E12, fetal liver becomes the predominant site of blood cell formation in the embryo (Dzierzak and Medvinsky, 1995). Just prior to birth and thereafter, hematopoiesis in fetal liver gradually decreases, and BM becomes the major hematopoietic site. In contrast to the human spleen, in mouse spleen, hematopoiesis, particularly erythropoiesis is very active in the red pulp in adulthood (Seifert and Marks, 1985). Hematopoiesis in the yolk sac is termed “primitive” (embryonic) hematopoiesis, and that in the fetal liver and BM is termed “definitive” (adult) hematopoiesis. Mature blood cells of the three main hematopoietic lineages, erythroid, myeloid and lymphoid, are derived from common pluripotent hematopoietic stem cells (Spangrude et al., 1988). The study of hematopoiesis has been facilitated by the identification of a variety of regulatory genes important for hematopoietic induction, lineage selection and blood cell differentiation (Engel and Murre, 1999; Orkin, 1995; Sieweke and Graf, 1998). Mutations or translocations in many of these genes are important in hematopoietic malignancies (Rowley, 1998; Sawyers, 1998).
- Citation of the above documents is not intended as an admission that any of the foregoing is pertinent prior art. All statements as to the date or representation as to the contents of these documents is based on the information available to the applicant and does not constitute any admission as to the correctness of the dates or contents of these documents.
- The present inventors cloned a novel murine gene, designated Hemogen (hemopoietic gene), which was sequentially expressed in active hematopoietic sites and downregulated in the process of blood cell differentiation. Hemogen transcripts were specifically detected in blood islands, primitive blood cells and fetal liver during embryogenesis, and then remained in bone marrow and spleen in adult mice. Immunostaining demonstrated that Hemogen is a nuclear protein.
- The present inventors also discovered a human homologue of Hemogen, named EDAG, which was mapped to chromosome 9q22, a leukemia breakpoint. Like Hemogen, EDAG exhibited specific expression in hematopoietic tissues and cells. Hemogen and EDAG play an important role in hematopoietic development and neoplasms.
- FIG. 1 presents the nucleotide sequence of mouse Hemogen Numbers on the left refer to the nucleotide sequence (SEQ ID NO:1) (upper) and the deduced amino acid sequence (SEQ ID NO:2) (lower). The first ATG (191-193 nucleotides) defines a 1512 bp ORF with multiple upstream stop codons (upper case and bold) in all three reading frames. The Kozak consensus sequence around ATG (191-193) is underlined. A polyadenylation signal (double-underscored) is 16 nucleotides upstream of the poly(A) tail. The amino acid sequence contains 503 residues with an N-terminal basic domain (underlined) and an acidic domain at the C-terminus (residues 450-480, double underline). The region of residues 34-50 (in brackets) is a predicted coiled-coil domain. This protein also contains a bipartite nuclear localization signal (residues 61-78, underlined italic) that is a basic amino acid cluster.
- FIG. 2 is a photomicrograph showing Hemogen protein localized in cell nuclei. The pcDNA3.1 plasmid with Hemogen-FLAG fusion gene was transfected into COS-7 cells and detected with anti-FLAG antibody. The signal was localized in the cell nuclei (nu) but not in the nucleoli (no).
- FIGS.3A-3Q are a series of photomicrographs showing expression of Hemogen during mouse embryogenesis by in situ hybridization. Digoxigenin-labeled antisense RNA probes were hybridized with mouse frontal section at E8.5 (panel A) and sagittal sections at 9.5, 10.5, 11.5, 12.5 and 14.5 (respectively in panels D, F, I, L, and 0). The Hemogen transcripts are shown as purple staining. Panels B, E, G, J, M, and P show circulating blood cells in panels A, D, F, I, L and 0 at high magnification (1000×). The high magnification (1000×) of the blood island is shown in panel C. The high magnification (1000×) of fetal livers are shown in panels H, K, N and Q. Scale bars=1 mm. ao: aorta, bc: blood cell, bi: blood island, lv: liver.
- FIGS.4A-4B is a set of photomicrographs showing E10.5 and E12.5 blood cells. Embryo sections were stained with hematoxylin to show the distinct morphology of circulating blood cells at these two stages. (Panel A). The E10.5 primitive erythrocytes have higher nucleus/cytoplasm ratio. (B). The E12.5 primitive erythrocytes have more condensed nuclei.
- FIG. 5A-C shows a Northern blot analysis of Hemogen. Total RNA (˜15 μg) from each indicated tissue was hybridized with a32P-labeled antisense RNA probe derived from Hemogen. Panel A: A ˜2.4 kb message was detected in spleen and BM by 5 h exposure. Panel B: The same filter was overexposed for 5 days. Besides the signals in BM and spleen, three very weak bands (arrows) were detected in the peripheral blood but not in other tissues. Panel C: The same agarose gel was stained with ethidium bromide before transfer to monitor RNA loading.
- FIGS.6A-6C is a series of photomicrographs showing expression of Hemogen in adult mouse spleen by in situ hybridization. Panel A: A spleen section was hybridized with the Hemogen sense RNA probes as a negative control. Panel B: By hybridization with antisense RNA probe, Hemogen transcripts were localized in the red pulp (rp) but not in the white pulp (wp). Panel C is a higher magnification (1000×) of the positive-staining cells in the red pulp. Scale bar=1 mm.
- FIG. 7 shows expression analysis of Hemogen by RT-PCR. The PCR products were amplified from the templates as indicated. Lanes containing RT-PCR reactions without reverse transcriptase are labeled RT(−). Histone H3 was used as the internal control.
- FIG. 8 shows the amino acid sequence alignment of Hemogen, EDAG and RP59. The sequences were aligned by ClustalW program. The mouse gene Hemogen (accession # AF269248) shares 70% and 43% identity with a rat gene RP59 (accession # AJ302650) and a human gene EDAG (accession # AF322875) respectively at the amino acid level. The nuclear localization signal (61-78 residues) and coiled-coil domain (34-50 residues) are highly conserved.
- FIGS.9A-9B shows an expression analysis of EDAG in human tissues, cultured cells and cell lines. Panel A: Northern analysis of human tissue RNA blots with 32P-labeled EDAG cDNA probes. Two isoforms, a 2.4 kb major isoform and a 1.8 kb minor isoform, were detected (arrow). Panel B shows expression of EDAG in hematopoietic tissues, cultured cells, cell lines and non-hematopoietic cell lines by RT-PCR. Histone H3 was used as the internal control.
- The present invention provides a new nuclear protein, Hemogen in mice, EDAG in humans (the names are used interchangeably herein), which shows a spatial-temporal expression pattern corresponding to the ontogeny of hematopoiesis. Its expression is strictly localized to hematopoietic tissues from embryonic stages through adulthood. Hemogen is differentially expressed in immature hematopoietic progenitor cells but downregulated in mature nucleated blood cells. A close correlation exists between Hemogen expression and hematopoiesis.
- The nucleic acid (SEQ ID NO: 1) and deduced amino acid sequence (SEQ ID NO:2 of Hemogen is shown in FIG. 1
- The nucleotide sequence (SEQ ID NO:3) and amino acid sequence (SEQ ID NO:4) of human EDAG are shown below. The nucleotide sequence includes both 5′ and 3′ flanking non-coding sequence as well. Only the nucleotide sequence is numbered
1 gttatgaagataggtactg 20 tgggtgttagaaagattcacggcaaaacagggaagcatctaggct 65 gcttgtggaagtcagaccaaaatagcaggaaggtattgcagcaag 110 atggatttgggaaaggaccaatctcatttgaagcaccatcagaca M D L G K D Q S H L K H H Q T 155 cctgaccctcatcaagaagagaaccattctccagaagtcattgga P D P H Q E E N H S P E V I G 200 acctggagtttgagaaacagagaactacttagaaaaagaaaagct T W S L R N R E L L R K R K A 245 gaagtgcatgaaaaggaaacatcacaatggctatttggagaacag E V H E K E T S Q W L F G E Q 290 aaaaaacgcaagcagcagagaacaggaaaaggaaatcgaagaggc K K R K Q Q P T G K G N R R G 335 agaaagagacaacaaaacacagaattgaaggtggagcctcagcca R K R Q Q N T E L K V E P Q P 380 cagatagaaaaggaaatagtggagaaagcactggcacctatagag Q I E K E I V E K A L A P I E 425 aaaaaaactgagccacctgggagcataaccaaagtatttccttca K K T E P P G S I T K V F P S 470 gtagcctccccgcaaaaagttgtgcctgaggaacacttttctgaa V A S P Q K V V P E E H F S E 515 atatgtcaagaaagtaacatatatcaggagaatttttctgagtac I C Q E S N I Y Q E N F S E Y 560 caagaaatagcagtacaaaaccattcttctgaaacatgccaacat Q E I A V Q N H S S E T C Q H 605 gtgtctgaacctgaagacctctctcctaaaatgtaccaagaaata V S E P E D L S P K M Y Q E I 650 tctgtacttcaagacaattcttccaaaatatgccaagacatgaag S V L Q D N S S K I C Q D M K 695 gaacctgaagacaactctcctaacacatgccaagtaatatctgta E P E D N S P N T C Q V I S V 740 attcaagaccatcctttcaaaatgtaccaagatatggctaaacga I Q D H P F K M Y Q D M A K R 785 gaagatctggctcctaaaatgtgccaagaagctgctgtacccaaa E D L A P K M C Q E A A V P K 830 atccttccttgtccaacatctgaagacacagctgatctggcagga I L P C P T S E D T A D L A G 875 tgctctcttcaagcatatccaaaaccagatgtgcctaaaggctat C S L Q A Y P K P D V P K G Y 920 attcttgacacagaccaaaatccagcagaaccagaggaatacaat I L D T D Q N P A E P E E Y N 965 gaaacagatcaaggaatagctgagacagaaggcctttttcctaaa E T D Q G I A E T E C L F P K 1010 atacaagaaatagctgagcctaaagacctttctacaaaaacacac I Q E I A E P K D L S T K T H 1055 caagaatcagctgaacctaaataccttcctcataaaacatgtaac Q E S A E P K Y L P H K T C N 1100 gaaattattgtgcctaaagccccctctcataaaacaatccaagaa E I I V P K A P S H K T I Q E 1145 acacctcattctgaagactattcaattgaaataaaccaagaaact T P H S E D Y S I E I N Q E T 1190 cctgggtctgaaaaatattcacctgaaacgtatcaagaaatacct P G S E K Y S P E T Y Q E I P 1235 gggcttgaagaatattcacctgaaatataccaagaaacatcccag G L E E Y S P E T Y Q E T S Q 1280 cttgaagaatattcacctgaaatataccaagaaacaccggggcct L E E Y S P E I Y Q E T P G P 1325 gaagacctctctactgagacatataaaaataaggatgtgcctaaa E D L S T E T Y K N K D V P K 1370 gaatgctttccagaaccacaccaagaaacaggtgggccccaaggc E C F P E P H Q E T G G P Q G 1415 caggatcctaaagcacaccaggaagatgctaaagatgcttatact Q D P K A E Q E D A K D A Y T 1460 tttcctcaagaaatgaaagaaaaacccaaagaagagccaggaata F P Q E M K E K P K E E P G I 1505 ccagcaattctgaatgagagtcatccagaaaatgatgtctatagt P A I L N E S H P E N D V Y S 1550 tatgttttgttttaacaatgctcaaccataaagttgtggtccaat Y V L F * 1595 ggaaaaaaaaaaaaaaaaaaaaaa - The coding sequence of human EDAG, SEQ ID NO:5 is shown below. This is a fragment of SEQ ID NO:3 and includes 1452 nucleotides; the stop codon is shown in brackets at the end.
atggatttgg gaaaggacca atctcatttg aagcaccatc 50 agacacctga ccctcatcaa gaagagaacc attctccaga agtcattgga 100 acctggagtt tgagaaacag agaactactt agaaaaagaa aagctgaagt 150 gcatgaaaag gaaacatcac aatggctatt tggagaacag aaaaaacgca 200 agcagcagag aacaggaaaa ggaaatcgaa gaggcagaaa gagacaacaa 250 aacacagaat tgaaggtgga gcctcagcca cagatagaaa aggaaatagt 300 ggagaaagca ctggcaccta tagagaaaaa aactgagcca cctgggagca 350 taaccaaagt atttccttca gtagcctccc cgcaaaaagt tgtgcctgag 400 gaacactttt ctgaaatatg tcaagaaagt aacatatatc aggagaattt 450 ttctgagtac caagaaatag cagtacaaaa ccattcttct gaaacatgcc 500 aacatgtgtc tgaacctgaa gacctctctc ctaaaatgta ccaagaaata 550 tctgtacttc aagacaattc ttccaaaata tgccaagaca tgaaggaacc 600 tgaagacaac tctcctaaca catgccaagt aatatctgta attcaagacc 650 atcctttcaa aatgtaccaa gatatggcta aacgagaaga tctggctcct 700 aaaatgtgcc aagaagctgc tgtacccaaa atccttcctt gtccaacatc 750 tgaagacaca gctgatctgg caggatgctc tcttcaagca tatccaaaac 800 cagatgtgcc taaaggctat attcttgaca cagaccaaaa tccagcagaa 850 ccagaggaat acaatgaaac agatcaagga atagctgaga cagaaggcct 900 ttttcctaaa atacaagaaa tagctgagcc taaagacctt tctacaaaaa 950 cacaccaaga atcagctgaa cctaaatacc ttcctcataa aacatgtaac 1000 gaaattattg tgcctaaagc cccctctcat aaaacaatcc aagaaacacc 1050 tcattctgaa gactattcaa ttgaaataaa ccaagaaact cctgggtctg 1100 aaaaatattc acctgaaacg tatcaagaaa tacctgggct tgaagaatat 1150 tcacctgaaa tataccaaga aacatcccag cttgaagaat attcacctga 1200 aatataccaa gaaacaccgg ggcctgaaga cctctctact gagacatata 1250 aaaataagga tgtgcctaaa gaatgctttc cagaaccaca ccaagaaaca 1300 ggtgggcccc aaggccagga tcctaaagca caccaggaag atgctaaaga 1350 tgcttatact tttcctcaag aaatgaaaga aaaacccaaa gaagagccag 1400 gaataccagc aattctgaat gagagtcatc cagaaaatga tgtctatagt 1450 tatgttttgt tt``[taa] 1452 - The amino acid sequence of human EDAG (SEQ ID NO:4), 484 residues, is shown separately below.
MDLGKDQSHL KHHQTPDPHQ EENHSPEVTG TWSLRNRELL 50 RKRKAEVHEK ETSQWLFGEQ KKRKQQRTGK GNRRGRKRQQ NTELKVEPQP 100 QIEKEIVEKA LAPIEKKTEP PGSITKVFPS VASPQKVVPE EHESEICQES 150 NIYQENFSEY QEIAVQNHSS ETCQHVSEPE DLSPKMYQEI SVLQDNSSKT 200 CQDMKEPEDN SPNTCQVISV IQDHPFKMYQ DMAKREDLAP KMCQEAAVPK 250 ILPCPTSEDT ADLAGCSLQA YPKPDVPKGY ILDTDQNPAE PEEYNETDQG 300 TAETEGLFPK IQEIAEPKDL STKTHQESAE PKYLPHKTCN EITVPKAPSH 350 KTIQETPHSE DYSTEINQET PGSEKYSPET YQEIPGLEEY SPEIYQETSQ 400 LEEYSPEIYQ ETPGPEDLST ETYKNKDVPK ECFPEPHQET GGPQGQDPKA 450 HQEDAKDAYT FPQEMKEKPK EEPGIPAILN ESHPENDVYS YVLF 484 - EDAG, also specifically expressed in hematopoietic cells, maps to chromosome 9q22, a region containing breakpoints (for translocation) present in several hematopoietic neoplasms.
- The invention is also directed to an isolated nucleic acid molecule that hybridizes with any of the above nucleic acid molecules under stringent hybridization conditions. Preferred stringent conditions include incubation in 6×sodium chloride/sodium citrate (SSC) at about 45° C., followed by a wash in about 0.2×SSC at a temperature of about 50° C.
- A preferred nucleic acid molecule as above encodes a protein having an amino acid sequence selected from SEQ ID NO:2 and SEQ ID NO:4 or encodes a biologically active fragment, homologue or other functional derivative of the protein. Preferably, the nucleic acid molecule encodes the protein having the sequence SEQ ID NO:4 (EDAG of human origin) or encodes the biologically active fragment, homologue or other functional derivative of SEQ ID NO:4.
- The present invention includes an “isolated” Hemogen or EDAG polypeptide having the sequence SEQ ID NO:2 or SEQ ID NO:4. While the present disclosure exemplifies the full length human and murine proteins (and DNA), it is to be understood that homologues of EDAG from other mammalian species and mutants thereof that possess the characteristics disclosed herein are intended within the scope of this invention.
- Also included is a “functional derivative” of EDAG which is means an amino acid substitution variant, a “fragment,” or a “chemical derivative” of EDAG, which terms are defined below. A functional derivative retains measurable EDAG activity, preferably that of binding to an anti-EDAG antibody, or expression in an hematopoietic cells, preferably progenitors, which permits its utility in accordance with the present invention. “Functional derivatives” encompass “variants” and “fragments” regardless of whether the terms are used in the conjunctive or the alternative herein.
- A functional homologue must possess the above biochemical and biological activity. In view of this functional characterization, use of homologous proteins EDAG from other species, including proteins not yet discovered, fall within the scope of the invention if these proteins have sequence similarity and the recited biochemical and biological activity.
- To determine the percent identity of two amino acid sequences or of two nucleic acid sequences, the sequences are aligned for optimal comparison purposes (e.g., gaps can be introduced in one or both of a first and a second amino acid or nucleic acid sequence for optimal alignment and non-homologous sequences can be disregarded for comparison purposes). In a preferred method of alignment, Cys residues are aligned.
- In a preferred embodiment, the length of a sequence being compared is at least 30%, preferably at least 40%, more preferably at least 50%, even more preferably at least 60%, and even more preferably at least 70%, 80%, or 90% of the length of the reference sequence. The amino acid residues (or nucleotides) at corresponding amino acid positions (or nucleotide) positions are then compared. When a position in the first sequence is occupied by the same amino acid residue (or nucleotide) as the corresponding position in the second sequence, then the molecules are identical at that position (as used herein amino acid or nucleic acid “identity” is equivalent to amino acid or nucleic acid “homology”). The percent identity between the two sequences is a function of the number of identical positions shared by the sequences, taking into account the number of gaps, and the length of each gap, which need to be introduced for optimal alignment of the two sequences.
- The comparison of sequences and determination of percent identity between two sequences can be accomplished using a mathematical algorithm. In a preferred embodiment, the percent identity between two amino acid sequences is determined using the Needleman and Wunsch (J. Mol. Biol. 48:444-453 (1970) algorithm which has been incorporated into the GAP program in the GCG software package (available at http://www.gcg.com), using either a Blossom 62 matrix or a PAM250 matrix, and a gap weight of 16, 14, 12, 10, 8, 6, or 4 and a length weight of 1, 2, 3, 4, 5, or 6. In yet another preferred embodiment, the percent identity between two nucleotide sequences is determined using the GAP program in the GCG software package (available at http://www.gcg.com), using a NWSgapdna.CMP matrix and a gap weight of 40, 50, 60, 70, or 80 and a length weight of 1, 2, 3, 4, 5, or 6. In another embodiment, the percent identity between two amino acid or nucleotide sequences is determined using the algorithm of E. Meyers and W. Miller (CABIOS, 4:11-17 (1989)) which has been incorporated into the ALIGN program (version 2.0), using a PAM120 weight residue table, a gap length penalty of 12 and a gap penalty of 4.
- The nucleic acid and protein sequences of the present invention can further be used as a “query sequence” to perform a search against public databases, for example, to identify other family members or related sequences. Such searches can be performed using the NBLAST and XBLAST programs (version 2.0) of Altschul et al. (1990)J. Mol. Biol. 215:403-10. BLAST nucleotide searches can be performed with the NBLAST program, score=100, wordlength=12 to obtain nucleotide sequences homologous to human or murine EDAG nucleic acid molecules. BLAST protein searches can be performed with the XBLAST program, score=50, wordlength=3 to obtain amino acid sequences homologous to human or murine EDAG protein molecules of the invention. To obtain gapped alignments for comparison purposes, Gapped BLAST can be utilized as described in Altschul et al. (1997) Nucleic Acids Res. 25:3389-3402. When utilizing BLAST and Gapped BLAST programs, the default parameters of the respective programs (e.g., XBLAST and NBLAST) can be used. See http://www.ncbi.nlm.nih.gov.
- Thus, a homologue of the EDAG protein described above is characterized as having (a) functional activity of native EDAG, and (b) sequence similarity to a native EDAG protein (such as SEQ ID NO:2 or SEQ ID NO:4, when determined above, of at least about 30% (at the amino acid level), preferably at least about 50%, more preferably at least about 70%, even more preferably at least about 90%.
- It is within the skill in the art to obtain and express such a protein using DNA probes based on the disclosed sequences of EDAG. Then, the protein's biochemical and biological activity can be tested readily using art-recognized methods such as those described herein.
- Also provided is an expression vector comprising any of the above nucleic acid molecules operatively linked to
- (a) a promoter and
- (b) optionally, additional regulatory sequences that regulate expression of the nucleic acid in a eukaryotic cell.
- The above expression vector may be a plasmid or a viral vector. These vectors include self replicating RNA replicons (DNA-launched or RNA), suicide RNA vectors DNA viruses (such as adenovirus, vaccinia virus, etc.) and RNA virions grown on packaging cell lines.
- The vector DNA or RNA may be complexed to gold particles for gene gun-mediated introduction to a host or complexed with other polymers, for example, in controlled release formulations, that enhance delivery to the desired target cells and tissues.
- This invention includes a cell transformed or transfected with any of the above nucleic acid molecules or expression vectors. The cell is preferably a eukaryotic cell, more preferably a mammalian cell, most preferably a human cell. The cell may be a hematopoietic cell, preferably a progenitor cell. In another embodiment, the cell is a tumor cell.
- A preferred embodiment is an isolated mammalian tumor cell transfected with an exogenous nucleic acid molecule encoding a mammalian EDAG (preferably SEQ ID NO:2 or SEQ ID NO:4) or a biologically active fragment, homologue or other functional derivative thereof, wherein the EDAG is expressed in the cells.
- Also provided is a EDAG fusion polypeptide having a first fusion partner comprising all or a—part of a EDAG protein fused
- (i) directly to a second polypeptide or,
- (ii) optionally, fused to a linker peptide sequence that is fused to the second polypeptide.
- The above A EDAG fusion protein may also be fused to a second polypeptide, preferably one or more domains of an Ig heavy chain constant region, preferably having an amino acid sequence corresponding to the hinge, CH2 and CH3 regions of a human immunoglobulin Cyl chain.
- The present invention includes antibodies specific to Hemogen and to EDAG, produced by conventional methods. These include polyclonal antisera and monoclonal antibodies. Also included are antigen-binding fragments of these antibodies. The antibodies may be used to detect the presence or measure the amount of Hemogen or EDAG protein in a tissue sample or biological fluid in any conventional immunoassay. The antibodies may therefore be used in diagnostic assays for abnormalities associated with altered expression of these proteins in humans, mice or in the homologues of these proteins in other species.
- Thus, in one embodiment is included an antibody that is specific for an epitope of a EDAG protein. The epitope may be a linear or conformational epitope of a polypeptide of SEQ ID NO:2 or SEQ ID NO:4. The antibody is preferably a monoclonal antibody, more preferably a human or humanized (via engineering) monoclonal antibody.
- Also provided is a method of using the above antibody to identify or quantitate cells expressing a EDAG polypeptide in a cell population, comprising
- (a) contacting cells of the population with the above antibody so that the antibody binds to cells expressing the epitope;
- (b) assessing the presence of or quantitating the number of cells to which the antibody is bound.
- Another method is provided for isolating cells expressing a EDAG polypeptide from a cell population, comprising
- (a) contacting the population with the above antibody so that the antibody binds to cells expressing the epitope;
- (b) positively selecting cells to which the antibody has bound or negatively selecting cells to which the antibody has not bound.
- Also provided is a method of detecting the presence or quantitating a EDAG polypeptide, fragment or homologue in a sample, comprising the steps of:
- (a) contacting the sample with the above antibody such that the antibody binds to any polypeptides or fragments bearing the epitope;
- (b) detecting the presence of, or quantitating the polypeptides or fragments bound to the antibody.
- The nucleic acid encoding EDAG and the EDAG protein are used in diagnostic methods and kits for evaluating abnormalities in early hematopoiesis such as those that lead to cancer. These molecules are also useful in drug screening for potential agents that stimulate or inhibit early hematopoietic differentiation and may contribute to the inhibition of leukemogenesis or lymphomagenesis. Finally, the protein, biologically active fragments thereof, or antibodies to the protein, are useful as therapeutic agents to treat diseases associated with abnormal early hematopoietic differentiation such as certain forms of leukemia.
- Cloning and Sequencing of the Mouse Hemogen and the Human EDAG
- We have used PCR-based cDNA subtraction to identify differentially regulated genes during mouse cardiac development. Heart tissue, which contains some blood cells, was dissected from mouse embryos at E10.5 and E16.5, and the RNA was extracted. With E10.5 RNA as the tester and E16.5 RNA as the driver, subtraction hybridization was performed using the PCR-Select cDNA subtraction kit (Clontech). The subtracted clones were then sequenced in an automated DNA sequencer. The sequences were searched against GenBank. One clone, 6B2, shared homology with several mouse expressed sequence tags (ESTs). There was no match in the non-redundant database. Three EST clones (accession # AA051237, AI121196 and AI006512) were purchased from Research Genetics and sequenced in both directions to obtain the full-length cDNA, now designated Hemogen, as shown in FIG. 1. The opening reading frame (ORF) was deduced. Additional cDNA clones were also identified through screening the E10 heart cDNA library (Stratagene).
- Hemogen also shared homology with several human ESTs. Two human EST clones (accession # T52254 and AA393302) were sequenced and the ORF was deduced. During the process of our study, three human homologous sequences, including two draft human genomic sequences (accession # AC015928 and AL354726) and a human cDNA with hypothetical protein EDAG-1 (accession # AF228713), were deposited to GenBank. The putative ORF that we defined and named “EDAG” shows an additional 175 amino acids at the N-terminus of the previous deposited EDAG-1 sequence. Recently, a rat homologue, RP59 (GenBank # AJ302650) was deposited. The amino acid sequences of Hemogen, EDAG and RP59 were aligned (FIG. 8) using ClustalW program (http://www2.ebi.ac.uk/clustalw/).
- The Hemogen and EDAG sequences have been deposited in GenBank and assigned accession numbers AF269248 and AF322875 respectively.
- Plasmid Constructs
- Mammalian expression vector pcDNA3.1 (Invitrogen) was modified by inserting a FLAG tag into the multiple cloning sites to express the FLAG-fusion protein. This plasmid was re-named pcDNA3.1-FLAG. To generate a mammalian expression vector of Hemogen, the ORF (191-1699 bp) was cloned into pcDNA 3.1-FLAG. The resulting construct, named pcDNA3.1-Hemogen, was sequenced to confirm that the Hemogen ORF was fused in-frame with the FLAG tag at the C-terminus.
- Immunostaining
- COS-7 cells were cultured in Dulbecco's modified Eagle medium with 10% fetal bovine serum (Gibco BRL). Using Lipofectamine Plus™ reagent (Gibco BRL), 1 μg plasmid DNA of pcDNA3.1-Hemogen was transfected into COS-7 cells to express Hemogen-FLAG fusion protein. In parallel, the same amount of pcDNA3.1-FLAG vector was transfected as a negative control. For immunostaining, 24 hours after transfection cells were fixed in 4% paraformaldehyde for 4 minutes. The endogenous peroxidase was quenched with 0.3% H2O2 in methanol. After the treatment with 2% blocking serum, the cells were incubated with the mouse anti-FLAG M2 monoclonal antibody (Sigma), and then, after washing, with the anti-mouse IgG antibody conjugated with peroxidase (Vector). The signals were detected through the reaction of peroxidase with the substrate DAB (Roche).
- In vitro Transcription
- RNAs synthesized by in vitro transcription were used as the riboprobes for Northern and in situ hybridization. A 224 bp EcoRI-ApaI fragment of Hemogen cDNA was cloned into pBluescript-SKII vector to produce riboprobe. The conditions for in vitro transcription were the following: 1 μg linearized DNA, lx transcription buffer (Roche), 10 mM DTT, 2 μl DIG RNA labeling mix (Roche), 20 units RNase inhibitor, 10 units RNA polymerase in 20 μl volume. The reaction was incubated at 37° C. for 2 hours and then digested with 2 units RNase-free DNaseI at 37° C. for 15 minutes to remove the template DNA.
- In situ Hybridization
- Mouse embryos and tissues were fixed in 4% paraformaldehyde overnight at 4° C., embedded in paraffin and sectioned. In situ hybridization was carried out using a modification of a previously described method (Wilkinson, 1992). Tissue sections were pretreated with 0.2N HCl for 20 minutes, 10 μg/ml proteinase K for 15 minutes at 37° C., 4% formaldehyde for 20 minutes, and 0.5% acetic anhydride in 0.1M TEA (pH 8.0) for 10 minutes. The prehybridization was performed in the hybridization buffer (50% formamide, 5×SSC, pH 4.5, 2% blocking reagent (Roche), 0.1
% Tween 20, 0.5% CHAPS, 50 μg/ml yeast RNA, 5 mM EDTA, 50 μg/ml heparin) at 60-65° C. for 1 hour. The hybridization was done with 1 μg/ml digoxigenin-labeled RNA probes at 60-65° C. overnight. Non-specific reactants were removed by three 15-minute washes in 50% formamide, 2×SSC and 0.1% CHAPS at 60-65° C. The samples were incubated with alkaline phosphatase-conjugated anti-digoxigenin antibody (Roche) overnight at 4° C., and then washed to remove the non-specific binding. Signals were developed with the substrate NBT/BCIP (Roche). - RNA Isolation and Northern Analysis
- RNAs were extracted from tissues or cells with TRIzol reagent (Gibco BRL), and fractionated in 1.2% agarose-formaldehyde gel. The Northern blots were performed by standard methods (Sambrook et al., 1989). Human tissue blot was purchased from Clontech.
- Reverse Transcription Polymerase Chain Reaction (RT-PCR)
- 1-5 μg total RNA was used to synthesize first-strand cDNA using SuperScript™ reverse transcriptase (Gibco BRL) in 20 μl reaction. The PCR reaction was performed with Taq DNA polymerase (Gibco BRL) for 30 cycles.
- Primer pairs were:
Hemogen: 5′-AAACACACCTCTCTCCTACCAC-3′ and (SEQ ID NO:6) 5′-CCTACTTTCTGGGCTCCTTCTG-3′. (SEQ ID NO:7) EDAG: 5′-AAGCACCATCAGACACCTGACC-3′ and (SEQ ID NO:8) 5′-TGCTTGAAGAGAGCATCCTGCC-3′. (SEQ ID NO:9) Histone H3: 5′-CCACTGAACTTCTGATTCGC-3′ and (SEQ ID NO:10 5′-GGGTGCTAGCTGGATGTCTT-3′. (SEQ ID NO:11) - The PCR products of Hemogen, EDAG and Histone are 881 bp, 751 bp and 214 bp respectively.
- Expression Analysis of Hemogen and EDAG by RT-PCR
- Total RNA was extracted from a variety of freshly isolated tissues, flow-sorted BM cells, cultured cells and transformed cell lines. Gene expression was analyzed by RT-PCR. The cells were isolated as previously described (Nicholson et al., 2000). In brief, adult mouse BM cells were sorted using different mAbs in a fluorescence activated cell sorter (FACS Vantage, Becton Dickinson). Natural killer cells were generated as previously described (Hirayama et al., 1998) by culturing mouse newborn liver cells for 21 days with 500 units/ml recombinant human IL-2. BM-derived macrophages were obtained as previously described (Li and Chen, 1995). All human tissues were obtained with approval from the Institutional Review Board of Wayne State University.
- Adult BM cells were isolated from fragments of ribs that were removed from patients undergoing thoracic surgery. Young thymus tissue was obtained from children undergoing cardiac surgery. CD34+cells and monocytes were obtained from the blood of cancer patients undergoing peripheral mobilization for autologous transplant. T cells were obtained from blood, cultured with IL-2 for several weeks and were >99% CD3+. Hematopoietic cell lines K562, U937 and non-hematopoietic cell lines 24SV48, HUVEC and SKBR3 were cultured in RPMI-1640 medium with 10% calf serum.
- We initially performed a PCR-based cDNA subtraction, aiming to identify developmentally regulated genes in mouse E10.5 and E16.5 heart tissues. A number of differentially expressed genes were identified in this screening. Possibly because blood cells were trapped in heart tissue, some hematopoietic genes, such as embryonic ε and βH1 globins, were also cloned.
- One of the clones, 6B2, showed differential expression at E10.5. Based on a search in GenBank, this clone showed sequence similarity and homology with several mouse ESTs, e.g., GenBank accession numbers AA051237, AI121196 and AI006512. These three independent clones were sequenced to obtain the full-length cDNA sequences from which the amino acid sequence was deduced (FIG. 1). This novel gene, now designated Hemogen (hemopoietic gene), encodes 503 amino acids with a calculated molecular weight 55,043 Da and a pI of 4.84.
- Sequence analysis showed the first ATG is the translation initiation codon and the surrounding sequence AAGATGG is consistent with the Kozak consensus sequence (purine at position −3 and G at position +4 (Kozak, 1997). Stop codons exist in all three reading frames upstream of the presumed initiation codon. The polyadenylation signal sequence is ATTAAA (2288-2293 nt), a most common variant of the canonical sequence AATAAA (Graber et al., 1999).
- The putative Hemogen protein has a basic N-terminal domain (34-78 residues with a net charge of +15) and an acidic C-terminal domain (450-480 residues with a net charge −11). In the basic domain, the region from residues 34-50 is predicted (at a window size of 14) to be a coiled-coil domain (Lupas, 1996), which is implicated in protein polymerization. Residues 61-78 contain a bipartite nuclear localization signal suggesting that Hemogen is a nuclear protein (Dingwall and Laskey, 1991). Analysis of amino acid composition shows that usages of proline (10.3%), glutamate (8.7%) and glutamic acid (11.9%) are higher than average (Brendel et al., 1992).
- Hemogen Encodes a Nuclear Protein
- To confirm that the nuclear localization signal in Hemogen protein was functional, we transfected the mammalian expression vector containing the Hemogen-FLAG fusion gene into COS-7 cells, and used anti-FLAG antibody to determine the subcellular localization of this protein. Hemogen was found in cell nuclei but not in nucleoli or cytoplasm (FIG. 2).
- To study the expression pattern of Hemogen during mouse development, we used in situ hybridization and Northern blotting to detect mRNA transcripts. Hemogen was expressed during early embryogenesis at E8.5 in the blood islands of the yolk sac and in the circulating primitive blood cells (FIG. 3A,B,C). The blood islands are the first sites to produce primitive blood cells. Expression in circulating blood cells was highly detectable at E9.5 and E10.5 (FIG. 3D,E,F,G). As liver organogenesis emerged at E10.5, Hemogen expression became detectable in the developing hepatic primordia (FIG. 3F,H). Fetal liver is the primary site of definitive blood cells generation during fetal stages. From E11.5, Hemogen was exclusively expressed in the fetal liver (FIG. 3I, K), while expression in circulating blood cells was downregulated dramatically to undetectable levels (FIG. 3J). The same expression patterns were observed in E12.5 and E14.5 embryos (FIG. 3L,M,N,O,P,Q). Primitive erythrocytes at E10.5 and E12.5 are easily distinguished morphologically by hematoxylin staining. The primitive erythrocytes at E10.5 showed higher nucleus/cytoplasm ratios than those at El 2.5 (FIG. 4A,B).
- We examined the tissue distribution of Hemogen in adult mice by Northern blots. A 2.4 kb transcript was specifically expressed in the BM and spleen (FIG. 5A). When the same filter was overexposed, a weak signal was detected in the peripheral blood, and two additional transcripts at 1.1 and 3.7 kb were also identified (1.1 kb band giving the strongest signal. (FIG. 5B). This suggested multiple isoforms of Hemogen in peripheral blood cells. No expression was detected in the thymus and various non-hematopoietic tissues including brain, heart, kidney, liver, lung, skeletal muscle and stomach.
- In adult spleen, the red pulp is active in erythropoiesis (while the white pulp is a lymphoid tissue that contains mature B and T lymphocytes (van Ewijk and Nieuwenhuis, 1985)). To determine the localization of Hemogen expression in adult spleen, we performed in situ hybridization. As shown in FIG. 6B, Hemogen was expressed in the red pulp but not in the white pulp. Higher magnification (FIG. 6C) revealed that the positively stained cells were presumably erythroid precursor cells. Hemogen was detected in about 70% of the cells in the red pulp.
- These results demonstrate that Hemogen is expressed in both primitive and definitive hematopoiesis. It is sequentially expressed in the active hematopoietic sites, such as yolk sac, fetal liver, BM and spleen. Hemogen expression is highly specific to the hematopoietic system throughout development since no transcripts were detected in any non-hematopoietic tissues.
- To further investigate which hematopoietic cells express Hemogen, we purified adult mouse BM cells by flow cytometry using monoclonal antibodies and assessed Hemogen expression by RT-PCR. As shown in FIG. 7, Hemogen was primarily expressed in Lineage−blast cells, LinlocKit+Sca-1+pluripotent stem cells and CD34+ stem cells. Previous studies have shown that these three cell types (Spangrude et al., 1988, Li and Johnson, 1995, Krause et al., 1996) are enriched of early multipotential stem cells. Low levels of expression were found in cultured macrophages and natural killer cells. However no expression was detected in freshly isolated CD3+ T cells, B220+ B cells, Terr-119+ erythrocytes and GR-1+ granulocytes, which are all differentiated blood cells. Hence, Hemogen is differentially expressed in the hematopoietic progenitor cells and downregulated in differentiated blood cells. This notion is consistent with the observation that Hemogen is expressed in the active hematopoietic sites known to harbor hematopoietic progenitor cells (FIGS. 3,5), whereas expression is diminished in the peripheral blood that contains mainly mature blood cells (FIG. 5).
- To search for human homologues of Hemogen, we identified two human ESTs (accession # T52254 and AA393302) in GenBank. At the time the present study was underway, three human homologous sequences, including a hypothetical protein EDAG-1 (accession #AF228713) and two draft human genomic sequences (accession # AC015928 and AL354726) were deposited to GenBank. According to the GenBank entry, EDAG-1 encodes a 309 amino acid protein with previously unknown function. A BLAST search (Altschul et al., 1997) showed that residues 216-503 Hemogen protein shared 42% identity and 55% similarity with EDAG-1 protein. By sequence analysis, the two draft genomic sequences AC015928 and AL354726 were found to contain the EDAG-1 gene.
- Moreover, from the cDNA sequences of human EST clones T52254 and AA393302, we concluded that this cDNA is an isoform of EDAG-1 cDNA. Another upstream ATG translation initiation codon defined a longer ORF encoding a 484 amino acid polypeptide (FIG. 8), This was designated EDAG to distinguish it from “EDAG-1” that lacks the N-terminal 175 amino acids. The same ORF of 484 amino acids was also deduced from the genomic sequence.
- Protein sequence alignment showed overall 43% identity between Hemogen and EDAG (FIG. 8). However, the nuclear localization signal and coiled-coil domain are highly conserved with 94% and 76% similarity respectively, suggesting that EDAG may also be a nuclear protein. When we were preparing this manuscript, a rat homologue, RP59 (accession # AJ302650) was deposited in GenBank. Hemogen has 70% identity with RP59. The nuclear localization signal and coiled-coil domain are almost identical (FIG. 8).
- We searched the human gene-mapping databases to determine the chromosome localization of EDAG. By searching the Map Viewer at NCBI (http://www.ncbi.nlm.nih.gov/genome/guide/), the genomic sequence AC015928 was located in the region 85.1-85.3 Mb of chromosome 9 in the GenBank map. This ˜160 kb BAC clone was also found to contain the forkhead box E1 (FOXE1/FKHL15) gene that has been mapped to chromosome 9q22 (Chadwick et al., 1997). Therefore, the genomic clone AC015928, containing FOXE1/FKHL15 and EDAG, is located at the same position-9q22. Furthermore, by electronic PCR at NCBI (http://www.ncbi.nlm.nih.gov/genome/sts/epcr.cgi), EDAG was found to contain the STS marker SHGC-33415. A search of GeneMap'99 (http://www.ncbi.nlm.nih.gov/genemap/) and GDB database (http://gdbwww.gdb.org/gdb/advancedSearch.html) revealed that the marker SHGC-33415 is on the long arm of chromosome 9 between the markers D9S287 and D9S176, which correspond to the 9q22 region on the cytogenetic ideogram. Therefore the human gene EDAG, homologous with Hemogen, maps to chromosome 9q22, which correlates with breakpoints detected in several human hematological neoplasms, such as acute myeloid leukemia (Mitelman, 1991; Mitelman et al., 1997).
- The hybridization of an EDAG cDNA probe with human tissue RNA blots revealed that EDAG was expressed in the active hematopoietic organs, BM and fetal liver (FIG. 9A). Two isoforms, a 2.4 kb major isoform and a 1.8 kb minor isoform, were detected. No expression was found in the spleen, lymph node, thymus or peripheral blood leukocytes (FIG. 9A). It is noteworthy that in human adults, the spleen and thymus are inactive in hematopoiesis under normal condition. RT-PCR permitted detection of high level of transcripts in the myelogenous leukemia cell line K562, K562 stimulated with phorbol myristate acetate (PMA), adult BM and CD34+ progenitor cells. Low level of expression appeared in thymus of a child and in cells of the histiocytic lymphoma (macrophage-like) cell line U-937. No expression was detected in cultured blood T cells, monocytes or in other non-hematopoietic cell lines including SV-40 transformed thymus epithelial cell line 24SV48, endothelial cell line HUVEC and breast epithelial cell line SKBR3 (FIG. 9B). Thus, EDAG is specifically expressed in hematopoietic cells, and expression is developmentally regulated.
- Described herein are a novel murine gene Hemogen and its human homologue EDAG that are specifically expressed in hematopoietic tissues. Hemogen exhibited highly specific expression in hematopoiesis throughout development. During primitive hematopoiesis, Hemogen is detectable in blood islands and in circulating primitive erythrocytes. During definitive hematopoiesis, Hemogen is expressed in the fetal liver as early as the time of formation of hepatic primordia at E10.5. After El 1.5, expression was limited to the fetal liver. In adult mice, Hemogen was expressed in BM, spleen and very weakly in peripheral blood but not in other tissues. Very few genes exhibit such tissue- and stage-specific expression patterns in the blood system throughout all developmental stages. Given its presence as a nuclear factor, Hemogen is expected to play a regulatory roles in hematopoiesis.
- The expression of Hemogen in primitive erythroid cells is of particular interest. During mouse embryogenesis there are two distinct populations of erythrocytes, the primitive (nucleated, yolk sac derived) and definitive (enucleated, fetal liver derived) erythrocytes. Before E11, all erythrocytes in the blood stream are primitive. After E11, the fetal liver begins to generate definitive erythrocytes that gradually replace primitive erythrocytes in the circulation. Primitive erythrocytes usually produce embryonic globins, but begin to synthesize adult globins after E11 (Brotherton et al., 1979). The nuclei of primitive erythrocytes gradually condense during embryonic development. By in situ hybridization, expression of Hemogen in primitive erythrocytes was abundant before E11.5 but was dramatically downregulated after E11.5. This downregulation correlates with primitive erythroid differentiation in terms of morphological and molecular changes. Similar downregulation has been observed in transcription factors EKLF (Southwood et al., 1996) and SCL/tal-1 (Elefanty et al., 1999) that are crucial for erythroid development (Robb et al., 1995; Shivdasani et al., 1995; Nuez et al., 1995; Perkins et al., 1995).
- Evidently differential gene expression is important in controlling developmental events in hematopoiesis. A variety of genes are down- or up-regulated during cell differentiation. For examples, the transcription factors SCL/tal-1, GATA-2 and GATA-1 expression extinguishes during erythroid differentiation from primitive (CD34+, CD38−) progenitors (Cheng et al., 1996). Like these factors, Hemogen was differentially expressed in immature progenitor cells and downregulated in mature blood cells. The results indicate a role for Hemogen/EDAG in blood cell differentiation.
- EDAG is the human homologue of Hemogen, and similarly, EDAG is closely tied in with hematopoiesis. Based on sequences in GenBank (accession # AF228713), EDAG cDNA was cloned from the fetal liver, an active site of hematopoiesis. Moreover, EDAG was specifically expressed in a variety of hematopoietic cells and tissues but not in non-hematopoietic cells.
- By data mining, we mapped EDAG to chromosome 9q22. This locus is of particular interest since a number of breakpoints associated with human blood diseases have been mapped to this region (see the website http://www.ncbi.nlm.nih.gov/CCAP/mitelsum.cgi), for examples, acute myeloid leukemia with deletions del(9)(q22), del(9)(q12q22), del(9)(q13q22) or translocation t(9;10)(q22; q22) (Kao et al., 1986; Mitelman, 1991; Mitelman et al., 1997; Sreekantaiah et al., 1989; Yunis et al., 1984). A genetic disease, familial hemophagocytic lymphohistiocytosis (HPLH1), also maps to 9q21.3-q22 (Ohadi et al., 1999).
- It was suggested that the chromosome region 9q21-q22 contains a cluster of leukemia breakpoints, and genes important for leukemogenesis appear to reside in this region (Sreekantaiah et al., 1989). Now in light of the evidence presented here that EDAG is involved in hematopoiesis, EDAG appears to be a candidate gene for involvement in these diseases.
- In summary, we cloned Hemogen, a novel murine gene expressed in cells and tissues that coincide with active hematopoiesis. We have also discovered its human homologue that is specifically expressed in hematopoietic tissues and maps to leukemia breakpoints of a human chromosome.
- Altschul, S. F., Madden, T. L., Schaffer, A. A., Zhang, J., Zhang, Z., Miller, W. and Lipman, D. J., 1997. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25, 3389-3402.
- Brendel, V., Bucher, P., Nourbakhsh, I. R., Blaisdell, B. E. and Karlin, S., 1992. Methods and algorithms for statistical analysis of protein sequences. Proc Natl Acad Sci U S A 89, 2002-2006.
- Brotherton, T. W., Chui, D. H., Gauldie, J. and Patterson, M., 1979. Hemoglobin ontogeny during normal mouse fetal development. Proc Natl Acad Sci U S A 76, 2853-2857.
- Chadwick, B. P., Obermayr, F. and Frischauf, A. M., 1997. FKHL15, a new human member of the forkhead gene family located on chromosome 9q22. Genomics 41, 390-396.
- Cheng, T., Shen, H., Giokas, D., Gere, J., Tenen, D. G. and Scadden, D. T., 1996. Temporal mapping of gene expression levels during the differentiation of individual primary hematopoietic cells. Proc Natl Acad Sci U S A 93, 13158-13163.
- Dingwall, C. and Laskey, R. A., 1991. Nuclear targeting sequences—a consensus? Trends Biochem Sci 16, 478-481.
- Dzierzak, E. and Medvinsky, A., 1995. Mouse embryonic hematopoiesis.
Trends Genet 11, 359-366. - Elefanty, A. G., Begley, C. G., Hartley, L., Papaevangeliou, B. and Robb, L., 1999. SCL expression in the mouse embryo detected with a targeted lacZ reporter gene demonstrates its localization to hematopoietic, vascular, and neural tissues. Blood 94, 3754-3763.
- Engel, I. and Murre, C., 1999. Transcription factors in hematopoiesis. Curr Opin Genet Dev 9, 575-579.
- Graber, J. H., Cantor, C. R., Mohr, S. C. and Smith, T. F., 1999. In silico detection of control signals: mRNA 3′-end-processing sequences in diverse species. Proc Natl Acad Sci U S A 96, 14055-14060.
- Hirayama, M., Genyea, C., Brownell, A. and Kaplan, J., 1998. IL-2-activated murine newborn liver NK cells enhance engraftment of hematopoietic stem cells in MHC-mismatched recipients. Bone Marrow Transplant 21, 1245-1252.
- Kao, Y. S., Sartin, B. W., Van Brunt, J. and Hew, A. Y., Jr., 1986. Interstitial 9q deletion (q12q22) in two cases of acute myeloblastic leukemia. Cancer Genet Cytogenet 19, 365-366.
- Kozak, M., 1997. Recognition of AUG and alternative initiator codons is augmented by G in position +4 but is not generally affected by the nucleotides in positions +5 and +6. EMBO J 16, 2482-2492.
- Krause, D. S., Fackler, M. J., Civin, C. I. and May, W. S., 1996. CD34: structure, biology, and clinical utility. Blood 87, 1-13.
- Li, C. L. and Johnson, G. R., 1995. Murine hematopoietic stem and progenitor cells: I. Enrichment and biologic characterization. Blood 85, 1472-1479.
- Li, Y. and Chen, B., 1995. Differential regulation of fyn-associated protein tyrosine kinase activity by macrophage colony-stimulating factor (M-CSF) and granulocyte-macrophage colony-stimulating factor (GM-CSF). J Leukoc Biol 57, 484-490.
- Lupas, A., 1996. Prediction and analysis of coiled-coil structures. Methods Enzymol 266, 513-525.
- Mitelman, F., 1991. Catalog of chromosome aberrations in cancer. Wiley-Liss, New York.
- Mitelman, F., Mertens, F. and Johansson, B., 1997. A breakpoint map of recurrent chromosomal rearrangements in human neoplasia [see comments]. Nat Genet 15 Spec No, 417-474.
- Nicholson, R. H., Pantano, S., Eliason, J. F., Galy, A., Weiler, S., Kaplan, J., Hughes, M. R. and Ko, M. S., 2000. Phemx, a novel mouse gene expressed in hematopoietic cells maps to the imprinted cluster on distal chromosome 7. Genomics 68, 13-21.
- Nuez, B., Michalovich, D., Bygrave, A., Ploemacher, R. and Grosveld, F., 1995. Defective haematopoiesis in fetal liver resulting from inactivation of the EKLF gene. Nature 375, 316-318.
- Ohadi, M., Lalloz, M. R., Sham, P., Zhao, J., Dearlove, A. M., Shiach, C., Kinsey, S., Rhodes, M. and Layton, D. M., 1999. Localization of a gene for familial hemophagocytic lymphohistiocytosis at chromosome 9q21.3-22 by homozygosity mapping. Am J Hum Genet 64, 165-171.
- Orkin, S. H., 1995. Transcription factors and hematopoietic development. J Biol Chem 270, 4955-4958.
- Perkins, A. C., Sharpe, A. H. and Orkin, S. H., 1995. Lethal beta-thalassaemia in mice lacking the erythroid CACCC-transcription factor EKLF. Nature 375, 318-322.
- Robb, L., Lyons, I., Li, R., Hartley, L., Kontgen, F., Harvey, R. P., Metcalf, D. and Begley, C. G., 1995. Absence of yolk sac hematopoiesis from mice with a targeted disruption of the scl gene. Proc Natl Acad Sci U S A 92, 7075-7079.
- Rowley, J. D., 1998. The critical role of chromosome translocations in human leukemias. Annu Rev Genet 32, 495-519.
- Sambrook, J., Fritsch, E. F. and Maniatis, T., 1989. Molecular cloning: a laboratory manual. Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y.
- Sawyers, C. L., 1998. Molecular abnormalities in myeloid leukemias and myelodysplastic syndromes. Leuk Res 22, 1113-1122.
- Seifert, M. F. and Marks, S. C., Jr., 1985. The regulation of hemopoiesis in the spleen. Experientia 41, 192-199.
- Shivdasani, R. A., Mayer, E. L. and Orkin, S. H., 1995. Absence of blood formation in mice lacking the T-cell leukaemia oncoprotein tal-l/SCL. Nature 373, 432-434.
- Sieweke, M. H. and Graf, T., 1998. A transcription factor party during blood cell differentiation. Curr Opin Genet Dev 8, 545-551.
- Southwood, C. M., Downs, K. M. and Bieker, J. J., 1996. Erythroid Kruppel-like factor exhibits an early and sequentially localized pattern of expression during mammalian erythroid ontogeny. Dev Dyn 206, 248-259.
- Spangrude, G. J., Heimfeld, S. and Weissman, I. L., 1988. Purification and characterization of mouse hematopoietic stem cells. Science 241, 58-62.
- Sreekantaiah, C., Baer, M. R., Preisler, H. D. and Sandberg, A. A., 1989. Involvement of bands 9q21-q22 in five cases of acute nonlymphocytic leukemia. Cancer Genet Cytogenet 39, 55-64.
- van Ewijk, W. and Nieuwenhuis, P., 1985. Compartments, domains and migration pathways of lymphoid cells in the splenic pulp. Experientia 41, 199-208.
- Wilkinson, D. G., 1992. In situ hybridization: a practical approach. IRL Press at Oxford University Press, Oxford; New York, The Practical approach series.
- Yunis, J. J., Brunning, R. D., Howe, R. B. and Lobell, M., 1984. High-resolution chromosomes as an independent prognostic indicator in adult acute nonlymphocytic leukemia. N Engl J Med 311, 812-818.
- The references cited above are all incorporated by reference herein, whether specifically incorporated or not.
- Having now fully described this invention, it will be appreciated by those skilled in the art that the same can be performed within a wide range of equivalent parameters, concentrations, and conditions without departing from the spirit and scope of the invention and without undue experimentation.
-
1 11 1 2331 DNA Mus musculus CDS (191)..(1699) 1 ggaatttgcc cacagctgtg gtctaaccag ataaaacttt taggcgggaa gtgacaagca 60 aacttgaatg tgtgtgtctg tggctttttt ttttcttcat cttcatttga agtgttggtc 120 ttgatatttt caaaaagctt ttaggctgcc tgtgaagtca aagccaatac caagaaggca 180 tcgtggcaag atg gac atg ggg aag ggc cga cct cgt ctg aag ctc ccc 229 Met Asp Met Gly Lys Gly Arg Pro Arg Leu Lys Leu Pro 1 5 10 cag atg cct gaa gct cac cca cag aag tcc tgt gct cca gac atc att 277 Gln Met Pro Glu Ala His Pro Gln Lys Ser Cys Ala Pro Asp Ile Ile 15 20 25 gga tct tgg agt ctg aga aac aga gaa caa ctg agg aag aga aaa gct 325 Gly Ser Trp Ser Leu Arg Asn Arg Glu Gln Leu Arg Lys Arg Lys Ala 30 35 40 45 gag gcc cag ggg agg cag aca tca caa tgg ctc ctt gga gaa cag aaa 373 Glu Ala Gln Gly Arg Gln Thr Ser Gln Trp Leu Leu Gly Glu Gln Lys 50 55 60 aaa cgc aag tat cag aga aca gga aaa gga aat aaa aga ggc cga aag 421 Lys Arg Lys Tyr Gln Arg Thr Gly Lys Gly Asn Lys Arg Gly Arg Lys 65 70 75 aga caa ggg aac gtg gag caa aag gca gag cct tgg tca caa aca gaa 469 Arg Gln Gly Asn Val Glu Gln Lys Ala Glu Pro Trp Ser Gln Thr Glu 80 85 90 agg gaa agg gtg caa gag gta ttg gta tct gct gag gaa gaa acc gag 517 Arg Glu Arg Val Gln Glu Val Leu Val Ser Ala Glu Glu Glu Thr Glu 95 100 105 cac cct ggg aac tct gca act gaa gcc ctc ccc ttg gtc cca tcc ccc 565 His Pro Gly Asn Ser Ala Thr Glu Ala Leu Pro Leu Val Pro Ser Pro 110 115 120 125 aca aaa gct gtg cct gca gat cag tgt tct gaa gca cac caa gaa agc 613 Thr Lys Ala Val Pro Ala Asp Gln Cys Ser Glu Ala His Gln Glu Ser 130 135 140 att caa tgt caa gaa aga gca ata cag aac cat tct caa aca cac ctc 661 Ile Gln Cys Gln Glu Arg Ala Ile Gln Asn His Ser Gln Thr His Leu 145 150 155 tct cct acc aca tgc caa gga ata gca gta ctt caa cat tct cct aaa 709 Ser Pro Thr Thr Cys Gln Gly Ile Ala Val Leu Gln His Ser Pro Lys 160 165 170 atg tgc caa gat atg gcc gaa cct gag gta ttc tct cct aac atg tgc 757 Met Cys Gln Asp Met Ala Glu Pro Glu Val Phe Ser Pro Asn Met Cys 175 180 185 cag gag aca gct gtg ccc caa acc tat cct ccc aaa gca ctt gaa gaa 805 Gln Glu Thr Ala Val Pro Gln Thr Tyr Pro Pro Lys Ala Leu Glu Glu 190 195 200 205 atg gct gca gcc gag cca ctc tct cct aaa atg tgc cag gaa aca act 853 Met Ala Ala Ala Glu Pro Leu Ser Pro Lys Met Cys Gln Glu Thr Thr 210 215 220 gtg tcc cca aac cat tct tcc aaa gtg ccc caa gat atg gct gga cct 901 Val Ser Pro Asn His Ser Ser Lys Val Pro Gln Asp Met Ala Gly Pro 225 230 235 gag gct ctc tct cct aac atg tgc cag gaa cca act gtg cct caa gaa 949 Glu Ala Leu Ser Pro Asn Met Cys Gln Glu Pro Thr Val Pro Gln Glu 240 245 250 cat act ttg aaa atg tgc cat gat gtg gcc aga cct gaa gtc ctc tct 997 His Thr Leu Lys Met Cys His Asp Val Ala Arg Pro Glu Val Leu Ser 255 260 265 cct aaa aca cat caa gag atg gct gtt cca aaa gcc ttt ccc tgt gta 1045 Pro Lys Thr His Gln Glu Met Ala Val Pro Lys Ala Phe Pro Cys Val 270 275 280 285 aca cct gga gat gct gct ggc ctg gaa gga tgc gcc cca aaa gcc ctc 1093 Thr Pro Gly Asp Ala Ala Gly Leu Glu Gly Cys Ala Pro Lys Ala Leu 290 295 300 ccc caa tca gat gtc gct gaa ggc tgt cca ctt gac aca acc ccc acg 1141 Pro Gln Ser Asp Val Ala Glu Gly Cys Pro Leu Asp Thr Thr Pro Thr 305 310 315 tca gtc aca cca gaa caa acc act tcc gac cca gat ctg gga atg gct 1189 Ser Val Thr Pro Glu Gln Thr Thr Ser Asp Pro Asp Leu Gly Met Ala 320 325 330 gtg act gaa ggc ttc ttt tct gaa gcc aga gaa tgc act gtt tct gaa 1237 Val Thr Glu Gly Phe Phe Ser Glu Ala Arg Glu Cys Thr Val Ser Glu 335 340 345 ggc gtt tct aca aag aca cac caa gaa gca gtt gaa cct gaa ttc att 1285 Gly Val Ser Thr Lys Thr His Gln Glu Ala Val Glu Pro Glu Phe Ile 350 355 360 365 tct cac gag act tat aaa gaa ttc act gtg cct ata gtt tct tct cag 1333 Ser His Glu Thr Tyr Lys Glu Phe Thr Val Pro Ile Val Ser Ser Gln 370 375 380 aaa aca atc caa gaa tca cct gag cct gaa caa tat tca cct gaa aca 1381 Lys Thr Ile Gln Glu Ser Pro Glu Pro Glu Gln Tyr Ser Pro Glu Thr 385 390 395 tgt caa cca ata cct ggg cct gag aac tat tca ctg gaa acc tgc cat 1429 Cys Gln Pro Ile Pro Gly Pro Glu Asn Tyr Ser Leu Glu Thr Cys His 400 405 410 gaa atg tcg ggg cct gaa gac ctc tct atc aag acc tgt cag gac agg 1477 Glu Met Ser Gly Pro Glu Asp Leu Ser Ile Lys Thr Cys Gln Asp Arg 415 420 425 gag gag cct aaa cac agc ctt cca gaa gga gcc cag aaa gta ggt ggg 1525 Glu Glu Pro Lys His Ser Leu Pro Glu Gly Ala Gln Lys Val Gly Gly 430 435 440 445 gcc caa ggg cag gac gct gat gca cag gac agc gag aac gct ggt gct 1573 Ala Gln Gly Gln Asp Ala Asp Ala Gln Asp Ser Glu Asn Ala Gly Ala 450 455 460 ttc tct caa gat ttt aca gaa atg gag gaa gaa aac aaa gca gat caa 1621 Phe Ser Gln Asp Phe Thr Glu Met Glu Glu Glu Asn Lys Ala Asp Gln 465 470 475 gat ccg gaa gct cca gca agc cca caa ggt tct caa gag acc tgc cca 1669 Asp Pro Glu Ala Pro Ala Ser Pro Gln Gly Ser Gln Glu Thr Cys Pro 480 485 490 gaa aat ggc atc tac agc tct gct cta ttt taacagtgct cagtgatgga 1719 Glu Asn Gly Ile Tyr Ser Ser Ala Leu Phe 495 500 gctgcagtcc agctcaatac agcatacata tctcttgtgg tttcactgaa acactgcagc 1779 aatcactaaa atttgcattg ctattttaac ttatgctttt ttttctattt gtagctctta 1839 tctaaaagag agaactaaca tttttaaggc tctaacacat agacaatagt gtgtgtgtgt 1899 gtgtgtgtgt gtgtgtgtgt gtgtgccgtg tgagcacctg agggtgtgga tttgtatatg 1959 ggggaagaca gaacagggga aaggttgagt agttgatttt cccctctaag aggaaacata 2019 tatttggtag ttctgaggag aagatagcaa ttcaatatga acacttagtg tttttgaaag 2079 tatacagatt cttgtaagtc ttgtcaacta ttgatgttgt aacaacatca gaattttatt 2139 cgagctttac acgtctctga gttgatctga acaattctta ttctaaaagt tcttgcaaat 2199 tattttggaa ttgataattg tcacttattt ctgtgtgaac ctgaaccttc tatttctatt 2259 ttttaaactg tgtttgtaaa aaatgtacat taaatcatta ctatggtctt aaaaaaaaaa 2319 aaaaaaaaaa aa 2331 2 503 PRT Mus musculus 2 Met Asp Met Gly Lys Gly Arg Pro Arg Leu Lys Leu Pro Gln Met Pro 1 5 10 15 Glu Ala His Pro Gln Lys Ser Cys Ala Pro Asp Ile Ile Gly Ser Trp 20 25 30 Ser Leu Arg Asn Arg Glu Gln Leu Arg Lys Arg Lys Ala Glu Ala Gln 35 40 45 Gly Arg Gln Thr Ser Gln Trp Leu Leu Gly Glu Gln Lys Lys Arg Lys 50 55 60 Tyr Gln Arg Thr Gly Lys Gly Asn Lys Arg Gly Arg Lys Arg Gln Gly 65 70 75 80 Asn Val Glu Gln Lys Ala Glu Pro Trp Ser Gln Thr Glu Arg Glu Arg 85 90 95 Val Gln Glu Val Leu Val Ser Ala Glu Glu Glu Thr Glu His Pro Gly 100 105 110 Asn Ser Ala Thr Glu Ala Leu Pro Leu Val Pro Ser Pro Thr Lys Ala 115 120 125 Val Pro Ala Asp Gln Cys Ser Glu Ala His Gln Glu Ser Ile Gln Cys 130 135 140 Gln Glu Arg Ala Ile Gln Asn His Ser Gln Thr His Leu Ser Pro Thr 145 150 155 160 Thr Cys Gln Gly Ile Ala Val Leu Gln His Ser Pro Lys Met Cys Gln 165 170 175 Asp Met Ala Glu Pro Glu Val Phe Ser Pro Asn Met Cys Gln Glu Thr 180 185 190 Ala Val Pro Gln Thr Tyr Pro Pro Lys Ala Leu Glu Glu Met Ala Ala 195 200 205 Ala Glu Pro Leu Ser Pro Lys Met Cys Gln Glu Thr Thr Val Ser Pro 210 215 220 Asn His Ser Ser Lys Val Pro Gln Asp Met Ala Gly Pro Glu Ala Leu 225 230 235 240 Ser Pro Asn Met Cys Gln Glu Pro Thr Val Pro Gln Glu His Thr Leu 245 250 255 Lys Met Cys His Asp Val Ala Arg Pro Glu Val Leu Ser Pro Lys Thr 260 265 270 His Gln Glu Met Ala Val Pro Lys Ala Phe Pro Cys Val Thr Pro Gly 275 280 285 Asp Ala Ala Gly Leu Glu Gly Cys Ala Pro Lys Ala Leu Pro Gln Ser 290 295 300 Asp Val Ala Glu Gly Cys Pro Leu Asp Thr Thr Pro Thr Ser Val Thr 305 310 315 320 Pro Glu Gln Thr Thr Ser Asp Pro Asp Leu Gly Met Ala Val Thr Glu 325 330 335 Gly Phe Phe Ser Glu Ala Arg Glu Cys Thr Val Ser Glu Gly Val Ser 340 345 350 Thr Lys Thr His Gln Glu Ala Val Glu Pro Glu Phe Ile Ser His Glu 355 360 365 Thr Tyr Lys Glu Phe Thr Val Pro Ile Val Ser Ser Gln Lys Thr Ile 370 375 380 Gln Glu Ser Pro Glu Pro Glu Gln Tyr Ser Pro Glu Thr Cys Gln Pro 385 390 395 400 Ile Pro Gly Pro Glu Asn Tyr Ser Leu Glu Thr Cys His Glu Met Ser 405 410 415 Gly Pro Glu Asp Leu Ser Ile Lys Thr Cys Gln Asp Arg Glu Glu Pro 420 425 430 Lys His Ser Leu Pro Glu Gly Ala Gln Lys Val Gly Gly Ala Gln Gly 435 440 445 Gln Asp Ala Asp Ala Gln Asp Ser Glu Asn Ala Gly Ala Phe Ser Gln 450 455 460 Asp Phe Thr Glu Met Glu Glu Glu Asn Lys Ala Asp Gln Asp Pro Glu 465 470 475 480 Ala Pro Ala Ser Pro Gln Gly Ser Gln Glu Thr Cys Pro Glu Asn Gly 485 490 495 Ile Tyr Ser Ser Ala Leu Phe 500 3 1618 DNA Homo sapiens CDS (110)..(1561) 3 gttatgaaga taggtactgt gggtgttaga aagattcacg gcaaaacagg gaagcatcta 60 ggctgcttgt ggaagtcaga ccaaaatagc aggaaggtat tgcagcaag atg gat ttg 118 Met Asp Leu 1 gga aag gac caa tct cat ttg aag cac cat cag aca cct gac cct cat 166 Gly Lys Asp Gln Ser His Leu Lys His His Gln Thr Pro Asp Pro His 5 10 15 caa gaa gag aac cat tct cca gaa gtc att gga acc tgg agt ttg aga 214 Gln Glu Glu Asn His Ser Pro Glu Val Ile Gly Thr Trp Ser Leu Arg 20 25 30 35 aac aga gaa cta ctt aga aaa aga aaa gct gaa gtg cat gaa aag gaa 262 Asn Arg Glu Leu Leu Arg Lys Arg Lys Ala Glu Val His Glu Lys Glu 40 45 50 aca tca caa tgg cta ttt gga gaa cag aaa aaa cgc aag cag cag aga 310 Thr Ser Gln Trp Leu Phe Gly Glu Gln Lys Lys Arg Lys Gln Gln Arg 55 60 65 aca gga aaa gga aat cga aga ggc aga aag aga caa caa aac aca gaa 358 Thr Gly Lys Gly Asn Arg Arg Gly Arg Lys Arg Gln Gln Asn Thr Glu 70 75 80 ttg aag gtg gag cct cag cca cag ata gaa aag gaa ata gtg gag aaa 406 Leu Lys Val Glu Pro Gln Pro Gln Ile Glu Lys Glu Ile Val Glu Lys 85 90 95 gca ctg gca cct ata gag aaa aaa act gag cca cct ggg agc ata acc 454 Ala Leu Ala Pro Ile Glu Lys Lys Thr Glu Pro Pro Gly Ser Ile Thr 100 105 110 115 aaa gta ttt cct tca gta gcc tcc ccg caa aaa gtt gtg cct gag gaa 502 Lys Val Phe Pro Ser Val Ala Ser Pro Gln Lys Val Val Pro Glu Glu 120 125 130 cac ttt tct gaa ata tgt caa gaa agt aac ata tat cag gag aat ttt 550 His Phe Ser Glu Ile Cys Gln Glu Ser Asn Ile Tyr Gln Glu Asn Phe 135 140 145 tct gag tac caa gaa ata gca gta caa aac cat tct tct gaa aca tgc 598 Ser Glu Tyr Gln Glu Ile Ala Val Gln Asn His Ser Ser Glu Thr Cys 150 155 160 caa cat gtg tct gaa cct gaa gac ctc tct cct aaa atg tac caa gaa 646 Gln His Val Ser Glu Pro Glu Asp Leu Ser Pro Lys Met Tyr Gln Glu 165 170 175 ata tct gta ctt caa gac aat tct tcc aaa ata tgc caa gac atg aag 694 Ile Ser Val Leu Gln Asp Asn Ser Ser Lys Ile Cys Gln Asp Met Lys 180 185 190 195 gaa cct gaa gac aac tct cct aac aca tgc caa gta ata tct gta att 742 Glu Pro Glu Asp Asn Ser Pro Asn Thr Cys Gln Val Ile Ser Val Ile 200 205 210 caa gac cat cct ttc aaa atg tac caa gat atg gct aaa cga gaa gat 790 Gln Asp His Pro Phe Lys Met Tyr Gln Asp Met Ala Lys Arg Glu Asp 215 220 225 ctg gct cct aaa atg tgc caa gaa gct gct gta ccc aaa atc ctt cct 838 Leu Ala Pro Lys Met Cys Gln Glu Ala Ala Val Pro Lys Ile Leu Pro 230 235 240 tgt cca aca tct gaa gac aca gct gat ctg gca gga tgc tct ctt caa 886 Cys Pro Thr Ser Glu Asp Thr Ala Asp Leu Ala Gly Cys Ser Leu Gln 245 250 255 gca tat cca aaa cca gat gtg cct aaa ggc tat att ctt gac aca gac 934 Ala Tyr Pro Lys Pro Asp Val Pro Lys Gly Tyr Ile Leu Asp Thr Asp 260 265 270 275 caa aat cca gca gaa cca gag gaa tac aat gaa aca gat caa gga ata 982 Gln Asn Pro Ala Glu Pro Glu Glu Tyr Asn Glu Thr Asp Gln Gly Ile 280 285 290 gct gag aca gaa ggc ctt ttt cct aaa ata caa gaa ata gct gag cct 1030 Ala Glu Thr Glu Gly Leu Phe Pro Lys Ile Gln Glu Ile Ala Glu Pro 295 300 305 aaa gac ctt tct aca aaa aca cac caa gaa tca gct gaa cct aaa tac 1078 Lys Asp Leu Ser Thr Lys Thr His Gln Glu Ser Ala Glu Pro Lys Tyr 310 315 320 ctt cct cat aaa aca tgt aac gaa att att gtg cct aaa gcc ccc tct 1126 Leu Pro His Lys Thr Cys Asn Glu Ile Ile Val Pro Lys Ala Pro Ser 325 330 335 cat aaa aca atc caa gaa aca cct cat tct gaa gac tat tca att gaa 1174 His Lys Thr Ile Gln Glu Thr Pro His Ser Glu Asp Tyr Ser Ile Glu 340 345 350 355 ata aac caa gaa act cct ggg tct gaa aaa tat tca cct gaa acg tat 1222 Ile Asn Gln Glu Thr Pro Gly Ser Glu Lys Tyr Ser Pro Glu Thr Tyr 360 365 370 caa gaa ata cct ggg ctt gaa gaa tat tca cct gaa ata tac caa gaa 1270 Gln Glu Ile Pro Gly Leu Glu Glu Tyr Ser Pro Glu Ile Tyr Gln Glu 375 380 385 aca tcc cag ctt gaa gaa tat tca cct gaa ata tac caa gaa aca ccg 1318 Thr Ser Gln Leu Glu Glu Tyr Ser Pro Glu Ile Tyr Gln Glu Thr Pro 390 395 400 ggg cct gaa gac ctc tct act gag aca tat aaa aat aag gat gtg cct 1366 Gly Pro Glu Asp Leu Ser Thr Glu Thr Tyr Lys Asn Lys Asp Val Pro 405 410 415 aaa gaa tgc ttt cca gaa cca cac caa gaa aca ggt ggg ccc caa ggc 1414 Lys Glu Cys Phe Pro Glu Pro His Gln Glu Thr Gly Gly Pro Gln Gly 420 425 430 435 cag gat cct aaa gca cac cag gaa gat gct aaa gat gct tat act ttt 1462 Gln Asp Pro Lys Ala His Gln Glu Asp Ala Lys Asp Ala Tyr Thr Phe 440 445 450 cct caa gaa atg aaa gaa aaa ccc aaa gaa gag cca gga ata cca gca 1510 Pro Gln Glu Met Lys Glu Lys Pro Lys Glu Glu Pro Gly Ile Pro Ala 455 460 465 att ctg aat gag agt cat cca gaa aat gat gtc tat agt tat gtt ttg 1558 Ile Leu Asn Glu Ser His Pro Glu Asn Asp Val Tyr Ser Tyr Val Leu 470 475 480 ttt taacaatgct caaccataaa gttgtggtcc aatggaaaaa aaaaaaaaaa 1611 Phe aaaaaaa 1618 4 484 PRT Homo sapiens 4 Met Asp Leu Gly Lys Asp Gln Ser His Leu Lys His His Gln Thr Pro 1 5 10 15 Asp Pro His Gln Glu Glu Asn His Ser Pro Glu Val Ile Gly Thr Trp 20 25 30 Ser Leu Arg Asn Arg Glu Leu Leu Arg Lys Arg Lys Ala Glu Val His 35 40 45 Glu Lys Glu Thr Ser Gln Trp Leu Phe Gly Glu Gln Lys Lys Arg Lys 50 55 60 Gln Gln Arg Thr Gly Lys Gly Asn Arg Arg Gly Arg Lys Arg Gln Gln 65 70 75 80 Asn Thr Glu Leu Lys Val Glu Pro Gln Pro Gln Ile Glu Lys Glu Ile 85 90 95 Val Glu Lys Ala Leu Ala Pro Ile Glu Lys Lys Thr Glu Pro Pro Gly 100 105 110 Ser Ile Thr Lys Val Phe Pro Ser Val Ala Ser Pro Gln Lys Val Val 115 120 125 Pro Glu Glu His Phe Ser Glu Ile Cys Gln Glu Ser Asn Ile Tyr Gln 130 135 140 Glu Asn Phe Ser Glu Tyr Gln Glu Ile Ala Val Gln Asn His Ser Ser 145 150 155 160 Glu Thr Cys Gln His Val Ser Glu Pro Glu Asp Leu Ser Pro Lys Met 165 170 175 Tyr Gln Glu Ile Ser Val Leu Gln Asp Asn Ser Ser Lys Ile Cys Gln 180 185 190 Asp Met Lys Glu Pro Glu Asp Asn Ser Pro Asn Thr Cys Gln Val Ile 195 200 205 Ser Val Ile Gln Asp His Pro Phe Lys Met Tyr Gln Asp Met Ala Lys 210 215 220 Arg Glu Asp Leu Ala Pro Lys Met Cys Gln Glu Ala Ala Val Pro Lys 225 230 235 240 Ile Leu Pro Cys Pro Thr Ser Glu Asp Thr Ala Asp Leu Ala Gly Cys 245 250 255 Ser Leu Gln Ala Tyr Pro Lys Pro Asp Val Pro Lys Gly Tyr Ile Leu 260 265 270 Asp Thr Asp Gln Asn Pro Ala Glu Pro Glu Glu Tyr Asn Glu Thr Asp 275 280 285 Gln Gly Ile Ala Glu Thr Glu Gly Leu Phe Pro Lys Ile Gln Glu Ile 290 295 300 Ala Glu Pro Lys Asp Leu Ser Thr Lys Thr His Gln Glu Ser Ala Glu 305 310 315 320 Pro Lys Tyr Leu Pro His Lys Thr Cys Asn Glu Ile Ile Val Pro Lys 325 330 335 Ala Pro Ser His Lys Thr Ile Gln Glu Thr Pro His Ser Glu Asp Tyr 340 345 350 Ser Ile Glu Ile Asn Gln Glu Thr Pro Gly Ser Glu Lys Tyr Ser Pro 355 360 365 Glu Thr Tyr Gln Glu Ile Pro Gly Leu Glu Glu Tyr Ser Pro Glu Ile 370 375 380 Tyr Gln Glu Thr Ser Gln Leu Glu Glu Tyr Ser Pro Glu Ile Tyr Gln 385 390 395 400 Glu Thr Pro Gly Pro Glu Asp Leu Ser Thr Glu Thr Tyr Lys Asn Lys 405 410 415 Asp Val Pro Lys Glu Cys Phe Pro Glu Pro His Gln Glu Thr Gly Gly 420 425 430 Pro Gln Gly Gln Asp Pro Lys Ala His Gln Glu Asp Ala Lys Asp Ala 435 440 445 Tyr Thr Phe Pro Gln Glu Met Lys Glu Lys Pro Lys Glu Glu Pro Gly 450 455 460 Ile Pro Ala Ile Leu Asn Glu Ser His Pro Glu Asn Asp Val Tyr Ser 465 470 475 480 Tyr Val Leu Phe 5 1455 DNA Homo sapiens 5 atggatttgg gaaaggacca atctcatttg aagcaccatc agacacctga ccctcatcaa 60 gaagagaacc attctccaga agtcattgga acctggagtt tgagaaacag agaactactt 120 agaaaaagaa aagctgaagt gcatgaaaag gaaacatcac aatggctatt tggagaacag 180 aaaaaacgca agcagcagag aacaggaaaa ggaaatcgaa gaggcagaaa gagacaacaa 240 aacacagaat tgaaggtgga gcctcagcca cagatagaaa aggaaatagt ggagaaagca 300 ctggcaccta tagagaaaaa aactgagcca cctgggagca taaccaaagt atttccttca 360 gtagcctccc cgcaaaaagt tgtgcctgag gaacactttt ctgaaatatg tcaagaaagt 420 aacatatatc aggagaattt ttctgagtac caagaaatag cagtacaaaa ccattcttct 480 gaaacatgcc aacatgtgtc tgaacctgaa gacctctctc ctaaaatgta ccaagaaata 540 tctgtacttc aagacaattc ttccaaaata tgccaagaca tgaaggaacc tgaagacaac 600 tctcctaaca catgccaagt aatatctgta attcaagacc atcctttcaa aatgtaccaa 660 gatatggcta aacgagaaga tctggctcct aaaatgtgcc aagaagctgc tgtacccaaa 720 atccttcctt gtccaacatc tgaagacaca gctgatctgg caggatgctc tcttcaagca 780 tatccaaaac cagatgtgcc taaaggctat attcttgaca cagaccaaaa tccagcagaa 840 ccagaggaat acaatgaaac agatcaagga atagctgaga cagaaggcct ttttcctaaa 900 atacaagaaa tagctgagcc taaagacctt tctacaaaaa cacaccaaga atcagctgaa 960 cctaaatacc ttcctcataa aacatgtaac gaaattattg tgcctaaagc cccctctcat 1020 aaaacaatcc aagaaacacc tcattctgaa gactattcaa ttgaaataaa ccaagaaact 1080 cctgggtctg aaaaatattc acctgaaacg tatcaagaaa tacctgggct tgaagaatat 1140 tcacctgaaa tataccaaga aacatcccag cttgaagaat attcacctga aatataccaa 1200 gaaacaccgg ggcctgaaga cctctctact gagacatata aaaataagga tgtgcctaaa 1260 gaatgctttc cagaaccaca ccaagaaaca ggtgggcccc aaggccagga tcctaaagca 1320 caccaggaag atgctaaaga tgcttatact tttcctcaag aaatgaaaga aaaacccaaa 1380 gaagagccag gaataccagc aattctgaat gagagtcatc cagaaaatga tgtctatagt 1440 tatgttttgt tttaa 1455 6 22 DNA Artificial Sequence Primer 6 aaacacacct ctctcctacc ac 22 7 22 DNA Artificial Sequence Primer 7 cctactttct gggctccttc tg 22 8 22 DNA Artificial Sequence Primer 8 aagcaccatc agacacctga cc 22 9 22 DNA Artificial Sequence Primer 9 tgcttgaaga gagcatcctg cc 22 10 20 DNA Artificial Sequence Primer 10 ccactgaact tctgattcgc 20 11 20 DNA Artificial Sequence Primer 11 gggtgctagc tggatgtctt 20
Claims (19)
1. An isolated nucleic acid molecule that encodes a mammalian protein Hemogen/EDAG that is selectively expressed in developing or immature hematopoietic cells.
2. The nucleic acid molecule of claim 1 that comprises a nucleotide sequence selected from SEQ ID NO:1 or SEQ ID NO:5.
3. An isolated nucleic acid molecule that hybridizes with the nucleic acid molecule of claim 1 under stringent hybridization conditions.
4. An isolated nucleic acid molecule that hybridizes with the nucleic acid molecule of claim 2 under stringent hybridization conditions.
5. The nucleic acid molecule of claim 2 that comprises the nucleotide sequence SEQ ID NO:5.
6. The nucleic acid molecule of claim 1 that encodes a protein having an amino acid sequence selected from SEQ ID NO:2 and SEQ ID NO:4 or encodes a biologically active fragment, homologue or other functional derivative of said protein.
7. The nucleic acid molecule of claim 6 that encodes said protein having the sequence SEQ ID NO:4 or encodes said biologically active fragment, homologue or other functional derivative of SEQ ID NO:4.
8. An expression vector comprising the nucleic acid of claim 1 operatively linked to
(a) a promoter and
(b) optionally, additional regulatory sequences that regulate expression of said nucleic acid in a eukaryotic cell.
9. An expression vector comprising the nucleic acid of any of claims 2-7, operatively linked to
(a) a promoter and
(b) optionally, additional regulatory sequences that regulate expression of said nucleic acid in a eukaryotic cell.
10. A cell transformed or transfected with the vector of claim 8 .
11. A cell transformed or transfected with the vector of claim 9 .
12. A polypeptide that is selectively expressed in developing or immature hematopoietic cells, encoded by a nucleic acid molecule having the sequence SEQ ID NO:1 or SEQ ID NO:5, or a fragment, homologue or functional derivative of said polypeptide.
13. The polypeptide of claim 12 having the amino acid sequence SEQ ID NO:2 or SEQ ID NO:4, or a fragment, homologue or equivalent of said polypeptide.
14. An antibody that is specific for an epitope of the polypeptide of claim 12 .
15. An antibody that is specific for an epitope of the polypeptide of claim 13 .
16. The antibody of claim 14 or 15 that is a monoclonal antibody.
17. A method of identifying or quantitating cells expressing a EDAG polypeptide on in a cell or tissue sample, comprising
(a) contacting the sample with the antibody of claim 16 , so that said antibody binds to cells expressing said epitope;
(b) assessing the presence of, or quantitating the number of, cells to which said antibody is bound.
18. A method of detecting the presence or quantitating a EDAG polypeptide, fragment or homologue in a sample, comprising the steps of:
(a) contacting the sample with the antibody of claim 16 such that the antibody binds to any polypeptides or fragments bearing said epitope;
(b) detecting the presence of, or quantitating the polypeptides or fragments bound to said antibody.
19. A method for detecting an abnormality in early hematopoiesis associated with an abnormal amount of EDAG protein in a biological fluid sample, a cell sample or a tissue sample suspected of said abnormality, comprising:
(a) determining the amount of EDAG in said sample in accordance with claim 18 ,
(b) comparing the amount determined in step (a) with the amount of EDAG polypeptide in a normal or control sample of said biological fluid, cells or tissue, or with a predetermined normal value;
wherein if the amount of EDAG determined in step is significantly lower or higher than said control or value, said sample is detected as abnormal.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/103,140 US20030113817A1 (en) | 2001-03-22 | 2002-03-22 | Hemogen-EDAG: novel nuclear factors expressed in hematopoietic development |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US27762401P | 2001-03-22 | 2001-03-22 | |
US10/103,140 US20030113817A1 (en) | 2001-03-22 | 2002-03-22 | Hemogen-EDAG: novel nuclear factors expressed in hematopoietic development |
Publications (1)
Publication Number | Publication Date |
---|---|
US20030113817A1 true US20030113817A1 (en) | 2003-06-19 |
Family
ID=26800126
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US10/103,140 Abandoned US20030113817A1 (en) | 2001-03-22 | 2002-03-22 | Hemogen-EDAG: novel nuclear factors expressed in hematopoietic development |
Country Status (1)
Country | Link |
---|---|
US (1) | US20030113817A1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102860282A (en) * | 2011-07-06 | 2013-01-09 | 中国人民解放军军事医学科学院放射与辐射医学研究所 | Preparation method of transgenic mouse of specificity expression Cre recombinase of hematopoietic system |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030165852A1 (en) * | 2000-11-15 | 2003-09-04 | Schueler Paula A. | Methods and reagents for identifying rare fetal cells in the maternal circulation |
US20040053396A1 (en) * | 2001-12-04 | 2004-03-18 | Jackson Jennifer L. | Molecules for disease detection and treatment |
-
2002
- 2002-03-22 US US10/103,140 patent/US20030113817A1/en not_active Abandoned
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030165852A1 (en) * | 2000-11-15 | 2003-09-04 | Schueler Paula A. | Methods and reagents for identifying rare fetal cells in the maternal circulation |
US20040053396A1 (en) * | 2001-12-04 | 2004-03-18 | Jackson Jennifer L. | Molecules for disease detection and treatment |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102860282A (en) * | 2011-07-06 | 2013-01-09 | 中国人民解放军军事医学科学院放射与辐射医学研究所 | Preparation method of transgenic mouse of specificity expression Cre recombinase of hematopoietic system |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Yang et al. | Hemogen is a novel nuclear factor specifically expressed in mouse hematopoietic development and its human homologue EDAG maps to chromosome 9q22, a region containing breakpoints of hematological neoplasms | |
Pells et al. | Developmentally-regulated expression of murine K-ras isoforms | |
Cheadle et al. | Molecular genetic advances in tuberous sclerosis | |
US7560541B2 (en) | Heart20049410 full-length cDNA and polypeptides | |
Shimada et al. | Analysis of genes under the downstream control of the t (8; 21) fusion protein AML1-MTG8: overexpression of the TIS11b (ERF-1, cMG1) gene induces myeloid cell proliferation in response to G-CSF | |
CA2665489C (en) | Prrg4-associated compositions and methods of use thereof in methods of tumor diagnosis | |
Carpino et al. | Identification, cDNA cloning, and targeted deletion of p70, a novel, ubiquitously expressed SH3 domain-containing protein | |
JPH11500614A (en) | Multiple tumor abnormal growth genes | |
WO1996026958A2 (en) | Eph RECEPTOR LIGAND ELF-2 | |
Seong et al. | Phosphorylation of a novel zinc-finger-like protein, ZPR9, by murine protein serine/threonine kinase 38 (MPK38) | |
CN1643148B (en) | Mouse spermatogenesis gene, human male infertility-related gene, and diagnostic system using the gene | |
WO2002030976A1 (en) | Cell control nucleic acids and proteins | |
US20030113817A1 (en) | Hemogen-EDAG: novel nuclear factors expressed in hematopoietic development | |
US20040161746A1 (en) | Method of testing allergic disease | |
Guo et al. | A human Mix-like homeobox gene MIXL shows functional similarity to Xenopus Mix. 1 | |
Yamada et al. | Effects of PU. 1-induced mouse calcium–calmodulin-dependent kinase I-like kinase (CKLiK) on apoptosis of murine erythroleukemia cells | |
AU2002321090B2 (en) | A method for diagnosing a person having multiple sclerosis | |
US7883896B2 (en) | Marker molecules associated with lung tumors | |
US20050064411A1 (en) | Novel gene nedl-1 | |
US6060588A (en) | Bap-1 proteins | |
AU769825B2 (en) | Gene encoding syntaxin interacting protein | |
EP1227106A1 (en) | Pro-apoptotic proteins and DNA molecules encoding them | |
AU2002321090A1 (en) | A method for diagnosing a person having multiple sclerosis | |
JP2000515737A (en) | Growth arrest gene compositions and methods | |
JPH10127296A (en) | EXT2 gene |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |
|
AS | Assignment |
Owner name: NATIONAL INSTITUTES OF HEALTH (NIH), U.S. DEPT. OF Free format text: CONFIRMATORY LICENSE;ASSIGNOR:WAYNE STATE UNIVERSITY;REEL/FRAME:028641/0539 Effective date: 20120719 |
|
AS | Assignment |
Owner name: NATIONAL INSTITUTES OF HEALTH - DIRECTOR DEITR, MA Free format text: CONFIRMATORY LICENSE;ASSIGNOR:WAYNE STATE UNIVERSITY;REEL/FRAME:049258/0553 Effective date: 20190308 |