WO2018101950A1 - Methods and compositions for the diagnosis of acute myeloid leukemia - Google Patents
Methods and compositions for the diagnosis of acute myeloid leukemia Download PDFInfo
- Publication number
- WO2018101950A1 WO2018101950A1 PCT/US2016/064442 US2016064442W WO2018101950A1 WO 2018101950 A1 WO2018101950 A1 WO 2018101950A1 US 2016064442 W US2016064442 W US 2016064442W WO 2018101950 A1 WO2018101950 A1 WO 2018101950A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- son
- polypeptide
- nucleic acid
- level
- acid encoding
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 131
- 208000031261 Acute myeloid leukaemia Diseases 0.000 title claims abstract description 45
- 238000003745 diagnosis Methods 0.000 title claims abstract description 16
- 239000000203 mixture Substances 0.000 title claims description 24
- 208000033776 Myeloid Acute Leukemia Diseases 0.000 title abstract description 30
- 101150014221 Son gene Proteins 0.000 claims abstract description 93
- 208000002250 Hematologic Neoplasms Diseases 0.000 claims abstract description 79
- 230000009033 hematopoietic malignancy Effects 0.000 claims abstract description 79
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 312
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 312
- 229920001184 polypeptide Polymers 0.000 claims description 311
- 102000039446 nucleic acids Human genes 0.000 claims description 168
- 108020004707 nucleic acids Proteins 0.000 claims description 168
- 150000007523 nucleic acids Chemical class 0.000 claims description 168
- 108090000623 proteins and genes Proteins 0.000 claims description 158
- 238000012360 testing method Methods 0.000 claims description 112
- 210000004027 cell Anatomy 0.000 claims description 109
- 239000000523 sample Substances 0.000 claims description 99
- 239000003795 chemical substances by application Substances 0.000 claims description 57
- 101710169972 Menin Proteins 0.000 claims description 55
- 102100030550 Menin Human genes 0.000 claims description 55
- 230000027455 binding Effects 0.000 claims description 52
- 230000014509 gene expression Effects 0.000 claims description 49
- 230000001965 increasing effect Effects 0.000 claims description 46
- 108700009124 Transcription Initiation Site Proteins 0.000 claims description 43
- 101000652433 Homo sapiens Protein SON Proteins 0.000 claims description 42
- 108010051779 histone H3 trimethyl Lys4 Proteins 0.000 claims description 42
- 102000004169 proteins and genes Human genes 0.000 claims description 35
- 108010016788 Cyclin-Dependent Kinase Inhibitor p21 Proteins 0.000 claims description 28
- 102100033270 Cyclin-dependent kinase inhibitor 1 Human genes 0.000 claims description 28
- 101001059220 Homo sapiens Zinc finger protein Gfi-1 Proteins 0.000 claims description 26
- 102100022103 Histone-lysine N-methyltransferase 2A Human genes 0.000 claims description 25
- 101001045846 Homo sapiens Histone-lysine N-methyltransferase 2A Proteins 0.000 claims description 25
- 102100029004 Zinc finger protein Gfi-1 Human genes 0.000 claims description 25
- 210000001185 bone marrow Anatomy 0.000 claims description 24
- 201000003793 Myelodysplastic syndrome Diseases 0.000 claims description 22
- 230000000694 effects Effects 0.000 claims description 22
- 108091034117 Oligonucleotide Proteins 0.000 claims description 21
- 101000771599 Homo sapiens WD repeat-containing protein 5 Proteins 0.000 claims description 20
- 108060006706 SRC Proteins 0.000 claims description 20
- 102100029445 WD repeat-containing protein 5 Human genes 0.000 claims description 20
- 102000001332 SRC Human genes 0.000 claims description 19
- 101000865038 Homo sapiens Histone-lysine N-methyltransferase SETD1A Proteins 0.000 claims description 17
- 102100029768 Histone-lysine N-methyltransferase SETD1A Human genes 0.000 claims description 16
- 238000011536 re-plating Methods 0.000 claims description 16
- 102100031150 Growth arrest and DNA damage-inducible protein GADD45 alpha Human genes 0.000 claims description 15
- 102100027768 Histone-lysine N-methyltransferase 2D Human genes 0.000 claims description 15
- 101001066158 Homo sapiens Growth arrest and DNA damage-inducible protein GADD45 alpha Proteins 0.000 claims description 15
- 101001045848 Homo sapiens Histone-lysine N-methyltransferase 2B Proteins 0.000 claims description 15
- 101001008894 Homo sapiens Histone-lysine N-methyltransferase 2D Proteins 0.000 claims description 15
- 101000703089 Homo sapiens Set1/Ash2 histone methyltransferase complex subunit ASH2 Proteins 0.000 claims description 15
- 102100030733 Set1/Ash2 histone methyltransferase complex subunit ASH2 Human genes 0.000 claims description 15
- 238000009396 hybridization Methods 0.000 claims description 15
- 102100023226 Early growth response protein 1 Human genes 0.000 claims description 14
- 101001049697 Homo sapiens Early growth response protein 1 Proteins 0.000 claims description 14
- 101000591187 Homo sapiens Notch homolog 2 N-terminal-like protein A Proteins 0.000 claims description 14
- 102100034093 Notch homolog 2 N-terminal-like protein A Human genes 0.000 claims description 14
- 210000002798 bone marrow cell Anatomy 0.000 claims description 14
- 239000012634 fragment Substances 0.000 claims description 14
- 210000003958 hematopoietic stem cell Anatomy 0.000 claims description 13
- 102100036220 PC4 and SFRS1-interacting protein Human genes 0.000 claims description 12
- 108010093345 lens epithelium-derived growth factor Proteins 0.000 claims description 12
- 101000974349 Homo sapiens Nuclear receptor coactivator 6 Proteins 0.000 claims description 11
- 102100022929 Nuclear receptor coactivator 6 Human genes 0.000 claims description 11
- 210000003819 peripheral blood mononuclear cell Anatomy 0.000 claims description 11
- 101000608194 Homo sapiens Pyrin domain-containing protein 1 Proteins 0.000 claims description 10
- 239000007787 solid Substances 0.000 claims description 10
- -1 SET IB Proteins 0.000 claims description 9
- 101000584499 Homo sapiens Polycomb protein SUZ12 Proteins 0.000 claims description 8
- 101000856554 Homo sapiens Zinc finger protein Gfi-1b Proteins 0.000 claims description 8
- 102100030702 Polycomb protein SUZ12 Human genes 0.000 claims description 8
- 102100025531 Zinc finger protein Gfi-1b Human genes 0.000 claims description 8
- 102000050914 human SON Human genes 0.000 claims description 8
- 210000005087 mononuclear cell Anatomy 0.000 claims description 6
- 102100030232 Protein SON Human genes 0.000 claims 27
- 102000001708 Protein Isoforms Human genes 0.000 abstract description 43
- 108010029485 Protein Isoforms Proteins 0.000 abstract description 43
- 230000003993 interaction Effects 0.000 description 49
- 108020004459 Small interfering RNA Proteins 0.000 description 39
- 238000011529 RT qPCR Methods 0.000 description 33
- 238000002487 chromatin immunoprecipitation Methods 0.000 description 29
- 238000004458 analytical method Methods 0.000 description 27
- 230000002018 overexpression Effects 0.000 description 27
- 208000032839 leukemia Diseases 0.000 description 26
- 238000002474 experimental method Methods 0.000 description 25
- 239000013598 vector Substances 0.000 description 25
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 24
- 108010085371 Activating Transcription Factor 3 Proteins 0.000 description 23
- 102000007476 Activating Transcription Factor 3 Human genes 0.000 description 22
- 230000006870 function Effects 0.000 description 22
- 238000001114 immunoprecipitation Methods 0.000 description 20
- 230000001404 mediated effect Effects 0.000 description 18
- 238000001262 western blot Methods 0.000 description 18
- 108020004414 DNA Proteins 0.000 description 15
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 15
- 108010033040 Histones Proteins 0.000 description 14
- 239000013612 plasmid Substances 0.000 description 14
- 241000699666 Mus <mouse, genus> Species 0.000 description 13
- 230000002401 inhibitory effect Effects 0.000 description 12
- 108010077544 Chromatin Proteins 0.000 description 11
- 238000003556 assay Methods 0.000 description 11
- 210000003483 chromatin Anatomy 0.000 description 11
- 230000002441 reversible effect Effects 0.000 description 11
- 230000004568 DNA-binding Effects 0.000 description 10
- 101100220044 Homo sapiens CD34 gene Proteins 0.000 description 10
- 239000000306 component Substances 0.000 description 10
- 238000005516 engineering process Methods 0.000 description 10
- 230000004048 modification Effects 0.000 description 10
- 238000012986 modification Methods 0.000 description 10
- 238000001890 transfection Methods 0.000 description 10
- 239000002299 complementary DNA Substances 0.000 description 9
- 239000000284 extract Substances 0.000 description 9
- 238000013518 transcription Methods 0.000 description 9
- 230000035897 transcription Effects 0.000 description 9
- 241000713666 Lentivirus Species 0.000 description 8
- 239000011324 bead Substances 0.000 description 8
- 230000007115 recruitment Effects 0.000 description 8
- 230000002103 transcriptional effect Effects 0.000 description 8
- 102100030095 Histone-lysine N-methyltransferase SETD1B Human genes 0.000 description 7
- 101000864672 Homo sapiens Histone-lysine N-methyltransferase SETD1B Proteins 0.000 description 7
- 101000838301 Homo sapiens Tubulin gamma-1 chain Proteins 0.000 description 7
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 7
- 102100028979 Tubulin gamma-1 chain Human genes 0.000 description 7
- 238000000749 co-immunoprecipitation Methods 0.000 description 7
- 210000000130 stem cell Anatomy 0.000 description 7
- 230000008685 targeting Effects 0.000 description 7
- 230000037426 transcriptional repression Effects 0.000 description 7
- 230000003827 upregulation Effects 0.000 description 7
- 229940124158 Protease/peptidase inhibitor Drugs 0.000 description 6
- 239000000872 buffer Substances 0.000 description 6
- 230000001332 colony forming effect Effects 0.000 description 6
- 239000013604 expression vector Substances 0.000 description 6
- 230000000977 initiatory effect Effects 0.000 description 6
- 239000013610 patient sample Substances 0.000 description 6
- 239000000137 peptide hydrolase inhibitor Substances 0.000 description 6
- 238000012163 sequencing technique Methods 0.000 description 6
- 108700024394 Exon Proteins 0.000 description 5
- 101000652434 Mus musculus Protein SON Proteins 0.000 description 5
- 241000699670 Mus sp. Species 0.000 description 5
- 108010047956 Nucleosomes Proteins 0.000 description 5
- 230000009918 complex formation Effects 0.000 description 5
- 238000001415 gene therapy Methods 0.000 description 5
- 230000003394 haemopoietic effect Effects 0.000 description 5
- 229940027941 immunoglobulin g Drugs 0.000 description 5
- 239000000463 material Substances 0.000 description 5
- 238000007069 methylation reaction Methods 0.000 description 5
- 238000002493 microarray Methods 0.000 description 5
- 210000001623 nucleosome Anatomy 0.000 description 5
- 239000000047 product Substances 0.000 description 5
- 210000001519 tissue Anatomy 0.000 description 5
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 4
- 108091029523 CpG island Proteins 0.000 description 4
- 239000007995 HEPES buffer Substances 0.000 description 4
- 102000010029 Homer Scaffolding Proteins Human genes 0.000 description 4
- 108010077223 Homer Scaffolding Proteins Proteins 0.000 description 4
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 4
- 206010028980 Neoplasm Diseases 0.000 description 4
- 108010014608 Proto-Oncogene Proteins c-kit Proteins 0.000 description 4
- 102000016971 Proto-Oncogene Proteins c-kit Human genes 0.000 description 4
- 230000004570 RNA-binding Effects 0.000 description 4
- 239000000427 antigen Substances 0.000 description 4
- 102000036639 antigens Human genes 0.000 description 4
- 108091007433 antigens Proteins 0.000 description 4
- 230000033228 biological regulation Effects 0.000 description 4
- 238000006243 chemical reaction Methods 0.000 description 4
- 230000003247 decreasing effect Effects 0.000 description 4
- 238000010586 diagram Methods 0.000 description 4
- 238000009826 distribution Methods 0.000 description 4
- 230000008482 dysregulation Effects 0.000 description 4
- 238000001727 in vivo Methods 0.000 description 4
- 238000001638 lipofection Methods 0.000 description 4
- 239000002502 liposome Substances 0.000 description 4
- 210000004940 nucleus Anatomy 0.000 description 4
- 239000008194 pharmaceutical composition Substances 0.000 description 4
- 238000003757 reverse transcription PCR Methods 0.000 description 4
- 239000011780 sodium chloride Substances 0.000 description 4
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 4
- 230000003612 virological effect Effects 0.000 description 4
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 3
- 241000702421 Dependoparvovirus Species 0.000 description 3
- 101001084710 Drosophila melanogaster Histone H2A.v Proteins 0.000 description 3
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 3
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 3
- WSFSSNUMVMOOMR-UHFFFAOYSA-N Formaldehyde Chemical compound O=C WSFSSNUMVMOOMR-UHFFFAOYSA-N 0.000 description 3
- 102100031181 Glyceraldehyde-3-phosphate dehydrogenase Human genes 0.000 description 3
- 102100023919 Histone H2A.Z Human genes 0.000 description 3
- 102000006947 Histones Human genes 0.000 description 3
- 101000905054 Homo sapiens Histone H2A.Z Proteins 0.000 description 3
- 239000004472 Lysine Substances 0.000 description 3
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 3
- 108091028043 Nucleic acid sequence Proteins 0.000 description 3
- 241001411185 Sison Species 0.000 description 3
- 230000004913 activation Effects 0.000 description 3
- 230000015572 biosynthetic process Effects 0.000 description 3
- 230000000903 blocking effect Effects 0.000 description 3
- 238000010276 construction Methods 0.000 description 3
- 238000012217 deletion Methods 0.000 description 3
- 230000037430 deletion Effects 0.000 description 3
- 201000010099 disease Diseases 0.000 description 3
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 3
- 238000010828 elution Methods 0.000 description 3
- 102000037865 fusion proteins Human genes 0.000 description 3
- 108020001507 fusion proteins Proteins 0.000 description 3
- 108020004445 glyceraldehyde-3-phosphate dehydrogenase Proteins 0.000 description 3
- 238000004519 manufacturing process Methods 0.000 description 3
- 229920000609 methyl cellulose Polymers 0.000 description 3
- 230000011987 methylation Effects 0.000 description 3
- 239000001923 methylcellulose Substances 0.000 description 3
- 239000013642 negative control Substances 0.000 description 3
- 238000007747 plating Methods 0.000 description 3
- 238000002360 preparation method Methods 0.000 description 3
- 238000003753 real-time PCR Methods 0.000 description 3
- 230000001105 regulatory effect Effects 0.000 description 3
- 238000003786 synthesis reaction Methods 0.000 description 3
- 238000010361 transduction Methods 0.000 description 3
- 230000026683 transduction Effects 0.000 description 3
- 241000701161 unidentified adenovirus Species 0.000 description 3
- 108020005345 3' Untranslated Regions Proteins 0.000 description 2
- 108020003589 5' Untranslated Regions Proteins 0.000 description 2
- 101100504389 Arabidopsis thaliana GFT1 gene Proteins 0.000 description 2
- 101100477411 Dictyostelium discoideum set1 gene Proteins 0.000 description 2
- 108091060211 Expressed sequence tag Proteins 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- 102100022537 Histone deacetylase 6 Human genes 0.000 description 2
- 102100027755 Histone-lysine N-methyltransferase 2C Human genes 0.000 description 2
- 102100021090 Homeobox protein Hox-A9 Human genes 0.000 description 2
- 101000899330 Homo sapiens Histone deacetylase 6 Proteins 0.000 description 2
- 101001008892 Homo sapiens Histone-lysine N-methyltransferase 2C Proteins 0.000 description 2
- 108091092195 Intron Proteins 0.000 description 2
- 108700041619 Myeloid Ecotropic Viral Integration Site 1 Proteins 0.000 description 2
- 102000047831 Myeloid Ecotropic Viral Integration Site 1 Human genes 0.000 description 2
- WWGBHDIHIVGYLZ-UHFFFAOYSA-N N-[4-[3-[[[7-(hydroxyamino)-7-oxoheptyl]amino]-oxomethyl]-5-isoxazolyl]phenyl]carbamic acid tert-butyl ester Chemical compound C1=CC(NC(=O)OC(C)(C)C)=CC=C1C1=CC(C(=O)NCCCCCCC(=O)NO)=NO1 WWGBHDIHIVGYLZ-UHFFFAOYSA-N 0.000 description 2
- 108091061960 Naked DNA Proteins 0.000 description 2
- 229930182555 Penicillin Natural products 0.000 description 2
- JGSARLDLIJGVTE-MBNYWOFBSA-N Penicillin G Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1 JGSARLDLIJGVTE-MBNYWOFBSA-N 0.000 description 2
- RVGRUAULSDPKGF-UHFFFAOYSA-N Poloxamer Chemical compound C1CO1.CC1CO1 RVGRUAULSDPKGF-UHFFFAOYSA-N 0.000 description 2
- 238000003559 RNA-seq method Methods 0.000 description 2
- 229920002684 Sepharose Polymers 0.000 description 2
- 102000040945 Transcription factor Human genes 0.000 description 2
- 108091023040 Transcription factor Proteins 0.000 description 2
- 239000007983 Tris buffer Substances 0.000 description 2
- 230000001594 aberrant effect Effects 0.000 description 2
- 239000004480 active ingredient Substances 0.000 description 2
- 239000011543 agarose gel Substances 0.000 description 2
- 150000001413 amino acids Chemical class 0.000 description 2
- 230000003042 antagnostic effect Effects 0.000 description 2
- 238000001574 biopsy Methods 0.000 description 2
- 238000009835 boiling Methods 0.000 description 2
- 108010006025 bovine growth hormone Proteins 0.000 description 2
- 210000004900 c-terminal fragment Anatomy 0.000 description 2
- 210000004899 c-terminal region Anatomy 0.000 description 2
- 230000001413 cellular effect Effects 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 230000003021 clonogenic effect Effects 0.000 description 2
- 230000007547 defect Effects 0.000 description 2
- 238000004925 denaturation Methods 0.000 description 2
- 230000036425 denaturation Effects 0.000 description 2
- 239000000412 dendrimer Substances 0.000 description 2
- 229920000736 dendritic polymer Polymers 0.000 description 2
- 238000000432 density-gradient centrifugation Methods 0.000 description 2
- 229960003964 deoxycholic acid Drugs 0.000 description 2
- KXGVEGMKQFWNSR-LLQZFEROSA-N deoxycholic acid Chemical compound C([C@H]1CC2)[C@H](O)CC[C@]1(C)[C@@H]1[C@@H]2[C@@H]2CC[C@H]([C@@H](CCC(O)=O)C)[C@@]2(C)[C@@H](O)C1 KXGVEGMKQFWNSR-LLQZFEROSA-N 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- 230000003828 downregulation Effects 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 239000012091 fetal bovine serum Substances 0.000 description 2
- 239000012997 ficoll-paque Substances 0.000 description 2
- 238000009472 formulation Methods 0.000 description 2
- 230000012010 growth Effects 0.000 description 2
- 239000001963 growth medium Substances 0.000 description 2
- 230000036541 health Effects 0.000 description 2
- 108010027263 homeobox protein HOXA9 Proteins 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- 238000001802 infusion Methods 0.000 description 2
- 150000002632 lipids Chemical class 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- KWGKDLIKAYFUFQ-UHFFFAOYSA-M lithium chloride Chemical compound [Li+].[Cl-] KWGKDLIKAYFUFQ-UHFFFAOYSA-M 0.000 description 2
- 210000004698 lymphocyte Anatomy 0.000 description 2
- 239000012139 lysis buffer Substances 0.000 description 2
- 229910001629 magnesium chloride Inorganic materials 0.000 description 2
- 238000002826 magnetic-activated cell sorting Methods 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- 239000012528 membrane Substances 0.000 description 2
- 108020004999 messenger RNA Proteins 0.000 description 2
- 238000010606 normalization Methods 0.000 description 2
- 230000002246 oncogenic effect Effects 0.000 description 2
- 230000036961 partial effect Effects 0.000 description 2
- 239000008188 pellet Substances 0.000 description 2
- 229940049954 penicillin Drugs 0.000 description 2
- 239000000546 pharmaceutical excipient Substances 0.000 description 2
- 230000009038 pharmacological inhibition Effects 0.000 description 2
- YBYRMVIVWMBXKQ-UHFFFAOYSA-N phenylmethanesulfonyl fluoride Chemical compound FS(=O)(=O)CC1=CC=CC=C1 YBYRMVIVWMBXKQ-UHFFFAOYSA-N 0.000 description 2
- 229960000502 poloxamer Drugs 0.000 description 2
- 229920001983 poloxamer Polymers 0.000 description 2
- 102000040430 polynucleotide Human genes 0.000 description 2
- 108091033319 polynucleotide Proteins 0.000 description 2
- 239000002157 polynucleotide Substances 0.000 description 2
- 230000006916 protein interaction Effects 0.000 description 2
- 230000002829 reductive effect Effects 0.000 description 2
- 230000001177 retroviral effect Effects 0.000 description 2
- 229960005322 streptomycin Drugs 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 230000004083 survival effect Effects 0.000 description 2
- 239000000725 suspension Substances 0.000 description 2
- 238000002560 therapeutic procedure Methods 0.000 description 2
- 230000000699 topical effect Effects 0.000 description 2
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- 239000003981 vehicle Substances 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- 239000013603 viral vector Substances 0.000 description 2
- 239000011534 wash buffer Substances 0.000 description 2
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 1
- IHPYMWDTONKSCO-UHFFFAOYSA-N 2,2'-piperazine-1,4-diylbisethanesulfonic acid Chemical compound OS(=O)(=O)CCN1CCN(CCS(O)(=O)=O)CC1 IHPYMWDTONKSCO-UHFFFAOYSA-N 0.000 description 1
- 239000013607 AAV vector Substances 0.000 description 1
- 102100032187 Androgen receptor Human genes 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 1
- 238000011740 C57BL/6 mouse Methods 0.000 description 1
- 101710167800 Capsid assembly scaffolding protein Proteins 0.000 description 1
- 208000005623 Carcinogenesis Diseases 0.000 description 1
- 102000014914 Carrier Proteins Human genes 0.000 description 1
- 208000028952 Chronic enteropathy associated with SLCO2A1 gene Diseases 0.000 description 1
- 108091062157 Cis-regulatory element Proteins 0.000 description 1
- 108020004635 Complementary DNA Proteins 0.000 description 1
- 102100023580 Cyclic AMP-dependent transcription factor ATF-4 Human genes 0.000 description 1
- 108090000695 Cytokines Proteins 0.000 description 1
- 102000004127 Cytokines Human genes 0.000 description 1
- 239000003298 DNA probe Substances 0.000 description 1
- 241000450599 DNA viruses Species 0.000 description 1
- 108010008532 Deoxyribonuclease I Proteins 0.000 description 1
- 102000007260 Deoxyribonuclease I Human genes 0.000 description 1
- 108010067770 Endopeptidase K Proteins 0.000 description 1
- 102100038595 Estrogen receptor Human genes 0.000 description 1
- 108700007698 Genetic Terminator Regions Proteins 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 241000700721 Hepatitis B virus Species 0.000 description 1
- 229920000209 Hexadimethrine bromide Polymers 0.000 description 1
- 101000905743 Homo sapiens Cyclic AMP-dependent transcription factor ATF-4 Proteins 0.000 description 1
- 101000932480 Homo sapiens Fms-related tyrosine kinase 3 ligand Proteins 0.000 description 1
- 101000716729 Homo sapiens Kit ligand Proteins 0.000 description 1
- 101000582631 Homo sapiens Menin Proteins 0.000 description 1
- 101001123298 Homo sapiens PR domain zinc finger protein 14 Proteins 0.000 description 1
- 101000779418 Homo sapiens RAC-alpha serine/threonine-protein kinase Proteins 0.000 description 1
- 101001077298 Homo sapiens Retinoblastoma-binding protein 5 Proteins 0.000 description 1
- 101000799461 Homo sapiens Thrombopoietin Proteins 0.000 description 1
- 101000694103 Homo sapiens Thyroid peroxidase Proteins 0.000 description 1
- 101001050297 Homo sapiens Transcription factor JunD Proteins 0.000 description 1
- 206010062904 Hormone-refractory prostate cancer Diseases 0.000 description 1
- 108091029795 Intergenic region Proteins 0.000 description 1
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 1
- 206010025323 Lymphomas Diseases 0.000 description 1
- 101150029107 MEIS1 gene Proteins 0.000 description 1
- 241000407429 Maja Species 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 101000932479 Mus musculus Fms-related tyrosine kinase 3 ligand Proteins 0.000 description 1
- 101001033276 Mus musculus Interleukin-3 Proteins 0.000 description 1
- 101000716728 Mus musculus Kit ligand Proteins 0.000 description 1
- 239000000020 Nitrocellulose Substances 0.000 description 1
- 238000000636 Northern blotting Methods 0.000 description 1
- 102000007999 Nuclear Proteins Human genes 0.000 description 1
- 108010089610 Nuclear Proteins Proteins 0.000 description 1
- 101710163270 Nuclease Proteins 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 239000007990 PIPES buffer Substances 0.000 description 1
- 102100028974 PR domain zinc finger protein 14 Human genes 0.000 description 1
- 239000004793 Polystyrene Substances 0.000 description 1
- 101710130420 Probable capsid assembly scaffolding protein Proteins 0.000 description 1
- 206010060862 Prostate cancer Diseases 0.000 description 1
- 208000000236 Prostatic Neoplasms Diseases 0.000 description 1
- 108700040121 Protein Methyltransferases Proteins 0.000 description 1
- 102000055027 Protein Methyltransferases Human genes 0.000 description 1
- 108010076504 Protein Sorting Signals Proteins 0.000 description 1
- 102100033810 RAC-alpha serine/threonine-protein kinase Human genes 0.000 description 1
- 239000012083 RIPA buffer Substances 0.000 description 1
- 108091034057 RNA (poly(A)) Proteins 0.000 description 1
- 102000015097 RNA Splicing Factors Human genes 0.000 description 1
- 108010039259 RNA Splicing Factors Proteins 0.000 description 1
- 102000044126 RNA-Binding Proteins Human genes 0.000 description 1
- 108700020471 RNA-Binding Proteins Proteins 0.000 description 1
- 238000011530 RNeasy Mini Kit Methods 0.000 description 1
- 239000012980 RPMI-1640 medium Substances 0.000 description 1
- 102100025192 Retinoblastoma-binding protein 5 Human genes 0.000 description 1
- 235000013290 Sagittaria latifolia Nutrition 0.000 description 1
- 101710204410 Scaffold protein Proteins 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 108010034546 Serratia marcescens nuclease Proteins 0.000 description 1
- 238000000692 Student's t-test Methods 0.000 description 1
- 101710120037 Toxin CcdB Proteins 0.000 description 1
- 102100023118 Transcription factor JunD Human genes 0.000 description 1
- 229920004890 Triton X-100 Polymers 0.000 description 1
- 239000013504 Triton X-100 Substances 0.000 description 1
- 102100038458 Ubinuclein-1 Human genes 0.000 description 1
- 101710094188 Ubinuclein-1 Proteins 0.000 description 1
- 108091023045 Untranslated Region Proteins 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 101150063416 add gene Proteins 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 108010080146 androgen receptors Proteins 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- PXXJHWLDUBFPOL-UHFFFAOYSA-N benzamidine Chemical compound NC(=N)C1=CC=CC=C1 PXXJHWLDUBFPOL-UHFFFAOYSA-N 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- 108091008324 binding proteins Proteins 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- 239000000090 biomarker Substances 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 201000011510 cancer Diseases 0.000 description 1
- 230000036952 cancer formation Effects 0.000 description 1
- 230000005907 cancer growth Effects 0.000 description 1
- 231100000504 carcinogenesis Toxicity 0.000 description 1
- 125000002091 cationic group Chemical group 0.000 description 1
- 238000000525 cavity enhanced absorption spectroscopy Methods 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 230000022131 cell cycle Effects 0.000 description 1
- 230000006369 cell cycle progression Effects 0.000 description 1
- 230000032823 cell division Effects 0.000 description 1
- 230000004663 cell proliferation Effects 0.000 description 1
- 238000002659 cell therapy Methods 0.000 description 1
- 230000005754 cellular signaling Effects 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 238000011490 co-immunoprecipitation assay Methods 0.000 description 1
- 235000015246 common arrowhead Nutrition 0.000 description 1
- 230000008094 contradictory effect Effects 0.000 description 1
- 238000004132 cross linking Methods 0.000 description 1
- 239000003431 cross linking reagent Substances 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 239000008121 dextrose Substances 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 238000006073 displacement reaction Methods 0.000 description 1
- 231100000673 dose–response relationship Toxicity 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 239000003937 drug carrier Substances 0.000 description 1
- 241001493065 dsRNA viruses Species 0.000 description 1
- 239000012149 elution buffer Substances 0.000 description 1
- 210000001671 embryonic stem cell Anatomy 0.000 description 1
- 239000003995 emulsifying agent Substances 0.000 description 1
- 238000010201 enrichment analysis Methods 0.000 description 1
- 230000004049 epigenetic modification Effects 0.000 description 1
- 230000010856 establishment of protein localization Effects 0.000 description 1
- 108010038795 estrogen receptors Proteins 0.000 description 1
- ZMMJGEGLRURXTF-UHFFFAOYSA-N ethidium bromide Chemical compound [Br-].C12=CC(N)=CC=C2C2=CC=C(N)C=C2[N+](CC)=C1C1=CC=CC=C1 ZMMJGEGLRURXTF-UHFFFAOYSA-N 0.000 description 1
- 238000013401 experimental design Methods 0.000 description 1
- 239000013613 expression plasmid Substances 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 239000011152 fibreglass Substances 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 238000005194 fractionation Methods 0.000 description 1
- 230000007849 functional defect Effects 0.000 description 1
- 125000000524 functional group Chemical group 0.000 description 1
- 238000001641 gel filtration chromatography Methods 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- 229930195712 glutamate Natural products 0.000 description 1
- 238000010438 heat treatment Methods 0.000 description 1
- 230000011132 hemopoiesis Effects 0.000 description 1
- 206010073071 hepatocellular carcinoma Diseases 0.000 description 1
- 231100000844 hepatocellular carcinoma Toxicity 0.000 description 1
- 108091008039 hormone receptors Proteins 0.000 description 1
- 102000055151 human KITLG Human genes 0.000 description 1
- 102000047681 human MEN1 Human genes 0.000 description 1
- 102000053400 human TPO Human genes 0.000 description 1
- 238000002513 implantation Methods 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 238000007917 intracranial administration Methods 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 238000007912 intraperitoneal administration Methods 0.000 description 1
- 238000001990 intravenous administration Methods 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 239000004816 latex Substances 0.000 description 1
- 229920000126 latex Polymers 0.000 description 1
- 208000021601 lentivirus infection Diseases 0.000 description 1
- 238000004811 liquid chromatography Methods 0.000 description 1
- 239000006166 lysate Substances 0.000 description 1
- 125000003588 lysine group Chemical group [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 1
- 229920002521 macromolecule Polymers 0.000 description 1
- 239000006249 magnetic particle Substances 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 230000014759 maintenance of location Effects 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 239000002609 medium Substances 0.000 description 1
- 108091035591 miR-23a stem-loop Proteins 0.000 description 1
- 238000010208 microarray analysis Methods 0.000 description 1
- 239000011325 microbead Substances 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 238000010172 mouse model Methods 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 229920001220 nitrocellulos Polymers 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 239000002773 nucleotide Substances 0.000 description 1
- 231100000590 oncogenic Toxicity 0.000 description 1
- 230000006548 oncogenic transformation Effects 0.000 description 1
- 238000011275 oncology therapy Methods 0.000 description 1
- 239000006179 pH buffering agent Substances 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 230000001575 pathological effect Effects 0.000 description 1
- 210000005259 peripheral blood Anatomy 0.000 description 1
- 239000011886 peripheral blood Substances 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 239000004033 plastic Substances 0.000 description 1
- 229920003023 plastic Polymers 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 229920002223 polystyrene Polymers 0.000 description 1
- 239000004800 polyvinyl chloride Substances 0.000 description 1
- 229920000915 polyvinyl chloride Polymers 0.000 description 1
- 239000011148 porous material Substances 0.000 description 1
- 239000002244 precipitate Substances 0.000 description 1
- 238000004321 preservation Methods 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 102000005962 receptors Human genes 0.000 description 1
- 108020003175 receptors Proteins 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000022983 regulation of cell cycle Effects 0.000 description 1
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 1
- 102000037983 regulatory factors Human genes 0.000 description 1
- 108091008025 regulatory factors Proteins 0.000 description 1
- 230000003252 repetitive effect Effects 0.000 description 1
- 230000000754 repressing effect Effects 0.000 description 1
- 230000001718 repressive effect Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 230000019491 signal transduction Effects 0.000 description 1
- 230000003584 silencer Effects 0.000 description 1
- 238000001542 size-exclusion chromatography Methods 0.000 description 1
- 150000003384 small molecules Chemical class 0.000 description 1
- 229910000030 sodium bicarbonate Inorganic materials 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 238000001179 sorption measurement Methods 0.000 description 1
- 210000000952 spleen Anatomy 0.000 description 1
- 239000003381 stabilizer Substances 0.000 description 1
- 238000007619 statistical method Methods 0.000 description 1
- 238000011476 stem cell transplantation Methods 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 230000000153 supplemental effect Effects 0.000 description 1
- 230000008093 supporting effect Effects 0.000 description 1
- 208000024891 symptom Diseases 0.000 description 1
- 208000011580 syndromic disease Diseases 0.000 description 1
- 238000007910 systemic administration Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000010474 transient expression Effects 0.000 description 1
- 238000003146 transient transfection Methods 0.000 description 1
- 238000002054 transplantation Methods 0.000 description 1
- 238000005199 ultracentrifugation Methods 0.000 description 1
- 241001529453 unidentified herpesvirus Species 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 210000002845 virion Anatomy 0.000 description 1
- 239000000277 virosome Substances 0.000 description 1
- 238000012800 visualization Methods 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 238000009736 wetting Methods 0.000 description 1
- 239000000080 wetting agent Substances 0.000 description 1
- 238000002689 xenotransplantation Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/53—Immunoassay; Biospecific binding assay; Materials therefor
- G01N33/574—Immunoassay; Biospecific binding assay; Materials therefor for cancer
- G01N33/57407—Specifically defined cancers
- G01N33/57426—Specifically defined cancers leukemia
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6883—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
- C12Q1/6886—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material for cancer
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/156—Polymorphic or mutational markers
Definitions
- Embodiments of the present invention relate to the diagnosis of hematopoietic malignancies. Some embodiments include methods and kits for the diagnosis of hematopoietic malignancies, such as acute myeloid leukemia. Some embodiments relate to the presence of alternatively-spliced isoforms of a SON gene.
- Methyl ation of lysine residues of histone H3 is an event dictating active or repressed status of chromatin.
- Tri-methylation of histone 3 lysine 4 (H3K4me3) near transcription start sites is associated with active transcription (Barski et al., 2007; Guenther et al., 2007), and in mammals, this modification is mediated by the SET1 and mixed lineage leukemia (MLL) family methyltransferases, SET1A, SET1B, and MLLl ⁇ t (Miller et al., 2001 ; Shilatifard, 2012).
- MLL mixed lineage leukemia
- the SET1/MLL proteins are associated with multiple subunit proteins, such as WDR5, ASH2L and RBBP5, to acquire a maximum activity in methylation of H3K4 (Cao et al., 2010; Dou et al., 2006; Ernst and Vakoc, 2012).
- the N-terminal portion of the MLL1/2 protein interacts with the scaffold protein menin, facilitating LEDGF interaction and chromatin binding of the MLL complex (Yokoyama and Cleary, 2008).
- the MLL-menin interaction is required for leukemia-associated target gene expression (Yokoyama et al., 2005), suggesting that this interaction is involved in activating oncogenic MLL-target genes.
- MLL-menin interaction has been shown to effectively block leukemia progression (Borkin et al., 2015) and prostate cancer growth (Malik et al., 2015), indicating that MLL-menin interaction could be a promising target for cancer therapy.
- cellular factors that regulate MLL-menin interaction and MLL complex assembly are largely unknown.
- Some embodiments of the methods and compositions provided herein include a method of diagnosing a hematopoietic malignancy or a likelihood of developing a hematopoietic malignancy in a test subject comprising: determining a ratio for the level of a short polypeptide or the level of a nucleic acid encoding the short polypeptide in the test subject to the level of a long polypeptide or the level of a nucleic acid encoding the long polypeptide in the test subject, wherein the short polypeptide is encoded by a truncated alternatively-spliced transcript of a SON gene, and the long polypeptide is encoded by a full length transcript of the SON gene.
- Some embodiments of the methods and compositions provided herein include a method of detecting a ratio for the level of a short polypeptide or the level of a nucleic acid encoding the short polypeptide in a test subject to the level of a long polypeptide or the level of a nucleic acid encoding the long polypeptide in the test subject comprising: contacting a nucleic acid or polypeptide sample from the test subject with an agent which indicates the level of the short polypeptide or the level of a nucleic acid encoding the short polypeptide in the test subject; contacting the nucleic acid or polypeptide sample with an agent which indicates the level of the long polypeptide or a nucleic acid encoding the long polypeptide in the test subject; and determining the ratio for the level of the short polypeptide or the level of a nucleic acid encoding the short polypeptide in the test subject to the level of the long polypeptide or the level of a nucleic acid encoding the long polypeptide
- Some embodiments of the methods and compositions provided herein include a method of determining the level of a short polypeptide or a nucleic acid encoding the polypeptide in a nucleic acid or polypeptide sample from a test subject comprising: contacting the nucleic acid or polypeptide sample from the test subject with an agent which indicates the level of the short polypeptide or which indicates the level of a nucleic acid encoding the short polypeptide in the sample; and determining the level of the short polypeptide or a nucleic acid encoding the short polypeptide in the sample, wherein the short polypeptide is encoded by a truncated alternatively-spliced transcript of a SON gene.
- Some embodiments of the methods and compositions provided herein include a method of diagnosing a hematopoietic malignancy or a likelihood of developing a hematopoietic malignancy in a test subject comprising: determining the level of a short polypeptide or the level of a nucleic acid encoding the short polypeptide in a sample obtained from a test subject, wherein the short polypeptide is encoded by a truncated alternatively- spliced transcript of a SON gene.
- Some embodiments also include comparing the level of the short polypeptide or the level of a nucleic acid encoding the short polypeptide in a sample obtained from the test subject with the level of the short polypeptide or the level of a nucleic acid encoding the short polypeptide in a subject not having a hematopoietic malignancy.
- Some embodiments also include determining the level of a long polypeptide or a nucleic acid encoding the long polypeptide in the test subject, wherein the long polypeptide is encoded by a full length transcript of the SON gene.
- Some embodiments also include determining a ratio for the level of a short polypeptide or the level of a nucleic acid encoding the short polypeptide in the test subject to the level of a long polypeptide or the level of a nucleic acid encoding the long polypeptide in the sample from the test subject. In some embodiments, the ratio is greater than about 30%, 40%, or 50%.
- Some embodiments also include determining the level of an additional polypeptide in addition to the level of the short form of the SON polypeptide or the level of a nucleic acid encoding the additional polypeptide in addition to the level of a nucleic acid encoding the short form of the SON polypeptide in a test subject, wherein the additional polypeptide is encoded by a gene selected from CDKN1A, GFI1 and ATF3.
- Some embodiments also include comparing the level of an additional polypeptide or the level of a nucleic acid encoding the additional polypeptide in the test subject with the level of an additional polypeptide or the level of a nucleic acid encoding the additional polypeptide in a subject not having a hematopoietic malignancy.
- the truncated alternatively-spliced transcript of a SON gene includes a SON E variant of a SON gene, and/or a SON B variant of a SON gene.
- the nucleic acid encoding the short polypeptide comprises a sequence which specifically binds to an oligonucleotide comprising SEQ ID NOs:06 or 10.
- the short polypeptide comprises SEQ ID NOs:88 or 90.
- the nucleic acid encoding the short polypeptide comprises SEQ ID NOs:87 or 89.
- the full length transcript of the SON gene comprises a sequence which specifically binds to an oligonucleotide comprising a sequence selected from SEQ ID NOs: 13-18.
- the long polypeptide comprises SEQ ID NO:86.
- the nucleic acid encoding the long polypeptide comprises SEQ ID NO:85.
- the SON gene is a human SON gene.
- Some embodiments also include obtaining a sample from the test subject.
- Some embodiments also include contacting the sample with an agent which specifically binds to the short polypeptide or a nucleic acid encoding the short polypeptide.
- Some embodiments also include contacting the sample with an agent which specifically binds to the long polypeptide or a nucleic acid encoding the long polypeptide.
- the agent is a primer or a hybridization probe.
- the primer or the hybridization probe comprises an oligonucleotide comprising SEQ ID NOs:06, 10 or 13-18
- the agent is an antibody or antigen-binding fragment thereof which specifically binds to the short polypeptide. In some embodiments, the agent is an antibody or antigen-binding fragment thereof which specifically binds to the long polypeptide.
- the agent is attached to a solid support.
- the sample comprises nucleic acids or polypeptides.
- Some embodiments also include determining an increased level of a polypeptide encoded by a SON target gene or a nucleic acid encoded by a SON target gene in the sample from the test subject, wherein the SON target gene has a SON binding site.
- the SON target gene is selected from the group consisting of FOX03, NOTCH2NL, SRC, GADD45A, CDKN1A, ATF3, GFI1, EGR1, SRC, and GFI1B.
- Some embodiments also include determining an increased level of a MLL multi-protein complex in the sample from the test subject, wherein the MLL is selected from the group consisting of MLL1, MLL-N, MLL-C, and MLL2.
- the multiprotein complex comprises a protein selected from menin, ASH2L, LEDGF, and WDR5.
- Some embodiments also include determining an increased replating activity of a hematopoietic progenitor cell of the sample from the test subject.
- the hematopoietic progenitor cell is a bone marrow cell.
- Some embodiments also include determining an increased binding of a protein at the or adjacent to the transcription start site of a SON target gene in the sample from the test subject.
- the protein is selected from the group consisting of MLL1, MLL-N, MLL-C, MLL2, WDR5, ASH2L, menin, SET1A, SET1B, ASC2, and SUZ12.
- the SON target gene is selected from the group consisting of CDKN1A, GADD45A, NOTCH2NL, ATF3, GFI1, EGR1, and SRC.
- Some embodiments also include determining an increased level in H3K4me3 in the sample from the test subject. In some embodiments, the level in H3K4me3 is increased at the or adjacent to a SON target gene selected from the group consisting of CDKN1A, GADD45A, NOTCH2NL, ATF3, GFI1, EGR1, and SRC.
- the sample comprises bone marrow mononuclear cells (BM-MNCs), or peripheral blood mononuclear cells (PB-MNCs).
- BM-MNCs bone marrow mononuclear cells
- PB-MNCs peripheral blood mononuclear cells
- the hematopoietic malignancy is selected from acute myeloid leukemia (AML) and myelodysplastic syndrome (MDS).
- AML acute myeloid leukemia
- MDS myelodysplastic syndrome
- the hematopoietic malignancy is an acute myeloid leukemia (AML) subtype selected from M2 without t(8;21), and t(8;21)(q22;q22).
- test subject is mammalian. In some embodiments, the test subject is human.
- Some embodiments of the methods and compositions provided herein include a method of diagnosing a hematopoietic malignancy or a likelihood of developing a hematopoietic malignancy in a test subject comprising: detecting increased levels of binding of a short polypeptide with a nucleic acid having a SON binding site in a sample from a test subject, wherein the short polypeptide is encoded by a truncated alternatively-spliced transcript of a SON gene.
- the truncated alternatively-spliced transcript of a SON gene includes a SON E variant of a SON gene, and/or a SON B variant of a SON gene.
- the nucleic acid encoding the short polypeptide comprises a sequence which specifically binds to an oligonucleotide comprising SEQ ID NOs:06 or 10.
- the short polypeptide comprises SEQ ID NOs:88 or 90.
- the nucleic acid encoding the short polypeptide comprises SEQ ID NOs:87 or 89.
- the hematopoietic malignancy is selected from acute myeloid leukemia (AML) and myelodysplastic syndrome (MDS).
- AML acute myeloid leukemia
- MDS myelodysplastic syndrome
- the hematopoietic malignancy is an acute myeloid leukemia (AML) subtype selected from M2 without t(8;21), and t(8;21)(q22;q22).
- the subject is mammalian. In some embodiments, subject is human.
- Some embodiments of the methods and compositions provided herein include a method of ameliorating a hematopoietic malignancy comprising: increasing the level of a full length SON polypeptide or a nucleic acid encoding a full length SON polypeptide in a cell of a subject.
- the expression level of a nucleic acid encoding said full length SON polypeptide or the expression level of said full length SON polypeptide is increased by administering a composition comprising a nucleic acid to the subject.
- the nucleic acid comprises a sequence encoding a full length SON polypeptide.
- the SON polypeptide is a human SON polypeptide.
- the nucleic acid comprises SEQ ID NO.:85.
- the polypeptide comprises SEQ ID NO.:86.
- the hematopoietic malignancy is selected from acute myeloid leukemia (AML) and myelodysplastic syndrome (MDS).
- AML acute myeloid leukemia
- MDS myelodysplastic syndrome
- the hematopoietic malignancy is an acute myeloid leukemia (AML) subtype selected from M2 without t(8;21), and t(8;21)(q22;q22).
- the subject is mammalian. In some embodiments, the subject is human.
- kits for the diagnosis of a hematopoietic malignancy in a test subject comprising: an agent that specifically binds to a short polypeptide or a nucleic acid encoding the short polypeptide, wherein the short polypeptide is encoded by a truncated alternatively-spliced transcript of a SON gene.
- Some embodiments also include an agent that specifically binds to a long polypeptide or a nucleic acid encoding the long polypeptide, wherein the long polypeptide is encoded by a full length transcript of the SON gene.
- Some embodiments also include an agent that specifically binds to an additional polypeptide in addition to the short polypeptide or a nucleic acid encoding the additional polypeptide in addition to the nucleic acid encoding the short polypeptide in the test subject, wherein the additional polypeptide is encoded by a gene selected from CDKN1A, GFI1 and ATF3.
- the truncated alternatively-spliced transcript of a SON gene includes a SON E variant of a SON gene, and/or a SON B variant of a SON gene.
- the nucleic acid encoding the short polypeptide comprises a sequence which specifically binds to an oligonucleotide comprising SEQ ID NOs:06 or 10. In some embodiments, the short polypeptide comprises SEQ ID NOs:88 or 90. In some embodiments, the nucleic acid encoding the short polypeptide comprises SEQ ID NOs:87 or 89.
- the full length transcript of the SON gene comprises a sequence which specifically binds to an oligonucleotide comprising a sequence selected from SEQ ID NOs: 13-18.
- the long polypeptide comprises SEQ ID NO:86.
- the nucleic acid encoding the long polypeptide comprises SEQ ID NO:85.
- the SON gene is a human SON gene.
- Some embodiments also include an agent that specifically binds to a SON target gene, wherein the SON target gene has a SON binding site.
- the SON target gene is selected from the group consisting of FOX03, NOTCH2NL, SRC, GADD45A, CDKN1A, ATF3, GFI1, EGR1, SRC, and GFI1B.
- Some embodiments also include an agent that specifically binds to a protein selected from MLL1, MLL-N, MLL-C, MLL2, menin, ASH2L, LEDGF, WDR5, SET1A, SET1B, ASC2, SUZ12, and H3K4me3.
- the agent is a primer or a hybridization probe.
- the primer or the hybridization probe comprises an oligonucleotide comprising SEQ ID NOs:06, 10, or 13-18.
- the agent is an antibody or antigen-bind fragment thereof which specifically binds to the short polypeptide.
- the agent is attached to a solid support.
- FIG. 1A depicts a schematic summarizing chromatin-immunoprecipitation (ChIP) using two different SON antibodies (Abs) and DN A- sequencing to determine overlap peaks.
- SON-N and SON-C Abs specifically bind to the N- and the C-terminus of SON, respectively.
- FIG. IB depicts genomic distribution of SON-binding sites determined by SON-N and SON-C ChIP overlap peaks.
- the pie graphs show the percentage of peaks located at specific genomic regions indicated (transcription start site, transcription start site).
- FIG. 1C depicts a Venn diagram showing the number of SON ChIP peaks (SON-N, SON-C and overlap) within ⁇ 5 kb from the transcription start site (Top). Pie graph illustrating genomic locations of overlap peaks within ⁇ 5kb from the transcription start site (Bottom).
- FIG. ID depicts a heat map showing the overlap peak signal of SON ChIP around the transcription start site of genes.
- FIG. IE depicts the top five DNA sequence motifs identified in the SON- N and SON-C overlap peaks within ⁇ 5 kb of the transcription start site.
- FIG. IF depicts gene ontology (GO) term enrichment analysis using DAVID for the genes in which SON-N and SON-C overlap peaks were found within ⁇ 5 kb from the transcription start site.
- FIG. 1G depicts the result of a pilot ChlP-seq study to verify specificity of two SON antibodies, SON-N Ab and SON-C Ab.
- the numbers of ChlP-seq peaks are summarized.
- FIG. 1H depicts a Venn diagram showing number of SON peaks determined by ChlP-seq using SON-N Ab and SON-C Ab.
- FIG. II depicts top five motifs enriched in SON-binding sites analyzed by HOMER.
- FIG. 2A depicts qPCR analyses of SON ChlP-seq target genes in K562 cells transfected with control siRNA and two different SON siRNAs.
- GFI1B served as a negative control which does not have SON binding sites near the transcription start site. Values represent mean ⁇ SD of four independent experiments. *p ⁇ 0.01.
- FIG. 2B depicts average signal profiles of indicated histone modifications and CpG islands around the SON-binding sites.
- FIG. 2C depicts Integrative Genomics Viewer (IGV) images representing SON (SON-N), H3K4me3, and H3K27ac ChlP-seq read counts at the target gene locus in K562 cells.
- IGF Integrative Genomics Viewer
- FIG. 2D depicts close-up images of ChlP-seq peaks of SON-N, SON-C, H3K4me3, and H3K27ac in representative SON target genes.
- FIG. 2E depicts SON ChlP-qPCR analysis (with SON-N Ab) confirming SON enrichment near the transcription start site of indicated genes in K562 cells.
- the number in the parenthesis indicates the base-pair counts from the transcription start site of each gene, and the primers for qPCR (TABLE 3) were designed to detect SON enrichment around these positions. *p-values ⁇ 0.01.
- FIG. 2F depicts HA ChlP-qPCR analysis measuring DNA-binding ability of FLA-tagged full-length SON and several deletion mutants in K562 cells; Full-length SON (SON F), potential DNA-binding region-deleted SON (SON ⁇ , amino acids 1 ,263 - 1,818 deleted), G-patch-deleted SON (SON AG-patch), and double-stranded RNA binding motif- deleted SON (SON ADSRM).
- Potential DNA-binding region that has been shown to interact with human hepatitis B virus genome (Sun et al., 201 1) is indicated. Empty vector transfected sample was used as a control. *p-values ⁇ 0.01.
- FIG. 2G depicts a Western blot that confirmed knockdown efficiency of two different siRNAs (#1 and #2) against SON in K562 cells.
- FIG. 2H depicts average signal profiles of H3K4mel and H3K27ac around the intergenic SON-binding sites.
- FIG. 21 depicts an IGV browser image of density profiles of SON (SON-N Ab ChIP), CpG islands, H3K4me3 and H3K27ac at several examples of SON target genes. Scale bar lOkb.
- FIG. 2J depicts close-up images of ChlP-seq peaks of SON-N, SON-C, H3K4me3, H3K27ac, H2A.Z and MNase-seq (nucleosome) peaks in representative SON target genes. Scale bar,lkb.
- FIG. 3A depicts ChlP-qPCR analyses of various histone modification levels at the SON-binding regions near the transcription start sites of the indicated genes upon SON knockdown in K562 cells. Histone H3 ChlP-qPCR was used as a control.
- FIG. 3B depicts ChlP-qPCR analyses MLL and SET1 complex protein recruitment (MLL-N; MLL1 N-terminus region, MLL-C; MLL1 C-terminus region, MLL2, WDR5, ASH2L, menin, SET1A, SET1B and ASC2) to the regions near the transcription start sites of the indicated genes upon SON knockdown.
- FIG. 3C depicts SON Depletion Leads to H3K4me3 Modification and MLL Complex Recruitment.
- ChlP-qPCR analysis of two SON target genes (CDKN1A and GADD45A), were conducted using indicated antibodies in K562 cells transfected with control or SON siRNA-#2. ChlP-qPCR results are plotted as percentage of input DNA. *p- values ⁇ 0.05.
- FIG. 3D depicts SON Depletion Leads to H3K4me3 Modification and MLL Complex Recruitment.
- ChlP-qPCR analysis of two non-targets (GFI1B and ATF4; B), were conducted using indicated antibodies in K562 cells transfected with control or SON siRNA-#l .
- ChlP-qPCR results are plotted as percentage of input DNA. *p-values ⁇ 0.05.
- FIG. 4A depicts a Western blot verified SON knockdown by SON siRNA transfection in K562 cells.
- FIG. 4B depicts a co-immunoprecipitation experiment with MLL-N antibodies.
- FIG. 4C depicts a co-immunoprecipitation experiment with WDR5 antibodies.
- FIG. 4D depicts a co-immunoprecipitation experiment with LEGF antibodies.
- FIG. 4E depicts a co-immunoprecipitation experiment with SET1A antibodies.
- FIG. 4F depicts interaction of SON with menin.
- K562 nuclear extracts were immunoprecipitated with control IgG or SON antibody (SON-N Ab) and several components of the MLL complex were examined by Western blot.
- FIG. 4G depicts verification of the SON-menin interaction.
- HEK 293 cells transfected with HA-SON, Flag-menin or pcDNA3-control as indicated were used for co- immunoprecipitation with HA antibody followed by Western blotting with HA or Flag antibodies.
- FIG. 4H depicts an immunoprecipitation experiment in K562 cells transfected with V5-tagged SON indicates that SON outcompetes MLL (MLL-N) for menin interaction in a dose-dependent manner.
- MLL-N MLL
- plasm id transfection two different amounts of SON-V5 construct, 5 ⁇ g (+) or 10 ⁇ g (++), were used.
- FIG. 41 depicts an experiment in which nuclear extracts from control and SON knockdown-K562 cells were size- fractionated on FPLC and analyzed for MLL1 N- terminus (MLL-N) and menin distribution by Western blotting using indicated antibodies (bottom panels).
- Top panels are elution profiles: fraction numbers on elution profile (top) indicate MLL-N-enriched fractions (left panel of WB, fractions 12-19) and trace on elution profile (top) indicates further downstream fractions (right panel of WB, fractions 20-41).
- FIG. 4J depicts a Western blot confirmed knockdown efficiency of SON siRNA in MV4; 11 cells.
- FIG. 4K depicts nuclear extracts from MV4; 11 transfected by control and SON siRNA were used for immunoprecipitation with MLL-N antibody and Western blotted with indicated antibodies.
- the N-terminus of wild-type MLL1 is marked by an asterisk and the MLL-AF4 fusion protein is marked by a red arrow head.
- FIG. 4L depicts 293T cells transiently co-expressing Flag-MLL-ENL and Myc-menin were transfected with control vector or SON-V5 construct and subjected to co- immunoprecipitation assays using a Flag antibody. Immunoprecipitated Flag-MLL-ENL and Myc-menin were analyzed by Western blotting.
- FIG. 4M depicts expression of target genes of MLL fusion proteins (HOXA9 and MEIS1) or SON protein (CDKN1A, ATF3, and GFI1) measured by real-time qPCR in the control and SON siRNA-transfected MLL-rearranged leukemic cell lines, MV4;11 and ML-2.
- the genes identified by our SON ChlP-seq in K562 e.g. CDKN1A, ATF3 and GFI1 were also upregulated upon SON knockdown in MV4;1 1 cells (heterozygous for MLL-AF4) which retain one copy of wild-type MLL.
- FIG. 5A depicts a schematic representation of the SON gene and the SON proteins.
- Full-length SON (SON F) is generated by alignment of 12 constitutive exons. Inclusion of alternative exons (exon 7a, and exon 5a) produces two different alternatively spliced isoforms, SON B and E.
- Horizontal arrows indicate the specific position of the primers used in qPCR. Arrowheads, polyadenylation signal sequences; Hatched boxes, untranslated regions.
- FIG. 5D depicts relative ratio of SON F, SON B and SON E in the BM- MNCs from AML patients (PI - P10) and healthy normal donors (Nl - N4).
- FIG. 5E depicts relative ratio of SON F, SON B and SON E in the PB- MNCs from AML patients (PI, P5, P9-14), MDS patients (P15 - 16) and healthy normal donors (N5 - Ni l).
- FIG. 5F depicts analyses of SON isoform expression in normal mouse Lin- /c-Kit+ bone marrow (BM) cells and leukemic blasts from mice with AMLl-ET09a- and MLL-AF9-induced leukemia. Specific exon regions indicated in each graph were determined by qPCR with the primer sets indicated in FIG. 5H. Data are represented as mean ⁇ SD of three independent experiments. *p ⁇ 0.01. [0094] FIG. 5G depicts relative ratios of three forms of Son in leukemic blasts (freshly isolated from the animals and also collected after methylcellulose culture) and normal Lin-/c-Kit+ mouse BM cells were determined.
- FIG. 5H depicts a schematic representation of the mouse Son genes and the Son proteins. Similar to human SON, full-length mouse Son (Son f; mouse Son isoform 1) is generated by alignment of 12 constitutive exons. Inclusion of alternative exons (exon 7a; exons 5a) produces two different alternatively spliced isoforms, Son b (a predicted isoform) and Son e (mouse Son isoform 2). The primer sets used for mouse Son qPCR (presented in FIG. 5F) are indicated with horizontal arrows.
- FIG. 51 depicts a structure of the SON E mRNA and position of PCR primers for 3' RACE PCR to confirm the 3' UTR sequence of SON E.
- Arrows indicate primers for 1st RACE PCR and arrows indicate primers for 2nd RACE PCR.
- EtBr-stained agarose gel showed 3' RACE-amplified PCR products (asterisk) that were used for sequencing. Decreased amount of the 3' RACE PCR product in total SON siRNA-transfected K562 cells, compared to control K562 cells, indicates the specificity of 3' RACE PCR.
- FIG. 5J depicts sequencing result of the 3' RACE PCR product from human SON E transcripts.
- FIG. 5K depicts a schematic strategy for measurement of relative ratio of SON isoforms presented in FIG. 5D, FIG. 5E and FIG. 5G.
- each qPCR analysis was performed using same forward primer and two different reverse primers.
- the ratio of exon 5a-included transcripts (SON E) and exon 5-included transcripts (total of SON F and SON B) was determined using a common forward primer (F3) and exon-specific reverse primers (R5 or R5a).
- the ratio of exon 7a-included transcripts (SON B) and exon 7-included transcript (SON F) was determined using a common forward primer (F5) and exon-specific reverse primers (R7 or R7a). Based on two qPCR analyses, relative levels of SON F, SON B and SON E were determined.
- FIG. 5L depicts a schematic of genomic structure of full-length SON and SON B. Each bar with the probe set number indicates the specific position of DNA probes used in microarray analysis (Affymetrix U133A microarrays).
- FIG. 5M depicts relative expression levels of SON detected by three different probe sets. The microarray data were from Stegmaier Leukemia Dataset (Stegmaier et al., 2004) from Oncomine database. The values of log2 median-centered intensity detected by indicated probe sets were displayed as a boxplot according to Oncomine output.
- FIG. 5N depicts relative expression levels of SON detected by three different probe sets.
- the microarray data were from Maia Leukemia Dataset (Maia et al., 2005) from Oncomine database.
- the values of log2 median-centered intensity detected by indicated probe sets were displayed as a boxplot according to Oncomine output.
- FIG. 6A depicts a qPCR analysis of leukemia-associated SON target genes, CDKN1A, ATF3 and GFI1, in BM-MNCs from healthy normal donors (Nl - N4) and AML patients (P2 - P10). Black Broken lines indicate the average level of each gene in normal donor samples.
- FIG. 6B depicts the effects of total SON knockdown (with SON siRNA-1) and SON E-specific knockdown (with SON E siRNA) on expression of leukemia-associated, ChlP-seq target genes in K562 cells.
- SON target genes regulated by RNA splicing function of SON TUBG1, HDAC6 and AKT1 were also examined, revealing the effect of SON E on regulation of ChlP-seq target genes, but not RNA splicing target genes.
- FIG. 6C depicts the effects of total SON knockdown (with SON siRNA-1) and SON E-specific knockdown (with SON E siRNA) on expression of leukemia-associated, ChlP-seq target genes in primary human CD34+ BM cells.
- FIG. 6D depicts effects of SON F and SON E overexpression on SON target gene expression (ChlP-seq target genes and RNA splicing target genes) in human CD34+ BM cells. Error bars represent SD from three independent experiments. *p ⁇ 0.05, **p ⁇ 0.01.
- FIG. 6E depicts a ChlP-qPCR assay to analyze the H3K4me3 levels at SON-binding sites near the promoter of the CDKN1A and GFI1 genes upon SON F and SON E expression.
- K562 cells were transfected with indicated siRNA and SON constructs (siRNA-resistant form).
- siRNA and SON constructs siRNA-resistant form
- 3 ⁇ g (+) of SON F constructs plus increasing amounts of SON E construct, 0 (-), 3 (+) or 6 ⁇ g (++), or 3 ⁇ g of SON E alone were used as indicated.
- H3K4me3 antibody or H3 antibody were used for ChlP.
- Asterisks indicate statistical significances of H3K4me3 reduction, as compared to the sample transfected with SON siRNA alone (the second lane).
- Inter-peak asterisks indicate statistical significance of the difference between two indicated samples. *p ⁇ 0.01.
- FIG. 6F depicts TUBG1 exon 7-8 minigene splicing assay demonstrating that SON E does not interfere with SON F-mediated RNA splicing.
- FIG. 6G depicts the location of the target regions of total SON siRNAs (siRNAs #1 and #2, targeting exon 3) and SON E siRNA (targeting exon 5a). SON siRNA sequences are described in herein.
- FIG. 6H depicts verification of SON F and SON E overexpression in primary human CD34+ bone marrow cells after nucleofection of the expression constructs.
- Exogenous expression of SON was determined by qPCR using the forward primers (F- hSON-Exonl2 for SON F, F-hSON-Exon4 for SON E) together with a reverse primer targeting the V5 sequence (R-V5-exo) in the plasmid. **p-values ⁇ 0.01.
- FIG. 61 depicts the expression level of SON F-V5 and SON E-Flag in the K562 cells used for V5-CMP and Flag-ChIP presented in FIG. 6E.
- FIG. 6 J depicts strategies of minigene assay with the TUBG1 exon7- intron 7 - exon8 minigene model (containing SON-dependent splicing sites; Ahn et al., 201 1; Lu et al., 2013).
- Splicing efficiencies of transfected SON F and SON E (expressed by the siRNA-resistant form of cDNA; Ahn et al., 201 1) were accessed by RT-PCR using a forward primer (F) and a reverse primer (R) to detect unspliced and spliced RNA.
- FIG. 6K depicts RT-PCR results demonstrating minigene splicing efficiencies.
- the numbers above the each lane indicates the relative amount of exogenous SON F and/or SON E transfected together with the minigene.
- FIG. 7A depicts a schematic diagram of expression constructs used for the experiments presented in panels B— E; V5-tagged SON F (full-length), Flag-tagged SON E, V5-tagged SON E, and the HA-tagged SON C-terminal fragment containing the RS domain and RNA-binding motifs (SR+RB).
- FIG. 7B depicts ChlP-qPCR analysis of SON F-V5 or SON E-Flag binding on target regions near the transcription start site of CDKN1 A.
- Five ⁇ g (+) of SON F- V5 constructs plus increasing amounts of SON E-Flag, 5 (+) or 10 ⁇ g (++) constructs were used for transfection into K562 cells. Shown are representative results of three independent experiments (mean ⁇ S.D. from triplicate).
- Asterisks indicate statistical significances of SON F-V5 or SON E-Flag enrichment, as compared to the background signal from negative control samples (SON E Flag-only sample in V5-ChlP and SON F-V5-only sample in Flag ChIP). Inter-peak asterisks indicate statistical significance of the difference between two indicated samples.
- FIG. 7C depicts co-immunoprecipitation (IP) experiments demonstrating the lack of menin-binding ability of SON E. Lysates of HEK 293 cells co-transfected with Flag-menin and SON F-V5 or SON E-V5 were subjected to IP with anti-V5, followed by Western blotting with indicated antibodies.
- IP co-immunoprecipitation
- FIG. 7D depicts the C-terminus of SON interacts with menin.
- the HA- tagged SR+RB fragment was expressed in HEK 293 cells with or without Flag-menin, and subjected to HA-IP followed by Western blotting with HA or Flag antibodies.
- FIG. 7E depicts SON E overexpression enhances the interactions between MLL complex components, while SON F overexpression shows inhibitory effects.
- Empty vector (pcDNA3), SON F-V5, or SON E-Flag were transfected into K562 cells (without depletion of endogenous SON) and nuclear extract was used for immunoprecipitation with MLL-N antibody followed by Western blotting with indicated antibodies.
- FIG. 7F depicts a Western blot verifying overexpression of SON E in primary mouse bone marrow cells infected with lentivirus carrying flag-tagged SON E.
- FIG. 7G depicts number of colony-forming units in serial replating assay using primary mouse bone marrow cells infected with lentivirus carrying the control vector or SON E. The results represent 3 independent experiments. *p ⁇ 0.05, **p ⁇ 0.01.
- FIG. 7H depicts representative images of control cells and the colony formed by SON E-overexpressing cells in the third round of the replating assay.
- FIG. 71 depicts models for SON and its alternative spliced isoform function in transcription.
- SON binds G/C-rich sequence near the transcription start site to inhibit transcriptional activity.
- Interaction of the SON C-terminal region with menin diminishes MLL complex assembly and its recruitment to target DNA, leading to the low level of H3K4me3 and transcriptional repression.
- MLL complex formation is facilitated, resulting in enhanced recruitment of the MLL complex and subsequent increase of H3K4me3 and promoter activity.
- SON B and SON E In leukemic condition, a high level of short SON isoforms (SON B and SON E) interrupts DNA-binding of full- length SON, but cannot interfere with MLL-menin interaction, resulting in abrogation of the full-length SON function and de-repression/activation of multiple leukemia-associated genes.
- FIG. 7J depicts co-immunoprecipitation of HA-tagged SON SR+RB with Myc-tagged full-length/wild type (WT) or several partial fragments of menin after transient transfections of the constructs into 293T cells as indicated. Immunoprecipitations were performed using an HA antibody and analyzed by Western blotting using a Myc antibody. Wild type and several partial fragments of menin were detected in the precipitates (upper panel, lane 3, 5, 6, and 8; arrowheads). Asterisks indicate IgG.
- FIG. 7K depicts a schematic diagram shows the structure of the menin and its deletion constructs used in FIG. 7J. The region previously shown to interact with MLL is indicated with a bar.
- FIG. 7L depicts a Western blot verified expression of V5-tagged SON F (SON F-V5) and Flag-tagged SON E (SON E-Flag) in K562 cells used for the experiment presented in FIG. 7E.
- FIG. 7M depicts schematic structures of the lentiviral construct for SON E overexpression and the illustration of the experimental design.
- Bone marrow (BM) cells were isolated from mice, infected with empty lentivirus (Control) or Flag-tagged SON E— expressing lentivirus (SON E) and subjected to the colony forming and serial replating assays. For each round of plating, a total of 2 ⁇ 10 4 BM cells were cultured in MethoCult (M3434).
- FIG. 7N depicts representative photomicrographs of colonies from control and SON E lentivirus-infected BM cells from the first plating, demonstrating the high efficiency of infection (GFP-positive cells). Scale bars on pictures represent 100 ⁇ . DETAILED DESCRIPTION
- SON previously known as an RNA splicing factor, has been found to also control MLL complex-mediated transcriptional initiation. SON binds to DNA near transcription start sites, interacts with menin, and inhibits MLL complex assembly, resulting in decreased histone 3 lysine 4 (H3K4me3) and transcriptional repression. Alternatively-spliced short isoforms of SON are markedly upregulated in acute myeloid leukemia.
- the short isoforms compete with full- length SON for chromatin occupancy, but lack the menin-binding ability, thereby antagonizing full-length SON function in transcriptional repression while not impairing full- length SON-mediated RNA splicing. Furthermore, overexpression of a short isoform of SON enhances replating potential of hematopoietic progenitors.
- SON is a ubiquitously expressed nuclear protein recently identified as an SR-like splicing co-factor. SON is required for proper RNA splicing of selective genes (Ahn et al., 201 1; Hickey et al., 2014; Lu et al., 2013; Lu et al., 2014; Martello, 2013; Sharma et al., 2011). Knockdown of SON leads to splicing defects in transcripts of other genes containing weak splice sites, and many of the affected genes are necessary for cell cycle progression and epigenetic modification (Ahn et al., 201 1; Sharma et al., 201 1).
- SON is highly expressed in human embryonic stem cells and is an essential factor in stem cell pluripotency (Chia et al., 2010; Lu et al., 2013). Further RNA-seq analyses revealed that SON knockdown causes intron retention and exon skipping at several pluripotency genes, such as OCT4, PRDM14 and E4F 1 (Lu et al., 2013).
- SON may function in transcriptional regulation.
- SON has been implicated in DNA-binding (Mattioni et al., 1992; Sun et al., 2001), and SON suppresses the promoter activity of the miR-23a ⁇ 27a ⁇ 24-2 cluster (Ahn et al., 2013).
- SON has an unexpected role in interacting with menin and regulating MLL complex activity, H3K4me3, and transcriptional initiation of multiple leukemia-associated genes. Described herein are significant increases in acute myeloid leukemia in short splice variants of SON, which lack menin-interacting ability, and their functional significance in blocking full-length SON function in transcriptional repression while not impairing full-length SON-mediated RNA splicing.
- Applicant has discovered that an increase in the ratio of the level of a truncated alternatively-spliced transcript of a SON gene, such as SON E and/or SON B transcripts of the SON gene, to the level of a full length transcript of the SON gene, such as a SON F transcript can indicate a subject having a hematopoietic malignancy, or the likelihood of the subject developing a hematopoietic malignancy.
- a truncated alternatively-spliced transcript of a SON gene can indicate a subject having a hematopoietic malignancy, or the likelihood of the subject developing a hematopoietic malignancy.
- an increase in the levels of additional markers, such CDKN1A, GFI1 and ATF3 can also indicate a subject having a hematopoietic malignancy, or the likelihood of the subject developing a hematopoietic malignancy.
- Some embodiments of the methods and compositions provided herein include methods for the diagnosis of a hematopoietic malignancy, or a likelihood of developing a hematopoietic malignancy in a test subject.
- the hematopoietic malignancy can include acute myeloid leukemia (AML) and myelodysplastic syndrome (MDS).
- AML acute myeloid leukemia
- MDS myelodysplastic syndrome
- the hematopoietic malignancy is an AML subtype including M2 without t(8;21), and t(8;21)(q22;q22).
- the short polypeptide is a truncated alternatively-spliced transcript of a SON gene. See for example, TABLE 1.
- the truncated alternatively-spliced transcript of a SON gene comprises a SON E variant of a SON gene, or a SON B variant of a SON gene.
- the nucleic acid encoding the short polypeptide comprises a sequence which specifically binds to an oligonucleotide comprising SEQ ID NOs:06 or 10.
- the short polypeptide comprises SEQ ID NOs:88 or 90.
- the nucleic acid encoding the short polypeptide comprises SEQ ID NOs:87 or 89.
- the long polypeptide is encoded by a full length transcript of the SON gene. See e.g., TABLE 1.
- the full length transcript of the SON gene comprises a sequence which specifically binds to an oligonucleotide comprising a sequence selected from SEQ ID NOs:13-18.
- the long polypeptide comprises SEQ ID NO: 86.
- the nucleic acid encoding the long polypeptide comprises SEQ ID NO:85.
- the SON gene is a human SON gene.
- Some embodiments include determining a ratio for the level of a short polypeptide or the level of a nucleic acid encoding the short polypeptide in the test subject to the level of a long polypeptide or the level of a nucleic acid encoding the long polypeptide in the test subject.
- the short polypeptide is encoded by a truncated alternatively-spliced transcript of a SON gene
- the long polypeptide is encoded by a full length transcript of the SON gene.
- Some embodiments include methods of detecting a ratio for the level of a short polypeptide or the level of a nucleic acid encoding the short polypeptide in a test subject to the level of a long polypeptide or the level of a nucleic acid encoding the long polypeptide in the test subject.
- Such methods can include obtaining a nucleic acid or polypeptide sample from the test subject; contacting the nucleic acid or polypeptide sample with an agent which indicates the level of the short polypeptide or a nucleic acid encoding the short polypeptide in the test subject; contacting the nucleic acid or polypeptide sample with an agent which indicates the level of the long polypeptide or a nucleic acid encoding the long polypeptide in the test subject; and determining the ratio for the level of the short polypeptide or the level of a nucleic acid encoding the short polypeptide in the test subject to the level of the long polypeptide or the level of a nucleic acid encoding the long polypeptide in the test subject.
- the short polypeptide is encoded by a truncated alternatively-spliced transcript of a SON gene
- the long polypeptide is encoded by a full length transcript of the SON gene.
- a ratio of the level of a short polypeptide or the level of a nucleic acid encoding the short polypeptide in a test subject to the level of a long polypeptide or the level of a nucleic acid encoding the long polypeptide in the test subject can be indicative of a test subject having a diagnosis for a hematopoietic malignancy, or a likelihood of developing a hematopoietic malignancy.
- the ratio is greater than about 30%, 40%, 50%, 60%, 70%, 80%, and 90%, or any range between any two of the foregoing numbers.
- Some embodiments include methods of detecting an increase in the level of a short polypeptide or a nucleic acid encoding the polypeptide in a test subject. Such methods can include obtaining a nucleic acid or polypeptide sample from the test subject; contacting the nucleic acid or polypeptide sample with an agent which indicates an increase in the level of the short polypeptide or a nucleic acid encoding the short polypeptide in the test subject; and determining the level the short polypeptide or a nucleic acid encoding the short polypeptide in the test subject.
- the short polypeptide is encoded by a truncated alternatively-spliced transcript of a SON gene.
- Some embodiments include methods of diagnosing a hematopoietic malignancy, or the likelihood of developing a hematopoietic malignancy in a test subject. Such embodiments can include determining the level of a short polypeptide or the level of a nucleic acid encoding the short polypeptide in a subject. In such embodiments, the short polypeptide is encoded by a truncated alternatively-spliced transcript of a SON gene.
- Some methods also include comparing the level of the short polypeptide or the level of a nucleic acid encoding the short polypeptide in a test subject with the level of the short polypeptide or the level of a nucleic acid encoding the short polypeptide in a subject not having a hematopoietic malignancy.
- an increase in the level of a short polypeptide or the level of a nucleic acid encoding the short polypeptide in a subject is indicative of a test subject having a diagnosis for a hematopoietic malignancy, or a likelihood of developing a hematopoietic malignancy.
- the increase is at least about 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 100%, and 200%, or any range between any two of the foregoing numbers, compared to the level of the short polypeptide or the level of a nucleic acid encoding the short polypeptide in a subject not having a hematopoietic malignancy.
- Some methods also include determining the level of a long polypeptide or a nucleic acid encoding the long polypeptide in the test subject, wherein the long polypeptide is encoded by a full length transcript of the SON gene. Some such methods also include determining a ratio for the level of a short polypeptide or the level of a nucleic acid encoding the short polypeptide in the test subject to the level of a long polypeptide or the level of a nucleic acid encoding the long polypeptide in the test subject. In the foregoing methods, a ratio can be indicative of a test subject having a diagnosis for a hematopoietic malignancy, or a likelihood of developing a hematopoietic malignancy. In some embodiments, the ratio is greater than about 30%, 40%, 50%, 60%, 70%, 80%, and 90%, or any range between any two of the foregoing numbers.
- Some methods also include determining the level of a polypeptide in addition to the level of the short form of the SON polypeptide or the level of a nucleic acid encoding the additional polypeptide in addition to the level of a nucleic acid encoding the short form of the SON polypeptide, wherein the additional polypeptide is encoded by a gene selected from CDKN1A, GFI1 and ATF3. Some methods also include comparing the level of an additional polypeptide or the level of a nucleic acid encoding the additional polypeptide in the test subject with the level of an additional polypeptide or the level of a nucleic acid encoding the additional polypeptide in a subject not having a hematopoietic malignancy.
- the increase is at least about 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 100%, and 200%, or any range between any two of the foregoing numbers, compared to the level of an additional polypeptide or the level of a nucleic acid encoding the additional polypeptide in a subject not having a hematopoietic malignancy.
- Some embodiments also include determining the level of a polypeptide encoded by a SON target gene or a nucleic acid encoded by a SON target gene in a sample from a test subject.
- the SON target gene has a SON binding site.
- an increased level of a polypeptide encoded by a SON target gene or a nucleic acid encoded by a SON target gene in the sample from the test subject compared to the level of a polypeptide encoded by a SON target gene or a nucleic acid encoded by a SON target gene in a sample from a subject not having a hematopoietic malignancy can be indicative of a diagnosis of a hematopoietic malignancy, or a likelihood of developing a hematopoietic malignancy in a test subject.
- the SON target gene is selected from the group consisting of FOX03, NOTCH2NL, SRC, GADD45A, CDKN1A, ATF3, GFT1 , EGR1 , SRC, and GFT1 B.
- Some embodiments also include determining the level of a MLL multi- protein complex in a sample from a test subject. Some embodiments also include comparing the level of the MLL multi-protein complex in the sample from the test subject with the level of a MLL multi-protein complex in a sample from a sample from a subject not having a hematopoietic malignancy. In some embodiments, an increase in the level of a MLL multi- protein complex in the sample from a test subject can be indicative of a diagnosis of a hematopoietic malignancy, or a likelihood of developing a hematopoietic malignancy in a test subject.
- the MLL is selected from the group consisting of MLLl, MLL-N, MLL-C, and MLL2.
- the multiprotein complex includes a protein selected from menin, ASH2L, LEDGF, and WDR5.
- Some embodiments also include determining the level of a replating activity of a hematopoietic progenitor cell of a sample from a test subject. Some embodiments also include comparing the level of replating activity of a hematopoietic progenitor cell of the sample from the test subject with the level of replating activity of a hematopoietic progenitor cell of the sample from the test subject from a subject not having a hematopoietic malignancy. In some embodiments, an increase in the level of replating activity of a hematopoietic progenitor cell of the sample from the test subject can be indicative of a hematopoietic malignancy. In some embodiments, the hematopoietic progenitor cell is a bone marrow cell.
- Some embodiments also include determining the level of binding of a protein at the or adjacent to the transcription start site of a SON target gene in a sample from a test subject. Some embodiments also include comparing the level the level of binding of a protein at the or adjacent to the transcription start site of a SON target gene in a sample from a test subject with the level of binding of a protein at the or adjacent to the transcription start site of a SON target gene in a sample from a subject not having a hematopoietic malignancy. In some embodiments, an increased binding of a protein at the or adjacent to the transcription start site of a SON target gene in the sample from the test subject can be indicative of a hematopoietic malignancy.
- the protein can include MLL1, MLL-N, MLL-C, MLL2, WDR5, ASH2L, menin, SET1A, SET1B, ASC2, and SUZ12.
- the SON target gene can include CDKN1 A, GADD45A, NOTCH2NL, ATF3, GFI1, EGR1, and SRC.
- Some embodiments also include determining the level in H3K4me3 in the sample from the test subject. Some embodiments also include comparing the level in H3K4me3 in the sample from the test subject with the level in H3K4me3 in the sample from a subject not having a hematopoietic malignancy. In some embodiments an increased level in H3K4me3 in a sample from a test subject can be indicative of a hematopoietic malignancy. In some embodiments, the level in H3K4me3 is increased at the or adjacent to a SON target gene including CDKN1A, GADD45A, NOTCH2NL, ATF3, GFI1, EGR1, and SRC.
- a SON target gene including CDKN1A, GADD45A, NOTCH2NL, ATF3, GFI1, EGR1, and SRC.
- Some embodiments also include a sample from the test subject.
- the sample can include nucleic acids or polypeptides.
- the sample comprises bone marrow mononuclear cells (BM-MNCs), or peripheral blood mononuclear cells (PB-MNCs).
- BM-MNCs bone marrow mononuclear cells
- PB-MNCs peripheral blood mononuclear cells
- Some embodiments also include contacting the sample with an agent which specifically binds to the short polypeptide or a nucleic acid encoding the short polypeptide. Some embodiments also include contacting the sample with an agent which specifically binds to the long polypeptide or a nucleic acid encoding the long polypeptide.
- the agent is a primer or a hybridization probe. In some embodiments, the primer or the hybridization probe comprises an oligonucleotide comprising SEQ ID NOs:06, 10 or 13-18.
- Some embodiments also include contacting the sample with an agent which specifically binds to a SON target gene, such as a SON target gene having a SON binding site.
- a SON target gene such as a SON target gene having a SON binding site.
- the SON target includes FOX03, NOTCH2NL, SRC, GADD45A, CDKN1A, ATF3, GFI1, EGR1, SRC, and GFI1B.
- Some embodiments also include contacting the sample with an agent which specifically binds to a protein selected from MLL1, MLL-N, MLL-C, MLL2, menin, ASH2L, LEDGF, WDR5, SET1A, SET1B, ASC2, SUZ12, and H3K4me3.
- an agent which specifically binds to a protein selected from MLL1, MLL-N, MLL-C, MLL2, menin, ASH2L, LEDGF, WDR5, SET1A, SET1B, ASC2, SUZ12, and H3K4me3.
- the agent is an antibody or antigen-binding fragment thereof which specifically binds to the short polypeptide.
- the agent can be immobilized on a solid support.
- solid supports include a test well of a microtiter plate, a membrane, such as nitrocellulose membrane, a particle, such as a bead, comprising glass, fiberglass, latex, or a plastic material, such as polystyrene or polyvinylchloride.
- the solid support can include a magnetic particle.
- the agent can be immobilized on the solid support by a noncovalent association, such as adsorption, and/or a covalent attachment which can include a direct linkage between the agent and functional groups on the solid support, or may be a linkage by way of a cross-linking agent.
- Methods to determine the level of a nucleic acid such as a transcript of a SON gene, such as a truncated alternatively-spliced transcript of a SON gene, such as SON B variant, and SON E variant; a full length transcript of a SON gene, such as SON F; and nucleic acids encoded by CDKN1A, GFI1 and ATF3 are also well known in the art.
- a nucleic acid such as a transcript of a SON gene, such as a truncated alternatively-spliced transcript of a SON gene, such as SON B variant, and SON E variant
- a full length transcript of a SON gene such as SON F
- nucleic acids encoded by CDKN1A, GFI1 and ATF3 are also well known in the art. Examples of the foregoing methods include hybridizing a hybridization probe or primer with a nucleic acid. Methods can include PCR, quantative methods of PCR, such as
- the hybridization probe or primer can include an oligonucleotide comprising any one of SEQ ID NOs:06, 10, 13-18.
- Methods to determine the level of a polypeptide such as a polypeptide encoded by a truncated alternatively-spliced transcript of a SON gene, such as SON B variant, and SON E variant; a full length transcript of a SON gene, such as SON F; and nucleic acids encoded by CDKN1A, GFI1 and ATF3 are well known in the art.
- Examples of the foregoing methods include contacting the polypeptide with an agent that that specifically binds to the polypeptide.
- the agent comprises an antibody or antigen- binding fragment thereof.
- Some embodiments of the methods and compositions provided herein also include a method of diagnosing a hematopoietic malignancy or a likelihood of developing a hematopoietic malignancy in a test subject that includes detecting increased levels of binding of a short polypeptide with a nucleic acid having a SON binding site in a sample from a test subject, wherein the short polypeptide is encoded by a truncated alternatively-spliced transcript of a SON gene.
- Some embodiments of the methods and compositions provided herein also include methods of ameliorating a hematopoietic malignancy. Some such methods include treat, preventing, and/or reducing the symptoms of the hematopoietic malignancy. Some embodiments include increasing the level of a full length SON polypeptide or a nucleic acid encoding a full length SON polypeptide in a cell of a subject. In some embodiments, the expression level of a nucleic acid encoding SON or the expression level of SON protein is increased by administering an isolated nucleic acid to the subject. In some embodiments, the nucleic acid comprises a sequence encoding a full length SON polypeptide.
- the SON polypeptide is a human SON polypeptide.
- the nucleic acid comprises SEQ ID NO.:85.
- the polypeptide comprises SEQ ID NO.: 86.
- the hematopoietic malignancy is selected from acute myeloid leukemia (AML) and myelodysplasia syndrome (MDS).
- the hematopoietic malignancy is an acute myeloid leukemia (AML) subtype selected from M2 without t(8;21), and t(8;21)(q22;q22).
- the subject is mammalian. In some embodiments, the subject is human.
- Methods to increase the level of a polypeptide or a nucleic acid encoding a polypeptide, such as a full length SON polypeptide, in a cell of a subject are well known in the art.
- some methods can include administering an expression vector to the subject.
- the expression vector can include the nucleic acid and a promoter.
- the promoter can be tissue-specific, constitutive, and/or inducible.
- compositions that include the nucleic acid, such as an expression vector including the nucleic acid, and a pharmaceutically effective carrier.
- Pharmaceutically acceptable carriers are determined in part by the particular composition being administered, as well as by the particular method used to administer the composition. Accordingly, there is a wide variety of suitable formulations of pharmaceutical compositions available (see, e.g., Remington's Pharmaceutical Sciences, 17th ed., 1989).
- compositions can be administered by various means known in the art.
- vectors can be delivered to cells ex vivo, such as cells explanted from an individual patient (e.g., lymphocytes, bone marrow aspirates, tissue biopsy), followed by re-implantation of the cells into a subject, usually after selection for cells which have incorporated the vector.
- Therapies where cells are genetically modified ex vivo, and then re-introduced into a subject can be referred to as cell-based therapies or cell therapies.
- cells are isolated from the subject organism, transduced with an exogenous gene (gene or cDNA) according to the present disclosure, and re-infused back into the subject (e.g., a patient).
- Expression vectors can include plasmid vectors, retroviral vectors, lentiviral vectors, adenovirus vectors, helper-dependent adenovirus, poxvirus vectors; herpesvirus vectors and adeno-associated virus vectors, etc. See, also, U.S. Pat. Nos. 6,534,261; 6,607,882; 6,824,978; 6,933,1 13; 6,979,539; 7,013,219; and 7,163,824, incorporated by reference herein in their entireties.
- Non-viral vector delivery systems include DNA plasmids, naked nucleic acid, and nucleic acid complexed with a delivery vehicle such as a liposome or poloxamer.
- Viral vector delivery systems include DNA and RNA viruses, which have either episomal or integrated genomes after delivery to the cell.
- Methods of non-viral delivery of nucleic acids include electroporation, lipofection, microinjection, biolistics, virosomes, liposomes, immunoliposomes, dendrimers, polycation or lipid:nucleic acid conjugates, naked DNA, artificial virions, agent-enhanced uptake of DNA or use of macromolecules such as dendrimers (see Wijagkanalen et al (201 1) Pharm Res 28(7) p. 1500-19). Sonoporation using, e.g., the Sonitron 2000 system (Rich-Mar) can also be used for delivery of nucleic acids.
- nucleic acid delivery systems include those provided by Amaxa Biosystems (Cologne, Germany), Maxcyte, Inc. (Rockville, Md.), BTX Molecular Delivery Systems (Holliston, Mass.) and Copernicus Therapeutics Inc, (see for example U.S. Pat. No. 6,008,336).
- Lipofection is described in e.g., U.S. Pat. Nos. 5,049,386; 4,946,787; and 4,897,355) and lipofection reagents are sold commercially (e.g., TransfectamTM and LipofectinTM).
- Cationic and neutral lipids that are suitable for efficient receptor-recognition lipofection of polynucleotides include those of Feigner, WO 91/17424, WO 91/16024.
- Adenoviral based systems can be used.
- Adenoviral based vectors are capable of very high transduction efficiency in many cell types and do not require cell division. With such vectors, high titer and high levels of expression have been obtained. This vector can be produced in large quantities in a relatively simple system.
- Adeno -associated virus (“AAV”) vectors are also used to transduce cells with target nucleic acids, e.g., in the in vitro production of nucleic acids and peptides, and for in vivo and ex vivo gene therapy procedures (see, e.g., West et al., Virology 160:38-47 (1987); U.S. Pat. No.
- Gene therapy vectors such as expression vectors comprising a nucleic acid encoding a full length SON polypeptide, can be delivered in vivo by administration to an individual patient, typically by systemic administration (e.g., intravenous, intraperitoneal, intramuscular, subdermal, or intracranial infusion) or topical application, as described below.
- vectors can be delivered to cells ex vivo, such as cells explanted from an individual patient (e.g., lymphocytes, bone marrow aspirates, tissue biopsy) or universal donor hematopoietic stem/progenitor cells, followed by reimplantation of the cells into a patient, usually after selection for cells which have incorporated the vector.
- Vectors e.g., retroviruses, adenoviruses, liposomes, etc.
- donor construct such as expression vectors comprising a nucleic acid encoding a full length SON polypeptide
- naked DNA complexed/formulated with a delivery vehicle e.g. liposome or poloxamer
- Administration is by any of the routes normally used for introducing a molecule into ultimate contact with blood or tissue cells including, but not limited to, injection, infusion, topical application and electroporation. Suitable methods of administering such nucleic acids are available and well known to those of skill in the art, and, although more than one route can be used to administer a particular composition, a particular route can often provide a more immediate and more effective reaction than another route.
- Formulations for both ex vivo and in vivo administrations include suspensions in liquid or emulsified liquids.
- the active ingredients often are mixed with excipients which are pharmaceutically acceptable and compatible with the active ingredient.
- Suitable excipients include, for example, water, saline, dextrose, glycerol, ethanol or the like, and combinations thereof.
- the composition may contain minor amounts of auxiliary substances, such as, wetting or emulsifying agents, pH buffering agents, stabilizing agents or other reagents that enhance the effectiveness of the pharmaceutical composition.
- kits for the diagnosis of a hematopoietic malignancy, or likelihood of developing a hematopoietic malignancy in a test subject can include an agent that specifically binds to a short polypeptide or a nucleic acid encoding the short polypeptide.
- the short polypeptide is encoded by a truncated alternatively-spliced transcript of a SON gene.
- Some such embodiments can include an agent that specifically binds to a long polypeptide or a nucleic acid encoding the long polypeptide.
- the long polypeptide is encoded by a full length transcript of the SON gene.
- Some embodiments also include an agent that specifically binds to an additional polypeptide or a nucleic acid encoding the additional polypeptide in the test subject, wherein the additional polypeptide is encoded by a gene selected from CDKN1A, GFI1 and ATF3.
- the truncated alternatively-spliced transcript of a SON gene comprises a SON E variant of a SON gene. In some embodiments, the truncated alternatively-spliced transcript of a SON gene comprises a SON B variant of a SON gene.
- the nucleic acid encoding the short polypeptide comprises a sequence which specifically binds to an oligonucleotide comprising SEQ ID NOs:06 or 10. In some embodiments, the short polypeptide comprises SEQ ID NOs:88 or 90. In some embodiments, the nucleic acid encoding the short polypeptide comprises SEQ ID NOs:87 or 89.
- the full length transcript of the SON gene comprises a sequence which specifically binds to an oligonucleotide comprising a sequence selected from SEQ ID NOs: 13-18.
- the long polypeptide comprises SEQ ID NO:86.
- the nucleic acid encoding the long polypeptide comprises SEQ ID NO:85.
- the SON gene is a human SON gene.
- Some embodiments also include an agent that specifically binds to a SON target gene, wherein the SON target gene has a SON binding site.
- the SON target gene can include FOX03, NOTCH2NL, SRC, GADD45A, CDKN1A, ATF3, GFI1, EGR1, SRC, and GFI1B.
- Some embodiments also include an agent that specifically binds to a includingMLLl, MLL-N, MLL-C, MLL2, menin, ASH2L, LEDGF, WDR5, SET1A, SET IB, ASC2, SUZ12, and H3K4me3.
- the agent is a primer or a hybridization probe.
- the primer or the hybridization probe comprises an oligonucleotide comprising SEQ ID NOs:06, 10, or 13-18.
- the agent is an antibody or antigen-binding fragment thereof which specifically binds to the short polypeptide.
- the agent is attached to a solid support.
- the ChlP-seq peaks located 5 kb upstream and downstream ( ⁇ 5kb) from the transcription start site were analyzed, which were mainly localized in the promoter, the 5' UTR, and the first exon or intron of the target genes (FIG. 1C).
- the heat map and motif analysis confirmed that SON binding sites are indeed enriched near transcription start sites (FIG. ID) and contain repetitive G or C tracts as well as GC dinucleotide repeats (FIG. IE and FIG. II).
- the genes bearing SON peaks at their transcription start site have functions in DNA-binding / transcription (e.g.
- ATF3, GFI1, EGR1, FOX03A), receptor signal transduction (NOTCH2NL, SRC) and cell cycle regulation (CDKN1A, GADD45A) (FIG. IF).
- Altered expression of these genes has been implicated in perturbed hematopoiesis, leukemia and other cancers (Janz et al., 2006; Joslin et al., 2007; Khandanpour et al., 2013; Liebermann et al., 201 1; Phelan et al., 2010; Viale et al., 2009). Enrichment of SON near transcription start sites of these genes was further confirmed by ChlP-qPCR (FIG.
- FIG. 2E knockdown of SON by siRNA (FIG. 2G) caused upregulation of SON ChIP target genes (FIG. 2A), indicating that SON interaction with transcription start sites of target genes exerts inhibitory effects on transcription.
- the genomic regions bearing SON peaks near transcription start sites show a low level of conventional nucleosomes and the presence of variant histone, H2A.Z, indicating the active chromatin status (FIG. 2J).
- a close look at the peak regions revealed that the high peaks of H3K4me3 were precisely aligned with the valleys of SON peaks, and vice versa (FIG. 2D, FIG. 2J).
- SON depletion leads to increased recruitment of MLL complex components to the SON target genes and enhanced MLL complex formation
- MLL itself is a weak H3K4 methyltransferase
- its interaction with the multiple subunit proteins greatly increases the ability to induce H3K4me3 (Dou et al., 2006; Steward et al., 2006).
- IP immunoprecipitation
- MLL-N MLL1 N-terminus
- SON interacts with menin, an MLL1/2 complex component, and SON-menin interaction diminishes MLL-menin interaction
- menin-MLL-N interaction was assessed as well as menin-SON interaction by IP with menin antibody.
- SON overexpression increased the menin-SON complex formation and at the same time, menin interaction with MLL-N was significantly decreased (FIG. 4H).
- Size-exclusion chromatography further demonstrated that a higher amount of menin was detectable in the MLL 1 -containing fractions when SON was depleted by siRNA (FIG. 41).
- BM-MNCs Bone marrow mononuclear cells
- FAB subtype M2 Bone marrow mononuclear cells
- TABLE 2 healthy donors
- FIG. 5A Similar to previous results (Ahn et al., 2013), most patient samples showed high levels of total SON (exon 1—3 region).
- expression levels of the exon 9— 12 region specific for SON F did not show significant upregulation, the expression level of alternative exon (exon 5a or 7a)-containing transcripts were significantly increased in AML patient BM-MNC samples (FIG.
- SON F is the major form of SON in normal human BM-MNCs, and the short isoforms (SON B and SON E) occupy -20% of total SON (FIG. 5D).
- the portion of SON E was remarkably increased in AML patient BM-MNCs, resulting in SON E accounting for 30 - 70% of total SON (FIG. 5D).
- PB-MNCs were also analyzed for relative ratios of full-length and SON isoforms. While PB-MNCs from 7 normal healthy donors represent almost identical patterns showing that only -10% of total SON is taken by SON E and SON B, the percentage of SON E and especially SON B are markedly increased in AML and MDS patients (FIG.
- SON E attenuates the full-length SON function in transcriptional repression, resulting in derepression of SON ChIP target genes, but does not impair full-length SON-mediated RNA splicing
- SON ChIP target genes CDKN1A, GFI1 and ATF3 were selected (FIG. 6A) since the importance of tight regulation of these genes in leukemia and other cancers has been demonstrated (Abbas and Dutta, 2009; Janz et al., 2006; Khandanpour et al., 2013; Phelan et al., 2010; Viale et al., 2009).
- These target genes were indeed significantly upregulated in BM-MNCs from AML patients compared with healthy donors (FIG. 6A), indicating that increased expression of SON short isoforms is associated with de-repression of SON ChIP target genes in AML.
- FIG. 6G To examine the exact effect of SON short isoforms on SON target gene expression, a specific siRNA targeting the SON E-specific exon 5a was developed (FIG. 6G), and confirmed that this siRNA lowers the level of SON E, but not SON F (FIG. 6B and FIG. 6C). Interestingly, while the total SON siRNA that targets all forms of SON significantly upregulated SON ChIP target genes, transfection of SON E-specific siRNA led to downregulation of theses target genes in both K562 cells and human CD34+ bone marrow (BM) cells (FIG. 6B and FIG. 6C), indicating that a stronger repression of target genes occurred in the absence of SON E.
- BM bone marrow
- SON E overexpression resultsed in upregulation of those target genes in human CD34+ BM cells (FIG. 6D). These results strongly support the concept that SON E weakens the inhibitory effect of full-length SON on target gene transcription.
- SON E expression affects SON F function in repressing H3K4me3 was examined.
- SON F (siRNA-resistant form) expression could lower the H3K4me3 level, which was increased by SON siRNA, near the transcription start sites of the CDKN1A and GFI1 genes (FIG. 6E).
- expression of SON E alone failed to reduce the H3K4me3 level, indicating its lack of ability to suppress H3K4me3.
- SON E was co-expressed with SON F (FIG. 61)
- SON E could attenuate the repressive effect of SON F on H3K4me3 in a dose-dependent manner (FIG. 6E).
- a minigene splicing assay was performed using the TUBG1 exon 7-8 minigene model (Ahn et al., 201 1) and various ratios of SON F and SON E transfection to assess the effect of SON E expression on SON F-mediated RNA splicing (FIG. 6J).
- SON E expression did not attenuate SON F-mediated RNA splicing (FIG. 6F and FIG. 6K).
- SON E competes with full-length SON for DNA-binding, but lacks the menin-binding ability, thus attenuating the inhibitory effect of full-length SON on MLL complex assembly
- SON E was overexpressed in mouse primary bone marrow cells through lenviral transduction (FIG. 7F, FIG. 7M, and FIG. 7N), and measured the colony forming ability in methylcellulose replating assays (FIG. 7M).
- SON E-overexpressing cells were able to produce significantly increased numbers of colonies and were able to maintain colony forming cells even in the 5th round of plating (FIG. 7G and FIG. 7H), indicating higher clonogenic potential.
- the findings demonstrate that increased expression of SON short isoforms enhances the preservation of sternness in normal hematopoietic progenitors, suggesting its potential contribution to stem cell self-renewal and development and/or maintenance of leukemic stem cells.
- SON was previously known as an RNA splicing co-factor.
- This study demonstrated that SON and its splice variants regulate MLL complex assembly and H3K4me3, affect gene expression of multiple leukemia-associated genes, and affect replating potential of hematopoietic progenitors.
- the data suggest that although the short splice variants, such as SON E, interact with target DNA, they cannot exert an inhibitory effect on MLL complex assembly due to the lack of menin-binding ability. Therefore, overexpression of "short SON" in pathological conditions, such as AML, attenuates the inhibitory effects of full-length SON on MLL complex assembly, resulting in activation of multiple target gene transcription (modeled in FIG. 71).
- MLL-menin interaction is also involved in promoting hepatocellular carcinoma development (Xu et al., 2013).
- MLL-menin interaction Given the significance of MLL-menin interaction in disease-associated gene expression, identification of endogenous regulatory factors affecting this interaction would generate valuable information for understanding and targeting the MLL complex.
- menin In addition to MLL, menin also interacts with other transcription factors and hormone receptors, such as JUND, estrogen receptor and androgen receptor (Agarwal et al., 1999; Dreijerink et al., 2006; Malik et al., 2015).
- Short splice variants of SON their effects on two arms of SON function and clinical significance
- SON ChIP target genes identified include critical factors for cell-cycle and hematopoietic differentiation, such as CDKN1A, GFI1 and ATF3, which are aberrantly upregulated in AML patients with elevated levels of short SON isoforms (FIG. 5 and FIG. 6). Therefore, increased SON isoforms in hematopoietic stem cells or progenitors could potentially lead to failures in dosage controls of multiple leukemia- associated target genes.
- overexpression of SON E specifically weakened the full-length SON function in transcriptional initiation, but does not impair full-length SON- mediated RNA splicing.
- SON E overexpression markedly enhances replating capacity of primary hematopoietic progenitors, shedding light on functional significance of aberrant upregulation of "short SON" in AML. This study suggests that overexpression of short SON alone may not be sufficient to drive oncogenic transformation of hematopoietic cells, but implies potential roles of short SON in self-renewal and clonogenic abilities of leukemic cells.
- ChlP-Sequencing ChlP-seq
- ChlP-qPCR ChlP-qPCR
- ChIP Chromatin Immunoprecpitation
- ChlP-seq libraries were generated using Gnomegen Library Preparation Kits (Gnomegen), and ChlP-seq libraries were cluster-amplified and sequenced with the Illumina HiSeq2000 sequencer (50-nucleotide pair-ended read).
- K562, MV4; 1 1 and ML-2 cell lines were cultured in RPMI 1640 medium with 10% fetal bovine serum.
- Primary human CD34+ bone marrow cells were purchased from Lonza and were cultured in STEMSPAN SFEM (StemCell Technologies) supplemented with 1% penicillin and streptomycin, 100 ng/ml of recombinant human SCF, 100 ng/ml of recombinant human FLT3L and 100 ng/ml of recombinant human TPO for 1 to 2 days before transfection.
- Plasmids containing siRNA-resistant full-length SON cDNA (siRR-SON F) and SON SR+RB (serine/arginine-rich region and RNA-binding domain) were created as previously described (Ahn et al., 201 1).
- the plasmid containing SON E was constructed as follows.
- the region from internal Hpal site to the 3' end of SON E cDNA was amplified using the forward primer (SON Hpal-F: 5'-GAATCTTCAATTACGTTAACA-3') (SEQ ID NO:77) and the reverse primer (Flag-SON E Notl-R: 5'- TTTGCGGCCGCTATTTGTCATCGTCATCCTTGTAGTCTGGCCGGCC AAACTCAGTTTAGTTCTTCTATAGT AGCTCCTCCTG-3 ' ) (SEQ ID NO:78).
- the sequence for the Flag tag was added as indicated in the reverse primer.
- GC AAGTGATGTTGGACGTGACAGATC-3 ' (SEQ ID NO:79) and V5-SON F Notl-R: 5'- TTTGCGGCCGCtacgtagaatcgagaccgaggagagggttagggataggcttaccTGGCCGGCCatacctatt caagaaaacatacaatt-3' (SEQ ID NO:80)) and subcloned into plasmids.
- Full-length cDNA clones of human menin (pcDNA3 Flag-menin, #32079) and MLL-ENL (pMSCV Flag-MLL- pl-ENL, #20873) were purchased from Addgene).
- RT-qPCR Reverse transcription and Quantitative PCR
- RT-qPCR Quantitative real time PCR
- the antibodies used for ChIP, IP, and Western blotting were the following; SON antibody recognizing the N-terminus of SON (SON-N Ab) was generated against amino acids 74-88 of human SON. SON-N antibody was used for both ChIP and Western blot.
- Anti- SON (SON-C Ab, abl21759, for ChIP), anti-H3K27ac (ab4729), anti-H3K4me3 (ab8580), anti-SUZ12 (ab l2073), anti-WDR5 (ab56919), anti-H3 (abl791), and anti-H3K4mel (ab8895), anti-H3K79me2 (ab3594) were purchased from Abeam.
- Anti-TRX2/MLL2 (A300- 1 13A), anti-ASH2L (A300-489A), anti-menin (A300-105A), anti-MLL-C (MLL1, C- terminus, A300-374A), anti-LEDGF (A300-848A), anti-SETIA (A300-289A), anti-SETIB (A302-281A), anti-CFPl (A303-161A), and anti-MLL-N for ChIP (MLL1 , N-terminus, A300-086A) were purchased from Bethyl Laboratories. ASC2 antibody was a gift from Dr. Jae W Lee (Oregon Health & Science University).
- Anti-H3K27me3 (07-4490, Millipore), anti-MLL-N for IP and WB (MLL1 N-terminus, 39829, Active motif), anti-V5 (R960-25, Invitrogen), anti-HA (#2367, Cell Signaling Technology), anti-Flag M2 (F3165, Sigma), and anti-Actin (A5441 , Sigma) were purchased from the indicated companies.
- K562 cells were incubated with 1% formaldehyde in 5 ml growth medium for 10 min at room temperature and cross-linking reaction was terminated by incubation with 125 raM glycine for 10 min. Subsequently cells were incubated for 15 min at 4°C with lysis buffer (5 mM PIPES pH 8.0 / 85 mM KC1 / 0.5% NP-40 / lx Complete Protease Inhibitor Cocktail (Roche)), collected by centrifugation for 5 min at 3,000g and resuspended in RIPA buffer (150 mM NaCl / 50 mM Tris-HCl, pH 8.0/ 1 mM EDTA / 1% sodium deoxycholate / 0.1% SDS / 1% Triton X-100 / lx Complete Protease Inhibitor Cocktail).
- lysis buffer 5 mM PIPES pH 8.0 / 85 mM KC1 / 0.5% NP-40 / l
- the magnetic beads were washed 5 times for lOmin at 4°C on a rotating platform with 1ml wash buffer (100 mM Tris pH 7.5 / 500 mM LiCl / 1% NP-40 / 1% Sodium deoxycholate) and washed once with TE (10 mM Tris pH 7.5 / 0.1 mM EDTA). After washing, the washed beads were eluted by heating for 2hr at 65 °C in elution buffer (1% SDS / 0.1 M NaHC03) with proteinase K. ChIP DNA were purified and concentrated using the QIAquick PCR Purification Kit (Qiagen). siRNA and Plasmid Transfection
- Total SON siRNAs directed against human SON (siSON #1 : GCAUUUGGCCCAUCUGAGAtt, (SEQ ID NO:81) Ahn et al, 2011 ; siSON #2: UGAGCGCUCUAUGAUGUCAtt, (SEQ ID NO:82) Lu et al, 2013), human SON E-specific siRNA (siSON E: CACCGGAGCUUGGAAAUUAtt (SEQ ID NO:83)), and negative control siRNA (UAACGACGCGACGACGUAAtt (SEQ ID NO:84)) were custom synthesis products by Life Technologies (Silencer Select siRNA).
- 0.5 ⁇ 10 6 cells were nucleofected with 100 - 200 pmol of siRNA or 5 ⁇ g of plasmid using Cell Line Nucleofector Kit V (Lonza) according to the manufacturer's instructions.
- MV4;11 cells 0.5x 106 cells were nucleofected with 150 - 200 pmol of siRNA using Cell Line Nucleofector Kit L (Lonza) according to the manufacturer's instructions.
- human CD34+ bone marrow cells 0.4x 106 cells were nucleofected with 80 - 150 pmol of siRNA or 4 ⁇ g of plasmid using Human CD34+ Cell Nucleofector Kit (Lonza) according to the manufacturer's instructions.
- Human embryonic kidney (HEK) 293 cells were transiently transfected with 5 ⁇ g of plasmids using PEL
- ChlP-sequencing data files were aligned to the hgl9 human reference genome using Bowtie (version 0.12.9) and standard parameters. Peak calling was performed with MACS vl .4.2 software using default parameters. To identify high confidence SON binding peaks, the MACS peak calling output from two different experimental samples were used. BigWig files were generated by first extending the 5' ends of uniquely aligned, non- duplicate ChlP-seq reads by the average DNA fragment length (150 bp for histone marks, 250 bp for transcription factors) in the 3' direction using BEDtools. Identified peaks were then annotated to the nearest transcription start site.
- FDR false discovery rate
- TD tag density
- the genomic distributions of binding sites were analyzed using the cis-regulatory element annotation system (CEAS v 1.0.2).
- the genes closest to the binding site on both strands were classified into functional categories such as promoter (from - lkb to +100bp), 5' UTR, first exon, first intron, exon, intron, 3' UTR, and intergenic region.
- the genes were also divided into defined groups according to the enrichment of the SON across transcription start site regions (5 kb window surrounding the transcription start site).
- Tag density heatmap (FIG.
- FIG. 2F were generated using IGV tools and were viewed in Integrative Genomics Viewer (IGV) (http://www.broadinstitute.org/igv/).
- IGV Integrative Genomics Viewer
- Previously published ChlP-seq data for H3K27ac, H3K4me3, and H2A.Z from K562 (downloaded from ENCODE/Broad Institute) and MNase- seq data for nucleosome position from K562 (downloaded from ENCODE/Stanford/BYU) were analyzed. Enriched ChlP-seq regions at promoters (5 kb window surrounding the transcription start site) for SON and histone marks were combined together to generate a unified track consisting of all merged enriched regions.
- Nuclear extracts were prepared from control, siRNA- or plasmid- transfected K562 or MV4; 1 1 cells using the Dignam protocol (Dignam et al., 1983). In brief, harvested cells were resuspended in three packed cell volumes of buffer A (10 mM HEPES pH 7.9 / 1.5 mM MgCl 2 / 10 mM KC1 / 1 mM DTT / 0.1% NP-40 / Protease Inhibitor Cocktail) and homogenized using needle and syringe with 25 to 30 gentle strokes.
- buffer A (10 mM HEPES pH 7.9 / 1.5 mM MgCl 2 / 10 mM KC1 / 1 mM DTT / 0.1% NP-40 / Protease Inhibitor Cocktail
- Lysed cells were centrifuged at 13,000xg for 10 min and the nuclei pellet was resuspended in two packed volumes of buffer C (20 mm HEPES pH 7.9 / 420 mM KC1 / 1.5 mM MgCl 2 / 1 mM DTT / 25% glycerol / Protease Inhibitor Cocktail).
- the nuclei suspension was gently stirred for 30 min at 4°C and centrifuged 15 min at 13,000 g to remove debris.
- Sufficient volume of buffer D (20 mM HEPES pH 7.9 / 0.5 mM DTT / 25% glycerol / Protease Inhibitor Cocktail) was added to the nuclei extract.
- nuclear extracts were pre-cleared with protein A-sepharose beads (Life Technologies) for 1 hour and incubated either with rabbit IgG or SON antibody at 4°C overnight on a rotator. Beads were washed four times with wash buffer (20 mM HEPES pH 7.9 / 150 mM NaCl / 0.05% (v/v) NP-40) and eluted by boiling in SDS buffer and analyzed by SDS-PAGE.
- wash buffer (20 mM HEPES pH 7.9 / 150 mM NaCl / 0.05% (v/v) NP-40
- HEK293 cells were co- transfected with V5 and HA-tagged plasmids encoding wild type SON F or its alternative spliced variant SON E and either Flag-tagged menin plasmid or empty vector.
- Cells were lysed with lysis buffer (50 mM Tris-HCl, pH 8.0 / 150 mM NaCl / 0.5% NP-40 / 10% glycerol / Protease Inhibitor Cocktail / 50 U/mL of Benzonase nuclease (Sigma)) for 1 hour.
- lysis buffer 50 mM Tris-HCl, pH 8.0 / 150 mM NaCl / 0.5% NP-40 / 10% glycerol / Protease Inhibitor Cocktail / 50 U/mL of Benzonase nuclease (Sigma)
- the minigene construct containing TUBG1 exon 7- intron 7- exon 8 were used to examine SON-mediated splicing as described previously (Ahn et al., 201 1). Briefly, HeLa cells in 6 well plates were transfected with 100 pmol of control siRNA or SON siRNA. The next day, minigene alone or minigene plus various amounts of SON F or SON E constructs (siRNA-resistant form) were transfected as indicated in each experiment. RNAs were isolated 30h after minigene transfection, and RT-PCR was performed using forward primer targeting TUBG1 exon 7 and the reverse primer targeting the bovine growth hormone (BGH) terminator sequence present in the pcDNA vector.
- BGH bovine growth hormone
- the transcription terminating site of the primary transcript of SON E was determined by 3' RACE PCR.
- K562 total RNA was reverse transcribed by Superscript III First Strand Synthesis Kit (Life Technologies) with the adapter oligo-dT primer.
- the first PCR was performed using the first adapter primer (Adapter Reverse Parti) and SON E primer (SON El), which specifically bind with SON exon 5a region.
- a second PCR was achieved using the second adapter primer (Adapter Reverse Part2) and SON E primer (SON E2). PCR products were visualized on a 1% agarose gel and cDNA fragments were cloned and sequenced.
- the primer sets for RACE PCR are listed in TABLE 3.
- the bone marrow mononuclear cells and/or peripheral blood mononuclear cells from AML patients were obtained from the Stem Cell and Xenotransplantation Core Facility of the University of Pennsylvania.
- Peripheral blood samples from additional 5 patients diagnosed with AML or MDS (P12— PI 6) and healthy donors (N5— Nl 1) were obtained from the BioBank of Mitchell Cancer Institute, University of South Alabama. Cells were purified by Ficoll-Paque (GE Healthcare) density-gradient centrifugation and frozen as viable cells. Details of the patient samples were listed in TABLE 2.
- Mouse leukemic blasts were obtained from C57BL/6 mice with AML1- ET09a-induced leukemia generated by retroviral transduction-transplantation (Yan et al., 2006).
- MLL-AF9 leukemic blasts were obtained from the spleen and bone marrow of MLL- AF9a knock-in mice (Corral et al., 1996).
- Normal mouse Lin-/c-Kit+ bone marrow cells were prepared by Lineage Cell Depletion Kit and CD1 17 MicroBeads (Miltenyl Biotec).
- Lentiviral vector for SON E overexpression was prepared by subcloning SON E cDNAs into the pCDH-MCS-T2A-copGFP-MSCV vector (System Bioscience). Lentivirus was produced by cotransfection of HEK 293T cells with expression plasmid, pMDLg/pRRE, pRSV-REV, and pVSVG. Viral supernatants were collected after 48 h and clarified by filtration before use. Ultracentrifugation was performed for lentivirus concentration with the Optima L-100 XP centrifuge (Beckman) using an SW55TI rotor (Beckman) at 19,400 rpm for 2 hr at 20°C. Supernatant was completely removed and virus pellets resuspended in PBS.
- Mouse total bone marrow cells were transduced with recombinant lentivirus in the 4 ug/ml polybrene. Infected cells were incubated overnight in the above mouse BM culture media. After 3 days of culture, 2 x 10 4 cells were plated in methylcellulose medium (Methocult GF M3434, StemCell Technologies). Colony number counting and re- plating were repeated every 7-10 days. The colony-forming units (CFUs) were quantified as the average and standard deviation of at least triplicate determinations.
- CFUs colony-forming units
- Histone methylase MLLl has critical roles in tumor growth and angiogenesis and its knockdown suppresses tumor growth in vivo. Oncogene 32, 3359-3370.
- MLL-rearranged leukemia is dependent on aberrant H3K79 methylation by DOT1L. Cancer cell 20, 66-78.
- RNAi screen reveals determinants of human embryonic stem cell identity. Nature 468, 316-320.
- An M11-AF9 fusion gene made by homologous recombination causes acute leukemia in chimeric mice: a method to create fusion oncogenes. Cell 85, 853-861.
- SON connects the splicing-regulatory network with pluripotency in human embryonic stem cells. Nature cell biology 15, 1 141-1 152.
- Gene expression profiling identifies BAX-delta as a novel tumor antigen in acute lymphoblastic leukemia. Cancer Res 65, 10050- 10058.
- COMPASS a complex of proteins associated with a trithorax-related SET domain protein. Proceedings of the National Academy of Sciences of the United States of America 98, 12902-12907.
- the menin tumor suppressor protein is an essential oncogenic cofactor for MLL-associated leukemogenesis. Cell 123, 207-218.
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Immunology (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Molecular Biology (AREA)
- Pathology (AREA)
- Analytical Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Hematology (AREA)
- Zoology (AREA)
- General Health & Medical Sciences (AREA)
- Biotechnology (AREA)
- Microbiology (AREA)
- Urology & Nephrology (AREA)
- Hospice & Palliative Care (AREA)
- Wood Science & Technology (AREA)
- Genetics & Genomics (AREA)
- Biochemistry (AREA)
- Biomedical Technology (AREA)
- Physics & Mathematics (AREA)
- Oncology (AREA)
- General Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Cell Biology (AREA)
- Biophysics (AREA)
- Food Science & Technology (AREA)
- Medicinal Chemistry (AREA)
- General Physics & Mathematics (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
Embodiments of the present invention relate to the diagnosis of hematopoietic malignancies. Some embodiments include methods and kits for the diagnosis of hematopoietic malignancies, such as acute myeloid leukemia. Some embodiments relate to the presence of alternatively-spliced isoforms of a SON gene.
Description
METHODS AND COMPOSITIONS FOR THE DIAGNOSIS
OF ACUTE MYELOID LEUKEMIA
STATEMENT REGARDING FEDERALLY SPONSORED R&D
[0001] This invention was made with government support under CA 190688 and CA185818 awarded by the National Institutes of Health. The government has certain rights in the invention.
REFERENCE TO SEQUENCE LISTING
[0002] The present application is being filed along with a Sequence Listing in electronic format. The Sequence Listing is provided as a file entitled USA.018WO_SEQLIST, created November 30, 2016, which is approximately 183 Kb in size. The information in the electronic format of the Sequence Listing is incorporated herein by reference in its entirety.
FIELD OF THE INVENTION
[0003] Embodiments of the present invention relate to the diagnosis of hematopoietic malignancies. Some embodiments include methods and kits for the diagnosis of hematopoietic malignancies, such as acute myeloid leukemia. Some embodiments relate to the presence of alternatively-spliced isoforms of a SON gene.
BACKGROUND OF THE INVENTION
[0004] Methyl ation of lysine residues of histone H3 is an event dictating active or repressed status of chromatin. Tri-methylation of histone 3 lysine 4 (H3K4me3) near transcription start sites is associated with active transcription (Barski et al., 2007; Guenther et al., 2007), and in mammals, this modification is mediated by the SET1 and mixed lineage leukemia (MLL) family methyltransferases, SET1A, SET1B, and MLLl^t (Miller et al., 2001 ; Shilatifard, 2012). The SET1/MLL proteins are associated with multiple subunit proteins, such as WDR5, ASH2L and RBBP5, to acquire a maximum activity in methylation of H3K4 (Cao et al., 2010; Dou et al., 2006; Ernst and Vakoc, 2012). The N-terminal portion
of the MLL1/2 protein interacts with the scaffold protein menin, facilitating LEDGF interaction and chromatin binding of the MLL complex (Yokoyama and Cleary, 2008). The MLL-menin interaction is required for leukemia-associated target gene expression (Yokoyama et al., 2005), suggesting that this interaction is involved in activating oncogenic MLL-target genes. Pharmacological inhibition of the MLL-menin interaction has been shown to effectively block leukemia progression (Borkin et al., 2015) and prostate cancer growth (Malik et al., 2015), indicating that MLL-menin interaction could be a promising target for cancer therapy. However, cellular factors that regulate MLL-menin interaction and MLL complex assembly are largely unknown.
SUMMARY OF THE INVENTION
[0005] Some embodiments of the methods and compositions provided herein include a method of diagnosing a hematopoietic malignancy or a likelihood of developing a hematopoietic malignancy in a test subject comprising: determining a ratio for the level of a short polypeptide or the level of a nucleic acid encoding the short polypeptide in the test subject to the level of a long polypeptide or the level of a nucleic acid encoding the long polypeptide in the test subject, wherein the short polypeptide is encoded by a truncated alternatively-spliced transcript of a SON gene, and the long polypeptide is encoded by a full length transcript of the SON gene.
[0006] Some embodiments of the methods and compositions provided herein include a method of detecting a ratio for the level of a short polypeptide or the level of a nucleic acid encoding the short polypeptide in a test subject to the level of a long polypeptide or the level of a nucleic acid encoding the long polypeptide in the test subject comprising: contacting a nucleic acid or polypeptide sample from the test subject with an agent which indicates the level of the short polypeptide or the level of a nucleic acid encoding the short polypeptide in the test subject; contacting the nucleic acid or polypeptide sample with an agent which indicates the level of the long polypeptide or a nucleic acid encoding the long polypeptide in the test subject; and determining the ratio for the level of the short polypeptide or the level of a nucleic acid encoding the short polypeptide in the test subject to the level of the long polypeptide or the level of a nucleic acid encoding the long polypeptide in the test
subject, wherein the short polypeptide is encoded by a truncated alternatively-spliced transcript of a SON gene, and the long polypeptide is encoded by a full length transcript of the SON gene. In some embodiments, the ratio is greater than about 30%, 40%, or 50%.
[0007] Some embodiments of the methods and compositions provided herein include a method of determining the level of a short polypeptide or a nucleic acid encoding the polypeptide in a nucleic acid or polypeptide sample from a test subject comprising: contacting the nucleic acid or polypeptide sample from the test subject with an agent which indicates the level of the short polypeptide or which indicates the level of a nucleic acid encoding the short polypeptide in the sample; and determining the level of the short polypeptide or a nucleic acid encoding the short polypeptide in the sample, wherein the short polypeptide is encoded by a truncated alternatively-spliced transcript of a SON gene.
[0008] Some embodiments of the methods and compositions provided herein include a method of diagnosing a hematopoietic malignancy or a likelihood of developing a hematopoietic malignancy in a test subject comprising: determining the level of a short polypeptide or the level of a nucleic acid encoding the short polypeptide in a sample obtained from a test subject, wherein the short polypeptide is encoded by a truncated alternatively- spliced transcript of a SON gene.
[0009] Some embodiments also include comparing the level of the short polypeptide or the level of a nucleic acid encoding the short polypeptide in a sample obtained from the test subject with the level of the short polypeptide or the level of a nucleic acid encoding the short polypeptide in a subject not having a hematopoietic malignancy.
[0010] Some embodiments also include determining the level of a long polypeptide or a nucleic acid encoding the long polypeptide in the test subject, wherein the long polypeptide is encoded by a full length transcript of the SON gene.
[0011] Some embodiments also include determining a ratio for the level of a short polypeptide or the level of a nucleic acid encoding the short polypeptide in the test subject to the level of a long polypeptide or the level of a nucleic acid encoding the long polypeptide in the sample from the test subject. In some embodiments, the ratio is greater than about 30%, 40%, or 50%.
[0012] Some embodiments also include determining the level of an additional polypeptide in addition to the level of the short form of the SON polypeptide or the level of a nucleic acid encoding the additional polypeptide in addition to the level of a nucleic acid encoding the short form of the SON polypeptide in a test subject, wherein the additional polypeptide is encoded by a gene selected from CDKN1A, GFI1 and ATF3.
[0013] Some embodiments also include comparing the level of an additional polypeptide or the level of a nucleic acid encoding the additional polypeptide in the test subject with the level of an additional polypeptide or the level of a nucleic acid encoding the additional polypeptide in a subject not having a hematopoietic malignancy.
[0014] In some embodiments, the truncated alternatively-spliced transcript of a SON gene includes a SON E variant of a SON gene, and/or a SON B variant of a SON gene. In some embodiments, the nucleic acid encoding the short polypeptide comprises a sequence which specifically binds to an oligonucleotide comprising SEQ ID NOs:06 or 10. In some embodiments, the short polypeptide comprises SEQ ID NOs:88 or 90. In some embodiments, the nucleic acid encoding the short polypeptide comprises SEQ ID NOs:87 or 89.
[0015] In some embodiments, the full length transcript of the SON gene comprises a sequence which specifically binds to an oligonucleotide comprising a sequence selected from SEQ ID NOs: 13-18. In some embodiments, the long polypeptide comprises SEQ ID NO:86. In some embodiments, the nucleic acid encoding the long polypeptide comprises SEQ ID NO:85.
[0016] In some embodiments, the SON gene is a human SON gene.
[0017] Some embodiments also include obtaining a sample from the test subject.
[0018] Some embodiments also include contacting the sample with an agent which specifically binds to the short polypeptide or a nucleic acid encoding the short polypeptide.
[0019] Some embodiments also include contacting the sample with an agent which specifically binds to the long polypeptide or a nucleic acid encoding the long polypeptide.
[0020] In some embodiments, the agent is a primer or a hybridization probe. In some embodiments, the primer or the hybridization probe comprises an oligonucleotide comprising SEQ ID NOs:06, 10 or 13-18
[0021] In some embodiments, the agent is an antibody or antigen-binding fragment thereof which specifically binds to the short polypeptide. In some embodiments, the agent is an antibody or antigen-binding fragment thereof which specifically binds to the long polypeptide.
[0022] In some embodiments, the agent is attached to a solid support.
[0023] In some embodiments, the sample comprises nucleic acids or polypeptides.
[0024] Some embodiments also include determining an increased level of a polypeptide encoded by a SON target gene or a nucleic acid encoded by a SON target gene in the sample from the test subject, wherein the SON target gene has a SON binding site. In some embodiments, the SON target gene is selected from the group consisting of FOX03, NOTCH2NL, SRC, GADD45A, CDKN1A, ATF3, GFI1, EGR1, SRC, and GFI1B.
[0025] Some embodiments also include determining an increased level of a MLL multi-protein complex in the sample from the test subject, wherein the MLL is selected from the group consisting of MLL1, MLL-N, MLL-C, and MLL2. In some embodiments, the multiprotein complex comprises a protein selected from menin, ASH2L, LEDGF, and WDR5.
[0026] Some embodiments also include determining an increased replating activity of a hematopoietic progenitor cell of the sample from the test subject.
[0027] In some embodiments, the hematopoietic progenitor cell is a bone marrow cell.
[0028] Some embodiments also include determining an increased binding of a protein at the or adjacent to the transcription start site of a SON target gene in the sample from the test subject. In some embodiments, the protein is selected from the group consisting of MLL1, MLL-N, MLL-C, MLL2, WDR5, ASH2L, menin, SET1A, SET1B, ASC2, and SUZ12. In some embodiments, the SON target gene is selected from the group consisting of CDKN1A, GADD45A, NOTCH2NL, ATF3, GFI1, EGR1, and SRC.
[0029] Some embodiments also include determining an increased level in H3K4me3 in the sample from the test subject. In some embodiments, the level in H3K4me3 is increased at the or adjacent to a SON target gene selected from the group consisting of CDKN1A, GADD45A, NOTCH2NL, ATF3, GFI1, EGR1, and SRC.
[0030] In some embodiments, the sample comprises bone marrow mononuclear cells (BM-MNCs), or peripheral blood mononuclear cells (PB-MNCs).
[0031] In some embodiments, the hematopoietic malignancy is selected from acute myeloid leukemia (AML) and myelodysplastic syndrome (MDS). In some embodiments, the hematopoietic malignancy is an acute myeloid leukemia (AML) subtype selected from M2 without t(8;21), and t(8;21)(q22;q22).
[0032] In some embodiments, the test subject is mammalian. In some embodiments, the test subject is human.
[0033] Some embodiments of the methods and compositions provided herein include a method of diagnosing a hematopoietic malignancy or a likelihood of developing a hematopoietic malignancy in a test subject comprising: detecting increased levels of binding of a short polypeptide with a nucleic acid having a SON binding site in a sample from a test subject, wherein the short polypeptide is encoded by a truncated alternatively-spliced transcript of a SON gene.
[0034] In some embodiments, the truncated alternatively-spliced transcript of a SON gene includes a SON E variant of a SON gene, and/or a SON B variant of a SON gene. In some embodiments, the nucleic acid encoding the short polypeptide comprises a sequence which specifically binds to an oligonucleotide comprising SEQ ID NOs:06 or 10. In some embodiments, the short polypeptide comprises SEQ ID NOs:88 or 90. In some embodiments, the nucleic acid encoding the short polypeptide comprises SEQ ID NOs:87 or 89.
[0035] In some embodiments, the hematopoietic malignancy is selected from acute myeloid leukemia (AML) and myelodysplastic syndrome (MDS). In some embodiments, the hematopoietic malignancy is an acute myeloid leukemia (AML) subtype selected from M2 without t(8;21), and t(8;21)(q22;q22).
[0036] In some embodiments, the subject is mammalian. In some embodiments, subject is human.
[0037] Some embodiments of the methods and compositions provided herein include a method of ameliorating a hematopoietic malignancy comprising: increasing the level of a full length SON polypeptide or a nucleic acid encoding a full length SON polypeptide in a cell of a subject.
[0038] In some embodiments, the expression level of a nucleic acid encoding said full length SON polypeptide or the expression level of said full length SON polypeptide is increased by administering a composition comprising a nucleic acid to the subject.
[0039] In some embodiments, the nucleic acid comprises a sequence encoding a full length SON polypeptide. In some embodiments, the SON polypeptide is a human SON polypeptide. In some embodiments, the nucleic acid comprises SEQ ID NO.:85. In some embodiments, the polypeptide comprises SEQ ID NO.:86.
[0040] In some embodiments, the hematopoietic malignancy is selected from acute myeloid leukemia (AML) and myelodysplastic syndrome (MDS). In some embodiments, the hematopoietic malignancy is an acute myeloid leukemia (AML) subtype selected from M2 without t(8;21), and t(8;21)(q22;q22).
[0041] In some embodiments, the subject is mammalian. In some embodiments, the subject is human.
[0042] Some embodiments of the methods and compositions provided herein include a kit for the diagnosis of a hematopoietic malignancy in a test subject comprising: an agent that specifically binds to a short polypeptide or a nucleic acid encoding the short polypeptide, wherein the short polypeptide is encoded by a truncated alternatively-spliced transcript of a SON gene.
[0043] Some embodiments also include an agent that specifically binds to a long polypeptide or a nucleic acid encoding the long polypeptide, wherein the long polypeptide is encoded by a full length transcript of the SON gene.
[0044] Some embodiments also include an agent that specifically binds to an additional polypeptide in addition to the short polypeptide or a nucleic acid encoding the additional polypeptide in addition to the nucleic acid encoding the short polypeptide in the test subject, wherein the additional polypeptide is encoded by a gene selected from CDKN1A, GFI1 and ATF3.
[0045] In some embodiments, the truncated alternatively-spliced transcript of a SON gene includes a SON E variant of a SON gene, and/or a SON B variant of a SON gene.
[0046] In some embodiments, the nucleic acid encoding the short polypeptide comprises a sequence which specifically binds to an oligonucleotide comprising SEQ ID NOs:06 or 10. In some embodiments, the short polypeptide comprises SEQ ID NOs:88 or 90. In some embodiments, the nucleic acid encoding the short polypeptide comprises SEQ ID NOs:87 or 89.
[0047] In some embodiments, the full length transcript of the SON gene comprises a sequence which specifically binds to an oligonucleotide comprising a sequence selected from SEQ ID NOs: 13-18. In some embodiments, the long polypeptide comprises SEQ ID NO:86. In some embodiments, the nucleic acid encoding the long polypeptide comprises SEQ ID NO:85.
[0048] In some embodiments, the SON gene is a human SON gene.
[0049] Some embodiments also include an agent that specifically binds to a SON target gene, wherein the SON target gene has a SON binding site. In some embodiments, the SON target gene is selected from the group consisting of FOX03, NOTCH2NL, SRC, GADD45A, CDKN1A, ATF3, GFI1, EGR1, SRC, and GFI1B.
[0050] Some embodiments also include an agent that specifically binds to a protein selected from MLL1, MLL-N, MLL-C, MLL2, menin, ASH2L, LEDGF, WDR5, SET1A, SET1B, ASC2, SUZ12, and H3K4me3.
[0051] In some embodiments, the agent is a primer or a hybridization probe. In some embodiments, the primer or the hybridization probe comprises an oligonucleotide comprising SEQ ID NOs:06, 10, or 13-18. In some embodiments, the agent is an antibody or antigen-bind fragment thereof which specifically binds to the short polypeptide. In some embodiments, the agent is attached to a solid support.
BRIEF DESCRIPTION OF THE DRAWINGS
[0052] FIG. 1A depicts a schematic summarizing chromatin-immunoprecipitation (ChIP) using two different SON antibodies (Abs) and DN A- sequencing to determine overlap
peaks. SON-N and SON-C Abs specifically bind to the N- and the C-terminus of SON, respectively.
[0053] FIG. IB depicts genomic distribution of SON-binding sites determined by SON-N and SON-C ChIP overlap peaks. The pie graphs show the percentage of peaks located at specific genomic regions indicated (transcription start site, transcription start site).
[0054] FIG. 1C depicts a Venn diagram showing the number of SON ChIP peaks (SON-N, SON-C and overlap) within ±5 kb from the transcription start site (Top). Pie graph illustrating genomic locations of overlap peaks within ±5kb from the transcription start site (Bottom).
[0055] FIG. ID depicts a heat map showing the overlap peak signal of SON ChIP around the transcription start site of genes.
[0056] FIG. IE depicts the top five DNA sequence motifs identified in the SON- N and SON-C overlap peaks within ± 5 kb of the transcription start site.
[0057] FIG. IF depicts gene ontology (GO) term enrichment analysis using DAVID for the genes in which SON-N and SON-C overlap peaks were found within ±5 kb from the transcription start site.
[0058] FIG. 1G depicts the result of a pilot ChlP-seq study to verify specificity of two SON antibodies, SON-N Ab and SON-C Ab. The numbers of ChlP-seq peaks are summarized.
[0059] FIG. 1H depicts a Venn diagram showing number of SON peaks determined by ChlP-seq using SON-N Ab and SON-C Ab.
[0060] FIG. II depicts top five motifs enriched in SON-binding sites analyzed by HOMER.
[0061] FIG. 2A depicts qPCR analyses of SON ChlP-seq target genes in K562 cells transfected with control siRNA and two different SON siRNAs. GFI1B served as a negative control which does not have SON binding sites near the transcription start site. Values represent mean ± SD of four independent experiments. *p < 0.01.
[0062] FIG. 2B depicts average signal profiles of indicated histone modifications and CpG islands around the SON-binding sites.
[0063] FIG. 2C depicts Integrative Genomics Viewer (IGV) images representing SON (SON-N), H3K4me3, and H3K27ac ChlP-seq read counts at the target gene locus in K562 cells.
[0064] FIG. 2D depicts close-up images of ChlP-seq peaks of SON-N, SON-C, H3K4me3, and H3K27ac in representative SON target genes.
[0065] FIG. 2E depicts SON ChlP-qPCR analysis (with SON-N Ab) confirming SON enrichment near the transcription start site of indicated genes in K562 cells. The number in the parenthesis indicates the base-pair counts from the transcription start site of each gene, and the primers for qPCR (TABLE 3) were designed to detect SON enrichment around these positions. *p-values <0.01.
[0066] FIG. 2F depicts HA ChlP-qPCR analysis measuring DNA-binding ability of FLA-tagged full-length SON and several deletion mutants in K562 cells; Full-length SON (SON F), potential DNA-binding region-deleted SON (SON ΑΌΒ, amino acids 1 ,263 - 1,818 deleted), G-patch-deleted SON (SON AG-patch), and double-stranded RNA binding motif- deleted SON (SON ADSRM). Potential DNA-binding region that has been shown to interact with human hepatitis B virus genome (Sun et al., 201 1) is indicated. Empty vector transfected sample was used as a control. *p-values <0.01.
[0067] FIG. 2G depicts a Western blot that confirmed knockdown efficiency of two different siRNAs (#1 and #2) against SON in K562 cells.
[0068] FIG. 2H depicts average signal profiles of H3K4mel and H3K27ac around the intergenic SON-binding sites.
[0069] FIG. 21 depicts an IGV browser image of density profiles of SON (SON-N Ab ChIP), CpG islands, H3K4me3 and H3K27ac at several examples of SON target genes. Scale bar lOkb.
[0070] FIG. 2J depicts close-up images of ChlP-seq peaks of SON-N, SON-C, H3K4me3, H3K27ac, H2A.Z and MNase-seq (nucleosome) peaks in representative SON target genes. Scale bar,lkb.
[0071] FIG. 3A depicts ChlP-qPCR analyses of various histone modification levels at the SON-binding regions near the transcription start sites of the indicated genes upon SON knockdown in K562 cells. Histone H3 ChlP-qPCR was used as a control.
[0072] FIG. 3B depicts ChlP-qPCR analyses MLL and SET1 complex protein recruitment (MLL-N; MLL1 N-terminus region, MLL-C; MLL1 C-terminus region, MLL2, WDR5, ASH2L, menin, SET1A, SET1B and ASC2) to the regions near the transcription start sites of the indicated genes upon SON knockdown. Recruitment of SUZ12, a polycomb complex protein, was also examined, and H3 ChIP was done as a control. Depletion of SON at the transcription start site of indicated target regions upon SON siRNA-transfection was verified by SON ChIP. Signals for each experiment were represented as percentage of input chromatin. Results are expressed as mean ± SD of three biological replicates. * p < 0.01.
[0073] FIG. 3C depicts SON Depletion Leads to H3K4me3 Modification and MLL Complex Recruitment. ChlP-qPCR analysis of two SON target genes (CDKN1A and GADD45A), were conducted using indicated antibodies in K562 cells transfected with control or SON siRNA-#2. ChlP-qPCR results are plotted as percentage of input DNA. *p- values <0.05.
[0074] FIG. 3D depicts SON Depletion Leads to H3K4me3 Modification and MLL Complex Recruitment. ChlP-qPCR analysis of two non-targets (GFI1B and ATF4; B), were conducted using indicated antibodies in K562 cells transfected with control or SON siRNA-#l . ChlP-qPCR results are plotted as percentage of input DNA. *p-values <0.05.
[0075] FIG. 4A depicts a Western blot verified SON knockdown by SON siRNA transfection in K562 cells.
[0076] FIG. 4B depicts a co-immunoprecipitation experiment with MLL-N antibodies.
[0077] FIG. 4C depicts a co-immunoprecipitation experiment with WDR5 antibodies.
[0078] FIG. 4D depicts a co-immunoprecipitation experiment with LEGF antibodies.
[0079] FIG. 4E depicts a co-immunoprecipitation experiment with SET1A antibodies.
[0080] FIG. 4F depicts interaction of SON with menin. K562 nuclear extracts were immunoprecipitated with control IgG or SON antibody (SON-N Ab) and several components of the MLL complex were examined by Western blot.
[0081] FIG. 4G depicts verification of the SON-menin interaction. HEK 293 cells transfected with HA-SON, Flag-menin or pcDNA3-control as indicated were used for co- immunoprecipitation with HA antibody followed by Western blotting with HA or Flag antibodies.
[0082] FIG. 4H depicts an immunoprecipitation experiment in K562 cells transfected with V5-tagged SON indicates that SON outcompetes MLL (MLL-N) for menin interaction in a dose-dependent manner. For plasm id transfection, two different amounts of SON-V5 construct, 5 μg (+) or 10 μg (++), were used.
[0083] FIG. 41 depicts an experiment in which nuclear extracts from control and SON knockdown-K562 cells were size- fractionated on FPLC and analyzed for MLL1 N- terminus (MLL-N) and menin distribution by Western blotting using indicated antibodies (bottom panels). Top panels are elution profiles: fraction numbers on elution profile (top) indicate MLL-N-enriched fractions (left panel of WB, fractions 12-19) and trace on elution profile (top) indicates further downstream fractions (right panel of WB, fractions 20-41).
[0084] FIG. 4J depicts a Western blot confirmed knockdown efficiency of SON siRNA in MV4; 11 cells.
[0085] FIG. 4K depicts nuclear extracts from MV4; 11 transfected by control and SON siRNA were used for immunoprecipitation with MLL-N antibody and Western blotted with indicated antibodies. The N-terminus of wild-type MLL1 is marked by an asterisk and the MLL-AF4 fusion protein is marked by a red arrow head.
[0086] FIG. 4L depicts 293T cells transiently co-expressing Flag-MLL-ENL and Myc-menin were transfected with control vector or SON-V5 construct and subjected to co- immunoprecipitation assays using a Flag antibody. Immunoprecipitated Flag-MLL-ENL and Myc-menin were analyzed by Western blotting.
[0087] FIG. 4M depicts expression of target genes of MLL fusion proteins (HOXA9 and MEIS1) or SON protein (CDKN1A, ATF3, and GFI1) measured by real-time qPCR in the control and SON siRNA-transfected MLL-rearranged leukemic cell lines, MV4;11 and ML-2. Note that the genes identified by our SON ChlP-seq in K562 (e.g. CDKN1A, ATF3 and GFI1) were also upregulated upon SON knockdown in MV4;1 1 cells (heterozygous for MLL-AF4) which retain one copy of wild-type MLL. However, these genes
were not changed upon SON knockdown in ML2 cells which do not have wild-type MLL (homozygous for MLL-AF6), suggesting that the negative effect of SON on those target genes is exerted through inhibition of the MLL wild-type complex. Data represent three replicates from three independent experiments. *p values < 0.01.
[0088] FIG. 5A depicts a schematic representation of the SON gene and the SON proteins. Full-length SON (SON F) is generated by alignment of 12 constitutive exons. Inclusion of alternative exons (exon 7a, and exon 5a) produces two different alternatively spliced isoforms, SON B and E. Horizontal arrows indicate the specific position of the primers used in qPCR. Arrowheads, polyadenylation signal sequences; Hatched boxes, untranslated regions.
[0089] FIG. 5B depicts qPCR analysis of specific exon regions of SON in BM- MNCs of AML patients (P, red dots; n = 10) and healthy normal donors (N, black dots; n = 4). GAPDH was used for normalization. Horizontal bars indicating the median expression level of specific exon regions of SON are presented with the error bars indicating ±SD. *p < 0.05, **p < 0.01.
[0090] FIG. 5C depicts qPCR analysis of specific exon regions of SON in PB- MNCs of AML (n = 8) and MDS (n = 2) patients (P) and healthy normal donors (N; n = 7). GAPDH was used for normalization. Horizontal bars indicating the median expression level of specific exon regions of SON are presented with the error bars indicating ±SD. *p < 0.05, **p < 0.01.
[0091] FIG. 5D depicts relative ratio of SON F, SON B and SON E in the BM- MNCs from AML patients (PI - P10) and healthy normal donors (Nl - N4).
[0092] FIG. 5E depicts relative ratio of SON F, SON B and SON E in the PB- MNCs from AML patients (PI, P5, P9-14), MDS patients (P15 - 16) and healthy normal donors (N5 - Ni l).
[0093] FIG. 5F depicts analyses of SON isoform expression in normal mouse Lin- /c-Kit+ bone marrow (BM) cells and leukemic blasts from mice with AMLl-ET09a- and MLL-AF9-induced leukemia. Specific exon regions indicated in each graph were determined by qPCR with the primer sets indicated in FIG. 5H. Data are represented as mean ± SD of three independent experiments. *p < 0.01.
[0094] FIG. 5G depicts relative ratios of three forms of Son in leukemic blasts (freshly isolated from the animals and also collected after methylcellulose culture) and normal Lin-/c-Kit+ mouse BM cells were determined.
[0095] FIG. 5H depicts a schematic representation of the mouse Son genes and the Son proteins. Similar to human SON, full-length mouse Son (Son f; mouse Son isoform 1) is generated by alignment of 12 constitutive exons. Inclusion of alternative exons (exon 7a; exons 5a) produces two different alternatively spliced isoforms, Son b (a predicted isoform) and Son e (mouse Son isoform 2). The primer sets used for mouse Son qPCR (presented in FIG. 5F) are indicated with horizontal arrows.
[0096] FIG. 51 depicts a structure of the SON E mRNA and position of PCR primers for 3' RACE PCR to confirm the 3' UTR sequence of SON E. Arrows indicate primers for 1st RACE PCR and arrows indicate primers for 2nd RACE PCR. EtBr-stained agarose gel showed 3' RACE-amplified PCR products (asterisk) that were used for sequencing. Decreased amount of the 3' RACE PCR product in total SON siRNA-transfected K562 cells, compared to control K562 cells, indicates the specificity of 3' RACE PCR.
[0097] FIG. 5J depicts sequencing result of the 3' RACE PCR product from human SON E transcripts.
[0098] FIG. 5K depicts a schematic strategy for measurement of relative ratio of SON isoforms presented in FIG. 5D, FIG. 5E and FIG. 5G. To calculate relative expression level of SON isoforms, each qPCR analysis was performed using same forward primer and two different reverse primers. At the 1st qPCR, the ratio of exon 5a-included transcripts (SON E) and exon 5-included transcripts (total of SON F and SON B) was determined using a common forward primer (F3) and exon-specific reverse primers (R5 or R5a). At the 2nd qPCR, the ratio of exon 7a-included transcripts (SON B) and exon 7-included transcript (SON F) was determined using a common forward primer (F5) and exon-specific reverse primers (R7 or R7a). Based on two qPCR analyses, relative levels of SON F, SON B and SON E were determined.
[0099] FIG. 5L depicts a schematic of genomic structure of full-length SON and SON B. Each bar with the probe set number indicates the specific position of DNA probes used in microarray analysis (Affymetrix U133A microarrays).
[0100] FIG. 5M depicts relative expression levels of SON detected by three different probe sets. The microarray data were from Stegmaier Leukemia Dataset (Stegmaier et al., 2004) from Oncomine database. The values of log2 median-centered intensity detected by indicated probe sets were displayed as a boxplot according to Oncomine output.
[0101] FIG. 5N depicts relative expression levels of SON detected by three different probe sets. The microarray data were from Maia Leukemia Dataset (Maia et al., 2005) from Oncomine database. The values of log2 median-centered intensity detected by indicated probe sets were displayed as a boxplot according to Oncomine output.
[0102] FIG. 6A depicts a qPCR analysis of leukemia-associated SON target genes, CDKN1A, ATF3 and GFI1, in BM-MNCs from healthy normal donors (Nl - N4) and AML patients (P2 - P10). Black Broken lines indicate the average level of each gene in normal donor samples.
[0103] FIG. 6B depicts the effects of total SON knockdown (with SON siRNA-1) and SON E-specific knockdown (with SON E siRNA) on expression of leukemia-associated, ChlP-seq target genes in K562 cells. In addition, previously identified SON target genes regulated by RNA splicing function of SON (TUBG1, HDAC6 and AKT1) were also examined, revealing the effect of SON E on regulation of ChlP-seq target genes, but not RNA splicing target genes. *p < 0.05, **p < 0.01.
[0104] FIG. 6C depicts the effects of total SON knockdown (with SON siRNA-1) and SON E-specific knockdown (with SON E siRNA) on expression of leukemia-associated, ChlP-seq target genes in primary human CD34+ BM cells.
[0105] FIG. 6D depicts effects of SON F and SON E overexpression on SON target gene expression (ChlP-seq target genes and RNA splicing target genes) in human CD34+ BM cells. Error bars represent SD from three independent experiments. *p < 0.05, **p < 0.01.
[0106] FIG. 6E depicts a ChlP-qPCR assay to analyze the H3K4me3 levels at SON-binding sites near the promoter of the CDKN1A and GFI1 genes upon SON F and SON E expression. K562 cells were transfected with indicated siRNA and SON constructs (siRNA-resistant form). For plasmid transfection, 3 μg (+) of SON F constructs plus increasing amounts of SON E construct, 0 (-), 3 (+) or 6 μg (++), or 3 μg of SON E alone
were used as indicated. H3K4me3 antibody or H3 antibody were used for ChlP. Asterisks indicate statistical significances of H3K4me3 reduction, as compared to the sample transfected with SON siRNA alone (the second lane). Inter-peak asterisks indicate statistical significance of the difference between two indicated samples. *p < 0.01.
[0107] FIG. 6F depicts TUBG1 exon 7-8 minigene splicing assay demonstrating that SON E does not interfere with SON F-mediated RNA splicing.
[0108] FIG. 6G depicts the location of the target regions of total SON siRNAs (siRNAs #1 and #2, targeting exon 3) and SON E siRNA (targeting exon 5a). SON siRNA sequences are described in herein.
[0109] FIG. 6H depicts verification of SON F and SON E overexpression in primary human CD34+ bone marrow cells after nucleofection of the expression constructs. Exogenous expression of SON was determined by qPCR using the forward primers (F- hSON-Exonl2 for SON F, F-hSON-Exon4 for SON E) together with a reverse primer targeting the V5 sequence (R-V5-exo) in the plasmid. **p-values <0.01.
[0110] FIG. 61 depicts the expression level of SON F-V5 and SON E-Flag in the K562 cells used for V5-CMP and Flag-ChIP presented in FIG. 6E.
[0111] FIG. 6 J depicts strategies of minigene assay with the TUBG1 exon7- intron 7 - exon8 minigene model (containing SON-dependent splicing sites; Ahn et al., 201 1; Lu et al., 2013). Splicing efficiencies of transfected SON F and SON E (expressed by the siRNA-resistant form of cDNA; Ahn et al., 201 1) were accessed by RT-PCR using a forward primer (F) and a reverse primer (R) to detect unspliced and spliced RNA.
[0112] FIG. 6K depicts RT-PCR results demonstrating minigene splicing efficiencies. The numbers above the each lane indicates the relative amount of exogenous SON F and/or SON E transfected together with the minigene.
[0113] FIG. 7A depicts a schematic diagram of expression constructs used for the experiments presented in panels B— E; V5-tagged SON F (full-length), Flag-tagged SON E, V5-tagged SON E, and the HA-tagged SON C-terminal fragment containing the RS domain and RNA-binding motifs (SR+RB).
[0114] FIG. 7B depicts ChlP-qPCR analysis of SON F-V5 or SON E-Flag binding on target regions near the transcription start site of CDKN1 A. Five μg (+) of SON F-
V5 constructs plus increasing amounts of SON E-Flag, 5 (+) or 10 μg (++) constructs were used for transfection into K562 cells. Shown are representative results of three independent experiments (mean ± S.D. from triplicate). Asterisks indicate statistical significances of SON F-V5 or SON E-Flag enrichment, as compared to the background signal from negative control samples (SON E Flag-only sample in V5-ChlP and SON F-V5-only sample in Flag ChIP). Inter-peak asterisks indicate statistical significance of the difference between two indicated samples. *p < 0.05, **p < 0.01.
[0115] FIG. 7C depicts co-immunoprecipitation (IP) experiments demonstrating the lack of menin-binding ability of SON E. Lysates of HEK 293 cells co-transfected with Flag-menin and SON F-V5 or SON E-V5 were subjected to IP with anti-V5, followed by Western blotting with indicated antibodies.
[0116] FIG. 7D depicts the C-terminus of SON interacts with menin. The HA- tagged SR+RB fragment was expressed in HEK 293 cells with or without Flag-menin, and subjected to HA-IP followed by Western blotting with HA or Flag antibodies.
[0117] FIG. 7E depicts SON E overexpression enhances the interactions between MLL complex components, while SON F overexpression shows inhibitory effects. Empty vector (pcDNA3), SON F-V5, or SON E-Flag were transfected into K562 cells (without depletion of endogenous SON) and nuclear extract was used for immunoprecipitation with MLL-N antibody followed by Western blotting with indicated antibodies.
[0118] FIG. 7F depicts a Western blot verifying overexpression of SON E in primary mouse bone marrow cells infected with lentivirus carrying flag-tagged SON E.
[0119] FIG. 7G depicts number of colony-forming units in serial replating assay using primary mouse bone marrow cells infected with lentivirus carrying the control vector or SON E. The results represent 3 independent experiments. *p < 0.05, **p < 0.01.
[0120] FIG. 7H depicts representative images of control cells and the colony formed by SON E-overexpressing cells in the third round of the replating assay.
[0121] FIG. 71 depicts models for SON and its alternative spliced isoform function in transcription. Under normal conditions, SON binds G/C-rich sequence near the transcription start site to inhibit transcriptional activity. Interaction of the SON C-terminal region with menin diminishes MLL complex assembly and its recruitment to target DNA,
leading to the low level of H3K4me3 and transcriptional repression. Upon SON depletion, MLL complex formation is facilitated, resulting in enhanced recruitment of the MLL complex and subsequent increase of H3K4me3 and promoter activity. In leukemic condition, a high level of short SON isoforms (SON B and SON E) interrupts DNA-binding of full- length SON, but cannot interfere with MLL-menin interaction, resulting in abrogation of the full-length SON function and de-repression/activation of multiple leukemia-associated genes.
[0122] FIG. 7J depicts co-immunoprecipitation of HA-tagged SON SR+RB with Myc-tagged full-length/wild type (WT) or several partial fragments of menin after transient transfections of the constructs into 293T cells as indicated. Immunoprecipitations were performed using an HA antibody and analyzed by Western blotting using a Myc antibody. Wild type and several partial fragments of menin were detected in the precipitates (upper panel, lane 3, 5, 6, and 8; arrowheads). Asterisks indicate IgG.
[0123] FIG. 7K depicts a schematic diagram shows the structure of the menin and its deletion constructs used in FIG. 7J. The region previously shown to interact with MLL is indicated with a bar.
[0124] FIG. 7L depicts a Western blot verified expression of V5-tagged SON F (SON F-V5) and Flag-tagged SON E (SON E-Flag) in K562 cells used for the experiment presented in FIG. 7E.
[0125] FIG. 7M depicts schematic structures of the lentiviral construct for SON E overexpression and the illustration of the experimental design. Bone marrow (BM) cells were isolated from mice, infected with empty lentivirus (Control) or Flag-tagged SON E— expressing lentivirus (SON E) and subjected to the colony forming and serial replating assays. For each round of plating, a total of 2 χ 104 BM cells were cultured in MethoCult (M3434).
[0126] FIG. 7N depicts representative photomicrographs of colonies from control and SON E lentivirus-infected BM cells from the first plating, demonstrating the high efficiency of infection (GFP-positive cells). Scale bars on pictures represent 100 μιη.
DETAILED DESCRIPTION
[0127] Dysregulation of MLL complex-mediated histone methylation plays a pivotal role in gene expression associated with diseases, but little is known about cellular factors modulating MLL complex activity. As described herein, SON, previously known as an RNA splicing factor, has been found to also control MLL complex-mediated transcriptional initiation. SON binds to DNA near transcription start sites, interacts with menin, and inhibits MLL complex assembly, resulting in decreased histone 3 lysine 4 (H3K4me3) and transcriptional repression. Alternatively-spliced short isoforms of SON are markedly upregulated in acute myeloid leukemia. The short isoforms compete with full- length SON for chromatin occupancy, but lack the menin-binding ability, thereby antagonizing full-length SON function in transcriptional repression while not impairing full- length SON-mediated RNA splicing. Furthermore, overexpression of a short isoform of SON enhances replating potential of hematopoietic progenitors. These findings define SON as a fine-tuner of the MLL-menin interaction and reveal short SON overexpression as a marker indicating aberrant transcriptional initiation in leukemia.
[0128] SON is a ubiquitously expressed nuclear protein recently identified as an SR-like splicing co-factor. SON is required for proper RNA splicing of selective genes (Ahn et al., 201 1; Hickey et al., 2014; Lu et al., 2013; Lu et al., 2014; Martello, 2013; Sharma et al., 2011). Knockdown of SON leads to splicing defects in transcripts of other genes containing weak splice sites, and many of the affected genes are necessary for cell cycle progression and epigenetic modification (Ahn et al., 201 1; Sharma et al., 201 1). Interestingly, SON is highly expressed in human embryonic stem cells and is an essential factor in stem cell pluripotency (Chia et al., 2010; Lu et al., 2013). Further RNA-seq analyses revealed that SON knockdown causes intron retention and exon skipping at several pluripotency genes, such as OCT4, PRDM14 and E4F 1 (Lu et al., 2013).
[0129] While the role of SON in RNA-binding and splicing has been highlighted, a few studies have also suggested that SON may function in transcriptional regulation. SON has been implicated in DNA-binding (Mattioni et al., 1992; Sun et al., 2001), and SON suppresses the promoter activity of the miR-23a~27a~24-2 cluster (Ahn et al., 2013). In addition, microarray and RNA-seq experiments have shown that SON knockdown not only
leads to gene downregulation which is mainly due to splicing defects, but also upregulates a substantial number of genes (Ahn et al., 2011; Lu et al., 2013; Sharma et al., 2011), strongly suggesting that SON has a repressive function in gene expression. However, whether SON is directly associated with chromosomal loci in the mammalian genome and how SON regulates transcription is completely unknown. Interestingly, in addition to full-length SON, various splice isoforms of SON have been predicted based on analyses of expressed sequence tags (ESTs) and genomic DNA sequence. Nevertheless, the functional significance of SON isoforms remains unexplored.
[0130] As described herein, SON has an unexpected role in interacting with menin and regulating MLL complex activity, H3K4me3, and transcriptional initiation of multiple leukemia-associated genes. Described herein are significant increases in acute myeloid leukemia in short splice variants of SON, which lack menin-interacting ability, and their functional significance in blocking full-length SON function in transcriptional repression while not impairing full-length SON-mediated RNA splicing.
[0131] Applicant has discovered that an increase in the ratio of the level of a truncated alternatively-spliced transcript of a SON gene, such as SON E and/or SON B transcripts of the SON gene, to the level of a full length transcript of the SON gene, such as a SON F transcript can indicate a subject having a hematopoietic malignancy, or the likelihood of the subject developing a hematopoietic malignancy. Applicant has also discovered that increases in the level of a truncated alternatively-spliced transcript of a SON gene, such as SON E and/or SON B transcripts of the SON gene, can indicate a subject having a hematopoietic malignancy, or the likelihood of the subject developing a hematopoietic malignancy. Applicant has also discovered that an increase in the levels of additional markers, such CDKN1A, GFI1 and ATF3 can also indicate a subject having a hematopoietic malignancy, or the likelihood of the subject developing a hematopoietic malignancy.
Methods for diagnosis
[0132] Some embodiments of the methods and compositions provided herein include methods for the diagnosis of a hematopoietic malignancy, or a likelihood of developing a hematopoietic malignancy in a test subject. In some embodiments the
hematopoietic malignancy can include acute myeloid leukemia (AML) and myelodysplastic syndrome (MDS). In some embodiments, the hematopoietic malignancy is an AML subtype including M2 without t(8;21), and t(8;21)(q22;q22).
[0133] Some embodiments of the methods and compositions provided herein relate to a short polypeptide or a nucleic acid encoding the short polypeptide. In some embodiments, the short polypeptide is a truncated alternatively-spliced transcript of a SON gene. See for example, TABLE 1. In some embodiments, the truncated alternatively-spliced transcript of a SON gene comprises a SON E variant of a SON gene, or a SON B variant of a SON gene. In some embodiments, the nucleic acid encoding the short polypeptide comprises a sequence which specifically binds to an oligonucleotide comprising SEQ ID NOs:06 or 10. In some embodiments, the short polypeptide comprises SEQ ID NOs:88 or 90. In some embodiments, the nucleic acid encoding the short polypeptide comprises SEQ ID NOs:87 or 89.
[0134] Some embodiments of the methods and compositions provided herein relate to a long polypeptide or a nucleic acid encoding the long polypeptide. In some embodiments, the long polypeptide is encoded by a full length transcript of the SON gene. See e.g., TABLE 1. In some embodiments, the full length transcript of the SON gene comprises a sequence which specifically binds to an oligonucleotide comprising a sequence selected from SEQ ID NOs:13-18. In some embodiments, the long polypeptide comprises SEQ ID NO: 86. In some embodiments, the nucleic acid encoding the long polypeptide comprises SEQ ID NO:85.
[0135] In some embodiments, the SON gene is a human SON gene.
[0136] Some embodiments include determining a ratio for the level of a short polypeptide or the level of a nucleic acid encoding the short polypeptide in the test subject to the level of a long polypeptide or the level of a nucleic acid encoding the long polypeptide in the test subject. In some such embodiments, the short polypeptide is encoded by a truncated alternatively-spliced transcript of a SON gene, and the long polypeptide is encoded by a full length transcript of the SON gene.
[0137] Some embodiments include methods of detecting a ratio for the level of a short polypeptide or the level of a nucleic acid encoding the short polypeptide in a test subject
to the level of a long polypeptide or the level of a nucleic acid encoding the long polypeptide in the test subject. Such methods can include obtaining a nucleic acid or polypeptide sample from the test subject; contacting the nucleic acid or polypeptide sample with an agent which indicates the level of the short polypeptide or a nucleic acid encoding the short polypeptide in the test subject; contacting the nucleic acid or polypeptide sample with an agent which indicates the level of the long polypeptide or a nucleic acid encoding the long polypeptide in the test subject; and determining the ratio for the level of the short polypeptide or the level of a nucleic acid encoding the short polypeptide in the test subject to the level of the long polypeptide or the level of a nucleic acid encoding the long polypeptide in the test subject. In such embodiments, the short polypeptide is encoded by a truncated alternatively-spliced transcript of a SON gene, and the long polypeptide is encoded by a full length transcript of the SON gene.
[0138] In the foregoing methods, a ratio of the level of a short polypeptide or the level of a nucleic acid encoding the short polypeptide in a test subject to the level of a long polypeptide or the level of a nucleic acid encoding the long polypeptide in the test subject can be indicative of a test subject having a diagnosis for a hematopoietic malignancy, or a likelihood of developing a hematopoietic malignancy. In some embodiments, the ratio is greater than about 30%, 40%, 50%, 60%, 70%, 80%, and 90%, or any range between any two of the foregoing numbers.
[0139] Some embodiments include methods of detecting an increase in the level of a short polypeptide or a nucleic acid encoding the polypeptide in a test subject. Such methods can include obtaining a nucleic acid or polypeptide sample from the test subject; contacting the nucleic acid or polypeptide sample with an agent which indicates an increase in the level of the short polypeptide or a nucleic acid encoding the short polypeptide in the test subject; and determining the level the short polypeptide or a nucleic acid encoding the short polypeptide in the test subject. In such embodiments, the short polypeptide is encoded by a truncated alternatively-spliced transcript of a SON gene.
[0140] Some embodiments include methods of diagnosing a hematopoietic malignancy, or the likelihood of developing a hematopoietic malignancy in a test subject. Such embodiments can include determining the level of a short polypeptide or the level of a
nucleic acid encoding the short polypeptide in a subject. In such embodiments, the short polypeptide is encoded by a truncated alternatively-spliced transcript of a SON gene.
[0141] Some methods also include comparing the level of the short polypeptide or the level of a nucleic acid encoding the short polypeptide in a test subject with the level of the short polypeptide or the level of a nucleic acid encoding the short polypeptide in a subject not having a hematopoietic malignancy. In some embodiments, an increase in the level of a short polypeptide or the level of a nucleic acid encoding the short polypeptide in a subject is indicative of a test subject having a diagnosis for a hematopoietic malignancy, or a likelihood of developing a hematopoietic malignancy. In some embodiments, the increase is at least about 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 100%, and 200%, or any range between any two of the foregoing numbers, compared to the level of the short polypeptide or the level of a nucleic acid encoding the short polypeptide in a subject not having a hematopoietic malignancy.
[0142] Some methods also include determining the level of a long polypeptide or a nucleic acid encoding the long polypeptide in the test subject, wherein the long polypeptide is encoded by a full length transcript of the SON gene. Some such methods also include determining a ratio for the level of a short polypeptide or the level of a nucleic acid encoding the short polypeptide in the test subject to the level of a long polypeptide or the level of a nucleic acid encoding the long polypeptide in the test subject. In the foregoing methods, a ratio can be indicative of a test subject having a diagnosis for a hematopoietic malignancy, or a likelihood of developing a hematopoietic malignancy. In some embodiments, the ratio is greater than about 30%, 40%, 50%, 60%, 70%, 80%, and 90%, or any range between any two of the foregoing numbers.
[0143] Some methods also include determining the level of a polypeptide in addition to the level of the short form of the SON polypeptide or the level of a nucleic acid encoding the additional polypeptide in addition to the level of a nucleic acid encoding the short form of the SON polypeptide, wherein the additional polypeptide is encoded by a gene selected from CDKN1A, GFI1 and ATF3. Some methods also include comparing the level of an additional polypeptide or the level of a nucleic acid encoding the additional polypeptide in the test subject with the level of an additional polypeptide or the level of a nucleic acid
encoding the additional polypeptide in a subject not having a hematopoietic malignancy. In some embodiments, the increase is at least about 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 100%, and 200%, or any range between any two of the foregoing numbers, compared to the level of an additional polypeptide or the level of a nucleic acid encoding the additional polypeptide in a subject not having a hematopoietic malignancy.
[0144] Some embodiments also include determining the level of a polypeptide encoded by a SON target gene or a nucleic acid encoded by a SON target gene in a sample from a test subject. In some embodiments, the SON target gene has a SON binding site. In some embodiments, an increased level of a polypeptide encoded by a SON target gene or a nucleic acid encoded by a SON target gene in the sample from the test subject compared to the level of a polypeptide encoded by a SON target gene or a nucleic acid encoded by a SON target gene in a sample from a subject not having a hematopoietic malignancy can be indicative of a diagnosis of a hematopoietic malignancy, or a likelihood of developing a hematopoietic malignancy in a test subject. In some embodiments, the SON target gene is selected from the group consisting of FOX03, NOTCH2NL, SRC, GADD45A, CDKN1A, ATF3, GFT1 , EGR1 , SRC, and GFT1 B.
[0145] Some embodiments also include determining the level of a MLL multi- protein complex in a sample from a test subject. Some embodiments also include comparing the level of the MLL multi-protein complex in the sample from the test subject with the level of a MLL multi-protein complex in a sample from a sample from a subject not having a hematopoietic malignancy. In some embodiments, an increase in the level of a MLL multi- protein complex in the sample from a test subject can be indicative of a diagnosis of a hematopoietic malignancy, or a likelihood of developing a hematopoietic malignancy in a test subject. In some embodiments, the MLL is selected from the group consisting of MLLl, MLL-N, MLL-C, and MLL2. In some embodiments, the multiprotein complex includes a protein selected from menin, ASH2L, LEDGF, and WDR5.
[0146] Some embodiments also include determining the level of a replating activity of a hematopoietic progenitor cell of a sample from a test subject. Some embodiments also include comparing the level of replating activity of a hematopoietic progenitor cell of the sample from the test subject with the level of replating activity of a
hematopoietic progenitor cell of the sample from the test subject from a subject not having a hematopoietic malignancy. In some embodiments, an increase in the level of replating activity of a hematopoietic progenitor cell of the sample from the test subject can be indicative of a hematopoietic malignancy. In some embodiments, the hematopoietic progenitor cell is a bone marrow cell.
[0147] Some embodiments also include determining the level of binding of a protein at the or adjacent to the transcription start site of a SON target gene in a sample from a test subject. Some embodiments also include comparing the level the level of binding of a protein at the or adjacent to the transcription start site of a SON target gene in a sample from a test subject with the level of binding of a protein at the or adjacent to the transcription start site of a SON target gene in a sample from a subject not having a hematopoietic malignancy. In some embodiments, an increased binding of a protein at the or adjacent to the transcription start site of a SON target gene in the sample from the test subject can be indicative of a hematopoietic malignancy. In some embodiments, the protein can include MLL1, MLL-N, MLL-C, MLL2, WDR5, ASH2L, menin, SET1A, SET1B, ASC2, and SUZ12. In some embodiments, the SON target gene can include CDKN1 A, GADD45A, NOTCH2NL, ATF3, GFI1, EGR1, and SRC.
[0148] Some embodiments also include determining the level in H3K4me3 in the sample from the test subject. Some embodiments also include comparing the level in H3K4me3 in the sample from the test subject with the level in H3K4me3 in the sample from a subject not having a hematopoietic malignancy. In some embodiments an increased level in H3K4me3 in a sample from a test subject can be indicative of a hematopoietic malignancy. In some embodiments, the level in H3K4me3 is increased at the or adjacent to a SON target gene including CDKN1A, GADD45A, NOTCH2NL, ATF3, GFI1, EGR1, and SRC.
[0149] Some embodiments also include a sample from the test subject. In some embodiments, the sample can include nucleic acids or polypeptides. In some embodiments, the sample comprises bone marrow mononuclear cells (BM-MNCs), or peripheral blood mononuclear cells (PB-MNCs).
[0150] Some embodiments also include contacting the sample with an agent which specifically binds to the short polypeptide or a nucleic acid encoding the short
polypeptide. Some embodiments also include contacting the sample with an agent which specifically binds to the long polypeptide or a nucleic acid encoding the long polypeptide. In some embodiments, the agent is a primer or a hybridization probe. In some embodiments, the primer or the hybridization probe comprises an oligonucleotide comprising SEQ ID NOs:06, 10 or 13-18.
[0151] Some embodiments also include contacting the sample with an agent which specifically binds to a SON target gene, such as a SON target gene having a SON binding site. In some embodiments, the SON target includes FOX03, NOTCH2NL, SRC, GADD45A, CDKN1A, ATF3, GFI1, EGR1, SRC, and GFI1B.
[0152] Some embodiments also include contacting the sample with an agent which specifically binds to a protein selected from MLL1, MLL-N, MLL-C, MLL2, menin, ASH2L, LEDGF, WDR5, SET1A, SET1B, ASC2, SUZ12, and H3K4me3.
[0153] In some embodiments, the agent is an antibody or antigen-binding fragment thereof which specifically binds to the short polypeptide.
[0154] In some embodiments, the agent can be immobilized on a solid support. Examples of solid supports include a test well of a microtiter plate, a membrane, such as nitrocellulose membrane, a particle, such as a bead, comprising glass, fiberglass, latex, or a plastic material, such as polystyrene or polyvinylchloride. The solid support can include a magnetic particle. The agent can be immobilized on the solid support by a noncovalent association, such as adsorption, and/or a covalent attachment which can include a direct linkage between the agent and functional groups on the solid support, or may be a linkage by way of a cross-linking agent.
[0155] Methods to determine the level of a nucleic acid, such as a transcript of a SON gene, such as a truncated alternatively-spliced transcript of a SON gene, such as SON B variant, and SON E variant; a full length transcript of a SON gene, such as SON F; and nucleic acids encoded by CDKN1A, GFI1 and ATF3 are also well known in the art. Examples of the foregoing methods include hybridizing a hybridization probe or primer with a nucleic acid. Methods can include PCR, quantative methods of PCR, such as real time PCR, and Northern blot analysis. Techniques for both PCR based assays and hybridization assays are well known in the art {See e.g., Mullis et ah, Cold Spring Harbor Symp. Quant.
Biol., 51 :263, 1987; Erlich ed., PCR Technology, Stockton Press, NY, 1989). In some embodiments, the hybridization probe or primer can include an oligonucleotide comprising any one of SEQ ID NOs:06, 10, 13-18.
[0156] Methods to determine the level of a polypeptide, such as a polypeptide encoded by a truncated alternatively-spliced transcript of a SON gene, such as SON B variant, and SON E variant; a full length transcript of a SON gene, such as SON F; and nucleic acids encoded by CDKN1A, GFI1 and ATF3 are well known in the art. Examples of the foregoing methods include contacting the polypeptide with an agent that that specifically binds to the polypeptide. In some embodiments, the agent comprises an antibody or antigen- binding fragment thereof.
[0157] Some embodiments of the methods and compositions provided herein also include a method of diagnosing a hematopoietic malignancy or a likelihood of developing a hematopoietic malignancy in a test subject that includes detecting increased levels of binding of a short polypeptide with a nucleic acid having a SON binding site in a sample from a test subject, wherein the short polypeptide is encoded by a truncated alternatively-spliced transcript of a SON gene.
Methods of treatment
[0158] Some embodiments of the methods and compositions provided herein also include methods of ameliorating a hematopoietic malignancy. Some such methods include treat, preventing, and/or reducing the symptoms of the hematopoietic malignancy. Some embodiments include increasing the level of a full length SON polypeptide or a nucleic acid encoding a full length SON polypeptide in a cell of a subject. In some embodiments, the expression level of a nucleic acid encoding SON or the expression level of SON protein is increased by administering an isolated nucleic acid to the subject. In some embodiments, the nucleic acid comprises a sequence encoding a full length SON polypeptide. In some embodiments, the SON polypeptide is a human SON polypeptide. In some embodiments, the nucleic acid comprises SEQ ID NO.:85. In some embodiments, the polypeptide comprises SEQ ID NO.: 86. In some embodiments, the hematopoietic malignancy is selected from acute myeloid leukemia (AML) and myelodysplasia syndrome (MDS). In some embodiments, the
hematopoietic malignancy is an acute myeloid leukemia (AML) subtype selected from M2 without t(8;21), and t(8;21)(q22;q22). In some embodiments, the subject is mammalian. In some embodiments, the subject is human.
[0159] Methods to increase the level of a polypeptide or a nucleic acid encoding a polypeptide, such as a full length SON polypeptide, in a cell of a subject are well known in the art. For example, some methods can include administering an expression vector to the subject. The expression vector can include the nucleic acid and a promoter. The promoter can be tissue-specific, constitutive, and/or inducible.
[0160] Some embodiments include pharmaceutical compositions that include the nucleic acid, such as an expression vector including the nucleic acid, and a pharmaceutically effective carrier. Pharmaceutically acceptable carriers are determined in part by the particular composition being administered, as well as by the particular method used to administer the composition. Accordingly, there is a wide variety of suitable formulations of pharmaceutical compositions available (see, e.g., Remington's Pharmaceutical Sciences, 17th ed., 1989).
[0161] The pharmaceutical compositions can be administered by various means known in the art. For example, vectors can be delivered to cells ex vivo, such as cells explanted from an individual patient (e.g., lymphocytes, bone marrow aspirates, tissue biopsy), followed by re-implantation of the cells into a subject, usually after selection for cells which have incorporated the vector. Therapies where cells are genetically modified ex vivo, and then re-introduced into a subject can be referred to as cell-based therapies or cell therapies. In a preferred embodiment, cells are isolated from the subject organism, transduced with an exogenous gene (gene or cDNA) according to the present disclosure, and re-infused back into the subject (e.g., a patient).
[0162] Expression vectors can include plasmid vectors, retroviral vectors, lentiviral vectors, adenovirus vectors, helper-dependent adenovirus, poxvirus vectors; herpesvirus vectors and adeno-associated virus vectors, etc. See, also, U.S. Pat. Nos. 6,534,261; 6,607,882; 6,824,978; 6,933,1 13; 6,979,539; 7,013,219; and 7,163,824, incorporated by reference herein in their entireties. Conventional viral and non-viral based gene transfer methods can be used to introduce nucleic acids encoding a full length SON polynucleotide in cells (e.g., mammalian cells) and target tissues. Non-viral vector delivery
systems include DNA plasmids, naked nucleic acid, and nucleic acid complexed with a delivery vehicle such as a liposome or poloxamer. Viral vector delivery systems include DNA and RNA viruses, which have either episomal or integrated genomes after delivery to the cell. For a review of gene therapy procedures, see Anderson, Science 256:808-813 (1992); Nabel & Feigner, T1BTECH 1 1:211-217 (1993); Mitani & Caskey, T1BTECH 1 1 : 162-166 (1993); Dillon, TIBTECH 1 1 :167-175 (1993); Miller, Nature 357:455-460 (1992); Van Brunt, Biotechnology 6(10):1 149-1 154 (1988); Vigne, Restorative Neurology and Neuroscience 8:35-36 (1995); Kremer & Perricaudet, British Medical Bulletin 51(1):31- 44 (1995); Haddada et al., in Current Topics in Microbiology and Immunology Doerfler and Bohm (eds.) (1995); and Yu et al., Gene Therapy 1 :13-26 (1994). Methods of non-viral delivery of nucleic acids include electroporation, lipofection, microinjection, biolistics, virosomes, liposomes, immunoliposomes, dendrimers, polycation or lipid:nucleic acid conjugates, naked DNA, artificial virions, agent-enhanced uptake of DNA or use of macromolecules such as dendrimers (see Wijagkanalen et al (201 1) Pharm Res 28(7) p. 1500-19). Sonoporation using, e.g., the Sonitron 2000 system (Rich-Mar) can also be used for delivery of nucleic acids.
[0163] Additional exemplary nucleic acid delivery systems include those provided by Amaxa Biosystems (Cologne, Germany), Maxcyte, Inc. (Rockville, Md.), BTX Molecular Delivery Systems (Holliston, Mass.) and Copernicus Therapeutics Inc, (see for example U.S. Pat. No. 6,008,336). Lipofection is described in e.g., U.S. Pat. Nos. 5,049,386; 4,946,787; and 4,897,355) and lipofection reagents are sold commercially (e.g., Transfectam™ and Lipofectin™). Cationic and neutral lipids that are suitable for efficient receptor-recognition lipofection of polynucleotides include those of Feigner, WO 91/17424, WO 91/16024.
[0164] In applications in which transient expression is preferred, adenoviral based systems can be used. Adenoviral based vectors are capable of very high transduction efficiency in many cell types and do not require cell division. With such vectors, high titer and high levels of expression have been obtained. This vector can be produced in large quantities in a relatively simple system. Adeno -associated virus ("AAV") vectors are also used to transduce cells with target nucleic acids, e.g., in the in vitro production of nucleic acids and peptides, and for in vivo and ex vivo gene therapy procedures (see, e.g., West et al.,
Virology 160:38-47 (1987); U.S. Pat. No. 4,797,368; WO 93/24641; Kotin, Human Gene Therapy 5:793-801 (1994); Muzyczka, J. Clin. Invest. 94: 1351 (1994). Construction of recombinant AAV vectors are described in a number of publications, including U.S. Pat. No. 5,173,414; Tratschin et al., Mol. Cell. Biol. 5:3251-3260 (1985); Tratschin, et al., Mol. Cell. Biol. 4:2072-2081 (1984); Hermonat & Muzyczka, PNAS 81 :6466-6470 (1984); and Samulski et al., J. Virol. 63:03822-3828 (1989).
[0165] Gene therapy vectors, such as expression vectors comprising a nucleic acid encoding a full length SON polypeptide, can be delivered in vivo by administration to an individual patient, typically by systemic administration (e.g., intravenous, intraperitoneal, intramuscular, subdermal, or intracranial infusion) or topical application, as described below. Alternatively, vectors can be delivered to cells ex vivo, such as cells explanted from an individual patient (e.g., lymphocytes, bone marrow aspirates, tissue biopsy) or universal donor hematopoietic stem/progenitor cells, followed by reimplantation of the cells into a patient, usually after selection for cells which have incorporated the vector.
[0166] Vectors (e.g., retroviruses, adenoviruses, liposomes, etc.) containing donor construct, such as expression vectors comprising a nucleic acid encoding a full length SON polypeptide, can also be administered directly to an organism for transduction of cells in vivo. Alternatively, naked DNA complexed/formulated with a delivery vehicle (e.g. liposome or poloxamer) can be administered. Administration is by any of the routes normally used for introducing a molecule into ultimate contact with blood or tissue cells including, but not limited to, injection, infusion, topical application and electroporation. Suitable methods of administering such nucleic acids are available and well known to those of skill in the art, and, although more than one route can be used to administer a particular composition, a particular route can often provide a more immediate and more effective reaction than another route.
[0167] Formulations for both ex vivo and in vivo administrations include suspensions in liquid or emulsified liquids. The active ingredients often are mixed with excipients which are pharmaceutically acceptable and compatible with the active ingredient. Suitable excipients include, for example, water, saline, dextrose, glycerol, ethanol or the like, and combinations thereof. In addition, the composition may contain minor amounts of
auxiliary substances, such as, wetting or emulsifying agents, pH buffering agents, stabilizing agents or other reagents that enhance the effectiveness of the pharmaceutical composition.
Kits
[0168] Some embodiments of the methods and compositions provided herein include kits for the diagnosis of a hematopoietic malignancy, or likelihood of developing a hematopoietic malignancy in a test subject. Some such embodiments can include an agent that specifically binds to a short polypeptide or a nucleic acid encoding the short polypeptide. In such embodiments, the short polypeptide is encoded by a truncated alternatively-spliced transcript of a SON gene. Some such embodiments can include an agent that specifically binds to a long polypeptide or a nucleic acid encoding the long polypeptide. In such embodiments, the long polypeptide is encoded by a full length transcript of the SON gene.
[0169] Some embodiments also include an agent that specifically binds to an additional polypeptide or a nucleic acid encoding the additional polypeptide in the test subject, wherein the additional polypeptide is encoded by a gene selected from CDKN1A, GFI1 and ATF3.
[0170] In some embodiments, the truncated alternatively-spliced transcript of a SON gene comprises a SON E variant of a SON gene. In some embodiments, the truncated alternatively-spliced transcript of a SON gene comprises a SON B variant of a SON gene. In some embodiments, the nucleic acid encoding the short polypeptide comprises a sequence which specifically binds to an oligonucleotide comprising SEQ ID NOs:06 or 10. In some embodiments, the short polypeptide comprises SEQ ID NOs:88 or 90. In some embodiments, the nucleic acid encoding the short polypeptide comprises SEQ ID NOs:87 or 89.
[0171] In some embodiments, the full length transcript of the SON gene comprises a sequence which specifically binds to an oligonucleotide comprising a sequence selected from SEQ ID NOs: 13-18. In some embodiments, the long polypeptide comprises SEQ ID NO:86. In some embodiments, the nucleic acid encoding the long polypeptide comprises SEQ ID NO:85.
[0172] In some embodiments, the SON gene is a human SON gene.
[0173] Some embodiments also include an agent that specifically binds to a SON target gene, wherein the SON target gene has a SON binding site. In some embodiments, the SON target gene can include FOX03, NOTCH2NL, SRC, GADD45A, CDKN1A, ATF3, GFI1, EGR1, SRC, and GFI1B.
[0174] Some embodiments also include an agent that specifically binds to a includingMLLl, MLL-N, MLL-C, MLL2, menin, ASH2L, LEDGF, WDR5, SET1A, SET IB, ASC2, SUZ12, and H3K4me3.
[0175] In some embodiments, the agent is a primer or a hybridization probe. In some embodiments, the primer or the hybridization probe comprises an oligonucleotide comprising SEQ ID NOs:06, 10, or 13-18. In some embodiments, the agent is an antibody or antigen-binding fragment thereof which specifically binds to the short polypeptide. In some embodiments, the agent is attached to a solid support.
EXAMPLES
Genome-wide analyses of SON binding sites revealed SON interaction with G/C-rich sequences near transcription start sites and SON depletion caused activation of SON target genes
[0176] To explore SON function in genome-wide DNA-binding and gene regulation, chromatin immunoprecipitation and sequencing (ChlP-seq) in K562 leukemic cells was performed with two different SON antibodies (SON-N and SON-C Abs) (FIG. 1 A). Through pilot experiments, these antibodies were validated to be suitable for ChIP experiments based on enrichment and reproducibility of peaks (FIG. 1G, FIG. 1H). The results from ChlP-seq with SON-N and SON-C Abs identified the genomic distribution of SON binding sites at both intergenic and intragenic regions, with a significant portion of the intragenic peaks located at promoters and introns (FIG. IB). To focus on SON function near transcription start sites, the ChlP-seq peaks located 5 kb upstream and downstream (± 5kb) from the transcription start site were analyzed, which were mainly localized in the promoter, the 5' UTR, and the first exon or intron of the target genes (FIG. 1C). The heat map and motif analysis confirmed that SON binding sites are indeed enriched near transcription start sites (FIG. ID) and contain repetitive G or C tracts as well as GC dinucleotide repeats (FIG. IE
and FIG. II). The genes bearing SON peaks at their transcription start site have functions in DNA-binding / transcription (e.g. ATF3, GFI1, EGR1, FOX03A), receptor signal transduction (NOTCH2NL, SRC) and cell cycle regulation (CDKN1A, GADD45A) (FIG. IF). Altered expression of these genes has been implicated in perturbed hematopoiesis, leukemia and other cancers (Janz et al., 2006; Joslin et al., 2007; Khandanpour et al., 2013; Liebermann et al., 201 1; Phelan et al., 2010; Viale et al., 2009). Enrichment of SON near transcription start sites of these genes was further confirmed by ChlP-qPCR (FIG. 2E) and deletion of the potential DNA-binding region (Sun et al., 2001) reduced SON interaction with target DNA (FIG. 2F). Interestingly, knockdown of SON by siRNA (FIG. 2G) caused upregulation of SON ChIP target genes (FIG. 2A), indicating that SON interaction with transcription start sites of target genes exerts inhibitory effects on transcription.
SON binding sites near the transcription start site overlap with the locations of the histone modification H3K4me3 and SON depletion leads to increased levels of H3K4me3
[0177] To elucidate the function of SON near the transcription start site, the genomic location of SON peaks were compared with the location of several histone modifications which are involved in regulation of transcriptional initiation. Interestingly, while the regions with intergenic SON peaks were enriched with mono-methylation of histone 3 lysine 4 (H3K4mel) (FIG. 2H), SON peaks near transcription start sites were closely associated with the locations of H3K4me3, a marker of active or potentially active promoters, and CpG islands (FIG. 2B). Concurrence of SON peaks with H3K4me3 peaks and CpG islands were further visualized and confirmed in several SON target genes (FIG. 2C, FIG. 21). The genomic regions bearing SON peaks near transcription start sites show a low level of conventional nucleosomes and the presence of variant histone, H2A.Z, indicating the active chromatin status (FIG. 2J). A close look at the peak regions revealed that the high peaks of H3K4me3 were precisely aligned with the valleys of SON peaks, and vice versa (FIG. 2D, FIG. 2J). These observations suggest that SON may bind to the region between nucleosomes and modify histones within adjacent nucleosomes.
[0178] Extensive ChlP-qPCR analyses revealed that the level of H3K4me3 at SON target sites near the transcription start site was significantly increased (FIG. 3A, FIG.
3C, FIG. 3D), while H3K4me3 at the transcription start site of unrelated genes (non-targets) did not show any significant changes upon SON knockdown (FIG. 3D). The levels of H3K4mel and H3K27ac also showed an increase in a few sites, while H3K27me3 did not show any changes (FIG. 3A, FIG. 3C). Taken together, these data demonstrate that SON functions to lower the level of H3K4me3 near transcription start sites.
SON depletion leads to increased recruitment of MLL complex components to the SON target genes and enhanced MLL complex formation
[0179] To understand the mechanism of the increased H3K4me3 upon SON knockdown, the occupancy of MLL and SET1 complex components at SON target sites near the transcription start site was measured. Interestingly, significant increases in occupancies of MLL (MLL1 N-terminus and C-terminus), MLL2, WDR5, ASH2L and menin at SON target genes were detected in SON-depleted cells (FIG. 3B, FIG. 3C, FIG. 3D). In contrast, recruitment of SET1A, SET1B and ASC2/NCOA6 (a component of MLL3/4 complex) to the SON target sites was not increased upon SON knockdown (FIG. 3B). These results indicate that SON inhibits recruitment of MLL 1, MLL2 and their associated components to the target chromatin region.
[0180] While MLL itself is a weak H3K4 methyltransferase, its interaction with the multiple subunit proteins greatly increases the ability to induce H3K4me3 (Dou et al., 2006; Steward et al., 2006). Since increases in both chromatin occupancy of the MLL complex and H3K4me3 in SON-depleted cells were detected, it was hypothesized that the protein interactions between the MLL complex subunits may be altered by SON knockdown. To this end, immunoprecipitation (IP) with an MLL1 N-terminus (MLL-N) antibody was performed and the presence of other MLL complex components by Western blotting was examined. Surprisingly, depletion of SON (FIG. 4A) significantly facilitated MLL-N interaction with MLL1 C-terminus (MLL-C), WDR5, ASH2L, menin and LEDGF (FIG. 4B). Immunoprecipitation with a WDR5 antibody further confirmed the enhanced interaction of WDR5 with MLL-C, MLL2, and menin upon SON depletion (FIG. 4C). Increased LEDGF interaction with MLL-N and menin in SON siRNA-transfected cells was also confirmed by LEDGF IP (FIG. 4D). In contrast, both WDR5 IP and SET1A IP experiments demonstrated
that the protein interactions within the SET1A complex and the MLL3/4 complex were not affected upon SON knockdown (FIG. 4C and FIG. 4E). These findings revealed that SON exerts its inhibitory effect specifically on MLL1/2 complex assembly.
SON interacts with menin, an MLL1/2 complex component, and SON-menin interaction diminishes MLL-menin interaction
[0181] The findings on the inhibitory effect of SON on MIX 1/2 complex formation prompted the examination of the possibility of physical association of SON with MLL1/2 complex subunits. Immunoprecipitation with SON antibody to detect SON- associated proteins revealed that SON interacts with menin, an MLL1/2 complex component involved in MLL function in oncogenesis (FIG. 4F). SON interaction with menin was further confirmed by overexpression and IP experiments (FIG. 4G). However, none of other components of the MLL complex, such as MLL-N, MLL-C, ASH2L, WDR5 and LEDGF, were detected in SON co-IP (FIG. 4F), suggesting that the SON-menin complex is not incorporated into a complete MLL complex. To examine the effect of SON on the interaction of menin with its direct binding partner MLL-N, menin-MLL-N interaction was assessed as well as menin-SON interaction by IP with menin antibody. Surprisingly, SON overexpression increased the menin-SON complex formation and at the same time, menin interaction with MLL-N was significantly decreased (FIG. 4H). Size-exclusion chromatography further demonstrated that a higher amount of menin was detectable in the MLL 1 -containing fractions when SON was depleted by siRNA (FIG. 41). These findings demonstrate an inhibitory effect of menin-SON interaction on menin-MLL interaction.
[0182] An increased interaction between MLL and menin upon SON knockdown was also observed in MV4; 11 cells which express MLL-fusion protein (MLL-AF4) as well as wild-type MLL (FIG. 4J and FIG. 4K). SON having an inhibitory effect on the interaction between menin and the MLL-fusion protein MLL-ENL was demonstrated (FIG. 4L). HOXA9 and MEIS 1, well-known transcriptional targets of MLL-fusion proteins, were also upregulated upon SON knockdown in MLL-rearranged cell lines (FIG. 4M), indicating inhibitory effects of SON on expression of MLL-fusion protein target genes.
Short isoforms of SON generated by alternative splicing are upregulated in acute myeloid leukemia
[0183] In addition to full-length SON (isoform F; SON F hereafter), several splice variants of SON have been predicted in genome databases (TABLE 1). An alternative exon is located between exon 6 and exon 7 of the human SON gene (labeled as exon 7a), and inclusion of exon 7a generates the isoform B (SON B). There is another alternative exon within intron 4 (labeled as exon 5a), and inclusion of this exon generates the isoform E (SON E). Both SON B and SON E are C-terminally truncated forms. Similar forms of full-length and the short splice variants (Son f, Son b and Son e) have been predicted in mice (FIG. 5A, FIG. 5H, and TABLE 1).
[0184] Interestingly, exon 5a is extremely well conserved between human and mouse (Wynn et al., 2000), suggesting functional significance of the isoforms made by inclusion of this alternative exon. Despite such information, no experimental data have proven the expression of these SON splice variants. 3' rapid amplification of cDNA ends (3' RACE) in K562 cells was performed and confirmed that mRNA of SON E with its own poly(A) site is indeed expressed (FIG. 51 and FIG. 5J).
[0185] To address the functional significance of SON splice variants, whether SON splice variants are differentially expressed in AML was examined. Bone marrow mononuclear cells (BM-MNCs) from AML patients (FAB subtype M2) and healthy donors (TABLE 2) were analyzed by qPCR using several primer sets (FIG. 5A). Similar to previous results (Ahn et al., 2013), most patient samples showed high levels of total SON (exon 1—3 region). Surprisingly, while expression levels of the exon 9— 12 region specific for SON F did not show significant upregulation, the expression level of alternative exon (exon 5a or 7a)-containing transcripts were significantly increased in AML patient BM-MNC samples (FIG. 5B). the SON isoform levels in peripheral blood mononuclear cells (PB-MNCs) isolated from AML and myelodysplastic syndrome (MDS) patients were measured (TABLE 2), and significant upregulation of exon 5 a- and exon 7a-containing transcripts were observed (FIG. 5C). These findings revealed that the SON upregulation in AML patients is largely attributable to upregulation of short isoforms.
[0186] The relative proportions of full-length SON and the isoforms in normal donors and AML patients were measured (FIG. 5K). The results revealed that SON F is the major form of SON in normal human BM-MNCs, and the short isoforms (SON B and SON E) occupy -20% of total SON (FIG. 5D). In contrast, the portion of SON E was remarkably increased in AML patient BM-MNCs, resulting in SON E accounting for 30 - 70% of total SON (FIG. 5D). PB-MNCs were also analyzed for relative ratios of full-length and SON isoforms. While PB-MNCs from 7 normal healthy donors represent almost identical patterns showing that only -10% of total SON is taken by SON E and SON B, the percentage of SON E and especially SON B are markedly increased in AML and MDS patients (FIG. 5E). Further evidence of short isoform expression was demonstrated in leukemic blasts isolated from mouse models of AMLl-ET09a-mediated leukemia and MLL-AF9-induced leukemia (FIG. 5F and FIG. 5G). In addition, microarray data available from Oncomine revealed that expression of exon 7a detected by the exon 7a-specific probe sets (FIG. 5L) was significantly increased in leukemia and lymphoma samples (FIG. 5M and FIG. 5N). Collectively, all of these results demonstrated the C-terminally truncated short isoforms of SON are aberrantly upregulated in hematopoietic malignancies, particularly in AML.
SON E attenuates the full-length SON function in transcriptional repression, resulting in derepression of SON ChIP target genes, but does not impair full-length SON-mediated RNA splicing
[0187] Whether the increase of SON short isoforms has any effect on expression of SON target genes in AML patients was investigated. Among SON ChIP target genes, CDKN1A, GFI1 and ATF3 were selected (FIG. 6A) since the importance of tight regulation of these genes in leukemia and other cancers has been demonstrated (Abbas and Dutta, 2009; Janz et al., 2006; Khandanpour et al., 2013; Phelan et al., 2010; Viale et al., 2009). These target genes were indeed significantly upregulated in BM-MNCs from AML patients compared with healthy donors (FIG. 6A), indicating that increased expression of SON short isoforms is associated with de-repression of SON ChIP target genes in AML.
[0188] To examine the exact effect of SON short isoforms on SON target gene expression, a specific siRNA targeting the SON E-specific exon 5a was developed (FIG. 6G),
and confirmed that this siRNA lowers the level of SON E, but not SON F (FIG. 6B and FIG. 6C). Interestingly, while the total SON siRNA that targets all forms of SON significantly upregulated SON ChIP target genes, transfection of SON E-specific siRNA led to downregulation of theses target genes in both K562 cells and human CD34+ bone marrow (BM) cells (FIG. 6B and FIG. 6C), indicating that a stronger repression of target genes occurred in the absence of SON E. Furthermore, unlike SON F overexpression which causes SON ChIP target gene repression, SON E overexpression (FIG. 6H) resulted in upregulation of those target genes in human CD34+ BM cells (FIG. 6D). These results strongly support the concept that SON E weakens the inhibitory effect of full-length SON on target gene transcription.
[0189] Since the critical role of SON in RNA splicing of a group of genes has been previously demonstrated (Ahn et al., 201 1; Lu et al., 2013; Sharma et al., 2011), how those genes are regulated upon SON E knockdown and overexpression was examined. While total SON siRNA significantly reduced the level of SON's splicing target genes, such as TUBG1, HDAC6 and A T1, knockdown of SON E caused no change or a marginal decrease in the expression of these splicing target genes (FIG. 6B). Furthermore, SON E overexpression in human CD34+ BM cells did not affect the level of SON's RNA splicing target genes (FIG. 6D), indicating that short SON isoforms do not exert an inhibitory effect on the expression of the target genes undergoing SON-mediated RNA splicing.
[0190] Whether SON E expression affects SON F function in repressing H3K4me3 was examined. SON F (siRNA-resistant form) expression could lower the H3K4me3 level, which was increased by SON siRNA, near the transcription start sites of the CDKN1A and GFI1 genes (FIG. 6E). In contrast, expression of SON E alone (siRNA- resistant form) failed to reduce the H3K4me3 level, indicating its lack of ability to suppress H3K4me3. Interestingly, when SON E was co-expressed with SON F (FIG. 61), SON E could attenuate the repressive effect of SON F on H3K4me3 in a dose-dependent manner (FIG. 6E).
[0191] A minigene splicing assay was performed using the TUBG1 exon 7-8 minigene model (Ahn et al., 201 1) and various ratios of SON F and SON E transfection to assess the effect of SON E expression on SON F-mediated RNA splicing (FIG. 6J). SON E expression did not attenuate SON F-mediated RNA splicing (FIG. 6F and FIG. 6K). Taken
together, these results demonstrate that SON E overexpression causes dysregulation of SON- mediated transcriptional repression, but not SON-mediated RNA splicing.
SON E competes with full-length SON for DNA-binding, but lacks the menin-binding ability, thus attenuating the inhibitory effect of full-length SON on MLL complex assembly
[0192] To gain further mechanistic insights, the DNA-binding ability of SON E was investigated. ChIP was performed to pull down V5-tagged SON F and Flag-tagged SON E (FIG. 7A) after expression of different amounts of SON E together with SON F. ChlP- qPCR revealed the enrichment of transfected SON E at SON target sites near transcription start sites of the CDKN1A and ATF3 genes (FIG. 7B), demonstrating that SON E indeed retains its ability to associate with target DNA. Increased SON E binding to the target sites concomitantly led to decreased SON F binding to the same sites (FIG. 7B). These data demonstrate that SON F and SON E share the common target DNA for binding, and they compete with each other to occupy the same chromatin region. Therefore, overexpression of SON E results in displacement of full-length SON from the target DNA.
[0193] Since the data demonstrated that SON F interaction with menin abrogates MLL-menin interaction (FIG. 4F), whether SON E retains the menin-binding ability was investigated. Surprisingly, unlike SON F, SON E does not interact with menin (FIG. 7C). It was demonstrated that the C-terminus of SON containing the RS domain and RNA-binding motifs (SR+RB in FIG. 7A) indeed interacts with menin (FIG. 7D) through the central region of menin (FIG. 7J and FIG. 7K). This region has been known to be important for menin interaction with MLL (Murai et al., 2011), suggesting that the C-terminal region of SON potentially occupies the MLL-binding site within menin.
[0194] The effects of SON F and SON E overexpression (FIG. 7L) on MLL interaction with menin and MLL complex formation were examined. While SON F overexpression inhibits MLL-N interaction with menin, SON E overexpression further enhanced the interactions between MLL-N with menin. SON E overexpression also strengthened MLL-N interaction with MLL-C and WDR5, indicating that the whole MLL complex assembly is enhanced upon increased SON E expression (FIG. 7E).
SON E overexpression enhances in vitro replating capacity of primary mouse bone marrow cells
[0195] To further assess the functional significance of SON E in hematopoietic cells, SON E was overexpressed in mouse primary bone marrow cells through lenviral transduction (FIG. 7F, FIG. 7M, and FIG. 7N), and measured the colony forming ability in methylcellulose replating assays (FIG. 7M). Surprisingly, SON E-overexpressing cells were able to produce significantly increased numbers of colonies and were able to maintain colony forming cells even in the 5th round of plating (FIG. 7G and FIG. 7H), indicating higher clonogenic potential. The findings demonstrate that increased expression of SON short isoforms enhances the preservation of sternness in normal hematopoietic progenitors, suggesting its potential contribution to stem cell self-renewal and development and/or maintenance of leukemic stem cells.
Discussion
[0196] SON was previously known as an RNA splicing co-factor. This study demonstrated that SON and its splice variants regulate MLL complex assembly and H3K4me3, affect gene expression of multiple leukemia-associated genes, and affect replating potential of hematopoietic progenitors. The data suggest that although the short splice variants, such as SON E, interact with target DNA, they cannot exert an inhibitory effect on MLL complex assembly due to the lack of menin-binding ability. Therefore, overexpression of "short SON" in pathological conditions, such as AML, attenuates the inhibitory effects of full-length SON on MLL complex assembly, resulting in activation of multiple target gene transcription (modeled in FIG. 71). These current findings suggest that there are two arms for the biological function of SON: one supporting efficient RNA splicing and the other antagonizing MLL complex-mediated transcriptional initiation.
SON as a negative regulator of MLL-menin interaction and MLL complex assembly
[0197] It has been demonstrated that the direct interaction between MLL and menin is required for MLL target gene expression and oncogenic property of the MLL fusion protein. The importance of the MLL-menin interaction was highlighted by a recent study showing that pharmacologic inhibition of this interaction is able to block progression of
MLL-rearranged leukemia (Borkin et al., 2015; Grembecka et al., 2012). In addition, blocking MLL-menin interaction with a small molecule inhibitor could effectively block growth of hormone-refractory prostate cancers (Malik et al., 2015). MLL-menin interaction is also involved in promoting hepatocellular carcinoma development (Xu et al., 2013). Given the significance of MLL-menin interaction in disease-associated gene expression, identification of endogenous regulatory factors affecting this interaction would generate valuable information for understanding and targeting the MLL complex. Our discoveries of SON as a menin-binding protein and a negative regulator of MLL-menin interaction support a potential clinical impact of SON. In addition to MLL, menin also interacts with other transcription factors and hormone receptors, such as JUND, estrogen receptor and androgen receptor (Agarwal et al., 1999; Dreijerink et al., 2006; Malik et al., 2015).
Short splice variants of SON: their effects on two arms of SON function and clinical significance
[0198] A finding in this study is identification of functional significance of SON isoforms generated by alternative splicing. This data demonstrated that a short isoform of SON (SON E) which is expressed in both human and mouse has a functional defect in menin interaction and transcriptional repression but retains the ability to interact with target DNA sequences. Although this study did not examine the function of SON B, another short isoform of SON, SON B may have similar functions as SON E based on the role of the C-terminus of SON in menin interaction (FIG. 7D and FIG. 7E). SON ChIP target genes identified include critical factors for cell-cycle and hematopoietic differentiation, such as CDKN1A, GFI1 and ATF3, which are aberrantly upregulated in AML patients with elevated levels of short SON isoforms (FIG. 5 and FIG. 6). Therefore, increased SON isoforms in hematopoietic stem cells or progenitors could potentially lead to failures in dosage controls of multiple leukemia- associated target genes. Interestingly, overexpression of SON E specifically weakened the full-length SON function in transcriptional initiation, but does not impair full-length SON- mediated RNA splicing. These results suggest that overexpression of short SON observed in leukemia patients would selectively affect SON's transcription target genes, leading to de-
repression, while not blocking full-length SON-mediated RNA splicing which is involved in cell proliferation and survival.
[0199] So far, the biological significance of the MLL complex in leukemia has been studied mainly in MLL-rearranged leukemia (Bernt et al., 2011; Dorrance et al., 2006; Popovic and Licht, 2012). Dysregulation of wild-type MLL function has not been clearly linked to diseases, although wild-type MLL function was recently shown to be necessary for growth and survival of MLL-fusion leukemia (Thiel et al., 2010) and other solid tumors (Ansari et al., 2013). This work suggests that MLL functions could be altered in cancer patients without MLL-rearrangement when "short SON" is overexpressed. In addition, the increased level of short SON isoforms could serve as a biomarker indicating dysregulation of MLL-mediated H3K4me3.
[0200] SON E overexpression markedly enhances replating capacity of primary hematopoietic progenitors, shedding light on functional significance of aberrant upregulation of "short SON" in AML. This study suggests that overexpression of short SON alone may not be sufficient to drive oncogenic transformation of hematopoietic cells, but implies potential roles of short SON in self-renewal and clonogenic abilities of leukemic cells.
Experimental Procedures
ChlP-Sequencing (ChlP-seq) and ChlP-qPCR
[0201] Chromatin Immunoprecpitation (ChIP) was performed with the methods adapted from the protocol of the laboratory of Richard M. Myers which has been used in the part of the ENCODE project (http://research.hudsonalpha.org/Myers/?page_id=142). For ChlP-qPCR, all ChIP signals were normalized to total input and each experiment was performed 3 times independently. The primer sets for qPCR on the specific region of the target genes are listed in TABLE 3.
TABLE 3
[0202] ChlP-seq libraries were generated using Gnomegen Library Preparation Kits (Gnomegen), and ChlP-seq libraries were cluster-amplified and sequenced with the Illumina HiSeq2000 sequencer (50-nucleotide pair-ended read).
Primary Patient Samples
[0203] Cells were purified by Ficoll-Paque (GE Healthcare) density-gradient centrifugation and frozen as viable cells. Details of the patient samples are listed in TABLE 2.
Cell Lines and Cell Culture
[0204] K562, MV4; 1 1 and ML-2 cell lines were cultured in RPMI 1640 medium with 10% fetal bovine serum. Primary human CD34+ bone marrow cells were purchased from Lonza and were cultured in STEMSPAN SFEM (StemCell Technologies) supplemented with 1% penicillin and streptomycin, 100 ng/ml of recombinant human SCF, 100 ng/ml of recombinant human FLT3L and 100 ng/ml of recombinant human TPO for 1 to 2 days before transfection. Primary mouse bone marrow cells were cultured in STEMSPAN supplemented with 15% FBS, 1% penicillin and streptomycin, 1% Glutamate, 10 ng/ml recombinant mouse IL-3 50ng/ml recombinant mouse SCF and 50 ng/ml recombinant mouse FLT3L for 1 day after lentivirus infection. All the cytokines were purchased from PeproTech.
Plasmid Construction
[0205] Plasmids containing siRNA-resistant full-length SON cDNA (siRR-SON F) and SON SR+RB (serine/arginine-rich region and RNA-binding domain) were created as previously described (Ahn et al., 201 1). The plasmid containing SON E was constructed as follows. The region from internal Hpal site to the 3' end of SON E cDNA was amplified using the forward primer (SON Hpal-F: 5'-GAATCTTCAATTACGTTAACA-3') (SEQ ID
NO:77) and the reverse primer (Flag-SON E Notl-R: 5'- TTTGCGGCCGCTATTTGTCATCGTCATCCTTGTAGTCTGGCCGGCC AAACTCAGTTTAGTTCTTCTATAGT AGCTCCTCCTG-3 ' ) (SEQ ID NO:78). The sequence for the Flag tag was added as indicated in the reverse primer. Then, the C-terminus of SON F in pcDNA3 HA-siRR SON F- Flag was replaced by the amplified SON E C- terminal fragment at the Hpal and Notl sites to create pcDNA3 HA-siRR SON E-Flag construct. To construct V5-tagged plasmids, the C-terminus region was amplified from cDNA using specific primers (SON BmgBl-F: 5'-
GC AAGTGATGTTGGACGTGACAGATC-3 ' (SEQ ID NO:79) and V5-SON F Notl-R: 5'- TTTGCGGCCGCtacgtagaatcgagaccgaggagagggttagggataggcttaccTGGCCGGCCatacctatt caagaaaaacatacaatt-3' (SEQ ID NO:80)) and subcloned into plasmids. Full-length cDNA clones of human menin (pcDNA3 Flag-menin, #32079) and MLL-ENL (pMSCV Flag-MLL- pl-ENL, #20873) were purchased from Addgene).
Reverse transcription and Quantitative PCR (RT-qPCR)
[0206] Human leukemic cell lines from one 6-well, human CD34+ BM cells from one 24-well, and patient samples were lysed and total RNA was isolated using the RNeasy Mini Kit (Qiagen). Total RNA from each sample was treated with RNase-free DNase I and used as a template to produce cDNA with oligo-dT using the Superscript III First Strand Synthesis Kit (Life Technologies). Quantitative real time PCR (RT-qPCR) was performed in triplicate reactions on the iQ5 Real Time PCR Detection System (Bio-Rad) using the Fast Start Universal SYBR Green Master (Roche) and standard deviations were calculated. All PCR reactions were finished under the following program: initial denaturation step was 95°C for 10 min, followed by 40 cycles of denaturation at 95°C for 15 seconds, and annealing at 60°C for 60 seconds. PCR primers were listed in TABLE 3 in the supplemental material. Gene expression levels were normalized to GAPDH.
Antibodies
[0207] The antibodies used for ChIP, IP, and Western blotting were the following; SON antibody recognizing the N-terminus of SON (SON-N Ab) was generated against amino acids 74-88 of human SON. SON-N antibody was used for both ChIP and Western blot. Anti-
SON (SON-C Ab, abl21759, for ChIP), anti-H3K27ac (ab4729), anti-H3K4me3 (ab8580), anti-SUZ12 (ab l2073), anti-WDR5 (ab56919), anti-H3 (abl791), and anti-H3K4mel (ab8895), anti-H3K79me2 (ab3594) were purchased from Abeam. Anti-TRX2/MLL2 (A300- 1 13A), anti-ASH2L (A300-489A), anti-menin (A300-105A), anti-MLL-C (MLL1, C- terminus, A300-374A), anti-LEDGF (A300-848A), anti-SETIA (A300-289A), anti-SETIB (A302-281A), anti-CFPl (A303-161A), and anti-MLL-N for ChIP (MLL1 , N-terminus, A300-086A) were purchased from Bethyl Laboratories. ASC2 antibody was a gift from Dr. Jae W Lee (Oregon Health & Science University). Anti-H3K27me3 (07-4490, Millipore), anti-MLL-N for IP and WB (MLL1 N-terminus, 39829, Active motif), anti-V5 (R960-25, Invitrogen), anti-HA (#2367, Cell Signaling Technology), anti-Flag M2 (F3165, Sigma), and anti-Actin (A5441 , Sigma) were purchased from the indicated companies.
Chromatin Immunoprecpitation (ChIP)
[0208] K562 cells were incubated with 1% formaldehyde in 5 ml growth medium for 10 min at room temperature and cross-linking reaction was terminated by incubation with 125 raM glycine for 10 min. Subsequently cells were incubated for 15 min at 4°C with lysis buffer (5 mM PIPES pH 8.0 / 85 mM KC1 / 0.5% NP-40 / lx Complete Protease Inhibitor Cocktail (Roche)), collected by centrifugation for 5 min at 3,000g and resuspended in RIPA buffer (150 mM NaCl / 50 mM Tris-HCl, pH 8.0/ 1 mM EDTA / 1% sodium deoxycholate / 0.1% SDS / 1% Triton X-100 / lx Complete Protease Inhibitor Cocktail). To shear chromatin to lengths ranging between 200—500 base pairs, crude nuclei were sonicated with the Ultrasonic disintegrator Sonicator S-4000 (Misonix). Sonicated DNA from each sample were incubated at 4°C overnight with 1—5 μg of specific antibodies or normal immunoglobulin G (IgG) as controls and magnetic bead protein A or G (Dynabeads Protein A or Protein G, Life Technologies). The magnetic beads were washed 5 times for lOmin at 4°C on a rotating platform with 1ml wash buffer (100 mM Tris pH 7.5 / 500 mM LiCl / 1% NP-40 / 1% Sodium deoxycholate) and washed once with TE (10 mM Tris pH 7.5 / 0.1 mM EDTA). After washing, the washed beads were eluted by heating for 2hr at 65 °C in elution buffer (1% SDS / 0.1 M NaHC03) with proteinase K. ChIP DNA were purified and concentrated using the QIAquick PCR Purification Kit (Qiagen).
siRNA and Plasmid Transfection
[0209] Total SON siRNAs directed against human SON (siSON #1 : GCAUUUGGCCCAUCUGAGAtt, (SEQ ID NO:81) Ahn et al, 2011 ; siSON #2: UGAGCGCUCUAUGAUGUCAtt, (SEQ ID NO:82) Lu et al, 2013), human SON E-specific siRNA (siSON E: CACCGGAGCUUGGAAAUUAtt (SEQ ID NO:83)), and negative control siRNA (UAACGACGCGACGACGUAAtt (SEQ ID NO:84)) were custom synthesis products by Life Technologies (Silencer Select siRNA). For K562 and ML2 cells, 0.5χ 106 cells were nucleofected with 100 - 200 pmol of siRNA or 5 μg of plasmid using Cell Line Nucleofector Kit V (Lonza) according to the manufacturer's instructions. For MV4;11 cells, 0.5x 106 cells were nucleofected with 150 - 200 pmol of siRNA using Cell Line Nucleofector Kit L (Lonza) according to the manufacturer's instructions. For human CD34+ bone marrow cells, 0.4x 106 cells were nucleofected with 80 - 150 pmol of siRNA or 4 μg of plasmid using Human CD34+ Cell Nucleofector Kit (Lonza) according to the manufacturer's instructions. Human embryonic kidney (HEK) 293 cells were transiently transfected with 5 μg of plasmids using PEL
ChlP-Seq Analysis
[0210] ChlP-sequencing data files were aligned to the hgl9 human reference genome using Bowtie (version 0.12.9) and standard parameters. Peak calling was performed with MACS vl .4.2 software using default parameters. To identify high confidence SON binding peaks, the MACS peak calling output from two different experimental samples were used. BigWig files were generated by first extending the 5' ends of uniquely aligned, non- duplicate ChlP-seq reads by the average DNA fragment length (150 bp for histone marks, 250 bp for transcription factors) in the 3' direction using BEDtools. Identified peaks were then annotated to the nearest transcription start site. Peaks that were identified in both experimental samples (overlapping peaks) with a false discovery rate (FDR) of 0.001 and a tag density (TD) of 12 in at least one of the experimental samples were identified as significant high confidence peaks. When determining the peak overlaps from each analysis, an in-house script was used to determine the percentage of the region of the smaller peak that overlapped with the larger peak. Overlap cut-off threshold was set to 50%, such that 50% of
the smaller peak in one replicate was required to overlap with the peak in the other replicate to be considered an overlapping peak.
[0211] The genomic distributions of binding sites (FIG. IB and FIG. 1C) were analyzed using the cis-regulatory element annotation system (CEAS v 1.0.2). The genes closest to the binding site on both strands were classified into functional categories such as promoter (from - lkb to +100bp), 5' UTR, first exon, first intron, exon, intron, 3' UTR, and intergenic region. The genes were also divided into defined groups according to the enrichment of the SON across transcription start site regions (5 kb window surrounding the transcription start site). Tag density heatmap (FIG. ID) was generated using the R package pheatmap vO.7.7 (peak extensions 5 kb upstream and 5 kb downstream of the peak summit and bin size 10 bp). Motif analyses (FIG. IE and FIG. II) were performed using HOMER, the methods of which are freely available at http://biowhat.ucsd.edu/homer/. Gene-associated region annotations (FIG. IF) were obtained with Database for Annotation, Visualization and Integrated Discovery (DAVID) v6.7. ChIP density plots (FIG. 2B and FIG. 2H) were made using HOMER by calculating average tag densities across ±5kb regions surrounding indicated peaks. ChTP-seq read density files (FTG. 2C, FIG. 2D, FIG. 2F) were generated using IGV tools and were viewed in Integrative Genomics Viewer (IGV) (http://www.broadinstitute.org/igv/). Previously published ChlP-seq data for H3K27ac, H3K4me3, and H2A.Z from K562 (downloaded from ENCODE/Broad Institute) and MNase- seq data for nucleosome position from K562 (downloaded from ENCODE/Stanford/BYU) were analyzed. Enriched ChlP-seq regions at promoters (5 kb window surrounding the transcription start site) for SON and histone marks were combined together to generate a unified track consisting of all merged enriched regions.
Preparation of Nuclear Extraction and Immunoprecipitation (IP)
[0212] Nuclear extracts were prepared from control, siRNA- or plasmid- transfected K562 or MV4; 1 1 cells using the Dignam protocol (Dignam et al., 1983). In brief, harvested cells were resuspended in three packed cell volumes of buffer A (10 mM HEPES pH 7.9 / 1.5 mM MgCl2 / 10 mM KC1 / 1 mM DTT / 0.1% NP-40 / Protease Inhibitor Cocktail) and homogenized using needle and syringe with 25 to 30 gentle strokes. Lysed cells
were centrifuged at 13,000xg for 10 min and the nuclei pellet was resuspended in two packed volumes of buffer C (20 mm HEPES pH 7.9 / 420 mM KC1 / 1.5 mM MgCl2 / 1 mM DTT / 25% glycerol / Protease Inhibitor Cocktail). The nuclei suspension was gently stirred for 30 min at 4°C and centrifuged 15 min at 13,000 g to remove debris. Sufficient volume of buffer D (20 mM HEPES pH 7.9 / 0.5 mM DTT / 25% glycerol / Protease Inhibitor Cocktail) was added to the nuclei extract. For IP with K562 cells, nuclear extracts were pre-cleared with protein A-sepharose beads (Life Technologies) for 1 hour and incubated either with rabbit IgG or SON antibody at 4°C overnight on a rotator. Beads were washed four times with wash buffer (20 mM HEPES pH 7.9 / 150 mM NaCl / 0.05% (v/v) NP-40) and eluted by boiling in SDS buffer and analyzed by SDS-PAGE.
[0213] For Co-IP with exogenously expressed proteins, HEK293 cells were co- transfected with V5 and HA-tagged plasmids encoding wild type SON F or its alternative spliced variant SON E and either Flag-tagged menin plasmid or empty vector. Cells were lysed with lysis buffer (50 mM Tris-HCl, pH 8.0 / 150 mM NaCl / 0.5% NP-40 / 10% glycerol / Protease Inhibitor Cocktail / 50 U/mL of Benzonase nuclease (Sigma)) for 1 hour. Whole cell extracts were pre-cleared and incubated with the V5 or HA antibody and protein G-sepharose beads (Life technologies) overnight. Co-IP complexes precipitated by bead were eluted by boiling in SDS buffer and subjected to Western blot analyses.
Size Fractionation and Analysis
[0214] Gel filtration chromatography was carried out at 4 °C using AKTA system (GE Healthcare). Nuclear extract from control or SON siRNA transfected K562 cells were prepared fresh, passed through a 0.22-μηι pore size MILLEX-GS filter (Millipore) and size- fractionated by fast-protein liquid chromatography (FPLC). 5mg of nuclear protein was applied to a Superose 6 10/300 GL column (GE Healthcare) equilibrated in FPLC buffer (20 mM Tris-HCl / 0.2 mM EDTA / 5 mM MgC12 / 0.1 M KC1 / 10% glycerol / 0.5 mM DTT / 1 mM benzamidine / 0.2 mM PMSF / pH 7.9) at 0.3 mL/min. 0.5 mL of elutes were collected and prepared for Western blot.
Analysis of Minigene Splicing
[0215] The minigene construct containing TUBG1 exon 7- intron 7- exon 8 were used to examine SON-mediated splicing as described previously (Ahn et al., 201 1). Briefly, HeLa cells in 6 well plates were transfected with 100 pmol of control siRNA or SON siRNA. The next day, minigene alone or minigene plus various amounts of SON F or SON E constructs (siRNA-resistant form) were transfected as indicated in each experiment. RNAs were isolated 30h after minigene transfection, and RT-PCR was performed using forward primer targeting TUBG1 exon 7 and the reverse primer targeting the bovine growth hormone (BGH) terminator sequence present in the pcDNA vector.
3' RACE PCR
[0216] The transcription terminating site of the primary transcript of SON E was determined by 3' RACE PCR. For the 3' RACE PCR, K562 total RNA was reverse transcribed by Superscript III First Strand Synthesis Kit (Life Technologies) with the adapter oligo-dT primer. The first PCR was performed using the first adapter primer (Adapter Reverse Parti) and SON E primer (SON El), which specifically bind with SON exon 5a region. A second PCR was achieved using the second adapter primer (Adapter Reverse Part2) and SON E primer (SON E2). PCR products were visualized on a 1% agarose gel and cDNA fragments were cloned and sequenced. The primer sets for RACE PCR are listed in TABLE 3.
Primary Patient Samples
[0217] The bone marrow mononuclear cells and/or peripheral blood mononuclear cells from AML patients (FAB subtype M2, Pl - Pl l) as well as bone marrow mononuclear cells from healthy donors (Nl — N4) were obtained from the Stem Cell and Xenotransplantation Core Facility of the University of Pennsylvania. Peripheral blood samples from additional 5 patients diagnosed with AML or MDS (P12— PI 6) and healthy donors (N5— Nl 1) were obtained from the BioBank of Mitchell Cancer Institute, University of South Alabama. Cells were purified by Ficoll-Paque (GE Healthcare) density-gradient centrifugation and frozen as viable cells. Details of the patient samples were listed in TABLE 2.
Preparation of Leukemic Blasts and Lin- / c-Kit+ Bone Marrow Cells from Mice
[0218] Mouse leukemic blasts were obtained from C57BL/6 mice with AML1- ET09a-induced leukemia generated by retroviral transduction-transplantation (Yan et al., 2006). MLL-AF9 leukemic blasts were obtained from the spleen and bone marrow of MLL- AF9a knock-in mice (Corral et al., 1996). Normal mouse Lin-/c-Kit+ bone marrow cells were prepared by Lineage Cell Depletion Kit and CD1 17 MicroBeads (Miltenyl Biotec).
Lentiviral Vector Construction and Lentivirus Production
[0219] Lentiviral vector for SON E overexpression was prepared by subcloning SON E cDNAs into the pCDH-MCS-T2A-copGFP-MSCV vector (System Bioscience). Lentivirus was produced by cotransfection of HEK 293T cells with expression plasmid, pMDLg/pRRE, pRSV-REV, and pVSVG. Viral supernatants were collected after 48 h and clarified by filtration before use. Ultracentrifugation was performed for lentivirus concentration with the Optima L-100 XP centrifuge (Beckman) using an SW55TI rotor (Beckman) at 19,400 rpm for 2 hr at 20°C. Supernatant was completely removed and virus pellets resuspended in PBS.
Colony Forming Unit Assay and Serial Replating
[0220] Mouse total bone marrow cells were transduced with recombinant lentivirus in the 4 ug/ml polybrene. Infected cells were incubated overnight in the above mouse BM culture media. After 3 days of culture, 2 x 104 cells were plated in methylcellulose medium (Methocult GF M3434, StemCell Technologies). Colony number counting and re- plating were repeated every 7-10 days. The colony-forming units (CFUs) were quantified as the average and standard deviation of at least triplicate determinations.
Oncomine Database Analysis
[0221] Analysis of SON isoform expression in leukemia and normal cells were conducted using the Oncomine database (www.oncomine.org). The Oncomine database was used as previously described (Rhodes et al., 2004).
Statistical Analysis
[0222] In all graphs data are expressed as mean ± SD of three independent experiments, except when otherwise indicated. Differences were analyzed by Student's t test. P-values < 0.05 were considered significant.
[0223] Each of the following references is incorporated herein by reference in its entirety.
References
Abbas, T., and Dutta, A. (2009). p21 in cancer: intricate networks and multiple activities. Nature reviews. Cancer 9, 400-414.
Agarwal, S.K., Guru, S.C., Heppner, C, Erdos, M.R., Collins, R.M., Park, S.Y., Saggar, S., Chandrasekharappa, S.C., Collins, F.S., Spiegel, A.M., et al. (1999). Menin interacts with the API transcription factor JunD and represses JunD-activated transcription. Cell 96, 143-152.
Ahn, E.E., Higashi, T., Yan, M., Matsuura, S., Hickey, C.J., Lo, M.C., Shia, W.J., DeKelver, R.C., and Zhang, D.E. (2013). SON protein regulates GATA-2 through transcriptional control of the microRNA 23a~27a~24-2 cluster. The Journal of biological chemistry 288, 5381-5388.
Ahn, E.Y., DeKelver, R.C., Lo, M.C., Nguyen, T.A., Matsuura, S., Boyapati, A., Pandit, S., Fu, X.D., and Zhang, D.E. (201 1). SON controls cell-cycle progression by coordinated regulation of RNA splicing. Molecular cell 42, 185-198.
Ansari, K.I., Kasiri, S., and Mandal, S.S. (2013). Histone methylase MLLl has critical roles in tumor growth and angiogenesis and its knockdown suppresses tumor growth in vivo. Oncogene 32, 3359-3370.
Barski, A., Cuddapah, S., Cui, K., Roh, T.Y., Schones, D.E., Wang, Z., Wei, G., Chepelev, I., and Zhao, K. (2007). High-resolution profiling of histone methylations in the human genome. Cell 129, 823-837.
Bernt, K.M., Zhu, N., Sinha, A.U., Vempati, S., Faber, J., Krivtsov, A.V., Feng, Z., Punt, N., Daigle, A., Bullinger, L., et al. (201 1). MLL-rearranged leukemia is dependent on aberrant H3K79 methylation by DOT1L. Cancer cell 20, 66-78.
Borkin, D., He, S., Miao, H., Kempinska, K., Pollock, J., Chase, J., Purohit, T., Malik, B., Zhao, T., Wang, J., et al. (2015). Pharmacologic Inhibition of the Menin-MLL Interaction Blocks Progression of MLL Leukemia In Vivo. Cancer cell 27, 589-602.
Cao, F., Chen, Y., Cierpicki, T., Liu, Y., Basrur, V., Lei, M., and Dou, Y. (2010). An Ash2L/RbBP5 heterodimer stimulates the MLLl methyltransferase activity through coordinated substrate interactions with the MLLl SET domain. PloS one 5, el4102.
Chia, N.Y., Chan, Y.S., Feng, B., Lu, X., Orlov, Y.L., Moreau, D., Kumar, P., Yang, L., Jiang, J., Lau, M.S., et al. (2010). A genome-wide RNAi screen reveals determinants of human embryonic stem cell identity. Nature 468, 316-320.
Corral, J., Lavenir, I., Impey, H., Warren, A. J., Forster, A., Larson, T. A., Bell, S., McKenzie, A. N., King, G., and Rabbitts, T. H. (1996). An M11-AF9 fusion gene made by
homologous recombination causes acute leukemia in chimeric mice: a method to create fusion oncogenes. Cell 85, 853-861.
Dignam, J. D., Lebovitz, R. M., and Roeder, R. G. (1983). Accurate transcription initiation by RNA polymerase II in a soluble extract from isolated mammalian nuclei. Nucleic Acids Res 11, 1475-1489.
Dorrance, A.M., Liu, S., Yuan, W., Becknell, B., Arnoczky, K.J., Guimond, M., Strout, M.P., Feng, L., Nakamura, T., Yu, L., et al. (2006). Mil partial tandem duplication induces aberrant Hox expression in vivo via specific epigenetic alterations. The Journal of clinical investigation 116, 2707-2716.
Dou, Y., Milne, T.A., Ruthenburg, A.J., Lee, S., Lee, J.W., Verdine, G.L., Allis, CD., and Roeder, R.G. (2006). Regulation of MLL1 H3K4 methyltransferase activity by its core components. Nature structural & molecular biology 13, 713-719.
Dreijerink, K.M., Mulder, K.W., Winkler, G.S., Hoppener, J.W., Lips, C.J., and Timmers, H.T. (2006). Menin links estrogen receptor activation to histone H3K4 trimethylation. Cancer research 66, 4929-4935.
Ernst, P., and Vakoc, C.R. (2012). WRAD: enabler of the SET 1 -family of H3K4 methyltransferases. Briefings in functional genomics 1 1 , 217-226.
Grembecka, J., He, S., Shi, A., Purohit, T., Muntean, A.G., Sorenson, R.J., Showalter, H.D., Murai, M.J., Belcher, A.M., Hartley, T., et al. (2012). Menin-MLL inhibitors reverse oncogenic activity of MLL fusion proteins in leukemia. Nature chemical biology 8, 277-284.
Guenther, M.G., Levine, S.S., Boyer, L.A., Jaenisch, R., and Young, R.A. (2007). A chromatin landmark and transcription initiation at most promoters in human cells. Cell 130, 77-88.
Hickey, C.J., Kim, J.H., and Ahn, E.Y. (2014). New discoveries of old SON: a link between RNA splicing and cancer. Journal of cellular biochemistry 115, 224-231.
Janz, M., Hummel, M., Truss, M., Wollert-Wulf, B., Mathas, S., Johrens, K., Hagemeier, C, Bommert, K., Stein, H., Dorken, B., et al. (2006). Classical Hodgkin lymphoma is characterized by high constitutive expression of activating transcription factor 3 (ATF3), which promotes viability of Hodgkin/Reed-Sternberg cells. Blood 107, 2536-2539.
Joslin, J.M., Fernald, A.A., Tennant, T.R., Davis, E.M., Kogan, S.C., Anastasi, J., Crispino, J.D., and Le Beau, M.M. (2007). Haploinsufficiency of EGR1, a candidate gene in the del(5q), leads to the development of myeloid disorders. Blood 110, 719-726.
Khandanpour, C, Phelan, J.D., Vassen, L., Schutte, J., Chen, R., Horman, S.R., Gaudreau, M.C., Krongold, J., Zhu, J., Paul, W.E., et al. (2013). Growth factor independence 1 antagonizes a p53-induced DNA damage response pathway in lymphoblastic leukemia. Cancer cell 23, 200-214.
Liebermann, D.A., Tront, J.S., Sha, X., Mukherjee, K., Mohamed-Hadley, A., and Hoffman, B. (201 1). Gadd45 stress sensors in malignancy and leukemia. Critical reviews in oncogenesis 16, 129-140.
Lu, X., Goke, J., Sachs, F., Jacques, P.E., Liang, H., Feng, B., Bourque, G., Bubulya, P.A., and Ng, H.H. (2013). SON connects the splicing-regulatory network with pluripotency in human embryonic stem cells. Nature cell biology 15, 1 141-1 152.
Lu, X., Ng, H.H., and Bubulya, P.A. (2014). The role of SON in splicing, development, and disease. Wiley interdisciplinary reviews. RNA.
Maia, S., Haining, W. N., Ansen, S., Xia, Z., Armstrong, S. A., Seth, N. P., Ghia, P., den Boer, M. L., Pieters, R., Sallan, S. E., et al. (2005). Gene expression profiling identifies BAX-delta as a novel tumor antigen in acute lymphoblastic leukemia. Cancer Res 65, 10050- 10058.
Malik, R., Khan, A.P., Asangani, I.A., Cieslik, M., Prensner, J.R., Wang, X., Iyer, M.K., Jiang, X., Borkin, D., Escara-Wilke, J., et al. (2015). Targeting the MLL complex in castration-resistant prostate cancer. Nature medicine 21, 344-352.
Martello, G. (2013). Let's sp(l)ice up pluripotency! The EMBO journal 32, 2903-2904.
Mattioni, T., Hume, C.R., Konigorski, S., Hayes, P., Osterweil, Z., and Lee, J.S. (1992). A cDNA clone for a novel nuclear protein with DNA binding activity. Chromosoma 101, 618-624.
Miller, T., Krogan, N.J., Dover, J., Erdjument-Bromage, H., Tempst, P., Johnston, M., Greenblatt, J.F., and Shilatifard, A. (2001). COMPASS: a complex of proteins associated with a trithorax-related SET domain protein. Proceedings of the National Academy of Sciences of the United States of America 98, 12902-12907.
Murai, M.J., Chruszcz, M., Reddy, G., Grembecka, J., and Cierpicki, T. (201 1). Crystal structure of menin reveals binding site for mixed lineage leukemia (MLL) protein. The Journal of biological chemistry 286, 31742-31748.
Phelan, J.D., Shroyer, N.F., Cook, T., Gebelein, B., and Grimes, H.L. (2010). Gfil- cells and circuits: unraveling transcriptional networks of development and disease. Current opinion in hematology 17, 300-307.
Popovic, R., and Licht, J.D. (2012). Emerging epigenetic targets and therapies in cancer medicine. Cancer discovery 2, 405-413.
Rhodes, D. R., Yu, J., Shanker, K., Deshpande, N., Varambally, R., Ghosh, D., Barrette, T., Pandey, A., and Chinnaiyan, A. M. (2004). ONCOMINE: a cancer microarray database and integrated data-mining platform. Neoplasia 6, 1-6.
Sharma, A., Markey, M., Torres-Munoz, K., Varia, S., Kadakia, M., Bubulya, A., and Bubulya, P.A. (201 1). Son maintains accurate splicing for a subset of human pre-mRNAs. Journal of cell science 124, 4286-4298.
Shilatifard, A. (2012). The COMPASS family of histone H3K4 methylases: mechanisms of regulation in development and disease pathogenesis. Annual review of biochemistry 81 , 65-95.
Stegmaier, K., Ross, K. N., Colavito, S. A., O'Malley, S., Stockwell, B. R., and Golub, T. R. (2004). Gene expression-based high-throughput screening(GE-HTS) and application to leukemia differentiation. Nat Genet 36, 257-263.
Steward, M.M., Lee, J.S., O'Donovan, A., Wyatt, M., Bernstein, B.E., and Shilatifard, A. (2006). Molecular regulation of H3K4 trimethylation by ASH2L, a shared subunit of MLL complexes. Nature structural & molecular biology 13, 852-854.
Sun, C.T., Lo, W.Y., Wang, I.H., Lo, Y.H., Shiou, S.R., Lai, C.K., and Ting, L.P. (2001). transcription repression of human hepatitis B virus genes by negative regulatory element-binding protein/SON. The Journal of biological chemistry 276, 24059-24067.
Thiel, A.T., Blessington, P., Zou, T., Feather, D., Wu, X., Yan, J., Zhang, H., Liu, Z., Ernst, P., Koretzky, G.A., et al. (2010). MLL-AF9-induced leukemogenesis requires coexpression of the wild-type Mil allele. Cancer cell 17, 148-159.
Viale, A., De Franco, F., Orleth, A., Cambiaghi, V., Giuliani, V., Bossi, D., Ronchini, C, Ronzoni, S., Muradore, I., Monestiroli, S., et al. (2009). Cell-cycle restriction limits DNA damage and maintains self-renewal of leukaemia stem cells. Nature 457, 51-56.
Wynn, S.L., Fisher, R.A., Pagel, C, Price, M., Liu, Q.Y., Khan, I.M., Zammit, P., Dadrah, K., Mazrani, W., Kessling, A., et al. (2000). Organization and conservation of the GART/SON/DONSON locus in mouse and human genomes. Genomics 68, 57-62.
Xu, B., Li, S.H., Zheng, R., Gao, S.B., Ding, L.H., Yin, Z.Y., Lin, X., Feng, Z.J., Zhang, S., Wang, X.M., et al. (2013). Menin promotes hepatocellular carcinogenesis and epigenetically up-regulates Yapl transcription. Proceedings of the National Academy of Sciences of the United States of America 110, 17480-17485.
Yan, M., Kanbe, E., Peterson, L. F., Boyapati, A., Miao, Y., Wang, Y., Chen, I. M., Chen, Z., Rowley, J. D., Willman, C. L., and Zhang, D. E. (2006). A previously unidentified alternatively spliced isoform of t(8;21) transcript promotes leukemogenesis. Nat Med 12, 945-949.
Yokoyama, A., and Cleary, M.L. (2008). Menin critically links MLL proteins with LEDGF on cancer-associated target genes. Cancer cell 14, 36-46.
Yokoyama, A., Somervaille, T.C., Smith, K.S., Rozenblatt-Rosen, O., Meyerson, M., and Cleary, M.L. (2005). The menin tumor suppressor protein is an essential oncogenic cofactor for MLL-associated leukemogenesis. Cell 123, 207-218.
[0224] The term "comprising" as used herein is synonymous with "including," "containing," or "characterized by," and is inclusive or open-ended and does not exclude additional, unrecited elements or method steps.
[0225] The above description discloses several methods and materials of the present invention. This invention is susceptible to modifications in the methods and materials, as well as alterations in the fabrication methods and equipment. Such modifications will become apparent to those skilled in the art from a consideration of this disclosure or practice of the invention disclosed herein. Consequently, it is not intended that this invention be limited to the specific embodiments disclosed herein, but that it cover all modifications and alternatives coming within the true scope and spirit of the invention.
[0226] All references cited herein, including but not limited to published and unpublished applications, patents, and literature references, are incorporated herein by reference in their entirety and are hereby made a part of this specification. To the extent publications and patents or patent applications incorporated by reference contradict the disclosure contained in the specification, the specification is intended to supersede and/or take precedence over any such contradictory material.
Claims
1. A method of diagnosing a hematopoietic malignancy or a likelihood of developing a hematopoietic malignancy in a test subject comprising:
determining a ratio for the level of a short polypeptide or the level of a nucleic acid encoding the short polypeptide in the test subject to the level of a long polypeptide or the level of a nucleic acid encoding the long polypeptide in the test subject,
wherein the short polypeptide is encoded by a truncated alternatively-spliced transcript of a SON gene, and the long polypeptide is encoded by a full length transcript of the SON gene.
2. A method of detecting a ratio for the level of a short polypeptide or the level of a nucleic acid encoding the short polypeptide in a test subject to the level of a long polypeptide or the level of a nucleic acid encoding the long polypeptide in the test subject comprising:
contacting a nucleic acid or polypeptide sample from the test subject with an agent which indicates the level of the short polypeptide or the level of a nucleic acid encoding the short polypeptide in the test subject;
contacting the nucleic acid or polypeptide sample with an agent which indicates the level of the long polypeptide or a nucleic acid encoding the long polypeptide in the test subject; and
determining the ratio for the level of the short polypeptide or the level of a nucleic acid encoding the short polypeptide in the test subject to the level of the long polypeptide or the level of a nucleic acid encoding the long polypeptide in the test subject,
wherein the short polypeptide is encoded by a truncated alternatively-spliced transcript of a SON gene, and the long polypeptide is encoded by a full length transcript of the SON gene.
3. The method of claim 1 or 2, wherein the ratio is greater than about 30%.
4. The method of any one of claims 1-3, wherein the ratio is greater than about
40%.
5. The method of any one of claims 1-4, wherein the ratio is greater than about
50%.
6. A method of determining the level of a short polypeptide or a nucleic acid encoding the polypeptide in a nucleic acid or polypeptide sample from a test subject comprising:
contacting the nucleic acid or polypeptide sample from the test subject with an agent which indicates the level of the short polypeptide or which indicates the level of a nucleic acid encoding the short polypeptide in the sample; and
determining the level of the short polypeptide or a nucleic acid encoding the short polypeptide in the sample, wherein the short polypeptide is encoded by a truncated alternatively-spliced transcript of a SON gene.
7. A method of diagnosing a hematopoietic malignancy or a likelihood of developing a hematopoietic malignancy in a test subject comprising: determining the level of a short polypeptide or the level of a nucleic acid encoding the short polypeptide in a sample obtained from a test subject, wherein the short polypeptide is encoded by a truncated alternatively-spliced transcript of a SON gene.
8. The method of any one of claims 1-7, further comprising comparing the level of the short polypeptide or the level of a nucleic acid encoding the short polypeptide in a sample obtained from the test subject with the level of the short polypeptide or the level of a nucleic acid encoding the short polypeptide in a subject not having a hematopoietic malignancy.
9. The method of any one of claims 1-8, further comprising determining the level of a long polypeptide or a nucleic acid encoding the long polypeptide in the test subject, wherein the long polypeptide is encoded by a full length transcript of the SON gene.
10. The method of claim 9, further comprising determining a ratio for the level of a short polypeptide or the level of a nucleic acid encoding the short polypeptide in the test subject to the level of a long polypeptide or the level of a nucleic acid encoding the long polypeptide in the sample from the test subject
1 1. The method of claim 10, wherein the ratio is greater than about 30%.
12. The method of claim 10, wherein the ratio is greater than about 40%.
13. The method of claim 10, wherein the ratio is greater than about 50%.
14. The method of any one of claims 1-13, further comprising determining the level of an additional polypeptide in addition to the level of the short form of the SON polypeptide or the level of a nucleic acid encoding the additional polypeptide in addition to the level of a nucleic acid encoding the short form of the SON polypeptide in a test subject, wherein the additional polypeptide is encoded by a gene selected from CDK 1A, GF11 and ATF3.
15. The method of claim 14, further comprising comparing the level of an additional polypeptide or the level of a nucleic acid encoding the additional polypeptide in the test subject with the level of an additional polypeptide or the level of a nucleic acid encoding the additional polypeptide in a subject not having a hematopoietic malignancy.
16. The method of any one of claims 1-15, wherein the truncated alternatively- spliced transcript of a SON gene comprises a SON E variant of a SON gene.
17. The method of any one of claims 1-16, wherein the truncated alternatively- spliced transcript of a SON gene comprises a SON B variant of a SON gene.
18. The method of claim any one of claims 1-17, wherein the nucleic acid encoding the short polypeptide comprises a sequence which specifically binds to an oligonucleotide comprising SEQ ID NOs:06 or 10.
19. The method of any one of claims 1-18, wherein the short polypeptide comprises SEQ ID NOs:88 or 90.
20. The method of any one of claims 1-19, wherein the nucleic acid encoding the short polypeptide comprises SEQ ID NOs:87 or 89.
21. The method of any one of claims 1 -20, wherein the full length transcript of the SON gene comprises a sequence which specifically binds to an oligonucleotide comprising a sequence selected from SEQ ID NOs: 13-18.
22. The method of any one of claims 1 -21 , wherein the long polypeptide comprises SEQ ID NO:86.
23. The method of any one of claims 1-22, wherein the nucleic acid encoding the long polypeptide comprises SEQ ID NO:85.
24. The method of any one of claims 1-23, wherein the SON gene is a human SON gene.
25. The method of any one of claims 1-24, further comprising obtaining a sample from the test subject.
26. The method of claim 25, further comprising contacting the sample with an agent which specifically binds to the short polypeptide or a nucleic acid encoding the short polypeptide.
27. The method of claim 25, further comprising contacting the sample with an agent which specifically binds to the long polypeptide or a nucleic acid encoding the long polypeptide.
28. The method of claim 26 or 27, wherein the agent is a primer or a hybridization probe.
29. The method of claim 28, wherein the primer or the hybridization probe comprises an oligonucleotide comprising SEQ ID NOs:06, 10 or 13-18
30. The method of claim 26 or 27, wherein the agent is an antibody or antigen- binding fragment thereof which specifically binds to the short polypeptide.
31. The method of claim 26 or 27, wherein the agent is an antibody or antigen- binding fragment thereof which specifically binds to the long polypeptide.
32. The method of any one of claims 26-31, wherein the agent is attached to a solid support.
33. The method of any one of claims 2-32, wherein the sample comprises nucleic acids or polypeptides.
34. The method of any one of claims 1 -33, further comprising determining an increased level of a polypeptide encoded by a SON target gene or a nucleic acid encoded by a SON target gene in the sample from the test subject, wherein the SON target gene has a SON binding site.
35. The method of claim 34, wherein the SON target gene is selected from the group consisting of FOX03, NOTCH2NL, SRC, GADD45A, CD N1A, ATF3, GFI1, EGR1, SRC, and GFI1B.
36. The method of any one of claims 1-35, further comprising determining an increased level of a MLL multi-protein complex in the sample from the test subject, wherein the MLL is selected from the group consisting of MLL1, MLL-N, MLL-C, and MLL2.
37. The method of claim 36, wherein the multiprotein complex comprises a protein selected from menin, ASH2L, LEDGF, and WDR5.
38. The method of any one of claims 1-37 further comprising determining an increased replating activity of a hematopoietic progenitor cell of the sample from the test subject.
39. The method of claim 38, wherein the hematopoietic progenitor cell is a bone marrow cell.
40. The method of any one of claims 1-39, further comprising determining an increased binding of a protein at the or adjacent to the transcription start site of a SON target gene in the sample from the test subject.
41. The methods of claim 40, wherein the protein is selected from the group consisting of MLL1, MLL-N, MLL-C, MLL2, WDR5, ASH2L, menin, SET1A, SET IB, ASC2, and SUZ12.
42. The method of claim 40 or 41, wherein the SON target gene is selected from the group consisting of CDKN1A, GADD45A, NOTCH2NL, ATF3, GFI1, EGR1, and SRC.
43. The method of any one of claims 1-42 further comprising determining an increased level in H3K4me3 in the sample from the test subject.
44. The method of claim 43, wherein the level in H3K4me3 is increased at the or adjacent to a SON target gene selected from the group consisting of CDKN1 A, GADD45A, NOTCH2NL, ATF3, GFI1, EGR1, and SRC.
45. The method of any one of claims 2-44, wherein the sample comprises bone marrow mononuclear cells (BM-MNCs), or peripheral blood mononuclear cells (PB-MNCs).
46. The method of any one of claims 1-45, wherein the hematopoietic malignancy is selected from acute myeloid leukemia (AML) and myelodysplastic syndrome (MDS).
47. The method of any one of claims 1-46, wherein the hematopoietic malignancy is an acute myeloid leukemia (AML) subtype selected from M2 without t(8;21), and t(8;21)(q22;q22).
48. The method of any one of claims 1-47, wherein the test subject is mammalian.
49. The method of any one of claims 1-48, wherein the test subject is human.
50. A method of diagnosing a hematopoietic malignancy or a likelihood of developing a hematopoietic malignancy in a test subject comprising: detecting increased levels of binding of a short polypeptide with a nucleic acid having a SON binding site in a sample from a test subject, wherein the short polypeptide is encoded by a truncated alternatively-spliced transcript of a SON gene.
51. The method of claim 50, wherein the truncated alternatively-spliced transcript of a SON gene comprises a SON E variant of a SON gene.
52. The method of any one of claims 50-51, wherein the truncated alternatively- spliced transcript of a SON gene comprises a SON B variant of a SON gene.
53. The method of claim any one of claims 50-52, wherein the nucleic acid encoding the short polypeptide comprises a sequence which specifically binds to an oligonucleotide comprising SEQ ID NOs:06 or 10.
54. The method of any one of claims 50-53, wherein the short polypeptide comprises SEQ ID NOs:88 or 90.
55. The method of any one of claims 50-54, wherein the nucleic acid encoding the short polypeptide comprises SEQ ID NOs:87 or 89.
56. The method of any one of claims 50-55, wherein the hematopoietic malignancy is selected from acute myeloid leukemia (AML) and myelodysplastic syndrome (MDS).
57. The method of any one of claims 50-56, wherein the hematopoietic malignancy is an acute myeloid leukemia (AML) subtype selected from M2 without t(8;21), and t(8;21)(q22;q22).
58. The method of any one of claims 50-57, wherein the subject is mammalian.
59. The method of any one of claims 50-58, wherein subject is human.
60. A method of ameliorating a hematopoietic malignancy comprising: increasing the level of a full length SON polypeptide or a nucleic acid encoding a full length SON polypeptide in a cell of a subject.
61. The method of claim 60, wherein the expression level of a nucleic acid encoding said full length SON polypeptide or the expression level of said full length SON polypeptide is increased by administering a composition comprising a nucleic acid to the subject.
62. The method of claim 60 or 61 , wherein the nucleic acid comprises a sequence encoding a full length SON polypeptide.
63. The method of any one of claims 60-62, wherein the SON polypeptide is a human SON polypeptide.
64. The method of any one of claims 60-63, wherein the nucleic acid comprises SEQ ID NO.:85.
65. The method of any one of claims 60-64, wherein the polypeptide comprises SEQ ID NO.:86.
66. The method of any one of claims 60-65, wherein the hematopoietic malignancy is selected from acute myeloid leukemia (AML) and myelodysplastic syndrome (MDS).
67. The method of any one of claims 60-66, wherein the hematopoietic malignancy is an acute myeloid leukemia (AML) subtype selected from M2 without t(8;21), and t(8;21)(q22;q22).
68. The method of any one of claims 60-67, wherein the subject is mammalian.
69. The method of any one of claims 60-68, wherein subject is human.
70. A kit for the diagnosis of a hematopoietic malignancy in a test subject comprising: an agent that specifically binds to a short polypeptide or a nucleic acid encoding the short polypeptide, wherein the short polypeptide is encoded by a truncated alternatively- spliced transcript of a SON gene.
71. The kit of claim 70, further comprising an agent that specifically binds to a long polypeptide or a nucleic acid encoding the long polypeptide, wherein the long polypeptide is encoded by a full length transcript of the SON gene.
72. The kit of any one of claims 70-71, further comprising an agent that specifically binds to an additional polypeptide in addition to the short polypeptide or a nucleic acid encoding the additional polypeptide in addition to the nucleic acid encoding the short polypeptide in the test subject, wherein the additional polypeptide is encoded by a gene selected from CDK 1A, GF11 and ATF3.
73. The kit of any one of claims 70-72, wherein the truncated alternatively-spliced transcript of a SON gene comprises a SON E variant of a SON gene.
74. The kit of any one of claims 70-73, wherein the truncated alternatively-spliced transcript of a SON gene comprises a SON B variant of a SON gene.
75. The kit of any one of claims 70-74, wherein the nucleic acid encoding the short polypeptide comprises a sequence which specifically binds to an oligonucleotide comprising SEQ ID NOs:06 or 10.
76. The kit of any one of claims 70-75, wherein the short polypeptide comprises SEQ ID NOs:88 or 90.
77. The kit of any one of claims 70-76, wherein the nucleic acid encoding the short polypeptide comprises SEQ ID NOs:87 or 89.
78. The kit of any one of claims 70-77, wherein the full length transcript of the SON gene comprises a sequence which specifically binds to an oligonucleotide comprising a sequence selected from SEQ ID NOs: 13-18.
79. The kit of any one of claims 70-78, wherein the long polypeptide comprises SEQ ID NO:86.
80. The kit of any one of claims 70-79, wherein the nucleic acid encoding the long polypeptide comprises SEQ ID NO:85.
81. The kit of any one of claims 70-80, wherein the SON gene is a human SON gene.
82. The kit of any one of claims 70-81, further comprising an agent that specifically binds to a SON target gene, wherein the SON target gene has a SON binding site.
83. The kit of claim 82, wherein the SON target gene is selected from the group consisting of FOX03, NOTCH2NL, SRC, GADD45A, CDKN 1 A, ATF3, GFI1, EGR1, SRC, and GFI1B.
84. The kit of any one of claims 70-83, further comprising an agent that specifically binds to a protein selected from MLL1, MLL-N, MLL-C, MLL2, menin, ASH2L, LEDGF, WDR5, SET1A, SET IB, ASC2, SUZ12, and H3K4me3.
85. The kit of any one of claims 70-84, wherein the agent is a primer or a hybridization probe.
86. The kit of any one of claims 70-85, wherein the primer or the hybridization probe comprises an oligonucleotide comprising SEQ ID NOs:06, 10, or 13-18.
87. The kit of any one of claims 70-86, wherein the agent is an antibody or antigen-bind fragment thereof which specifically binds to the short polypeptide.
88. The kit of any one of claims 70-87, wherein the agent is attached to a solid support.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/US2016/064442 WO2018101950A1 (en) | 2016-12-01 | 2016-12-01 | Methods and compositions for the diagnosis of acute myeloid leukemia |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/US2016/064442 WO2018101950A1 (en) | 2016-12-01 | 2016-12-01 | Methods and compositions for the diagnosis of acute myeloid leukemia |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2018101950A1 true WO2018101950A1 (en) | 2018-06-07 |
Family
ID=62241821
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2016/064442 WO2018101950A1 (en) | 2016-12-01 | 2016-12-01 | Methods and compositions for the diagnosis of acute myeloid leukemia |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2018101950A1 (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPWO2020013233A1 (en) * | 2018-07-10 | 2021-08-02 | 国立大学法人 東京大学 | ALS biomarkers and methods for diagnosing ALS |
EP4232020A4 (en) * | 2020-10-21 | 2024-08-07 | Kura Oncology, Inc. | TREATMENT OF HEMATOLOGICAL MALIGNANCIES WITH MENIN INHIBITORS |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5643729A (en) * | 1994-02-24 | 1997-07-01 | Boehringer Ingelheim International Gmbh | Methods for diagnosing cancer, precancerous state, or susceptibility to other forms of diseases by detecting an acceleration of exon skipping in IRF-1 mRNA |
JP2010133880A (en) * | 2008-12-05 | 2010-06-17 | Hokkaido Univ | Method for detecting leukemia cell and method for inspecting resistivity to therapy of leukemia |
JP5371778B2 (en) * | 2007-12-28 | 2013-12-18 | Lsipファンド運営合同会社 | Method for detecting leukemia cells |
US20160060709A1 (en) * | 2013-04-30 | 2016-03-03 | Université de Montréal | Novel biomarkers for acute myeloid leukemia |
-
2016
- 2016-12-01 WO PCT/US2016/064442 patent/WO2018101950A1/en active Application Filing
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5643729A (en) * | 1994-02-24 | 1997-07-01 | Boehringer Ingelheim International Gmbh | Methods for diagnosing cancer, precancerous state, or susceptibility to other forms of diseases by detecting an acceleration of exon skipping in IRF-1 mRNA |
JP5371778B2 (en) * | 2007-12-28 | 2013-12-18 | Lsipファンド運営合同会社 | Method for detecting leukemia cells |
JP2010133880A (en) * | 2008-12-05 | 2010-06-17 | Hokkaido Univ | Method for detecting leukemia cell and method for inspecting resistivity to therapy of leukemia |
US20160060709A1 (en) * | 2013-04-30 | 2016-03-03 | Université de Montréal | Novel biomarkers for acute myeloid leukemia |
Non-Patent Citations (1)
Title |
---|
KIM ET AL.: "SON and its alternatively spliced isoforms control MLL complex-mediated H3K4me3 and transcription of leukemia-associated genes", MOLECULAR CELL, vol. 61, no. 6, 17 March 2016 (2016-03-17), pages 859 - 873, XP029464133 * |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPWO2020013233A1 (en) * | 2018-07-10 | 2021-08-02 | 国立大学法人 東京大学 | ALS biomarkers and methods for diagnosing ALS |
JP7255897B2 (en) | 2018-07-10 | 2023-04-11 | 国立大学法人 東京大学 | Biomarkers for ALS and methods for diagnosing ALS |
EP4232020A4 (en) * | 2020-10-21 | 2024-08-07 | Kura Oncology, Inc. | TREATMENT OF HEMATOLOGICAL MALIGNANCIES WITH MENIN INHIBITORS |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Pan et al. | A novel protein encoded by exosomal CircATG4B induces oxaliplatin resistance in colorectal cancer by promoting autophagy | |
Pang et al. | Peptide SMIM30 promotes HCC development by inducing SRC/YES1 membrane anchoring and MAPK pathway activation | |
Wu et al. | A novel protein encoded by circular SMO RNA is essential for Hedgehog signaling activation and glioblastoma tumorigenicity | |
Liu et al. | PRMT5-dependent transcriptional repression of c-Myc target genes promotes gastric cancer progression | |
Wan et al. | Splicing factor SRSF1 promotes pancreatitis and KRASG12D-mediated pancreatic cancer | |
Qi et al. | OVOL2 links stemness and metastasis via fine-tuning epithelial-mesenchymal transition in nasopharyngeal carcinoma | |
Qi et al. | CDK13 upregulation-induced formation of the positive feedback loop among circCDK13, miR-212-5p/miR-449a and E2F5 contributes to prostate carcinogenesis | |
AU2014333513B2 (en) | Method for the prognosis and treatment of metastasizing cancer of the bone originating from breast cancer | |
WO2011133424A2 (en) | Genetic amplification of iqgap1 in cancer | |
Chen et al. | TBC1D8 amplification drives tumorigenesis through metabolism reprogramming in ovarian cancer | |
JP2015109806A (en) | Method for detecting new ret fused body | |
US20190338290A1 (en) | Treatment of sarcoma | |
Dong et al. | Rictor promotes cell migration and actin polymerization through regulating ABLIM1 phosphorylation in Hepatocellular Carcinoma | |
Bullock et al. | Thyroid transcription factor FOXE1 interacts with ETS factor ELK1 to co-regulate TERT | |
Bie et al. | HOTAIR competitively binds MiRNA330 as a molecular sponge to increase the resistance of gastric cancer to trastuzumab | |
WO2018101950A1 (en) | Methods and compositions for the diagnosis of acute myeloid leukemia | |
Leach et al. | Simultaneous inhibition of TRIM24 and TRIM28 sensitises prostate cancer cells to antiandrogen therapy, decreasing VEGF signalling and angiogenesis | |
Raguz et al. | Production of P‐glycoprotein from the MDR1 upstream promoter is insufficient to affect the response to first‐line chemotherapy in advanced breast cancer | |
JP7109040B2 (en) | fibrosis inhibitor | |
US20240068037A1 (en) | Methods and compositions for diagnosing and treating prostate cancer based on long noncoding rna overlapping the lck gene that regulates prostate cancer cell growth | |
Xu et al. | Engrailed‐1 Promotes Pancreatic Cancer Metastasis | |
CN106636368A (en) | Application of miR-130a to diagnosis, treatment and prognosis of ovarian cancer | |
Dong et al. | Methyltransferase-like 3–catalysed N6-methyladenosine methylation facilitates the contribution of vascular smooth muscle cells to atherosclerosis | |
Raju et al. | NFAT2 drives both Orai3 transcription and protein degradation by harnessing the differences in epigenetic landscape of MARCH8 E3 ligase | |
An et al. | Age associated high level of major vault protein is p53 dependent |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 16923098 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 16923098 Country of ref document: EP Kind code of ref document: A1 |