WO1997037041A9 - Dna sequencing by mass spectrometry - Google Patents
Dna sequencing by mass spectrometryInfo
- Publication number
- WO1997037041A9 WO1997037041A9 PCT/US1997/004394 US9704394W WO9737041A9 WO 1997037041 A9 WO1997037041 A9 WO 1997037041A9 US 9704394 W US9704394 W US 9704394W WO 9737041 A9 WO9737041 A9 WO 9737041A9
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- mass
- modified
- nucleic acid
- fragments
- base
- Prior art date
Links
- 238000004949 mass spectrometry Methods 0.000 title claims abstract description 113
- 238000001712 DNA sequencing Methods 0.000 title abstract description 52
- 239000012634 fragment Substances 0.000 claims abstract description 141
- 238000000034 method Methods 0.000 claims abstract description 141
- 239000001226 triphosphate Substances 0.000 claims abstract description 64
- 239000000523 sample Substances 0.000 claims abstract description 57
- 235000011178 triphosphate Nutrition 0.000 claims abstract description 55
- 108091034117 Oligonucleotide Proteins 0.000 claims abstract description 42
- 238000004458 analytical method Methods 0.000 claims abstract description 24
- 239000003153 chemical reaction reagent Substances 0.000 claims abstract description 12
- 150000007523 nucleic acids Chemical class 0.000 claims description 197
- 108020004707 nucleic acids Proteins 0.000 claims description 164
- 102000039446 nucleic acids Human genes 0.000 claims description 164
- 125000003729 nucleotide group Chemical group 0.000 claims description 92
- 238000012163 sequencing technique Methods 0.000 claims description 90
- 239000002773 nucleotide Substances 0.000 claims description 83
- 239000007787 solid Substances 0.000 claims description 47
- 230000008569 process Effects 0.000 claims description 32
- 239000000203 mixture Substances 0.000 claims description 29
- 150000002500 ions Chemical class 0.000 claims description 24
- 238000003795 desorption Methods 0.000 claims description 19
- 239000011159 matrix material Substances 0.000 claims description 17
- 230000000295 complement effect Effects 0.000 claims description 13
- 238000013467 fragmentation Methods 0.000 claims description 13
- 238000006062 fragmentation reaction Methods 0.000 claims description 13
- 238000005406 washing Methods 0.000 claims description 13
- 150000003212 purines Chemical class 0.000 claims description 11
- UNXRWKVEANCORM-UHFFFAOYSA-N triphosphoric acid Chemical compound OP(O)(=O)OP(O)(=O)OP(O)(O)=O UNXRWKVEANCORM-UHFFFAOYSA-N 0.000 claims description 10
- 239000006227 byproduct Substances 0.000 claims description 9
- 239000000376 reactant Substances 0.000 claims description 9
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical class O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 claims description 7
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical class O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 claims description 7
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical class NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 claims description 6
- FDGQSTZJBFJUBT-UHFFFAOYSA-N hypoxanthine Chemical compound O=C1NC=NC2=C1NC=N2 FDGQSTZJBFJUBT-UHFFFAOYSA-N 0.000 claims description 6
- 125000002467 phosphate group Chemical group [H]OP(=O)(O[H])O[*] 0.000 claims description 6
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical class CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 claims description 6
- 239000002243 precursor Substances 0.000 claims description 5
- 239000002213 purine nucleotide Substances 0.000 claims description 5
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical class NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 claims description 4
- 230000004069 differentiation Effects 0.000 claims description 4
- 230000002441 reversible effect Effects 0.000 claims description 4
- UGQMRVRMYYASKQ-UHFFFAOYSA-N Hypoxanthine nucleoside Natural products OC1C(O)C(CO)OC1N1C(NC=NC2=O)=C2N=C1 UGQMRVRMYYASKQ-UHFFFAOYSA-N 0.000 claims description 3
- 229930024421 Adenine Natural products 0.000 claims description 2
- 229960000643 adenine Drugs 0.000 claims description 2
- 230000001143 conditioned effect Effects 0.000 claims description 2
- 230000003100 immobilizing effect Effects 0.000 claims 2
- 125000000561 purinyl group Chemical group N1=C(N=C2N=CNC2=C1)* 0.000 claims 2
- PEHVGBZKEYRQSX-UHFFFAOYSA-N 6-amino-7-deazapurine Natural products NC1=NC=NC2=C1C=CN2 PEHVGBZKEYRQSX-UHFFFAOYSA-N 0.000 claims 1
- DPRSKJHWKNHBOW-UHFFFAOYSA-N 7-Deazainosine Natural products OC1C(O)C(CO)OC1N1C(NC=NC2=O)=C2C=C1 DPRSKJHWKNHBOW-UHFFFAOYSA-N 0.000 claims 1
- DPRSKJHWKNHBOW-KCGFPETGSA-N 7-[(2r,3r,4s,5r)-3,4-dihydroxy-5-(hydroxymethyl)oxolan-2-yl]-1h-pyrrolo[2,3-d]pyrimidin-4-one Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(NC=NC2=O)=C2C=C1 DPRSKJHWKNHBOW-KCGFPETGSA-N 0.000 claims 1
- GPJICFPVOAERJL-RAWIJENESA-N 9-deazainosine Chemical compound O[C@@H]1[C@@H](O)[C@H](CO)O[C@H]1C1=CN=C2C(=O)NC=N[C]12 GPJICFPVOAERJL-RAWIJENESA-N 0.000 claims 1
- 108020004414 DNA Proteins 0.000 abstract description 98
- -1 nucleoside triphosphates Chemical class 0.000 abstract description 48
- 239000002777 nucleoside Substances 0.000 abstract description 39
- 238000012986 modification Methods 0.000 abstract description 31
- 238000007480 sanger sequencing Methods 0.000 abstract description 31
- 238000009396 hybridization Methods 0.000 abstract description 11
- 238000001962 electrophoresis Methods 0.000 abstract description 10
- 238000005516 engineering process Methods 0.000 abstract description 9
- 230000006872 improvement Effects 0.000 abstract description 6
- 238000006467 substitution reaction Methods 0.000 abstract description 4
- 238000000816 matrix-assisted laser desorption--ionisation Methods 0.000 abstract description 2
- 239000013615 primer Substances 0.000 description 147
- 239000000047 product Substances 0.000 description 87
- 238000006243 chemical reaction Methods 0.000 description 78
- 239000000243 solution Substances 0.000 description 52
- 238000003752 polymerase chain reaction Methods 0.000 description 46
- ZMXDDKWLCZADIW-UHFFFAOYSA-N N,N-Dimethylformamide Chemical compound CN(C)C=O ZMXDDKWLCZADIW-UHFFFAOYSA-N 0.000 description 44
- SJRJJKPEHAURKC-UHFFFAOYSA-N N-Methylmorpholine Chemical compound CN1CCOCC1 SJRJJKPEHAURKC-UHFFFAOYSA-N 0.000 description 38
- 238000001228 spectrum Methods 0.000 description 33
- 239000011324 bead Substances 0.000 description 31
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 26
- 108091028043 Nucleic acid sequence Proteins 0.000 description 26
- 238000001840 matrix-assisted laser desorption--ionisation time-of-flight mass spectrometry Methods 0.000 description 25
- YMWUJEATGCHHMB-UHFFFAOYSA-N Dichloromethane Chemical compound ClCCl YMWUJEATGCHHMB-UHFFFAOYSA-N 0.000 description 24
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 description 24
- 239000011541 reaction mixture Substances 0.000 description 23
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 21
- 230000015572 biosynthetic process Effects 0.000 description 21
- 230000004048 modification Effects 0.000 description 21
- 238000003786 synthesis reaction Methods 0.000 description 21
- 238000013459 approach Methods 0.000 description 20
- 239000000872 buffer Substances 0.000 description 20
- 230000000875 corresponding effect Effects 0.000 description 20
- 239000002299 complementary DNA Substances 0.000 description 19
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 18
- YXFVVABEGXRONW-UHFFFAOYSA-N Toluene Chemical compound CC1=CC=CC=C1 YXFVVABEGXRONW-UHFFFAOYSA-N 0.000 description 18
- 108010058966 bacteriophage T7 induced DNA polymerase Proteins 0.000 description 17
- 125000002264 triphosphate group Chemical class [H]OP(=O)(O[H])OP(=O)(O[H])OP(=O)(O[H])O* 0.000 description 17
- 239000012528 membrane Substances 0.000 description 16
- 238000001514 detection method Methods 0.000 description 15
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 15
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 14
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 14
- NQRYJNQNLNOLGT-UHFFFAOYSA-N Piperidine Chemical compound C1CCNCC1 NQRYJNQNLNOLGT-UHFFFAOYSA-N 0.000 description 14
- JUJWROOIHBZHMG-UHFFFAOYSA-N Pyridine Chemical compound C1=CC=NC=C1 JUJWROOIHBZHMG-UHFFFAOYSA-N 0.000 description 14
- 238000001254 matrix assisted laser desorption--ionisation time-of-flight mass spectrum Methods 0.000 description 14
- 238000011282 treatment Methods 0.000 description 14
- AVBGNFCMKJOFIN-UHFFFAOYSA-N triethylammonium acetate Chemical compound CC(O)=O.CCN(CC)CC AVBGNFCMKJOFIN-UHFFFAOYSA-N 0.000 description 14
- 125000003088 (fluoren-9-ylmethoxy)carbonyl group Chemical group 0.000 description 13
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 13
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 13
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 12
- 230000006820 DNA synthesis Effects 0.000 description 12
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 12
- OFBQJSOFQDEBGM-UHFFFAOYSA-N Pentane Chemical compound CCCCC OFBQJSOFQDEBGM-UHFFFAOYSA-N 0.000 description 12
- 238000003776 cleavage reaction Methods 0.000 description 12
- VHJLVAABSRFDPM-QWWZWVQMSA-N dithiothreitol Chemical compound SC[C@@H](O)[C@H](O)CS VHJLVAABSRFDPM-QWWZWVQMSA-N 0.000 description 12
- 238000000746 purification Methods 0.000 description 12
- 238000010561 standard procedure Methods 0.000 description 12
- 239000013598 vector Substances 0.000 description 12
- 239000000499 gel Substances 0.000 description 11
- 239000002904 solvent Substances 0.000 description 11
- 241000894007 species Species 0.000 description 11
- 238000012546 transfer Methods 0.000 description 11
- JGFZNNIVVJXRND-UHFFFAOYSA-N N,N-Diisopropylethylamine (DIPEA) Chemical compound CCN(C(C)C)C(C)C JGFZNNIVVJXRND-UHFFFAOYSA-N 0.000 description 10
- 150000001793 charged compounds Chemical class 0.000 description 10
- 239000007858 starting material Substances 0.000 description 10
- 239000000126 substance Substances 0.000 description 10
- KDCGOANMDULRCW-UHFFFAOYSA-N 7H-purine Chemical compound N1=CNC2=NC=NC2=C1 KDCGOANMDULRCW-UHFFFAOYSA-N 0.000 description 9
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 9
- WFDIJRYMOXRFFG-UHFFFAOYSA-N Acetic anhydride Chemical compound CC(=O)OC(C)=O WFDIJRYMOXRFFG-UHFFFAOYSA-N 0.000 description 9
- XEKOWRVHYACXOJ-UHFFFAOYSA-N Ethyl acetate Chemical compound CCOC(C)=O XEKOWRVHYACXOJ-UHFFFAOYSA-N 0.000 description 9
- ZMANZCXQSJIPKH-UHFFFAOYSA-N Triethylamine Chemical compound CCN(CC)CC ZMANZCXQSJIPKH-UHFFFAOYSA-N 0.000 description 9
- UCMIRNVEIXFBKS-UHFFFAOYSA-N beta-alanine Chemical group NCCC(O)=O UCMIRNVEIXFBKS-UHFFFAOYSA-N 0.000 description 9
- 230000027832 depurination Effects 0.000 description 9
- 230000007017 scission Effects 0.000 description 9
- 238000010898 silica gel chromatography Methods 0.000 description 9
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 8
- HEDRZPFGACZZDS-UHFFFAOYSA-N Chloroform Chemical compound ClC(Cl)Cl HEDRZPFGACZZDS-UHFFFAOYSA-N 0.000 description 8
- 102000004190 Enzymes Human genes 0.000 description 8
- 108090000790 Enzymes Proteins 0.000 description 8
- 238000003559 RNA-seq method Methods 0.000 description 8
- 239000002253 acid Substances 0.000 description 8
- 238000007792 addition Methods 0.000 description 8
- 230000003247 decreasing effect Effects 0.000 description 8
- 229960004592 isopropanol Drugs 0.000 description 8
- 230000035945 sensitivity Effects 0.000 description 8
- LBSDTBJWUJIFBO-UHFFFAOYSA-N (2,3,4,5,6-pentafluorophenyl) 2-(9h-fluoren-9-ylmethoxycarbonylamino)acetate Chemical compound FC1=C(F)C(F)=C(F)C(F)=C1OC(=O)CNC(=O)OCC1C2=CC=CC=C2C2=CC=CC=C21 LBSDTBJWUJIFBO-UHFFFAOYSA-N 0.000 description 7
- 239000004471 Glycine Substances 0.000 description 7
- HDRRAMINWIWTNU-NTSWFWBYSA-N [[(2s,5r)-5-(2-amino-6-oxo-3h-purin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl] phosphono hydrogen phosphate Chemical compound C1=2NC(N)=NC(=O)C=2N=CN1[C@H]1CC[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 HDRRAMINWIWTNU-NTSWFWBYSA-N 0.000 description 7
- QGZKDVFQNNGYKY-UHFFFAOYSA-N ammonia Natural products N QGZKDVFQNNGYKY-UHFFFAOYSA-N 0.000 description 7
- SUYVUBYJARFZHO-RRKCRQDMSA-N dATP Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-RRKCRQDMSA-N 0.000 description 7
- SUYVUBYJARFZHO-UHFFFAOYSA-N dATP Natural products C1=NC=2C(N)=NC=NC=2N1C1CC(O)C(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-UHFFFAOYSA-N 0.000 description 7
- URGJWIFLBWJRMF-JGVFFNPUSA-N ddTTP Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)CC1 URGJWIFLBWJRMF-JGVFFNPUSA-N 0.000 description 7
- 238000001502 gel electrophoresis Methods 0.000 description 7
- YMAWOPBAYDPSLA-UHFFFAOYSA-N glycylglycine Chemical compound [NH3+]CC(=O)NCC([O-])=O YMAWOPBAYDPSLA-UHFFFAOYSA-N 0.000 description 7
- 238000010348 incorporation Methods 0.000 description 7
- 230000003993 interaction Effects 0.000 description 7
- 239000000463 material Substances 0.000 description 7
- 239000008188 pellet Substances 0.000 description 7
- 108090000623 proteins and genes Proteins 0.000 description 7
- UMJSCPRVCHMLSP-UHFFFAOYSA-N pyridine Natural products COC1=CC=CN=C1 UMJSCPRVCHMLSP-UHFFFAOYSA-N 0.000 description 7
- 150000003254 radicals Chemical class 0.000 description 7
- RYYWUUFWQRZTIU-UHFFFAOYSA-K thiophosphate Chemical compound [O-]P([O-])([O-])=S RYYWUUFWQRZTIU-UHFFFAOYSA-K 0.000 description 7
- QSECPQCFCWVBKM-UHFFFAOYSA-N 2-iodoethanol Chemical compound OCCI QSECPQCFCWVBKM-UHFFFAOYSA-N 0.000 description 6
- 108010017826 DNA Polymerase I Proteins 0.000 description 6
- 102000004594 DNA Polymerase I Human genes 0.000 description 6
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 6
- 238000012408 PCR amplification Methods 0.000 description 6
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 6
- 108010090804 Streptavidin Proteins 0.000 description 6
- IQFYYKKMVGJFEH-XLPZGREQSA-N Thymidine Chemical class O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 IQFYYKKMVGJFEH-XLPZGREQSA-N 0.000 description 6
- ARLKCWCREKRROD-POYBYMJQSA-N [[(2s,5r)-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl] phosphono hydrogen phosphate Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)CC1 ARLKCWCREKRROD-POYBYMJQSA-N 0.000 description 6
- OIRDTQYFTABQOQ-KQYNXXCUSA-N adenosine Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OIRDTQYFTABQOQ-KQYNXXCUSA-N 0.000 description 6
- 230000003321 amplification Effects 0.000 description 6
- 229960002685 biotin Drugs 0.000 description 6
- 235000020958 biotin Nutrition 0.000 description 6
- 239000011616 biotin Substances 0.000 description 6
- RGWHQCVHVJXOKC-SHYZEUOFSA-J dCTP(4-) Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](COP([O-])(=O)OP([O-])(=O)OP([O-])([O-])=O)[C@@H](O)C1 RGWHQCVHVJXOKC-SHYZEUOFSA-J 0.000 description 6
- HAAZLUGHYHWQIW-KVQBGUIXSA-N dGTP Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 HAAZLUGHYHWQIW-KVQBGUIXSA-N 0.000 description 6
- NHVNXKFIZYSCEB-XLPZGREQSA-N dTTP Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)C1 NHVNXKFIZYSCEB-XLPZGREQSA-N 0.000 description 6
- 230000000694 effects Effects 0.000 description 6
- 230000002255 enzymatic effect Effects 0.000 description 6
- 238000001704 evaporation Methods 0.000 description 6
- 125000003630 glycyl group Chemical group [H]N([H])C([H])([H])C(*)=O 0.000 description 6
- 229910052757 nitrogen Inorganic materials 0.000 description 6
- 238000003199 nucleic acid amplification method Methods 0.000 description 6
- 238000011160 research Methods 0.000 description 6
- OAKPWEUQDVLTCN-NKWVEPMBSA-N 2',3'-Dideoxyadenosine-5-triphosphate Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1CC[C@@H](CO[P@@](O)(=O)O[P@](O)(=O)OP(O)(O)=O)O1 OAKPWEUQDVLTCN-NKWVEPMBSA-N 0.000 description 5
- XNWFRZJHXBZDAG-UHFFFAOYSA-N 2-METHOXYETHANOL Chemical compound COCCO XNWFRZJHXBZDAG-UHFFFAOYSA-N 0.000 description 5
- ZKHQWZAMYRWXGA-KQYNXXCUSA-N Adenosine triphosphate Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)[C@H]1O ZKHQWZAMYRWXGA-KQYNXXCUSA-N 0.000 description 5
- 102100033215 DNA nucleotidylexotransferase Human genes 0.000 description 5
- 108010008286 DNA nucleotidylexotransferase Proteins 0.000 description 5
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 5
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 5
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 5
- NINIDFKCEFEMDL-UHFFFAOYSA-N Sulfur Chemical group [S] NINIDFKCEFEMDL-UHFFFAOYSA-N 0.000 description 5
- 150000007513 acids Chemical class 0.000 description 5
- 238000000137 annealing Methods 0.000 description 5
- 238000005119 centrifugation Methods 0.000 description 5
- 230000003750 conditioning effect Effects 0.000 description 5
- 238000004925 denaturation Methods 0.000 description 5
- 230000036425 denaturation Effects 0.000 description 5
- 238000001035 drying Methods 0.000 description 5
- 230000008020 evaporation Effects 0.000 description 5
- 238000010438 heat treatment Methods 0.000 description 5
- 125000005647 linker group Chemical group 0.000 description 5
- 238000001819 mass spectrum Methods 0.000 description 5
- 238000001906 matrix-assisted laser desorption--ionisation mass spectrometry Methods 0.000 description 5
- 125000002496 methyl group Chemical group [H]C([H])([H])* 0.000 description 5
- 150000003833 nucleoside derivatives Chemical class 0.000 description 5
- 238000002515 oligonucleotide synthesis Methods 0.000 description 5
- 229920001223 polyethylene glycol Polymers 0.000 description 5
- 238000002360 preparation method Methods 0.000 description 5
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 5
- 238000000163 radioactive labelling Methods 0.000 description 5
- 238000000926 separation method Methods 0.000 description 5
- 229910052717 sulfur Inorganic materials 0.000 description 5
- 238000004885 tandem mass spectrometry Methods 0.000 description 5
- 238000004809 thin layer chromatography Methods 0.000 description 5
- 230000009466 transformation Effects 0.000 description 5
- JWDFQMWEFLOOED-UHFFFAOYSA-N (2,5-dioxopyrrolidin-1-yl) 3-(pyridin-2-yldisulfanyl)propanoate Chemical compound O=C1CCC(=O)N1OC(=O)CCSSC1=CC=CC=N1 JWDFQMWEFLOOED-UHFFFAOYSA-N 0.000 description 4
- SBASXUCJHJRPEV-UHFFFAOYSA-N 2-(2-methoxyethoxy)ethanol Chemical compound COCCOCCO SBASXUCJHJRPEV-UHFFFAOYSA-N 0.000 description 4
- BRARRAHGNDUELT-UHFFFAOYSA-N 3-hydroxypicolinic acid Chemical compound OC(=O)C1=NC=CC=C1O BRARRAHGNDUELT-UHFFFAOYSA-N 0.000 description 4
- 108090001008 Avidin Proteins 0.000 description 4
- 108010008488 Glycylglycine Proteins 0.000 description 4
- 229920005654 Sephadex Polymers 0.000 description 4
- 239000012507 Sephadex™ Substances 0.000 description 4
- 102000004142 Trypsin Human genes 0.000 description 4
- 108090000631 Trypsin Proteins 0.000 description 4
- 230000001133 acceleration Effects 0.000 description 4
- 125000000217 alkyl group Chemical group 0.000 description 4
- 230000029936 alkylation Effects 0.000 description 4
- 238000005804 alkylation reaction Methods 0.000 description 4
- 238000005515 capillary zone electrophoresis Methods 0.000 description 4
- 150000001768 cations Chemical class 0.000 description 4
- 238000010367 cloning Methods 0.000 description 4
- 150000001875 compounds Chemical class 0.000 description 4
- 230000008878 coupling Effects 0.000 description 4
- 238000010168 coupling process Methods 0.000 description 4
- 238000005859 coupling reaction Methods 0.000 description 4
- 230000007423 decrease Effects 0.000 description 4
- 238000010511 deprotection reaction Methods 0.000 description 4
- 235000011180 diphosphates Nutrition 0.000 description 4
- 150000002148 esters Chemical class 0.000 description 4
- 238000012869 ethanol precipitation Methods 0.000 description 4
- 238000002474 experimental method Methods 0.000 description 4
- 230000006870 function Effects 0.000 description 4
- 125000000524 functional group Chemical group 0.000 description 4
- 229940043257 glycylglycine Drugs 0.000 description 4
- 229910052736 halogen Inorganic materials 0.000 description 4
- 150000002367 halogens Chemical class 0.000 description 4
- 125000003835 nucleoside group Chemical group 0.000 description 4
- 229940046166 oligodeoxynucleotide Drugs 0.000 description 4
- 238000005580 one pot reaction Methods 0.000 description 4
- 150000004713 phosphodiesters Chemical class 0.000 description 4
- 239000002718 pyrimidine nucleoside Substances 0.000 description 4
- 238000012552 review Methods 0.000 description 4
- 239000000741 silica gel Substances 0.000 description 4
- 229910002027 silica gel Inorganic materials 0.000 description 4
- 229960001866 silicon dioxide Drugs 0.000 description 4
- 239000011593 sulfur Substances 0.000 description 4
- 239000006228 supernatant Substances 0.000 description 4
- 238000001269 time-of-flight mass spectrometry Methods 0.000 description 4
- 238000013518 transcription Methods 0.000 description 4
- 230000035897 transcription Effects 0.000 description 4
- YWYZEGXAUVWDED-UHFFFAOYSA-N triammonium citrate Chemical compound [NH4+].[NH4+].[NH4+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O YWYZEGXAUVWDED-UHFFFAOYSA-N 0.000 description 4
- 239000012588 trypsin Substances 0.000 description 4
- HKRARRDDNQDLOY-UHFFFAOYSA-N (2,3,4,5,6-pentafluorophenyl) 3-(9h-fluoren-9-ylmethoxycarbonylamino)propanoate Chemical compound FC1=C(F)C(F)=C(F)C(F)=C1OC(=O)CCNC(=O)OCC1C2=CC=CC=C2C2=CC=CC=C21 HKRARRDDNQDLOY-UHFFFAOYSA-N 0.000 description 3
- NKCBLVMXMCSJQL-HISDBWNOSA-N (2r,3r,4s,5r)-5-[[[[(2r,3s,4r,5r)-5-(6-aminopurin-9-yl)-3,4-dihydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-hydroxyphosphoryl]oxymethyl]-2-(3-aminopyridin-1-ium-1-yl)-4-hydroxyoxolan-3-olate Chemical compound NC1=CC=C[N+]([C@H]2[C@@H]([C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OC[C@@H]3[C@H]([C@@H](O)[C@@H](O3)N3C4=NC=NC(N)=C4N=C3)O)O2)[O-])=C1 NKCBLVMXMCSJQL-HISDBWNOSA-N 0.000 description 3
- VHUUQVKOLVNVRT-UHFFFAOYSA-N Ammonium hydroxide Chemical compound [NH4+].[OH-] VHUUQVKOLVNVRT-UHFFFAOYSA-N 0.000 description 3
- 239000003298 DNA probe Substances 0.000 description 3
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 3
- 241000588724 Escherichia coli Species 0.000 description 3
- WSFSSNUMVMOOMR-UHFFFAOYSA-N Formaldehyde Chemical compound O=C WSFSSNUMVMOOMR-UHFFFAOYSA-N 0.000 description 3
- 239000002202 Polyethylene glycol Substances 0.000 description 3
- 230000006819 RNA synthesis Effects 0.000 description 3
- 229920002684 Sepharose Polymers 0.000 description 3
- 101710137500 T7 RNA polymerase Proteins 0.000 description 3
- 108010006785 Taq Polymerase Proteins 0.000 description 3
- 238000002835 absorbance Methods 0.000 description 3
- 239000008351 acetate buffer Substances 0.000 description 3
- 239000012491 analyte Substances 0.000 description 3
- 238000005571 anion exchange chromatography Methods 0.000 description 3
- 125000003118 aryl group Chemical group 0.000 description 3
- 238000000376 autoradiography Methods 0.000 description 3
- 125000000852 azido group Chemical group *N=[N+]=[N-] 0.000 description 3
- 229940000635 beta-alanine Drugs 0.000 description 3
- 244000309464 bull Species 0.000 description 3
- 230000015556 catabolic process Effects 0.000 description 3
- 238000005341 cation exchange Methods 0.000 description 3
- 239000001913 cellulose Substances 0.000 description 3
- 229920002678 cellulose Polymers 0.000 description 3
- KRKNYBCHXYNGOX-UHFFFAOYSA-N citric acid Chemical compound OC(=O)CC(O)(C(O)=O)CC(O)=O KRKNYBCHXYNGOX-UHFFFAOYSA-N 0.000 description 3
- 238000001360 collision-induced dissociation Methods 0.000 description 3
- 238000004891 communication Methods 0.000 description 3
- 238000004590 computer program Methods 0.000 description 3
- XPPKVPWEQAFLFU-UHFFFAOYSA-J diphosphate(4-) Chemical compound [O-]P([O-])(=O)OP([O-])([O-])=O XPPKVPWEQAFLFU-UHFFFAOYSA-J 0.000 description 3
- 229940093499 ethyl acetate Drugs 0.000 description 3
- 235000019439 ethyl acetate Nutrition 0.000 description 3
- 125000001495 ethyl group Chemical group [H]C([H])([H])C([H])([H])* 0.000 description 3
- 238000001914 filtration Methods 0.000 description 3
- 229910052740 iodine Inorganic materials 0.000 description 3
- 238000005342 ion exchange Methods 0.000 description 3
- ZBKFYXZXZJPWNQ-UHFFFAOYSA-N isothiocyanate group Chemical group [N-]=C=S ZBKFYXZXZJPWNQ-UHFFFAOYSA-N 0.000 description 3
- 238000002372 labelling Methods 0.000 description 3
- 230000001404 mediated effect Effects 0.000 description 3
- 229910052751 metal Inorganic materials 0.000 description 3
- 239000002184 metal Substances 0.000 description 3
- SWFDTFBWXCNRGN-UHFFFAOYSA-N phosphonato phosphate;tributylazanium Chemical compound [O-]P([O-])(=O)OP([O-])([O-])=O.CCCC[NH+](CCCC)CCCC.CCCC[NH+](CCCC)CCCC.CCCC[NH+](CCCC)CCCC.CCCC[NH+](CCCC)CCCC SWFDTFBWXCNRGN-UHFFFAOYSA-N 0.000 description 3
- 229920003023 plastic Polymers 0.000 description 3
- 239000004033 plastic Substances 0.000 description 3
- 229920002401 polyacrylamide Polymers 0.000 description 3
- 239000002244 precipitate Substances 0.000 description 3
- 238000001556 precipitation Methods 0.000 description 3
- 108090000765 processed proteins & peptides Proteins 0.000 description 3
- 125000001436 propyl group Chemical group [H]C([*])([H])C([H])([H])C([H])([H])[H] 0.000 description 3
- 235000018102 proteins Nutrition 0.000 description 3
- 102000004169 proteins and genes Human genes 0.000 description 3
- 230000002285 radioactive effect Effects 0.000 description 3
- 230000009467 reduction Effects 0.000 description 3
- 229910052709 silver Inorganic materials 0.000 description 3
- 229910021642 ultra pure water Inorganic materials 0.000 description 3
- 239000012498 ultrapure water Substances 0.000 description 3
- 238000003260 vortexing Methods 0.000 description 3
- DGVVWUTYPXICAM-UHFFFAOYSA-N β‐Mercaptoethanol Chemical compound OCCS DGVVWUTYPXICAM-UHFFFAOYSA-N 0.000 description 3
- MRUKYOQQKHNMFI-XVFCMESISA-N 1-[(2r,3r,4s,5r)-3-azido-4-hydroxy-5-(hydroxymethyl)oxolan-2-yl]pyrimidine-2,4-dione Chemical compound [N-]=[N+]=N[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 MRUKYOQQKHNMFI-XVFCMESISA-N 0.000 description 2
- FALRKNHUBBKYCC-UHFFFAOYSA-N 2-(chloromethyl)pyridine-3-carbonitrile Chemical compound ClCC1=NC=CC=C1C#N FALRKNHUBBKYCC-UHFFFAOYSA-N 0.000 description 2
- QMQXVNHINOVNSD-UHFFFAOYSA-N 2-[chloro(diphenyl)methyl]-5,5-dimethoxycyclohexa-1,3-diene Chemical compound C1=CC(OC)(OC)CC=C1C(Cl)(C=1C=CC=CC=1)C1=CC=CC=C1 QMQXVNHINOVNSD-UHFFFAOYSA-N 0.000 description 2
- VKIGAWAEXPTIOL-UHFFFAOYSA-N 2-hydroxyhexanenitrile Chemical compound CCCCC(O)C#N VKIGAWAEXPTIOL-UHFFFAOYSA-N 0.000 description 2
- UYEMGAFJOZZIFP-UHFFFAOYSA-N 3,5-dihydroxybenzoic acid Chemical compound OC(=O)C1=CC(O)=CC(O)=C1 UYEMGAFJOZZIFP-UHFFFAOYSA-N 0.000 description 2
- CQVWOJSAGPFDQL-UHFFFAOYSA-N 3-iodopropan-1-ol Chemical compound OCCCI CQVWOJSAGPFDQL-UHFFFAOYSA-N 0.000 description 2
- 239000003155 DNA primer Substances 0.000 description 2
- QOSSAOTZNIDXMA-UHFFFAOYSA-N Dicylcohexylcarbodiimide Chemical compound C1CCCCC1N=C=NC1CCCCC1 QOSSAOTZNIDXMA-UHFFFAOYSA-N 0.000 description 2
- 108060002716 Exonuclease Proteins 0.000 description 2
- CTKINSOISVBQLD-UHFFFAOYSA-N Glycidol Chemical compound OCC1CO1 CTKINSOISVBQLD-UHFFFAOYSA-N 0.000 description 2
- HWMVXEKEEAIYGB-UHFFFAOYSA-K Isocitric acid, DL- Chemical compound [Na+].[Na+].[Na+].[O-]C(=O)C(O)C(C([O-])=O)CC([O-])=O HWMVXEKEEAIYGB-UHFFFAOYSA-K 0.000 description 2
- 239000004472 Lysine Substances 0.000 description 2
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 2
- CSNNHWWHGAXBCP-UHFFFAOYSA-L Magnesium sulfate Chemical compound [Mg+2].[O-][S+2]([O-])([O-])[O-] CSNNHWWHGAXBCP-UHFFFAOYSA-L 0.000 description 2
- 229910019142 PO4 Inorganic materials 0.000 description 2
- 108010002747 Pfu DNA polymerase Proteins 0.000 description 2
- 229920002873 Polyethylenimine Polymers 0.000 description 2
- 241000722234 Pseudococcus Species 0.000 description 2
- 102000009609 Pyrophosphatases Human genes 0.000 description 2
- 108010009413 Pyrophosphatases Proteins 0.000 description 2
- 108010065868 RNA polymerase SP6 Proteins 0.000 description 2
- 229910007161 Si(CH3)3 Chemical group 0.000 description 2
- BQCADISMDOOEFD-UHFFFAOYSA-N Silver Chemical compound [Ag] BQCADISMDOOEFD-UHFFFAOYSA-N 0.000 description 2
- PMZURENOXWZQFD-UHFFFAOYSA-L Sodium Sulfate Chemical compound [Na+].[Na+].[O-]S([O-])(=O)=O PMZURENOXWZQFD-UHFFFAOYSA-L 0.000 description 2
- UIIMBOGNXHQVGW-UHFFFAOYSA-M Sodium bicarbonate Chemical compound [Na+].OC([O-])=O UIIMBOGNXHQVGW-UHFFFAOYSA-M 0.000 description 2
- 101000865057 Thermococcus litoralis DNA polymerase Proteins 0.000 description 2
- DTQVDTLACAAQTR-UHFFFAOYSA-N Trifluoroacetic acid Chemical compound OC(=O)C(F)(F)F DTQVDTLACAAQTR-UHFFFAOYSA-N 0.000 description 2
- 239000007984 Tris EDTA buffer Substances 0.000 description 2
- 239000007983 Tris buffer Substances 0.000 description 2
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 2
- DBEKJYUAWIEWGT-YXINZVNLSA-N [(2r,3s,4r,5r)-4-amino-2-[[(4,4-dimethoxycyclohexa-1,5-dien-1-yl)-diphenylmethoxy]methyl]-5-(2,4-dioxopyrimidin-1-yl)oxolan-3-yl] acetate Chemical compound C1=CC(OC)(OC)CC=C1C(C=1C=CC=CC=1)(C=1C=CC=CC=1)OC[C@@H]1[C@@H](OC(C)=O)[C@@H](N)[C@H](N2C(NC(=O)C=C2)=O)O1 DBEKJYUAWIEWGT-YXINZVNLSA-N 0.000 description 2
- 238000006640 acetylation reaction Methods 0.000 description 2
- 150000001351 alkyl iodides Chemical class 0.000 description 2
- 239000002168 alkylating agent Substances 0.000 description 2
- 229940100198 alkylating agent Drugs 0.000 description 2
- 229910052782 aluminium Inorganic materials 0.000 description 2
- XAGFODPZIPBFFR-UHFFFAOYSA-N aluminium Chemical compound [Al] XAGFODPZIPBFFR-UHFFFAOYSA-N 0.000 description 2
- 125000003277 amino group Chemical group 0.000 description 2
- 229910021529 ammonia Inorganic materials 0.000 description 2
- 238000004873 anchoring Methods 0.000 description 2
- 239000007864 aqueous solution Substances 0.000 description 2
- 238000000211 autoradiogram Methods 0.000 description 2
- 239000012148 binding buffer Substances 0.000 description 2
- 230000006287 biotinylation Effects 0.000 description 2
- 238000007413 biotinylation Methods 0.000 description 2
- 229910052794 bromium Inorganic materials 0.000 description 2
- 239000007795 chemical reaction product Substances 0.000 description 2
- 239000003795 chemical substances by application Substances 0.000 description 2
- 238000004587 chromatography analysis Methods 0.000 description 2
- 238000009833 condensation Methods 0.000 description 2
- 230000005494 condensation Effects 0.000 description 2
- 235000009508 confectionery Nutrition 0.000 description 2
- 238000006731 degradation reaction Methods 0.000 description 2
- 239000005547 deoxyribonucleotide Substances 0.000 description 2
- 239000005546 dideoxynucleotide Substances 0.000 description 2
- 238000009826 distribution Methods 0.000 description 2
- CTSPAMFJBXKSOY-UHFFFAOYSA-N ellipticine Chemical compound N1=CC=C2C(C)=C(NC=3C4=CC=CC=3)C4=C(C)C2=C1 CTSPAMFJBXKSOY-UHFFFAOYSA-N 0.000 description 2
- ZMMJGEGLRURXTF-UHFFFAOYSA-N ethidium bromide Chemical compound [Br-].C12=CC(N)=CC=C2C2=CC=C(N)C=C2[N+](CC)=C1C1=CC=CC=C1 ZMMJGEGLRURXTF-UHFFFAOYSA-N 0.000 description 2
- 229960005542 ethidium bromide Drugs 0.000 description 2
- 102000013165 exonuclease Human genes 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 238000003818 flash chromatography Methods 0.000 description 2
- 239000007850 fluorescent dye Substances 0.000 description 2
- 238000001215 fluorescent labelling Methods 0.000 description 2
- 238000004108 freeze drying Methods 0.000 description 2
- 239000011521 glass Substances 0.000 description 2
- 229960002449 glycine Drugs 0.000 description 2
- 229910052737 gold Inorganic materials 0.000 description 2
- 239000010931 gold Substances 0.000 description 2
- 238000004128 high performance liquid chromatography Methods 0.000 description 2
- 239000001257 hydrogen Substances 0.000 description 2
- 229910052739 hydrogen Inorganic materials 0.000 description 2
- AFQIYTIJXGTIEY-UHFFFAOYSA-N hydrogen carbonate;triethylazanium Chemical compound OC(O)=O.CCN(CC)CC AFQIYTIJXGTIEY-UHFFFAOYSA-N 0.000 description 2
- 230000007062 hydrolysis Effects 0.000 description 2
- 238000006460 hydrolysis reaction Methods 0.000 description 2
- 238000011065 in-situ storage Methods 0.000 description 2
- NOESYZHRGYRDHS-UHFFFAOYSA-N insulin Chemical compound N1C(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(NC(=O)CN)C(C)CC)CSSCC(C(NC(CO)C(=O)NC(CC(C)C)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CCC(N)=O)C(=O)NC(CC(C)C)C(=O)NC(CCC(O)=O)C(=O)NC(CC(N)=O)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CSSCC(NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2C=CC(O)=CC=2)NC(=O)C(CC(C)C)NC(=O)C(C)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2NC=NC=2)NC(=O)C(CO)NC(=O)CNC2=O)C(=O)NCC(=O)NC(CCC(O)=O)C(=O)NC(CCCNC(N)=N)C(=O)NCC(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC(O)=CC=3)C(=O)NC(C(C)O)C(=O)N3C(CCC3)C(=O)NC(CCCCN)C(=O)NC(C)C(O)=O)C(=O)NC(CC(N)=O)C(O)=O)=O)NC(=O)C(C(C)CC)NC(=O)C(CO)NC(=O)C(C(C)O)NC(=O)C1CSSCC2NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CC(N)=O)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)C(C)C)CC1=CN=CN1 NOESYZHRGYRDHS-UHFFFAOYSA-N 0.000 description 2
- PGLTVOMIXTUURA-UHFFFAOYSA-N iodoacetamide Chemical compound NC(=O)CI PGLTVOMIXTUURA-UHFFFAOYSA-N 0.000 description 2
- 238000000752 ionisation method Methods 0.000 description 2
- 125000001449 isopropyl group Chemical group [H]C([H])([H])C([H])(*)C([H])([H])[H] 0.000 description 2
- 238000000608 laser ablation Methods 0.000 description 2
- 125000005524 levulinyl group Chemical group 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 238000002844 melting Methods 0.000 description 2
- 230000008018 melting Effects 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- CMWYAOXYQATXSI-UHFFFAOYSA-N n,n-dimethylformamide;piperidine Chemical compound CN(C)C=O.C1CCNCC1 CMWYAOXYQATXSI-UHFFFAOYSA-N 0.000 description 2
- FFUDCIXZEUGJLO-DCMFLLSESA-N n-[9-[(2r,4s,5r)-5-[[bis(4-methoxyphenyl)-phenylmethoxy]methyl]-4-hydroxyoxolan-2-yl]-8-bromopurin-6-yl]benzamide Chemical compound C1=CC(OC)=CC=C1C(C=1C=CC(OC)=CC=1)(C=1C=CC=CC=1)OC[C@@H]1[C@@H](O)C[C@H](N2C3=NC=NC(NC(=O)C=4C=CC=CC=4)=C3N=C2Br)O1 FFUDCIXZEUGJLO-DCMFLLSESA-N 0.000 description 2
- 239000006199 nebulizer Substances 0.000 description 2
- 230000007935 neutral effect Effects 0.000 description 2
- 230000005257 nucleotidylation Effects 0.000 description 2
- 239000012074 organic phase Substances 0.000 description 2
- 238000010647 peptide synthesis reaction Methods 0.000 description 2
- KHIWWQKSHDUIBK-UHFFFAOYSA-N periodic acid Chemical compound OI(=O)(=O)=O KHIWWQKSHDUIBK-UHFFFAOYSA-N 0.000 description 2
- 239000012071 phase Substances 0.000 description 2
- 125000001997 phenyl group Chemical group [H]C1=C([H])C([H])=C(*)C([H])=C1[H] 0.000 description 2
- 239000010452 phosphate Substances 0.000 description 2
- 230000026731 phosphorylation Effects 0.000 description 2
- 238000006366 phosphorylation reaction Methods 0.000 description 2
- 239000013612 plasmid Substances 0.000 description 2
- 238000006116 polymerization reaction Methods 0.000 description 2
- 108091033319 polynucleotide Proteins 0.000 description 2
- 239000002157 polynucleotide Substances 0.000 description 2
- 102000040430 polynucleotide Human genes 0.000 description 2
- 238000011176 pooling Methods 0.000 description 2
- 239000002987 primer (paints) Substances 0.000 description 2
- 230000037452 priming Effects 0.000 description 2
- 102000004196 processed proteins & peptides Human genes 0.000 description 2
- ZCCUUQDIBDJBTK-UHFFFAOYSA-N psoralen Chemical group C1=C2OC(=O)C=CC2=CC2=C1OC=C2 ZCCUUQDIBDJBTK-UHFFFAOYSA-N 0.000 description 2
- 230000005855 radiation Effects 0.000 description 2
- 238000001829 resonance ionisation spectroscopy Methods 0.000 description 2
- 239000004332 silver Substances 0.000 description 2
- SQGYOTSLMSWVJD-UHFFFAOYSA-N silver(1+) nitrate Chemical compound [Ag+].[O-]N(=O)=O SQGYOTSLMSWVJD-UHFFFAOYSA-N 0.000 description 2
- 238000004088 simulation Methods 0.000 description 2
- JQWHASGSAFIOCM-UHFFFAOYSA-M sodium periodate Chemical compound [Na+].[O-]I(=O)(=O)=O JQWHASGSAFIOCM-UHFFFAOYSA-M 0.000 description 2
- 239000011343 solid material Substances 0.000 description 2
- 239000007790 solid phase Substances 0.000 description 2
- 238000004611 spectroscopical analysis Methods 0.000 description 2
- 229940014800 succinic anhydride Drugs 0.000 description 2
- 230000002194 synthesizing effect Effects 0.000 description 2
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 2
- 125000002221 trityl group Chemical group [H]C1=C([H])C([H])=C([H])C([H])=C1C([*])(C1=C(C(=C(C(=C1[H])[H])[H])[H])[H])C1=C([H])C([H])=C([H])C([H])=C1[H] 0.000 description 2
- 229940035893 uracil Drugs 0.000 description 2
- 239000011534 wash buffer Substances 0.000 description 2
- 238000010626 work up procedure Methods 0.000 description 2
- WMSUFWLPZLCIHP-UHFFFAOYSA-N (2,5-dioxopyrrolidin-1-yl) 9h-fluoren-9-ylmethyl carbonate Chemical compound C12=CC=CC=C2C2=CC=CC=C2C1COC(=O)ON1C(=O)CCC1=O WMSUFWLPZLCIHP-UHFFFAOYSA-N 0.000 description 1
- NJBIVXMQFIQOGE-KVQBGUIXSA-N (2r,3s,5r)-5-(6-amino-8-bromopurin-9-yl)-2-(hydroxymethyl)oxolan-3-ol Chemical class BrC1=NC=2C(N)=NC=NC=2N1[C@H]1C[C@H](O)[C@@H](CO)O1 NJBIVXMQFIQOGE-KVQBGUIXSA-N 0.000 description 1
- ADFXKUOMJKEIND-UHFFFAOYSA-N 1,3-dicyclohexylurea Chemical compound C1CCCCC1NC(=O)NC1CCCCC1 ADFXKUOMJKEIND-UHFFFAOYSA-N 0.000 description 1
- RYHBNJHYFVUHQT-UHFFFAOYSA-N 1,4-Dioxane Chemical compound C1COCCO1 RYHBNJHYFVUHQT-UHFFFAOYSA-N 0.000 description 1
- LLIPTMWIZVIUSX-XVFCMESISA-N 1-[(2r,3r,4s,5r)-3-amino-4-hydroxy-5-(hydroxymethyl)oxolan-2-yl]pyrimidine-2,4-dione Chemical class N[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 LLIPTMWIZVIUSX-XVFCMESISA-N 0.000 description 1
- IOXIQPZHBHEJKZ-SXMVTHIZSA-N 1-[(2s,4s,5r)-2-(3-aminopropanoyl)-4-hydroxy-5-(hydroxymethyl)oxolan-2-yl]pyrimidine-2,4-dione Chemical class C1=CC(=O)NC(=O)N1[C@@]1(C(=O)CCN)C[C@H](O)[C@@H](CO)O1 IOXIQPZHBHEJKZ-SXMVTHIZSA-N 0.000 description 1
- XLEYFDVVXLMULC-UHFFFAOYSA-N 2',4',6'-trihydroxyacetophenone Chemical compound CC(=O)C1=C(O)C=C(O)C=C1O XLEYFDVVXLMULC-UHFFFAOYSA-N 0.000 description 1
- MXHRCPNRJAMMIM-SHYZEUOFSA-N 2'-deoxyuridine Chemical group C1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 MXHRCPNRJAMMIM-SHYZEUOFSA-N 0.000 description 1
- LDQREKGAFSJNJC-UHFFFAOYSA-N 2-(2-azanylethyldisulfanyl)ethanamine Chemical compound NCCSSCCN.NCCSSCCN LDQREKGAFSJNJC-UHFFFAOYSA-N 0.000 description 1
- BZQAWHMCGZYKMH-UHFFFAOYSA-N 2-aminoethanethiol Chemical compound NCCS.NCCS BZQAWHMCGZYKMH-UHFFFAOYSA-N 0.000 description 1
- VXGRJERITKFWPL-UHFFFAOYSA-N 4',5'-Dihydropsoralen Natural products C1=C2OC(=O)C=CC2=CC2=C1OCC2 VXGRJERITKFWPL-UHFFFAOYSA-N 0.000 description 1
- VRICZARLROAIDW-UHFFFAOYSA-N 4-iodobutan-1-ol Chemical compound OCCCCI VRICZARLROAIDW-UHFFFAOYSA-N 0.000 description 1
- BTJIUGUIPKRLHP-UHFFFAOYSA-N 4-nitrophenol Chemical compound OC1=CC=C([N+]([O-])=O)C=C1 BTJIUGUIPKRLHP-UHFFFAOYSA-N 0.000 description 1
- XVMSFILGAMDHEY-UHFFFAOYSA-N 6-(4-aminophenyl)sulfonylpyridin-3-amine Chemical compound C1=CC(N)=CC=C1S(=O)(=O)C1=CC=C(N)C=N1 XVMSFILGAMDHEY-UHFFFAOYSA-N 0.000 description 1
- ZGXJTSGNIOSYLO-UHFFFAOYSA-N 88755TAZ87 Chemical compound NCC(=O)CCC(O)=O ZGXJTSGNIOSYLO-UHFFFAOYSA-N 0.000 description 1
- 208000030507 AIDS Diseases 0.000 description 1
- 108091093088 Amplicon Proteins 0.000 description 1
- 244000233967 Anethum sowa Species 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- PCDQPRRSZKQHHS-XVFCMESISA-N CTP Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 PCDQPRRSZKQHHS-XVFCMESISA-N 0.000 description 1
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical group [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 1
- 108020004635 Complementary DNA Proteins 0.000 description 1
- RYGMFSIKBFXOCR-UHFFFAOYSA-N Copper Chemical compound [Cu] RYGMFSIKBFXOCR-UHFFFAOYSA-N 0.000 description 1
- 102000018832 Cytochromes Human genes 0.000 description 1
- 108010052832 Cytochromes Proteins 0.000 description 1
- 229920002271 DEAE-Sepharose Polymers 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 108010043461 Deep Vent DNA polymerase Proteins 0.000 description 1
- AHCYMLUZIRLXAA-SHYZEUOFSA-N Deoxyuridine 5'-triphosphate Chemical compound O1[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)C[C@@H]1N1C(=O)NC(=O)C=C1 AHCYMLUZIRLXAA-SHYZEUOFSA-N 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 102000005593 Endopeptidases Human genes 0.000 description 1
- 108010059378 Endopeptidases Proteins 0.000 description 1
- 241000701832 Enterobacteria phage T3 Species 0.000 description 1
- 101000686777 Escherichia phage T7 T7 RNA polymerase Proteins 0.000 description 1
- JNCMHMUGTWEVOZ-UHFFFAOYSA-N F[CH]F Chemical group F[CH]F JNCMHMUGTWEVOZ-UHFFFAOYSA-N 0.000 description 1
- 108010026389 Gramicidin Proteins 0.000 description 1
- XKMLYUALXHKNFT-UUOKFMHZSA-N Guanosine-5'-triphosphate Chemical compound C1=2NC(N)=NC(=O)C=2N=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)[C@H]1O XKMLYUALXHKNFT-UUOKFMHZSA-N 0.000 description 1
- ZIXGXMMUKPLXBB-UHFFFAOYSA-N Guatambuinine Natural products N1C2=CC=CC=C2C2=C1C(C)=C1C=CN=C(C)C1=C2 ZIXGXMMUKPLXBB-UHFFFAOYSA-N 0.000 description 1
- 108010081348 HRT1 protein Hairy Chemical group 0.000 description 1
- 102100021881 Hairy/enhancer-of-split related with YRPW motif protein 1 Human genes 0.000 description 1
- 101000610640 Homo sapiens U4/U6 small nuclear ribonucleoprotein Prp3 Proteins 0.000 description 1
- 208000026350 Inborn Genetic disease Diseases 0.000 description 1
- 102000004877 Insulin Human genes 0.000 description 1
- 108090001061 Insulin Proteins 0.000 description 1
- 125000001176 L-lysyl group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C([H])([H])C([H])([H])C([H])([H])C(N([H])[H])([H])[H] 0.000 description 1
- 108060001084 Luciferase Proteins 0.000 description 1
- 239000005089 Luciferase Substances 0.000 description 1
- 229910021380 Manganese Chloride Inorganic materials 0.000 description 1
- GLFNIEUTAYBVOC-UHFFFAOYSA-L Manganese chloride Chemical compound Cl[Mn]Cl GLFNIEUTAYBVOC-UHFFFAOYSA-L 0.000 description 1
- NQTADLQHYWFPDB-UHFFFAOYSA-N N-Hydroxysuccinimide Chemical compound ON1C(=O)CCC1=O NQTADLQHYWFPDB-UHFFFAOYSA-N 0.000 description 1
- 102000002250 NAD+ Nucleosidase Human genes 0.000 description 1
- 108010000193 NAD+ Nucleosidase Proteins 0.000 description 1
- 206010028980 Neoplasm Diseases 0.000 description 1
- 239000004952 Polyamide Substances 0.000 description 1
- 239000004698 Polyethylene Substances 0.000 description 1
- 239000004743 Polypropylene Substances 0.000 description 1
- 239000004793 Polystyrene Substances 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- SUYXJDLXGFPMCQ-INIZCTEOSA-N SJ000287331 Natural products CC1=c2cnccc2=C(C)C2=Nc3ccccc3[C@H]12 SUYXJDLXGFPMCQ-INIZCTEOSA-N 0.000 description 1
- 101001110823 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) 60S ribosomal protein L6-A Proteins 0.000 description 1
- 101000712176 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) 60S ribosomal protein L6-B Proteins 0.000 description 1
- 229910000831 Steel Inorganic materials 0.000 description 1
- 241000589500 Thermus aquaticus Species 0.000 description 1
- 102000018690 Trypsinogen Human genes 0.000 description 1
- 108010027252 Trypsinogen Proteins 0.000 description 1
- 102100040374 U4/U6 small nuclear ribonucleoprotein Prp3 Human genes 0.000 description 1
- PGAVKCOVUIYSFO-XVFCMESISA-N UTP Chemical compound O[C@@H]1[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O[C@H]1N1C(=O)NC(=O)C=C1 PGAVKCOVUIYSFO-XVFCMESISA-N 0.000 description 1
- 238000012793 UV/ Vis spectrometry Methods 0.000 description 1
- 229910052770 Uranium Inorganic materials 0.000 description 1
- 241001265414 Vieraea Species 0.000 description 1
- KSGWVKXHBOLJGB-NNGGQVLBSA-N [[(2r,3s,4r,5r)-4-amino-4-[2-[(2-aminoacetyl)amino]acetyl]-5-(2,4-dioxopyrimidin-1-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl] phosphono hydrogen phosphate Chemical compound NCC(=O)NCC(=O)[C@@]1(N)[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O[C@H]1N1C(=O)NC(=O)C=C1 KSGWVKXHBOLJGB-NNGGQVLBSA-N 0.000 description 1
- ZKHQWZAMYRWXGA-KNYAHOBESA-N [[(2r,3s,4r,5r)-5-(6-aminopurin-9-yl)-3,4-dihydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl] dihydroxyphosphoryl hydrogen phosphate Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)O[32P](O)(O)=O)[C@@H](O)[C@H]1O ZKHQWZAMYRWXGA-KNYAHOBESA-N 0.000 description 1
- AZJLCKAEZFNJDI-DJLDLDEBSA-N [[(2r,3s,5r)-5-(4-aminopyrrolo[2,3-d]pyrimidin-7-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl] phosphono hydrogen phosphate Chemical group C1=CC=2C(N)=NC=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 AZJLCKAEZFNJDI-DJLDLDEBSA-N 0.000 description 1
- CIBAUQOHZMSSMP-UIISKDMLSA-N [[(2r,3s,5r)-5-[6-amino-8-(2-aminoacetyl)purin-9-yl]-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl] phosphono hydrogen phosphate Chemical compound NCC(=O)C1=NC2=C(N)N=CN=C2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 CIBAUQOHZMSSMP-UIISKDMLSA-N 0.000 description 1
- XXFXTBNFFMQVKJ-UHFFFAOYSA-N [diphenyl(trityloxy)methyl]benzene Chemical group C=1C=CC=CC=1C(C=1C=CC=CC=1)(C=1C=CC=CC=1)OC(C=1C=CC=CC=1)(C=1C=CC=CC=1)C1=CC=CC=C1 XXFXTBNFFMQVKJ-UHFFFAOYSA-N 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- 125000000218 acetic acid group Chemical group C(C)(=O)* 0.000 description 1
- YFHNDHXQDJQEEE-UHFFFAOYSA-N acetic acid;hydrazine Chemical compound NN.CC(O)=O YFHNDHXQDJQEEE-UHFFFAOYSA-N 0.000 description 1
- 230000021736 acetylation Effects 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 239000000443 aerosol Substances 0.000 description 1
- 230000002152 alkylating effect Effects 0.000 description 1
- 235000001014 amino acid Nutrition 0.000 description 1
- 150000001413 amino acids Chemical class 0.000 description 1
- 229960002749 aminolevulinic acid Drugs 0.000 description 1
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 1
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 1
- 150000001450 anions Chemical class 0.000 description 1
- 230000003466 anti-cipated effect Effects 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- XKRFYHLGVUSROY-UHFFFAOYSA-N argon Substances [Ar] XKRFYHLGVUSROY-UHFFFAOYSA-N 0.000 description 1
- 229910052786 argon Inorganic materials 0.000 description 1
- 125000003710 aryl alkyl group Chemical group 0.000 description 1
- 239000012298 atmosphere Substances 0.000 description 1
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 125000001797 benzyl group Chemical group [H]C1=C([H])C([H])=C(C([H])=C1[H])C([H])([H])* 0.000 description 1
- 230000029918 bioluminescence Effects 0.000 description 1
- 238000005415 bioluminescence Methods 0.000 description 1
- 229920001222 biopolymer Polymers 0.000 description 1
- OMWQUXGVXQELIX-UHFFFAOYSA-N bitoscanate Chemical compound S=C=NC1=CC=C(N=C=S)C=C1 OMWQUXGVXQELIX-UHFFFAOYSA-N 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 238000010504 bond cleavage reaction Methods 0.000 description 1
- 239000012888 bovine serum Substances 0.000 description 1
- 239000000337 buffer salt Substances 0.000 description 1
- 238000010804 cDNA synthesis Methods 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 201000011510 cancer Diseases 0.000 description 1
- 239000004202 carbamide Substances 0.000 description 1
- 229910052799 carbon Inorganic materials 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 238000002144 chemical decomposition reaction Methods 0.000 description 1
- 229910052801 chlorine Inorganic materials 0.000 description 1
- 229910001914 chlorine tetroxide Inorganic materials 0.000 description 1
- 239000013611 chromosomal DNA Substances 0.000 description 1
- 238000004140 cleaning Methods 0.000 description 1
- 239000013599 cloning vector Substances 0.000 description 1
- 238000010549 co-Evaporation Methods 0.000 description 1
- 238000002485 combustion reaction Methods 0.000 description 1
- 230000006835 compression Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 239000007859 condensation product Substances 0.000 description 1
- 239000005289 controlled pore glass Substances 0.000 description 1
- 229910052802 copper Inorganic materials 0.000 description 1
- 239000010949 copper Substances 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- OOTFVKOQINZBBF-UHFFFAOYSA-N cystamine Chemical compound CCSSCCN OOTFVKOQINZBBF-UHFFFAOYSA-N 0.000 description 1
- 229940099500 cystamine Drugs 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- GYOZYWVXFNDGLU-XLPZGREQSA-N dTMP Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(O)=O)[C@@H](O)C1 GYOZYWVXFNDGLU-XLPZGREQSA-N 0.000 description 1
- 238000005202 decontamination Methods 0.000 description 1
- 230000003588 decontaminative effect Effects 0.000 description 1
- 230000000593 degrading effect Effects 0.000 description 1
- 230000003111 delayed effect Effects 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 239000005549 deoxyribonucleoside Substances 0.000 description 1
- 238000009795 derivation Methods 0.000 description 1
- 230000006866 deterioration Effects 0.000 description 1
- 238000006642 detritylation reaction Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000000502 dialysis Methods 0.000 description 1
- 150000004985 diamines Chemical class 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 239000000539 dimer Substances 0.000 description 1
- 230000003292 diminished effect Effects 0.000 description 1
- NHERQRIKJNWOOV-UHFFFAOYSA-N diphosphono hydrogen phosphate;7h-purine Chemical class C1=NC=C2NC=NC2=N1.OP(O)(=O)OP(O)(=O)OP(O)(O)=O NHERQRIKJNWOOV-UHFFFAOYSA-N 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 238000010494 dissociation reaction Methods 0.000 description 1
- 230000005593 dissociations Effects 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 239000000975 dye Substances 0.000 description 1
- 230000009881 electrostatic interaction Effects 0.000 description 1
- 239000003480 eluent Substances 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 230000007515 enzymatic degradation Effects 0.000 description 1
- DNJIEGIFACGWOD-UHFFFAOYSA-N ethyl mercaptane Natural products CCS DNJIEGIFACGWOD-UHFFFAOYSA-N 0.000 description 1
- 230000005284 excitation Effects 0.000 description 1
- 108010052305 exodeoxyribonuclease III Proteins 0.000 description 1
- 239000000706 filtrate Substances 0.000 description 1
- 229910052731 fluorine Inorganic materials 0.000 description 1
- 125000004216 fluoromethyl group Chemical group [H]C([H])(F)* 0.000 description 1
- 239000011888 foil Substances 0.000 description 1
- 230000005021 gait Effects 0.000 description 1
- 239000007789 gas Substances 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 238000007429 general method Methods 0.000 description 1
- 208000016361 genetic disease Diseases 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 238000012268 genome sequencing Methods 0.000 description 1
- 239000003365 glass fiber Substances 0.000 description 1
- 150000002332 glycine derivatives Chemical class 0.000 description 1
- 150000002334 glycols Chemical class 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- IUAYMJGZBVDSGL-XNNAEKOYSA-N gramicidin S Chemical compound C([C@@H]1C(=O)N2CCC[C@H]2C(=O)N[C@H](C(=O)N[C@@H](CCCN)C(=O)N[C@H](C(N[C@H](CC=2C=CC=CC=2)C(=O)N2CCC[C@H]2C(=O)N[C@H](C(=O)N[C@@H](CCCN)C(=O)N[C@@H](CC(C)C)C(=O)N1)C(C)C)=O)CC(C)C)C(C)C)C1=CC=CC=C1 IUAYMJGZBVDSGL-XNNAEKOYSA-N 0.000 description 1
- 229950009774 gramicidin s Drugs 0.000 description 1
- 238000009499 grossing Methods 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- PYGSKMBEVAICCR-UHFFFAOYSA-N hexa-1,5-diene Chemical group C=CCCC=C PYGSKMBEVAICCR-UHFFFAOYSA-N 0.000 description 1
- NAQMVNRVTILPCV-UHFFFAOYSA-N hexane-1,6-diamine Chemical compound NCCCCCCN NAQMVNRVTILPCV-UHFFFAOYSA-N 0.000 description 1
- 125000004051 hexyl group Chemical group [H]C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])* 0.000 description 1
- OAKJQQAXSVQMHS-UHFFFAOYSA-O hydrazinium(1+) Chemical compound [NH3+]N OAKJQQAXSVQMHS-UHFFFAOYSA-O 0.000 description 1
- 230000003301 hydrolyzing effect Effects 0.000 description 1
- COQRGFWWJBEXRC-UHFFFAOYSA-N hydron;methyl 2-aminoacetate;chloride Chemical compound Cl.COC(=O)CN COQRGFWWJBEXRC-UHFFFAOYSA-N 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 230000002779 inactivation Effects 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 239000003999 initiator Substances 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 229940125396 insulin Drugs 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 238000009830 intercalation Methods 0.000 description 1
- 230000002687 intercalation Effects 0.000 description 1
- 238000011835 investigation Methods 0.000 description 1
- 238000005040 ion trap Methods 0.000 description 1
- 238000001038 ionspray mass spectrometry Methods 0.000 description 1
- 238000004989 laser desorption mass spectroscopy Methods 0.000 description 1
- 238000004811 liquid chromatography Methods 0.000 description 1
- 238000011068 loading method Methods 0.000 description 1
- 229910001629 magnesium chloride Inorganic materials 0.000 description 1
- 229910052943 magnesium sulfate Inorganic materials 0.000 description 1
- 239000011565 manganese chloride Substances 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 125000001434 methanylylidene group Chemical group [H]C#[*] 0.000 description 1
- XXNZVJWJWMVONK-UHFFFAOYSA-N methyl 2-[(2-aminoacetyl)amino]acetate Chemical compound COC(=O)CNC(=O)CN XXNZVJWJWMVONK-UHFFFAOYSA-N 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 239000013081 microcrystal Substances 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 230000037230 mobility Effects 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- PSHKMPUSSFXUIA-UHFFFAOYSA-N n,n-dimethylpyridin-2-amine Chemical compound CN(C)C1=CC=CC=N1 PSHKMPUSSFXUIA-UHFFFAOYSA-N 0.000 description 1
- 229940124276 oligodeoxyribonucleotide Drugs 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- XSXHWVKGUXMUQE-UHFFFAOYSA-N osmium dioxide Inorganic materials O=[Os]=O XSXHWVKGUXMUQE-UHFFFAOYSA-N 0.000 description 1
- 230000003647 oxidation Effects 0.000 description 1
- 238000007254 oxidation reaction Methods 0.000 description 1
- 230000001590 oxidative effect Effects 0.000 description 1
- 239000001301 oxygen Substances 0.000 description 1
- 229910052760 oxygen Inorganic materials 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 230000001575 pathological effect Effects 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- VLTRZXGMWDSKGL-UHFFFAOYSA-M perchlorate Chemical compound [O-]Cl(=O)(=O)=O VLTRZXGMWDSKGL-UHFFFAOYSA-M 0.000 description 1
- QKFJKGMPGYROCL-UHFFFAOYSA-N phenyl isothiocyanate Chemical group S=C=NC1=CC=CC=C1 QKFJKGMPGYROCL-UHFFFAOYSA-N 0.000 description 1
- NMHMNPHRMNGLLB-UHFFFAOYSA-N phloretic acid Chemical compound OC(=O)CCC1=CC=C(O)C=C1 NMHMNPHRMNGLLB-UHFFFAOYSA-N 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 239000002985 plastic film Substances 0.000 description 1
- 229920006255 plastic film Polymers 0.000 description 1
- 229910052697 platinum Inorganic materials 0.000 description 1
- 229920002647 polyamide Polymers 0.000 description 1
- 229920000573 polyethylene Polymers 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 229920001155 polypropylene Polymers 0.000 description 1
- 229920002223 polystyrene Polymers 0.000 description 1
- 229920002981 polyvinylidene fluoride Polymers 0.000 description 1
- 125000002577 pseudohalo group Chemical group 0.000 description 1
- WHMDPDGBKYUEMW-UHFFFAOYSA-N pyridine-2-thiol Chemical compound SC1=CC=CC=N1 WHMDPDGBKYUEMW-UHFFFAOYSA-N 0.000 description 1
- 150000003230 pyrimidines Chemical class 0.000 description 1
- 125000001453 quaternary ammonium group Chemical group 0.000 description 1
- 239000002901 radioactive waste Substances 0.000 description 1
- 238000003608 radiolysis reaction Methods 0.000 description 1
- 239000011535 reaction buffer Substances 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000004007 reversed phase HPLC Methods 0.000 description 1
- 238000002390 rotary evaporation Methods 0.000 description 1
- 239000012488 sample solution Substances 0.000 description 1
- 229920006395 saturated elastomer Polymers 0.000 description 1
- 238000007086 side reaction Methods 0.000 description 1
- 229910001961 silver nitrate Inorganic materials 0.000 description 1
- PCMORTLOPMLEFB-ONEGZZNKSA-N sinapic acid Chemical compound COC1=CC(\C=C\C(O)=O)=CC(OC)=C1O PCMORTLOPMLEFB-ONEGZZNKSA-N 0.000 description 1
- PCMORTLOPMLEFB-UHFFFAOYSA-N sinapinic acid Natural products COC1=CC(C=CC(O)=O)=CC(OC)=C1O PCMORTLOPMLEFB-UHFFFAOYSA-N 0.000 description 1
- 108010062513 snake venom phosphodiesterase I Proteins 0.000 description 1
- 229910000030 sodium bicarbonate Inorganic materials 0.000 description 1
- 235000017557 sodium bicarbonate Nutrition 0.000 description 1
- 229910052938 sodium sulfate Inorganic materials 0.000 description 1
- 235000011152 sodium sulphate Nutrition 0.000 description 1
- 238000010532 solid phase synthesis reaction Methods 0.000 description 1
- 239000011877 solvent mixture Substances 0.000 description 1
- 125000006850 spacer group Chemical group 0.000 description 1
- 230000003595 spectral effect Effects 0.000 description 1
- 239000010959 steel Substances 0.000 description 1
- 239000011550 stock solution Substances 0.000 description 1
- 238000012916 structural analysis Methods 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 238000005987 sulfurization reaction Methods 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 125000000999 tert-butyl group Chemical group [H]C([H])([H])C(*)(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 229940124597 therapeutic agent Drugs 0.000 description 1
- 125000003396 thiol group Chemical group [H]S* 0.000 description 1
- 229940113082 thymine Drugs 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 150000005691 triesters Chemical class 0.000 description 1
- ZIBGPFATKBEMQZ-UHFFFAOYSA-N triethylene glycol Chemical class OCCOCCOCCO ZIBGPFATKBEMQZ-UHFFFAOYSA-N 0.000 description 1
- JLGLQAWTXXGVEM-UHFFFAOYSA-N triethylene glycol monomethyl ether Chemical compound COCCOCCOCCO JLGLQAWTXXGVEM-UHFFFAOYSA-N 0.000 description 1
- 125000002023 trifluoromethyl group Chemical group FC(F)(F)* 0.000 description 1
- 238000005866 tritylation reaction Methods 0.000 description 1
- 238000000108 ultra-filtration Methods 0.000 description 1
- 241001515965 unidentified phage Species 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 238000012800 visualization Methods 0.000 description 1
- 239000003643 water by type Substances 0.000 description 1
- 238000005303 weighing Methods 0.000 description 1
- 238000009736 wetting Methods 0.000 description 1
Definitions
- DNA sequencing is one of the most fundamental technologies in molecular biology and the life sciences in general. The ease and the rate by which DNA sequences can be obtained greatly affects related technologies such as development and production of new therapeutic agents and new and useful varieties of plants and microorganisms via recombinant DNA technology. In particular, unraveling the DNA sequence helps in understanding human pathological conditions including genetic disorders, cancer and AIDS.
- DNA sequencing is performed by either the chemical degradation method of Maxam and Gilbert (Methods in Enzvmology 6_5_, 499-560 (1980)) or the enzymatic dideoxynucleotide termination method of Sanger et al. (Proc. Natl. Acad. Sci. USA 74. 5463-67 (1977)).
- base specific modifications result in a base specific cleavage of the radioactive or fluorescently labeled DNA fragment
- four sets of nested fragments are produced which are separated according to length by polyacrylamide gel electrophoresis (PAGE). After autoradiography, the sequence can be read directly since each band (fragment) in the gel originates from a base specific cleavage event. Thus, the fragment lengths in the four "ladders” directly translate into a specific position in the DNA sequence.
- the four base specific sets of DNA fragments are formed by starting with a primer/template system elongating the primer into the unknown DNA sequence area and thereby copying the template and synthesizing a complementary strand by DNA polymerases, such as Klenow fragment of E. coli DNA polymerase I, a DNA polymerase from Thermus aquaticus, Taq DNA polymerase, or a modified T7 DNA polymerase, Sequenase (Tabor et al., Proc. Natl. Acad. Sci. USA 84, 4767-4771 (1987)), in the presence of chain-terminating reagents.
- DNA polymerases such as Klenow fragment of E. coli DNA polymerase I, a DNA polymerase from Thermus aquaticus, Taq DNA polymerase, or a modified T7 DNA polymerase, Sequenase (Tabor et al., Proc. Natl. Acad. Sci. USA 84, 4767-4771 (1987)
- the chain-terminating event is achieved by incorporating into the four separate reaction mixtures in addition to the four normal deoxynucleoside triphosphates, dATP, dGTP, dTTP and dCTP, only one of the chain-terminating dideoxynucleoside triphosphates, ddATP, ddGTP, ddTTP or ddCTP, respectively, in a limiting small concentration.
- the four sets of resulting fragments produce, after electrophoresis, four base specific ladders from which the DNA sequence can be determined.
- a recent modification of the Sanger sequencing strategy involves the degradation of phosphorothioate-containing DNA fragments obtained by using alpha-thio dNTP instead of the normally used ddNTPs during the primer extension reaction mediated by DNA polymerase (Labeit et ⁇ /.. DNA 5 I 173-177 (1986); Amersham, PCT- Application GB86/00349; Eckstein et al., Nucleic Acids Res. 16 9947 (1988)).
- the four sets of base-specific sequencing ladders are obtained by limited digestion with exonuclease III or snake venom phosphodiesterase, subsequent separation on PAGE and visualization by radioisotopic labeling of either the primer or one of the dNTPs.
- the base-specific cleavage is achieved by alkylating the sulphur atom in the modified phosphodiester bond followed by a heat treatment (Max-Planck-technik, DE 3930312 Al). Both methods can be combined with the amplification of the DNA via the Polymerase Chain Reaction (PCR). On the upfront end, the DNA to be sequenced has to be fragmented into sequencable pieces of currently not more than 500 to 1000 nucleotides.
- this is a multi-step process involving cloning and subcloning steps using different and appropriate cloning vectors such as YAC, cosmids, plasmids and Ml 3 vectors (Sambrook et al, Molecular Cloning: A Laboratory Manual. Cold Spring Harbor Laboratory Press, 1989).
- the fragments of about 500 to 1000 base pairs are integrated into a specific restriction site of the replicative form I (RF I) of a derivative of the M13 bacteriophage (Vieria and Messing, Gene 19, 259 (1982)) and then the double-stranded form is transformed to the single-stranded circular form to serve as a template for the Sanger sequencing process having a binding site for a universal primer obtained by chemical DNA synthesis (Sinha, Biernat, McManus and Koster. Nucleic Acids Res. 12. 4539-57 (1984); U.S. Patent No. 4725677 upstream of the restriction site into which the unknown DNA fragment has been inserted.
- RF I replicative form I
- the DNA sequence in the interested region most be known at least to the extent to bind a sequencing primer.
- detectable labels have to be used in either the primer (very often at the 5 '-end) or in one of the deoxynucleoside triphosphates, dNTP.
- radioisotopes such as 32 P, 33 P or 35 S is still the most frequently used technique. After PAGE, the gels are exposed to X-ray films and silver grain exposure is analyzed. The use of radioisotopic labeling creates several problems.
- DNA using chemiluminescence triggerable and amplifyable by enzymes have been developed (Beck, O'Keefe, Coull and K ⁇ ster. Nucleic Acids Res. 17. 5115-5123 (1989) and Beck and K ⁇ ster, Anal. Chem. 62, 2258-2270 (1990)). These labeling methods were combined with multiplex DNA sequencing (Church et al. Science 240, 185-188 (1988) to provide for a strategy aimed at high throughput DNA sequencing (K ⁇ ster et al ,
- the primer extension products synthesized on the immobilized template strand are purified of enzymes, other sequencing reagents and by-products by a washing step and then released under denaturing conditions by loosing the hydrogen bonds between the Watson-Crick base pairs and subjected to PAGE separation.
- the primer extension products (not the template) from a DNA sequencing reaction are bound to a solid support via biotin/avidin (Du Pont De Nemours, PCT Application WO 91/11533).
- biotin/avidin Du Pont De Nemours, PCT Application WO 91/11533
- the interaction between biotin and avidin is overcome by employing denaturing conditions (formamide/EDTA) to release the primer extension products of the sequencing reaction from the solid support for PAGE separation.
- beads e.g., magnetic beads (Dynabeads) and Sepharose beads
- filters e.g., glass beads
- capillaries e.g., glass beads
- plastic dipsticks e.g., polystyrene strips
- microtiter wells e.g., microtiter wells
- PAGE polyacrylamide gel electrophoresis
- CZE capillary zone electrophoresis
- hybridization or fragmentation sequencing (Bains, Biotechnology 10, 757-58 ( 1992) and Mirzabekov et al , FEBS Letters 256 : 1 18- 122 ( 1989)) utilizing the specific hybridization of known short oligonucleotides (e.g., octadeoxynucleotides which gives 65,536 different sequences) to a complementary DNA sequence. Positive hybridization reveals a short stretch of the unknown sequence. Repeating this process by performing hybridizations with all possible octadeoxynucleotides should theoretically determine the sequence.
- known short oligonucleotides e.g., octadeoxynucleotides which gives 65,536 different sequences
- the enzymes used and the DNA are held in place by solid phases (DEAE-Sepharose and Sepharose) either by ionic interactions or by covalent attachment.
- the amount of pyrophosphate is determined via bioluminescence (luciferase).
- a synthesis approach to DNA sequencing is also used by Tsien et al (PCT Application No. WO 91/06678).
- the incoming dNTP's are protected at the 3'-end by various blocking groups such as acetyl or phosphate groups and are removed before the next elongation step, which makes this process very slow compared to standard sequencing methods.
- the template DNA is immobilized on a polymer support.
- a fluorescent or radioactive label is additionally incorporated into the modified dNTP's.
- the same patent application also describes an apparatus designed to automate the process.
- Mass spectrometry in general, provides a means of "weighing" individual molecules by ionizing the molecules in vacuo and making them “fly” by volatilization. Under the influence of combinations of electric and magnetic fields, the ions follow trajectories depending on their individual mass (m) and charge (z). In the range of molecules with low molecular weight, mass spectrometry has long been part of the routine physical-organic repertoire for analysis and characterization of organic molecules by the determination of the mass of the parent molecular ion.
- MALDI mass spectrometry in contrast, can be particularly attractive when a time-of-flight (TOF) configuration is used as a mass analyzer.
- TOF time-of-flight
- the MALDI-TOF mass spectrometry has been introduced by Hillenkamp et al ("Matrix Assisted UV-Laser Desorption/ionization: A New Approach to Mass Spectrometry of Large Biomolecules," Biological Mass Spectrometry (Burlingame and McCloskey, editors), Elsevier Science Publishers, Amsterdam, pp. 49-60, 1990.) Since, in most cases, no multiple molecular ion peaks are produced with this technique, the mass spectra, in principle, look simpler compared to ES mass spectrometry.
- NTP's, dNTP's and, as terminating nucleotides, ddNTP's which are substituted at the 5'- position of the sugar moiety with one or a combination of the isotopes The polynucleotides obtained are degraded to 3'- nucleotides, cleaved at the N-glycosidic linkage and the isotopically labeled 5'- functionality removed by periodate oxidation and the resulting formaldehyde species determined by mass spectrometry.
- a specific combination of isotopes serves to discriminate base-specifically between internal nucleotides originating from the incorporation of NTP's and dNTP's and terminal nucleotides caused by linking ddNTP's to the end of the polynucleotide chain.
- a series of RNA/DNA fragments is produced, and in one embodiment, separated by electrophoresis, and, with the aid of the so-called matrix method of analysis, the sequence is deduced.
- the sulfur isotopes can be located either in the base or at the alpha-position of the triphosphate moiety whereas the halogen isotopes are located either at the base or at the 3'-position of the sugar ring
- the sequencing reaction mixtures are separated by an electrophoretic technique such as
- the SO2 generated with masses of 64, 65, 66 or 68 is determined on-line by mass spectrometry using, e.g., as mass analyzer, a quadrupole with a single ion-multiplier to detect the ion current.
- EPO Patent Applications No. 0360676 Al and 0360677 Al also describe Sanger sequencing using stable isotope substitutions in the ddNTP's such as D, ⁇ . , or functional groups such as CF3 or Si(CH3)3 at the base, the sugar or the alpha position of the triphosphate moiety according to chemical functionality.
- the Sanger sequencing reaction mixtures are separated by tube gel electrophoresis.
- the effluent is converted into an aerosol by the electrospray/thermospray nebulizer method and then atomized and ionized by a hot plasma (7000 to 8000 K) and analyzed by a simple mass analyzer.
- An instrument is proposed which enables one to automate the analysis of the Sanger sequencing reaction mixture consisting of tube electrophoresis, a nebulizer and a mass analyzer.
- the invention describes a new method to sequence DNA.
- the improvements over the existing DNA sequencing technologies include high speed, high throughput, no required electrophoresis (and, thus, no gel reading artifacts due to the complete absence of an electrophoretic step), and no costly reagents involving various substitutions with stable isotopes.
- the invention utilizes the Sanger sequencing strategy and assembles the sequence information by analysis of the nested fragments obtained by base-specific chain termination via their different molecular masses using mass spectrometry, for example, MALDI or ES mass spectrometry.
- a further increase in throughput can be obtained by introducing mass modifications in the oligonucleotide primer, the chain-terminating nucleoside triphosphates and/or the chain-elongating nucleoside triphosphates, as well as using integrated tag sequences which allow multiplexing by hybridization of tag specific probes with mass differentiated molecular weights.
- FIGURE 1 is a representation of a process to generate the samples to be analyzed by mass spectrometry.
- This process entails insertion of a DNA fragment of unknown sequence into a cloning vector such as derivatives of M13, pUC or phagemids; transforming the double-stranded form into the single-stranded form; performing the four Sanger sequencing reactions; linking the base-specifically terminated nested fragment family temporarily to a solid support; removing by a washing step all by-products; conditioning the nested DNA or RNA fragments by, for example, cation-ion exchange or modification reagent and presenting the immobilized nested fragments either directly to mass spectrometric analysis or cleaving the purified fragment family off the support and evaporating the cleavage reagent.
- a cloning vector such as derivatives of M13, pUC or phagemids
- FIGURE 2A shows the Sanger sequencing products using ddTTP as terminating deoxynucleoside triphosphate of a hypothetical DNA fragment of 50 nucleotides (SEQ LD NO:3) in length with approximately equally balanced base composition. The molecular masses of the various chain terminated fragments are given.
- FIGURE 2B shows an idealized mass spectrum of such a DNA fragment mixture.
- FIGURES 3A and 3B show, in analogy to FIGURES 2A and 2B, data for the same model sequence (SEQ ID NO:3) with ddATP as chain terminator.
- FIGURES 4A and 4B show data, analogous to FIGURES 2A and 2B when ddGTP is used as a chain terminator for the same model sequence (SEQ L ⁇ NO:3).
- FIGURES 5A and 5B illustrate the results obtained where chain termination is performed with ddCTP as a chain terminator, in a similar way as shown in FIGURES 2 A and 2B for the same model sequence (SEQ LD NO:3).
- FIGURE 6 summarizes the results of FIGURES 2A to 5B, showing the correlation of molecular weights of the nested four fragment families to the DNA sequence (SEQ ID NO:3).
- FIGURE 7 illustrates the general structure of mass-modified sequencing nucleic acid primers or tag sequencing probes for either Sanger DNA or Sanger RNA sequencing.
- FIGURE 8 shows the general structure for the mass-modified triphosphates for either Sanger DNA or Sanger RNA sequencing. General formulas of the chain-elongating and the chain-terminating nucleoside triphosphates are demonstrated.
- FIGURE 9 outlines various linking chemistries (X) with either polyethylene glycol or terminally monoalkylated polyethylene glycol (R) as an example.
- FIGURE 10 illustrates similar linking chemistries as shown in FIGURE 8 and depicts various mass modifying moieties (R).
- FIGURE 1 1 outlines how multiplex mass spectrometric sequencing can work using the mass-modified nucleic acid primer (UP).
- FIGURE 12 shows the process of multiplex mass spectrometric sequencing employing mass-modified chain-elongating and/or terminating nucleoside triphosphates.
- FIGURE 13 shows multiplex mass spectrometric sequencing by involving the hybridization of mass-modified tag sequence specific probes.
- FIGURE 14 shows a MALDI-TOF spectrum of a mixture of oligothymidylic acids, d(pT) 12-I8
- FIGURE 15 shows a superposition of MALDI-TOF spectra of the 50-mer d(TAACGGTCATTACGGCCATTGACTGTAGGACCTGCATTACATGACTAGCT) (SEQ LD NO:3) (500 fmol) and dT(pdT) 99 (500 fmol).
- FIGURE 16 shows the MALDI-TOF spectra of all 13 DNA sequences representing the nested dT-terminated fragments of the Sanger DNA sequencing simulation of Figure 2, 500 fmol each.
- FIGURE 17 shows the superposition of the spectra of FIGURE 16. The two panels show two different scales and the spectra analyzed at that scale
- FIGURE 18 shows the superimposed MALDI-TOF spectra from MALDI- MS analysis of mass-modified oligonucleotides as described in Example 21.
- FIGURE 19 illustrates various linking chemistries between the solid support (P) and the nucleic acid primer (NA) through a strong electrostatic interaction.
- FIGURE 20 illustrates various linking chemistries between the solid support (P) and the nucleic acid primer (NA) through a charge transfer complex of a charge transfer acceptor (A) and a charge transfer donor (D).
- FIGURE 21 illustrates various linking chemistries between the solid support (P) and the nucleic acid primer (NA) through a stable organic radical
- FIGURE 22 illustrates a possible linking chemistry between the solid support (P) and the nucleic acid primer (NA) through Watson-Crick base pairing
- FIGURE 23 illustrates linking the solid support (P) and the nucleic acid primer (NA) through a photo lytically cleavable bond.
- FIGURE 24 shows the portion of the sequence of pRFcl DNA, which was used as template for PCR amplification of unmodified and 7-deazapurine containing 99-mer and 200-mer nucleic acids as well as the sequences of the 19-mer primers and the two 18-mer reverse primers.
- FIGURE 25 shows the portion of the nucleotide sequence of M13mpl8 RFI DNA which was used for PCR amplification of unmodified and 7-deazapurine containing 103-mer nucleic acids. Also shown are nucleotide sequences of the 17-mer primers used in the PCR.
- FIGURE 26 shows the result of a polyacrylamide gel electrophoresis of PCR products purified and concentrated for MALDI-TOF MS analysis.
- M chain length marker
- lane 1 7-deazapurine containing 99-mer PCR product
- lane 2 unmodified 99- mer
- lane 3 7-deazapurine containing 103-mer
- lane 4 unmodified 103-mer PCR product.
- FIGURE 27 an autoradiogram of polyacrylamide gel electrophoresis of
- Lanes 1 and 2 unmodified and 7 -deazapurine modified 103-mer PCR product (53321 and 23520 counts)
- lanes 3 and 4 unmodified and 7-deazapurine modified 200-mer (71123 and 39582 counts)
- lanes 5 and 6 unmodified and 7-deazapurine modified 99-mer (173216 and 94400 counts).
- FIGURE 28 a) MALDI-TOF mass spectrum of the unmodified 103-mer PCR products (sum of twelve single shot spectra). The mean value of the masses calculated for the two single strands (31768 u and 31759 u) is 31763 u. Mass resolution: 18. b) MALDI-TOF mass spectrum of 7-deazapurine containing 103-mer PCR product
- FIGURE 29 a) MALDI-TOF mass spectrum of the unmodified 99-mer PCR product (sum of twenty single shot spectra). Values of the masses calculated for the two single strands: 30261 u and 30794 u. b) MALDI-TOF mass spectrum of the 7- deazapurine containing 99-mer PCR product (sum of twelve single shot spectra). Values of the masses calculated for the two single strands: 30224 u and 30750 u.
- FIGURE 30 a) MALDI-TOF mass spectrum of the unmodified 200-mer PCR product (sum of 30 single shot spectra). The mean value of the masses calculated for the two single strands (61873 u and 61595 u) is 61734 u. Mass resolution: 28. b)
- MALDI-TOF mass spectrum of 7-deazapurine containing 200-mer PCR product (sum of 30 single shot spectra). The mean value of the masses calculated for the two single strands (61772 u and 61514 u) is 61643 u. Mass resolution: 39.
- FIGURE 31 a) MALDI-TOF mass spectrum of 7-deazapurine containing 100-mer PCR product with ribomodified primers. The mean value of the masses calculated for the two single strands (30529 u and 31095 u) is 30812 u. b) MALDI-TOF mass spectrum of the PCR-product after hydrolytic primer-cleavage. The mean value of the masses calculated for the two single strands (25104 u and 25229 u) is 25167 u. The mean value of the cleaved primers (5437 u and 5918 u) is 5677 u.
- FIGURE 32 A-D shows the MALDI-TOF mass spectrum of the four sequencing ladders obtained from a 39 -mer template (SEQ. LD. No. 13), which was immobilized to streptavidin beads via a 3' biotinylation.
- a 14-mer primer (SEQ. ID. NO. 14) was used in the sequencing.
- FIGURE 33 shows a MALDI-TOF mass spectrum of a solid state sequencing of a 78-mer template (SEQ. JJD. No. 15), which was immobilized to streptavidin beads via a 3' biotinylation.
- a 18-mer primer (SEQ LD No. 16) and ddGTP were used in the sequencing.
- FIGURE 34 shows a scheme in which duplex DNA probes with single- stranded overhang capture specific DNA templates and also serve as primers for solid state sequencing.
- FIGURE 35 A-D shows MALDI-TOF mass spectra obtained from a 5' fluorescent labeled 23-mer (SEQ. LD. No. 19) annealed to an 3' biotinylated 18-mer (SEQ. LD. No. 20), leaving a 5-base overhang, which captured a 15-mer template (SEQ. LD. No. 21).
- FIGURE 36 shows a stacking flurogram of the same products obtained from the reaction described in FIGURE 35, but run on a conventional DNA sequencer.
- This invention describes an improved method of sequencing DNA.
- this invention employs mass spectrometry to analyze the Sanger sequencing reaction mixtures.
- the DNA sequence can be assigned via superposition (e.g., interpolation) of the molecular weight peaks of the four individual experiments.
- the molecular weights of the four specifically terminated fragment families can be determined simultaneously by MS, either by mixing the products of all four reactions run in at least two separate reaction vessels (i.e., all run separately, or two together, or three together) or by running one reaction having all four chain-terminating nucleotides (e.g., a reaction mixture comprising dTTP, ddTTP, dATP, ddATP, dCTP, ddCTP, dGTP, ddGTP) in one reaction vessel.
- the molecular weight values have been, in effect, interpolated. Comparison of the mass difference measured between fragments with the known masses of each chain-terminating nucleotide allows the assignment of sequence to be carried out. In some instances, it may be desirable to mass modify, as discussed below, the chain-terminating nucleotides so as to expand the difference in molecular weight between each nucleotide. It will be apparent to those skilled in the art when mass-modification of the chain-terminating nucleotides is desirable and can depend, for instance, on the resolving ability of the particular spectrometer employed. By way of example, it may be desirable to produce four chain-
- chain-elongating nucleotides and chain-terminating nucleotides are well known in the art.
- chain-elongating nucleotides include 2'-deoxyribonucleotides and chain-terminating nucleotides include 2', 3'-dideoxyribonucleotides.
- chain-elongating nucleotides include ribonucelotides and chain-terminating nucleotides include 3'-deoxyribonucleotides.
- nucleotide is also well known in the art.
- nucleotides include nucleoside mono-, di-, and triphosphates. Nucleotides also include modified nucleotides such as phosphorothioate nucleotides.
- mass spectrometry is a serial method, in contrast to currently used slab gel electrophoresis which allows several samples to be processed in parallel
- a further improvement can be achieved by multiplex mass spectrometric DNA sequencing to allow simultaneous sequencing of more than one DNA or RNA fragment.
- the range of about 300 mass units between one nucleotide addition can be utilized by employing either mass- modified nucleic acid sequencing primers or chain-elongating and/or terminating nucleoside triphosphates so as to shift the molecular weight of the base-specifically terminated fragments of a particular DNA or RNA species being sequenced in a predetermined manner.
- several sequencing reactions can be mass spectrometrically analyzed in parallel.
- multiplex mass spectrometric DNA sequencing can be performed by mass modifying the fragment families through specific oligonucleotides (tag probes) which hybridize to specific tag sequences within each of the fragment families.
- tag probe can be covalently attached to the individual and specific tag sequence prior to mass spectrometry.
- Preferred mass spectrometer formats for use in the invention are matrix assisted laser desorption ionization (MALDI), electrospray (ES), ion cyclotron resonance (ICR) and Fourier Transform.
- MALDI matrix assisted laser desorption ionization
- ES electrospray
- ICR ion cyclotron resonance
- ABI atmospheric pressure ionization interface
- MS/MS quadrupole configuration In MALDI mass spectrometry, various mass analyzers can be used, e g , magnetic sector/magnetic deflection instruments in single or triple quadrupole mode (MS/MS), Fourier transform and time-of-flight (TOF) configurations as is known in the art of mass spectrometry. For the desorption/ionization process, numerous matrix/laser combinations can be used. Ion-trap and reflectron configurations can also be employed. In one embodiment of the invention, the molecular weight values of at least two base-specifically terminated fragments are determined concurrently using mass spectrometry.
- the molecular weight values of preferably at least five and more preferably at least ten base-specifically terminated fragments are determined by mass spectrometry. Also included in the invention are determinations of the molecular weight values of at least 20 base-specifically terminated fragments and at least 30 base- specifically terminated fragments. Further, the nested base-specifically terminated fragments in a specific set can be purified of all reactants and by-products but are not separated from one another. The entire set of nested base-specifically terminated fragments is analyzed concurrently and the molecular weight values are determined. At least two base-specifically terminated fragments are analyzed concurrently by mass spectrometry when the fragments are contained in the same sample.
- the overall mass spectrometric DNA sequencing process will start with a library of small genomic fragments obtained after first randomly or specifically cutting the genomic DNA into large pieces which then, in several subcloning steps, are reduced in size and inserted into vectors like derivatives of Ml 3 or pUC (e.g., M 13 mp 18 or M 13 mp 19) (see FIGURE 1 ).
- the fragments inserted in vectors, such as Ml 3 are obtained via subcloning starting with a cDNA library.
- the DNA fragments to be sequenced are generated by the polymerase chain reaction (e.g., Higuchi et al, "A General Method of in vitro Preparation and Mutagenesis of DNA Fragments: Study of Protein and DNA Interactions," Nucleic Acids Res. ⁇ 16, 7351-67 (1988)).
- Sanger sequencing can start from one nucleic acid primer (UP) binding to the plus-strand or from another nucleic acid primer binding to the opposite minus- strand.
- either the complementary sequence of both strands of a given unknown DNA sequence can be obtained (providing for reduction of ambiguity in the sequence determination) or the length of the sequence information obtainable from one clone can be extended by generating sequence information from both ends of the unknown vector-inserted DNA fragment.
- the nucleic acid primer carries, preferentially at the 5'-end, a linking functionality, L, which can include a spacer of sufficient length and which can interact with a suitable functionality, L', on a solid support to form a reversible linkage such as a photocleavable bond. Since each of the four Sanger sequencing families starts with a nucleic acid primer (L-UP; FIGURE 1) this fragment family can be bound to the solid support by reacting with functional groups, L', on the surface of a solid support and then intensively washed to remove all buffer salts, triphosphates, enzymes, reaction by- products, etc.
- L-UP nucleic acid primer
- the temporary linkage can be such that it is cleaved under the conditions of mass spectrometry, i.e., a photocleavable bond such as a charge transfer complex or a stable organic radical.
- the linkage can be formed with L' being a quaternary ammonium group (some examples are given in FIGURE 19).
- the surface of the solid support carries negative charges which repel the negatively charged nucleic acid backbone and thus facilitates desorption.
- Desorption will take place either by the heat created by the laser pulse and/or, depending on L,' by specific absorption of laser energy which is in resonance with the L' chromophore (see, e.g., examples given in FIGURE 19).
- the functionalities, L and L,' can also form a charge transfer complex and thereby form the temporary L-L' linkage.
- Various examples for appropriate functionalities with either acceptor or donator properties are depicted without limitation in FIGURE 20. Since in many cases the "charge- transfer band" can be determined by UV/vis spectrometry (see e.g. Organic Charge Transfer Complexes by R.
- the laser energy can be tuned to the corresponding energy of the charge-transfer wavelength and, thus, a specific desorption off the solid support can be initiated.
- the donor functionality can be either on the solid support or coupled to the nested Sanger DNA/RNA fragments or vice versa.
- the temporary linkage L-L' can be generated by homolytically forming relatively stable radicals as exemplified in FIGURE 21.
- FIGURE 21 a combination of the approaches using charge-transfer complexes and stable organic radicals is shown.
- the nested Sanger DNA/RNA fragments are captured via the formation of a charge transfer complex.
- the nested Sanger DNA/RNA fragments are captured via Watson-Crick base pairing to a solid support- bound oligonucleotide complementary to either the sequence of the nucleic acid primer or the tag oligonucleotide sequence (see FIGURE 22).
- the duplex formed will be cleaved under the influence of the laser pulse and desorption can be initiated.
- the solid support- bound base sequence can be presented through natural oligoribo- or oligodeoxyribonucleotide as well as analogs (e.g. thio-modified phosphodiester or phosphotriester backbone) or employing oligonucleotide mimetics such as PNA analogs (see e.g. Nielsen et al, Science.
- nucleic acids can be "conditioned" by adding positive or negative charges, i.e. charge tags (CTs). CTs increase the mass spectrometer detection sensitivity by increasing the degree of ionization during the mass spectrometric
- a CT can be linked either to the external 3' or 5' position or internally e.g. at the 2' position or at the base, e.g. at C-5 in uracil, C-5 methylgroup of thymine, C-5 at cytosine, at C 7 or C* of guanine, adenine and hypoxanthine or at the phosphate ester moiety.
- Charge tags, CTs can function molecules with permanent (i.e. pH-independent) ionization, such as:
- the trityl group is used to anchor the oligonucleotide to a solid support via the tertiary carbon and this bond is cleaved during mass spectrometry (e.g. MALDI), leaving a positive charge on the desorbing and high
- a charge tag array in conjunction with another conditioning means.
- Particularly preferred means to be used in conjunction with the CT include treating the phosphodiester bond with trialkylsilyl halides or the phosphomonothiodiester bond with alkyliodides to render the polyanionic backbone neutral.
- Another example of conditioning is modification of the phosphodiester backbone of the nucleic acid molecule (e.g. cation exchange), which can be useful for eliminating peak broadening due to a heterogeneity in the cations bound per nucleotide unit.
- a nucleic acid molecule can be contacted with an alkylating agent such as alkyliodide, iodoacetamide, ⁇ -iodoethanol, or 2,3-epoxy-l-propanol, the monothio phosphodiester bonds of a nucleic acid molecule can be transformed into a phosphotriester bond. Likewise, phosphodiester bonds may be transformed to uncharged derivatives employing trialkylsilyl chlorides.
- alkylating agent such as alkyliodide, iodoacetamide, ⁇ -iodoethanol, or 2,3-epoxy-l-propanol
- nucleotides which reduce sensitivity for depurination (fragmentation during MS) such as N7- or N9-deazapurine nucleotides, or RNA building blocks or using oligonucleotide triesters or incorporating phosphorothioate functions which are alkylated or employing oligonucleotide mimetics such as PNA
- Modification of the phosphodiester backbone can be accomplished by, for example, using alpha-thio modified nucleotides for chain elongation and termination.
- alkylating agents such as akyliodides, iodoacetamide, ⁇ -iodoethanol, 2,3-epoxy-l- propanol (see FIGURE 10)
- the monothio phosphodiester bonds of the nested Sanger fragments are transformed into phosphotriester bonds.
- Multiplexing by mass modification in this case is obtained by mass-modifying the nucleic acid primer (UP) or the nucleoside triphosphates at the sugar or the base moiety.
- UP nucleic acid primer
- nucleoside triphosphates at the sugar or the base moiety.
- the linking chemistry allows one to cleave off the so- purified nested DNA enzymatically, chemically or physically.
- the L- L' chemistry can be of a type of disulfide bond (chemically cleavable, for example, by mercaptoethanol or dithioerythrol), a biotin/streptavidin system, a heterobifunctional derivative of a trityl ether group (K ⁇ ster et al, "A Versatile Acid-Labile Linker for
- the purification process and/or ion exchange process can be carried out by a number of other methods instead of, or in conjunction with, immobilization on a solid support.
- the base-specifically terminated products can be separated from the reactants by dialysis, filtration (including ultrafiltration), and chromatography.
- these techniques can be used to exchange the cation of the phosphate backbone with a counter-ion which reduces peak broadening.
- the base-specifically terminated fragment families can be generated by standard Sanger sequencing using the Large Klenow fragment of E. coli DNA polymerase I, by Sequenase, Taq DNA polymerase and other DNA polymerases suitable for this purpose, thus generating nested DNA fragments for the mass spectrometric analysis.
- RNA polymerases such as the SP6 or the T7 RNA polymerase can be used on appropriate vectors containing, for example, the SP6 or the T7 promoters (e.g. Axelrod et al, "Transcription from Bacteriophage T7 and SP6 RNA Polymerase Promoters in the Presence of 3'- Deoxyribonucleoside 5'-triphosphate Chain Terminators," Biochemistry 24, 5716-23 (1985)).
- the unknown DNA sequence fragments are inserted downstream from such promoters.
- nucleic acid primer Pitulle et al, "Initiator Oligonucleotides for the Combination of Chemical and Enzymatic RNA Synthesis," Gene 1 12. 101-105 (1992)
- L linking functionalities
- various solid supports can be used, e.g., beads (silica gel, controlled pore glass, magnetic beads, Sephadex/Sepharose beads, cellulose beads, etc.), capillaries, glass fiber filters, glass surfaces, metal surfaces or plastic material.
- useful plastic materials include membranes in filter or microtiter plate formats, the latter allowing the automation of the purification process by employing microtiter plates which, as one embodiment of the invention, carry a permeable membrane in the bottom of the well functionalized with L'.
- Membranes can be based on polyethylene, polypropylene, polyamide, polyvinylidenedifluoride and the like.
- suitable metal surfaces include steel, gold, silver, aluminum, and copper.
- purification, cation exchange, and/or modification of the phosphodiester backbone of the L-L' bound nested Sanger fragments they can be cleaved off the solid support chemically, enzymatically or physically.
- the L-L' bound fragments can be cleaved from the support when they are subjected to mass spectrometric analysis by using appropriately chosen L-L' linkages and corresponding laser energies/intensities as described above and in FIGURES 19-23
- the highly purified, four base-specifically terminated DNA or RNA fragment families are then analyzed with regard to their fragment lengths via determination of their respective molecular weights by MALDI or ES mass spectrometry.
- the samples dissolved in water or in a volatile buffer, are injected either continuously or discontinuously into an atmospheric pressure ionization interface (API) and then mass analyzed by a quadrupole.
- API atmospheric pressure ionization interface
- the molecular weight peaks are searched for the known molecular weight of the nucleic acid primer (UP) and determined which of the four chain-terminating nucleotides has been added to the UP. This represents the first nucleotide of the unknown sequence.
- the second, the third, the n extension product can be identified in a similar manner and, by this, the nucleotide sequence is assigned.
- the generation of multiple ion peaks which can be obtained using ES mass spectrometry can increase the accuracy of the mass determination.
- various mass analyzers can be used, e.g., magnetic sector/magnetic deflection instruments in single or triple quadrupole mode (MS/MS), Fourier transform and time-of-flight (TOF) configurations as is known in the art of mass spectrometry.
- FIGURES 2A through 6 are given as an example of the data obtainable when sequencing a hypothetical DNA fragment of 50 nucleotides in length (SEQ ID NO:3) and having a molecular weight of 15,344.02 daltons.
- the molecular weights calculated for the ddT (FIGURES 2A and 2B), ddA (FIGURES 3A and 3B), ddG (FIGURES 4A and 4B) and ddC (FIGURES 5A and 5B) terminated products are given (corresponding to fragments of SEQ LD NO.3) and the idealized four MALDI-TOF mass spectra shown. All four spectra are superimposed, and from this, the DNA sequence can be generated.
- nucleic acid primer as used herein encompasses primers for both DNA and RNA Sanger sequencing.
- FIGURE 7 presents a general formula of the nucleic acid primer (UP) and the tag probes (TP).
- the mass modifying moiety can be attached, for instance, to either the 5'-end of the oligonucleotide (M ), to the nucleobase (or bases)
- Primer length can vary between 1 and 50 nucleotides in length.
- the primer is preferentially in the range of about 15 to 30 nucleotides in length.
- the length of the primer is preferentially in the range of about 2 to 6 nucleotides. If a tag probe (TP) is to hybridize to the integrated tag sequence of a family chain- terminated fragments, its preferential length is about 20 nucleotides.
- the table in FIGURE 7 depicts some examples of mass-modified primer/tag probe configurations for DNA, as well as RNA, Sanger sequencing. This list is, however, not meant to be limiting, since numerous other combinations of mass-modifying functions and positions within the oligonucleotide molecule are possible and are deemed part of the invention.
- the mass-modifying functionality can be, for example, a halogen, an azido, or of the type, XR, wherein X is a linking group and R is a mass-modifying functionality.
- the mass-modifying functionality can thus be used to introduce defined mass increments into the oligonucleotide molecule.
- nucleotides used for chain-elongation and/or termination are mass-modified. Examples of such modified nucleotides are shown in FIGURE 8. Here the mass-modifying moiety, M, can be attached either to the
- the mass-modifying functionality can be added so as to affect chain termination, such as by attaching it to the 3 '-position of the sugar ring in the nucleoside triphosphate, M 5 .
- the list in FIGURE 8 represents examples of possible configurations for generating chain-terminating nucleoside triphosphates for
- FIGURE 9 gives a more detailed description of particular examples of how the mass-modification, M, can be introduced for X in XR as well as using oligo-/polyethylene glycol derivatives for R.
- the oligo/polyethylene glycols can also be monoalkylated by a lower alkyl such as methyl, ethyl, propyl, isopropyl, t- butyl and the like.
- a selection of linking functionalities, X are also illustrated.
- Other chemistries can be used in the mass-modified compounds, as for example, those described recently in Oligonucleotides and Analogues. A Practical Approach. F. Eckstein, editor, LRL Press, Oxford, 1991.
- various mass-modifying functionalities, R can be selected and attached via appropriate linking chemistries, X.
- suitable linking chemistries, X can be selected and attached via appropriate linking chemistries, X.
- FIGURE 10 A simple mass-modification can be achieved by substituting H for halogens like F, Cl, Br and/or I, or pseudohalogens such as SCN, NCS, or by using different alkyl, aryl or aralkyl moieties such as methyl, ethyl, propyl, isopropyl, t-butyl, hexyl, phenyl, substituted phenyl, benzyl, or functional groups such as CH2F, CHF2, CF3, Si(CH 3 )3, Si(CH3) 2 (C 2 H 5 ), Si(CH3)(C 2 H 5 ) 2 , Si(C 2 H 5 ) 3 .
- halogens like F, Cl, Br and/or I, or pseudohalogens such as SCN
- mass-modification can be obtained by attaching homo- or heteropeptides through X to the UP, TP or nucleoside triphosphates.
- the superscript 0-i designates i + 1 mass differentiated nucleotides, primers or tags.
- the superscript 0 e.g., NTP , UP
- the superscript i e.g., NTP ,
- NTP 1 , NTP 2 , etc. can designate the i-th mass-modified species of that reactant. If, for example, more than one species of nucleic acids (e.g., DNA clones) are to be concurrently sequenced by multiplex DNA sequencing, then i + 1 different mass-modified nucleic acid primers (UP 0 , UP 1 ... UP i ) can be used to distinguish each set of base- specifically terminated fragments, wherein each species of mass-modified UP can be distinguished by mass spectrometry from the rest.
- i + 1 different mass-modified nucleic acid primers UP 0 , UP 1 ... UP i
- the first reaction mixture is obtained by standard Sanger DNA sequencing having unknown DNA fragment 1 (clone 1) integrated in an appropriate vector (e.g., M13mpl8), employing an unmodified nucleic acid primer UP , and a standard mixture of the four unmodified deoxynucleoside triphosphates, dNTP , and with l/10th of one of the four dideoxynucleoside triphosphates, ddNTP
- a second reaction mixture for DNA fragment 2 (clone 2) is obtained by employing a mass-modified nucleic acid primer UP and, as before, the four unmodified nucleoside triphosphates, dNTP , containing in each separate Sanger reaction l/10th of the chain-terminating unmodified dideoxynucleoside triphosphates ddNTP .
- an appropriate vector e.g., M13mpl8
- RNA polymerase e.g., SP6 or T7 RNA polymerase
- NTP and 3 '-dNTP the DNA sequence is being determined by Sanger RNA sequencing.
- FIGURE 12 illustrates the process of multiplexing by mass-modified chain- elongating or/and terminating nucleoside triphosphates in which three different DNA fragments (3 clones) are mass analyzed simultaneously.
- the first DNA Sanger sequencing reaction (DNA fragment 1, clone 1) is the standard mixture employing
- 0 0 1 0 0 2 clone 3 have the following contents: UP , dNTP , ddNTP and UP , dNTP , ddNTP
- an amplification of the mass increment in mass-modifying the extended DNA fragments can be achieved by either using an equally
- dNTP deoxynucleoside triphosphate
- dNTP deoxynucleoside triphosphate
- the contents of the reaction mixtures can be as follows: either UP°/dNTP 0 /ddNTP°, w ⁇ dNir ⁇ ddNTP 0 and UP°/dNTP 2 /ddNTP° or UP°/dNTP 0 /ddNTP°, UP°/dNTP * /ddNTP * and
- DNA sequencing can be performed by
- Sanger RNA sequencing employing unmodified nucleic acid primers, UP , and an appropriate mixture of chain-elongating and terminating nucleoside triphosphates.
- the mass-modification can be again either in the chain-terminating nucleoside triphosphate alone or in conjunction with mass-modified chain-elongating nucleoside triphosphates.
- Multiplexing is achieved by pooling the three base-specifically terminated sequencing reactions (e.g., the ddTTP terminated products) and simultaneously analyzing the pooled products by mass spectrometry.
- the first extension products of the known nucleic acid primer sequence are assigned, e.g., via a computer program. Mass/sequence assignments are possible even in the worst case in which the nucleic acid primer is extended/terminated by the same nucleotide, e.g., ddT, in all three clones.
- the following configurations thus obtained can be well differentiated by their different mass-
- DNA sequencing by multiplex mass spectrometry can be achieved by cloning the DNA fragments to be sequenced in "plex-vectors" containing vector specific "tag sequences" as described (K ⁇ ster et al,
- a further increase in multiplexing can be achieved by using, in addition to the tag probe/tag sequence interaction, mass-modified nucleic acid primers (FIGURE 7) and/or mass-modified deoxynucleoside, dNTP ' and/or dideoxynucleoside triphosphates, ddNTP .
- FOGURE 7 mass-modified nucleic acid primers
- dNTP ' and/or dideoxynucleoside triphosphates ddNTP .
- the tag sequence/tag probe multiplexing approach is not limited to Sanger DNA sequencing generating nested DNA fragments with DNA polymerases.
- the DNA sequence can also be determined by transcribing the unknown DNA sequence from appropriate promoter-containing vectors (see above) with various RNA polymerases and mixtures of NTP /3'-dNTP , thus generating nested RNA fragments.
- the mass-modifying functionality can be introduced by a two or multiple step process.
- kits for sequencing nucleic acids by mass spectrometry which include combinations of the above-described sequencing reactants.
- the kit comprises reactants for multiplex mass spectrometric sequencing of several different species of nucleic acid.
- the kit can include a solid support having a linking functionality (L ) for immobilization of the base- specifically terminated products; at least one nucleic acid primer having a linking group (L) for reversibly and temporarily linking the primer and solid support through, for example, a photocleavable bond; a set of chain-elongating nucleotides (e.g., dATP, dCTP, dGTP and dTTP, or ATP, CTP, GTP and UTP); a set of chain-terminating nucleotides (such as 2',3'-dideoxynucleotides for DNA synthesis or 3'-deoxynucleotides for RNA synthesis); and an appropriate polymerase for synthesizing complementary nucle
- Primers and/or terminating nucleotides can be mass-modified so that the base-specifically terminated fragments generated from one of the species of nucleic acids to be sequenced can be distinguished by mass spectrometry from all of the others
- a set of tag probes (as described above) can be included in the kit.
- the kit can also include appropriate buffers as well as instructions for performing multiplex mass spectrometry to concurrently sequence multiple species of nucleic acids.
- a nucleic acid sequencing kit can comprise a solid support as described above, a primer for initiating synthesis of complementary nucleic acid fragments, a set of chain-elongating nucleotides and an appropriate polymerase.
- the mass-modified chain-terminating nucleotides are selected so that the addition of one of the chain terminators to a growing complementary nucleic acid can be distinguished by mass spectrometry.
- the present invention is further illustrated by the following examples which should not be construed as limiting in any way.
- the contents of all cited references including literature references, issued patents, published patent applications (including international patent application Publication Number WO 94/16101, entitled “DNA Sequencing by Mass Spectrometry” by H. Koester; and international patent application Publication Number WO 94/21822 entitled “DNA Sequencing by Mass Spectrometry Via Exonuclease Degradation” by H. Koester), and co-pending patent applications, (including U.S Patent Application Serial No. 08/406,199, entitled “DNA Diagnostics Based on Mass Spectrometry” by H. Koester), as cited throughout this application are hereby expressly incorporated by reference.
- Sequelon membranes (Millipore Corp., Bedford, MA) with phenyl isothiocyanate groups are used as a starting material.
- the membrane disks with a diameter of 8 mm, are wetted with a solution of N-methylmorpholine/water/2- propanol (NMM solution) (2/49/49 v/v/v), the excess liquid removed with filter paper and placed on a piece of plastic film or aluminum foil located on a heating block set to 55 C.
- NMM solution N-methylmorpholine/water/2- propanol
- a solution of 1 mM 2-mercaptoethylamine (cysteamine) or 2, 2'-dithio- bis(ethylamine) (cystamine) or S-(2-thiopyridyl)-2-thio-ethylamine (10 ul, 10 nmol) in NMM is added per disk and heated at 55 C. After 15 min, 10 ul of NMM solution are added per disk and heated for another 5 min. Excess of isothiocyanate groups may be removed by treatment with 10 ul of a 10 mM solution of glycine in NMM solution.
- the disks are treated with 10 ul of a solution of 1M aqueous dithiothreitol (DTT)/2-propanol (1 :1 v/v) for 15 min at room temperature. Then, the disks are thoroughly washed in a filtration manifold with 5 aliquots of 1 ml each of the NMM solution, then with 5 aliquots of 1 ml acetonitrile/water (1/1 v/v) and subsequently dried.
- DTT dithiothreitol
- the disks are stored with free thiol groups in a solution of 1M aqueous dithiothreitol/2-propanol (1 : 1 v/v) and, before use, DTT is removed by three washings with 1 ml each of the NMM solution.
- the primer oligonucleotides with 5'-SH functionality can be prepared by various methods (e.g., B.C.F Chu et al, Nucleic Acids Res. 14. 5591-5603 (1986), Sproat et al. Nucleic Acids Res 15 4837-48 (1987) and Oligonucleotides and Analogues: A Practical Approach (F Eckstein, editor), LRL Press Oxford, 1991).
- Sequencing reactions according to the Sanger protocol are performed in a standard way (e.g., H. Swerdlow et al, Nucleic Acids Res. 18, 1415-19 (1990)).
- the free 5'-thiol primer can be used; in other cases, the SH functionality can be protected, e.g., by a trityl group during the Sanger sequencing reactions and removed prior to anchoring to the support in the following way.
- the four sequencing reactions (150 ul each in an Eppendorf tube) are terminated by a 10 min incubation at 70 C to denature the DNA polymerase (such as
- Klenow fragment, Sequenase and the reaction mixtures are ethanol precipitated.
- the supernatants are removed and the pellets vortexed with 25 ul of an 1M aqueous silver nitrate solution, and after one hour at room temperature, 50 ul of an 1 M aqueous solution of DTT is added and mixed by vortexing. After 15 min, the mixtures are centrifuged and the pellets are washed twice with 100 ul ethylacetate by vortexing and centrifugation to remove excess DTT.
- the primer extension products with free S'-thiol group are now coupled to the thiolated membrane supports under mild oxidizing conditions.
- the oligonucleotide primer is functionalized with an amino group at the 5'-end which is introduced by standard procedures during automated DNA synthesis.
- the primary amino group is reacted with 3-(2-pyridyldithio) propionic acid N-hydroxysuccinimide ester (SPDP) and subsequently coupled to the thiolated supports and monitored by the release of pyridyl-2-thione as described above.
- SPDP 3-(2-pyridyldithio) propionic acid N-hydroxysuccinimide ester
- the primer-extension products are purified by washing the membrane disks three times each with 100 ul NMM solution and three times with 100 ul each of 10 mM TEAA buffer pH 7.2.
- the purified primer-extension products are released by three successive treatments with 10 ul of 10 mM 2-mercaptoethanol in 10 mM TEAA buffer pH 7.2, lyophilized and analyzed by either ES or MALDI mass spectrometry.
- This procedure can also be used for the mass-modified nucleic acid primers UP in an analogous and appropriate way, taking into account the chemical properties of the mass-modifying functionalities.
- the four reaction mixtures (150 ul each in an Eppendorf tube) are heated to 70 C for 10 min to inactivate the DNA polymerase, ethanol precipitated, centrifuged and resuspended in 10 ul of 10 mM TEAA buffer pH 7.2. 10 ul of a 2 mM solution of the Fmoc-5-aminolevulinyI-NHS ester in 10 mM TEAA buffer is added, vortexed and incubated at 25 C for 30 min.
- the excess of the reagent is removed by ethanol precipitation and centrifugation
- the Fmoc group is cleaved off by resuspending the pellets in 10 ul of a solution of 20% piperidine in N,N-dimethylformamide/water (1 : 1 v/v). After 15 min at 25 C, piperidine is thoroughly removed by three precipitations/centrifugations with 100 ul each of ethanol, the pellets are resuspended in 10 ul of a solution of N-methylmorpholine, 2-propanol and water
- RNA extension products are immobilized in an analogous way. The procedure can be applied to other solid supports with isothiocyanate groups in a similar manner.
- the immobilized primer-extension products are extensively washed three times with 100 ul each of NMM solution and three times with 100 ul 10 mM TEAA buffer pH 7.2.
- the purified primer-extension products are released by three successive treatments with 10 ul of 100 mM hydrazinium acetate buffer pH 6.5, lyophilized and analyzed by either ES or MALDI mass spectrometry.
- Sequelon DITC membrane disks of 8 mm diameter (Millipore Corp., Bedford, MA) are wetted with 10 ul of NMM solution (N-methylmorpholine/propanaol- 2/water; 2/49/49 v/v/v) and a linker arm introduced by reaction with 10 ul of a 10 mM solution of 1,6-diaminohexane in NMM
- NMM solution N-methylmorpholine/propanaol- 2/water; 2/49/49 v/v/v
- linker arm introduced by reaction with 10 ul of a 10 mM solution of 1,6-diaminohexane in NMM
- the excess diamine is removed by three washing steps with 100 ul of NMM solution.
- the four Sanger DNA sequencing reaction mixtures (150 ul each in Eppendorf tubes) are heated for 10 min at 70 C to inactivate the DNA polymerase, ethanol precipitated, and the pellets resuspended in 10 ul of a solution of N-methylmorpholine, 2-propanol and water (2/10/88 v/v/v). This solution is transferred to the Lys-Lys-DITC membrane disks and coupled on a heating block set at 55 C. After drying, 10 ul of NMM solution is added and the drying process repeated.
- the immobilized primer-extension products are extensively washed three times with 100 ul each of NMM solution and three times with 100 ul each of 10 mM TEAA buffer pH 7.2.
- the bond between the primer- extension products and the solid support is cleaved by treatment with trypsin under standard conditions and the released products analyzed by either ES or MALDI mass spectrometry with trypsin serving as an internal mass standard
- DITC Sequelon membrane disks of 8 mm diameter
- disks of 8 mm diameter are prepared as described in EXAMPLE 3 and 10 ul of a 10 mM solution of 3-aminopyridine adenine dinucleotide (APAD) (Sigma) in NMM solution added.
- APAD 3-aminopyridine adenine dinucleotide
- the excess APAD is removed by a 10 ul wash of NMM solution and the disks are treated with 10 ul of 10 mM sodium periodate in NMM solution (15 min, 25 C).
- primer-extension products are extensively washed with the NMM solution (3 times with 100 ul each) and 10 mM TEAA buffer pH 7.2 (3 times with 100 ul each) and the purified primer-extension products are released by treatment with either NADase or pyrophosphatase in 10 mM TEAA buffer at pH 7.2 at 37 C for 15 min, lyophilized and analyzed by either ES or MALDI mass spectrometry, the enzymes serving as internal mass standards.
- Oligonucleotides are synthesized by standard automated DNA synthesis using ⁇ -cyanoethylphosphoamidites (H. K ⁇ ster et al., Nucleic Acids Res. )2, 4539 (1984)) and a 5'-amino group is introduced at the end of solid phase DNA synthesis (e.g. Agrawal et al, Nucleic Acids Res. 14, 6227-45 (1986) or Sproat et al, Nucleic Acids Res. 15. 6181-96 (1987)).
- oligonucleotide synthesis starting with 0.25 umol CPG-bound nucleoside, is deprotected with concentrated aqueous ammonia, purified via OligoPAK T M Cartridges (Millipore Corp., Bedford, MA) and lyophilized. This material with a 5'-terminal amino group is dissolved in 100 ul absolute
- N,N-dimethylformamide DMF
- N-Fmoc-glycine pentafluorophenyl ester for 60 min at 25 C.
- the Fmoc group is cleaved off by a 10 min treatment with 100 ul of a solution of 20% piperidine in N,N-dimethylformamide.
- Excess piperidine, DMF and the cleavage product from the Fmoc group are removed by ethanol precipitation and the precipitate lyophilized from 10 mM TEAA buffer pH 7.2.
- This material is now either used as primer for the Sanger DNA sequencing reactions or one or more glycine residues (or other suitable protected amino acid active esters) are added to create a series of mass- modified primer oligonucleotides suitable for Sanger DNA or RNA sequencing. Immobilization of these mass-modified nucleic acid primers UP after primer-extension during the sequencing process can be achieved as described, e.g., in EXAMPLES 1 to 4.
- the Fmoc group is removed at the end of the solid phase synthesis with a 20 min treatment with a 20 % solution of piperidine in DMF at room temperature. DMF is removed by a washing step with acetonitrile and the oligonucleotide deprotected and purified in the standard way
- the mass-modifying functionality was obtained as follows: 7.61 g (100.0 mmole) freshly distilled ethylene glycol monomethyl ether dissolved in 50 ml absolute pyridine was reacted with 10.01 g (100.0 mmole) recrystallized succinic anhydride in the presence of 1.22 g (10 0 mmole) 4-N,N- dimethylaminopyridine overnight at room temperature The reaction was terminated by the addition of water (5 0 ml), the reaction mixture evaporated in vacuo, co-evaporated twice with dry toluene (20 ml each) and the residue redissolved in 100 ml dichloromethane The solution was extracted successively, twice with 10 % aqueous citric acid (2 x 20 ml) and once with water (20 ml) and the organic phase dried over anhydrous sodium sulfate.
- the reaction mixture was evaporated in vacuo, co-evaporated with toluene, redissolved in dichloromethane and chromatographed on silicagel (Si60, Merck, column 4x50 cm) with dichloromethane/methanol mixtures The fractions containing the desired compound were collected, evaporated, redissolved in 25 ml dichloromethane and precipitated into 250 ml pentane
- the dried precipitate of 5-(3-N-(O-succinyl ethylene glycol monomethyl ether)-amidopropynyl-l)-2'-deoxyuridine (yield 65 %) is 5'-O-dimethoxytritylated and transformed into the nucleoside-3 '-O- ⁇ -cyanoethyl-N, N-diisopropylphosphoamidite and incorporated as a building block in the automated oligonucleotide synthesis according to standard procedures.
- the mass-modified nucleotide can
- nucleosidic starting material was as in previous examples, 5-(3- aminopropynyl-l)-2'-deoxyuridine.
- the mass-modifying functionality was obtained similar to EXAMPLE 8. 12.02 g (100.0 mmole) freshly distilled diethylene glycol monomethyl ether dissolved in 50 ml absolute pyridine was reacted with 10.01 g (100.0 mmole) recrystallized succinic anhydride in the presence of 1.22 g (10.0 mmole) 4-N, N- dimethylaminopyridine (DMAP) overnight at room temperature.
- DMAP N- dimethylaminopyridine
- the mass-modified building block is incorporated into automated chemical DNA synthesis according to standard procedures.
- one or more of the thymidine/uridine residues can be substituted by this mass-modified nucleotide.
- the nucleic acid primers of EXAMPLES 8 and 9 would have a mass difference of 44.05 daltons.
- the product fractions were combined, the solvent evaporated, the fractions dissolved in 5 ml dichloromethane and precipitated into 100 ml pentane. Yield was 487 mg (0.76 mmole, 76 %). Transformation into the corresponding nucleoside- ⁇ -cyanoethylphosphoamidite and integration into automated chemical DNA synthesis is performed under standard conditions. During final deprotection with aqueous concentrated ammonia, the methyl group is removed from the glycine moiety.
- the mass-modified building block can substitute one or more deoxyadenosine/adenosine residues in the nucleic acid primer sequence.
- This derivative was prepared in analogy to the glycine derivative of
- the mass-modified deoxythymidine derivative can substitute for one or more of the thymidine residues in the nucleic acid primer.
- the 4-nitrophenyl ester of succinylated diethylene glycol monomethyl ether see EXAMPLE 9
- triethylene glycol monomethyl ether the corresponding mass-modified oligonucleotides are prepared.
- the mass difference between the ethylene, diethylene and triethylene glycol derivatives is 44.05, 88.1 and 132.15 daltons respectively.
- the alkylated oligonucleotide was purified by standard reversed phase HPLC (RP-18 Ultraphere, Beckman; column: 4.5 x 250 mm; 100 mM triethylammonium acetate, pH 7.0 and a gradient of 5 to 40 % acetonitrile).
- the nucleic acid primer containing one or more phosphorothioate phosphodiester bond is used in the Sanger sequencing reactions.
- the primer-extension products of the four sequencing reactions are purified as exemplified in EXAMPLES 1 - 4, cleaved off the solid support, lyophilized and dissolved in 4 ⁇ l each of TE buffer pH 8.0 and alkylated by addition of 2 ⁇ l of a 20 mM solution of 2-iodoethanol in DMF. It is then analyzed by ES and/or MALDI mass spectrometry.
- 4-iodobutanol mass-modified nucleic acid primer are obtained with a mass difference of 14.03, 28.06 and 42.03 daltons respectively compared to the unmodified phosphorothioate phosphodiester-containing oligonucleotide.
- Solvents were removed by evaporation in vacuo and the residue purified by silica gel chromatography. Yield was 71 1 mg (0.71 mmole, 82 %). Detritylation was achieved by a one hour treatment with 80% aqueous acetic acid at room temperature. The residue was evaporated to dryness, co-evaporated twice with toluene, suspended in 1 ml dry acetonitrile and 5'-phosphorylated with POCI3 according to literature (Yoshikawa et al. , Bull Chem. Soc. Japan 42, 3505 (1969) and Sowa et al, Bull. Chem. Soc.
- Japan 48, 2084 (1975) and directly transformed in a one-pot reaction to the 5'-triphosphate using 3 ml of a 0.5 M solution (1.5 mmole) tetra (tri-n-butylammonium) pyrophosphate in DMF according to literature (e.g. Seela et al, Helvetica Chimica Acta 24, 1048 (1991)).
- the Fmoc and the 3'-O-acetyl groups were removed by a one-hour treatment with concentrated aqueous ammonia at room temperature and the reaction mixture evaporated and lyophilized.
- a glycyl-glycine modified 2'-amino-2'-deoxyuridine-5 '-triphosphate was obtained by removing the Fmoc group from 5'-O-(4,4-dimethoxytrityI)-3'-O-acetyl-2'-N- (N-9-fluorenylmethyloxycarbonyl-glycyl)-2'-amino-2'-deoxyuridine by a one-hour treatment with a 20% solution of piperidine in DMF at room temperature, evaporation of solvents, two-fold co-evaporation with toluene and subsequent condensation with N- Fmoc-glycine pentafluorophenyl ester.
- the mass difference between the glycine, ⁇ -alanine and glycyl-glycine mass-modified nucleosides is, per nucleotide inco ⁇ orated, 58.06, 72.09 and 115.1 daltons respectively.
- mass- modified nucleoside triphosphates serve as a terminating nucleotide unit in the Sanger DNA sequencing reactions providing a mass difference per terminated fragment of 58.06, 72.09 and 1 15.1 daltons respectively when used in the multiplexing sequencing mode.
- the mass-differentiated fragments can then be analyzed by ES and/or MALDI mass spectrometry.
- EXAMPLE 15 Synthesis of deoxyuridine-5'-triphosphate mass-modified at C-5 of the heterocyciic base with glycine, glycyl-glycine and ⁇ -alanine residues.
- Mass-modification of Sanger DNA sequencing fragment ladders by incorporation of chain-elongating 2'-deoxy- and chain-terminating 2',3'-dideoxythymidine-5'- (alpha-S-)-triphosphate and subsequent alkylation with 2-iodoethanol and 3- iodopropanoi 2',3'-Dideoxythymidine-5'-(alpha-S)-triphosphate was prepared according to published procedures (e.g., for the alpha-S-triphosphate moiety: Eckstein et al, Biochemistry 15, 1685 (1976) and Accounts Chem. Res.
- the template (2 pmole) and the nucleic acid M13 sequencing primer (4 pmole) modified according to EXAMPLE 1 are annealed by heating to 65 C in 100 ul of 10 mM Tris-H ⁇ pH 7.5, 10 mM MgCl 2 , 50 mM NaCI, 7 mM dithiothreitol (DTT) for 5 min and slowly brought to 37 C during a one hour period.
- the sequencing reaction mixtures contain, as exemplified for the T-specific termination reaction, in a final volume of 150 ul, 200 uM (final concentration) each of dATP, dCTP, dTTP, 300 uM c7-deaza-dGTP, 5 uM 2',3'- dideoxythymidine-5'-(alpha-S)-triphosphate and 40 units Sequenase (United States Biochemicals). Polymerization is performed for 10 min at 37 C, the reaction mixture heated to 70 C to inactivate the Sequenase, ethanol precipitated and coupled to thiolated
- Sequelon membrane disks (8 mm diameter) as described in EXAMPLE 1. Alkylation is performed by treating the disks with 10 ul of 10 mM solution of either 2-iodoethanol or
- Oligothymidylic acid oligo p(dT) 12-18
- a matrix solution of 0.5 M in ethanol was prepared.
- Various matrices were used for this Example and Examples 19- 21 such as 3,5-dihydroxybenzoic acid, sinapinic acid, 3-hydroxypicolinic acid, 2,4,6- trihydroxyacetophenone.
- Oligonucleotides were lyophilized after purification by HPLC and taken up in ultrapure water (MilliQ, Millipore) using amounts to obtain a concentration of 10 pmoles/ ⁇ l as stock solution.
- MALDI-TOF spectra were obtained for this Example and Examples 19-21 on different commercial instruments such as Vision 2000 (Finnigan-MAT), VG TofSpec (Fisons Instruments), LaserTec Research (Vestec). The conditions for this Example were linear negative ion mode with an acceleration voltage of 25 kV.
- the MALDI-TOF spectrum generated is shown in FIGURE 14. Mass calibration was done externally and generally achieved by using defined peptides of appropriate mass range such as insulin, gramicidin S, trypsinogen, bovine serum albumen, and cytochrome C. All spectra were generated by employing a nitrogen laser with 5 nsec pulses at a wavelength of 337 nm.
- oligonucleotides Two large oligonucleotides were analyzed by mass spectrometry.
- the 50- mer d (TAACGGTCATTACGGCCATTGACTGTAGGACCTGCATTACATGACTAGCT) (SEQ ID NO:3) and dT(pdT) 99 were used.
- the oligodeoxynucleotides were synthesized using ⁇ -cyanoethylphosphoamidites and purified using published procedures. (e.g. N.D. Sinha, J. Biernat, J. McManus and H. K ⁇ ster, Nucleic Acids Res .
- Example 19 The 13 DNA sequences representing the nested dT-terminated fragments of the Sanger DNA sequencing for the 50-mer described in Example 19 (SEQ ID NO:3) were synthesized as described in Example 19. The samples were treated and 500 fmol of each fragment was analyzed by MALDI-MS as described in Example 18. The resulting MALDI-TOF spectra are shown in FIGURE 16. The conditions were reflectron positive ion mode with an acceleration of 5 kV and postacceleration of 20 kV. Calculated molecular masses and experimental molecular masses are shown in Table 1.
- the samples were prepared and 500 fmol of each modified 17-mer was analyzed using MALDI-MS as described in Example 18.
- the conditions used were reflectron positive ion mode with an acceleration of 5 kV and postacceleration of 20 kV.
- the MALDI-TOF spectra which were generated were superimposed and are shown in FIGURE 18.
- oligodeoxynucleotide primers were either synthesized according to standard phosphoamidite chemistry (Sinha, N.D,. et al., (1983) Tetrahedron Let. Vol. 24, Pp. 5843-5846; Sinha, N.D., et al., (1984) Nucleic Acids Res, Vol. 12, Pp. 4539-4557) on a MilliGen 7500 DNA synthesizer (Millipore, Bedford, MA USA) in 200 nmol scales or purchased from MWG-Biotech (Ebersberg, Germany, primer 3) and Biometra (Goettingen, Germany, primers 6-7).
- primer 1 5 ' - GTCACCCTCGACCTGCAG SEQ. LD. NO. 6); primer 2: 5 ' - TTGTAAAACGACGGCCAGT (SEQ. LD. NO. 7); primer 3: 5 ' - CTTCCACCGCGATGTTGA (SEQ. LD. NO. 8); primer 4: 5 ' - CAGGAAACAGCTATGAC (SEQ. LD. NO. 9); primer 5: 5 ' - GTAAAACGACGGCCAGT (SEQ. LD. NO. 10); primer 6: 5 ' - GTCACCCTCGACCTGCAgC (g: RiboG) (SEQ. LD. NO. 11); primer 7: 5 ' - GTTGTAAAACGAGGGCCAgT (g: RiboG) (SEQ. LD. NO. 12);
- the 103-mer DNA strands (modified and unmodified) were amplified from M13mp18 RFI DNA (100 ng, Pharmacia, Freiburg, Germany) in 100 ⁇ L reaction volume using primers 4 and 5 all other concentrations were unchanged.
- the reaction was performed using the cycle: denaturation at 95°C for 1 min., annealing at 40°C for 1 min. and extension at 72 °C for 1 min. After 30 cycles for the unmodified and 40 cycles for the modified 103-mer respectively, the samples were incubated for additional 10 min. at 72°C.
- Vent DNA polymerase were able to incorporate c -dATP and c -dGTP during PCR as well.
- the overall performance turned out to be best for the exo(-)Pfu DNA polymerase giving least side products during amplification. Using all three polymerases,
- RNA polymerases such as the SP6 or the T7 RNA polymerase, must be used
- the 99-mer, 103-mer and 200-mer PCR products were analyzed by MALDI-TOF MS. Based on past experience, it was known that the degree of depurination depends on the laser energy used for desorption and ionization of the analyte. Since the influence of 7-deazapurine modification on fragmentation due to depurination was to be investigated, all spectra were measured at the same relative laser energy.
- Figures 28a and 28b show the mass spectra of the modified and unmodified 103-mer nucleic acids. In case of the modified 103-mer, fragmentation
- the modified 103-mer still contains about 20% A and G from the oligonucleotide primers, it shows less fragmentation which is featured by much more narrow and symmetric signals. Especially peak tailing on the lower mass side due to depurination, is substantially reduced. Hence, the difference between measured and calculated mass is strongly reduced although it is
- a complete 7-deaza purine modification of nucleic acids may be achieved either using modified primers in PCR or cleaving the unmodified primers from the partially modified PCR product. Since disadvantages are associated with modified primers, as described above, a 100-mer was synthesized using primers with a ribo- modification The primers were cleaved hydrolytically with NaOH according to a method developed earlier in our laboratory (Koester, H. et al , Z Physiol. Chem. 359 1570- 1589) Figures 31 a and 3 lb display the spectra of the PCR product before and after primer cleavage.
- Oligonucleotides were purchased from Operon Technologies (Alameda, CA) in an unpurified form. Their sequences are listed in Table III. Sequencing reactions were performed on a solid surface using reagents from the sequencing kit for Sequenase Version 2.0 (Amersham, Arlington Heights, Illinois). Sequencing a 39-mer target Sequencing complex:
- template strand DNA11683 was 3'-biotinylated by terminal deoxynucleotidyl transferase.
- a 30 ⁇ l reaction containing 60 pmol of DNA1 1683, 1.3 nmol of biotin 14-dATP (GLBCO BRL, Grand Island, NY), 30 units of terminal transferase (Amersham, Arlington Heights, Illinois), and lx reaction buffer (supplied with enzyme), was incubated at 37°C for 1 hour. The reaction was stopped by heat inactivation of the terminal transferase at 70 °C for 10 min. The resulting product was desalted by passing through a TE-10 spin column (Clonetech).
- Biotin- 14-d ATP More than one molecules of biotin- 14-d ATP could be added to the 3 '-end of DNA1 1683.
- the biotinylated DNA1 1683 was incubated with 0.3 mg of Dynal streptavidin beads in 30 ⁇ l lx binding and washing buffer at ambient temperature for 30 min. The beads were washed twice with TE and redissolved in 30 ⁇ l TE, 10 ⁇ l aliquot (containing 0.1 mg of beads) was used for sequencing reactions.
- the 0.1 mg beads from previous step were resuspended in a lO ⁇ l volume containing 2 ⁇ l of 5x Sequenase buffer (200 mM Tris-HCI, pH 7.5, 100 mM MgC12, and 250 mM NaCI) from the Sequenase kit and 5 pmol of corresponding primer PNA16/DNA.
- the annealing mixture was heated to 70 °C and allowed to cool slowly to room temperature over a 20-30 min time period. Then 1 ⁇ l 0.1 M dithiothreitol solution, 1 ⁇ l Mn buffer (0.15 M sodium isocitrate and 0.1 M McC 12), and 2 ⁇ l of diluted
- Sequenase (3.25 units) were added.
- the reaction mixture was divided into four aliquots of 3 ⁇ l each and mixed with termination mixes (each consists of 3 ⁇ l of the appropriate termination mix: 32 ⁇ M c7dATP, 32 ⁇ M dCTP, 32 ⁇ M c7dGTP, 32 ⁇ M dTTP and 3.2 ⁇ M of one of the four ddTNPs, in 50 mM NaCI).
- the reaction mixtures were incubated at 37°C for 2 min. After the completion of extension, the beads were precipitated and the supernatant was removed. The beads were washed twice and resuspended in TE and kept at 4°C.
- the target TNR.PLASM2 was biotinylated and sequenced using procedures similar to those described in previous section (sequencing a 39-mer target).
- CM1B3B was immobilized on Dynabeads M280 with streptavidin (Dynal, Norway) by incubating 60 pmol of CM1B3B with 0.3 magnetic beads in 30 ⁇ l 1M NaCI and TE (lx binding and washing buffer) at room temperature for 30 min. The beads were washed twice with TE and redissolved in 30 ⁇ l TE, 10 or 20 ⁇ l aliquot (containing 0.1 or 0.2 mg of beads respectively) was used for sequencing reactions.
- the duplex was formed by annealing corresponding aliquot of beads from previous step with 10 pmol of DFl la5F (or 20 pmol of DFl la5F for 0.2 mg of beads) in a 9 ⁇ l volume containing 2 ⁇ l of 5x Sequenase buffer (200 mM Tris-HCI, pH 7.5, 100 mM MgCll, and 250 mM NaCI) from the Sequenase kit.
- the annealing mixture was heated to 65 °C and allowed to cool slowly to 37°C over a 20-30 min time period.
- the duplex primer was then mixed with 10 pmol of TSlo (20 pmol of TS10 for 0.2 mg of beads) in 1 ⁇ l volume, and the resulting mixture was further incubated at 37° C for 5 min, room temperature for 5-10 min. Then 1 ⁇ l 0.1 M dithiothreitol solution, 1 ⁇ l Mn buffer (0.15 M sodium isocitrate and 0.1 M MnCl 2 ), and 2 ⁇ l of diluted Sequenase (3.25 units) were added.
- the reaction mixture was divided into four aliquots of 3 ⁇ l each and mixed with termination mixes (each consists of 4 ⁇ l of the appropriate termination mix: 16 ⁇ M dATP, 16 ⁇ M dCTP, 16 ⁇ M dGTP, 16 ⁇ M dTTP and 1.6 ⁇ M of one of the four ddNTPs, in 50 mM NaCI).
- the reaction mixtures were incubated at room temperature for 5 min, and 37°C for 5 min. After the completion of extension, the beads were precipitated and the supernatant was removed. The beads were resuspended in 20 ⁇ l TE and kept at 4°C.
- the sequencing ladder loaded magnetic beads were washed twice using 50 mM ammonium citrate and resuspended in 0.5 ⁇ l pure water. The suspension was then loaded onto the sample target of the mass spectrometer and 0.5 ⁇ l of saturated matrix solution (3-hydropicolinic acid (HPA): ammonium citrate
- the reflectron TOFMS mass spectrometer (Vision 2000, Finnigan MAT, Bremen, Germany) was used for analysis. 5 kV was applied in the ion source and 20 kV was applied for postacceleration. All spectra were taken in the positive ion mode and a nitrogen laser was used. Normally, each spectrum was averaged for more than 100 shots and a standard 25-point smoothing was applied.
- a primer is directly annealed to the template and then extended and terminated in a Sanger dideoxy sequencing.
- a biotinylated primer is used and the sequencing ladders are captured by streptavidin- coated magnetic beads. After washing, the products are eluted from the beads using
- a 39-mer template (SEQ LD No. 13) was first biotinylated at the 3' end by adding biotin- 14-d ATP with terminal transferase More than one biotin- 14-d ATP molecule could be added by the enzyme However, since the template was immobilized and remained on the beads during MALDI, the number of biotin- 14-dATP would not affect the mass spectra
- a 14-mer primer (SEQ. LD No 14) was used for the solid-state sequencing MALDI-TOF mass spectra of the four sequencing ladders are shown in Figure 32, and the expected theoretical values are shown in Table III. The sequencing reaction produced a relatively homogenous ladder, and the full-length sequence was determined easily.
- a 78-mer template containing a CTG repeat (SEQ. ID. No. 15) was 3'-biotinylated by adding biotin- 14-d ATP with terminal transferase.
- An 18-mer primer (SEQ. ID. No. 16) was annealed right outside the CTG repeat so that the repeat could be sequenced immediately after primer extension.
- the four reactions were washed and analyzed by MALDI-TOFMS as usual.
- An example of the G-reaction is shown in Figure 33 and the expected sequencing ladder is shown in Table IV with theoretical mass values for each ladder component. All sequencing peaks were well resolved except the last component (theoretical value 20577.4) was indistinguishable from the background.
- Duplex DNA probes with single-stranded overhang have been demonstrated to be able to capture specific DNA templates and also serve as primers for solid-state sequencing.
- the scheme is shown in Figure 34. Stacking interactions between a duplex probe and a single- stranded template allow only 5-base overhand to be sufficient for capturing. Based on this format, a 5' fluorescent-labeled 23-mer (5*-GAT GAT CCG ACG CAT CAC AGC TC) (SEQ. ID. No. 19) was annealed to a 3'-biotinylated 18-mer (5'-GTG ATG CGT CGG ATC ATC) (SEQ. ID. NO.
Abstract
The invention describes a new method to sequence DNA. The improvements over the existing DNA sequencing technologies are high speed, high throughput, no electrophoresis and gel reading artifacts due to the complete absence of an electrophoretic step, and no costly reagents involving various substitutions with stable isotopes. The invention utilizes the Sanger sequencing strategy and assembles the sequence information by analysis of the nested fragments obtained by base-specific chain termination via their different molecular masses using mass spectrometry, as for example, MALDI or ES mass spectrometry. A futher increase in throughtput can be obtained by introducing mass-modifications in the oligonucleotide primer, chain-terminating nucleoside triphosphates and/or in the chain-elongating nucleoside triphosphates, as well as using integrated tag sequences which allow multiplexing by hybridization of tag specific probes with mass differentiated molecular weights.
Description
DNA SEQUENCING BY MASS SPECTROMETRY
Related Applications
This application is a continuation-in-part of U.S. Application Serial Number 08/617,010, which is a continuation-in-part of U.S. Application Serial Number 08/178,216, which issued as U.S. Patent No. 5,547,835, and which itself is a continuation-in-part of U.S. Application Serial Number 08/001,323 filed January 7, 1993, which is now abandoned. The contents of all related applications are incorporated herein by reference.
Background of the Invention
Since the genetic information is represented by the sequence of the four DNA building blocks deoxyadenosine- (dpA), deoxyguanosine- (dpG), deoxycytidine- (dpC) and deoxythymidine-5'-phosphate (dpT), DNA sequencing is one of the most fundamental technologies in molecular biology and the life sciences in general. The ease and the rate by which DNA sequences can be obtained greatly affects related technologies such as development and production of new therapeutic agents and new and useful varieties of plants and microorganisms via recombinant DNA technology. In particular, unraveling the DNA sequence helps in understanding human pathological conditions including genetic disorders, cancer and AIDS. In some cases, very subtle differences such as a one nucleotide deletion, addition or substitution can create serious, in some cases even fatal, consequences. Recently, DNA sequencing has become the core technology of the Human Genome Sequencing Project (e.g., J.E. Bishop and M. Waldholz, 1991, Genome: The Story of the Most Astonishing Scientific Adventure of Our Time - The Attempt to Map All the Genes in the Human Body. Simon & Schuster, New York). Knowledge of the complete human genome DNA sequence will certainly help to understand, to diagnose, to prevent and to treat human diseases. To be able to tackle successfully the determination of the approximately 3 billion base pairs of the human genome in a reasonable time frame and in an economical way, rapid, reliable,
sensitive and inexpensive methods need to be developed, which also offer the possibility of automation. The present invention provides such a technology.
Recent reviews of today's methods together with future directions and trends are given by Barrell (The FASEB Journal 5_, 40-45 (1991)), and Trainor (Anal. Chem. 62. 418-26 (1990)).
Currently, DNA sequencing is performed by either the chemical degradation method of Maxam and Gilbert (Methods in Enzvmology 6_5_, 499-560 (1980)) or the enzymatic dideoxynucleotide termination method of Sanger et al. (Proc. Natl. Acad. Sci. USA 74. 5463-67 (1977)). In the chemical method, base specific modifications result in a base specific cleavage of the radioactive or fluorescently labeled DNA fragment With the four separate base specific cleavage reactions, four sets of nested fragments are produced which are separated according to length by polyacrylamide gel electrophoresis (PAGE). After autoradiography, the sequence can be read directly since each band (fragment) in the gel originates from a base specific cleavage event. Thus, the fragment lengths in the four "ladders" directly translate into a specific position in the DNA sequence.
In the enzymatic chain termination method, the four base specific sets of DNA fragments are formed by starting with a primer/template system elongating the primer into the unknown DNA sequence area and thereby copying the template and synthesizing a complementary strand by DNA polymerases, such as Klenow fragment of E. coli DNA polymerase I, a DNA polymerase from Thermus aquaticus, Taq DNA polymerase, or a modified T7 DNA polymerase, Sequenase (Tabor et al., Proc. Natl. Acad. Sci. USA 84, 4767-4771 (1987)), in the presence of chain-terminating reagents. Here, the chain-terminating event is achieved by incorporating into the four separate reaction mixtures in addition to the four normal deoxynucleoside triphosphates, dATP, dGTP, dTTP and dCTP, only one of the chain-terminating dideoxynucleoside triphosphates, ddATP, ddGTP, ddTTP or ddCTP, respectively, in a limiting small concentration. The four sets of resulting fragments produce, after electrophoresis, four base specific ladders from which the DNA sequence can be determined. A recent modification of the Sanger sequencing strategy involves the degradation of phosphorothioate-containing DNA fragments obtained by using alpha-thio dNTP instead of the normally used ddNTPs during the primer extension reaction
mediated by DNA polymerase (Labeit et α/.. DNA 5I 173-177 (1986); Amersham, PCT- Application GB86/00349; Eckstein et al., Nucleic Acids Res. 16 9947 (1988)). Here, the four sets of base-specific sequencing ladders are obtained by limited digestion with exonuclease III or snake venom phosphodiesterase, subsequent separation on PAGE and visualization by radioisotopic labeling of either the primer or one of the dNTPs. In a further modification, the base-specific cleavage is achieved by alkylating the sulphur atom in the modified phosphodiester bond followed by a heat treatment (Max-Planck- Gesellschaft, DE 3930312 Al). Both methods can be combined with the amplification of the DNA via the Polymerase Chain Reaction (PCR). On the upfront end, the DNA to be sequenced has to be fragmented into sequencable pieces of currently not more than 500 to 1000 nucleotides. Starting from a genome, this is a multi-step process involving cloning and subcloning steps using different and appropriate cloning vectors such as YAC, cosmids, plasmids and Ml 3 vectors (Sambrook et al, Molecular Cloning: A Laboratory Manual. Cold Spring Harbor Laboratory Press, 1989). Finally, for Sanger sequencing, the fragments of about 500 to 1000 base pairs are integrated into a specific restriction site of the replicative form I (RF I) of a derivative of the M13 bacteriophage (Vieria and Messing, Gene 19, 259 (1982)) and then the double-stranded form is transformed to the single-stranded circular form to serve as a template for the Sanger sequencing process having a binding site for a universal primer obtained by chemical DNA synthesis (Sinha, Biernat, McManus and Koster. Nucleic Acids Res. 12. 4539-57 (1984); U.S. Patent No. 4725677 upstream of the restriction site into which the unknown DNA fragment has been inserted. Under specific conditions, unknown DNA sequences integrated into supercoiled double- stranded plasmid DNA can be sequenced directly by the Sanger method (Chen and Seeburg, DNA 4, 165-170 (1985)) and Lim et al., Gene Anal. Techn. 5_, 32-39 (1988), and, with the Polymerase Chain Reaction (PCR) (PCR Protocols: A Guide to Methods and Applications, Innis et al., editors, Academic Press, San Diego (1990)) cloning or subcloning steps could be omitted by directly sequencing off chromosomal DNA by first amplifying the DNA segment by PCR and then applying the Sanger sequencing method (Innis et al. , Proc. Natl. Acad. Sci. USA 85, 9436-9440 (1988)). In this case, however, the DNA sequence in the interested region most be known at least to the extent to bind a sequencing primer.
In order to be able to read the sequence from PAGE, detectable labels have to be used in either the primer (very often at the 5 '-end) or in one of the deoxynucleoside triphosphates, dNTP. Using radioisotopes such as 32P, 33P or 35S is still the most frequently used technique. After PAGE, the gels are exposed to X-ray films and silver grain exposure is analyzed. The use of radioisotopic labeling creates several problems.
Most labels useful for autoradiographic detection of sequencing fragements have relatively short half-lives which can limit the useful time of the labels. The emission high energy beta radiation, particularly from 32P, can lead to breakdown of the products via radiolysis so that the sample should be used very quickly after labeling. In addition, high energy radiation can also cause a deterioration of band sharpness by scattering. Some of these problems can be reduced by using the less energetic isotopes such as 33P or 35S
(see, e.g., Ornstein et al, Biotechniques 3. 476 (1985)). Here, however, longer exposure times have to be tolerated. Above all, the use of radioisotopes poses significant health risks to the experimentalist and, in heavy sequencing projects, decontamination and handling the radioactive waste are other severe problems and burdens.
In response to the above mentioned problems related to the use of radioactive labels, non-radioactive labeling techniques have been explored and, in recent years, integrated into partly automated DNA sequencing procedures. All these improvements utilize the Sanger sequencing strategy. The fuorescent label can be tagged to the primer (Smith et al, Nature 321, 674-679 (1986) and EPO Patent No.
87300998.9; Du Pont De Nemours EPO Application No. 0359225; Ansorge et al. J. Biochem. Biophys. Methods 13, 325-32 (1986)) or to the chain-terminating dideoxynucloside triphosphates (Prober et al. Science 238. 336-41 (1987); Applied Biosystems, PCT Application WO 91/05060). Based on either labeling the primer or the ddNTP, systems have been developed by Applied Biosystems (Smith et al, Science 235. G89 (1987); U.S. Patent Nos. 570973 and 689013), Du Pont De Nemours (Prober et al. Science 238. 336-341 (1987); U.S. Patents Nos. 881372 and 57566), Pharmacia-LKB (Ansorge et al Nucleic Acids Res. 15 , 4593-4602 (1987) and EMBL Patent Application DE P3724442 and P3805808.1) and Hitachi (JP 1-90844 and DE 4011991 Al). A somewhat similar approach was developed by Brumbaugh et al (Proc. Natl. Sci. USA 85, 5610-14 (1988) and U.S. Patent No. 4,729,947). An improved method for the Du Pont system using two electrophoretic lanes with two different specific labels per lane is
described (PCT Application WO92/02635). A different approach uses fluorescently labeled avidin and biotin labeled primers. Here, the sequencing ladders ending with biotin are reacted during electrophoresis with the labeled avidin which results in the detection of the individual sequencing bands (Brumbaugh et al, U.S. Patent No. 594676). More recently even more sensitive non-radioactive labeling techniques for
DNA using chemiluminescence triggerable and amplifyable by enzymes have been developed (Beck, O'Keefe, Coull and Kόster. Nucleic Acids Res. 17. 5115-5123 (1989) and Beck and Kόster, Anal. Chem. 62, 2258-2270 (1990)). These labeling methods were combined with multiplex DNA sequencing (Church et al. Science 240, 185-188 (1988) to provide for a strategy aimed at high throughput DNA sequencing (Kόster et al ,
Nucleic Acids Res Symposium Ser No 24. 318-321 (1991), University of Utah, PCT Application No. WO 90/15883); this strategy still suffers from the disadvantage of being very laborious and difficult to automate.
In an attempt to simplify DNA sequencing, solid supports have been introduced. In most cases published so far, the template strand for sequencing (with or without PCR amplification) is immobilized on a solid support most frequently utilizing the strong biotin-avidin/streptavidin interaction (Orion- Yhtyma Oy, U.S. Patent No. 277643; M. Uhlen et al Nucleic Acids Res. 16, 3025-38 (1988); Cemu Bioteknik, PCT Application No. WO 89/09282 and Medical Research Council, GB, PCT Application No. WO 92/03575). The primer extension products synthesized on the immobilized template strand are purified of enzymes, other sequencing reagents and by-products by a washing step and then released under denaturing conditions by loosing the hydrogen bonds between the Watson-Crick base pairs and subjected to PAGE separation. In a different approach, the primer extension products (not the template) from a DNA sequencing reaction are bound to a solid support via biotin/avidin (Du Pont De Nemours, PCT Application WO 91/11533). In contrast to the above mentioned methods, here, the interaction between biotin and avidin is overcome by employing denaturing conditions (formamide/EDTA) to release the primer extension products of the sequencing reaction from the solid support for PAGE separation. As solid supports, beads, (e.g., magnetic beads (Dynabeads) and Sepharose beads), filters, capillaries, plastic dipsticks (e.g., polystyrene strips) and microtiter wells are being proposed.
All methods discussed so far have one central step in common: polyacrylamide gel electrophoresis (PAGE). In many instances, this represents a major drawback and limitation for each of these methods. Preparing a homogeneous gel by polymerization, loading of the samples, the electrophoresis itself, detection of the sequence pattern (e.g., by autoradiography), removing the gel and cleaning the glass plates to prepare another gel are very laborious and time-consuming procedures. Moreover, the whole process is error-prone, difficult to automate, and, in order to improve reproducibility and reliability, highly trained and skilled personnel are required. In the case of radioactive labeling, autoradiography itself can consume from hours to days. In the case of fluorescent labeling, at least the detection of the sequencing bands is being performed automatically when using the laser-scanning devices integrated into commercial available DNA sequencers. One problem related to the fluorescent labeling is the influence of the four different base-specific fluorescent tags on the mobility of the fragments during electrophoresis and a possible overlap in the spectral bandwidth of the four specific dyes reducing the discriminating power between neighboring bands, hence, increasing the probability of sequence ambiguities. Artifacts are also produced by base- specific interactions with the polyacrylamide gel matrix (Frank and Kόster, Nucleic Acids Res. 6, 2069 (1979)) and by the formation of secondary structures which result in "band compressions" and hence do not allow one to read the sequence. This problem has, in part, been overcome by using 7-deazadeoxyguanosine triphosphates (Barr et al, Biotechniques 4. 428 (1986)). However, the reasons for some artifacts and conspicuous bands are still under investigation and need further improvement of the gel electrophoretic procedure.
A recent innovation in electrophoresis is capillary zone electrophoresis (CZE) (Jorgenson et al. , J Chromatography 3_5_2, 337 (1986); Gesteland et al ,
Nucleic Acids Res. 18. 1415-1419 (1990)) which, compared to slab gel electrophoresis (PAGE), significantly increases the resolution of the separation, reduces the time for an electrophoretic run and allows the analysis of very small samples. Here, however, other problems arise due to the miniaturization of the whole system such as wall effects and the necessity of highly sensitive on-line detection methods Compared to PAGE, another drawback is created by the fact that CZE is only a "one-lane" process, whereas in PAGE samples in multiple lanes can be electrophoresed simultaneously
Due to the severe limitations and problems related to having PAGE as an integral and central part in the standard DNA sequencing protocol, several methods have been proposed to do DNA sequencing without an electrophoretic step. One approach calls for hybridization or fragmentation sequencing (Bains, Biotechnology 10, 757-58 ( 1992) and Mirzabekov et al , FEBS Letters 256: 1 18- 122 ( 1989)) utilizing the specific hybridization of known short oligonucleotides (e.g., octadeoxynucleotides which gives 65,536 different sequences) to a complementary DNA sequence. Positive hybridization reveals a short stretch of the unknown sequence. Repeating this process by performing hybridizations with all possible octadeoxynucleotides should theoretically determine the sequence. In a completely different approach, rapid sequencing of DNA is done by unilaterally degrading one single, immobilized DNA fragment by an exonuclease in a moving flow stream and detecting the cleaved nucleotides by their specific fluorescent tag via laser excitation (Jett et al, J. Biomolecular Structure & Dynamics 2, 301-309, (1989); United States Department of Energy, PCT Application No. WO 89/03432). In another system proposed by Hyman (Anal. Biochem. 174, 423-436 (1988)), the pyrophosphate generated when the correct nucleotide is attached to the growing chain on a primer-template system is used to determine the DNA sequence. The enzymes used and the DNA are held in place by solid phases (DEAE-Sepharose and Sepharose) either by ionic interactions or by covalent attachment. In a continuous flow-through system, the amount of pyrophosphate is determined via bioluminescence (luciferase). A synthesis approach to DNA sequencing is also used by Tsien et al (PCT Application No. WO 91/06678). Here, the incoming dNTP's are protected at the 3'-end by various blocking groups such as acetyl or phosphate groups and are removed before the next elongation step, which makes this process very slow compared to standard sequencing methods. The template DNA is immobilized on a polymer support. To detect incorporation, a fluorescent or radioactive label is additionally incorporated into the modified dNTP's. The same patent application also describes an apparatus designed to automate the process.
Mass spectrometry, in general, provides a means of "weighing" individual molecules by ionizing the molecules in vacuo and making them "fly" by volatilization. Under the influence of combinations of electric and magnetic fields, the ions follow
trajectories depending on their individual mass (m) and charge (z). In the range of molecules with low molecular weight, mass spectrometry has long been part of the routine physical-organic repertoire for analysis and characterization of organic molecules by the determination of the mass of the parent molecular ion. In addition, by arranging collisions of this parent molecular ion with other particles (e.g., argon atoms), the molecular ion is fragmented forming secondary ions by the so-called collision induced dissociation (CID). The fragmentation pattern/pathway very often allows the derivation of detailed structural information. Many applications of mass spectrometric methods in the known in the art, particularly in biosciences, and can be found summarized in Methods in Enzymologv. Vol. 193 : "Mass Spectrometry" (J A. McCloskey, editor), 1990, Academic Press, New York.
Due to the apparent analytical advantages of mass spectrometry in providing high detection sensitivity, accuracy of mass measurements, detailed structural information by CLD in conjunction with an MS/MS configuration and speed, as well as on-line data transfer to a computer, there has been considerable interest in the use of mass spectrometry for the structural analysis of nucleic acids. Recent reviews summarizing this field include K. H. Schram, "Mass Spectrometry of Nucleic Acid Components, Biomedical Applications of Mass Spectrometry" 3_4, 203-287 (1990); and P F. Crain, "Mass Spectrometric Techniques in Nucleic Acid Research," Mass Spectrometry Reviews 9, 505-554 (1990). The biggest hurdle to applying mass spectrometry to nucleic acids is the difficulty of volatilizing these very polar biopolymers. Therefore, "sequencing" has been limited to low molecular weight synthetic oligonucleotides by determining the mass of the parent molecular ion and through this, confirming the already known sequence, or alternatively, confirming the known sequence through the generation of secondary ions (fragment ions) via CID in an MS/MS configuration utilizing, in particular, for the ionization and volatilization, the method of fast atomic bombardment (FAB mass spectrometry) or plasma desorption (PD mass spectrometry). As an example, the application of FAB to the analysis of protected dimeric blocks for chemical synthesis of oligodeoxynucleotides has been described (Kόster et al Biomedical Environmental Mass Spectrometry 14. 111-116 (1987)).
Two more recent ionization/desorption techniques are electrospray/ionspray (ES) and matrix-assisted laser desorption/ionization (MALDI). ES mass spectrometry has been introduced by Fenn et al. (J Phys. Chem 88, 4451-59 (1984); PCT Application No. WO 90/14148) and current applications are summarized in recent review articles (R.D. Smith et al , Anal. Chem £2, 882-89 ( 1990) and B. Ardrey, Electrospray Mass
Spectrometry, Spectroscopy Europe. 4, 10-18 (1992)). The molecular weights of the tetradecanucleotide d(CATGCCATGGCATG) (SEQ ID NO: l) (Covey et al "The Determination of Protein, Oligonucleotide and Peptide Molecular Weights by Ionspray Mass Spectrometry," Rapid Communications in Mass Spectrometry. 2, 249-256 (1988)), of the 21-mer d(AAATTGTGCACATCCTGCAGC) (SEQ ID NO:2) and without giving details of that of a tRNA with 76 nucleotides (Methods in Enzymology. 193, "Mass Spectrometry" (McCloskey, editor), p. 425, 1990, Academic Press, New York) have been published. As a mass analyzer, a quadrupole is most frequently used. The determination of molecular weights in femtomole amounts of sample is very accurate due to the presence of multiple ion peaks which all could be used for the mass calculation.
MALDI mass spectrometry, in contrast, can be particularly attractive when a time-of-flight (TOF) configuration is used as a mass analyzer. The MALDI-TOF mass spectrometry has been introduced by Hillenkamp et al ("Matrix Assisted UV-Laser Desorption/ionization: A New Approach to Mass Spectrometry of Large Biomolecules," Biological Mass Spectrometry (Burlingame and McCloskey, editors), Elsevier Science Publishers, Amsterdam, pp. 49-60, 1990.) Since, in most cases, no multiple molecular ion peaks are produced with this technique, the mass spectra, in principle, look simpler compared to ES mass spectrometry. Although DNA molecules up to a molecular weight of 410,000 daltons could be desorbed and volatilized (Williams et al, "Volatilization of High Molecular Weight DNA by Pulsed Laser Ablation of Frozen Aqueous Solutions," Science. 246, 1585-87 (1989)), this technique has so far only been used to determine the molecular weights of relatively small oligonucleotides of known sequence, e.g., oligothymidylic acids up to 18 nucleotides (Huth-Fehre et al. , "Matrix- Assisted Laser Desorption Mass Spectrometry of Oligodeoxythymidylic Acids," Rapid Communications in Mass Spectrometry. 6, 209-13 (1992)) and a double- stranded DNA of 28 base pairs (Williams et al, "Time-of-Flight Mass Spectrometry of Nucleic
Acids by Laser Ablation and Ionization from a Frozen Aqueous Matrix," Rapid Communications in Mass Spectrometry. 4, 348-351 (1990)). In one publication (Huth- Fehre et al, 1992 , supra), it was shown that a mixture of all the oligothymidylic acids from n=12 to n=18 nucleotides could be resolved. In U.S. Patent No. 5,064,754, RNA transcripts extended by DNA both of which are complementary to the DNA to be sequenced are prepared by incorporating
NTP's, dNTP's and, as terminating nucleotides, ddNTP's which are substituted at the 5'- position of the sugar moiety with one or a combination of the isotopes
The polynucleotides obtained are degraded to 3'- nucleotides, cleaved at the N-glycosidic linkage and the isotopically labeled 5'- functionality removed by periodate oxidation and the resulting formaldehyde species determined by mass spectrometry. A specific combination of isotopes serves to discriminate base-specifically between internal nucleotides originating from the incorporation of NTP's and dNTP's and terminal nucleotides caused by linking ddNTP's to the end of the polynucleotide chain. A series of RNA/DNA fragments is produced, and in one embodiment, separated by electrophoresis, and, with the aid of the so-called matrix method of analysis, the sequence is deduced.
In Japanese Patent No. 59-131909, an instrument is described which detects nucleic acid fragments separated either by electrophoresis, liquid chromatography or high speed gel filtration. Mass spectrometric detection is achieved by incorporating into the nucleic acids atoms which normally do not occur in DNA such as S, Br, I or Ag, Au, Pt, Os, Hg. The method, however, is not applied to sequencing of DNA using the Sanger method. In particular, it does not propose a base-specific correlation of such elements to an individual ddNTP. PCT Application No. WO 89/12694 (Brennan et al. , Proc. SPIE-Int Soc
Opt. Eng. 1206, (New Techno!. Cvtom. Mol. Biol ). pp. 60-77 (1990); and Brennan, U.S. Patent No. 5,003,059) employs the Sanger methodology for DNA sequencing by using a combination of either the four stable isotopes
to specifically label the chain-terminating ddNTP's. The sulfur isotopes can be located either in the base or at the alpha-position of the triphosphate moiety whereas the halogen isotopes are located either at the base or at the 3'-position of the sugar ring
The sequencing reaction mixtures are separated by an electrophoretic technique such as
CZE, transferred to a combustion unit in which the sulfur isotopes of the incorporated ddNTP's are transformed at about 900 C in an oxygen atmosphere. The SO2 generated with masses of 64, 65, 66 or 68 is determined on-line by mass spectrometry using, e.g., as mass analyzer, a quadrupole with a single ion-multiplier to detect the ion current.
A similar approach is proposed in U.S. Patent No. 5,002,868 (Jacobson et al, Proc. SPIE-Int. Soc. Pot. Eng. 1435, (Opt. Methods Ultrasensitive Detect. Anal.
Tech. Appl.). 26-35 (1991)) using Sanger sequencing with four ddNTP's specifically substituted at the alpha-position of the triphosphate moiety with one of the four stable sulfur isotopes as described above and subsequent separation of the four sets of nested sequences by tube gel electrophoresis. The only difference is the use of resonance ionization spectroscopy (RIS) in conjunction with a magnetic sector mass analyzer as disclosed in U.S. Patent No. 4,442,354 to detect the sulfur isotopes corresponding to the specific nucleotide terminators, and by this, allowing the assignment of the DNA sequence.
EPO Patent Applications No. 0360676 Al and 0360677 Al also describe
Sanger sequencing using stable isotope substitutions in the ddNTP's such as D, Λ . ,
or functional groups such as CF3 or Si(CH3)3 at the base, the sugar or the alpha position of the triphosphate moiety according to chemical functionality. The Sanger sequencing reaction mixtures are separated by tube gel electrophoresis. The effluent is converted into an aerosol by the electrospray/thermospray nebulizer method and then atomized and ionized by a hot plasma (7000 to 8000 K) and analyzed by a simple mass analyzer. An instrument is proposed which enables one to automate the analysis of the Sanger sequencing reaction mixture consisting of tube electrophoresis, a nebulizer and a mass analyzer.
The application of mass spectrometry to perform DNA sequencing by the hybridization/fragment method (see above) has been recently suggested (Bains, "DNA
Sequencing by Mass Spectrometry: Outline of a Potential Future Application," Chimicaoggi 9. 13-16 (1991)).
Summary of the Invention
The invention describes a new method to sequence DNA. The improvements over the existing DNA sequencing technologies include high speed, high throughput, no required electrophoresis (and, thus, no gel reading artifacts due to the complete absence of an electrophoretic step), and no costly reagents involving various substitutions with stable isotopes. The invention utilizes the Sanger sequencing strategy and assembles the sequence information by analysis of the nested fragments obtained by base-specific chain termination via their different molecular masses using mass spectrometry, for example, MALDI or ES mass spectrometry. A further increase in throughput can be obtained by introducing mass modifications in the oligonucleotide primer, the chain-terminating nucleoside triphosphates and/or the chain-elongating nucleoside triphosphates, as well as using integrated tag sequences which allow multiplexing by hybridization of tag specific probes with mass differentiated molecular weights.
Brief Description of the FIGURES
FIGURE 1 is a representation of a process to generate the samples to be analyzed by mass spectrometry. This process entails insertion of a DNA fragment of unknown sequence into a cloning vector such as derivatives of M13, pUC or phagemids; transforming the double-stranded form into the single-stranded form; performing the four Sanger sequencing reactions; linking the base-specifically terminated nested fragment family temporarily to a solid support; removing by a washing step all by-products; conditioning the nested DNA or RNA fragments by, for example, cation-ion exchange or modification reagent and presenting the immobilized nested fragments either directly to mass spectrometric analysis or cleaving the purified fragment family off the support and evaporating the cleavage reagent.
FIGURE 2A shows the Sanger sequencing products using ddTTP as terminating deoxynucleoside triphosphate of a hypothetical DNA fragment of 50 nucleotides (SEQ LD NO:3) in length with approximately equally balanced base composition. The molecular masses of the various chain terminated fragments are given.
FIGURE 2B shows an idealized mass spectrum of such a DNA fragment mixture.
FIGURES 3A and 3B show, in analogy to FIGURES 2A and 2B, data for the same model sequence (SEQ ID NO:3) with ddATP as chain terminator. FIGURES 4A and 4B show data, analogous to FIGURES 2A and 2B when ddGTP is used as a chain terminator for the same model sequence (SEQ Lϋ NO:3).
FIGURES 5A and 5B illustrate the results obtained where chain termination is performed with ddCTP as a chain terminator, in a similar way as shown in FIGURES 2 A and 2B for the same model sequence (SEQ LD NO:3). FIGURE 6 summarizes the results of FIGURES 2A to 5B, showing the correlation of molecular weights of the nested four fragment families to the DNA sequence (SEQ ID NO:3).
FIGURE 7 illustrates the general structure of mass-modified sequencing nucleic acid primers or tag sequencing probes for either Sanger DNA or Sanger RNA sequencing.
FIGURE 8 shows the general structure for the mass-modified triphosphates for either Sanger DNA or Sanger RNA sequencing. General formulas of the chain-elongating and the chain-terminating nucleoside triphosphates are demonstrated. FIGURE 9 outlines various linking chemistries (X) with either polyethylene glycol or terminally monoalkylated polyethylene glycol (R) as an example.
FIGURE 10 illustrates similar linking chemistries as shown in FIGURE 8 and depicts various mass modifying moieties (R).
FIGURE 1 1 outlines how multiplex mass spectrometric sequencing can work using the mass-modified nucleic acid primer (UP).
FIGURE 12 shows the process of multiplex mass spectrometric sequencing employing mass-modified chain-elongating and/or terminating nucleoside triphosphates.
FIGURE 13 shows multiplex mass spectrometric sequencing by involving the hybridization of mass-modified tag sequence specific probes.
FIGURE 14 shows a MALDI-TOF spectrum of a mixture of oligothymidylic acids, d(pT) 12-I8
FIGURE 15 shows a superposition of MALDI-TOF spectra of the 50-mer d(TAACGGTCATTACGGCCATTGACTGTAGGACCTGCATTACATGACTAGCT) (SEQ LD NO:3) (500 fmol) and dT(pdT)99 (500 fmol).
FIGURE 16 shows the MALDI-TOF spectra of all 13 DNA sequences representing the nested dT-terminated fragments of the Sanger DNA sequencing simulation of Figure 2, 500 fmol each.
FIGURE 17 shows the superposition of the spectra of FIGURE 16. The two panels show two different scales and the spectra analyzed at that scale
FIGURE 18 shows the superimposed MALDI-TOF spectra from MALDI- MS analysis of mass-modified oligonucleotides as described in Example 21.
FIGURE 19 illustrates various linking chemistries between the solid support (P) and the nucleic acid primer (NA) through a strong electrostatic interaction. FIGURE 20 illustrates various linking chemistries between the solid support (P) and the nucleic acid primer (NA) through a charge transfer complex of a charge transfer acceptor (A) and a charge transfer donor (D).
FIGURE 21 illustrates various linking chemistries between the solid support (P) and the nucleic acid primer (NA) through a stable organic radical FIGURE 22 illustrates a possible linking chemistry between the solid support (P) and the nucleic acid primer (NA) through Watson-Crick base pairing
FIGURE 23 illustrates linking the solid support (P) and the nucleic acid primer (NA) through a photo lytically cleavable bond.
FIGURE 24 shows the portion of the sequence of pRFcl DNA, which was used as template for PCR amplification of unmodified and 7-deazapurine containing 99-mer and 200-mer nucleic acids as well as the sequences of the 19-mer primers and the two 18-mer reverse primers.
FIGURE 25 shows the portion of the nucleotide sequence of M13mpl8 RFI DNA which was used for PCR amplification of unmodified and 7-deazapurine containing 103-mer nucleic acids. Also shown are nucleotide sequences of the 17-mer primers used in the PCR.
FIGURE 26 shows the result of a polyacrylamide gel electrophoresis of PCR products purified and concentrated for MALDI-TOF MS analysis. M: chain length marker, lane 1 : 7-deazapurine containing 99-mer PCR product, lane 2: unmodified 99- mer, lane 3: 7-deazapurine containing 103-mer and lane 4: unmodified 103-mer PCR product.
FIGURE 27: an autoradiogram of polyacrylamide gel electrophoresis of
32 PCR reactions carried out with 5'-[ P]-labeled primers 1 and 4. Lanes 1 and 2: unmodified and 7 -deazapurine modified 103-mer PCR product (53321 and 23520 counts), lanes 3 and 4: unmodified and 7-deazapurine modified 200-mer (71123 and 39582 counts) and lanes 5 and 6: unmodified and 7-deazapurine modified 99-mer (173216 and 94400 counts).
FIGURE 28: a) MALDI-TOF mass spectrum of the unmodified 103-mer PCR products (sum of twelve single shot spectra). The mean value of the masses calculated for the two single strands (31768 u and 31759 u) is 31763 u. Mass resolution: 18. b) MALDI-TOF mass spectrum of 7-deazapurine containing 103-mer PCR product
(sum of three single shot spectra). The mean value of the masses calculated for the two single strands (31727 u and 31719 u) is 31723 u. Mass resolution: 67.
FIGURE 29: a) MALDI-TOF mass spectrum of the unmodified 99-mer PCR product (sum of twenty single shot spectra). Values of the masses calculated for the two single strands: 30261 u and 30794 u. b) MALDI-TOF mass spectrum of the 7- deazapurine containing 99-mer PCR product (sum of twelve single shot spectra). Values of the masses calculated for the two single strands: 30224 u and 30750 u.
FIGURE 30: a) MALDI-TOF mass spectrum of the unmodified 200-mer PCR product (sum of 30 single shot spectra). The mean value of the masses calculated for the two single strands (61873 u and 61595 u) is 61734 u. Mass resolution: 28. b)
MALDI-TOF mass spectrum of 7-deazapurine containing 200-mer PCR product (sum of 30 single shot spectra). The mean value of the masses calculated for the two single strands (61772 u and 61514 u) is 61643 u. Mass resolution: 39.
FIGURE 31 : a) MALDI-TOF mass spectrum of 7-deazapurine containing 100-mer PCR product with ribomodified primers. The mean value of the masses calculated for the two single strands (30529 u and 31095 u) is 30812 u. b) MALDI-TOF
mass spectrum of the PCR-product after hydrolytic primer-cleavage. The mean value of the masses calculated for the two single strands (25104 u and 25229 u) is 25167 u. The mean value of the cleaved primers (5437 u and 5918 u) is 5677 u.
FIGURE 32 A-D shows the MALDI-TOF mass spectrum of the four sequencing ladders obtained from a 39 -mer template (SEQ. LD. No. 13), which was immobilized to streptavidin beads via a 3' biotinylation. A 14-mer primer (SEQ. ID. NO. 14) was used in the sequencing.
FIGURE 33 shows a MALDI-TOF mass spectrum of a solid state sequencing of a 78-mer template (SEQ. JJD. No. 15), which was immobilized to streptavidin beads via a 3' biotinylation. A 18-mer primer (SEQ LD No. 16) and ddGTP were used in the sequencing.
FIGURE 34 shows a scheme in which duplex DNA probes with single- stranded overhang capture specific DNA templates and also serve as primers for solid state sequencing. FIGURE 35 A-D shows MALDI-TOF mass spectra obtained from a 5' fluorescent labeled 23-mer (SEQ. LD. No. 19) annealed to an 3' biotinylated 18-mer (SEQ. LD. No. 20), leaving a 5-base overhang, which captured a 15-mer template (SEQ. LD. No. 21).
FIGURE 36 shows a stacking flurogram of the same products obtained from the reaction described in FIGURE 35, but run on a conventional DNA sequencer.
Detailed Description of the Invention
This invention describes an improved method of sequencing DNA. In particular, this invention employs mass spectrometry to analyze the Sanger sequencing reaction mixtures.
In Sanger sequencing, four families of chain-terminated fragments are obtained. The mass difference per nucleotide addition is 289.19 for dpC, 313.21 for dpA, 329.21 for dpG and 304.2 for dpT, respectively.
In one embodiment, through the separate determination of the molecular weights of the four base-specifically terminated fragment families, the DNA sequence can be assigned via superposition (e.g., interpolation) of the molecular weight peaks of the
four individual experiments. In another embodiment, the molecular weights of the four specifically terminated fragment families can be determined simultaneously by MS, either by mixing the products of all four reactions run in at least two separate reaction vessels (i.e., all run separately, or two together, or three together) or by running one reaction having all four chain-terminating nucleotides (e.g., a reaction mixture comprising dTTP, ddTTP, dATP, ddATP, dCTP, ddCTP, dGTP, ddGTP) in one reaction vessel. By simultaneously analyzing all four base-specifically terminated reaction products, the molecular weight values have been, in effect, interpolated. Comparison of the mass difference measured between fragments with the known masses of each chain-terminating nucleotide allows the assignment of sequence to be carried out. In some instances, it may be desirable to mass modify, as discussed below, the chain-terminating nucleotides so as to expand the difference in molecular weight between each nucleotide. It will be apparent to those skilled in the art when mass-modification of the chain-terminating nucleotides is desirable and can depend, for instance, on the resolving ability of the particular spectrometer employed. By way of example, it may be desirable to produce four chain-
1 2 3 1 terminating nucleotides, ddTTP, ddCTP , ddATP and ddGTP where ddCTP ,
2 3 ddATP and ddGTP have each been mass-modified so as to have molecular weights resolvable from one another by the particular spectrometer being used.
The terms chain-elongating nucleotides and chain-terminating nucleotides are well known in the art. For DNA, chain-elongating nucleotides include 2'-deoxyribonucleotides and chain-terminating nucleotides include 2', 3'-dideoxyribonucleotides. For RNA, chain-elongating nucleotides include ribonucelotides and chain-terminating nucleotides include 3'-deoxyribonucleotides. The term nucleotide is also well known in the art. For the purposes of this invention, nucleotides include nucleoside mono-, di-, and triphosphates. Nucleotides also include modified nucleotides such as phosphorothioate nucleotides.
Since mass spectrometry is a serial method, in contrast to currently used slab gel electrophoresis which allows several samples to be processed in parallel, in another embodiment of this invention, a further improvement can be achieved by multiplex mass spectrometric DNA sequencing to allow simultaneous sequencing of more than one DNA or RNA fragment. As described in more detail below, the range of about
300 mass units between one nucleotide addition can be utilized by employing either mass- modified nucleic acid sequencing primers or chain-elongating and/or terminating nucleoside triphosphates so as to shift the molecular weight of the base-specifically terminated fragments of a particular DNA or RNA species being sequenced in a predetermined manner. For the first time, several sequencing reactions can be mass spectrometrically analyzed in parallel. In yet another embodiment of this invention, multiplex mass spectrometric DNA sequencing can be performed by mass modifying the fragment families through specific oligonucleotides (tag probes) which hybridize to specific tag sequences within each of the fragment families. In another embodiment, the tag probe can be covalently attached to the individual and specific tag sequence prior to mass spectrometry.
Preferred mass spectrometer formats for use in the invention are matrix assisted laser desorption ionization (MALDI), electrospray (ES), ion cyclotron resonance (ICR) and Fourier Transform. For ES, the samples, dissolved in water or in a volatile buffer, are injected either continuously or discontinuously into an atmospheric pressure ionization interface (API) and then mass analyzed by a quadrupole The generation of multiple ion peaks which can be obtained using ES mass spectrometry can increase the accuracy of the mass determination. Even more detailed information on the specific structure can be obtained using an MS/MS quadrupole configuration In MALDI mass spectrometry, various mass analyzers can be used, e g , magnetic sector/magnetic deflection instruments in single or triple quadrupole mode (MS/MS), Fourier transform and time-of-flight (TOF) configurations as is known in the art of mass spectrometry. For the desorption/ionization process, numerous matrix/laser combinations can be used. Ion-trap and reflectron configurations can also be employed In one embodiment of the invention, the molecular weight values of at least two base-specifically terminated fragments are determined concurrently using mass spectrometry. The molecular weight values of preferably at least five and more preferably at least ten base-specifically terminated fragments are determined by mass spectrometry. Also included in the invention are determinations of the molecular weight values of at least 20 base-specifically terminated fragments and at least 30 base- specifically terminated fragments. Further, the nested base-specifically terminated
fragments in a specific set can be purified of all reactants and by-products but are not separated from one another. The entire set of nested base-specifically terminated fragments is analyzed concurrently and the molecular weight values are determined. At least two base-specifically terminated fragments are analyzed concurrently by mass spectrometry when the fragments are contained in the same sample.
In general, the overall mass spectrometric DNA sequencing process will start with a library of small genomic fragments obtained after first randomly or specifically cutting the genomic DNA into large pieces which then, in several subcloning steps, are reduced in size and inserted into vectors like derivatives of Ml 3 or pUC (e.g., M 13 mp 18 or M 13 mp 19) (see FIGURE 1 ). In a different approach, the fragments inserted in vectors, such as Ml 3, are obtained via subcloning starting with a cDNA library. In yet another approach, the DNA fragments to be sequenced are generated by the polymerase chain reaction (e.g., Higuchi et al, "A General Method of in vitro Preparation and Mutagenesis of DNA Fragments: Study of Protein and DNA Interactions," Nucleic Acids Res.τ 16, 7351-67 (1988)). As is known in the art, Sanger sequencing can start from one nucleic acid primer (UP) binding to the plus-strand or from another nucleic acid primer binding to the opposite minus- strand. Thus, either the complementary sequence of both strands of a given unknown DNA sequence can be obtained (providing for reduction of ambiguity in the sequence determination) or the length of the sequence information obtainable from one clone can be extended by generating sequence information from both ends of the unknown vector-inserted DNA fragment.
The nucleic acid primer carries, preferentially at the 5'-end, a linking functionality, L, which can include a spacer of sufficient length and which can interact with a suitable functionality, L', on a solid support to form a reversible linkage such as a photocleavable bond. Since each of the four Sanger sequencing families starts with a nucleic acid primer (L-UP; FIGURE 1) this fragment family can be bound to the solid support by reacting with functional groups, L', on the surface of a solid support and then intensively washed to remove all buffer salts, triphosphates, enzymes, reaction by- products, etc. Furthermore, for mass spectrometric analysis, it can be of importance at this stage to exchange the cation at the phosphate backbone of the DNA fragments in
order to eliminate peak broadening due to a heterogeneity in the cations bound per nucleotide unit. Since the L-L' linkage is only of a temporary nature with the purpose to capture the nested Sanger DNA or RNA fragments to properly condition them for mass spectrometric analysis, there are different chemistries which can serve this purpose. In addition to the examples given in which the nested fragments are coupled covalently to the solid support, washed, and cleaved off the support for mass spectrometric analysis, the temporary linkage can be such that it is cleaved under the conditions of mass spectrometry, i.e., a photocleavable bond such as a charge transfer complex or a stable organic radical. Furthermore, the linkage can be formed with L' being a quaternary ammonium group (some examples are given in FIGURE 19). In this case, preferably, the surface of the solid support carries negative charges which repel the negatively charged nucleic acid backbone and thus facilitates desorption. Desorption will take place either by the heat created by the laser pulse and/or, depending on L,' by specific absorption of laser energy which is in resonance with the L' chromophore (see, e.g., examples given in FIGURE 19). The functionalities, L and L,' can also form a charge transfer complex and thereby form the temporary L-L' linkage. Various examples for appropriate functionalities with either acceptor or donator properties are depicted without limitation in FIGURE 20. Since in many cases the "charge- transfer band" can be determined by UV/vis spectrometry (see e.g. Organic Charge Transfer Complexes by R. Foster, Academic Press, 1969), the laser energy can be tuned to the corresponding energy of the charge-transfer wavelength and, thus, a specific desorption off the solid support can be initiated. Those skilled in the art will recognize that several combinations can serve this purpose and that the donor functionality can be either on the solid support or coupled to the nested Sanger DNA/RNA fragments or vice versa. In yet another approach, the temporary linkage L-L' can be generated by homolytically forming relatively stable radicals as exemplified in FIGURE 21. In example 4 of FIGURE 21, a combination of the approaches using charge-transfer complexes and stable organic radicals is shown. Here, the nested Sanger DNA/RNA fragments are captured via the formation of a charge transfer complex. Under the influence of the laser pulse, desorption (as discussed above) as well as ionization will take place at the radical position. In the other examples of FIGURE 21 under the influence of the laser pulse, the
L-L' linkage will be cleaved and the nested Sanger DNA/RNA fragments desorbed and subsequently ionized at the radical position formed. Those skilled in the art will recognize that other organic radicals can be selected and that, in relation to the dissociation energies needed to homolytically cleave the bond between them, a corresponding laser wavelength can be selected (see e.g. Reactive Molecules by C. Wentrup, John Wiley & Sons, 1984). In yet another approach, the nested Sanger DNA/RNA fragments are captured via Watson-Crick base pairing to a solid support- bound oligonucleotide complementary to either the sequence of the nucleic acid primer or the tag oligonucleotide sequence (see FIGURE 22). The duplex formed will be cleaved under the influence of the laser pulse and desorption can be initiated. The solid support- bound base sequence can be presented through natural oligoribo- or oligodeoxyribonucleotide as well as analogs (e.g. thio-modified phosphodiester or phosphotriester backbone) or employing oligonucleotide mimetics such as PNA analogs (see e.g. Nielsen et al, Science. 254, 1497 (1991)) which render the base sequence less susceptible to enzymatic degradation and hence increases overall stability of the solid support-bound capture base sequence. With appropriate bonds, L-L', a cleavage can be obtained directly with a laser tuned to the energy necessary for bond cleavage. Thus, the immobilized nested Sanger fragments can be directly ablated during mass spectrometric analysis. Prior to mass spectrometric analysis, it may be useful to "condition" nucleic acid molecules, for example to decrease the laser energy required for volatization, to minimize fragmentation or to otherwise increase the sensitivity of mass spectrometeric detection. For example, nucleic acids can be "conditioned" by adding positive or negative charges, i.e. charge tags (CTs). CTs increase the mass spectrometer detection sensitivity by increasing the degree of ionization during the mass spectrometric
(e.g.MALDI) process. A CT can be linked either to the external 3' or 5' position or internally e.g. at the 2' position or at the base, e.g. at C-5 in uracil, C-5 methylgroup of thymine, C-5 at cytosine, at C7 or C* of guanine, adenine and hypoxanthine or at the phosphate ester moiety. Charge tags, CTs, can function molecules with permanent (i.e. pH-independent) ionization, such as:
or molecules which generate a positive charge upon MALDI and which are stabilized by delocalization of the positive charge by mesomeric effects in unsaturated and/or aromatic systems such as:
wherein, R, R1, R2 = H,OAl (wherein Al= e.g. lower alkyl, methyl, ethyl, propyl), NO2, CN, CO2H, CO2 active ester, or halogen; and
X = -O-, -NH-, -S-, C=O, OCO either in the para or meta position.
For example, the positive charge of a trityl cation is produced during MALDI by the removal of a moiety such as: -OR, where R = a lower alkyl, or an anion such as ClO4 ", SbF6\ BF4' and the like.
In an alternative scheme, the trityl group is used to anchor the oligonucleotide to a solid support via the tertiary carbon and this bond is cleaved during mass spectrometry (e.g. MALDI), leaving a positive charge on the desorbing and high
One of skill in the art can readily appreciate several variations to the schemes described above. In addition to employing the charge tag array alone, one of skill in the art can employ a charge tag array in conjunction with another conditioning means. Particularly preferred means to be used in conjunction with the CT include treating the phosphodiester bond with trialkylsilyl halides or the phosphomonothiodiester bond with alkyliodides to render the polyanionic backbone neutral.
Another example of conditioning is modification of the phosphodiester backbone of the nucleic acid molecule (e.g. cation exchange), which can be useful for eliminating peak broadening due to a heterogeneity in the cations bound per nucleotide unit. In addition, a nucleic acid molecule can be contacted with an alkylating agent such as alkyliodide, iodoacetamide, β-iodoethanol, or 2,3-epoxy-l-propanol, the monothio phosphodiester bonds of a nucleic acid molecule can be transformed into a phosphotriester bond. Likewise, phosphodiester bonds may be transformed to uncharged derivatives employing trialkylsilyl chlorides. Further conditioning involves incorporating nucleotides which reduce sensitivity for depurination (fragmentation during MS) such as N7- or N9-deazapurine nucleotides, or RNA building blocks or using oligonucleotide triesters or incorporating phosphorothioate functions which are alkylated or employing oligonucleotide mimetics such as PNA
Modification of the phosphodiester backbone can be accomplished by, for example, using alpha-thio modified nucleotides for chain elongation and termination. With alkylating agents such as akyliodides, iodoacetamide, β-iodoethanol, 2,3-epoxy-l- propanol (see FIGURE 10), the monothio phosphodiester bonds of the nested Sanger fragments are transformed into phosphotriester bonds. Multiplexing by mass modification in this case is obtained by mass-modifying the nucleic acid primer (UP) or the nucleoside triphosphates at the sugar or the base moiety. To those skilled in the art, other modifications of the nested Sanger fragments can be envisioned. In one embodiment of the invention, the linking chemistry allows one to cleave off the so- purified nested DNA enzymatically, chemically or physically. By way of example, the L- L' chemistry can be of a type of disulfide bond (chemically cleavable, for example, by mercaptoethanol or dithioerythrol), a biotin/streptavidin system, a heterobifunctional derivative of a trityl ether group (Kόster et al, "A Versatile Acid-Labile Linker for
Modification of Synthetic Biomolecules." Tetrahedron Letters 31 7095 (1990)) which can be cleaved under mildly acidic conditions, a levulinyl group cleavable under almost neutral conditions with a hydrazinium/acetate buffer, an arginine-arginine or lysine-lysine bond cleavable by an endopeptidase enzyme like trypsin or a pyrophosphate bond cleavable by a pyrophosphatase, a photocleavable bond which can be, for example, physically cleaved and the like (see, e.g., FIGURE 23). Optionally, another cation
exchange can be performed prior to mass spectrometric analysis. In the instance that an enzyme-cleavable bond is utilized to immobilize the nested fragments, the enzyme used to cleave the bond can serve as an internal mass standard during MS analysis.
The purification process and/or ion exchange process can be carried out by a number of other methods instead of, or in conjunction with, immobilization on a solid support. For example, the base-specifically terminated products can be separated from the reactants by dialysis, filtration (including ultrafiltration), and chromatography. Likewise, these techniques can be used to exchange the cation of the phosphate backbone with a counter-ion which reduces peak broadening. The base-specifically terminated fragment families can be generated by standard Sanger sequencing using the Large Klenow fragment of E. coli DNA polymerase I, by Sequenase, Taq DNA polymerase and other DNA polymerases suitable for this purpose, thus generating nested DNA fragments for the mass spectrometric analysis. It is, however, part of this invention that base-specifically terminated RNA transcripts of the DNA fragments to be sequenced can also be utilized for mass spectrometric sequence determination. In this case, various RNA polymerases such as the SP6 or the T7 RNA polymerase can be used on appropriate vectors containing, for example, the SP6 or the T7 promoters (e.g. Axelrod et al, "Transcription from Bacteriophage T7 and SP6 RNA Polymerase Promoters in the Presence of 3'- Deoxyribonucleoside 5'-triphosphate Chain Terminators," Biochemistry 24, 5716-23 (1985)). In this case, the unknown DNA sequence fragments are inserted downstream from such promoters. Transcription can also be initiated by a nucleic acid primer (Pitulle et al, "Initiator Oligonucleotides for the Combination of Chemical and Enzymatic RNA Synthesis," Gene 1 12. 101-105 (1992)) which carries, as one embodiment of this invention, appropriate linking functionalities, L, which allow the immobilization of the nested RNA fragments, as outlined above, prior to mass spectrometric analysis for purification and/or appropriate modification and/or conditioning.
For this immobilization process of the DNA/RNA sequencing products for mass spectrometric analysis, various solid supports can be used, e.g., beads (silica gel, controlled pore glass, magnetic beads, Sephadex/Sepharose beads, cellulose beads, etc.), capillaries, glass fiber filters, glass surfaces, metal surfaces or plastic material. Examples
of useful plastic materials include membranes in filter or microtiter plate formats, the latter allowing the automation of the purification process by employing microtiter plates which, as one embodiment of the invention, carry a permeable membrane in the bottom of the well functionalized with L'. Membranes can be based on polyethylene, polypropylene, polyamide, polyvinylidenedifluoride and the like. Examples of suitable metal surfaces include steel, gold, silver, aluminum, and copper. After purification, cation exchange, and/or modification of the phosphodiester backbone of the L-L' bound nested Sanger fragments, they can be cleaved off the solid support chemically, enzymatically or physically. Also, the L-L' bound fragments can be cleaved from the support when they are subjected to mass spectrometric analysis by using appropriately chosen L-L' linkages and corresponding laser energies/intensities as described above and in FIGURES 19-23
The highly purified, four base-specifically terminated DNA or RNA fragment families are then analyzed with regard to their fragment lengths via determination of their respective molecular weights by MALDI or ES mass spectrometry. For ES, the samples, dissolved in water or in a volatile buffer, are injected either continuously or discontinuously into an atmospheric pressure ionization interface (API) and then mass analyzed by a quadrupole. With the aid of a computer program, the molecular weight peaks are searched for the known molecular weight of the nucleic acid primer (UP) and determined which of the four chain-terminating nucleotides has been added to the UP. This represents the first nucleotide of the unknown sequence. Then,
the second, the third, the n extension product can be identified in a similar manner and, by this, the nucleotide sequence is assigned. The generation of multiple ion peaks which can be obtained using ES mass spectrometry can increase the accuracy of the mass determination. In MALDI mass spectrometry, various mass analyzers can be used, e.g., magnetic sector/magnetic deflection instruments in single or triple quadrupole mode (MS/MS), Fourier transform and time-of-flight (TOF) configurations as is known in the art of mass spectrometry. FIGURES 2A through 6 are given as an example of the data obtainable when sequencing a hypothetical DNA fragment of 50 nucleotides in length (SEQ ID NO:3) and having a molecular weight of 15,344.02 daltons. The molecular weights calculated for the ddT (FIGURES 2A and 2B), ddA (FIGURES 3A and 3B),
ddG (FIGURES 4A and 4B) and ddC (FIGURES 5A and 5B) terminated products are given (corresponding to fragments of SEQ LD NO.3) and the idealized four MALDI-TOF mass spectra shown. All four spectra are superimposed, and from this, the DNA sequence can be generated. This is shown in the summarizing FIGURE 6, demonstrating how the molecular weights are correlated with the DNA sequence. MALDI-TOF spectra have been generated for the ddT terminated products (FIGURE 16) corresponding to those shown in FIGURE 2 and these spectra have been superimposed (FIGURE 17). The correlation of calculated molecular weights of the ddT fragments and their experimentally-verified weights are shown in Table 1. Likewise, if all four chain- terminating reactions are combined and then analyzed by mass spectrometry, the molecular weight difference between two adjacent peaks can be used to determine the sequence. For the desorption/ionization process, numerous matrix/laser combinations can be used.
TABLE I
Correlation of calculated and experimentally verified molecular weights of the 13 DNA fragments of FIGURES 2 and 16. Fragment calculated mass experimental mass difference
(n-mer)
7-mer 2104.45 21 19.9 + 15.4 10-mer 301 1.04 3026.1 + 15.1 1 1-mer 3315.24 3330.1 + 14.9 19-mer 5771.82 5788.0 + 16.2 20-mer 6076.02 6093.8 + 17.8 24-mer 731 1.82 7374.9 +63.1 26-mer 7945.22 7960.9 + 15.7 33-mer 10112.63 10125.3 +12.7 37-mer 11348.43 11361.4 + 13.0 38-mer 11652.62 1 1670.2 +17.6 42-mer 12872.42 12888.3 + 15.9 46-mer 14108.22 14125.0 + 16.8 50-mer 15344.02 15362.6 + 18.6
In order to increase throughput to a level necessary for high volume genomic and cDNA sequencing projects, a further embodiment of the present invention is to utilize multiplex mass spectrometry to simultaneously determine more than one sequence. This can be achieved by several, albeit different, methodologies, the basic principle being the mass modification of the nucleic acid primer (UP), the chain- elongating and/or terminating nucleoside triphosphates, or by using mass-differentiated tag probes hybridizable to specific tag sequences. The term "nucleic acid primer" as used herein encompasses primers for both DNA and RNA Sanger sequencing.
By way of example, FIGURE 7 presents a general formula of the nucleic acid primer (UP) and the tag probes (TP). The mass modifying moiety can be attached, for instance, to either the 5'-end of the oligonucleotide (M ), to the nucleobase (or bases)
2 7 3
(M , M ), to the phosphate backbone (M ), and to the 2'-position of the nucleoside
4 6 5
(nucleosides) (M , M ) or/and to the terminal 3'-position (M ). Primer length can vary between 1 and 50 nucleotides in length. For the priming of DNA Sanger sequencing, the primer is preferentially in the range of about 15 to 30 nucleotides in length. For artificially priming the transcription in a RNA polymerase-mediated Sanger sequencing reaction, the length of the primer is preferentially in the range of about 2 to 6 nucleotides. If a tag probe (TP) is to hybridize to the integrated tag sequence of a family chain- terminated fragments, its preferential length is about 20 nucleotides. The table in FIGURE 7 depicts some examples of mass-modified primer/tag probe configurations for DNA, as well as RNA, Sanger sequencing. This list is, however, not meant to be limiting, since numerous other combinations of mass-modifying functions and positions within the oligonucleotide molecule are possible and are deemed part of the invention. The mass-modifying functionality can be, for example, a halogen, an azido, or of the type, XR, wherein X is a linking group and R is a mass-modifying functionality.
The mass-modifying functionality can thus be used to introduce defined mass increments into the oligonucleotide molecule.
In another embodiment, the nucleotides used for chain-elongation and/or termination are mass-modified. Examples of such modified nucleotides are shown in FIGURE 8. Here the mass-modifying moiety, M, can be attached either to the
2 7 7 nucleobase, M (in case of the c -deazanucleosides also to C-7, M ), to the triphosphate
group at the alpha phosphate, M3 , or to the 2'-position of the sugar ring of the nucleoside triphosphate, M4 and M6 . Furthermore, the mass-modifying functionality can be added so as to affect chain termination, such as by attaching it to the 3 '-position of the sugar ring in the nucleoside triphosphate, M5 . The list in FIGURE 8 represents examples of possible configurations for generating chain-terminating nucleoside triphosphates for
RNA or DNA Sanger sequencing. For those skilled in the art, however, it is clear that many other combinations can serve the purpose of the invention equally well. In the same way, those skilled in the art will recognize that chain-elongating nucleoside triphosphates can also be mass-modified in a similar fashion with numerous variations ar.α combinations in functionality and attachment positions.
Without limiting the scope of the invention, FIGURE 9 gives a more detailed description of particular examples of how the mass-modification, M, can be introduced for X in XR as well as using oligo-/polyethylene glycol derivatives for R. The mass-modifying increment in this case is 44, i.e. five different mass-modified species can be generated by just changing m from 0 to 4 thus adding mass units of 45 (m=0), 89 (m=l), 133 (m=2), 177 (m=3) and 221 (m=4) to the nucleic acid primer (UP), the tag probe (TP) or the nucleoside triphosphates respectively. The oligo/polyethylene glycols can also be monoalkylated by a lower alkyl such as methyl, ethyl, propyl, isopropyl, t- butyl and the like. A selection of linking functionalities, X, are also illustrated. Other chemistries can be used in the mass-modified compounds, as for example, those described recently in Oligonucleotides and Analogues. A Practical Approach. F. Eckstein, editor, LRL Press, Oxford, 1991.
In yet another embodiment, various mass-modifying functionalities, R, other than oligo/polyethylene glycols, can be selected and attached via appropriate linking chemistries, X. Without any limitation, some examples are given in FIGURE 10. A simple mass-modification can be achieved by substituting H for halogens like F, Cl, Br and/or I, or pseudohalogens such as SCN, NCS, or by using different alkyl, aryl or aralkyl moieties such as methyl, ethyl, propyl, isopropyl, t-butyl, hexyl, phenyl, substituted phenyl, benzyl, or functional groups such as CH2F, CHF2, CF3, Si(CH3)3, Si(CH3)2(C2H5), Si(CH3)(C2H5)2, Si(C2H5)3 . Yet another mass-modification can be obtained by attaching homo- or heteropeptides through X to the UP, TP or nucleoside
triphosphates. One example useful in generating mass-modified species with a mass increment of 57 is the attachment of oligoglycines, e.g., mass-modifications of 74 (r=l, m=O), 131 (r=l, m=2), 188 (r=l, m=3), 245 (r=l, m=4) are achieved. Simple oligoamides also can be used, e.g., mass-modifications of 74 (r=l, m=0), 88 (r=2, nv=0), 102 (r=3, m=0), 116 (r=4, m=0), etc. are obtainable. For those skilled in the art, it will be obvious that there are numerous possibilities in addition to those given in FIGURE 10 and the above mentioned reference (Oligonucleotides and Analogues. F. Eckstein, 1991), for introducing, in a predetermined manner, many different mass-modifying functionalities to UP, TP and nucleoside triphosphates which are acceptable for DNA and RNA Sanger sequencing.
As used herein, the superscript 0-i designates i + 1 mass differentiated nucleotides, primers or tags. In some instances, the superscript 0 (e.g., NTP , UP ) can designate an unmodified species of a particular reactant, and the superscript i (e.g., NTP ,
NTP1, NTP2 , etc.) can designate the i-th mass-modified species of that reactant. If, for example, more than one species of nucleic acids (e.g., DNA clones) are to be concurrently sequenced by multiplex DNA sequencing, then i + 1 different mass-modified nucleic acid primers (UP0, UP1... UPi ) can be used to distinguish each set of base- specifically terminated fragments, wherein each species of mass-modified UP can be distinguished by mass spectrometry from the rest. As illustrative embodiments of this invention, three different basic processes for multiplex mass spectrometric DNA sequencing employing the described mass- modified reagents are described below:
A) Multiplexing by the use of mass-modified nucleic acid primers (UP) for Sanger DNA or RNA sequencing (see for example FIGURE 1 1); B) Multiplexing by the use of mass-modified nucleoside triphosphates as chain elongators and/or chain terminators for Sanger DNA or RNA sequencing (see for example FIGURE 12); and
C) Multiplexing by the use of tag probes which specifically hybridize to tag sequences which are integrated into part of the four Sanger DNA/RNA base-specifically terminated fragment families. Mass modification here can be achieved as described for FIGURES 7, 9 and 10,
or alternately, by designing different oligonucleotide sequences having the same or different length with unmodified nucleotides which, in a predetermined way, generate appropriately differentiated molecular weights (see for example FIGURE 13). The process of multiplexing by mass-modified nucleic acid primers (UP) is illustrated by way of example in FIGURE 11 for mass analyzing four different DNA clones simultaneously. The first reaction mixture is obtained by standard Sanger DNA sequencing having unknown DNA fragment 1 (clone 1) integrated in an appropriate vector (e.g., M13mpl8), employing an unmodified nucleic acid primer UP , and a standard mixture of the four unmodified deoxynucleoside triphosphates, dNTP , and with l/10th of one of the four dideoxynucleoside triphosphates, ddNTP A second reaction mixture for DNA fragment 2 (clone 2) is obtained by employing a mass-modified nucleic acid primer UP and, as before, the four unmodified nucleoside triphosphates, dNTP , containing in each separate Sanger reaction l/10th of the chain-terminating unmodified dideoxynucleoside triphosphates ddNTP . In the other two experiments, the
2 four Sanger reactions have the following compositions: DNA fragment 3 (clone 3), UP , dNTP , ddNTP and DNA fragment 4 (clone 4), UP , dNTP , ddNTP . For mass spectrometric DNA sequencing, all base-specifically terminated reactions of the four clones are pooled and mass analyzed. The various mass peaks belonging to the four dideoxy-terminated (e.g., ddT-terminated) fragment families are assigned to specifically elongated and ddT-terminated fragments by searching (such as by a computer program)
0 1 2 3 for the known molecular ion peaks of UP , UP , UP and UP extended by either one of the four dideoxynucleoside triphosphates, UP -ddN , UP -ddN , UP -ddN and UP - ddN . In this way, the first nucleotides of the four unknown DNA sequences of clone 1 to 4 are determined. The process is repeated, having memorized the molecular masses of the four specific first extension products, until the four sequences are assigned.
Unambiguous mass/sequence assignments are possible even in the worst case scenario in which the four mass-modified nucleic acid primers are extended by the same dideoxynucleoside triphosphate, the extension products then being, for example, UP -
1 2 3 ddT, UP -ddT, UP -ddT and UP -ddT, which differ by the known mass increment differentiating the four nucleic acid primers. In another embodiment of this invention, an
analogous technique is employed using different vectors containing, for example, the SP6 and/or T7 promoter sequences, and performing transcription with the nucleic acid
0 1 2 3 primers UP , UP , UP and UP and either an RNA polymerase (e.g., SP6 or T7 RNA polymerase) with chain-elongating and terminating unmodified nucleoside triphosphates
0 0. NTP and 3 '-dNTP Here, the DNA sequence is being determined by Sanger RNA sequencing.
FIGURE 12 illustrates the process of multiplexing by mass-modified chain- elongating or/and terminating nucleoside triphosphates in which three different DNA fragments (3 clones) are mass analyzed simultaneously. The first DNA Sanger sequencing reaction (DNA fragment 1, clone 1) is the standard mixture employing
0 0 unmodified nucleic acid primer UP , dNTP and in each of the four reactions one of the four ddNTP . The second (DNA fragment 2, clone 2) and the third (DNA fragment 3,
0 0 1 0 0 2 clone 3) have the following contents: UP , dNTP , ddNTP and UP , dNTP , ddNTP
, respectively. In a variation of this process, an amplification of the mass increment in mass-modifying the extended DNA fragments can be achieved by either using an equally
1 2 mass-modified deoxynucleoside triphosphate (i.e., dNTP , dNTP ) for chain elongation alone or in conjunction with the homologous equally mass-modified dideoxynucleoside triphosphate. For the three clones depicted above, the contents of the reaction mixtures can be as follows: either UP°/dNTP0/ddNTP°, wΛdNirΛddNTP0 and UP°/dNTP2/ddNTP° or UP°/dNTP0/ddNTP°, UP°/dNTP * /ddNTP * and
0 2 2
UP /dNTP /ddNTP . As described above, DNA sequencing can be performed by
Sanger RNA sequencing employing unmodified nucleic acid primers, UP , and an appropriate mixture of chain-elongating and terminating nucleoside triphosphates. The mass-modification can be again either in the chain-terminating nucleoside triphosphate alone or in conjunction with mass-modified chain-elongating nucleoside triphosphates.
Multiplexing is achieved by pooling the three base-specifically terminated sequencing reactions (e.g., the ddTTP terminated products) and simultaneously analyzing the pooled products by mass spectrometry. Again, the first extension products of the known nucleic acid primer sequence are assigned, e.g., via a computer program. Mass/sequence assignments are possible even in the worst case in which the nucleic acid primer is extended/terminated by the same nucleotide, e.g., ddT, in all three clones. The following
configurations thus obtained can be well differentiated by their different mass-
0 0 0 1 0 2 modifications: UP -ddT , UP -ddT , UP -ddT .
In yet another embodiment of this invention, DNA sequencing by multiplex mass spectrometry can be achieved by cloning the DNA fragments to be sequenced in "plex-vectors" containing vector specific "tag sequences" as described (Kόster et al,
"Oligonucleotide Synthesis and Multiplex DNA Sequencing Using Chemiluminescent
Detection," Nucleic Acids Res. Symposium Ser. No. 24, 318-321 (1991)); then pooling clones from different plex-vectors for DNA preparation and the four separate Sanger sequencing reactions using standard dNTP /ddNTP and nucleic acid primer UP ; purifying the four multiplex fragment families via linking to a solid support through the linking group, L, at the 5'-end of UP; washing out all by-products, and cleaving the purified multiplex DNA fragments off the support or using the L-L' bound nested Sanger fragments as such for mass spectrometric analysis as described above; performing de¬ multiplexing by one-by-one hybridization of specific "tag probes"; and subsequently analyzing by mass spectrometry (see, for example, FIGURE 13). As a reference point, the four base-specifically terminated multiplex DNA fragment families are run by the
. 0 0 0 0 mass spectrometer and all ddT -, ddA -, ddC - and ddG -terminated molecular ion peaks are respectively detected and memorized. Assignment of, for example, ddT - terminated DNA fragments to a specific fragment family is accomplished by another mass spectrometric analysis after hybridization of the specific tag probe (TP) to the corresponding tag sequence contained in the sequence of this specific fragment family. Only those molecular ion peaks which are capable of hybridizing to the specific tag probe are shifted to a higher molecular mass by the same known mass increment (e.g. of the tag probe). These shifted ion peaks, by virtue of all hybridizing to a specific tag probe, belong to the same fragment family. For a given fragment family, this is repeated for the remaining chain terminated fragment families with the same tag probe to assign the complete DNA sequence. This process is repeated i-1 times corresponding to i clones multiplexed (the i-th clone is identified by default). The differentiation of the tag probes for the different multiplexed clones can be obtained just by the DNA sequence and its ability to Watson-Crick base pair to the tag
sequence. It is well known in the art how to calculate stringency conditions to provide for specific hybridization of a given tag probe with a given tag sequence (see, for example, Molecular Cloning: A laboratory manual 2ed, ed. by Sambrook, Fritsch and Maniatis (Cold Spring Harbor Laboratory Press: NY, 1989, Chapter 11). Furthermore, differentiation can be obtained by designing the tag sequence for each plex-vector to have a sufficient mass difference so as to be unique just by changing the length or base composition or by mass-modifications according to FIGURES 7, 9 and 10. In order to keep the duplex between the tag sequence and the tag probe intact during mass spectrometric analysis, it is another embodiment of the invention to provide for a covalent attachment mediated by, for example, photoreactive groups such as psoralen and ellipticine and by other methods known to those skilled in the art (see, for example, Helene et al, Nature 344, 358 (1990) and Thuong et al. "Oligonucleotides Attached to Intercalators, Photoreactive and Cleavage Agents" in F. Eckstein, Oligonucleotides and Analogues: A Practical Approach. LRL Press, Oxford 1991, 283-306). The DNA sequence is unraveled again by searching for the lowest molecular weight molecular ion peak corresponding to the known UP -tag sequence/tag probe molecular weight plus the first extension product, e.g., ddT , then the second, the third, etc.
In a combination of the latter approach with the previously described multiplexing processes, a further increase in multiplexing can be achieved by using, in addition to the tag probe/tag sequence interaction, mass-modified nucleic acid primers (FIGURE 7) and/or mass-modified deoxynucleoside, dNTP ' and/or dideoxynucleoside triphosphates, ddNTP . Those skilled in the art will realize that the tag sequence/tag probe multiplexing approach is not limited to Sanger DNA sequencing generating nested DNA fragments with DNA polymerases. The DNA sequence can also be determined by transcribing the unknown DNA sequence from appropriate promoter-containing vectors (see above) with various RNA polymerases and mixtures of NTP /3'-dNTP , thus generating nested RNA fragments.
In yet another embodiment of this invention, the mass-modifying functionality can be introduced by a two or multiple step process. In this case, the nucleic acid primer, the chain-elongating or terminating nucleoside triphosphates and/or
the tag probes are, in a first step, modified by a precursor functionality such as azido, - N3, or modified with a functional group in which the R in XR is H (FIGURE 7, 9) thus providing temporary functions, e.g., but not limited to -OH, -NH2, -NHR, -SH, -NCS, -OCO(CH2)rCOOH (r = 1-20), -NHCO(CH2)rCOOH (r = 1-20), -OSO2OH, -OCO(CH2)rI (r = 1-20), -OP(O-Alkyl)N(Alkyl)2. These less bulky functionalities result in better substrate properties for the enzymatic DNA or RNA synthesis reactions of the DNA sequencing process. The appropriate mass-modifying functionality is then introduced after the generation of the nested base-specifically terminated DNA or RNA fragments prior to mass spectrometry. Several examples of compounds which can serve as mass-modifying functionalities are depicted in FIGURES 9 and 10 without limiting the scope of this invention.
Another aspect of this invention concerns kits for sequencing nucleic acids by mass spectrometry which include combinations of the above-described sequencing reactants. For instance, in one embodiment, the kit comprises reactants for multiplex mass spectrometric sequencing of several different species of nucleic acid. The kit can include a solid support having a linking functionality (L ) for immobilization of the base- specifically terminated products; at least one nucleic acid primer having a linking group (L) for reversibly and temporarily linking the primer and solid support through, for example, a photocleavable bond; a set of chain-elongating nucleotides (e.g., dATP, dCTP, dGTP and dTTP, or ATP, CTP, GTP and UTP); a set of chain-terminating nucleotides (such as 2',3'-dideoxynucleotides for DNA synthesis or 3'-deoxynucleotides for RNA synthesis); and an appropriate polymerase for synthesizing complementary nucleotides. Primers and/or terminating nucleotides can be mass-modified so that the base-specifically terminated fragments generated from one of the species of nucleic acids to be sequenced can be distinguished by mass spectrometry from all of the others Alternative to the use of mass-modified synthesis reactants, a set of tag probes (as described above) can be included in the kit. The kit can also include appropriate buffers as well as instructions for performing multiplex mass spectrometry to concurrently sequence multiple species of nucleic acids. In another embodiment, a nucleic acid sequencing kit can comprise a solid support as described above, a primer for initiating synthesis of complementary nucleic
acid fragments, a set of chain-elongating nucleotides and an appropriate polymerase. The mass-modified chain-terminating nucleotides are selected so that the addition of one of the chain terminators to a growing complementary nucleic acid can be distinguished by mass spectrometry. The present invention is further illustrated by the following examples which should not be construed as limiting in any way. The contents of all cited references (including literature references, issued patents, published patent applications (including international patent application Publication Number WO 94/16101, entitled "DNA Sequencing by Mass Spectrometry" by H. Koester; and international patent application Publication Number WO 94/21822 entitled "DNA Sequencing by Mass Spectrometry Via Exonuclease Degradation" by H. Koester), and co-pending patent applications, (including U.S Patent Application Serial No. 08/406,199, entitled "DNA Diagnostics Based on Mass Spectrometry" by H. Koester), as cited throughout this application are hereby expressly incorporated by reference.
EXAMPLE 1
Immobilization of primer-extension products of Sanger DNA sequencing reaction for mass spectrometric analysis via disulfide bonds. As a solid support, Sequelon membranes (Millipore Corp., Bedford, MA) with phenyl isothiocyanate groups are used as a starting material. The membrane disks, with a diameter of 8 mm, are wetted with a solution of N-methylmorpholine/water/2- propanol (NMM solution) (2/49/49 v/v/v), the excess liquid removed with filter paper and placed on a piece of plastic film or aluminum foil located on a heating block set to 55 C. A solution of 1 mM 2-mercaptoethylamine (cysteamine) or 2, 2'-dithio- bis(ethylamine) (cystamine) or S-(2-thiopyridyl)-2-thio-ethylamine (10 ul, 10 nmol) in NMM is added per disk and heated at 55 C. After 15 min, 10 ul of NMM solution are added per disk and heated for another 5 min. Excess of isothiocyanate groups may be removed by treatment with 10 ul of a 10 mM solution of glycine in NMM solution. For cystamine, the disks are treated with 10 ul of a solution of 1M aqueous dithiothreitol (DTT)/2-propanol (1 :1 v/v) for 15 min at room temperature. Then, the disks are
thoroughly washed in a filtration manifold with 5 aliquots of 1 ml each of the NMM solution, then with 5 aliquots of 1 ml acetonitrile/water (1/1 v/v) and subsequently dried. If not used immediately the disks are stored with free thiol groups in a solution of 1M aqueous dithiothreitol/2-propanol (1 : 1 v/v) and, before use, DTT is removed by three washings with 1 ml each of the NMM solution. The primer oligonucleotides with 5'-SH functionality can be prepared by various methods (e.g., B.C.F Chu et al, Nucleic Acids Res. 14. 5591-5603 (1986), Sproat et al. Nucleic Acids Res 15 4837-48 (1987) and Oligonucleotides and Analogues: A Practical Approach (F Eckstein, editor), LRL Press Oxford, 1991). Sequencing reactions according to the Sanger protocol are performed in a standard way (e.g., H. Swerdlow et al, Nucleic Acids Res. 18, 1415-19 (1990)). In the presence of about 7-10 mM DTT the free 5'-thiol primer can be used; in other cases, the SH functionality can be protected, e.g., by a trityl group during the Sanger sequencing reactions and removed prior to anchoring to the support in the following way. The four sequencing reactions (150 ul each in an Eppendorf tube) are terminated by a 10 min incubation at 70 C to denature the DNA polymerase (such as
Klenow fragment, Sequenase) and the reaction mixtures are ethanol precipitated. The supernatants are removed and the pellets vortexed with 25 ul of an 1M aqueous silver nitrate solution, and after one hour at room temperature, 50 ul of an 1 M aqueous solution of DTT is added and mixed by vortexing. After 15 min, the mixtures are centrifuged and the pellets are washed twice with 100 ul ethylacetate by vortexing and centrifugation to remove excess DTT. The primer extension products with free S'-thiol group are now coupled to the thiolated membrane supports under mild oxidizing conditions. In general, it is sufficient to add the 5'-thiolated primer extension products dissolved in 10 ul 10 mM de-aerated triethylammonium acetate buffer (TEAA) pH 7.2 to the thiolated membrane supports. Coupling is achieved by drying the samples onto the membrane disks with a cold fan. This process can be repeated by wetting the membrane with 10 ul of 10 mM TEAA buffer pH 7.2 and drying as before. When using the 2- thiopyridyl derivatized compounds, anchoring can be monitored by the release of pyridine-2-thione spectrophotometrically at 343 nm. In another variation of this approach, the oligonucleotide primer is functionalized with an amino group at the 5'-end which is introduced by standard
procedures during automated DNA synthesis. After primer extension, during the Sanger sequencing process, the primary amino group is reacted with 3-(2-pyridyldithio) propionic acid N-hydroxysuccinimide ester (SPDP) and subsequently coupled to the thiolated supports and monitored by the release of pyridyl-2-thione as described above. After denaturation of DNA polymerase and ethanol precipitation of the sequencing products, the supernatants are removed and the pellets dissolved in 10 ul 10 mM TEAA buffer pH 7.2 and 10 ul of a 2 mM solution of SPDP in 10 mM TEAA are added. The reaction mixture is vortexed and incubated for 30 min at 25 C. Excess SPDP is then removed by three extractions (vortexing, centrifugation) with 50 ul each of ethanol and the resulting pellets are dissolved in 10 ul 10 mM TEAA buffer pH 7.2 and coupled to the thiolated supports (see above).
The primer-extension products are purified by washing the membrane disks three times each with 100 ul NMM solution and three times with 100 ul each of 10 mM TEAA buffer pH 7.2. The purified primer-extension products are released by three successive treatments with 10 ul of 10 mM 2-mercaptoethanol in 10 mM TEAA buffer pH 7.2, lyophilized and analyzed by either ES or MALDI mass spectrometry.
This procedure can also be used for the mass-modified nucleic acid primers UP in an analogous and appropriate way, taking into account the chemical properties of the mass-modifying functionalities.
EXAMPLE 2
Immobilization of primer-extension products of Sanger DNA sequencing reaction for mass spectrometric analysis via the levulinyl group 5-Aminolevulinic acid is protected at the primary amino group with the
Fmoc group using 9-fluorenylmethyl N-succinimidyl carbonate and is then transformed into the N-hydroxysuccinimide ester (NHS ester) using N-hydroxysuccinimide and dicyclohexyl carbodiimide under standard conditions. For the Sanger sequencing reactions, nucleic acid primers, UP , are used which are functionalized with a primary amino group at the 5'-end introduced by standard procedures during automated DNA synthesis with aminolinker phosphoamidites as the final synthetic step. Sanger
sequencing is performed under standard conditions (see above). The four reaction mixtures (150 ul each in an Eppendorf tube) are heated to 70 C for 10 min to inactivate the DNA polymerase, ethanol precipitated, centrifuged and resuspended in 10 ul of 10 mM TEAA buffer pH 7.2. 10 ul of a 2 mM solution of the Fmoc-5-aminolevulinyI-NHS ester in 10 mM TEAA buffer is added, vortexed and incubated at 25 C for 30 min. The excess of the reagent is removed by ethanol precipitation and centrifugation The Fmoc group is cleaved off by resuspending the pellets in 10 ul of a solution of 20% piperidine in N,N-dimethylformamide/water (1 : 1 v/v). After 15 min at 25 C, piperidine is thoroughly removed by three precipitations/centrifugations with 100 ul each of ethanol, the pellets are resuspended in 10 ul of a solution of N-methylmorpholine, 2-propanol and water
(2/10/88 v/v/v) and are coupled to the solid support carrying an isothiocyanate group. In the case of the DITC-Sequelon membrane (Millipore Corp., Bedford, MA), the membranes are prepared as described in EXAMPLE 1 and coupling is achieved on a heating block at 55 C as described above. RNA extension products are immobilized in an analogous way. The procedure can be applied to other solid supports with isothiocyanate groups in a similar manner.
The immobilized primer-extension products are extensively washed three times with 100 ul each of NMM solution and three times with 100 ul 10 mM TEAA buffer pH 7.2. The purified primer-extension products are released by three successive treatments with 10 ul of 100 mM hydrazinium acetate buffer pH 6.5, lyophilized and analyzed by either ES or MALDI mass spectrometry.
EXAMPLE 3
Immobilization of primer-extension products of Sanger DNA sequencing reaction for mass spectrometric analysis via a trypsin sensitive linkage
Sequelon DITC membrane disks of 8 mm diameter (Millipore Corp., Bedford, MA) are wetted with 10 ul of NMM solution (N-methylmorpholine/propanaol- 2/water; 2/49/49 v/v/v) and a linker arm introduced by reaction with 10 ul of a 10 mM solution of 1,6-diaminohexane in NMM The excess diamine is removed by three washing steps with 100 ul of NMM solution. Using standard peptide synthesis protocols,
two L-lysine residues are attached by two successive condensations with N-Fmoc-N- tBoc-L-lysine pentafluorophenylester, the terminal Fmoc group is removed with piperidine in NMM and the free α-amino group coupled to 1,4-phenylene diisothiocyanate (DITC). Excess DITC is removed by three washing steps with 100 ul 2- propanol and the N-tBoc groups removed with trifluoroacetic acid according to standard peptide synthesis procedures. The nucleic acid primer-extension products are prepared from oligonucleotides which carry a primary amino group at the 5'-terminus. The four Sanger DNA sequencing reaction mixtures (150 ul each in Eppendorf tubes) are heated for 10 min at 70 C to inactivate the DNA polymerase, ethanol precipitated, and the pellets resuspended in 10 ul of a solution of N-methylmorpholine, 2-propanol and water (2/10/88 v/v/v). This solution is transferred to the Lys-Lys-DITC membrane disks and coupled on a heating block set at 55 C. After drying, 10 ul of NMM solution is added and the drying process repeated.
The immobilized primer-extension products are extensively washed three times with 100 ul each of NMM solution and three times with 100 ul each of 10 mM TEAA buffer pH 7.2. For mass spectrometric analysis, the bond between the primer- extension products and the solid support is cleaved by treatment with trypsin under standard conditions and the released products analyzed by either ES or MALDI mass spectrometry with trypsin serving as an internal mass standard
EXAMPLE 4
Immobilization of primer-extension products of Sanger DNA sequencing reaction for mass spectrometric analysis via pyrophosphate linkage The DITC Sequelon membrane (disks of 8 mm diameter) are prepared as described in EXAMPLE 3 and 10 ul of a 10 mM solution of 3-aminopyridine adenine dinucleotide (APAD) (Sigma) in NMM solution added. The excess APAD is removed by a 10 ul wash of NMM solution and the disks are treated with 10 ul of 10 mM sodium periodate in NMM solution (15 min, 25 C). Excess periodate is removed and the primer-extension products of the four Sanger DNA sequencing reactions (150 ul each in Eppendorf tubes) employing nucleic acid primers with a primary amino group at the 5'-
M nd are ethanol precipitated, dissolved in 10 ul of a solution of N-methylmorpholine/2- propanol/ water (2/10/88 v/v/v) and coupled to the 2' 3'-diaIdehydo groups of the immobilized NAD analog.
The primer-extension products are extensively washed with the NMM solution (3 times with 100 ul each) and 10 mM TEAA buffer pH 7.2 (3 times with 100 ul each) and the purified primer-extension products are released by treatment with either NADase or pyrophosphatase in 10 mM TEAA buffer at pH 7.2 at 37 C for 15 min, lyophilized and analyzed by either ES or MALDI mass spectrometry, the enzymes serving as internal mass standards.
EXAMPLE S
Synthesis of nucleic acid primers mass-modified by glycine residues at the 5'- position of the sugar moiety of the terminal nucleoside Oligonucleotides are synthesized by standard automated DNA synthesis using β-cyanoethylphosphoamidites (H. Kόster et al., Nucleic Acids Res. )2, 4539 (1984)) and a 5'-amino group is introduced at the end of solid phase DNA synthesis (e.g. Agrawal et al, Nucleic Acids Res. 14, 6227-45 (1986) or Sproat et al, Nucleic Acids Res. 15. 6181-96 (1987)). The total amount of an oligonucleotide synthesis, starting with 0.25 umol CPG-bound nucleoside, is deprotected with concentrated aqueous ammonia, purified via OligoPAK T M Cartridges (Millipore Corp., Bedford, MA) and lyophilized. This material with a 5'-terminal amino group is dissolved in 100 ul absolute
N,N-dimethylformamide (DMF) and condensed with 10 μmole N-Fmoc-glycine pentafluorophenyl ester for 60 min at 25 C. After ethanol precipitation and centrifugation, the Fmoc group is cleaved off by a 10 min treatment with 100 ul of a solution of 20% piperidine in N,N-dimethylformamide. Excess piperidine, DMF and the cleavage product from the Fmoc group are removed by ethanol precipitation and the precipitate lyophilized from 10 mM TEAA buffer pH 7.2. This material is now either used as primer for the Sanger DNA sequencing reactions or one or more glycine residues (or other suitable protected amino acid active esters) are added to create a series of mass- modified primer oligonucleotides suitable for Sanger DNA or RNA sequencing.
Immobilization of these mass-modified nucleic acid primers UP after primer-extension during the sequencing process can be achieved as described, e.g., in EXAMPLES 1 to 4.
EXAMPLE 6
Synthesis of nucleic acid primers mass-modified at C-5 of the heterocyciic base of a pyrimidine nucleoside with glycine residues
Starting material was 5-(3-aminopropynyl-l)-3' 5'-di-p-tolyldeoxyuridine prepared and 3' 5'-de-O-acylated according to literature procedures (Haralambidis et al, Nucleic Acids Res. 15. 4857-76 (1987)). 0.281 g (1.0 mmole) 5-(3-aminopropynyl-l)-2'- deoxyuridine were reacted with 0.927 g (2.0 mmole) N-Fmoc-glycine pentafluorophenylester in 5 ml absolute N,N-dimethylformamide in the presence of 0.129 g (1 mmole; 174 ul) N,N-diisopropylethylamine for 60 min at room temperature. Solvents were removed by rotary evaporation and the product was purified by silica gel chromatography (Kieselgel 60, Merck; column: 2.5x 50 cm, elution with chloroform/methanol mixtures). Yield was 0.44 g (0.78 mmole, 78 %). In order to add another glycine residue, the Fmoc group is removed with a 20 min treatment with 20% solution of piperidine in DMF, evaporated in vacuo and the remaining solid material extracted three times with 20 ml ethylacetate. After having removed the remaining ethylacetate, N-Fmoc-glycine pentafluorophenylester is coupled as described above. 5- (3-(N-Fmoc-glycyl)-amidopropynyl-l)-2'-deoxyuridine is transformed into the 5'-O- dimethoxytritylated nucleoside-3'-O-β-cyanoethyl-N,N-diisopropylphosphoamidite and incorporated into automated oligonucleotide synthesis by standard procedures (H. Kόster et al. Nucleic Acids Res. 12. 2261 (1984)). This glycine modified thymidine analogue building block for chemical DNA synthesis can be used to substitute one or more of the thymidine/uridine nucleotides in the nucleic acid primer sequence. The Fmoc group is removed at the end of the solid phase synthesis with a 20 min treatment with a 20 % solution of piperidine in DMF at room temperature. DMF is removed by a washing step with acetonitrile and the oligonucleotide deprotected and purified in the standard way
EXAMPLE 7
Synthesis of a nucleic acid primer mass-modified at C-5 of the heterocyciic base of a pyrimidine nucleoside with β-alanine residues
Starting material was the same as in EXAMPLE 6. 0.281 g (1.0 mmole) 5 -(3 - Aminopropynyl- 1 )-2'-deoxyuridine was reacted with N-Fmoc-β-alanine pentafluorophenylester (0.955 g, 2.0 mmole) in 5 ml N,N-dimethylformamide (DMF) in the presence of 0.129 g (174 ul; 1.0 mmole) N,N-disopropylethylamine for 60 min at room temperature. Solvents were removed and the product purified by silica gel chromatography as described in EXAMPLE 6. Yield was 0.425 g (0.74 mmole, 74 %). Another β-alanine moiety can be added in exactly the same way after removal of the Fmoc group. The preparation of the 5'-O-dimethoxytritylated nucleoside-3'-O-β- cyanoethyl-N,N-diisopropylphosphoamidite from 5-(3-(N-Fmoc-β-alanyl)- amidopropynyl- 1 )-2'-deoxyuridine and incorporation into automated oligonucleotide synthesis is performed under standard conditions. This building block can substitute for any of the thymidine/uridine residues in the nucleic acid primer sequence. In the case of only one incorporated mass-modified nucleotide, the nucleic acid primer molecules prepared according to EXAMPLES 6 and 7 would have a mass difference of 14 daltons.
EXAMPLE 8
Synthesis of a nucleic acid primer mass-modified at C-5 of the heterocyciic base of a pyrimidine nucleoside with ethylene glycol monomethyl ether As a nucleosidic component, 5-(3-aminopropynyl-l)-2,-deoxyuridine was used in this example (see EXAMPLES 6 and 7). The mass-modifying functionality was obtained as follows: 7.61 g (100.0 mmole) freshly distilled ethylene glycol monomethyl ether dissolved in 50 ml absolute pyridine was reacted with 10.01 g (100.0 mmole) recrystallized succinic anhydride in the presence of 1.22 g (10 0 mmole) 4-N,N- dimethylaminopyridine overnight at room temperature The reaction was terminated by the addition of water (5 0 ml), the reaction mixture evaporated in vacuo, co-evaporated twice with dry toluene (20 ml each) and the residue redissolved in 100 ml dichloromethane The solution was extracted successively, twice with 10 % aqueous citric acid (2 x 20 ml) and once with water (20 ml) and the organic phase dried over anhydrous sodium sulfate. The organic phase was evaporated in vacuo, the residue redissolved in 50 ml dichloromethane and precipitated into 500 ml pentane and the precipitate dried in vacuo Yield was 13.12 g (74 0 mmole; 74 %) 8 86 g (50 0 mmole) of succinylated ethylene glycol monomethyl ether was dissolved in 100 ml dioxane containing 5% dry pyridine (5 ml) and 6 96 g (50 0 mmole) 4-nitrophenol and 10 32 g (50 0 mmole) dicyclohexylcarbodiimide was added and the reaction run at room temperature for 4 hours Dicyclohexylurea was removed by filtration, the filtrate evaporated in vacuo and the residue redissolved in 50 ml anhydrous DMF 12.5 ml (about 12 5 mmole 4-nitrophenylester) of this solution was used to dissolve 2 81 g (10 0 mmole) 5-(3-aminopropynyl-l)-2'-deoxyuridine The reaction was performed in the presence of 1 01 g (10 0 mmole, 1.4 ml) triethylamine at room temperature overnight
The reaction mixture was evaporated in vacuo, co-evaporated with toluene, redissolved in dichloromethane and chromatographed on silicagel (Si60, Merck, column 4x50 cm) with dichloromethane/methanol mixtures The fractions containing the desired compound were collected, evaporated, redissolved in 25 ml dichloromethane and precipitated into 250 ml pentane The dried precipitate of 5-(3-N-(O-succinyl ethylene glycol monomethyl ether)-amidopropynyl-l)-2'-deoxyuridine (yield 65 %) is 5'-O-dimethoxytritylated and
transformed into the nucleoside-3 '-O-β-cyanoethyl-N, N-diisopropylphosphoamidite and incorporated as a building block in the automated oligonucleotide synthesis according to standard procedures. The mass-modified nucleotide can substitute for one or more of the thymidine/uridine residues in the nucleic acid primer sequence. Deprotection and purification of the primer oligonucleotide also follows standard procedures.
EXAMPLE 9
Synthesis of a nucleic acid primer mass-modified at C-5 of the heterocyciic base of a pyrimidine nucleoside with diethylene glycol monomethyl ether Nucleosidic starting material was as in previous examples, 5-(3- aminopropynyl-l)-2'-deoxyuridine. The mass-modifying functionality was obtained similar to EXAMPLE 8. 12.02 g (100.0 mmole) freshly distilled diethylene glycol monomethyl ether dissolved in 50 ml absolute pyridine was reacted with 10.01 g (100.0 mmole) recrystallized succinic anhydride in the presence of 1.22 g (10.0 mmole) 4-N, N- dimethylaminopyridine (DMAP) overnight at room temperature. The work-up was as described in EXAMPLE 8. Yield was 18.35 g (82.3 mmole, 82.3 %). 1 1.06 g (50.0 mmole) of succinylated diethylene glycol monomethyl ether was transformed into the 4- nitrophenylester and, subsequently, 12.5 mmole was reacted with 2.81 g (10.0 mmole) of 5-(3-aminopropynyl-l)-2'-deoxyuridine as described in EXAMPLE 8. Yield after silica gel column chromatography and precipitation into pentane was 3.34 g (6.9 mmole, 69 %). After dimethoxytritylation and transformation into the nucleoside-β- cyanoethylphosphoamidite, the mass-modified building block is incorporated into automated chemical DNA synthesis according to standard procedures. Within the sequence of the nucleic acid primer UP , one or more of the thymidine/uridine residues can be substituted by this mass-modified nucleotide. In the case of only one incorporated mass-modified nucleotide, the nucleic acid primers of EXAMPLES 8 and 9 would have a mass difference of 44.05 daltons.
EXAMPLE 10
Synthesis of a nucleic acid primer mass-modified at C-8 of the heterocyciic base of deoxyadenosine with glycine Starting material was N6 -benzoyl-8-bromo-5'-O-(4,4'-dimethoxytrityl)-2'- deoxyadenosine prepared according to literature (Singh et al, Nucleic Acids Res. 18, 3339-45 (1990)). 632.5 mg (1.0 mmole) of this 8-bromo-deoxyadenosine derivative was suspended in 5 ml absolute ethanol and reacted with 251.2 mg (2.0 mmole) glycine
methyl ester (hydrochloride) in the presence of 241.4 mg (2.1 mmole; 366 ul) N, N- diisopropylethylamine and refluxed until the starting nucleosidic material had disappeared (4-6 hours) as checked by thin layer chromatography (TLC). The solvent was evaporated and the residue purified by silica gel chromatography (column 2.5x50 cm) using solvent mixtures of chloroform/methanol containing 0.1 % pyridine. The product fractions were combined, the solvent evaporated, the fractions dissolved in 5 ml dichloromethane and precipitated into 100 ml pentane. Yield was 487 mg (0.76 mmole, 76 %). Transformation into the corresponding nucleoside-β-cyanoethylphosphoamidite and integration into automated chemical DNA synthesis is performed under standard conditions. During final deprotection with aqueous concentrated ammonia, the methyl group is removed from the glycine moiety. The mass-modified building block can substitute one or more deoxyadenosine/adenosine residues in the nucleic acid primer sequence.
EXAMPLE 11
Synthesis of a nucleic acid primer mass-modified at C-8 of the heterocyciic base of deoxyadenosine with glycylglycine
This derivative was prepared in analogy to the glycine derivative of
6 EXAMPLE 10. 632.5 mg (1.0 mmole) N -Benzoyl-8-bromo-5'-O-(4,4'- dimethoxytrityl)-2'-deoxyadenosine was suspended in 5 ml absolute ethanol and reacted with 324.3 mg (2.0 mmole) glycyl-glycine methyl ester in the presence of 241.4 mg (2.1 mmole, 366 μl)
N, N-diisopropylethylamine. The mixture was refluxed and completeness of the reaction checked by TLC. Work-up and purification was similar to that described in EXAMPLE
10. Yield after silica gel column chromatography and precipitation into pentane was 464 mg (0.65 mmole, 65 %). Transformation into the nucleoside-β- cyanoethylphosphoamidite and into synthetic oligonucleotides is done according to standard procedures. In the case where only one of the deoxyadenosine/adenosine residues in the nucleic acid primer is substituted by this mass-modified nucleotide, the
mass difference between the nucleic acid primers of EXAMPLES 10 and 11 is 57.03 daltons.
EXAMPLE 12
Synthesis of a nucleic acid primer mass-modified at the C-2' of the sugar moiety of 2'-anιino-2'-deoxythymidine with ethylene glycol monomethyl ether residues
Starting material was 5'-O-(4,4-dimethoxytrityl)-2'-amino-2'- deoxythymidine synthesized according to published procedures (e.g., Verheyden et al, I. Org Chem. 36, 250-254 (1971); Sasaki et al, J. Org. Chem. 41, 3138-3143 (1976); Imazawa et al. J. Org Chem. 44, 2039-2041 (1979); Hobbs et al, J. Org. Chem. 42, 714-719 (1976); Ikehara et al. Chem Pharm. Bull. Japan 26 240-244 (1978); see also PCT Application WO 88/00201). 5'-O-(4,4-Dimethoxytrityl)-2'-amino-2'- deoxythymidine (559.62 mg; 1.0 mmole) was reacted with 2.0 mmole of the 4- nitrophenyl ester of succinylated ethylene glycol monomethyl ether (see EXAMPLE 8) in 10 ml dry DMF in the presence of 1.0 mmole (140 μl) triethylamine for 18 hours at room temperature. The reaction mixture was evaporated in vacuo, co-evaporated with toluene, redissolved in dichloromethane and purified by silica gel chromatography (Si60, Merck; column: 2.5x50 cm; eluent: chloroform/methanol mixtures containing 0.1 % triethylamine). The product containing fractions were combined, evaporated and precipitated into pentane. Yield was 524 mg (0.73 mmol; 73 %). Transformation into the nucleoside-β-cyanoethyl-N,N-diisopropylphosphoamidite and incorporation into the automated chemical DNA synthesis protocol is performed by standard procedures. The mass-modified deoxythymidine derivative can substitute for one or more of the thymidine residues in the nucleic acid primer. In an analogous way, by employing the 4-nitrophenyl ester of succinylated diethylene glycol monomethyl ether (see EXAMPLE 9) and triethylene glycol monomethyl ether, the corresponding mass-modified oligonucleotides are prepared. In the case of only one incorporated mass-modified nucleoside within the sequence, the mass difference between the ethylene, diethylene and triethylene glycol derivatives is 44.05, 88.1 and 132.15 daltons respectively.
EXAMPLE 13
Synthesis of a nucleic acid primer mass-modified in the internucleotidic linkage via alkylation of phosphorothioate groups Phosphorothioate-containing oligonucleotides were prepared according to standard procedures (see e.g. Gait et al, Nucleic Acids Res lg 1183 (1991)). One, several or all internucleotide linkages can be modified in this way. The (-)-M13 nucleic acid primer sequence (17-mer) 5'-dGTAAAACGACGGCCAGT was synthesized in 0.25 μmole scale on a DNA synthesizer and one phosphorothioate group introduced after the final synthesis cycle (G to T coupling). Sulfurization, deprotection and purification followed standard protocols. Yield was 31.4 nmole (12.6 % overall yield), corresponding to 31.4 nmole phosphorothioate groups. Alkylation was performed by dissolving the residue in 31.4 μl TE buffer (0.01 M Tris pH 8.0, 0.001 M EDTA) and by adding 16 μl of a solution of 20 mM solution of 2-iodoethanol (320 nmole; i.e., 10-fold excess with respect to phosphorothioate diesters) in N,N-dimethylformamide (DMF). The alkylated oligonucleotide was purified by standard reversed phase HPLC (RP-18 Ultraphere, Beckman; column: 4.5 x 250 mm; 100 mM triethylammonium acetate, pH 7.0 and a gradient of 5 to 40 % acetonitrile).
In a variation of this procedure, the nucleic acid primer containing one or more phosphorothioate phosphodiester bond is used in the Sanger sequencing reactions The primer-extension products of the four sequencing reactions are purified as exemplified in EXAMPLES 1 - 4, cleaved off the solid support, lyophilized and dissolved in 4 μl each of TE buffer pH 8.0 and alkylated by addition of 2 μl of a 20 mM solution of 2-iodoethanol in DMF. It is then analyzed by ES and/or MALDI mass spectrometry. In an analogous way, employing instead of 2-iodoethanoI, e.g., 3- iodopropanol, 4-iodobutanol mass-modified nucleic acid primer are obtained with a mass difference of 14.03, 28.06 and 42.03 daltons respectively compared to the unmodified phosphorothioate phosphodiester-containing oligonucleotide.
EXAMPLE 14
Synthesis of 2'-amino-2,-deoxyuridine-5,-triphosphate and 3'-amino-2',3'- dideoxythymidine-5'-triphosphate mass-modified at the 2'- or 3'-amino function with glycine or β-alanine residues
Starting material was 2'-azido-2'-deoxyuridine prepared according to literature (Verheyden et al. J. Org. Chem. 36 , 250 (1971)), which was 4,4- dimethoxytritylated at 5'-OH with 4,4-dimethoxytrityl chloride in pyridine and acetylated at 3'-OH with acetic anhydride in a one-pot reaction using standard reaction conditions. With 191 mg (0.71 mmole) 2'-azido-2'-deoxyuridine as starting material, 396 mg (0.65 mmol, 90.8 %) 5'-O-(4,4-dimethoxytrityl)-3'-O-acetyl-2'-azido-2'-deoxuridine was obtained after purification via silica gel chromatography. Reduction of the azido group was performed using published conditions (Barta et al. Tetrahedron 46. 587-594 (1990)). Yield of 5'-O-(4,4-dimethoxytrityl)-3'-O-acetyl-2'-amino-2'-deoxyuridine after silica gel chromatography was 288 mg (0.49 mmole; 76 %). This protected 2'-amino-2'- deoxyuridine derivative (588 mg, 1.0 mmole) was reacted with 2 equivalents (927 mg, 2.0 mmole) N-Fmoc-glycine pentafluorophenyl ester in 10 ml dry DMF overnight at room temperature in the presence of 1.0 mmole (174 μl) N,N-diisopropylethylamine. Solvents were removed by evaporation in vacuo and the residue purified by silica gel chromatography. Yield was 71 1 mg (0.71 mmole, 82 %). Detritylation was achieved by a one hour treatment with 80% aqueous acetic acid at room temperature. The residue was evaporated to dryness, co-evaporated twice with toluene, suspended in 1 ml dry acetonitrile and 5'-phosphorylated with POCI3 according to literature (Yoshikawa et al. , Bull Chem. Soc. Japan 42, 3505 (1969) and Sowa et al, Bull. Chem. Soc. Japan 48, 2084 (1975)) and directly transformed in a one-pot reaction to the 5'-triphosphate using 3 ml of a 0.5 M solution (1.5 mmole) tetra (tri-n-butylammonium) pyrophosphate in DMF according to literature (e.g. Seela et al, Helvetica Chimica Acta 24, 1048 (1991)). The Fmoc and the 3'-O-acetyl groups were removed by a one-hour treatment with concentrated aqueous ammonia at room temperature and the reaction mixture evaporated and lyophilized. Purification also followed standard procedures by using anion-exchange chromatography on DEAE-Sephadex with a linear gradient of triethylammonium bicarbonate (0.1 M - 1.0 M). Triphosphate containing fractions (checked by thin layer
chromatography on polyethyleneimine cellulose plates) were collected, evaporated and lyophilized. Yield (by UV-absorbance of the uracil moiety) was 68% (0.48 mmole).
A glycyl-glycine modified 2'-amino-2'-deoxyuridine-5 '-triphosphate was obtained by removing the Fmoc group from 5'-O-(4,4-dimethoxytrityI)-3'-O-acetyl-2'-N- (N-9-fluorenylmethyloxycarbonyl-glycyl)-2'-amino-2'-deoxyuridine by a one-hour treatment with a 20% solution of piperidine in DMF at room temperature, evaporation of solvents, two-fold co-evaporation with toluene and subsequent condensation with N- Fmoc-glycine pentafluorophenyl ester. Starting with 1.0 mmole of the 2'-N-glycyl-2'- amino-2'-deoxyuridine derivative and following the procedure described above, 0.72 mmole (72%) of the corresponding 2'-(N-glycyl-glycyl)-2'-amino-2'-deoxyuridine-5'- triphosphate was obtained.
Starting with 5'-O-(4,4-dimethoxytrityl)-3'-O-acetyl-2'-amino-2'- deoxyuridine and coupling with N-Fmoc-β-alanine pentafluorophenyl ester, the corresponding 2'-(N-β-alanyl)-2'-amino-2'-deoxyuridine-5'-triphosphate can be synthesized. These modified nucleoside triphosphates are incorporated during the Sanger
DNA sequencing process in the primer-extension products. The mass difference between the glycine, β-alanine and glycyl-glycine mass-modified nucleosides is, per nucleotide incoφorated, 58.06, 72.09 and 115.1 daltons respectively.
When starting with 5'-O-(4,4-dimethoxytrityl)-3'-amino-2',3'- dideoxythymidine (obtained by published procedures, see EXAMPLE 12), the corresponding 3'-(N-glycyl)-3'-amino-/ 3'-(-N-gIycyl-glycyl)-3'-amino-/ and 3'-(N-β- alanyl)-3'-amino-2',3'-dideoxythymidine-5'-triphosphates can be obtained. These mass- modified nucleoside triphosphates serve as a terminating nucleotide unit in the Sanger DNA sequencing reactions providing a mass difference per terminated fragment of 58.06, 72.09 and 1 15.1 daltons respectively when used in the multiplexing sequencing mode. The mass-differentiated fragments can then be analyzed by ES and/or MALDI mass spectrometry.
EXAMPLE 15
Synthesis of deoxyuridine-5'-triphosphate mass-modified at C-5 of the heterocyciic base with glycine, glycyl-glycine and β-alanine residues.
0.281 g (1.0 mmole) 5-(3-Aminopropynyl-l)-2,-deoxyuridine (see EXAMPLE 6) was reacted with either 0.927 g (2.0 mmole) N-Fmoc-glycine pentafluorophenylester or 0.955g (2.0 mmole) N-Fmoc-β-alanine pentafluorophenyl ester in 5 ml dry DMF in the presence of 0.129 g N, N-diisopropylethylamine (174 ul, 1.0 mmole) overnight at room temperature. Solvents were removed by evaporation in vacuo and the condensation products purified by flash chromatography on silica gel (Still et al, I Org. Chem. 41, 2923-2925 (1978)). Yields were 476 mg (0.85 mmole: 85%) for the glycine and 436 mg (0.76 mmole; 76%) for the β-alanine derivatives. For the synthesis of the glycyl-glycine derivative, the Fmoc group of 1.0 mmole Fmoc-glycine-deoxyuridine derivative was removed by one-hour treatment with 20% piperidine in DMF at room temperature. Solvents were removed by evaporation in vacuo, the residue was co- evaporated twice with toluene and condensed with 0.927 g (2.0 mmole) N-Fmoc-glycine pentafluorophenyl ester and purified as described above. Yield was 445 mg (0.72 mmole; 72%) The glycyl-, glycyl-glycyl- and β-alanyl-2'-deoxyuridine derivatives, N-protected with the Fmoc group were transformed to the 3'-O-acetyl derivatives by tritylation with 4,4-dimethoxytrityl chloride in pyridine and acetylation with acetic anhydride in pyridine in a one-pot reaction and subsequently detritylated by one hour treatment with 80% aqueous acetic acid according to standard procedures. Solvents were removed, the residues dissolved in 100 ml chloroform and extracted twice with 50 ml 10% sodium bicarbonate and once with 50 ml water, dried with sodium sulfate, the solvent evaporated and the residues purified by flash chromatography on silica gel. Yields were 361 mg (0.60 mmole; 71%) for the glycyl-, 351 mg (0.57 mmole; 75%) for the β-alanyl- and 323 mg (0.49 mmole; 68%) for the glycyl-glycyl-3-O'-acetyl-2'-deoxyuridine derivatives respectively. Phosphorylation at the 5'-OH with POCI3, transformation into the 5'- triphosphate by in-situ reaction with tetra(tri-n-butylammonium) pyrophosphate in DMF, 3'-de-O-acetylation, cleavage of the Fmoc group, and final purification by anion-exchange chromatography on DEAE-Sephadex was performed as described in EXAMPLE 14. Yields according to UV-absorbance of the uracil moiety were 0.41 mmole 5-(3-(N- glycyl)-amidopropynyl-l)-2'-deoxyuridine-5'-triphosphate (84%), 0 43 mmole 5-(3-(N-β-
alanyl)-amidopropynyl-l)-2'-deoxyuridine-5 '-triphosphate (75%) and 0.38 mmole 5-(3- (N-glycyl-glycyl)-amidopropynyl- 1 )-2'-deoxyuridine-5'-triphosphate (78%) .
These mass-modified nucleoside triphosphates were incorporated during the Sanger DNA sequencing primer-extension reactions. When using 5-(3-aminopropynyl- 1 )-2',3'-dideoxyuridine as starting material and following an analogous reaction sequence the corresponding glycyl-, glycyl-glycyl- and β-alanyl-2',3,-dideoxyuridine-5'-triphosphates were obtained in yields of 69, 63 and 71% respectively. These mass-modified nucleoside triphosphates serve as chain- terminating nucleotides during the Sanger DNA sequencing reactions. The mass- modified sequencing ladders are analyzed by either ES or MALDI mass spectrometry
EXAMPLE 16
Synthesis of 8-glycyl- and 8-glycyl-glycyI-2'-deoxyadenosine-5'-triphosphate 727 mg (1.0 mmole) of N -(4-tert-butylphenoxyacetyl)-8-glycyl-5'-(4,4-
6 dimethoxytπtyl)-2'- deoxyadenosine or 800 mg (1.0 mmole) N -(4-tert- butylphenoxyacetyl)-8-glycyl-glycyl-5'-(4,4-dimethoxytrityl)-2'-deoxyadenosine prepared according to EXAMPLES 10 and 1 1 and literature (Kόster et al, Tetrahedron 3_2, 362
(1981)) were acetylated with acetic anhydride in pyridine at the 3 '-OH, detritylated at the 5'-position with 80% acetic acid in a one-pot reaction and transformed into the 5'- triphosphates via phosphorylation with POCI3 and reaction in-situ with tetra(tri-n- butylammonium) pyrophosphate as described in EXAMPLE 14. Deprotection ofthe N - tert-butylphenoxyacetyl, the 3'-O-acetyl and the O-methyl group at the glycine residues was achieved with concentrated aqueous ammonia for ninety minutes at room temperature. Ammonia was removed by lyophilization and the residue washed with dichloromethane, solvent removed by evaporation in vacuo and the remaining solid material purified by anion-exchange chromatography on DEAE-Sephadex using a linear gradient of triethylammonium bicarbonate from 0.1 to 1.0 M. The nucleoside triphosphate containing fractions (checked by TLC on polyethyleneimine cellulose plates) were combined and lyophillized. Yield of the 8-glycyl-2'-deoxyadenosine-5'-triphosphate
(determined by UV-absorbance of the adenine moiety) was 57% (0.57 mmole). The yield for the 8-glycyl-glycyl-2'-deoxyadenosine-5'-triphosphate was 51% (0.51 mmole).
These mass-modified nucleoside triphosphates were incorporated during primer-extension in the Sanger DNA sequencing reactions. When using the corresponding N6-(4-tert-butylphenoxyacetyl)-8-glycyl- or
-glycyl-glycyl-5'-O-(4,4-dimethoxytrityl)-2',3'-dideoxyadenosine derivatives as starting materials prepared according to standard procedures (see, e.g., for the introduction of the 2', 3 '-function : Seela et al , Helvetica Chimica Acta 24, 1048- 1058 ( 1991 )) and using an analogous reaction sequence as described above, the chain-terminating mass-modified nucleoside triphosphates 8-glycyl- and 8-glycyl-glycyl-2'.3'-dideoxyadenosine-5'- triphosphates were obtained in 53 and 47% yields respectively. The mass-modified sequencing fragment ladders are analyzed by either ES or MALDI mass spectrometry.
EXAMPLE 17
Mass-modification of Sanger DNA sequencing fragment ladders by incorporation of chain-elongating 2'-deoxy- and chain-terminating 2',3'-dideoxythymidine-5'- (alpha-S-)-triphosphate and subsequent alkylation with 2-iodoethanol and 3- iodopropanoi 2',3'-Dideoxythymidine-5'-(alpha-S)-triphosphate was prepared according to published procedures (e.g., for the alpha-S-triphosphate moiety: Eckstein et al, Biochemistry 15, 1685 (1976) and Accounts Chem. Res. 12, 204 (1978) and for the 2',3'- dideoxy moiety: Seela et al, Helvetica Chimica Acta. 24, 1048-1058 (1991)). Sanger DNA sequencing reactions employing 2'-deoxythymidine-5'-(alpha-S)-triphosphate are performed according to standard protocols (e.g. Eckstein, Ann. Rev. Biochem. 5A, 367 (1985)). When using 2',3'-dideoxythymidine-5'-(alpha-S)-triphosphates, this is used instead of the unmodified 2',3'-dideoxythymidine-5'-triphosphate in standard Sanger DNA sequencing (see e.g. Swerdlow et al. Nucleic Acids Res. 18. 1415-1419 (1990)). The template (2 pmole) and the nucleic acid M13 sequencing primer (4 pmole) modified according to EXAMPLE 1 are annealed by heating to 65 C in 100 ul of 10 mM Tris-H< pH 7.5, 10 mM MgCl2, 50 mM NaCI, 7 mM dithiothreitol (DTT) for 5 min and slowly
brought to 37 C during a one hour period. The sequencing reaction mixtures contain, as exemplified for the T-specific termination reaction, in a final volume of 150 ul, 200 uM (final concentration) each of dATP, dCTP, dTTP, 300 uM c7-deaza-dGTP, 5 uM 2',3'- dideoxythymidine-5'-(alpha-S)-triphosphate and 40 units Sequenase (United States Biochemicals). Polymerization is performed for 10 min at 37 C, the reaction mixture heated to 70 C to inactivate the Sequenase, ethanol precipitated and coupled to thiolated
Sequelon membrane disks (8 mm diameter) as described in EXAMPLE 1. Alkylation is performed by treating the disks with 10 ul of 10 mM solution of either 2-iodoethanol or
3-iodopropanol in NMM (N-methylmorpholine/water/2-propanol, 2/49/49, v/v/v) (three times), washing with 10 ul NMM (three times) and cleaving the alkylated T-terminated primer-extension products off the support by treatment with DTT as described in
EXAMPLE 1. Analysis of the mass-modified fragment families is performed with either
ES or MALDI mass spectrometry.
EXAMPLE 18
Analysis of a Mixture of Oligothymidylic Acids
Oligothymidylic acid, oligo p(dT)12-18, is commercially available (United States Biochemical, Cleveland, OH). Generally, a matrix solution of 0.5 M in ethanol was prepared. Various matrices were used for this Example and Examples 19- 21 such as 3,5-dihydroxybenzoic acid, sinapinic acid, 3-hydroxypicolinic acid, 2,4,6- trihydroxyacetophenone. Oligonucleotides were lyophilized after purification by HPLC and taken up in ultrapure water (MilliQ, Millipore) using amounts to obtain a concentration of 10 pmoles/μl as stock solution. An aliquot (1 μl) of this concentration or a dilution in ultrapure water was mixed with 1 μl of the matrix solution on a flat metal surface serving as the probe tip and dried with a fan using cold air. In some experiments, cation-ion exchange beads in the acid form were added to the mixture of matrix and sample solution.
MALDI-TOF spectra were obtained for this Example and Examples 19-21 on different commercial instruments such as Vision 2000 (Finnigan-MAT), VG TofSpec (Fisons Instruments), LaserTec Research (Vestec). The conditions for this Example were
linear negative ion mode with an acceleration voltage of 25 kV. The MALDI-TOF spectrum generated is shown in FIGURE 14. Mass calibration was done externally and generally achieved by using defined peptides of appropriate mass range such as insulin, gramicidin S, trypsinogen, bovine serum albumen, and cytochrome C. All spectra were generated by employing a nitrogen laser with 5 nsec pulses at a wavelength of 337 nm.
6 7 2
Laser energy varied between 10 and 10 W/cm . To improve signal-to-noise ratio generally, the intensities of 10 to 30 laser shots were accumulated.
EXAMPLE 19
Mass Spectrometric Analysis of a 50-mer and a 99-mer
Two large oligonucleotides were analyzed by mass spectrometry. The 50- mer d (TAACGGTCATTACGGCCATTGACTGTAGGACCTGCATTACATGACTAGCT) (SEQ ID NO:3) and dT(pdT)99 were used. The oligodeoxynucleotides were synthesized using β-cyanoethylphosphoamidites and purified using published procedures. (e.g. N.D. Sinha, J. Biernat, J. McManus and H. Kόster, Nucleic Acids Res . 12, 4539 (1984)) employing commercially available DNA synthesizers from either Millipore (Bedford, MA) or Applied Biosystems (Foster City, CA) and HPLC equipment and RP18 reverse phase columns from Waters (Milford, MA). The samples for mass spectrometric analysis were prepared as described in Example 18. The conditions used for MALDI-MS analysis of each oligonucleotide were 500 fmol of each oligonucleotide, reflectron positive ion mode with an acceleration of 5 kV and postacceleration of 20 kV. The MALDI-TOF spectra generated were superimposed and are shown in FIGURE 15.
EXAMPLE 20
Simulation of the DNA Sequencing Results of FIGURE 2
The 13 DNA sequences representing the nested dT-terminated fragments of the Sanger DNA sequencing for the 50-mer described in Example 19 (SEQ ID NO:3) were synthesized as described in Example 19. The samples were treated and 500 fmol of each fragment was analyzed by MALDI-MS as described in Example 18. The resulting MALDI-TOF spectra are shown in FIGURE 16. The conditions were reflectron positive ion mode with an acceleration of 5 kV and postacceleration of 20 kV. Calculated molecular masses and experimental molecular masses are shown in Table 1.
The MALDI-TOF spectra were superimposed (FIGURE 17) to demonstrate that the individual peaks are resolvable even between the 10-mer and 1 1 -mer (upper panel) and the 37-mer and 38-mer (lower panel). The two panels show two different scales and the spectra analyzed at that scale.
EXAMPLE 21
MALDI-MS Analysis of a Mass-Modified Oligonucleotide A 17-mer was mass-modified at C-5 of one or two deoxyuridine moieties.
5-[ 13-(2-Methoxyethoxyl)-tridecyne- 1 -yl]-5'-O-(4,4'-dimethoxytrityl)-2'-deoxyuridine-3 '- β-cyanoethyl-N, N-diisopropylphosphoamidite was used to synthesize the modified 17- mers using the methods described in Example 19.
The modified 17-mers were
(unmodified 17-mer: molecular mass: 5273)
The samples were prepared and 500 fmol of each modified 17-mer was analyzed using MALDI-MS as described in Example 18. The conditions used were reflectron positive ion mode with an acceleration of 5 kV and postacceleration of 20 kV. The MALDI-TOF spectra which were generated were superimposed and are shown in FIGURE 18.
EXAMPLE 22
Detection of Polymerase Chain Reaction Products Containing 7-Deazapurine
MATERIALS AND METHODS PCR amplifications
The following oligodeoxynucleotide primers were either synthesized according to standard phosphoamidite chemistry (Sinha, N.D,. et al., (1983) Tetrahedron Let. Vol. 24, Pp. 5843-5846; Sinha, N.D., et al., (1984) Nucleic Acids Res, Vol. 12, Pp. 4539-4557) on a MilliGen 7500 DNA synthesizer (Millipore, Bedford, MA USA) in 200
nmol scales or purchased from MWG-Biotech (Ebersberg, Germany, primer 3) and Biometra (Goettingen, Germany, primers 6-7).
primer 1 : 5 ' - GTCACCCTCGACCTGCAG SEQ. LD. NO. 6); primer 2: 5 ' - TTGTAAAACGACGGCCAGT (SEQ. LD. NO. 7); primer 3: 5 ' - CTTCCACCGCGATGTTGA (SEQ. LD. NO. 8); primer 4: 5 ' - CAGGAAACAGCTATGAC (SEQ. LD. NO. 9); primer 5: 5 ' - GTAAAACGACGGCCAGT (SEQ. LD. NO. 10); primer 6: 5 ' - GTCACCCTCGACCTGCAgC (g: RiboG) (SEQ. LD. NO. 11); primer 7: 5 ' - GTTGTAAAACGAGGGCCAgT (g: RiboG) (SEQ. LD. NO. 12);
The 99-mer and 200-mer DNA strands (modified and unmodified) as well as the ribo- and 7-deaza-modified 100-mer were amplified from pRFcl DNA (10 ng, generously supplied S. Feyerabend, University of Hamburg) in 100 μL reaction volume containing 10 mmoiVL KCl, 10 mmol/L (NH4)2SO4, 20 mmol/L Tris HCI (pH = 8.8), 2 mmol/L MgSO4, (exo(-)Pseudococcus furiosus (Pfu) -Buffer, Pharmacia, Freiburg,
Germany), 0.2 mmol/L each dNTP (Pharmacia, Freiburg, Germany), 1 μmol/L of each primer and 1 unit of exo(-)Pfu DNA polymerase (Stratagene, Heidelberg, Germany).
For the 99-mer primers 1 and 2, for the 200-mer primers 1 and 3 and for the 100-mer primers 6 and 7 were used. To obtain 7-deazapurine modified nucleic acids, during PCR-amplification dATP and dGTP were replaced with 7-deaza-dATP and 7- deaza-dGTP. The reaction was performed in a thermal cycler (OmniGene, MWG- Biotech, Ebersberg, Germany) using the cycle: denaturation at 95 °C for 1 min., annealing at 51 °C for 1 min. and extension at 72°C for 1 min. For all PCRs the number of reaction cycles was 30. The reaction was allowed to extend for additional 10 min. at 72 °C after the last cycle
The 103-mer DNA strands (modified and unmodified) were amplified from M13mp18 RFI DNA (100 ng, Pharmacia, Freiburg, Germany) in 100 μL reaction volume using primers 4 and 5 all other concentrations were unchanged. The reaction was performed using the cycle: denaturation at 95°C for 1 min., annealing at 40°C for 1 min. and extension at 72 °C for 1 min. After 30 cycles for the unmodified and 40 cycles for
the modified 103-mer respectively, the samples were incubated for additional 10 min. at 72°C.
Synthesis of 5'- ^-P] -labeled PCR-primers Primers 1 and 4 were 5'-[ 32 -P]-labeled employing T4-polynucleotidkinase
(Epicentre Technologies) and (γ-32P)-ATP. (BLU/NGG/502A, Dupont, Germany) according to the protocols of the manufacturer. The reactions were performed substituting 10% of primer 1 and 4 in PCR with the labeled primers under otherwise unchanged reaction-conditions. The amplified DNAs were separated by gel electrophoresis on a 10% polyacrylamide gel. The appropriate bands were excised and counted on a Packard TRI-CARB 460C liquid scintillation system (Packard, CT, USA).
Primer-cleavage from ribo-modified PCR-product The amplified DNA was purified using Ultrafree-MC filter units (30,000 NMWL), it was then redissolved in 100 μl of 0.2 mol/L NaOH and heated at 95 °C for 25 minutes. The solution was then acidified with HCI (1 mol/L) and further purified for MALDI-TOF analysis employing Ultrafree-MC filter units (10,000 NMWL) as described below.
Purification of PCR products
All samples were purified and concentrated using Ultrafree-MC units 30000 NMWL (Millipore, Eschborn, Germany) according to the manufacturer's description. After lyophilisation, PCR products were redissolved in 5 μL (3 μL for the 200-mer) of ultrapure water. This analyte solution was directly used for MALDI-TOF measurements.
MALDI-TOF MS
Aliquots of 0.5 μL of analyte solution and 0.5 μL of matrix solution (0.7 mol/L 3-HPA and 0.07 mol/L ammonium citrate in acetonitrile/water (1 : 1, v/v)) were mixed on a flat metallic sample support. After drying at ambient temperature the sample was introduced into the mass spectrometer for analysis. The MALDI-TOF mass
spectrometer used was a Finnigan MAT Vision 2000 (Finnigan MAT, Bremen, Germany). Spectra were recorded in the positive ion reflector mode with a 5 keV ion source and 20 keV postacceleration. The instrument was equipped with a nitrogen laser
-8 (337 nm wavelength). The vacuum of the system was 3-4* 10 hPa in the analyzer
-7 region and 1-4* 10 hPa in the source region. Spectra of modified and unmodified DNA samples were obtained with the same relative laser power; external calibration was performed with a mixture of synthetic oligodeoxynucleotides (7-to50-mer).
RESULTS AND DISCUSSION Enzymatic synthesis of 7-deazapurine nucleotide containing nucleic acids by PCR
In order to demonstrate the feasibility of MALDI-TOF MS for the rapid, gel-free analysis of short PCR products and to investigate the effect of 7-deazapurine modification of nucleic acids under MALDI-TOF conditions, two different primer- template systems were used to synthesize DNA fragments. Sequences are displayed in Figures 24 and 25. While the two single strands of the 103-mer PCR product had nearly equal masses (Δm= 8 u), the two single strands of the 99-mer differed by 526 u.
Considering the facts that 7-deaza purine nucleotide building blocks for chemical DNA synthesis are approximately 160 times more expensive than regular ones (Product Information, Glen Research Corporation, Sterling, VA) and their application in standard β-cyano-phosphoamidite chemistry is not trivial (Product Information, Glen Research Corporation, Sterling, VA; Schneider , K and B.T. Chait (1995) Nucleic Acids Res.23, 1570) the cost of 7-deaza purine modified primers would be very high Therefore, to increase the applicability and scope of the method, all PCRs were performed using unmodified oligonucleotide primers which are routinely available
7 7
Substituting dATP and dGTP by c -dATP and c -dGTP in polymerase chain reaction led to products containing approximately 80% 7-deaza-purine modified nucleosides for the
99-mer and 103-mer; and about 90% for the 200-mer, respectively. Table II shows the base composition of all PCR products.
TABLE II
Base composition of the 99-mer, 103-mer and 200-mer PCR amplification products
(unmodified and 7-deaza purine modified)
"s" and "a" describe "sense" and "antisense" strands of the double-stranded PCR product. 2 indicates relative modification as percentage of 7-deaza purine modified nucleotides of total amount of purine nucleotides.
However, it remained to be determined whether 80-90% 7-deaza-purine modification would be sufficient for accurate mass spectrometer detection, lt was therefore important to determine whether all purine nucleotides could be substituted during the enzymatic amplification step. It was found that exo(-)Pseudococcus furiosus
(Pfu) DNA polymerase indeed could accept c 7 -dATP and c 7 -dGTP in the absence of unmodified purine triphosphates. However, the incorporation was less efficient leading to a lower yield of PCR product (Figure 26). Ethidium-bromide stains by intercalation with the stacked bases of the DNA-doublestrand. Therefore lower band intensities in the
ethidium-bromide stained gel might be artifacts since the modified DNA-strands do not necessarily need to give the same band intensities as the unmodified ones.
32 To verify these results, the PCRs with [ P]-labeled primers were repeated. The autoradiogram (Figure 27) clearly shows lower yields for the modified
PCR-products. The bands were excised from the gel and counted. For all PCR products the yield of the modified nucleic acids was about 50%, referring to the corresponding unmodified amplification product. Further experiments showed that exo(-)DeepVent and
7 7
Vent DNA polymerase were able to incorporate c -dATP and c -dGTP during PCR as well. The overall performance, however, turned out to be best for the exo(-)Pfu DNA polymerase giving least side products during amplification. Using all three polymerases,
7 7 it was found that such PCRs employing c -dATP and c -dGTP instead of their isosteres showed less side-reactions giving a cleaner PCR-product. Decreased occurrence of amplification side products may be explained by a reduction of primer mismatches due to a lower stability of the complex formed from the primer and the 7-deaza-purine containing template which is synthesized during PCR. Decreased melting point for DNA duplexes containing 7-deaza-purine have been described (Mizusawa, S. et al., (1986) Nucleic Acids Res., 14, 1319-1324). In addition to the three polymerases specified above (exo(-) Deep Vent DNA polymerase, Vent DNA polymerase and exo(-) (Pfu) DNA polymerase), it is anticipated that other polymerases, such as the Large Klenow fragment of E. coli DNA polymerase, Sequenase, Taq DNA polymerase, and U AmpliTaq, AmpliTaq or AmpliTaq TS DNA polymerase can be used. In addition, where RNA is the template, RNA polymerases, such as the SP6 or the T7 RNA polymerase, must be used
MALDI- TOF mass spectrometry of modified and unmodified PCR products.
The 99-mer, 103-mer and 200-mer PCR products were analyzed by MALDI-TOF MS. Based on past experience, it was known that the degree of depurination depends on the laser energy used for desorption and ionization of the analyte. Since the influence of 7-deazapurine modification on fragmentation due to
depurination was to be investigated, all spectra were measured at the same relative laser energy.
Figures 28a and 28b show the mass spectra of the modified and unmodified 103-mer nucleic acids. In case of the modified 103-mer, fragmentation
+ causes a broad (M+H) signal. The maximum of the peak is shifted to lower masses so
+ that the assigned mass represents a mean value of (M+H) signal and signals of
+ fragmented ions, rather than the (M+H) signal itself. Although the modified 103-mer still contains about 20% A and G from the oligonucleotide primers, it shows less fragmentation which is featured by much more narrow and symmetric signals. Especially peak tailing on the lower mass side due to depurination, is substantially reduced. Hence, the difference between measured and calculated mass is strongly reduced although it is
+ still below the expected mass. For the unmodified sample a (M+H) signal of 31670 was observed, which is a 97 u or 0.3% difference to the calculated mass. While, in case of the modified sample this mass difference diminished to 10 u or 0.03% (31713 u found, 31723 u calculated). These observations are verified by a significant increase in mass resolution
+ of the (M+H) signal of the two signal strands (m/Δm = 67 as opposed to 18 for the unmodified sample with Δm = full width at half maximum, fwhm). Because of the low mass difference between the two single strands (8 u) their individual signals were not resolved. With the results of the 99 base pair DNA fragments the effects of increased mass resolution for 7-deazapurine containing DNA becomes even more evident. The two single strands in the unmodified sample were not resolved even though the mass difference between the two strands of the PCR product was very high with 526 u due to unequal distribution of purines and pyrimidines (figure 29a). In contrast to this, the modified DNA showed distinct peaks for the two single strands (figure 29b) which makes the superiority of this approach for the determination of molecular weights to gel electrophoretic methods even more profound. Although base line resolution was not obtained the individual masses were abled to be assigned with an accuracy of 0.1%: Δm = 27 u for the lighter (calc. mass = 30224 u) and Δm = 14 u for the heavier strand (calc. mass = 30750 u). Again, it was found that the full width at half maximum was substantially decreased for the 7-deazapurine containing sample.
In case of both the 99-mer and 103-mer the 7-deazapurine containing nucleic acids seem to give higher sensitivity despite the fact that they still contain about
20% unmodified purine nucleotides. To get comparable signal-to-noise ratio at similar
+ intensities for the (M+H) signals, the unmodified 99-mer required 20 laser shots in contrast to 12 for the modified one and the 103-mer required 12 shots for the unmodified sample as opposed to three for the 7-deazapurine nucleoside-containing PCR product.
Comparing the spectra of the modified and unmodified 200-mer amplicons, improved mass resolution was again found for the 7-deazapurine containing sample as well as increased signal intensities (figures 30a and 30b). While the signal of the single strands predominates in the spectrum of the modified sample the DNA-suplex and dimers of the single strands gave the strongest signal for the unmodified sample
A complete 7-deaza purine modification of nucleic acids may be achieved either using modified primers in PCR or cleaving the unmodified primers from the partially modified PCR product. Since disadvantages are associated with modified primers, as described above, a 100-mer was synthesized using primers with a ribo- modification The primers were cleaved hydrolytically with NaOH according to a method developed earlier in our laboratory (Koester, H. et al , Z Physiol. Chem. 359 1570- 1589) Figures 31 a and 3 lb display the spectra of the PCR product before and after primer cleavage. Figure 3 lb shows that the hydrolysis was successful" Both hydrolyzed PCR product as well as the two released primers could be detected together with a small signal from residual uncleaved 100-mer This procedure is especially useful for the MALDI-TOF analysis of very short PCR-products since the share of unmodified purines originating from the primer increases with decreasing length of the amplified sequence
The remarkable properties of 7-deazapurine modified nucleic acids can be explained by either more effective desorption and/or ionization, increased ion stability and/or a lower denaturation energy of the double stranded purine modified nucleic acid The exchange of the N-7 for a methine group results in the loss of one acceptor for a hydrogen bond which influences the ability of the nucleic acid to form secondary structures due to non-Watson-Crick base pairing (Seela, F and A Kehne (1987) Biochemistry, 26, 2232-2238.), which should be a reason for better desorption during the MALDI process In addition to this the aromatic system of 7-deazapurine has a lower
electron density that weakens Watson-Crick base pairing resulting in a decreased melting point (Mizusawa, S. et al., (1986) Nucleic Acids Res., 14, 1319-1324) of the double- strand. This effect may decrease the energy needed for denaturation of the duplex in the MALDI process. These aspects as well as the loss of a site which probably will carry a positive charge on the N-7 nitrogen renders the 7-deazapurine modified nucleic acid less polar and may promote the effectiveness of desorption.
Because of the absence of N-7 as proton acceptor and the decreased polarizaiton of the C-N bond in 7-deazapurine nucleosides depurination following the mechanisms established for hydrolysis in solution is prevented. Although a direct correlation of reactions in solution and in the gas phase is problematic, less fragmentation due to depurination of the modified nucleic acids can be expected in the MALDI process. Depurination may either be accompanied by loss of charge which decreases the total yield of charged species or it may produce charged fragmentation products which decreases the intensity of the non fragmented molecular ion signal. The observation of both increased sensitivity and decreased peak tailing of the (M+H) signals on the lower mass side due to decreased fragmentation of the 7- deazapurine containing samples indicate that the N-7 atom indeed is essential for the mechanism of depurination in the MALDI-TOF process In conclusion, 7-deazapurine containing nucleic acids show distinctly increased ion-stability and sensitivity under MALDI-TOF conditions and therefore provide for higher mass accuracy and mass resolution.
EXAMPLE 23
Solid State Sequencing and Mass Spectrometer Detection
MATERIALS AND METHODS
Oligonucleotides were purchased from Operon Technologies (Alameda, CA) in an unpurified form. Their sequences are listed in Table III. Sequencing reactions were performed on a solid surface using reagents from the sequencing kit for Sequenase Version 2.0 (Amersham, Arlington Heights, Illinois).
Sequencing a 39-mer target Sequencing complex:
5 ' -TCTGGCCTGGTGCAGGGCCTATTGTAGTTGTGACGTACA- (Ab) a-3 ' (DNA11683) (SEQ. LD. No. 13)
3 ' TCAACACTGCATGT-5 ' (PNA16/DNA) (SEQ. LD. No. 14)
In order to perform solid-state DNA sequencing, template strand DNA11683 was 3'-biotinylated by terminal deoxynucleotidyl transferase. A 30 μl reaction, containing 60 pmol of DNA1 1683, 1.3 nmol of biotin 14-dATP (GLBCO BRL, Grand Island, NY), 30 units of terminal transferase (Amersham, Arlington Heights, Illinois), and lx reaction buffer (supplied with enzyme), was incubated at 37°C for 1 hour. The reaction was stopped by heat inactivation of the terminal transferase at 70 °C for 10 min. The resulting product was desalted by passing through a TE-10 spin column (Clonetech). More than one molecules of biotin- 14-d ATP could be added to the 3 '-end of DNA1 1683. The biotinylated DNA1 1683 was incubated with 0.3 mg of Dynal streptavidin beads in 30 μl lx binding and washing buffer at ambient temperature for 30 min. The beads were washed twice with TE and redissolved in 30 μl TE, 10 μl aliquot (containing 0.1 mg of beads) was used for sequencing reactions. The 0.1 mg beads from previous step were resuspended in a lOμl volume containing 2 μl of 5x Sequenase buffer (200 mM Tris-HCI, pH 7.5, 100 mM MgC12, and 250 mM NaCI) from the Sequenase kit and 5 pmol of corresponding primer PNA16/DNA. The annealing mixture was heated to 70 °C and allowed to cool slowly to room temperature over a 20-30 min time period. Then 1 μl 0.1 M dithiothreitol solution, 1 μl Mn buffer (0.15 M sodium isocitrate and 0.1 M McC 12), and 2 μl of diluted
Sequenase (3.25 units) were added. The reaction mixture was divided into four aliquots of 3 μl each and mixed with termination mixes (each consists of 3 μl of the appropriate termination mix: 32 μM c7dATP, 32 μM dCTP, 32 μM c7dGTP, 32 μM dTTP and 3.2 μM of one of the four ddTNPs, in 50 mM NaCI). The reaction mixtures were incubated at 37°C for 2 min. After the completion of extension, the beads were precipitated and
the supernatant was removed. The beads were washed twice and resuspended in TE and kept at 4°C.
Sequencing a 78-mer target Sequencing complex:
5'-AAGATCTGACCAGGGATTCGGTTAGCGTGACTGCTGCTGCTGCTGCTGCTGC TGGATGATCCGACGCATCAGATCTGG- (Ab)n-3 (SEQ. LD. NO.15)(TNR.PLASM2) 3'-CTACTAGGCTGCGTAGTC-5' (CM1) (SEQ. LD.NO.16)
The target TNR.PLASM2 was biotinylated and sequenced using procedures similar to those described in previous section (sequencing a 39-mer target).
Sequencing a 15-mer target with partially duplex probe
Sequencing complex:
5 3' (SEQ. LD. No.17)
'-F-GATGATCCGACGCATCACAGCTC
3 ' 3 ' (SEQ. ID. No. 18) -b-CTACTAGGCTGCGTAGTGTCGAGAACCTTGGCT
CM1B3B was immobilized on Dynabeads M280 with streptavidin (Dynal, Norway) by incubating 60 pmol of CM1B3B with 0.3 magnetic beads in 30 μl 1M NaCI and TE (lx binding and washing buffer) at room temperature for 30 min. The beads were washed twice with TE and redissolved in 30 μl TE, 10 or 20 μl aliquot (containing 0.1 or 0.2 mg of beads respectively) was used for sequencing reactions.
The duplex was formed by annealing corresponding aliquot of beads from previous step with 10 pmol of DFl la5F (or 20 pmol of DFl la5F for 0.2 mg of beads) in a 9 μl volume containing 2 μl of 5x Sequenase buffer (200 mM Tris-HCI, pH 7.5, 100 mM MgCll, and 250 mM NaCI) from the Sequenase kit. The annealing mixture was heated to 65 °C and allowed to cool slowly to 37°C over a 20-30 min time period. The duplex primer was then mixed with 10 pmol of TSlo (20 pmol of TS10 for 0.2 mg of beads) in 1 μl volume, and the resulting mixture was further incubated at 37° C for 5 min, room temperature for 5-10 min. Then 1 μl 0.1 M dithiothreitol solution, 1 μl Mn buffer (0.15 M sodium isocitrate and 0.1 M MnCl2), and 2 μl of diluted Sequenase (3.25 units) were added. The reaction mixture was divided into four aliquots of 3 μl each and mixed
with termination mixes (each consists of 4 μl of the appropriate termination mix: 16 μM dATP, 16 μM dCTP, 16 μM dGTP, 16 μM dTTP and 1.6 μM of one of the four ddNTPs, in 50 mM NaCI). The reaction mixtures were incubated at room temperature for 5 min, and 37°C for 5 min. After the completion of extension, the beads were precipitated and the supernatant was removed. The beads were resuspended in 20 μl TE and kept at 4°C. An aliquot of 2 μl (out of 20 μl) from each tube was taken and mixed with 8 μl of formamide, the resulting samples were denatured at 90-95 °C for 5 min and 2 μl (out of 10 μl total) was applied to an ALF DNA sequencer (Pharmacia, Piscataway, NJ) using a 10% polyacrylamide gel containing 7 M urea and 0.6x TBE. The remaining aliquot was used for MALDI-TOFMS analysis.
MALDI sample preparation and instrumentation
Before MALDI analysis, the sequencing ladder loaded magnetic beads were washed twice using 50 mM ammonium citrate and resuspended in 0.5 μl pure water. The suspension was then loaded onto the sample target of the mass spectrometer and 0.5 μl of saturated matrix solution (3-hydropicolinic acid (HPA): ammonium citrate
= 10 1 mole ratio in 50% acetonitrile) was added. The mixture was allowed to dry prior to mass spectometer analysis.
The reflectron TOFMS mass spectrometer (Vision 2000, Finnigan MAT, Bremen, Germany) was used for analysis. 5 kV was applied in the ion source and 20 kV was applied for postacceleration. All spectra were taken in the positive ion mode and a nitrogen laser was used. Normally, each spectrum was averaged for more than 100 shots and a standard 25-point smoothing was applied.
RESULTS AND DISCUSSIONS
Conventional solid-state sequencing
In conventional sequencing methods, a primer is directly annealed to the template and then extended and terminated in a Sanger dideoxy sequencing. Normally, a biotinylated primer is used and the sequencing ladders are captured by streptavidin- coated magnetic beads. After washing, the products are eluted from the beads using
EDTA and formamide. However, our previous findings indicated that only the annealed
strand of a duplex is desorbed and the immobilized strand remains on the beads (Tang, K et al , (1995) Nucleic Acids Research 23:3126-3131 ). Therefore, it is advantageous to immobilize the template and anneal the primer. After the sequencing reaction and washing, the beads with the immobilized template and annealed sequencing ladder can be loaded directly onto the mass spectrometer target and mix with matrix In MALDI, only the annealed sequencing ladder will be desorbed and ionized, and the immobilized template will remain on the target
A 39-mer template (SEQ LD No. 13) was first biotinylated at the 3' end by adding biotin- 14-d ATP with terminal transferase More than one biotin- 14-d ATP molecule could be added by the enzyme However, since the template was immobilized and remained on the beads during MALDI, the number of biotin- 14-dATP would not affect the mass spectra A 14-mer primer (SEQ. LD No 14) was used for the solid-state sequencing MALDI-TOF mass spectra of the four sequencing ladders are shown in Figure 32, and the expected theoretical values are shown in Table III. The sequencing reaction produced a relatively homogenous ladder, and the full-length sequence was determined easily. One peak around 5150 appeared in all reactions are not identified A possible explanation is that a small portion of the template formed some kind of secondary structure, such as a loop, which hindered sequenase extension Mis- incorporation is of minor importance, since the intensity of these peaks were much lower than that of the sequencing ladders Although 7-deaza purines were used in the sequencing reaction, which could stabilize the N-glycosidic bond and prevent depurination, minor base losses were still observed since the primer was not substituted by 7-deazapurines The full length ladder, with a ddA at the 3' end, appeared in the A reaction with an apparent mass of 1 1899 8 However, a more intense peak of 122 appeared in all four reactions and is likely due to an addition of an extra nucleotide by the
TABLE I I I CONTINUED
A-reaetion C-reaction G-reaction T-reaction i
1.
2. 4223.8 4223.8 4223.8 4223 . 8
3. 4521.1
4. 4809.2
5. 5122.4
6. 5434.6
7. 5737 . 8
8. 6051.1
9. 6379.2
10. 6704.4
11. 6995.6
12. 7284.8 ro
13. 7574.0
14. 7878 . 2
15. 8207.4
16. 8495.6
17. 8808.8
18. 9097.0
19. 9386.2
20. 9699.4
21. 10027.6
22. 10355.8
23. 10644.0
24. 10933.2
25. 11246.4 π
26. 11574.6
27. 11886.8 CΛ vβ -J * δ- w
The same technique could be used to sequence longer DNA fragments. A 78-mer template containing a CTG repeat (SEQ. ID. No. 15) was 3'-biotinylated by adding biotin- 14-d ATP with terminal transferase. An 18-mer primer (SEQ. ID. No. 16) was annealed right outside the CTG repeat so that the repeat could be sequenced immediately after primer extension. The four reactions were washed and analyzed by MALDI-TOFMS as usual. An example of the G-reaction is shown in Figure 33 and the expected sequencing ladder is shown in Table IV with theoretical mass values for each ladder component. All sequencing peaks were well resolved except the last component (theoretical value 20577.4) was indistinguishable from the background. Two neighboring sequencing peaks (a 62-mer and a 63-mer) were also separated indicating that such sequencing analysis could be applicable to longer templates. Again, an addition of an extra nucleotide by the Sequenase enzyme was observed in this spectrum. This addition is not template specific and appeared in all four reactions which makes it easy to be identified. Compared to the primer peak, the sequencing peaks were at much lower intensity in the long template case. Further optimization of the sequencing reaction may be required.
35 . 3 ' -CAATCGCACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5 '
36. 3'-CCAATCGCACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5'
37. 3'-GCCAATCGCACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5 ' 38. 3'-AGCCAATCGCACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5'
39. 3'-AAGCCAATCGCACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5'
40. 3'-TAAGCCAATCGCACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5'
41. 3 ' -CTAAGCCAATCGCACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5'
42. 3'-CCTAAGCCAATCGCACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5 ' 43. 3'-CCCTAAGCCAATCGCACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5'
44. 3 '-TCCCTAAGCCAATCGCACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5 '
45. 3*-GTCCCTAAGCCAATCGCACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5' 46. 3 '-GGTCCCTAAGCCAATCGCACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5 ' 47. 3 ' -TGGTCCCTAAGCCAATCGCACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5 ' 48. 3 ' -CTGGTCCCTAAGCCAATCGCACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5'
59. 3' -ACTGGTCCCTAAGCCAATCGCACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5'
50. 3'-GACTGGTCCCTAAGCCAATCGCACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5'
51. 3 '-AGACTGGTCCCTAAGCCAATCGCACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5'
52. 3'-TAGACTGGTCCCTAAGCCAATCGCACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5' 53. 3 -CTAGACTGGTCCCTAAGCCAATCGCACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5'
54. 3'-TCTAGACTGGTCCCTAAGCCAATCGCACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5' 55.3'-TTCTAGACTGGTCCCTAAGCCAATCGCACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5'
TABLE IV (Continued) ddAT ddCTP ddGTP ddTTP
1. 5491.6 5491.6 5491.6 5491.6
2. 5764.8
3. 6078.0
4. 6407.2
5. 6696.4
6. 7009.6
7. 7338.8
8. 7628.0
9. 7941.2
10. 8270.4
11. 8559.6
12. 8872.8
13. 9202.0
14. 9491.2
15. 9804.4
16. 10133.6
17. 10422.8
18. 10736.0
19. 11065.2
20. 11354.4
21. 11667.6
22. 1196.8
23. 12286.0
24. 12599.2
25. 12928.4
26. 13232.6
27. 13521.8
28. 13835.0
29. 14124.2
30. 14453.4
31. 14742.6
32. 15046.8
33. 15360.0
34. 15673.2
35. 15962.4
36. 16251.6
37. 16580.8
38. 16894.0
39. 17207.2
40. 17511.4
41. 17800.6
42. 18089.8
43. 18379.0
44. 18683.2
45. 19012.4
46. 19341.6
47. 19645.8
48. 19935.0
49. 20248.2
50. 20577.4
51. 20890.6
52. 21194.8
53. 21484.0
54. 21788.2
55. 22092.4
Sequencing using duplex DNA probes for capturing andpriming Duplex DNA probes with single-stranded overhang have been demonstrated to be able to capture specific DNA templates and also serve as primers for solid-state sequencing. The scheme is shown in Figure 34. Stacking interactions between a duplex probe and a single- stranded template allow only 5-base overhand to be sufficient for capturing. Based on this format, a 5' fluorescent-labeled 23-mer (5*-GAT GAT CCG ACG CAT CAC AGC TC) (SEQ. ID. No. 19) was annealed to a 3'-biotinylated 18-mer (5'-GTG ATG CGT CGG ATC ATC) (SEQ. ID. NO. 20), leaving a 5-base overhang. A 15-mer template (5'-TCG GTT CCA AGA GCT) (SEQ ID. No. 21) was captured by the duplex and sequencing reactions were performed by extension of the 5-base overhang MALDI-TOF mass spectra of the reactions are shown in Figure 35 All sequencing peaks were resolved although at relatively low intensities. The last peak in each reaction is due to unspecific addition of one nucleotide to the full length extension product by the Sequenase enzyme. For comparison, the same products were run on a conventional DNA sequencer and a stacking fluorogram of the results is shown in Figure 36. As can be seen from the Figure, the mass spectra had the same pattern as the fluorogram with sequencing peaks at much lower intensity compared to the 23-mer primer.
Improvements of MALDI- TOF mass spectrometry as a detection technique Sample distribution can be made more homogenous and signal intensity could potentially be increased by implementing the picoliter vial technique. In practice, the samples can be loaded on small pits with square openings of 100 um size. The beads used in the solid- state sequencing is less than 10 um in diameter, so they should fit well in the microliter vials Microcrystals of matrix and DNA containing "sweet spots" will be confined in the vial Since the laser spot size is about 100 μm in diameter, it will cover the entire opening of the vial. Therefore, searching for sweet spots will be unnecessary and high repetition-rate laser (e.g. >10Hz) can be used for acquiring spectra. An earlier report has shown that this device is capable of increasing the detection sensitivity of peptides and proteins by several orders of magnitude compared to conventional MALDI sample preparation technique.
Resolution of MALDI on DNA needs to be further improved in order to extend the sequencing range beyond 100 bases. Currently, using 3-HP A/ammonium citrate as matrix and a reflectron TOF mass spectrometer with 5kV ion source and 20 kV postacceleration, the resolution of the run-through peak in Figure 33 (73-mer) is greater than 200 (FWHM) which is enough for sequence determination in this case. This resolution is also the highest reported for
MALDI desorbed DNA ions above the 70-mer range. Use of the delayed extraction technique may further enhance resolution.
All of the above-cited references and publications are hereby incoφorated by reference. EQUIVALENTS
Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, numerous equivalents to the specific procedures described herein. Such equivalents are considered to be within the scope of this invention and are covered by the following claims.
SEQUENCE LISTING
(1) GENERAL INFORMATION:
(i) APPLICANT: Koster, Hubert
(ii) TITLE OF INVENTION: DNA SEQUENCING BY MASS SPECTROMETRY
(iii) NUMBER OF SEQUENCES: 21
(iv) CORRESPONDENCE ADDRESS:
(A) ADDRESSEE: Patent Group
Foley, Hoag & Eliot LLP
(B) STREET: One Post Office Square (C) CITY: Boston
(D) STATE: MA
(E) COUNTRY: USA
(F) ZIP: 02109-2170
(v) COMPUTER READABLE FORM:
(A) MEDIUM TYPE: Floppy disk
(B) COMPUTER: IBM PC compatible
(C) OPERATING SYSTEM: PC-DOS/MS-DOS
(D) SOFTWARE: ASCII (text)
(vi) CURRENT APPLICATION DATA:
(A) APPLICATION NUMBER:
(B) FILING DATE: 18-MAR-1997
(C) CLASSIFICATION:
(vii) PRIOR APPLICATION DATA:
(A) APPLICATION NUMBER: 08/617,010
(B) FILING DATE: 18-MAR-1996
(viii) PRIOR APPLICATION DATA:
(A) APPLICATION NUMBER: 08/178,216
(B) FILING DATE: 06-JAN-1994
(C) CLASSIFICATION:
(ix) ATTORNEY/AGENT INFORMATION:
(A) NAME: Arnold, Beth E.
(B) REGISTRATION NUMBER: 35,430
(C) REFERENCE/DOCKET NUMBER: SQA-3.25.27
(X) TELECOMMUNICATION INFORMATION:
(A) TELEPHONE: (617) 832-1294
(B) TELEFAX: (617) 832-7000
(2) INFORMATION FOR SEQ ID NO: 1 :
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 14 base pairs (B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: other nucleic acid
(iii) HYPOTHETICAL: YES
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l:
CATGCCATGG CATG 14
(2) INFORMATION FOR SEQ ID NO:2 :
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 21 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: other nucleic acid
(iii) HYPOTHETICAL: YES
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:
AAATTGTGCA CATCCTGCAG C 21
(2) INFORMATION FOR SEQ ID NO: 3:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 50 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: other nucleic acid
(iii) HYPOTHETICAL: YES
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3:
TAACGGTCAT TACGGCCATT GACTGTAGGA CCTGCATTAC ATGACTAGCT 50
(2) INFORMATION FOR SEQ ID NO: 4 :
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 17 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: other nucleic acid
(iii) HYPOTHETICAL: YES
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4
TAAAACGACG GGCCAGXG 17
(2) INFORMATION FOR SEQ ID NO: 5
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 17 base pairs (B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: other nucleic acid
(iii) HYPOTHETICAL: YES
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:
XAAAACGACG GGCCAGXG 17
(2) INFORMATION FOR SEQ ID NO: 6:
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 18 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6:
GTCACCCTCG ACCTGCAG 18
(2) INFORMATION FOR SEQ ID NO:7:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 19 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7:
TTGTAAAACG ACGGCCAGT 19
(2) INFORMATION FOR SEQ ID NO: 8:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 18 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8:
CTTCCACCGC GATGTTGA 18
(2) INFORMATION FOR SEQ ID NO: 9:
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 17 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 9:
CAGGAAACAG CTATGAC 17
(2) INFORMATION FOR SEQ ID NO: 10:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 17 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 10:
GTAAAACGAC GGCCAGT 17
(2) INFORMATION FOR SEQ ID NO:11:
(l) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 19 base pairs
(B) TYPE: nucleic acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(ix) FEATURE:
(A) NAME/KEY: mιsc_feature
(B) LOCATION: 1..19
(D) OTHER INFORMATION: /note= "All lowercase letters represent RiboG"
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11:
GTCACCCTCG ACCTGCAgC 19
(2) INFORMATION FOR SEQ ID NO: 12:
(l) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(n) MOLECULE TYPE: cDNA
(ix) FEATURE:
(A) NAME/KEY: mιsc_feature (B) LOCATION: 1..20
(D) OTHER INFORMATION: /note= "All lowercase letters represent RiboG"
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12:
GTTGTAAAAC GAGGGCCAgT 20
(2) INFORMATION FOR SEQ ID NO: 13:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 39 base pairs
(B) TYPE: nucleic acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13:
TCTGGCCTGG TGCAGGGCCT ATTGTAGTTG TGACGTACA 39
(2) INFORMATION FOR SEQ ID NO: 14:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 14 base pairs
(B) TYPE: nucleic acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14
TCAACACTGC ATGT 14
(2) INFORMATION FOR SEQ ID NO: 15:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 78 base pairs
(B) TYPE: nucleic acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15:
AAGATCTGAC CAGGGATTCG GTTAGCGTGA CTGCTGCTGC TGCTGCTGCT GCTGGATGAT 60
CCGACGCATC AGATCTGG 78
(2) INFORMATION FOR SEQ ID NO: 16:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 18 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16:
CTACTAGGCT GCGTAGTC
(2) INFORMATION FOR SEQ ID NO: 17:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 23 base pairs
(B) TYPE: nucleic acid (C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17:
GATGATCCGA CGCATCACAG CTC 23
(2) INFORMATION FOR SEQ ID NO: 18:
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 33 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18:
CTACTAGGCT GCGTAGTGTC GAGAACCTTG GCT 33
(2) INFORMATION FOR SEQ ID NO: 19:
(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 23 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19:
GATGATCCGA CGCATCACAG CTC 23
(2) INFORMATION FOR SEQ ID NO: 20:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 18 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20:
GTGATGCGTC GGATCATC 18
(2) INFORMATION FOR SEQ ID NO:21:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 15 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: single
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: cDNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21:
TCGGTTCCAA GAGCT 15
Claims
1. A method for determining the sequence of a nucleic acid, comprising the steps of:
a) generating at least two base-specifically terminated nucleic acid fragments containing modified purine nucleotides that are relatively resistant to fragmentation during mass spectrometry;
b) determining the molecular weight value of each base-specifically terminated fragment by mass spectrometry, wherein the molecular weight values of at least two base-specifically terminated fragments are determined concurrently; and
c) determining the sequence of the nucleic acid by aligning the base-specifically terminated nucleic acid fragments according to molecular weight.
2. The method according to claim 1 wherein the nucleic acid fragments are purified before the step of determining the molecular weight values by mass spectrometry.
3. The method according to claim 2 wherein the nucleic acid fragments are purified, comprising the steps of:
a) reversibly immobilizing the nucleic acid fragments on a solid support; and
b) washing out all remaining reactants and by-products.
4. The method according to claim 3, further comprising the step of removing the nucleic acid fragments from the solid support
5. The method of claim 1, wherein the fragments contain deazapurine moieties.
6. The method of claim 1, wherein the deaza purine moieties are selected from the group consisting of: C7-deazaadenine, C7-deazaguanine, 7-deazainosine triphosphate, C9-deazaadenine, C9-deazaguanine and C9-deazainosine triphosphate.
7. The method of claim 1, wherein at least about 50% of the purine nucleotides are modified within the nucleotide fragment.
8. A process of claim 1 wherein the mass spectrometer is selected from the group consisting of: Matrix- Assisted Laser Desorption/ionization Time-of-Flight (MALDI-TOF), Electrospray (ES), Ion Cyclotron Resonance (ICR),and Fourier Transform and combinations thereof.
9. The method according to claim 1, wherein more than one species of nucleic acid are concurrently sequenced by multiplex mass spectrometric nucleic acid sequencing employing nucleic acid primers, chain-elongating nucleotides, and chain-terminating nucleotides, wherein one of the sets of base-specifically terminated fragments is unmodified and the other sets of base-specifically terminated nucleic acid fragments are mass modified, and each of the sets of base-specifically terminated nucleic acid fragments has a sufficient mass difference to be distinguished from the others by mass spectrometry
10. The method according to claim 9, wherein at least one of the sets of mass- modified base-specifically terminated fragments is modified with a mass- modifying functionality at a heterocyciic base of at least one nucleotide.
11. The method according to claim 10, wherein the heterocyciic base-modified nucleotide is selected from the group consisting of a cytosine nucleotide modified at C-5, a thymine nucleotide modified at C-5, a thymine nucleotide
modified at the C-5 methyl group, a uracil nucleotide modified at C-5, an adenine nucleotide modified at C-8, an adenine nucleotide modified at C-7, a c7- deazaadenine modified at C-8, a c7-deazaadenine modified at C-7, a guanine nucleotide modified at C-8, a guanine nucleotide modified at C-7, a c7- deazaguanine modified at C-8, a c7-deazaguanine modified at C-7, a hypoxanthine modified at C-8, a c7-deazahypoxanthine modified at C-7, and a c7- deazahypoxanthine modified at C-8.
12. The method according to claim 9, wherein at least one of the sets of mass- modified base-specifically terminated nucleic acid fragments is modified with a mass-modifying functionality attached to one or more phosphate moieties of the internucleotidic linkages of the fragments.
13. The method according to claim 9, wherein at least one of the sets of mass- modified base-specifically terminated nucleic acid fragments is modified with a mass-modifying functionality attached to one or more sugar moieties of nucleotides within the set of mass modified base-specifically terminated fragments at at least one sugar position selected from the group consisting of a C-2' position, an external C- 3' position, and an external C-5' position.
14. The method according to claim 9, wherein at least one of the sets of mass- modified base-specifically terminated nucleic acid fragments is modified with a mass-modifying functionality (M) attached to the sugar moiety of a 5 '-terminal nucleotide and wherein the mass-modifying function (M) is the linking functionality (L).
15 The method according to claim 9, wherein a mass-modifying functionality (M) is attached to a set of base-specifically terminated nucleic acid fragments subsequent to generating the base-specifically terminated nucleic acid fragments and prior to determining the molecular weight values for the nested fragments by mass spectrometry.
16. The method according to claim 15, wherein the base-specifically terminated nucleic acid fragments are generated using at least one reagent selected from the group consisting of a nucleic acid primer, a chain-elongating nucleotide, a chain- terminating nucleotide and a tag probe which has been modified with a precursor of the mass-modifying functionality, M; and a subsequent step comprises modifying the precursor of the mass-modifying functionality to generate the mass-modifying functionality, M, prior to mass spectrometric analysis.
17. The method according to claim 9, wherein mass differentiation of the tag probes is achieved by changing the nucleotide composition of at least one of the tag probes and complementary tag sequence in the species of nucleic acid.
18. The method according to claim 9, wherein the tag probes are covalently bound to the corresponding complementary tag sequence prior to mass spectrometric analysis.
19. The method according to claim 18, wherein binding between the tag probes and the corresponding complementary tag sequences is achieved photochemically via photoactivatable groups.
20. A method of sequencing a nucleic acid, comprising the steps of: a) reversibly linking an oligonucleotide primer to a solid support;
b) generating at least two base-specifically terminated nucleic acid fragments containing nucleotides that are relatively resistant to fragmentation during mass spectrometry;
c) determining the molecular weight value of each nested fragment in each of the four sets of base-specifically terminated fragments of the nucleic acid by matrix assisted laser desorption/ionization mass spectrometry wherein the molecular weight values of at least two base-specifically terminated
fragments are determined concurrently and wherein the nested fragments are cleaved from the solid support by a laser during mass spectrometry; and
d) determining the nucleotide sequence by aligning the base specifically terminated fragments according to molecular weight
21. The method according to claim 20 wherein the nucleic acid fragments are purified before the step of determining the molecular weight values by mass spectrometry
22 The method according to claim 21 wherein the nucleic acid fragments are purified, comprising the steps of
a) reversibly immobilizing the nucleic acid fragments on a solid support; and
b) washing out all remaining reactants and by-products
23 The method according to claim 22, further comprising the step of removing the nucleic acid fragments from the solid support.
24 The method of claim 20, wherein the fragments contain deazapurine moieties
25 The method of claim 20, wherein the deaza purine moieties are selected
7 7 from the group consisting of. C -deazaadenine, C -deazaguanine, 7-
9 . 9 9 deazainosine triphosphate, C -deazaadenine, C -deazaguanine and C - deazainosine triphosphate.
26 The method of claim 20, wherein at least about 50% of the purine nucleotides are modified within the nucleotide fragment
27. A process of claim 20 wherein the mass spectrometer is selected from the group consisting of: Matrix-Assisted Laser Desorption/ionization Time-of-Flight (MALDI-TOF), Electrospray (ES), Ion Cyclotron Resonance (ICR), and Fourier Transform and combinations thereof.
28. The method according to claim 20, wherein more than one species of nucleic acid are concurrently sequenced by multiplex mass spectrometric nucleic acid sequencing employing nucleic acid primers, chain-elongating nucleotides, and chain-terminating nucleotides, wherein one of the sets of base-specifically terminated fragments is unmodified and the other sets of base-specifically terminated nucleic acid fragments are mass modified, and each of the sets of base-specifically terminated nucleic acid fragments has a sufficient mass difference to be distinguished from the others by mass spectrometry.
29. The method according to claim 28, wherein at least one of the sets of mass- modified base-specifically terminated fragments is modified with a mass- modifying functionality (M) at a heterocyciic base of at least one nucleotide.
30. The method according to claim 29, wherein the heterocyciic base-modified nucleotide is selected from the group consisting of a cytosine nucleotide modified at C-5, a thymine nucleotide modified at C-5, a thymine nucleotide modified at the C-5 methyl group, a uracil nucleotide modified at C-5, an adenine
7 nucleotide modified at C-8, an adenine nucleotide modified at C-7, a c -
7 deazaadenine modified at C-8, a c -deazaadenine modified at C-7, a guanine
7 nucleotide modified at C-8, a guanine nucleotide modified at C-7, a c -
7 deazaguanine modified at C-8, a c -deazaguanine modified at C-7, a
7 hypoxanthine modified at C-8, a c -deazahypoxanthme modified at C-7, and a
7 c -deazahypoxanthine modified at C-8.
31. The method according to claim 28, wherein at least one of the sets of mass- modified base-specifically terminated nucleic acid fragments is modified with a
mass-modifying functionality (M) attached to one or more phosphate moieties of the internucleotidic linkages of the fragments.
32. The method according to claim 28, wherein at least one of the sets of mass- modified base-specifically terminated nucleic acid fragments is modified with a mass-modifying functionality (M) attached to one or more sugar moieties of nucleotides within the set of mass modified base-specifically terminated fragments at at least one sugar position selected from the group consisting of a C-21 position, an external C- 3' position, and an external C-5' position.
33. The method according to claim 28, wherein at least one of the sets of mass- modified base-specifically terminated nucleic acid fragments is modified with a mass-modifying functionality (M) attached to the sugar moiety of a 5'-terminal nucleotide and wherein the mass-modifying function (M) is the linking functionality (L) .
34. The method according to claim 28, wherein a mass-modifying functionality (M) is attached to a set of base-specifically terminated nucleic acid fragments subsequent to generating the base-specifically terminated nucleic acid fragments and prior to determining the molecular weight values for the nested fragments by mass spectrometry.
35. The method according to claim 34, wherein the base-specifically terminated nucleic acid fragments are generated using at least one reagent selected from the group consisting of a nucleic acid primer, a chain-elongating nucleotide, a chain-terminating nucleotide and a tag probe which has been modified with a precursor of the mass-modifying functionality, M; and a subsequent step comprises modifying the precursor of the mass-modifying functionality, M, to generate the mass-modifying functionality, M, prior to mass spectrometric analysis.
36. The method according to claim 28, wherein mass differentiation of the tag probes is achieved by changing the nucleotide composition of at least one of the tag probes and complementary tag sequence in the species of nucleic acid.
37. The method according to claim 28, wherein the tag probes are covalently bound to the corresponding complementary tag sequence prior to mass spectrometric analysis.
38. The method according to claim 37, wherein binding between the tag probes and the corresponding complementary tag sequences is achieved photochemically via photoactivatable groups.
39. A method of multiplex analysis of nucleic acid sequences, comprising the steps of: a) reversibly linking a nucleic acid primer to a solid support; b) generating at least two conditioned, base-specifically terminated nucleic acid fragments containing modified purine nucleotides that are relatively resistant to fragmentation during mass spectrometry;
c) determining the molecular weight value of each fragment by matrix assisted laser desorption/ionization mass spectrometry wherein the molecular weight values of at least two base-specifically terminated fragments are determined concurrently and wherein the fragments are cleaved from the solid support by a laser during mass spectrometry; and
d) determining the nucleotide sequence by aligning the fragments according to molecular weight; wherein at least one reagent selected from a group consisting of, a nucleic acid primer, a chain-elongating nucleotide, and a chain-terminating nucleotide which has been mass-modified; wherein each set of base-specifically terminated fragments has a sufficient mass difference from the other sets of base- specifically terminated fragments so as to be unique; and wherein the molecular
weight values of the nested fragments of two or more sets of unseparated base- specifically terminated fragments are determined concurrently.
40 The method according to claim 39, wherein the reversible linkage is aphotocleavable bond.
41. The method according to claim 39 wherein the base-specifically terminated fragments are cleaved from the solid support prior to mass spectrometry.
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| AU22175/97A AU2217597A (en) | 1996-03-18 | 1997-03-18 | Dna sequencing by mass spectrometry |
Applications Claiming Priority (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US08/617,010 US6194144B1 (en) | 1993-01-07 | 1996-03-18 | DNA sequencing by mass spectrometry |
| US08/617,010 | 1996-03-18 | ||
| US54783596A | 1996-08-20 | 1996-08-20 | |
| US5/547,835 | 1996-08-20 |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| WO1997037041A2 WO1997037041A2 (en) | 1997-10-09 |
| WO1997037041A3 WO1997037041A3 (en) | 1997-12-04 |
| WO1997037041A9 true WO1997037041A9 (en) | 1997-12-31 |
Family
ID=27068672
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/US1997/004394 WO1997037041A2 (en) | 1996-03-18 | 1997-03-18 | Dna sequencing by mass spectrometry |
Country Status (2)
| Country | Link |
|---|---|
| AU (1) | AU2217597A (en) |
| WO (1) | WO1997037041A2 (en) |
Cited By (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6949633B1 (en) | 1995-05-22 | 2005-09-27 | Sequenom, Inc. | Primers useful for sizing nucleic acids |
| US6991903B2 (en) | 1992-11-06 | 2006-01-31 | Sequenom, Inc. | Solid phase sequencing of double-stranded nucleic acids |
| US7108974B2 (en) | 2001-03-02 | 2006-09-19 | Isis Pharmaceuticals, Inc. | Method for rapid detection and identification of bioagents |
| US7217510B2 (en) | 2001-06-26 | 2007-05-15 | Isis Pharmaceuticals, Inc. | Methods for providing bacterial bioagent characterizing information |
| USRE41005E1 (en) | 1996-11-06 | 2009-11-24 | Sequenom, Inc. | Beads bound to a solid support and to nucleic acids |
| US9034798B2 (en) | 2003-01-16 | 2015-05-19 | Caprotec Bioanalytics Gmbh | Capture compounds, collections thereof and methods for analyzing the proteome and complex compositions |
Families Citing this family (111)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5795714A (en) | 1992-11-06 | 1998-08-18 | Trustees Of Boston University | Method for replicating an array of nucleic acid probes |
| JPH08509857A (en) | 1993-01-07 | 1996-10-22 | シーケノム・インコーポレーテッド | DNA sequencing method by mass spectrometry |
| US6194144B1 (en) | 1993-01-07 | 2001-02-27 | Sequenom, Inc. | DNA sequencing by mass spectrometry |
| US5605798A (en) | 1993-01-07 | 1997-02-25 | Sequenom, Inc. | DNA diagnostic based on mass spectrometry |
| US6146854A (en) * | 1995-08-31 | 2000-11-14 | Sequenom, Inc. | Filtration processes, kits and devices for isolating plasmids |
| US5777324A (en) * | 1996-09-19 | 1998-07-07 | Sequenom, Inc. | Method and apparatus for maldi analysis |
| AU735416B2 (en) | 1996-11-06 | 2001-07-05 | Sequenom, Inc. | Dna diagnostics based on mass spectrometry |
| JP2001524808A (en) | 1996-12-10 | 2001-12-04 | ジーントレイス・システムズ・インコーポレイテッド | Releasable non-volatile mass labeling molecules |
| US6207370B1 (en) | 1997-09-02 | 2001-03-27 | Sequenom, Inc. | Diagnostics based on mass spectrometric detection of translated target polypeptides |
| NZ503289A (en) * | 1997-09-15 | 2002-11-26 | Xzillion Gmbh & Co Kg | Characterising nucleic acid fragments by mass spectrometry |
| US6268131B1 (en) * | 1997-12-15 | 2001-07-31 | Sequenom, Inc. | Mass spectrometric methods for sequencing nucleic acids |
| US7875440B2 (en) | 1998-05-01 | 2011-01-25 | Arizona Board Of Regents | Method of determining the nucleotide sequence of oligonucleotides and DNA molecules |
| US6780591B2 (en) | 1998-05-01 | 2004-08-24 | Arizona Board Of Regents | Method of determining the nucleotide sequence of oligonucleotides and DNA molecules |
| WO1999061910A1 (en) * | 1998-05-26 | 1999-12-02 | Board Of Trustees Of The University Of Illinois | Screening of compounds using ultrafiltration and mass spectometry |
| US6218118B1 (en) | 1998-07-09 | 2001-04-17 | Agilent Technologies, Inc. | Method and mixture reagents for analyzing the nucleotide sequence of nucleic acids by mass spectrometry |
| US6270976B1 (en) | 1998-09-15 | 2001-08-07 | Brax Group Limited | Characterizing nucleic acid by mass spectrometry |
| DE19905082C1 (en) * | 1999-01-29 | 2000-05-18 | Epigenomics Gmbh | Identification of methylation patterns of cytosine in genome DNA comprises chemical treatment to produce different base pairing behavior between cytosine and 5-methylcytosine |
| US6225061B1 (en) | 1999-03-10 | 2001-05-01 | Sequenom, Inc. | Systems and methods for performing reactions in an unsealed environment |
| IL145421A0 (en) * | 1999-03-18 | 2002-06-30 | Exiqon As | Use of lna in mass spectrometry |
| US20020009394A1 (en) | 1999-04-02 | 2002-01-24 | Hubert Koster | Automated process line |
| US6818395B1 (en) | 1999-06-28 | 2004-11-16 | California Institute Of Technology | Methods and apparatus for analyzing polynucleotide sequences |
| US7501245B2 (en) | 1999-06-28 | 2009-03-10 | Helicos Biosciences Corp. | Methods and apparatuses for analyzing polynucleotide sequences |
| US7668658B2 (en) | 1999-10-13 | 2010-02-23 | Sequenom, Inc. | Methods for generating databases and databases for identifying polymorphic genetic markers |
| JP2003519829A (en) | 1999-10-13 | 2003-06-24 | シークエノム・インコーポレーテツド | Methods for creating a database and a database for identifying polymorphic genetic markers |
| US7917301B1 (en) | 2000-09-19 | 2011-03-29 | Sequenom, Inc. | Method and device for identifying a biological sample |
| DE19963536C2 (en) * | 1999-12-20 | 2003-04-10 | Epigenomics Ag | Procedure for the analysis of nucleic acid sequences |
| US20020009727A1 (en) * | 2000-02-02 | 2002-01-24 | Schultz Gary A. | Detection of single nucleotide polymorphisms |
| GB0006141D0 (en) | 2000-03-14 | 2000-05-03 | Brax Group Ltd | Mass labels |
| US6958214B2 (en) | 2000-07-10 | 2005-10-25 | Sequenom, Inc. | Polymorphic kinase anchor proteins and nucleic acids encoding the same |
| US20020142483A1 (en) | 2000-10-30 | 2002-10-03 | Sequenom, Inc. | Method and apparatus for delivery of submicroliter volumes onto a substrate |
| US20040121314A1 (en) | 2002-12-06 | 2004-06-24 | Ecker David J. | Methods for rapid detection and identification of bioagents in containers |
| US7226739B2 (en) | 2001-03-02 | 2007-06-05 | Isis Pharmaceuticals, Inc | Methods for rapid detection and identification of bioagents in epidemiological and forensic investigations |
| US20040121309A1 (en) | 2002-12-06 | 2004-06-24 | Ecker David J. | Methods for rapid detection and identification of bioagents in blood, bodily fluids, and bodily tissues |
| US7666588B2 (en) | 2001-03-02 | 2010-02-23 | Ibis Biosciences, Inc. | Methods for rapid forensic analysis of mitochondrial DNA and characterization of mitochondrial DNA heteroplasmy |
| WO2002072892A1 (en) | 2001-03-12 | 2002-09-19 | California Institute Of Technology | Methods and apparatus for analyzing polynucleotide sequences by asynchronous base extension |
| US20020155587A1 (en) | 2001-04-20 | 2002-10-24 | Sequenom, Inc. | System and method for testing a biological sample |
| EP1401850A1 (en) * | 2001-06-20 | 2004-03-31 | Nuevolution A/S | Nucleoside derivatives for library preparation |
| EP1450957A1 (en) | 2001-10-26 | 2004-09-01 | Sequenom, Inc. | Method and apparatus for high-throughput sample handling process line |
| WO2003093296A2 (en) | 2002-05-03 | 2003-11-13 | Sequenom, Inc. | Kinase anchor protein muteins, peptides thereof, and related methods |
| DE10240746A1 (en) * | 2002-09-01 | 2004-03-18 | Epigenomics Ag | Method for the detection of nucleic acid sequences using cleavable probe molecules |
| CA2507189C (en) | 2002-11-27 | 2018-06-12 | Sequenom, Inc. | Fragmentation-based methods and systems for sequence variation detection and discovery |
| JP2006516193A (en) | 2002-12-06 | 2006-06-29 | アイシス・ファーマシューティカルス・インコーポレーテッド | Rapid identification of pathogens in humans and animals |
| DE10304219B3 (en) * | 2003-01-30 | 2004-08-19 | Epigenomics Ag | Method for the detection of cytosine methylation patterns with high sensitivity |
| US8158354B2 (en) | 2003-05-13 | 2012-04-17 | Ibis Biosciences, Inc. | Methods for rapid purification of nucleic acids for subsequent analysis by mass spectrometry by solution capture |
| US7964343B2 (en) | 2003-05-13 | 2011-06-21 | Ibis Biosciences, Inc. | Method for rapid purification of nucleic acids for subsequent analysis by mass spectrometry by solution capture |
| WO2005024068A2 (en) | 2003-09-05 | 2005-03-17 | Sequenom, Inc. | Allele-specific sequence variation analysis |
| US7956175B2 (en) | 2003-09-11 | 2011-06-07 | Ibis Biosciences, Inc. | Compositions for use in identification of bacteria |
| US8097416B2 (en) | 2003-09-11 | 2012-01-17 | Ibis Biosciences, Inc. | Methods for identification of sepsis-causing bacteria |
| US8546082B2 (en) | 2003-09-11 | 2013-10-01 | Ibis Biosciences, Inc. | Methods for identification of sepsis-causing bacteria |
| US7169560B2 (en) | 2003-11-12 | 2007-01-30 | Helicos Biosciences Corporation | Short cycle methods for sequencing polynucleotides |
| US7666592B2 (en) | 2004-02-18 | 2010-02-23 | Ibis Biosciences, Inc. | Methods for concurrent identification and quantification of an unknown bioagent |
| EP1716254B1 (en) | 2004-02-19 | 2010-04-07 | Helicos Biosciences Corporation | Methods for analyzing polynucleotide sequences |
| US7608394B2 (en) | 2004-03-26 | 2009-10-27 | Sequenom, Inc. | Methods and compositions for phenotype identification based on nucleic acid methylation |
| AU2005230936B2 (en) | 2004-03-26 | 2010-08-05 | Agena Bioscience, Inc. | Base specific cleavage of methylation-specific amplification products in combination with mass analysis |
| CA2567839C (en) | 2004-05-24 | 2011-06-28 | Isis Pharmaceuticals, Inc. | Mass spectrometry with selective ion filtration by digital thresholding |
| US7476734B2 (en) | 2005-12-06 | 2009-01-13 | Helicos Biosciences Corporation | Nucleotide analogs |
| US20050266411A1 (en) | 2004-05-25 | 2005-12-01 | Hofstadler Steven A | Methods for rapid forensic analysis of mitochondrial DNA |
| ATE507305T1 (en) | 2004-05-25 | 2011-05-15 | Helicos Biosciences Corp | METHOD FOR NUCLEIC ACID IMMOBILIZATION |
| US7811753B2 (en) | 2004-07-14 | 2010-10-12 | Ibis Biosciences, Inc. | Methods for repairing degraded DNA |
| WO2006135400A2 (en) | 2004-08-24 | 2006-12-21 | Isis Pharmaceuticals, Inc. | Methods for rapid identification of recombinant organisms |
| US7220549B2 (en) | 2004-12-30 | 2007-05-22 | Helicos Biosciences Corporation | Stabilizing a nucleic acid for nucleic acid sequencing |
| US7482120B2 (en) | 2005-01-28 | 2009-01-27 | Helicos Biosciences Corporation | Methods and compositions for improving fidelity in a nucleic acid synthesis reaction |
| WO2006094238A2 (en) | 2005-03-03 | 2006-09-08 | Isis Pharmaceuticals, Inc. | Compositions for use in identification of adventitious viruses |
| US8084207B2 (en) | 2005-03-03 | 2011-12-27 | Ibis Bioscience, Inc. | Compositions for use in identification of papillomavirus |
| EP1904655A2 (en) | 2005-07-21 | 2008-04-02 | Isis Pharmaceuticals, Inc. | Methods for rapid identification and quantitation of nucleic acid variants |
| US7666593B2 (en) | 2005-08-26 | 2010-02-23 | Helicos Biosciences Corporation | Single molecule sequencing of captured nucleic acids |
| US7397546B2 (en) | 2006-03-08 | 2008-07-08 | Helicos Biosciences Corporation | Systems and methods for reducing detected intensity non-uniformity in a laser beam |
| US8088582B2 (en) | 2006-04-06 | 2012-01-03 | Ibis Biosciences, Inc. | Compositions for the use in identification of fungi |
| US8679741B2 (en) | 2006-05-31 | 2014-03-25 | Sequenom, Inc. | Methods and compositions for the extraction and amplification of nucleic acid from a sample |
| AU2007353877B2 (en) | 2006-09-14 | 2012-07-19 | Ibis Biosciences, Inc. | Targeted whole genome amplification method for identification of pathogens |
| US7902345B2 (en) | 2006-12-05 | 2011-03-08 | Sequenom, Inc. | Detection and quantification of biomolecules using mass spectrometry |
| EP2126132B1 (en) | 2007-02-23 | 2013-03-20 | Ibis Biosciences, Inc. | Methods for rapid foresnsic dna analysis |
| US9598724B2 (en) | 2007-06-01 | 2017-03-21 | Ibis Biosciences, Inc. | Methods and compositions for multiple displacement amplification of nucleic acids |
| ATE549419T1 (en) | 2007-08-29 | 2012-03-15 | Sequenom Inc | METHODS AND COMPOSITIONS FOR UNIVERSAL SIZE-SPECIFIC POLYMERASE CHAIN REACTION |
| US20090180931A1 (en) | 2007-09-17 | 2009-07-16 | Sequenom, Inc. | Integrated robotic sample transfer device |
| EP4450642A3 (en) | 2008-01-17 | 2025-01-08 | Sequenom, Inc. | Single molecule nucleic acid sequence analysis processes and compositions |
| CA3073079C (en) | 2008-09-16 | 2023-09-26 | Sequenom, Inc. | Processes and compositions for methylation-based enrichment of fetal nucleic acid from a maternal sample useful for non-invasive prenatal diagnoses |
| EP2349549B1 (en) | 2008-09-16 | 2012-07-18 | Ibis Biosciences, Inc. | Mixing cartridges, mixing stations, and related kits, and system |
| US8534447B2 (en) | 2008-09-16 | 2013-09-17 | Ibis Biosciences, Inc. | Microplate handling systems and related computer program products and methods |
| EP2347254A2 (en) | 2008-09-16 | 2011-07-27 | Ibis Biosciences, Inc. | Sample processing units, systems, and related methods |
| US8962247B2 (en) | 2008-09-16 | 2015-02-24 | Sequenom, Inc. | Processes and compositions for methylation-based enrichment of fetal nucleic acid from a maternal sample useful for non invasive prenatal diagnoses |
| US8476013B2 (en) | 2008-09-16 | 2013-07-02 | Sequenom, Inc. | Processes and compositions for methylation-based acid enrichment of fetal nucleic acid from a maternal sample useful for non-invasive prenatal diagnoses |
| WO2010080616A1 (en) | 2008-12-19 | 2010-07-15 | Abbott Laboratories | Molecular assay for diagnosis of malaria |
| US8158936B2 (en) | 2009-02-12 | 2012-04-17 | Ibis Biosciences, Inc. | Ionization probe assemblies |
| WO2010104798A1 (en) | 2009-03-08 | 2010-09-16 | Ibis Biosciences, Inc. | Bioagent detection methods |
| US9393564B2 (en) | 2009-03-30 | 2016-07-19 | Ibis Biosciences, Inc. | Bioagent detection systems, devices, and methods |
| EP2414545B1 (en) | 2009-04-03 | 2017-01-11 | Sequenom, Inc. | Nucleic acid preparation compositions and methods |
| WO2011008971A1 (en) | 2009-07-17 | 2011-01-20 | Ibis Biosciences, Inc. | Lift and mount apparatus |
| US9194877B2 (en) | 2009-07-17 | 2015-11-24 | Ibis Biosciences, Inc. | Systems for bioagent indentification |
| WO2011014811A1 (en) | 2009-07-31 | 2011-02-03 | Ibis Biosciences, Inc. | Capture primers and capture sequence linked solid supports for molecular diagnostic tests |
| EP2462244B1 (en) | 2009-08-06 | 2016-07-20 | Ibis Biosciences, Inc. | Non-mass determined base compositions for nucleic acid detection |
| EP2957641B1 (en) | 2009-10-15 | 2017-05-17 | Ibis Biosciences, Inc. | Multiple displacement amplification |
| EP3088532B1 (en) | 2009-12-22 | 2019-10-30 | Sequenom, Inc. | Processes and kits for identifying aneuploidy |
| US9758840B2 (en) | 2010-03-14 | 2017-09-12 | Ibis Biosciences, Inc. | Parasite detection via endosymbiont detection |
| CN110564819A (en) | 2011-05-19 | 2019-12-13 | 基纳生物技术有限公司 | Products and methods for multiplex nucleic acid identification |
| EP4155401A1 (en) | 2012-03-02 | 2023-03-29 | Sequenom, Inc. | Methods and processes for non-invasive assessment of genetic variations |
| US10504613B2 (en) | 2012-12-20 | 2019-12-10 | Sequenom, Inc. | Methods and processes for non-invasive assessment of genetic variations |
| US9920361B2 (en) | 2012-05-21 | 2018-03-20 | Sequenom, Inc. | Methods and compositions for analyzing nucleic acid |
| US20140004105A1 (en) | 2012-06-29 | 2014-01-02 | Sequenom, Inc. | Age-related macular degeneration diagnostics |
| AU2013290102B2 (en) | 2012-07-13 | 2018-11-15 | Sequenom, Inc. | Processes and compositions for methylation-based enrichment of fetal nucleic acid from a maternal sample useful for non-invasive prenatal diagnoses |
| AU2013326980B2 (en) | 2012-10-04 | 2019-08-15 | Sequenom, Inc. | Methods and processes for non-invasive assessment of genetic variations |
| US11060145B2 (en) | 2013-03-13 | 2021-07-13 | Sequenom, Inc. | Methods and compositions for identifying presence or absence of hypermethylation or hypomethylation locus |
| EP3071706B1 (en) | 2013-11-21 | 2018-04-25 | Assistance Publique-Hôpitaux de Paris | Method for detecting chromosomal rearrangements |
| US11365447B2 (en) | 2014-03-13 | 2022-06-21 | Sequenom, Inc. | Methods and processes for non-invasive assessment of genetic variations |
| HK1250747A1 (en) | 2015-04-24 | 2019-01-11 | Agena Bioscience, Inc. | Multiplexed method for the identification and quantitation of minor alleles and polymorphisms |
| CN107787371B (en) | 2015-04-24 | 2022-02-01 | 基纳生物技术有限公司 | Parallel method for detecting and quantifying minor variants |
| US10426424B2 (en) | 2017-11-21 | 2019-10-01 | General Electric Company | System and method for generating and performing imaging protocol simulations |
| US12188083B2 (en) | 2018-06-01 | 2025-01-07 | Agena Bioscience, Inc. | Products and processes for nucleic acid detection and quantification |
| US20210301342A1 (en) | 2018-09-07 | 2021-09-30 | Sequenom, Inc. | Methods, and systems to detect transplant rejection |
| US20220093208A1 (en) | 2019-02-19 | 2022-03-24 | Sequenom, Inc. | Compositions, methods, and systems to detect hematopoietic stem cell transplantation status |
| US20230120825A1 (en) | 2020-02-28 | 2023-04-20 | Laboratory Corporation Of America Holdings | Compositions, Methods, and Systems for Paternity Determination |
Family Cites Families (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPH08509857A (en) * | 1993-01-07 | 1996-10-22 | シーケノム・インコーポレーテッド | DNA sequencing method by mass spectrometry |
| AU687801B2 (en) * | 1993-03-19 | 1998-03-05 | Sequenom, Inc. | DNA sequencing by mass spectrometry via exonuclease degradation |
| WO1995014108A1 (en) * | 1993-11-17 | 1995-05-26 | Amersham International Plc | Primer extension mass spectroscopy nucleic acid sequencing method |
-
1997
- 1997-03-18 AU AU22175/97A patent/AU2217597A/en not_active Abandoned
- 1997-03-18 WO PCT/US1997/004394 patent/WO1997037041A2/en active Application Filing
Cited By (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6991903B2 (en) | 1992-11-06 | 2006-01-31 | Sequenom, Inc. | Solid phase sequencing of double-stranded nucleic acids |
| US6949633B1 (en) | 1995-05-22 | 2005-09-27 | Sequenom, Inc. | Primers useful for sizing nucleic acids |
| USRE41005E1 (en) | 1996-11-06 | 2009-11-24 | Sequenom, Inc. | Beads bound to a solid support and to nucleic acids |
| USRE44693E1 (en) | 1996-11-06 | 2014-01-07 | Sequenom, Inc. | Beads bound to a solid support and to nucleic acids |
| US7108974B2 (en) | 2001-03-02 | 2006-09-19 | Isis Pharmaceuticals, Inc. | Method for rapid detection and identification of bioagents |
| US7217510B2 (en) | 2001-06-26 | 2007-05-15 | Isis Pharmaceuticals, Inc. | Methods for providing bacterial bioagent characterizing information |
| US9034798B2 (en) | 2003-01-16 | 2015-05-19 | Caprotec Bioanalytics Gmbh | Capture compounds, collections thereof and methods for analyzing the proteome and complex compositions |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US6194144B1 (en) | DNA sequencing by mass spectrometry | |
| EP0679196B1 (en) | Dna sequencing by mass spectrometry | |
| WO1997037041A9 (en) | Dna sequencing by mass spectrometry | |
| WO1997037041A2 (en) | Dna sequencing by mass spectrometry | |
| US6428955B1 (en) | DNA diagnostics based on mass spectrometry | |
| US7501251B2 (en) | DNA diagnostics based on mass spectrometry | |
| US6043031A (en) | DNA diagnostics based on mass spectrometry | |
| AU725966B2 (en) | Mass label linked hybridisation probes | |
| US20050042625A1 (en) | Mass label linked hybridisation probes | |
| CA2218188A1 (en) | Solid phase sequencing of biopolymers | |
| WO1996029431A9 (en) | Dna diagnostics based on mass spectrometry | |
| US6699668B1 (en) | Mass label linked hybridisation probes | |
| AU694940C (en) | DNA sequencing by mass spectrometry | |
| AU738203B2 (en) | DNA sequencing by mass spectrometry | |
| HK1126821A (en) | Dna diagnostics based on mass spectrometry | |
| HK1126822A (en) | Dna diagnostics based on mass spectrometry |