WO1999006839A1 - Procede d'identification et de developpement de chefs de file de medicaments - Google Patents
Procede d'identification et de developpement de chefs de file de medicaments Download PDFInfo
- Publication number
- WO1999006839A1 WO1999006839A1 PCT/US1998/015943 US9815943W WO9906839A1 WO 1999006839 A1 WO1999006839 A1 WO 1999006839A1 US 9815943 W US9815943 W US 9815943W WO 9906839 A1 WO9906839 A1 WO 9906839A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- protein
- proteins
- descriptors
- interest
- reactivity
- Prior art date
Links
- 229940079593 drug Drugs 0.000 title claims abstract description 94
- 239000003814 drug Substances 0.000 title claims abstract description 94
- 238000000034 method Methods 0.000 title claims description 66
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 420
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 415
- 150000001875 compounds Chemical class 0.000 claims abstract description 134
- 108091023037 Aptamer Proteins 0.000 claims abstract description 89
- 230000009257 reactivity Effects 0.000 claims abstract description 81
- 230000001404 mediated effect Effects 0.000 claims abstract description 4
- 239000003153 chemical reaction reagent Substances 0.000 claims description 82
- 239000003446 ligand Substances 0.000 claims description 71
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 48
- 230000004071 biological effect Effects 0.000 claims description 35
- 239000002773 nucleotide Substances 0.000 claims description 33
- 125000003729 nucleotide group Chemical group 0.000 claims description 33
- 108091034117 Oligonucleotide Proteins 0.000 claims description 29
- 238000012216 screening Methods 0.000 claims description 20
- 108010043958 Peptoids Proteins 0.000 claims description 2
- 230000000694 effects Effects 0.000 abstract description 43
- 235000018102 proteins Nutrition 0.000 description 327
- 230000027455 binding Effects 0.000 description 109
- 238000006243 chemical reaction Methods 0.000 description 66
- 235000001014 amino acid Nutrition 0.000 description 64
- 229940024606 amino acid Drugs 0.000 description 63
- 150000001413 amino acids Chemical class 0.000 description 61
- -1 phospho- Chemical class 0.000 description 40
- 239000000126 substance Substances 0.000 description 39
- 102000005962 receptors Human genes 0.000 description 37
- 108020003175 receptors Proteins 0.000 description 37
- 150000007523 nucleic acids Chemical class 0.000 description 33
- 102000004196 processed proteins & peptides Human genes 0.000 description 33
- ROFVEXUMMXZLPA-UHFFFAOYSA-N Bipyridyl Chemical compound N1=CC=CC=C1C1=CC=CC=N1 ROFVEXUMMXZLPA-UHFFFAOYSA-N 0.000 description 32
- 239000005557 antagonist Substances 0.000 description 32
- 108020004414 DNA Proteins 0.000 description 30
- 102000039446 nucleic acids Human genes 0.000 description 29
- 108020004707 nucleic acids Proteins 0.000 description 29
- 239000000556 agonist Substances 0.000 description 28
- 238000013459 approach Methods 0.000 description 28
- 229940049706 benzodiazepine Drugs 0.000 description 26
- 230000003647 oxidation Effects 0.000 description 22
- 238000007254 oxidation reaction Methods 0.000 description 22
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 20
- 230000015572 biosynthetic process Effects 0.000 description 19
- 230000003993 interaction Effects 0.000 description 19
- 238000003786 synthesis reaction Methods 0.000 description 19
- 235000006109 methionine Nutrition 0.000 description 18
- 230000036515 potency Effects 0.000 description 15
- 102000004190 Enzymes Human genes 0.000 description 14
- 108090000790 Enzymes Proteins 0.000 description 14
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 14
- 229940088598 enzyme Drugs 0.000 description 14
- 229930182817 methionine Natural products 0.000 description 14
- 230000000144 pharmacologic effect Effects 0.000 description 14
- 230000007017 scission Effects 0.000 description 14
- 125000001424 substituent group Chemical group 0.000 description 14
- SVUOLADPCWQTTE-UHFFFAOYSA-N 1h-1,2-benzodiazepine Chemical compound N1N=CC=CC2=CC=CC=C12 SVUOLADPCWQTTE-UHFFFAOYSA-N 0.000 description 13
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 13
- 238000003556 assay Methods 0.000 description 13
- 150000001557 benzodiazepines Chemical class 0.000 description 13
- 230000037361 pathway Effects 0.000 description 13
- 239000000758 substrate Substances 0.000 description 13
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 12
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 12
- 210000004027 cell Anatomy 0.000 description 12
- 230000004048 modification Effects 0.000 description 12
- 238000012986 modification Methods 0.000 description 12
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 11
- 239000000203 mixture Substances 0.000 description 11
- 239000002904 solvent Substances 0.000 description 11
- 102000016943 Muramidase Human genes 0.000 description 10
- 108010014251 Muramidase Proteins 0.000 description 10
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 10
- 238000004458 analytical method Methods 0.000 description 10
- 238000003776 cleavage reaction Methods 0.000 description 10
- 108020002326 glutamine synthetase Proteins 0.000 description 10
- 102000005396 glutamine synthetase Human genes 0.000 description 10
- 239000003112 inhibitor Substances 0.000 description 10
- 229960000274 lysozyme Drugs 0.000 description 10
- 235000010335 lysozyme Nutrition 0.000 description 10
- 239000004325 lysozyme Substances 0.000 description 10
- 230000027756 respiratory electron transport chain Effects 0.000 description 10
- 229940054025 carbamate anxiolytics Drugs 0.000 description 9
- 150000004657 carbamic acid derivatives Chemical class 0.000 description 9
- 239000002131 composite material Substances 0.000 description 9
- 239000013612 plasmid Substances 0.000 description 9
- 150000003384 small molecules Chemical class 0.000 description 9
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 8
- 230000004913 activation Effects 0.000 description 8
- 230000008859 change Effects 0.000 description 8
- 238000007385 chemical modification Methods 0.000 description 8
- 125000000524 functional group Chemical group 0.000 description 8
- 230000014509 gene expression Effects 0.000 description 8
- 125000004435 hydrogen atom Chemical group [H]* 0.000 description 8
- 230000003287 optical effect Effects 0.000 description 8
- 238000012360 testing method Methods 0.000 description 8
- 229910052723 transition metal Inorganic materials 0.000 description 8
- 150000003624 transition metals Chemical class 0.000 description 8
- 230000002596 correlated effect Effects 0.000 description 7
- 239000012634 fragment Substances 0.000 description 7
- 239000011159 matrix material Substances 0.000 description 7
- 230000036961 partial effect Effects 0.000 description 7
- 235000000346 sugar Nutrition 0.000 description 7
- 238000012549 training Methods 0.000 description 7
- CIWBSHSKHKDKBQ-JLAZNSOCSA-N Ascorbic acid Chemical compound OC[C@H](O)[C@H]1OC(=O)C(O)=C1O CIWBSHSKHKDKBQ-JLAZNSOCSA-N 0.000 description 6
- KXDHJXZQYSOELW-UHFFFAOYSA-M Carbamate Chemical compound NC([O-])=O KXDHJXZQYSOELW-UHFFFAOYSA-M 0.000 description 6
- 108010067902 Peptide Library Proteins 0.000 description 6
- 239000002168 alkylating agent Substances 0.000 description 6
- 229940100198 alkylating agent Drugs 0.000 description 6
- 238000004364 calculation method Methods 0.000 description 6
- 230000001419 dependent effect Effects 0.000 description 6
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 6
- 230000006870 function Effects 0.000 description 6
- 230000007246 mechanism Effects 0.000 description 6
- 239000007800 oxidant agent Substances 0.000 description 6
- 229920000642 polymer Polymers 0.000 description 6
- 230000008569 process Effects 0.000 description 6
- 230000004044 response Effects 0.000 description 6
- 238000001228 spectrum Methods 0.000 description 6
- 108020005087 unfolded proteins Proteins 0.000 description 6
- MHAJPDPJQMAIIY-UHFFFAOYSA-N Hydrogen peroxide Chemical compound OO MHAJPDPJQMAIIY-UHFFFAOYSA-N 0.000 description 5
- 101710173438 Late L2 mu core protein Proteins 0.000 description 5
- 101710188315 Protein X Proteins 0.000 description 5
- 239000002253 acid Substances 0.000 description 5
- 125000003275 alpha amino acid group Chemical group 0.000 description 5
- 230000004075 alteration Effects 0.000 description 5
- 238000002869 basic local alignment search tool Methods 0.000 description 5
- 230000008901 benefit Effects 0.000 description 5
- 238000010504 bond cleavage reaction Methods 0.000 description 5
- 230000000875 corresponding effect Effects 0.000 description 5
- 238000013461 design Methods 0.000 description 5
- 238000001514 detection method Methods 0.000 description 5
- 230000008034 disappearance Effects 0.000 description 5
- 201000010099 disease Diseases 0.000 description 5
- 238000009826 distribution Methods 0.000 description 5
- 239000001257 hydrogen Substances 0.000 description 5
- 229910052739 hydrogen Inorganic materials 0.000 description 5
- 230000001965 increasing effect Effects 0.000 description 5
- 230000005764 inhibitory process Effects 0.000 description 5
- 238000006317 isomerization reaction Methods 0.000 description 5
- 238000004949 mass spectrometry Methods 0.000 description 5
- 238000005259 measurement Methods 0.000 description 5
- 108091008104 nucleic acid aptamers Proteins 0.000 description 5
- 238000005457 optimization Methods 0.000 description 5
- 230000002829 reductive effect Effects 0.000 description 5
- 239000000523 sample Substances 0.000 description 5
- 125000000430 tryptophan group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C2=C([H])C([H])=C([H])C([H])=C12 0.000 description 5
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 4
- 102100037685 60S ribosomal protein L22 Human genes 0.000 description 4
- 108010032595 Antibody Binding Sites Proteins 0.000 description 4
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 4
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 4
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 4
- 230000009471 action Effects 0.000 description 4
- 125000001931 aliphatic group Chemical group 0.000 description 4
- 239000002249 anxiolytic agent Substances 0.000 description 4
- 235000013877 carbamide Nutrition 0.000 description 4
- 238000012512 characterization method Methods 0.000 description 4
- 239000003795 chemical substances by application Substances 0.000 description 4
- 238000010276 construction Methods 0.000 description 4
- 235000018417 cysteine Nutrition 0.000 description 4
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 4
- 238000006356 dehydrogenation reaction Methods 0.000 description 4
- 238000007876 drug discovery Methods 0.000 description 4
- 230000002209 hydrophobic effect Effects 0.000 description 4
- 238000000338 in vitro Methods 0.000 description 4
- 229910052751 metal Inorganic materials 0.000 description 4
- 239000002184 metal Substances 0.000 description 4
- 150000004713 phosphodiesters Chemical group 0.000 description 4
- 230000003389 potentiating effect Effects 0.000 description 4
- 239000000047 product Substances 0.000 description 4
- 238000010791 quenching Methods 0.000 description 4
- 230000000171 quenching effect Effects 0.000 description 4
- 230000009467 reduction Effects 0.000 description 4
- 238000006722 reduction reaction Methods 0.000 description 4
- 210000003705 ribosome Anatomy 0.000 description 4
- 238000011524 similarity measure Methods 0.000 description 4
- 229940124530 sulfonamide Drugs 0.000 description 4
- ZFXYFBGIUFBOJW-UHFFFAOYSA-N theophylline Chemical compound O=C1N(C)C(=O)N(C)C2=C1NC=N2 ZFXYFBGIUFBOJW-UHFFFAOYSA-N 0.000 description 4
- 239000013598 vector Substances 0.000 description 4
- GUJAGMICFDYKNR-UHFFFAOYSA-N 1,4-benzodiazepine Chemical group N1C=CN=CC2=CC=CC=C12 GUJAGMICFDYKNR-UHFFFAOYSA-N 0.000 description 3
- AVRPFRMDMNDIDH-UHFFFAOYSA-N 1h-quinazolin-2-one Chemical compound C1=CC=CC2=NC(O)=NC=C21 AVRPFRMDMNDIDH-UHFFFAOYSA-N 0.000 description 3
- 108010078791 Carrier Proteins Proteins 0.000 description 3
- 102000014914 Carrier Proteins Human genes 0.000 description 3
- 108091035707 Consensus sequence Proteins 0.000 description 3
- 108091008102 DNA aptamers Proteins 0.000 description 3
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 3
- RPTUSVTUFVMDQK-UHFFFAOYSA-N Hidralazin Chemical class C1=CC=C2C(NN)=NN=CC2=C1 RPTUSVTUFVMDQK-UHFFFAOYSA-N 0.000 description 3
- 102100034343 Integrase Human genes 0.000 description 3
- 101710163270 Nuclease Proteins 0.000 description 3
- 108091028043 Nucleic acid sequence Proteins 0.000 description 3
- 108010079855 Peptide Aptamers Proteins 0.000 description 3
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 3
- 241000700159 Rattus Species 0.000 description 3
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 3
- 239000000370 acceptor Substances 0.000 description 3
- 238000007792 addition Methods 0.000 description 3
- 150000001408 amides Chemical group 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 108091008324 binding proteins Proteins 0.000 description 3
- 125000002915 carbonyl group Chemical group [*:2]C([*:1])=O 0.000 description 3
- 239000003054 catalyst Substances 0.000 description 3
- 210000000170 cell membrane Anatomy 0.000 description 3
- 238000007621 cluster analysis Methods 0.000 description 3
- 125000004122 cyclic group Chemical group 0.000 description 3
- 230000006378 damage Effects 0.000 description 3
- 230000007423 decrease Effects 0.000 description 3
- 229960003529 diazepam Drugs 0.000 description 3
- AAOVKJBEBIDNHE-UHFFFAOYSA-N diazepam Chemical compound N=1CC(=O)N(C)C2=CC=C(Cl)C=C2C=1C1=CC=CC=C1 AAOVKJBEBIDNHE-UHFFFAOYSA-N 0.000 description 3
- 239000012636 effector Substances 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 230000005281 excited state Effects 0.000 description 3
- 125000002485 formyl group Chemical group [H]C(*)=O 0.000 description 3
- 239000000499 gel Substances 0.000 description 3
- 239000010931 gold Substances 0.000 description 3
- 125000005842 heteroatom Chemical group 0.000 description 3
- 150000001469 hydantoins Chemical class 0.000 description 3
- TUJKJAMUKRIRHC-UHFFFAOYSA-N hydroxyl Chemical compound [OH] TUJKJAMUKRIRHC-UHFFFAOYSA-N 0.000 description 3
- 230000002401 inhibitory effect Effects 0.000 description 3
- 238000000816 matrix-assisted laser desorption--ionisation Methods 0.000 description 3
- 239000002547 new drug Substances 0.000 description 3
- QJGQUHMNIGDVPM-UHFFFAOYSA-N nitrogen group Chemical group [N] QJGQUHMNIGDVPM-UHFFFAOYSA-N 0.000 description 3
- 230000001590 oxidative effect Effects 0.000 description 3
- 238000003752 polymerase chain reaction Methods 0.000 description 3
- 238000012163 sequencing technique Methods 0.000 description 3
- 235000004400 serine Nutrition 0.000 description 3
- 238000004611 spectroscopical analysis Methods 0.000 description 3
- 238000010561 standard procedure Methods 0.000 description 3
- 238000006467 substitution reaction Methods 0.000 description 3
- 238000012546 transfer Methods 0.000 description 3
- 238000006276 transfer reaction Methods 0.000 description 3
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 3
- MAOBFOXLCJIFLV-UHFFFAOYSA-N (2-aminophenyl)-phenylmethanone Chemical compound NC1=CC=CC=C1C(=O)C1=CC=CC=C1 MAOBFOXLCJIFLV-UHFFFAOYSA-N 0.000 description 2
- KWTSXDURSIMDCE-QMMMGPOBSA-N (S)-amphetamine Chemical compound C[C@H](N)CC1=CC=CC=C1 KWTSXDURSIMDCE-QMMMGPOBSA-N 0.000 description 2
- NGNBDVOYPDDBFK-UHFFFAOYSA-N 2-[2,4-di(pentan-2-yl)phenoxy]acetyl chloride Chemical compound CCCC(C)C1=CC=C(OCC(Cl)=O)C(C(C)CCC)=C1 NGNBDVOYPDDBFK-UHFFFAOYSA-N 0.000 description 2
- ASJSAQIRZKANQN-CRCLSJGQSA-N 2-deoxy-D-ribose Chemical compound OC[C@@H](O)[C@@H](O)CC=O ASJSAQIRZKANQN-CRCLSJGQSA-N 0.000 description 2
- FTOAOBMCPZCFFF-UHFFFAOYSA-N 5,5-diethylbarbituric acid Chemical compound CCC1(CC)C(=O)NC(=O)NC1=O FTOAOBMCPZCFFF-UHFFFAOYSA-N 0.000 description 2
- FWEOQOXTVHGIFQ-UHFFFAOYSA-N 8-anilinonaphthalene-1-sulfonic acid Chemical compound C=12C(S(=O)(=O)O)=CC=CC2=CC=CC=1NC1=CC=CC=C1 FWEOQOXTVHGIFQ-UHFFFAOYSA-N 0.000 description 2
- LRFVTYWOQMYALW-UHFFFAOYSA-N 9H-xanthine Chemical class O=C1NC(=O)NC2=C1NC=N2 LRFVTYWOQMYALW-UHFFFAOYSA-N 0.000 description 2
- LXEKPEMOWBOYRF-QDBORUFSSA-N AAPH Chemical compound Cl.Cl.NC(=N)C(C)(C)\N=N\C(C)(C)C(N)=N LXEKPEMOWBOYRF-QDBORUFSSA-N 0.000 description 2
- RZVAJINKPMORJF-UHFFFAOYSA-N Acetaminophen Chemical compound CC(=O)NC1=CC=C(O)C=C1 RZVAJINKPMORJF-UHFFFAOYSA-N 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 102100025064 Cellular tumor antigen p53 Human genes 0.000 description 2
- 101800001982 Cholecystokinin Proteins 0.000 description 2
- 102100025841 Cholecystokinin Human genes 0.000 description 2
- 238000010499 C–H functionalization reaction Methods 0.000 description 2
- 230000001482 DNA photocleavage Effects 0.000 description 2
- 241000713800 Feline immunodeficiency virus Species 0.000 description 2
- 102000005720 Glutathione transferase Human genes 0.000 description 2
- 108010070675 Glutathione transferase Proteins 0.000 description 2
- 101000721661 Homo sapiens Cellular tumor antigen p53 Proteins 0.000 description 2
- 102000004310 Ion Channels Human genes 0.000 description 2
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 2
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 2
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 2
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 2
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 2
- 102000003960 Ligases Human genes 0.000 description 2
- 108090000364 Ligases Proteins 0.000 description 2
- 241000124008 Mammalia Species 0.000 description 2
- NPPQSCRMBWNHMW-UHFFFAOYSA-N Meprobamate Chemical compound NC(=O)OCC(C)(CCC)COC(N)=O NPPQSCRMBWNHMW-UHFFFAOYSA-N 0.000 description 2
- 241001465754 Metazoa Species 0.000 description 2
- 108010021466 Mutant Proteins Proteins 0.000 description 2
- 102000008300 Mutant Proteins Human genes 0.000 description 2
- 206010028980 Neoplasm Diseases 0.000 description 2
- 108010038807 Oligopeptides Proteins 0.000 description 2
- 102000015636 Oligopeptides Human genes 0.000 description 2
- 108091005804 Peptidases Proteins 0.000 description 2
- 102000035195 Peptidases Human genes 0.000 description 2
- 239000004365 Protease Substances 0.000 description 2
- 102000009516 Protein Serine-Threonine Kinases Human genes 0.000 description 2
- 108010009341 Protein Serine-Threonine Kinases Proteins 0.000 description 2
- MUMGGOZAMZWBJJ-DYKIIFRCSA-N Testostosterone Chemical compound O=C1CC[C@]2(C)[C@H]3CC[C@](C)([C@H](CC4)O)[C@@H]4[C@@H]3CCC2=C1 MUMGGOZAMZWBJJ-DYKIIFRCSA-N 0.000 description 2
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 2
- 239000004473 Threonine Substances 0.000 description 2
- 101710100170 Unknown protein Proteins 0.000 description 2
- 241000700605 Viruses Species 0.000 description 2
- 230000003213 activating effect Effects 0.000 description 2
- 125000002015 acyclic group Chemical group 0.000 description 2
- 230000006978 adaptation Effects 0.000 description 2
- 235000004279 alanine Nutrition 0.000 description 2
- 150000001298 alcohols Chemical class 0.000 description 2
- 150000001412 amines Chemical group 0.000 description 2
- 150000001414 amino alcohols Chemical class 0.000 description 2
- 229940025084 amphetamine Drugs 0.000 description 2
- 229940035676 analgesics Drugs 0.000 description 2
- 125000000129 anionic group Chemical group 0.000 description 2
- 230000008485 antagonism Effects 0.000 description 2
- 239000000730 antalgic agent Substances 0.000 description 2
- 229940088710 antibiotic agent Drugs 0.000 description 2
- 229940125681 anticonvulsant agent Drugs 0.000 description 2
- 239000001961 anticonvulsive agent Substances 0.000 description 2
- 230000000949 anxiolytic effect Effects 0.000 description 2
- 229940005530 anxiolytics Drugs 0.000 description 2
- 238000003491 array Methods 0.000 description 2
- 125000003118 aryl group Chemical group 0.000 description 2
- 125000004429 atom Chemical group 0.000 description 2
- 230000002238 attenuated effect Effects 0.000 description 2
- 229940125717 barbiturate Drugs 0.000 description 2
- 230000004888 barrier function Effects 0.000 description 2
- 239000011324 bead Substances 0.000 description 2
- 230000000903 blocking effect Effects 0.000 description 2
- RYYVLZVUVIJVGH-UHFFFAOYSA-N caffeine Chemical compound CN1C(=O)N(C)C(=O)C2=C1N=CN2C RYYVLZVUVIJVGH-UHFFFAOYSA-N 0.000 description 2
- 238000004422 calculation algorithm Methods 0.000 description 2
- 201000011510 cancer Diseases 0.000 description 2
- 239000004202 carbamide Substances 0.000 description 2
- 235000014633 carbohydrates Nutrition 0.000 description 2
- 150000001720 carbohydrates Chemical class 0.000 description 2
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 2
- 230000001413 cellular effect Effects 0.000 description 2
- 239000007795 chemical reaction product Substances 0.000 description 2
- ANTSCNMPPGJYLG-UHFFFAOYSA-N chlordiazepoxide Chemical compound O=N=1CC(NC)=NC2=CC=C(Cl)C=C2C=1C1=CC=CC=C1 ANTSCNMPPGJYLG-UHFFFAOYSA-N 0.000 description 2
- 229960004782 chlordiazepoxide Drugs 0.000 description 2
- 150000001805 chlorine compounds Chemical class 0.000 description 2
- 229940107137 cholecystokinin Drugs 0.000 description 2
- 150000001923 cyclic compounds Chemical class 0.000 description 2
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 2
- 230000036425 denaturation Effects 0.000 description 2
- 238000004925 denaturation Methods 0.000 description 2
- VAYGXNSJCAHWJZ-UHFFFAOYSA-N dimethyl sulfate Chemical compound COS(=O)(=O)OC VAYGXNSJCAHWJZ-UHFFFAOYSA-N 0.000 description 2
- 239000006185 dispersion Substances 0.000 description 2
- 230000009881 electrostatic interaction Effects 0.000 description 2
- 230000008030 elimination Effects 0.000 description 2
- 238000003379 elimination reaction Methods 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 238000000556 factor analysis Methods 0.000 description 2
- 238000001502 gel electrophoresis Methods 0.000 description 2
- 238000004128 high performance liquid chromatography Methods 0.000 description 2
- 230000002779 inactivation Effects 0.000 description 2
- CGIGDMFJXJATDK-UHFFFAOYSA-N indomethacin Chemical compound CC1=C(CC(O)=O)C2=CC(OC)=CC=C2N1C(=O)C1=CC=C(Cl)C=C1 CGIGDMFJXJATDK-UHFFFAOYSA-N 0.000 description 2
- 239000000543 intermediate Substances 0.000 description 2
- 230000003834 intracellular effect Effects 0.000 description 2
- 150000002500 ions Chemical class 0.000 description 2
- 229960000310 isoleucine Drugs 0.000 description 2
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 2
- 150000002611 lead compounds Chemical class 0.000 description 2
- 150000002632 lipids Chemical class 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 238000012067 mathematical method Methods 0.000 description 2
- 239000012528 membrane Substances 0.000 description 2
- 229960004815 meprobamate Drugs 0.000 description 2
- 229930014626 natural product Natural products 0.000 description 2
- 230000007935 neutral effect Effects 0.000 description 2
- ADIMAYPTOBDMTL-UHFFFAOYSA-N oxazepam Chemical compound C12=CC(Cl)=CC=C2NC(=O)C(O)N=C1C1=CC=CC=C1 ADIMAYPTOBDMTL-UHFFFAOYSA-N 0.000 description 2
- 229960004535 oxazepam Drugs 0.000 description 2
- 108010083127 phage repressor proteins Proteins 0.000 description 2
- 238000006303 photolysis reaction Methods 0.000 description 2
- 230000015843 photosynthesis, light reaction Effects 0.000 description 2
- 230000006461 physiological response Effects 0.000 description 2
- 229920000447 polyanionic polymer Polymers 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 235000013930 proline Nutrition 0.000 description 2
- 230000004952 protein activity Effects 0.000 description 2
- 230000006318 protein oxidation Effects 0.000 description 2
- 102000016914 ras Proteins Human genes 0.000 description 2
- 108010014186 ras Proteins Proteins 0.000 description 2
- 239000000376 reactant Substances 0.000 description 2
- 229920005989 resin Polymers 0.000 description 2
- 239000011347 resin Substances 0.000 description 2
- 238000007363 ring formation reaction Methods 0.000 description 2
- IZTQOLKUZKXIRV-YRVFCXMDSA-N sincalide Chemical compound C([C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(N)=O)NC(=O)[C@@H](N)CC(O)=O)C1=CC=C(OS(O)(=O)=O)C=C1 IZTQOLKUZKXIRV-YRVFCXMDSA-N 0.000 description 2
- 239000012279 sodium borohydride Substances 0.000 description 2
- 229910000033 sodium borohydride Inorganic materials 0.000 description 2
- 239000007790 solid phase Substances 0.000 description 2
- 238000010532 solid phase synthesis reaction Methods 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- 230000002194 synthesizing effect Effects 0.000 description 2
- 229940065721 systemic for obstructive airway disease xanthines Drugs 0.000 description 2
- 229960000278 theophylline Drugs 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 238000000844 transformation Methods 0.000 description 2
- JWZZKOKVBUJMES-UHFFFAOYSA-N (+-)-Isoprenaline Chemical compound CC(C)NCC(O)C1=CC=C(O)C(O)=C1 JWZZKOKVBUJMES-UHFFFAOYSA-N 0.000 description 1
- RKGFWMYLTBPZGL-YUMQZZPRSA-N (2s)-2-amino-1-[(2s)-2-amino-4-methylsulfanylbutanoyl]sulfinyl-4-methylsulfanylbutan-1-one Chemical compound CSCC[C@H](N)C(=O)S(=O)C(=O)[C@@H](N)CCSC RKGFWMYLTBPZGL-YUMQZZPRSA-N 0.000 description 1
- DNXIKVLOVZVMQF-UHFFFAOYSA-N (3beta,16beta,17alpha,18beta,20alpha)-17-hydroxy-11-methoxy-18-[(3,4,5-trimethoxybenzoyl)oxy]-yohimban-16-carboxylic acid, methyl ester Natural products C1C2CN3CCC(C4=CC=C(OC)C=C4N4)=C4C3CC2C(C(=O)OC)C(O)C1OC(=O)C1=CC(OC)=C(OC)C(OC)=C1 DNXIKVLOVZVMQF-UHFFFAOYSA-N 0.000 description 1
- UCTWMZQNUQWSLP-VIFPVBQESA-N (R)-adrenaline Chemical compound CNC[C@H](O)C1=CC=C(O)C(O)=C1 UCTWMZQNUQWSLP-VIFPVBQESA-N 0.000 description 1
- 229930182837 (R)-adrenaline Natural products 0.000 description 1
- WSPOMRSOLSGNFJ-AUWJEWJLSA-N (Z)-chlorprothixene Chemical compound C1=C(Cl)C=C2C(=C/CCN(C)C)\C3=CC=CC=C3SC2=C1 WSPOMRSOLSGNFJ-AUWJEWJLSA-N 0.000 description 1
- KANAPVJGZDNSCZ-UHFFFAOYSA-N 1,2-benzothiazole 1-oxide Chemical class C1=CC=C2S(=O)N=CC2=C1 KANAPVJGZDNSCZ-UHFFFAOYSA-N 0.000 description 1
- NOLHRFLIXVQPSZ-UHFFFAOYSA-N 1,3-thiazolidin-4-one Chemical class O=C1CSCN1 NOLHRFLIXVQPSZ-UHFFFAOYSA-N 0.000 description 1
- ZRHUHDUEXWHZMA-UHFFFAOYSA-N 1,4-dihydropyrazol-5-one Chemical compound O=C1CC=NN1 ZRHUHDUEXWHZMA-UHFFFAOYSA-N 0.000 description 1
- JWDYCNIAQWPBHD-UHFFFAOYSA-N 1-(2-methylphenyl)glycerol Chemical compound CC1=CC=CC=C1OCC(O)CO JWDYCNIAQWPBHD-UHFFFAOYSA-N 0.000 description 1
- YUEATNYGWBDCEL-UHFFFAOYSA-N 1-hydroxypropyl carbamate Chemical class CCC(O)OC(N)=O YUEATNYGWBDCEL-UHFFFAOYSA-N 0.000 description 1
- LLSKXGRDUPMXLC-UHFFFAOYSA-N 1-phenylpiperidine Chemical class C1CCCCN1C1=CC=CC=C1 LLSKXGRDUPMXLC-UHFFFAOYSA-N 0.000 description 1
- WJFKNYWRSNBZNX-UHFFFAOYSA-N 10H-phenothiazine Chemical compound C1=CC=C2NC3=CC=CC=C3SC2=C1 WJFKNYWRSNBZNX-UHFFFAOYSA-N 0.000 description 1
- FUFLCEKSBBHCMO-UHFFFAOYSA-N 11-dehydrocorticosterone Natural products O=C1CCC2(C)C3C(=O)CC(C)(C(CC4)C(=O)CO)C4C3CCC2=C1 FUFLCEKSBBHCMO-UHFFFAOYSA-N 0.000 description 1
- IAIGWBZFHIEWJI-UHFFFAOYSA-N 1h-1,4-benzodiazepine-2,5-dione Chemical group N1C(=O)C=NC(=O)C2=CC=CC=C21 IAIGWBZFHIEWJI-UHFFFAOYSA-N 0.000 description 1
- WRAUXDQDRDJTKM-UHFFFAOYSA-N 2-(1h-indol-2-yl)ethanamine Chemical class C1=CC=C2NC(CCN)=CC2=C1 WRAUXDQDRDJTKM-UHFFFAOYSA-N 0.000 description 1
- HVAUUPRFYPCOCA-AREMUKBSSA-N 2-O-acetyl-1-O-hexadecyl-sn-glycero-3-phosphocholine Chemical compound CCCCCCCCCCCCCCCCOC[C@@H](OC(C)=O)COP([O-])(=O)OCC[N+](C)(C)C HVAUUPRFYPCOCA-AREMUKBSSA-N 0.000 description 1
- IZXIZTKNFFYFOF-UHFFFAOYSA-N 2-Oxazolidone Chemical class O=C1NCCO1 IZXIZTKNFFYFOF-UHFFFAOYSA-N 0.000 description 1
- CFNXYWSRCYVSTD-UHFFFAOYSA-N 2-acetyloxybenzoic acid;4-aminophenol Chemical compound NC1=CC=C(O)C=C1.CC(=O)OC1=CC=CC=C1C(O)=O CFNXYWSRCYVSTD-UHFFFAOYSA-N 0.000 description 1
- SPCKHVPPRJWQRZ-UHFFFAOYSA-N 2-benzhydryloxy-n,n-dimethylethanamine;2-hydroxypropane-1,2,3-tricarboxylic acid Chemical compound OC(=O)CC(O)(C(O)=O)CC(O)=O.C=1C=CC=CC=1C(OCCN(C)C)C1=CC=CC=C1 SPCKHVPPRJWQRZ-UHFFFAOYSA-N 0.000 description 1
- JIVPVXMEBJLZRO-CQSZACIVSA-N 2-chloro-5-[(1r)-1-hydroxy-3-oxo-2h-isoindol-1-yl]benzenesulfonamide Chemical compound C1=C(Cl)C(S(=O)(=O)N)=CC([C@@]2(O)C3=CC=CC=C3C(=O)N2)=C1 JIVPVXMEBJLZRO-CQSZACIVSA-N 0.000 description 1
- KISZTEOELCMZPY-UHFFFAOYSA-N 3,3-diphenylpropylamine Chemical class C=1C=CC=CC=1C(CCN)C1=CC=CC=C1 KISZTEOELCMZPY-UHFFFAOYSA-N 0.000 description 1
- USSIQXCVUWKGNF-UHFFFAOYSA-N 6-(dimethylamino)-4,4-diphenylheptan-3-one Chemical compound C=1C=CC=CC=1C(CC(C)N(C)C)(C(=O)CC)C1=CC=CC=C1 USSIQXCVUWKGNF-UHFFFAOYSA-N 0.000 description 1
- PQJUJGAVDBINPI-UHFFFAOYSA-N 9H-thioxanthene Chemical compound C1=CC=C2CC3=CC=CC=C3SC2=C1 PQJUJGAVDBINPI-UHFFFAOYSA-N 0.000 description 1
- 241000251468 Actinopterygii Species 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- 235000002198 Annona diversifolia Nutrition 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- BHELIUBJHYAEDK-OAIUPTLZSA-N Aspoxicillin Chemical compound C1([C@H](C(=O)N[C@@H]2C(N3[C@H](C(C)(C)S[C@@H]32)C(O)=O)=O)NC(=O)[C@H](N)CC(=O)NC)=CC=C(O)C=C1 BHELIUBJHYAEDK-OAIUPTLZSA-N 0.000 description 1
- 102000019260 B-Cell Antigen Receptors Human genes 0.000 description 1
- 108010012919 B-Cell Antigen Receptors Proteins 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 108010077805 Bacterial Proteins Proteins 0.000 description 1
- DWRXFEITVBNRMK-UHFFFAOYSA-N Beta-D-1-Arabinofuranosylthymine Natural products O=C1NC(=O)C(C)=CN1C1C(O)C(O)C(CO)O1 DWRXFEITVBNRMK-UHFFFAOYSA-N 0.000 description 1
- 229940123208 Biguanide Drugs 0.000 description 1
- 241000283690 Bos taurus Species 0.000 description 1
- 235000003351 Brassica cretica Nutrition 0.000 description 1
- 235000003343 Brassica rupestris Nutrition 0.000 description 1
- 241000219193 Brassicaceae Species 0.000 description 1
- 102000000584 Calmodulin Human genes 0.000 description 1
- 108010041952 Calmodulin Proteins 0.000 description 1
- 241000282472 Canis lupus familiaris Species 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- 241000700198 Cavia Species 0.000 description 1
- 108010059892 Cellulase Proteins 0.000 description 1
- 229930186147 Cephalosporin Natural products 0.000 description 1
- 241000282693 Cercopithecidae Species 0.000 description 1
- 108050001186 Chaperonin Cpn60 Proteins 0.000 description 1
- 102000052603 Chaperonins Human genes 0.000 description 1
- PCLITLDOTJTVDJ-UHFFFAOYSA-N Chlormethiazole Chemical compound CC=1N=CSC=1CCCl PCLITLDOTJTVDJ-UHFFFAOYSA-N 0.000 description 1
- JZUFKLXOESDKRF-UHFFFAOYSA-N Chlorothiazide Chemical compound C1=C(Cl)C(S(=O)(=O)N)=CC2=C1NCNS2(=O)=O JZUFKLXOESDKRF-UHFFFAOYSA-N 0.000 description 1
- 235000016795 Cola Nutrition 0.000 description 1
- 244000228088 Cola acuminata Species 0.000 description 1
- 235000011824 Cola pachycarpa Nutrition 0.000 description 1
- 102000008186 Collagen Human genes 0.000 description 1
- 108010035532 Collagen Proteins 0.000 description 1
- MFYSYFVPBJMHGN-ZPOLXVRWSA-N Cortisone Chemical compound O=C1CC[C@]2(C)[C@H]3C(=O)C[C@](C)([C@@](CC4)(O)C(=O)CO)[C@@H]4[C@@H]3CCC2=C1 MFYSYFVPBJMHGN-ZPOLXVRWSA-N 0.000 description 1
- MFYSYFVPBJMHGN-UHFFFAOYSA-N Cortisone Natural products O=C1CCC2(C)C3C(=O)CC(C)(C(CC4)(O)C(=O)CO)C4C3CCC2=C1 MFYSYFVPBJMHGN-UHFFFAOYSA-N 0.000 description 1
- 241000195493 Cryptophyta Species 0.000 description 1
- 108090000395 Cysteine Endopeptidases Proteins 0.000 description 1
- 241000701022 Cytomegalovirus Species 0.000 description 1
- 102000012410 DNA Ligases Human genes 0.000 description 1
- 108010061982 DNA Ligases Proteins 0.000 description 1
- 102000016559 DNA Primase Human genes 0.000 description 1
- 108010092681 DNA Primase Proteins 0.000 description 1
- 102000011724 DNA Repair Enzymes Human genes 0.000 description 1
- 108010076525 DNA Repair Enzymes Proteins 0.000 description 1
- 230000005778 DNA damage Effects 0.000 description 1
- 231100000277 DNA damage Toxicity 0.000 description 1
- 230000008836 DNA modification Effects 0.000 description 1
- 230000007018 DNA scission Effects 0.000 description 1
- 102000052510 DNA-Binding Proteins Human genes 0.000 description 1
- 108700020911 DNA-Binding Proteins Proteins 0.000 description 1
- 230000004568 DNA-binding Effects 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 108010053770 Deoxyribonucleases Proteins 0.000 description 1
- 102000016911 Deoxyribonucleases Human genes 0.000 description 1
- 108010016626 Dipeptides Proteins 0.000 description 1
- 229940123907 Disease modifying antirheumatic drug Drugs 0.000 description 1
- 206010061818 Disease progression Diseases 0.000 description 1
- 206010013710 Drug interaction Diseases 0.000 description 1
- VWLHWLSRQJQWRG-UHFFFAOYSA-O Edrophonum Chemical compound CC[N+](C)(C)C1=CC=CC(O)=C1 VWLHWLSRQJQWRG-UHFFFAOYSA-O 0.000 description 1
- 102000016942 Elastin Human genes 0.000 description 1
- 108010014258 Elastin Proteins 0.000 description 1
- 101100001669 Emericella variicolor andD gene Proteins 0.000 description 1
- 108010062466 Enzyme Precursors Proteins 0.000 description 1
- 102000010911 Enzyme Precursors Human genes 0.000 description 1
- 108060002716 Exonuclease Proteins 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- 102000009123 Fibrin Human genes 0.000 description 1
- 108010073385 Fibrin Proteins 0.000 description 1
- BWGVNKXGVNDBDI-UHFFFAOYSA-N Fibrin monomer Chemical compound CNC(=O)CNC(=O)CN BWGVNKXGVNDBDI-UHFFFAOYSA-N 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 108090000288 Glycoproteins Proteins 0.000 description 1
- 102000005744 Glycoside Hydrolases Human genes 0.000 description 1
- 108010031186 Glycoside Hydrolases Proteins 0.000 description 1
- 102000051366 Glycosyltransferases Human genes 0.000 description 1
- 108700023372 Glycosyltransferases Proteins 0.000 description 1
- 102100020948 Growth hormone receptor Human genes 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- 241001272567 Hominoidea Species 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 108010071893 Human Immunodeficiency Virus rev Gene Products Proteins 0.000 description 1
- 241000701044 Human gammaherpesvirus 4 Species 0.000 description 1
- UFHFLCQGNIYNRP-UHFFFAOYSA-N Hydrogen Chemical compound [H][H] UFHFLCQGNIYNRP-UHFFFAOYSA-N 0.000 description 1
- 108010058683 Immobilized Proteins Proteins 0.000 description 1
- 102000010790 Interleukin-3 Receptors Human genes 0.000 description 1
- 108010038452 Interleukin-3 Receptors Proteins 0.000 description 1
- 108090001007 Interleukin-8 Proteins 0.000 description 1
- LPHGQDQBBGAPDZ-UHFFFAOYSA-N Isocaffeine Natural products CN1C(=O)N(C)C(=O)C2=C1N(C)C=N2 LPHGQDQBBGAPDZ-UHFFFAOYSA-N 0.000 description 1
- 102000004195 Isomerases Human genes 0.000 description 1
- 108090000769 Isomerases Proteins 0.000 description 1
- PWWVAXIEGOYWEE-UHFFFAOYSA-N Isophenergan Chemical compound C1=CC=C2N(CC(C)N(C)C)C3=CC=CC=C3SC2=C1 PWWVAXIEGOYWEE-UHFFFAOYSA-N 0.000 description 1
- WTDRDQBEARUVNC-UHFFFAOYSA-N L-Dopa Natural products OC(=O)C(N)CC1=CC=C(O)C(O)=C1 WTDRDQBEARUVNC-UHFFFAOYSA-N 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 1
- QEFRNWWLZKMPFJ-ZXPFJRLXSA-N L-methionine (R)-S-oxide Chemical compound C[S@@](=O)CC[C@H]([NH3+])C([O-])=O QEFRNWWLZKMPFJ-ZXPFJRLXSA-N 0.000 description 1
- QEFRNWWLZKMPFJ-UHFFFAOYSA-N L-methionine sulphoxide Natural products CS(=O)CCC(N)C(O)=O QEFRNWWLZKMPFJ-UHFFFAOYSA-N 0.000 description 1
- 241000282838 Lama Species 0.000 description 1
- NNJVILVZKWQKPM-UHFFFAOYSA-N Lidocaine Chemical compound CCN(CC)CC(=O)NC1=C(C)C=CC=C1C NNJVILVZKWQKPM-UHFFFAOYSA-N 0.000 description 1
- 108090001060 Lipase Proteins 0.000 description 1
- 108090001030 Lipoproteins Proteins 0.000 description 1
- 102000004317 Lyases Human genes 0.000 description 1
- 108090000856 Lyases Proteins 0.000 description 1
- 108010052285 Membrane Proteins Proteins 0.000 description 1
- 102000018697 Membrane Proteins Human genes 0.000 description 1
- XADCESSVHJOZHK-UHFFFAOYSA-N Meperidine Chemical compound C=1C=CC=CC=1C1(C(=O)OCC)CCN(C)CC1 XADCESSVHJOZHK-UHFFFAOYSA-N 0.000 description 1
- 108010063312 Metalloproteins Proteins 0.000 description 1
- 102000010750 Metalloproteins Human genes 0.000 description 1
- 108010041559 Methionine Sulfoxide Reductases Proteins 0.000 description 1
- 102000000532 Methionine Sulfoxide Reductases Human genes 0.000 description 1
- 108060004795 Methyltransferase Proteins 0.000 description 1
- 238000000342 Monte Carlo simulation Methods 0.000 description 1
- 108010085220 Multiprotein Complexes Proteins 0.000 description 1
- 102000007474 Multiprotein Complexes Human genes 0.000 description 1
- 241000699670 Mus sp. Species 0.000 description 1
- DJDFFEBSKJCGHC-UHFFFAOYSA-N Naphazoline Chemical compound Cl.C=1C=CC2=CC=CC=C2C=1CC1=NCCN1 DJDFFEBSKJCGHC-UHFFFAOYSA-N 0.000 description 1
- SNIOPGDIGTZGOP-UHFFFAOYSA-N Nitroglycerin Chemical compound [O-][N+](=O)OCC(O[N+]([O-])=O)CO[N+]([O-])=O SNIOPGDIGTZGOP-UHFFFAOYSA-N 0.000 description 1
- 239000000006 Nitroglycerin Substances 0.000 description 1
- 238000000636 Northern blotting Methods 0.000 description 1
- 108020005497 Nuclear hormone receptor Proteins 0.000 description 1
- 102000007399 Nuclear hormone receptor Human genes 0.000 description 1
- 102000011931 Nucleoproteins Human genes 0.000 description 1
- 108010061100 Nucleoproteins Proteins 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- FTLDJPRFCGDUFH-UHFFFAOYSA-N Oxethazaine Chemical compound C=1C=CC=CC=1CC(C)(C)N(C)C(=O)CN(CCO)CC(=O)N(C)C(C)(C)CC1=CC=CC=C1 FTLDJPRFCGDUFH-UHFFFAOYSA-N 0.000 description 1
- 108090000854 Oxidoreductases Proteins 0.000 description 1
- 102000004316 Oxidoreductases Human genes 0.000 description 1
- 108090000417 Oxygenases Proteins 0.000 description 1
- 102000004020 Oxygenases Human genes 0.000 description 1
- 241001494479 Pecora Species 0.000 description 1
- 229930182555 Penicillin Natural products 0.000 description 1
- 108090000279 Peptidyltransferases Proteins 0.000 description 1
- 102000003992 Peroxidases Human genes 0.000 description 1
- QZVCTJOXCFMACW-UHFFFAOYSA-N Phenoxybenzamine Chemical compound C=1C=CC=CC=1CN(CCCl)C(C)COC1=CC=CC=C1 QZVCTJOXCFMACW-UHFFFAOYSA-N 0.000 description 1
- CXOFVDLJLONNDW-UHFFFAOYSA-N Phenytoin Chemical compound N1C(=O)NC(=O)C1(C=1C=CC=CC=1)C1=CC=CC=C1 CXOFVDLJLONNDW-UHFFFAOYSA-N 0.000 description 1
- 108700019535 Phosphoprotein Phosphatases Proteins 0.000 description 1
- 102000045595 Phosphoprotein Phosphatases Human genes 0.000 description 1
- 108010089430 Phosphoproteins Proteins 0.000 description 1
- 102000007982 Phosphoproteins Human genes 0.000 description 1
- 108090000608 Phosphoric Monoester Hydrolases Proteins 0.000 description 1
- 102000004160 Phosphoric Monoester Hydrolases Human genes 0.000 description 1
- PIJVFDBKTWXHHD-UHFFFAOYSA-N Physostigmine Natural products C12=CC(OC(=O)NC)=CC=C2N(C)C2C1(C)CCN2C PIJVFDBKTWXHHD-UHFFFAOYSA-N 0.000 description 1
- 108010003541 Platelet Activating Factor Proteins 0.000 description 1
- 108010021757 Polynucleotide 5'-Hydroxyl-Kinase Proteins 0.000 description 1
- 102000008422 Polynucleotide 5'-hydroxyl-kinase Human genes 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 102000004022 Protein-Tyrosine Kinases Human genes 0.000 description 1
- 108090000412 Protein-Tyrosine Kinases Proteins 0.000 description 1
- 102000009609 Pyrophosphatases Human genes 0.000 description 1
- 108010009413 Pyrophosphatases Proteins 0.000 description 1
- 238000004617 QSAR study Methods 0.000 description 1
- 108091008103 RNA aptamers Proteins 0.000 description 1
- 101710086015 RNA ligase Proteins 0.000 description 1
- 230000004570 RNA-binding Effects 0.000 description 1
- 108090000783 Renin Proteins 0.000 description 1
- 102100028255 Renin Human genes 0.000 description 1
- LCQMZZCPPSWADO-UHFFFAOYSA-N Reserpilin Natural products COC(=O)C1COCC2CN3CCc4c([nH]c5cc(OC)c(OC)cc45)C3CC12 LCQMZZCPPSWADO-UHFFFAOYSA-N 0.000 description 1
- QEVHRUUCFGRFIF-SFWBKIHZSA-N Reserpine Natural products O=C(OC)[C@@H]1[C@H](OC)[C@H](OC(=O)c2cc(OC)c(OC)c(OC)c2)C[C@H]2[C@@H]1C[C@H]1N(C2)CCc2c3c([nH]c12)cc(OC)cc3 QEVHRUUCFGRFIF-SFWBKIHZSA-N 0.000 description 1
- 241000710122 Rhinovirus B14 Species 0.000 description 1
- 108010083644 Ribonucleases Proteins 0.000 description 1
- 102000006382 Ribonucleases Human genes 0.000 description 1
- 108091028664 Ribonucleotide Proteins 0.000 description 1
- KJTLSVCANCCWHF-UHFFFAOYSA-N Ruthenium Chemical compound [Ru] KJTLSVCANCCWHF-UHFFFAOYSA-N 0.000 description 1
- 239000012327 Ruthenium complex Substances 0.000 description 1
- CNIIGCLFLJGOGP-UHFFFAOYSA-N SJ000285664 Natural products C=1C=CC2=CC=CC=C2C=1CC1=NCCN1 CNIIGCLFLJGOGP-UHFFFAOYSA-N 0.000 description 1
- 101710184528 Scaffolding protein Proteins 0.000 description 1
- QZAYGJVTTNCVMB-UHFFFAOYSA-N Serotonin Natural products C1=C(O)C=C2C(CCN)=CNC2=C1 QZAYGJVTTNCVMB-UHFFFAOYSA-N 0.000 description 1
- 108091027568 Single-stranded nucleotide Proteins 0.000 description 1
- 108010068542 Somatotropin Receptors Proteins 0.000 description 1
- 238000006619 Stille reaction Methods 0.000 description 1
- 101710172711 Structural protein Proteins 0.000 description 1
- 241000282887 Suidae Species 0.000 description 1
- 102000005262 Sulfatase Human genes 0.000 description 1
- 108091008874 T cell receptors Proteins 0.000 description 1
- 102000016266 T-Cell Antigen Receptors Human genes 0.000 description 1
- 102000006463 Talin Human genes 0.000 description 1
- 108010083809 Talin Proteins 0.000 description 1
- 108010017842 Telomerase Proteins 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- 244000269722 Thea sinensis Species 0.000 description 1
- 108090001109 Thermolysin Proteins 0.000 description 1
- IQFYYKKMVGJFEH-XLPZGREQSA-N Thymidine Natural products O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 IQFYYKKMVGJFEH-XLPZGREQSA-N 0.000 description 1
- 101710183280 Topoisomerase Proteins 0.000 description 1
- 102000003929 Transaminases Human genes 0.000 description 1
- 108090000340 Transaminases Proteins 0.000 description 1
- 108010040002 Tumor Suppressor Proteins Proteins 0.000 description 1
- 102000001742 Tumor Suppressor Proteins Human genes 0.000 description 1
- 102100031988 Tumor necrosis factor ligand superfamily member 6 Human genes 0.000 description 1
- 108050002568 Tumor necrosis factor ligand superfamily member 6 Proteins 0.000 description 1
- 101150104684 UL44 gene Proteins 0.000 description 1
- 108010046334 Urease Proteins 0.000 description 1
- 238000005411 Van der Waals force Methods 0.000 description 1
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 1
- 108010067390 Viral Proteins Proteins 0.000 description 1
- LXEKPEMOWBOYRF-UHFFFAOYSA-N [2-[(1-azaniumyl-1-imino-2-methylpropan-2-yl)diazenyl]-2-methylpropanimidoyl]azanium;dichloride Chemical compound Cl.Cl.NC(=N)C(C)(C)N=NC(C)(C)C(N)=N LXEKPEMOWBOYRF-UHFFFAOYSA-N 0.000 description 1
- SAZUGELZHZOXHB-UHFFFAOYSA-N acecarbromal Chemical compound CCC(Br)(CC)C(=O)NC(=O)NC(C)=O SAZUGELZHZOXHB-UHFFFAOYSA-N 0.000 description 1
- 229960001409 acecarbromal Drugs 0.000 description 1
- 108020002494 acetyltransferase Proteins 0.000 description 1
- 102000005421 acetyltransferase Human genes 0.000 description 1
- 150000007513 acids Chemical class 0.000 description 1
- 239000013543 active substance Substances 0.000 description 1
- 108091005764 adaptor proteins Proteins 0.000 description 1
- 230000001800 adrenalinergic effect Effects 0.000 description 1
- 239000000464 adrenergic agent Substances 0.000 description 1
- 239000000674 adrenergic antagonist Substances 0.000 description 1
- 230000001780 adrenocortical effect Effects 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 230000008484 agonism Effects 0.000 description 1
- 125000003295 alanine group Chemical group N[C@@H](C)C(=O)* 0.000 description 1
- 150000003973 alkyl amines Chemical class 0.000 description 1
- 230000002152 alkylating effect Effects 0.000 description 1
- 230000029936 alkylation Effects 0.000 description 1
- 238000005804 alkylation reaction Methods 0.000 description 1
- 230000008860 allosteric change Effects 0.000 description 1
- 108090000861 alpha Adrenergic Receptors Proteins 0.000 description 1
- 102000004305 alpha Adrenergic Receptors Human genes 0.000 description 1
- OMHBPUNFVFNHJK-UHFFFAOYSA-P ambenonium Chemical compound C=1C=CC=C(Cl)C=1C[N+](CC)(CC)CCNC(=O)C(=O)NCC[N+](CC)(CC)CC1=CC=CC=C1Cl OMHBPUNFVFNHJK-UHFFFAOYSA-P 0.000 description 1
- 229960000451 ambenonium Drugs 0.000 description 1
- 150000001409 amidines Chemical class 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 239000002269 analeptic agent Substances 0.000 description 1
- 229940051881 anilide analgesics and antipyretics Drugs 0.000 description 1
- 150000003931 anilides Chemical class 0.000 description 1
- 230000000507 anthelmentic effect Effects 0.000 description 1
- 239000000921 anthelmintic agent Substances 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 230000001466 anti-adreneric effect Effects 0.000 description 1
- 230000000844 anti-bacterial effect Effects 0.000 description 1
- 230000001754 anti-pyretic effect Effects 0.000 description 1
- 230000003356 anti-rheumatic effect Effects 0.000 description 1
- 230000000259 anti-tumor effect Effects 0.000 description 1
- 230000000840 anti-viral effect Effects 0.000 description 1
- 229940125713 antianxiety drug Drugs 0.000 description 1
- 239000000427 antigen Substances 0.000 description 1
- 108091007433 antigens Proteins 0.000 description 1
- 102000036639 antigens Human genes 0.000 description 1
- 239000000739 antihistaminic agent Substances 0.000 description 1
- 229940125715 antihistaminic agent Drugs 0.000 description 1
- 239000003430 antimalarial agent Substances 0.000 description 1
- 229940033495 antimalarials Drugs 0.000 description 1
- 239000003963 antioxidant agent Substances 0.000 description 1
- 230000006851 antioxidant defense Effects 0.000 description 1
- 235000006708 antioxidants Nutrition 0.000 description 1
- 239000002221 antipyretic Substances 0.000 description 1
- 229940125716 antipyretic agent Drugs 0.000 description 1
- 239000003435 antirheumatic agent Substances 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 150000004982 aromatic amines Chemical class 0.000 description 1
- 238000013528 artificial neural network Methods 0.000 description 1
- 229940072107 ascorbate Drugs 0.000 description 1
- 235000010323 ascorbic acid Nutrition 0.000 description 1
- 239000011668 ascorbic acid Substances 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 125000000613 asparagine group Chemical group N[C@@H](CC(N)=O)C(=O)* 0.000 description 1
- 229940009098 aspartate Drugs 0.000 description 1
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 1
- 238000005311 autocorrelation function Methods 0.000 description 1
- NYGHGTMKALXFIA-UHFFFAOYSA-N azapetine Chemical compound C1N(CC=C)CC2=CC=CC=C2C2=CC=CC=C21 NYGHGTMKALXFIA-UHFFFAOYSA-N 0.000 description 1
- 229960004779 azapetine Drugs 0.000 description 1
- 235000015241 bacon Nutrition 0.000 description 1
- 229960002319 barbital Drugs 0.000 description 1
- 125000003310 benzodiazepinyl group Chemical group N1N=C(C=CC2=C1C=CC=C2)* 0.000 description 1
- BNBQRQQYDMDJAH-UHFFFAOYSA-N benzodioxan Chemical class C1=CC=C2OCCOC2=C1 BNBQRQQYDMDJAH-UHFFFAOYSA-N 0.000 description 1
- 150000001558 benzoic acid derivatives Chemical class 0.000 description 1
- IQFYYKKMVGJFEH-UHFFFAOYSA-N beta-L-thymidine Natural products O=C1NC(=O)C(C)=CN1C1OC(CO)C(O)C1 IQFYYKKMVGJFEH-UHFFFAOYSA-N 0.000 description 1
- 239000011230 binding agent Substances 0.000 description 1
- 238000004166 bioassay Methods 0.000 description 1
- 230000008238 biochemical pathway Effects 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- 238000010170 biological method Methods 0.000 description 1
- 230000008512 biological response Effects 0.000 description 1
- FFSAXUULYPJSKH-UHFFFAOYSA-N butyrophenone Chemical compound CCCC(=O)C1=CC=CC=C1 FFSAXUULYPJSKH-UHFFFAOYSA-N 0.000 description 1
- VJEONQKOZGKCAK-UHFFFAOYSA-N caffeine Natural products CN1C(=O)N(C)C(=O)C2=C1C=CN2C VJEONQKOZGKCAK-UHFFFAOYSA-N 0.000 description 1
- 229960001948 caffeine Drugs 0.000 description 1
- 150000001728 carbonyl compounds Chemical class 0.000 description 1
- 150000001735 carboxylic acids Chemical class 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 229940106157 cellulase Drugs 0.000 description 1
- 229940124587 cephalosporin Drugs 0.000 description 1
- 238000009614 chemical analysis method Methods 0.000 description 1
- 229960002155 chlorothiazide Drugs 0.000 description 1
- SOYKEARSMXGVTM-UHFFFAOYSA-N chlorphenamine Chemical compound C=1C=CC=NC=1C(CCN(C)C)C1=CC=C(Cl)C=C1 SOYKEARSMXGVTM-UHFFFAOYSA-N 0.000 description 1
- 229960003291 chlorphenamine Drugs 0.000 description 1
- ZPEIMTDSQAKGNT-UHFFFAOYSA-N chlorpromazine Chemical compound C1=C(Cl)C=C2N(CCCN(C)C)C3=CC=CC=C3SC2=C1 ZPEIMTDSQAKGNT-UHFFFAOYSA-N 0.000 description 1
- 229960001076 chlorpromazine Drugs 0.000 description 1
- 229960001552 chlorprothixene Drugs 0.000 description 1
- 229960001523 chlortalidone Drugs 0.000 description 1
- 230000001713 cholinergic effect Effects 0.000 description 1
- PUFQVTATUTYEAL-UHFFFAOYSA-N cinchocaine Chemical compound C1=CC=CC2=NC(OCCCC)=CC(C(=O)NCCN(CC)CC)=C21 PUFQVTATUTYEAL-UHFFFAOYSA-N 0.000 description 1
- 229960001747 cinchocaine Drugs 0.000 description 1
- 229960004414 clomethiazole Drugs 0.000 description 1
- 239000005515 coenzyme Substances 0.000 description 1
- 229920001436 collagen Polymers 0.000 description 1
- 229940000425 combination drug Drugs 0.000 description 1
- 238000006258 combinatorial reaction Methods 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 230000002860 competitive effect Effects 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 150000004696 coordination complex Chemical class 0.000 description 1
- BPLKXBNWXRMHRE-UHFFFAOYSA-N copper;1,10-phenanthroline Chemical compound [Cu].C1=CN=C2C3=NC=CC=C3C=CC2=C1 BPLKXBNWXRMHRE-UHFFFAOYSA-N 0.000 description 1
- 238000010219 correlation analysis Methods 0.000 description 1
- 229960004544 cortisone Drugs 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 230000037029 cross reaction Effects 0.000 description 1
- 230000009260 cross reactivity Effects 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 238000002484 cyclic voltammetry Methods 0.000 description 1
- UVKZSORBKUEBAZ-UHFFFAOYSA-N cyclizine Chemical compound C1CN(C)CCN1C(C=1C=CC=CC=1)C1=CC=CC=C1 UVKZSORBKUEBAZ-UHFFFAOYSA-N 0.000 description 1
- 229960003564 cyclizine Drugs 0.000 description 1
- YLRNESBGEGGQBK-UHFFFAOYSA-N cyclomethycaine Chemical compound CC1CCCCN1CCCOC(=O)C(C=C1)=CC=C1OC1CCCCC1 YLRNESBGEGGQBK-UHFFFAOYSA-N 0.000 description 1
- 229960004741 cyclomethycaine Drugs 0.000 description 1
- 229960003263 cyclopentamine Drugs 0.000 description 1
- HFXKQSZZZPGLKQ-UHFFFAOYSA-N cyclopentamine Chemical compound CNC(C)CC1CCCC1 HFXKQSZZZPGLKQ-UHFFFAOYSA-N 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- 230000001086 cytosolic effect Effects 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- RWZVPVOZTJJMNU-UHFFFAOYSA-N demarcarium Chemical compound C=1C=CC([N+](C)(C)C)=CC=1OC(=O)N(C)CCCCCCCCCCN(C)C(=O)OC1=CC=CC([N+](C)(C)C)=C1 RWZVPVOZTJJMNU-UHFFFAOYSA-N 0.000 description 1
- 229960004656 demecarium Drugs 0.000 description 1
- 239000003398 denaturant Substances 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 229960003957 dexamethasone Drugs 0.000 description 1
- UREBDLICKHMUKA-CXSFZGCWSA-N dexamethasone Chemical compound C1CC2=CC(=O)C=C[C@]2(C)[C@]2(F)[C@@H]1[C@@H]1C[C@@H](C)[C@@](C(=O)CO)(O)[C@@]1(C)C[C@@H]2O UREBDLICKHMUKA-CXSFZGCWSA-N 0.000 description 1
- 229910003460 diamond Inorganic materials 0.000 description 1
- 239000010432 diamond Substances 0.000 description 1
- RAABOESOVLLHRU-UHFFFAOYSA-N diazene Chemical class N=N RAABOESOVLLHRU-UHFFFAOYSA-N 0.000 description 1
- LUZRJRNZXALNLM-JGRZULCMSA-N dihydroergotamine Chemical compound C([C@H]1C(=O)N2CCC[C@H]2[C@]2(O)O[C@@](C(N21)=O)(C)NC(=O)[C@H]1CN([C@H]2[C@@H](C=3C=CC=C4NC=C(C=34)C2)C1)C)C1=CC=CC=C1 LUZRJRNZXALNLM-JGRZULCMSA-N 0.000 description 1
- 229960004704 dihydroergotamine Drugs 0.000 description 1
- 125000005056 dihydrothiazolyl group Chemical group S1C(NC=C1)* 0.000 description 1
- 230000003292 diminished effect Effects 0.000 description 1
- 229960000520 diphenhydramine Drugs 0.000 description 1
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 1
- 229940120889 dipyrone Drugs 0.000 description 1
- 229940042396 direct acting antivirals thiosemicarbazones Drugs 0.000 description 1
- 208000035475 disorder Diseases 0.000 description 1
- OQALFHMKVSJFRR-UHFFFAOYSA-N dityrosine Chemical class OC(=O)C(N)CC1=CC=C(O)C(C=2C(=CC=C(CC(N)C(O)=O)C=2)O)=C1 OQALFHMKVSJFRR-UHFFFAOYSA-N 0.000 description 1
- 239000002934 diuretic Substances 0.000 description 1
- 229940030606 diuretics Drugs 0.000 description 1
- 229940000406 drug candidate Drugs 0.000 description 1
- 238000009510 drug design Methods 0.000 description 1
- 238000009509 drug development Methods 0.000 description 1
- 238000007877 drug screening Methods 0.000 description 1
- 239000003596 drug target Substances 0.000 description 1
- 230000009977 dual effect Effects 0.000 description 1
- 239000000975 dye Substances 0.000 description 1
- 229960003748 edrophonium Drugs 0.000 description 1
- 230000002888 effect on disease Effects 0.000 description 1
- 229920002549 elastin Polymers 0.000 description 1
- 238000002848 electrochemical method Methods 0.000 description 1
- 238000007336 electrophilic substitution reaction Methods 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 239000002532 enzyme inhibitor Substances 0.000 description 1
- 229940125532 enzyme inhibitor Drugs 0.000 description 1
- 229960005139 epinephrine Drugs 0.000 description 1
- 229940011871 estrogen Drugs 0.000 description 1
- 239000000262 estrogen Substances 0.000 description 1
- 150000002169 ethanolamines Chemical class 0.000 description 1
- 150000002171 ethylene diamines Chemical class 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 230000001747 exhibiting effect Effects 0.000 description 1
- 102000013165 exonuclease Human genes 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 229940032465 fenethylline Drugs 0.000 description 1
- NMCHYWGKBADVMK-UHFFFAOYSA-N fenetylline Chemical compound C1=NC=2N(C)C(=O)N(C)C(=O)C=2N1CCNC(C)CC1=CC=CC=C1 NMCHYWGKBADVMK-UHFFFAOYSA-N 0.000 description 1
- 229950003499 fibrin Drugs 0.000 description 1
- 239000004052 folic acid antagonist Substances 0.000 description 1
- 150000004675 formic acid derivatives Chemical class 0.000 description 1
- 101150002378 gC gene Proteins 0.000 description 1
- 239000007789 gas Substances 0.000 description 1
- 238000004817 gas chromatography Methods 0.000 description 1
- 238000007429 general method Methods 0.000 description 1
- 239000003862 glucocorticoid Substances 0.000 description 1
- 229960003711 glyceryl trinitrate Drugs 0.000 description 1
- 229930182470 glycoside Natural products 0.000 description 1
- PJJJBBJSCAKJQF-UHFFFAOYSA-N guanidinium chloride Chemical compound [Cl-].NC(N)=[NH2+] PJJJBBJSCAKJQF-UHFFFAOYSA-N 0.000 description 1
- VKYKSIONXSXAKP-UHFFFAOYSA-N hexamethylenetetramine Chemical compound C1N(C2)CN3CN1CN2C3 VKYKSIONXSXAKP-UHFFFAOYSA-N 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 125000000487 histidyl group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C([H])=N1 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 229960002474 hydralazine Drugs 0.000 description 1
- 229930195733 hydrocarbon Natural products 0.000 description 1
- 238000005984 hydrogenation reaction Methods 0.000 description 1
- 230000007062 hydrolysis Effects 0.000 description 1
- 238000006460 hydrolysis reaction Methods 0.000 description 1
- 125000001165 hydrophobic group Chemical group 0.000 description 1
- 239000003326 hypnotic agent Substances 0.000 description 1
- 230000000147 hypnotic effect Effects 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 150000002460 imidazoles Chemical class 0.000 description 1
- BCGWQEUPMDMJNV-UHFFFAOYSA-N imipramine Chemical compound C1CC2=CC=CC=C2N(CCCN(C)C)C2=CC=CC=C21 BCGWQEUPMDMJNV-UHFFFAOYSA-N 0.000 description 1
- 229960004801 imipramine Drugs 0.000 description 1
- 238000003365 immunocytochemistry Methods 0.000 description 1
- 230000001976 improved effect Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000000099 in vitro assay Methods 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 238000005462 in vivo assay Methods 0.000 description 1
- 230000000937 inactivator Effects 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 229960000905 indomethacin Drugs 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 229910052500 inorganic mineral Inorganic materials 0.000 description 1
- 239000013038 irreversible inhibitor Substances 0.000 description 1
- YGSFZBYOMFZJPV-UHFFFAOYSA-N isobucaine Chemical compound CC(C)CNC(C)(C)COC(=O)C1=CC=CC=C1 YGSFZBYOMFZJPV-UHFFFAOYSA-N 0.000 description 1
- 239000012948 isocyanate Substances 0.000 description 1
- 150000002513 isocyanates Chemical class 0.000 description 1
- 125000000741 isoleucyl group Chemical group [H]N([H])C(C(C([H])([H])[H])C([H])([H])C([H])([H])[H])C(=O)O* 0.000 description 1
- 229940039009 isoproterenol Drugs 0.000 description 1
- 102000048260 kappa Opioid Receptors Human genes 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 230000003902 lesion Effects 0.000 description 1
- 125000001909 leucine group Chemical group [H]N(*)C(C(*)=O)C([H])([H])C(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 229960004502 levodopa Drugs 0.000 description 1
- 229960004194 lidocaine Drugs 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 210000004185 liver Anatomy 0.000 description 1
- 239000003589 local anesthetic agent Substances 0.000 description 1
- 229960005015 local anesthetics Drugs 0.000 description 1
- 229920002521 macromolecule Polymers 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 229960003861 mephenesin Drugs 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- DJGAAPFSPWAYTJ-UHFFFAOYSA-M metamizole sodium Chemical compound [Na+].O=C1C(N(CS([O-])(=O)=O)C)=C(C)N(C)N1C1=CC=CC=C1 DJGAAPFSPWAYTJ-UHFFFAOYSA-M 0.000 description 1
- 229960001797 methadone Drugs 0.000 description 1
- TWXDDNPPQUTEOV-FVGYRXGTSA-N methamphetamine hydrochloride Chemical compound Cl.CN[C@@H](C)CC1=CC=CC=C1 TWXDDNPPQUTEOV-FVGYRXGTSA-N 0.000 description 1
- VRQVVMDWGGWHTJ-CQSZACIVSA-N methotrimeprazine Chemical compound C1=CC=C2N(C[C@H](C)CN(C)C)C3=CC(OC)=CC=C3SC2=C1 VRQVVMDWGGWHTJ-CQSZACIVSA-N 0.000 description 1
- 229940042053 methotrimeprazine Drugs 0.000 description 1
- 125000000956 methoxy group Chemical group [H]C([H])([H])O* 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 239000011707 mineral Substances 0.000 description 1
- 238000005065 mining Methods 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 125000004573 morpholin-4-yl group Chemical group N1(CCOCC1)* 0.000 description 1
- 210000003205 muscle Anatomy 0.000 description 1
- 235000010460 mustard Nutrition 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 229960005016 naphazoline Drugs 0.000 description 1
- 230000003533 narcotic effect Effects 0.000 description 1
- 229960002362 neostigmine Drugs 0.000 description 1
- LULNWZDBKTWDGK-UHFFFAOYSA-M neostigmine bromide Chemical compound [Br-].CN(C)C(=O)OC1=CC=CC([N+](C)(C)C)=C1 LULNWZDBKTWDGK-UHFFFAOYSA-M 0.000 description 1
- 230000001537 neural effect Effects 0.000 description 1
- 239000002858 neurotransmitter agent Substances 0.000 description 1
- 150000002828 nitro derivatives Chemical class 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 230000036963 noncompetitive effect Effects 0.000 description 1
- 108020004017 nuclear receptors Proteins 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 230000000269 nucleophilic effect Effects 0.000 description 1
- 238000010534 nucleophilic substitution reaction Methods 0.000 description 1
- 239000002777 nucleoside Substances 0.000 description 1
- HGASFNYMVGEKTF-UHFFFAOYSA-N octan-1-ol;hydrate Chemical compound O.CCCCCCCCO HGASFNYMVGEKTF-UHFFFAOYSA-N 0.000 description 1
- 229940099990 ogen Drugs 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 150000002894 organic compounds Chemical class 0.000 description 1
- 150000001475 oxazolidinediones Chemical class 0.000 description 1
- 229960000986 oxetacaine Drugs 0.000 description 1
- 230000004792 oxidative damage Effects 0.000 description 1
- 239000001301 oxygen Substances 0.000 description 1
- 229910052760 oxygen Inorganic materials 0.000 description 1
- 125000004430 oxygen atom Chemical group O* 0.000 description 1
- 229940094443 oxytocics prostaglandins Drugs 0.000 description 1
- NFHFRUOZVGFOOS-UHFFFAOYSA-N palladium;triphenylphosphane Chemical compound [Pd].C1=CC=CC=C1P(C=1C=CC=CC=1)C1=CC=CC=C1.C1=CC=CC=C1P(C=1C=CC=CC=1)C1=CC=CC=C1.C1=CC=CC=C1P(C=1C=CC=CC=1)C1=CC=CC=C1.C1=CC=CC=C1P(C=1C=CC=CC=1)C1=CC=CC=C1 NFHFRUOZVGFOOS-UHFFFAOYSA-N 0.000 description 1
- 229960005489 paracetamol Drugs 0.000 description 1
- 229940055076 parasympathomimetics choline ester Drugs 0.000 description 1
- 239000004031 partial agonist Substances 0.000 description 1
- 238000005192 partition Methods 0.000 description 1
- NRNCYVBFPDDJNE-UHFFFAOYSA-N pemoline Chemical compound O1C(N)=NC(=O)C1C1=CC=CC=C1 NRNCYVBFPDDJNE-UHFFFAOYSA-N 0.000 description 1
- 229960000761 pemoline Drugs 0.000 description 1
- 230000000149 penetrating effect Effects 0.000 description 1
- 108010082406 peptide permease Proteins 0.000 description 1
- 108040007629 peroxidase activity proteins Proteins 0.000 description 1
- 229940097156 peroxyl Drugs 0.000 description 1
- 229960000482 pethidine Drugs 0.000 description 1
- 239000002831 pharmacologic agent Substances 0.000 description 1
- DDBREPKUVSBGFI-UHFFFAOYSA-N phenobarbital Chemical compound C=1C=CC=CC=1C1(CC)C(=O)NC(=O)NC1=O DDBREPKUVSBGFI-UHFFFAOYSA-N 0.000 description 1
- 229960002695 phenobarbital Drugs 0.000 description 1
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N phenol group Chemical group C1(=CC=CC=C1)O ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 1
- 229950000688 phenothiazine Drugs 0.000 description 1
- 150000002990 phenothiazines Chemical class 0.000 description 1
- 229960003418 phenoxybenzamine Drugs 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- COLNVLDHVKWLRT-QMMMGPOBSA-N phenylalanine group Chemical group N[C@@H](CC1=CC=CC=C1)C(=O)O COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 1
- 229960002895 phenylbutazone Drugs 0.000 description 1
- VYMDGNCVAMGZFE-UHFFFAOYSA-N phenylbutazonum Chemical compound O=C1C(CCCC)C(=O)N(C=2C=CC=CC=2)N1C1=CC=CC=C1 VYMDGNCVAMGZFE-UHFFFAOYSA-N 0.000 description 1
- 229960001802 phenylephrine Drugs 0.000 description 1
- SONNWYBIRXJNDC-VIFPVBQESA-N phenylephrine Chemical compound CNC[C@H](O)C1=CC=CC(O)=C1 SONNWYBIRXJNDC-VIFPVBQESA-N 0.000 description 1
- 229960002036 phenytoin Drugs 0.000 description 1
- 239000002953 phosphate buffered saline Substances 0.000 description 1
- 150000003904 phospholipids Chemical class 0.000 description 1
- 230000000704 physical effect Effects 0.000 description 1
- 230000035790 physiological processes and functions Effects 0.000 description 1
- PIJVFDBKTWXHHD-HIFRSBDPSA-N physostigmine Chemical compound C12=CC(OC(=O)NC)=CC=C2N(C)[C@@H]2[C@@]1(C)CCN2C PIJVFDBKTWXHHD-HIFRSBDPSA-N 0.000 description 1
- 229960001697 physostigmine Drugs 0.000 description 1
- 150000004885 piperazines Chemical class 0.000 description 1
- LYKMMUBOEFYJQG-UHFFFAOYSA-N piperoxan Chemical compound C1OC2=CC=CC=C2OC1CN1CCCCC1 LYKMMUBOEFYJQG-UHFFFAOYSA-N 0.000 description 1
- 229950007976 piperoxan Drugs 0.000 description 1
- 229920003023 plastic Polymers 0.000 description 1
- 239000004033 plastic Substances 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 108091033319 polynucleotide Proteins 0.000 description 1
- 102000040430 polynucleotide Human genes 0.000 description 1
- 239000002157 polynucleotide Substances 0.000 description 1
- 229920005862 polyol Polymers 0.000 description 1
- 229920001184 polypeptide Polymers 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- XOFYZVNMUHMLCC-ZPOLXVRWSA-N prednisone Chemical compound O=C1C=C[C@]2(C)[C@H]3C(=O)C[C@](C)([C@@](CC4)(O)C(=O)CO)[C@@H]4[C@@H]3CCC2=C1 XOFYZVNMUHMLCC-ZPOLXVRWSA-N 0.000 description 1
- 229960004618 prednisone Drugs 0.000 description 1
- 229960003910 promethazine Drugs 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- ULWHHBHJGPPBCO-UHFFFAOYSA-N propane-1,1-diol Chemical class CCC(O)O ULWHHBHJGPPBCO-UHFFFAOYSA-N 0.000 description 1
- 150000003180 prostaglandins Chemical class 0.000 description 1
- 125000006239 protecting group Chemical group 0.000 description 1
- 239000003528 protein farnesyltransferase inhibitor Substances 0.000 description 1
- 230000006916 protein interaction Effects 0.000 description 1
- 230000009145 protein modification Effects 0.000 description 1
- 230000007026 protein scission Effects 0.000 description 1
- 230000002797 proteolythic effect Effects 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 239000000649 purine antagonist Substances 0.000 description 1
- DNTVKOMHCDKATN-UHFFFAOYSA-N pyrazolidine-3,5-dione Chemical compound O=C1CC(=O)NN1 DNTVKOMHCDKATN-UHFFFAOYSA-N 0.000 description 1
- 239000003790 pyrimidine antagonist Substances 0.000 description 1
- 150000003235 pyrrolidines Chemical class 0.000 description 1
- 238000000163 radioactive labelling Methods 0.000 description 1
- 239000002287 radioligand Substances 0.000 description 1
- 239000011541 reaction mixture Substances 0.000 description 1
- 238000003753 real-time PCR Methods 0.000 description 1
- 238000000611 regression analysis Methods 0.000 description 1
- 230000008263 repair mechanism Effects 0.000 description 1
- QEVHRUUCFGRFIF-MDEJGZGSSA-N reserpine Chemical compound O([C@H]1[C@@H]([C@H]([C@H]2C[C@@H]3C4=C(C5=CC=C(OC)C=C5N4)CCN3C[C@H]2C1)C(=O)OC)OC)C(=O)C1=CC(OC)=C(OC)C(OC)=C1 QEVHRUUCFGRFIF-MDEJGZGSSA-N 0.000 description 1
- 229960003147 reserpine Drugs 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 239000002336 ribonucleotide Substances 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- MDMGHDFNKNZPAU-UHFFFAOYSA-N roserpine Natural products C1C2CN3CCC(C4=CC=C(OC)C=C4N4)=C4C3CC2C(OC(C)=O)C(OC)C1OC(=O)C1=CC(OC)=C(OC)C(OC)=C1 MDMGHDFNKNZPAU-UHFFFAOYSA-N 0.000 description 1
- 229910052707 ruthenium Inorganic materials 0.000 description 1
- YAYGSLOSTXKUBW-UHFFFAOYSA-N ruthenium(2+) Chemical compound [Ru+2] YAYGSLOSTXKUBW-UHFFFAOYSA-N 0.000 description 1
- 150000003873 salicylate salts Chemical class 0.000 description 1
- 230000002000 scavenging effect Effects 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 229940076279 serotonin Drugs 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 230000019491 signal transduction Effects 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 238000007614 solvation Methods 0.000 description 1
- 125000006850 spacer group Chemical group 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 230000009870 specific binding Effects 0.000 description 1
- 238000006250 specific catalysis Methods 0.000 description 1
- 230000000087 stabilizing effect Effects 0.000 description 1
- 238000000528 statistical test Methods 0.000 description 1
- 230000003335 steric effect Effects 0.000 description 1
- 150000003431 steroids Chemical class 0.000 description 1
- 235000021286 stilbenes Nutrition 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- 230000007019 strand scission Effects 0.000 description 1
- KZNICNPSHKQLFF-UHFFFAOYSA-N succinimide Chemical class O=C1CCC(=O)N1 KZNICNPSHKQLFF-UHFFFAOYSA-N 0.000 description 1
- 150000008163 sugars Chemical class 0.000 description 1
- 108060007951 sulfatase Proteins 0.000 description 1
- MBDNRNMVTZADMQ-UHFFFAOYSA-N sulfolene Chemical class O=S1(=O)CC=CC1 MBDNRNMVTZADMQ-UHFFFAOYSA-N 0.000 description 1
- 150000003456 sulfonamides Chemical class 0.000 description 1
- 229910052717 sulfur Inorganic materials 0.000 description 1
- 238000010189 synthetic method Methods 0.000 description 1
- 230000009897 systematic effect Effects 0.000 description 1
- 229910052715 tantalum Inorganic materials 0.000 description 1
- 150000003511 tertiary amides Chemical class 0.000 description 1
- 229960003604 testosterone Drugs 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- 230000001225 therapeutic effect Effects 0.000 description 1
- 239000003451 thiazide diuretic agent Substances 0.000 description 1
- 150000003548 thiazolidines Chemical class 0.000 description 1
- 150000003573 thiols Chemical class 0.000 description 1
- ANRHNWWPFJCPAZ-UHFFFAOYSA-M thionine Chemical compound [Cl-].C1=CC(N)=CC2=[S+]C3=CC(N)=CC=C3N=C21 ANRHNWWPFJCPAZ-UHFFFAOYSA-M 0.000 description 1
- DKGYESBFCGKOJC-UHFFFAOYSA-N thiophen-3-amine Chemical class NC=1C=CSC=1 DKGYESBFCGKOJC-UHFFFAOYSA-N 0.000 description 1
- 229930192474 thiophene Natural products 0.000 description 1
- 229940104230 thymidine Drugs 0.000 description 1
- 230000036962 time dependent Effects 0.000 description 1
- JIVZKJJQOZQXQB-UHFFFAOYSA-N tolazoline Chemical compound C=1C=CC=CC=1CC1=NCCN1 JIVZKJJQOZQXQB-UHFFFAOYSA-N 0.000 description 1
- 229960002312 tolazoline Drugs 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
- 230000017105 transposition Effects 0.000 description 1
- IRYJRGCIQBGHIV-UHFFFAOYSA-N trimethadione Chemical compound CN1C(=O)OC(C)(C)C1=O IRYJRGCIQBGHIV-UHFFFAOYSA-N 0.000 description 1
- 229960004453 trimethadione Drugs 0.000 description 1
- 230000005641 tunneling Effects 0.000 description 1
- 150000003668 tyrosines Chemical class 0.000 description 1
- 238000009827 uniform distribution Methods 0.000 description 1
- 238000004148 unit process Methods 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 229940124549 vasodilator Drugs 0.000 description 1
- 239000003071 vasodilator agent Substances 0.000 description 1
- 210000003462 vein Anatomy 0.000 description 1
- 125000000391 vinyl group Chemical group [H]C([*])=C([H])[H] 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 238000001262 western blot Methods 0.000 description 1
- 108020001588 κ-opioid receptors Proteins 0.000 description 1
Classifications
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/94—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving narcotics or drugs or pharmaceuticals, neurotransmitters or associated receptors
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/68—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids
Definitions
- This invention relates to an improvement in the art of using combinatorial chemistry to develop drug leads.
- Combinatorial chemistry- permits the rapid and relatively inexpensive synthesis of large numbers of compounds in the small quantities suitable for automated assays directed at molecular targets.
- Numerous small companies and academic laboratories have successfully engineered combinatorial chemical libraries with a significant range of diversity (reviewed in Doyle, 1995, Gordon et al , 1994a, Gordon et al , 1994b) .
- Combinatorial Libraries In a combinatorial library, chemical building blocks are randomly combined into a large number (as high as 10E15) of different compounds, which are then simultaneously screened for binding (or other) activity against one or more targets . Libraries of thousands, even millions, of random oligopeptides have been prepared by chemical synthesis (Houghten et al . , Nature, 354:84-6(1991)), or gene expression (Marks et al . , J Mol Biol, 222:581-97(1991)), displayed on chromatographic supports (Lam et al . , Nature, 354:82-4(1991)), inside bacterial cells (Colas et al .
- the first combinatorial libraries were composed of peptides or proteins, in which all or selected amino acid positions were randomized. Peptides and proteins can exhibit high and specific binding activity, and can act as catalysts. In consequence, they are of great importance in biological systems. Unfortunately, peptides per se have limited utility for use as therapeutic entities. They are costly to synthesize, unstable in the presence of proteases and in general do not transit cellular membranes. Other classes of compounds have better properties for drug candidates. Nucleic acids have also been used in combinatorial libraries. Their great advantage is the ease with which a nucleic acid with appropriate binding activity can be amplified. As a result, combinatorial libraries composed of nucleic acids can be of low redundancy and hence, of high diversity.
- oligonucleotides are not suitable as drugs for several reasons.
- the oligonucleotides have high molecular weights and cannot be synthesized conveniently in large quantities.
- deoxy- and ribo-nucleotides are hydrolytically digested by nucleases that occur in all living systems and are therefore usually decomposed before reaching the target .
- Structure descriptors can be based on a variety of structural features . These approaches provide arrays of molecular descriptors that can be used to assess the similarity of molecules in a library.
- the "activity" database (A) contains the activities against 60 cell lines for 60,000 compounds that have been screened at NCI.
- the similarity in the activity profile against the panel of cell lines can then be calculated for any two compounds, and is generally assessed by a pairwise correlation coefficient (PCC) , which is determined by an algorithm called COMPARE, which calculates the similarity of all of the compounds in the database to a user-supplied "seed" compound.
- PCC pairwise correlation coefficient
- the "target” database (T) has been created for 100 proteins (targets) whose level of expression was determined in the same 60 cell lines. These expression levels were assessed by standard biological techniques that determine either the quantity of expressed protein (e.g., by Western blots or immunocytochemistry) or the quantity of messenger RNA (e.g., by quantitative PCR or Northern blots) for each protein in each cell line. Relation of the A and T databases then provides information on the molecular pharmacology of the compounds in A; inhibition of one of the heavily expressed proteins emerges as a possible mechanism for the activity of the compound.
- a "structure" database (S) contains structural descriptors for a library of 460,000 compounds that includes the compounds in A. Similarities between the structural descriptors can be calculated for all of the compounds in S, so for a given active compound in A, unscreened, but structurally similar, compounds can be identified in S. These unscreened compounds have an increased likelihood of being active in the cell lines for which the screened compounds are active. The latter process therefore provides a means for "lead optimization" after a compound with a given biological activity has been identified.
- the NCI approach in defining the target database (T) is significantly different from that described here in that it relies solely on biological activity assays.
- Kauvar, et al . , Chemistry & Biology, 2: 107-118 (1995) "fingerprinted” over 5,000 compounds by the binding potency (concentration needed to inhibit 50% of the protein's activity) of each compound to each member of a reference panel of eight proteins. (These proteins were selected on the basis of readily assayable activity, broad cross-reactivity with small organic molecules, and low correlation between each other in binding patterns.)
- a screening library of 54 compounds was then selected based on the diversity in their "fingerprints" (inhibitory activity against the reference panel proteins) .
- This "training set” was used to evaluate the similarity of the ligand binding characteristics of a new protein to one of the reference panel proteins.
- a computational surrogate (a weighted sum of two or more reference panel proteins) for the new protein is determined.
- the activity of all fingerprinted compounds to inhibit the activity of the new protein is predicted as the sum of their appropriately weighted inhibitory activities against the component reference proteins of the computational surrogate. Predictions may be improved by testing additional sets of compounds against the new protein. See also L. M. Kauvar, H. 0. Villar. Method to identify binding partners. US Patent 5587293.
- proteins are fingerprinted on the basis of their chemical reactivity, in the presence or absence of a binding partner, rather than on the basis of their biological activity. Therefore, we do not need to identify a large number of diverse affinity molecules for each reference protein.
- proteins are fingerprinted on the basis of their affinity for peptides or nucleic acids in a high-diversity library.
- This library provides a far greater range of conformational variation than is provided by Kauvar' s training set .
- Biomolecule reactivi ty Chemical reactions that modify nucleotides have been extremely successful in probing the structures of complex DNA and RNA molecules ( 16, 17) .
- a reagent that oxidizes nucleic acids by a particular reaction pathway is used to create backbone lesions in the polyanion. The sites of modification are then determined by high-resolution gel electrophoresis.
- a diplatinum compound, PtPop has been used to footprint DNA. Breiner, K. M. , M. A. Daugherty, T. G. Oas and H. H.
- Levine et al oxidized glutamine synthetase by a metal-catalyzed system, and found that a significant number of methionine residues could be oxidized without an increase in proteolytic susceptibility.
- Levine et al suggested engineering proteins to increase the number of surface methionines (for longer half-life) .
- a prospective "query" protein is characterized by a "reactivity descriptor", by which it is related to previously studied proteins (called “library” or “reference”, or “database” proteins) for which both a "reactivity descriptor” and one or more drug leads are known.
- a combinatorial chemical library enriched in or even limited to chemical compounds similar to the drug leads previously identified for the related proteins in the database is then synthesized and screened for binding or other activity against the "query protein” . This invention thereby reduces the number of chemical compounds that must be screened against an unknown protein target, and/or increases the likelihood of "hits".
- the reactivity descriptors here contemplated relate to the reactivity of the target protein (the term "target protein” refers to both “query” and “reference” proteins, both being described by their chemical reactivity) , especially in both a ligand bound and in a free state, with one or more chemical reagents.
- target protein refers to both "query” and “reference” proteins, both being described by their chemical reactivity
- the difference in the two reactivities is characteristic of the part of the protein which is occluded or otherwise shielded as a result of the ligand binding.
- the protected portion of the protein will include the actual ligand binding site.
- the ligands used to define the binding site of the protein may be the natural ligands therefor, if known, or they may be "surrogate ligands".
- a surrogate ligand is a molecule which binds the target protein, and has the potential of binding it at a site at which the target protein is bound by one of its natural ligands.
- Surrogate ligands may be obtained by synthesizing and screening combinatorial libraries.
- the library may be chosen from the point of view of obtaining the most structural diversity for the least synthetic effort, ignoring the suitability of the library members as drugs. For this reason, a preferred surrogate ligand library is a peptide ("BioKey") or oligonucleotide library.
- a query protein is related to reference proteins by its ability to bind similar surrogate ligands.
- a combinatorial oligomeric library of surrogate ligands typically peptides or nucleic acids, is screened, and the oligomers which bind the target protein (and thus are called "aptamers") are characterized to yield "aptamer" or descriptors.
- the aptamer descriptors sequences and, possibly, additional information such as contact points and secondary structure
- identified for the query protein are compared to those identified for the database proteins, and the drug leads previously identified for the database proteins characterized by the most similar surrogate ligands are favored .
- the aptamer are nucleic acids
- the bases involved in protein binding are determined by footprinting techniques.
- the epitope of the nucleic acid may be characterized as a sequence whose elements are unpaired G, A, T, or C, or any of the sixteen possible pairings (matched or mismatched) of those four bases. This sequence, for each surrogate ligand binding a reference protein, may be compared to that of the epitope of each aptamer binding the query protein.
- characterization of the proteins by reactivity descriptors be combined with its characterization by aptamer descriptors (surrogate ligand binding) .
- the work involved may be reduced by first screening a potential surrogate ligand library to obtain "aptamer descriptors" for the target protein, and then using the bound molecules (aptamers) from that library to modulate the chemical reactivity of the protein and thereby help characterize its binding site(s) by means of chemical reactivity descriptors.
- a potential surrogate ligand library to obtain "aptamer descriptors" for the target protein
- the bound molecules (aptamers) from that library to modulate the chemical reactivity of the protein and thereby help characterize its binding site(s) by means of chemical reactivity descriptors.
- our preference is to use peptides to alter reactivity and nucleic acids to generate aptamer descriptors.
- the similarity of the query protein to each of the reference proteins is determined.
- one or more drug leads are known, so these drug leads may be rated or ranked, as drug leads for modulators of the query protein, based on the similarity of their reference protein to the query protein, and, optionally, their own drug characteristics (e.g., potency, half-life, side effects).
- a combinatorial library is then synthesized which is enriched for members which are structurally similar to the aforementioned drug leads.
- Structurally similar members may be identified in a formal manner by use of the chemical structure descriptors available in the art, or more informally through a chemist's expert judgment of structural similarity.
- the chemist may also develop the drug leads without resorting to a combinatorial library, i.e., by synthesizing lead analogues on an individual, noncombinatorial basis.
- the lead analogues are then screened for the ability to modulate the target protein's activity, in vitro or in vivo.
- Successful analogues are added to the database as leads associated with the target protein, which now becomes a reference protein.
- the initial lead discovery occurs through querying a database of reference proteins and their associated descriptors and drugs.
- the combinatorial library is screened primarily for purpose of optimization of these leads, although it is likely to be sufficiently different in structure from the lead so that there will be some secondary lead discovery as well .
- Figure 1 Cartoon showing the reactivity of the transition- metal reagents toward nucleotides.
- Ru (typ) (bpy) 0 2+ (RuO) reacts by hydrogen abstraction at 1' and oxygen transfer at guanine C8.
- Pt 2 (pop) 4 4" reacts by hydrogen abstraction at 4' and 5' by outer-sphere electron transfer at guanine.
- Ru (bpy) 3 3 ⁇ reacts only by outer-sphere electron transfer.
- the reactivity of the complexes towards amino acids should therefore be different for all three reagents as well .
- Figure 3 Scheme showing the solvent accessibility of reactive amino acid residues (*) as a function of folding and binding of the BioKey peptide.
- Figure 4 Scheme showing the assay used to determine the relative rates for modification of a protein by a reagent. The presence of the protein decreases the yield of Form II DNA in a manner related to the rate constant for oxidation of the protein by the reagent.
- FIG. 5A shows the secodary structure of an RNA sequence, with protein contact sites marked with an arrow.
- Fi ⁇ . 5B is a two-dimensional grid representation of the same information.
- the present invention is directed to a method for the more efficient identification of small organic molecules, preferably molecules having a molecular weight of less than 500 daltons, which are pharmaceutically acceptable and which are potent modulators of the biological activity of a protein.
- a pharmacologically active substance elicits a physiological response by interacting with a specialized portion, known as a receptor, of the target cell.
- the substances which are able to elicit that response, by specific interaction with a receptor site are known as agonists .
- a substance able to elicit the maximum response is known as a full agonist, and one which elicits only, at most, a lesser (but discernible) response is a partial agonist.
- a pharmacological antagonist is a compound which interacts with the receptor but without eliciting a response. By doing so, it inhibits the receptor from responding to agonists.
- a competitive antagonist is one whose effect can be overcome by increasing the agonist concentration; a noncompetitive antagonist is one whose action is unaffected by agonist concentration .
- Ligands are substances which bind to receptors, and thereby encompass both agonists and pharmacological antagonists. Ligands which activate (agonize) or inhibit (antagonize) the receptor are here termed modulators .
- a "physiological" antagonist could be a substance which directly or indirectly inhibits the production or release of the natural agonist, or directly or indirectly facilitates its elimination from the receptor site.
- a physiological antagonist of one receptor may be a pharmacological agonist of another receptor, such as one which activates an enzyme which degrades the natural ligand of the first receptor. If a disease state is the result of inappropriate activation of a receptor, the disease may be prevented or treated by means of a physiological or pharmacological antagonist. Other disease states may arise through inadequate activation of a receptor, in which case the disease may be prevented by means of a suitable agonist.
- receptors proteins embedded in the phospholipid bilayer of cell membranes.
- the binding of an agonist to the receptor can cause an allosteric change at an intracellular site, altering the receptor's interaction with other biomolecules.
- the physiological response is initiated by the interaction with this "second messenger” (the agonist is the "first messenger") or "effector" molecule.
- the peptides and nucleic acids used in the present invention can act as agonists (binding to the receptor and causing its activation) , as antagonists (binding to the receptor without activating it, and blocking its activation by agonists) , or as pharmacologically neutral species (binding to the receptor without either activating or blocking it) .
- Enzymes are special types of receptors. Receptors interact with agonists to form complexes which elicit a biological response. Ordinary receptors then release the agonist intact. With enzymes, the agonists are enzyme substrates, and the enzymes catalyze a chemical modification of the substrate. Enzymes are not necessarily integral membrane proteins; they may be secreted, or intracellular, proteins.
- enzymes are activated by the action of a receptor's second messenger, or, more indirectly, by the product of an "upstream" enzymatic reaction.
- drugs may also be useful because of their interaction with enzymes.
- the drug may serve as a substrate for the enzyme, as a coenzyme, or as an enzyme inhibitor. (An irreversible inhibitor is an "inactivator” . )
- Drugs may also cause, directly or indirectly, the conversion of a proenzyme into an enzymes. Many disease states are associated with inappropriately low or high activity of particular enzymes.
- the present invention may be used to identify both agonists and antagonists of receptors. It is not unusual for a relatively small structural change to convert an agonist into a pharmacological antagonist, or vice versa. Therefore, even if the drugs known to interact with a reference protein are all agonists, the drugs in question may serve as leads to the identification of both agonists and antagonists of the reference protein and of related proteins. Similarly, known antagonists may serve as drug leads, not only to additional antagonists, but to agonists as well.
- nucleic acid aptamers and BioKey peptides used in developing descriptors are selected only for their ability to bind the target protein; it is not required that they activate the protein, or inhibits its activation. Some will bind at sites at which agonism/antagonism can occur, others will not. Their purpose is to help characterize the target protein surface, not to serve as drug leads themselves (although the practitioner is free to test the nucleic acids and peptides for agonist/antagonist activity and to use the active ones as leads in the design of active analogues which are more suitable than nucleic acids and peptides per se as drugs) .
- binding partners ligands
- ligands proteins
- binding partner of a protein it is relatively straightforward to study how the interaction of the binding protein and its binding partner affects biological activity.
- inhibitors are likely to affect the biological activity of the protein, at least if they can be delivered m. vivo to the site of the interaction.
- the binding protein is a receptor, and the binding partner an effector of the biological activity, then the inhibitor will antagonize the biological activity. If the binding partner is one which, through binding, blocks a biological activity, then an inhibitor of that interaction will, in effect, be an agonist of the biological activity in question.
- binding sites are typically relatively small surface patches.
- the binding characteristics of the protein may often be altered by local modifications at these sites, without denaturing the protein.
- Electrostatic interactions include salt bridges, hydrogen bonds, and van der Waals forces.
- hydrophobic interaction is actually the absence of hydrogen bonding between nonpolar groups and water, rather than a favorable interaction between the nonpolar groups themselves. Hydrophobic interactions are important in stabilizing the conformation of a protein and thus indirectly affect ligand binding, although hydrophobic residues are usually buried and thus not part of the binding site.
- Peptides have been found to bind proteins at the same sites as those by which the proteins interact with other proteins, macromolecules and biologically significant substances e.g. nucleic acids, lipids and enzyme substrates.
- the potency of an antagonist of a protein may be expressed as an IC50, the concentration of the antagonist which causes a 50% inhibition of a protein's binding or biological activity in an in vitro or in vivo assay system.
- a pharmaceutically effective dosage of an antagonist depends on both the IC50 of the antagonist, and the effective concentrations of the protein and its clinically significant binding partner (s).
- Potencies may be categorized as follows:
- the antagonists identified by the present invention are in one of the four higher categories identified above, and are in any event more potent than any antagonist known for the protein in question at the time of filing of this application.
- the potency of an agonist may be quantified as the dosage resulting in 50% of its maximal effect on a receptor.
- drug lead refers to a compound which is a member of a structural class which is generally suitable, in terms of physical characteristics (e.g., solubility) , as a source of drugs, and which has at least some useful pharmacological activity, and which therefore could serve effectively as a starting point for the design of analogues and derivatives which are useful as drugs.
- the "drug lead” may be a useful drug in its own right, or it may be a compound which is deficient as a drug because of inadequate potency or undesirable side effects. In the latter case, analogues and derivatives are sought which overcome these deficiencies. In the former case, one seeks to improve the already useful drug.
- Such analogues and derivatives may be identified by rational drug design, or by screening of combinatorial or noncombinatorial libraries of analogues and derivatives.
- a drug lead is a compound with a molecular weight of less than 1,000, more preferably, less than 750, still more preferably, less than 600, most preferably, less than 500.
- it has a computed log octanol-water partition coefficient in the range of -4 to +14, more preferably, -2 to +7.5.
- the target protein may be a naturally occurring protein, or a subunit or domain thereof, from any natural source, including a virus, a microorganism (including bacterial, fungi, algae, and protozoa) , an invertebrate (including insects and worms) , or the normal or cancerous cells of a vertebrate (especially a mammal, bird or fish and, among mammals, particularly humans, apes, monkeys, cows, pigs, goats, llamas, sheep, rats, mice, rabbits, guinea pigs, cats and dogs) .
- the target protein may be a mutant of a natural protein. Mutations may be introduced to facilitate the labeling or immobilization of the target protein, or to alter its biological activity (An inhibitor of a mutant protein may be useful to selectively inhibit an undesired activity of the mutant protein and leave other activities substantially intact) .
- the target protein may be, inter alia, a glyco-, lipo-, phospho-, or metalloprotein. It may be a nuclear, cytoplasmic, membrane, or secreted protein. It may, but need not, be an enzyme.
- the known binding partners (if any) of the target protein may be, inter alia. other proteins, oligo- or polypeptides, nucleic acids, carbohydrates, lipids, or small organic or inorganic molecules or ions.
- the biological activity or function of the target protein may be, but is not limited to, being a kinase protein kinase tyrosine kinase
- protease endoprotease exoprotease metalloprotease serine endopeptidase cysteine endopeptidase nuclease Deoxyribonuclease ribonuclease endonulcease exonuclease polymerase DNA Dependent RNA polymerase DNA Dependent DNA polymerase telomerase primase
- the binding protein may have more than one paratope and they may be the same or different. Different paratopes may interact with epitopes of different binding partners. An individual paratope may be specific to a particular binding partner, or it may interact with several different binding partners. A protein can bind a particular binding partner through several different binding sites. The binding sites may be continuous or discontinuous (vis-a-vis the primary sequence of the protein) .
- the target (query or reference) protein may be any protein of interest.
- the information is added to the database.
- a particularly preferred initial target protein is glutathione-S-transferase (GST) , which is chosen because it has been crystallographically characterized with and without a large number of bound inhibitors (37) , the level of expression has been measured in NCI's 60 cell lines ( 13) , the activities of many known inhibitors are available (38) , and macroscopic quantities of peptides that bind to the active site are preparable .
- GST glutathione-S-transferase
- Proteins for which peptide ligands exist and for which expression data are available at NCI include ras ( 39) , src ( 40) , and p53 ( 41) , and other promising proteins for which both kinds of data are likely to be available in the future are the UL44 protein from cytomegalovirus, and hMDM2 protein that binds to p53 ( 41) . Crystallographically characterized targets are of particular interest and utility in the early stages.
- library generally refers to a collection of chemical or biological entities which are related in origin, structure, and/or function, and which can be screened simultaneously for a property of interest .
- combininatorial library refers to a library in which the individual members are either systematic or random combinations of a limited set of basic elements, the properties of each member being dependent on the choice and location of the elements incorporated into it. Typically, the members of the library are at least capable of being screened simultaneously. Randomization may be complete or partial; some positions may be randomized and others predetermined, and at random positions, the choices may be limited in a predetermined manner.
- the members of a combinatorial library may be oligomers or polymers of some kind, in which the variation occurs through the choice of monomeric building block at one or more positions of the oligomer or polymer, and possibly in terms of the connecting linkage, or the length of the oligomer or polymer, too.
- the members may be nonoligomeric molecules with a standard core structure, like the 1, 4-benzodiazepine structure, with the variation being introduced by the choice of substituents at particular variable sites on the core structure.
- the members may be nonoligomeric molecules assembled like a jigsaw puzzle, but wherein each piece has both one or more variable moieties (contributing to library diversity) and one or more constant moieties (providing the functionalities for coupling the piece in question to other pieces) .
- Composite combinatorial library is a mixture of two or more simple libraries, e.g., DNAs and peptides, or benzodiazepine and carbamates .
- the number of component simple libraries in a composite library will, of course, normally be smaller than the average number of members in each simple library, as otherwise the advantage of a library over individual synthesis is small.
- An oligonucleotide library is a combinatorial library, at least some of whose members are single-stranded oligonucleotides having three or more nucleotides connected by phosphodiester or analogous bonds.
- the oligonucleotides may be linear, cyclic or branched, and may include non-nucleic acid moieties.
- the nucleotides are not limited to the nucleotides normally found in DNA or RNA.
- nucleotides modified to increase nuclease resistance and chemical stability of aptamers see Chart 1 in Osborne and Ellington, Chem. Rev., 97: 349-70 (1997).
- For screening of RNA see Ellington and Szostak, Nature, 346: 818-22 (1990).
- the libraries of the present invention are preferably composed of oligonucleotides having a length of 3 to 100 bases, more preferably 15 to 35 bases.
- the oligonucleotides in a given library may be of the same or of different lengths.
- Oligonucleotide libraries have the advantage that libraries of very high diversity (e.g., 10 1S ) are feasible, and binding molecules are readily amplified in vitro by polymerase chain reaction (PCR) . Moreover, nucleic acid molecules can have very high specificity and affinity to targets.
- PCR polymerase chain reaction
- this invention prepares and screens oligonucleotide libraries by the SELEX method, as described in King and Famulok, Molec . Biol. Repts . , 20: 97-107 (1994) ; L. Gold, C. Tuerk . Methods of producing nucleic acid ligands, US#5595877; Oliphant et al . Gene 44:177 (1986).
- aptamer is conferred on those oligonucleotides which bind the target protein. Such aptamers may be used to characterize the target protein, both directly (through identification of the aptamer and the points of contact between the aptamer and the protein) and indirectly (by use of the aptamer as a ligand to modify the chemical reactivity of the protein) .
- a peptide library is a combinatorial library, at least some of whose members are peptides having three or more amino acids connected via peptide bonds.
- the peptides may be linear, branched, or cyclic, and may include nonpeptidyl moieties.
- the amino acids are not limited to the naturally occurring amino acids.
- a biased peptide library is one in which one or more (but not all) residues of the peptides are constant residues.
- the individual members are referred to as peptide ligands (PL) .
- an internal residue is constant, so that the peptide sequence may be written as
- Xaa is either any naturally occurring amino acid, or any amino acid except cysteine
- m and n are chosen independently from the range of 2 to 20
- the Xaa may be the same or different
- AA X is the same naturally occurring amino acid for all peptides in the library but may be any amino acid.
- m and n are chosen independently from the range of 4 to 9.
- AA X is located at or near the center of the peptide. More specifically, it is desirable that m and n are not different by more than 2; more preferably m and n are equal. Even if the chosen AA X is required (or at least permissive) of the target protein (TP) binding activity, one may need particular flanking residues to assure that it is properly positioned. If AA X is more or less centrally located, the library presents numerous alternative choices for the flanking residues. If AA X is at an end, this flexibility is diminished.
- the most preferred libraries are those in which AA ! is tryptophan, proline or tyrosine.
- Second most preferred are those in which AA X is phenylalanine, histidine, arginine, aspartate, leucine or isoleucine.
- Third most preferred are those in which AA X is asparagine, serine, alanine or methionine.
- the least preferred choices are cysteine and glycine. These preferences are based on evaluation of the results of screening random peptide libraries for binding to many different TPs .
- Ligands that bind to functional domains tend to have both constant as well as unique features. Therefore, by using “biased” peptide libraries, one can ease the burden of finding ligands. Either “biased” or “unbiased” libraries may be screened to identify "BioKey” peptides for use in developing reactivity descriptors, and, optionally, peptide aptamer descriptors and additional drug leads.
- a “descriptor” (also known as a parameter, character, variable, or variate) is a numerically expressed characteristic of a compound (which may be a protein, or a protein ligand) , which helps to distinguish that compound from others.
- a descriptor value need not be absolutely specific to a compound to be useful.
- the characteristics may be pure structural characteristics (as in a "structural descriptor") or they may refer to the compound's interaction with other compounds, such as a binding interaction (as in an "aptamer descriptor") or a chemical reaction (as in a "reactivity descriptor”) .
- “Paired Descriptors” are descriptors of the same property as measured in two different molecule.
- a “descriptor array”, “list”, or “set” is an array, list or set whose elements are different descriptors for the same molecule.
- a plurality of comparable descriptors for two compounds may be used to calculate a similarity for the two compounds.
- the descriptors used in making this calculation in the present invention, for two proteins, will include (a) at least one chemical reactivity descriptor, and/or (b) at least one peptide or oligonucleotide affinity descriptor, and preferably both.
- the similarity calculation may optionally consider other descriptors, such as structural descriptors of known ligands, as well.
- a set of n-descriptors defines an n-dimensional descriptor space; each compound for which a descriptor set is available may be said to occupy a point in descriptor space.
- the dissimilarity of two compounds may be expressed as a distance between the two points which they occupy in descriptor space.
- a similarity measure or coefficient quantifies the relationship between two individuals (compounds) , given the values of a set of variates (descriptors) common to both. Similarity coefficients are usually defined to take values in the range of 0 to 1.
- One commonly used ' measure of similarity is the product moment correlation coefficient. Its correlation is unity whenever two profiles are parallel, regardless of how far apart they are in level. Two profiles may have correlation of +1 even if they are not parallel, provided that the two sets of scores are linearly related.
- the Jaccard or Sneath coefficient modifies the simple matching coefficient by ignoring bits which in both i and j_ are zero, i.e., by ignoring negative matches (mutual absences). In other words, it is obtained by dividing the number of bits which are set in both descriptor bit strings, and dividing by the total number of bits set in either descriptor string. It is also called the unweighted Tanimoto coefficient.
- Gower has defined a general similarity coef f icient which can be used for binary, qualitative , and quantitative data :
- Descriptors may be quantitative or qualitative. Quantitative descriptors may be integers or real numbers. Qualitative descriptors divide the data into categories which may be, but need not be, expressible as having relative magnitudes. Binary descriptors are a special case of qualitative descriptors, in which there are just two categories, typically representing the presence or absence of a feature. Qualitative data for which the variates have several levels may be treated like binary data with each level of a variate being regarded as a single binary variable (i.e., an eight level variate expressed as eight bits) . Or the levels may be numbered sequentially (i.e., an eight level variable expressed as three bits) . The reactivity descriptors are preferably quantitative in form.
- aptamer descriptors are expressed as nucleic acid sequences, with or without secondary structure and protein contact information, they are qualitative in form. If they are expressed as 2D fingerprints, they are a string of binary data. Gower's coefficient, for qualitative data, only credits exact matches of the variate. For aptamers, it is more useful to evaluate the similarity of the sequences by a BLAST type analysis rather than to simply state whether aptamers are the same or different.
- the greater the distance the less the similarity.
- Descriptors may be weighted (or otherwise transformed) for any of several reasons, including:
- the raw descriptor values may be, but need not be, transformed prior to use in calculating distances.
- Typical transformations are (a) presence (l)/absence (0), (b) ln(x+l), (c) frequency in sample, (d) root, and (e) relative range, i.e., (value-min) / (max-min) .
- Descriptor weights may be adjusted empirically on the basis of specially designed test sets.
- a training set of proteins is identified. Descriptors are evaluated for each protein in the set.
- a training set of compounds including are also tested against each compound in the set. These compounds are chosen so that, for any protein in the set, there is at least one compound which is an agonist or antagonist for it.
- a neural net with the descriptor weights as inputs, is used to predict the activity of each compound against each protein, using the calculated protein similarities. For example, it will calculate the similarity of protein x to all other proteins, then treat the activities of the compounds against the other proteins as "knowns" and use it to predict the activity of the compounds against protein x. This is done repeatedly, with each protein taking on the role of protein x, in turn.
- the training set proteins may be chosen for high diversity, e.g., insignificant sequence similarities and/or unrelated biological activities. If the plan is to use the library for lead optimization, then the training set proteins might be members of a family of homologous proteins.
- the coefficient of variation may be useful in comparing descriptors; it is the standard deviation divided by the mean. If there is no information available about the ultimate significance of a descriptor, one may give a greater weight to descriptors which have a larger CV and hence a more uniform distribution.
- Standard mathematical methods such as cluster analysis, principal components analysis, or partial least squares analysis, may be used to determine which descriptors are strongly correlated and to replace them with a new descriptor which is a weighted sum of the original correlated descriptors.
- One may alternatively choose (perhaps randomly) one of each pair of highly correlated descriptors and simply prune it, thereby reducing the amount of data which must be collected.
- Distances may be calculated on the basis of any of a variety of distance measures known in the statistical arts.
- the "cosine theta” distance is the cosine of the angle between the vector from the origin to point X lk and the vector from the origin to point X_ .
- the Mahalanobis distance allows for correlations between variables; if the variables are uncorrelated, D 2 is equivalent to Euclidean distance measured using standard variables.
- the Calhoun distance uses only rank orders; for molecules i and i, the distance is the proportion of the entire set (excluding i and j.) that have descriptor states intermediate between that for i and that for j_ for one or more of the descriptors k.
- a distance measure may be transformed into a similarity measure by any of a variety of transformations that convert a non-negative number to the range 0..1, e.g.,
- S 13 l- (d 13 /d amax )
- s 13 the fraction of the pairs for which the distance is greater than or equal to d ⁇ . This is a measure of relative similarity.
- the diversity of a set of compounds, as measured by a set of descriptors, may be calculated in several ways.
- a purely geometric method involves assuming that each compound sweeps out a hypersphere in descriptor space, the hypersphere having a radius known as the similarity radius.
- the total hypervolume in descriptor space of points within a unit similarily radius of one or more of the compounds is calculated. This is compared to the hypervolume achievable if none of hypersphere ' s overlap; i.e., to n * volume of a single hypersphere, where n is the number of compounds in the set.
- the swept hypervolume may be determined exactly, or by Monte Carlo methods.
- the ratio of the swept hypervolume to the maximum hypervolume is a measure of compound set diversity, ranging from 1 (maximum) to 1/n (minimum) .
- Another approach is to calculate all of the pairwise distances between compounds in descriptor space.
- the mean distance is a measure of diversity. If desired, this can be scaled by calculating the ratio of the mean distance to the maximum theoretical distance.
- a third approach is to apply cluster analysis to the set of compounds .
- the method used should be one which does not set the number of clusters arbitrarily, but rather decides the number based on some goodness-of-fit criterion.
- the resulting number of cluster is a measure of diversity, as is the ratio of the number of clusters to the number of compounds .
- the sum of H(k) for all k is a measure of overall diversity. Standard techniques may be used to correct for correlation.
- Binding sites recognize ligands based on complementarity of both shape and functionality.
- the functionality in the binding site is an array of recognizable groups (hydrophobic, hydrogen bond donors, hydrogen bond acceptors, IT stacking) that is complementary to the ligand ( 9) .
- Many of these functional groups are reactive towards common chemical modification reagents, such as hydroxyl radical, Mn0 4 " , NaBH 4 , and dimethyl sulfate ( 10) , and towards "designer” reagents usually based on transition-metal complexes ( 11 , 12) .
- reagents can react with functional groups via a wide range of pathways, including one-electron oxidation, oxygen-atom transfer, hydrogen-atom abstraction, hydrogenation, one-electron reduction, hydrolysis, and alkylation.
- the array of functional groups in a binding site will therefore exhibit a unique array of rates of reactivity to each of these reagents. This array of rates will provide a set of descriptors for the chemical functionality of the binding site. Therefore, the set of descriptors can be obtained by measuring the relative rates of the functional groups in the binding site against a panel of chemical modification reagents.
- the reactivity descriptors are binding site specific.
- the structure of lysozyme shows that there are two very important tryptophan residues in the binding site, so it will be important that the measured rate is for those and not for the other trp residues in the protein.
- the binding site trp residues are by far the most accessible and will probably dominate the measured reactivity. The easiest way to assess this point will be to measure the reaction rates with and without a ligand (such as a peptide BioKey) that blocks the binding site. The difference in rates is then the reactivity of the binding site.
- the chemical modification reagent can be covalently attached to the ligand and delivered directly to the binding site.
- the presently preferred reagents are transition-metal complexes -- e.g., Pt 2 (pop) 4 4" , Ru(tpy) (bpy)0 + , and Ru (bpy) 3 3+ -- whose rate constants can be measured by optical spectroscopy or Stern-Volmer quenching.
- the oxoruthenium (IV) system has been very useful in discerning the important kinetic and thermodynamic factors in DNA oxidation, because the precise quantity of oxidant is known and the fate of all oxidizing equivalents can be quantitated.
- the reactivity pathways available to the oxoruthenium (IV) and Pt 2 (pop) 4 4 ⁇ complexes include inner-sphere reactions where there is significant bond- breaking or bond-making in the transition state that involves the metal complex. These are the reaction pathways described above that lead to hydrogen abstraction from nucleotides. Outer-sphere pathways are ones where only an electron is transferred by tunneling from one reactant to another. The advantage of these pathways is that there is a spherically symmetric distance dependence to the reaction probability while inner-sphere pathways generally require a specific approach of the reagent on the reactant. This difference will be important here in defining a diverse set of reaction pathways of the reagents with the protein targets.
- Reactivi ty prof iles The amino acid "reactivity profiles" will be determined for each of the proposed reagents.
- the rate constants for the reagents may be determined with any or with all 20 conventional amino acids by methods described below. These 20 rate constants will then provide a profile for each reagent.
- the desired result is that there is a different profile for each reagent. For example, if Ru (tpy) (bpy) 0 2+ reacts mostly with C-H donors like serine and threonine but Ru(bpy) 3 3+ reacts mostly with one electron donors like tyrosine and tryptophan, then each descriptor and the relationship between the two descriptors will be informative.
- the rate constants will be measured for the free amino acids and a representative set of di- and tri-peptides to show that the individual rate constants add together linearly and can be discriminated by the three reagents .
- the three preferred reagents and their reactivities are well suited to discriminating the 20 amino acids.
- Pt 2 (pop) 4 4" abstracts hydrogen atoms from weak C-H bonds in organic substrates. Shown in Figure 2 are some of the amino acids with the C-H bonds likely to be activated by Pt 2 (pop) 4 4" highlighted.
- the C-H activation chemistry of Ru(tpy) (bpy)0 2+ is distinct from that of Pt 2 (pop) 4 4" in that inner-sphere adducts of the ruthenium-oxo linkage are often formed ( 19, 34) , which favors activation of alcohols over aliphatic functionality.
- Ru(tpy) (bpy)0 2+ is likely to prefer serine, threonine, and cysteine more than Pt 2 (pop) 4 4" .
- Inner-sphere chemistry on the tryptophan and histidine rings or terminal amines of argninine are also likely with Ru(tpy) (bpy)0 2+ .
- the rate constants for a large array of amino acids will be approximately the average for all of the amino acids separately, weighted by the solvent accessibility of the individual groups in the folded protein. Therefore, the rate constant for a dipeptide should be the average of the rate constants for each individual amino acid, and similarly to larger peptides as long as no secondary structure develops to attenuate the reactivity of certain groups that are protected by the tertiary structure. It will be instructive to measure the rate constants for some representative random-coil peptides to show how simply linking the amino acids modulates the observed rate constants.
- the rate constants may be measured for the three transition-metal complexes under three conditions: denatured protein, folded protein, and folded protein with bound surrogate (BioKey) peptide or nucleic acid.
- the objective is for the difference in rate constants for folded and denatured protein to give a quantitative descriptor of the surface area of the folded protein (weighted by the solvent-exposed amino acids) and for the rate constants with and without the bound BioKey molecule to give a quantitative descriptor of the amino acids in the binding site.
- a complication is that the bound BioKey may present new solvent-accessible residues that react with the transition-metal complex.
- BioKey peptides with a scrambled amino acid sequence will be included in the reaction with the protein alone, so that the reactive functionality in the BioKey will be present in both reactions.
- Unfolded proteins may be generated chemically by addition of a denaturing agent, such as urea or guanidinium hydrochloride; we have shown elsewhere that urea does not deactivate our reagents (24) .
- a denaturing agent such as urea or guanidinium hydrochloride
- thermal denaturation in cases where the chemical denaturant is problematic, we will use thermal denaturation; in the case of Pt 2 (pop) 4 " , we have shown that thermal denaturation does not alter the selectivity of the reagent with biomolecules except as modulated by the change in biomolecular structure.
- the rate constant for the peptide may be determined separately and used to correct for additional oxidation of peptide side chains not blocked by the protein.
- the rate constant for a peptide with the same amino acids, but with a scrambled sequence, may also be of interest.
- the protein oxidation rates are measured either by quantitating the disappearance of the reagent by standard analytical methods, quantitating the disappearance of the protein by mass spectrometry, amino acid analysis, changes in fluorescence of probes that bind to proteins, or competition with DNA plasmids that are cleaved by the reagent. Rates are compiled for all proteins for each analytical method and each protein state (folded, unfolded, BioKey) . Each of these rates is a new quantitative descriptor of the protein. If one measures the rate of disappearance of the reagent, this will reflect its action on all amino acids.
- the measurement will be more amino acid-specific. Any or all of these approaches may be used to generate descriptors.
- the reactivities may be weighted by the solvent accessibility of each residue as prescribed by the folded structure, so the difference in rates for the folded and unfolded proteins will give a quantitative description of the degree of folding and of the chemical functionality on the solvent-accessible surface.
- the BioKey peptide When the BioKey peptide is bound to the active site, the active site residues will be blocked. The difference in the rates with and without the BioKey will therefore be a quantitative descriptor of the number and kind of residues in the active site.
- the collected data are used to generate a new database containing in vi tro descriptors of protein targets.
- the database contains entries for relative rates in the unfolded, folded, and ligand-bound states.
- the relative rates are normalized on a scale from zero to one and entered into the database in a three-dimensional matrix where each point corresponds to a particular reagent for a given protein in one of the three states. More dimensions may be added by including amino acid-specific rate measurements, such as surface hydrophobicity changes.
- proteins that exhibit large changes in rate from unfolded to folded states are those with compact structures and a large number of buried residues. Likewise, large changes in rate between folded and BioKey-bound states indicates large active sites.
- the three reagents described thus far can be generalized by substitution and still be amenable to real-time characterization.
- the R (tpy) (bpy) 0 2+ complexes can be substituted with electron-donating and releasing substituents that will increase or decrease the driving force for oxidation and hence the reactivity and selectivity for different groups in the polymer.
- Ru (bpy) 2 (4-NMe 2 -py) 0 2+ only abstracts 1' hydrogens from thymidine sugars and not from A, C, or G whereas the parent Ru(tpy) (bpy)0 2+ complex oxidizes all four nucleotides ( 42) .
- these complexes can be substituted with sterically differentiated groups that will modulate the selectivity based on the solvent accessibility.
- steric effects lower the reactivity towards guanine oxidation in duplex DNA without changing the reactivity toward sugar 1' hydrogens (20) .
- Reactivity of one-electron oxidants based on Ru(bpy) 3 3+ can be modulated by electron-releasing or withdrawing groups or by making the complex significantly larger or smaller, which will change the electron-transfer distance (27) .
- these changes should modulate the reactivity profile and the effects of solvent accessibility. Such changes in profile will create reagents that return differentiated descriptors for the protein targets .
- transition-metal reagents are attractive because of the facility with which they can be modified and the ease with which absolute rate constants can be measured ( 17) ; however, mining the potential information available from the wide range of known chemical modification reagents requires general methods for measuring the relative rates.
- chemical modification reactions have been much more widely applied to study of nucleic acid structure than to study of protein structure because of the greater lability of the phosphodiester backbone. Indeed, reactions that modify DNA nucleotides lead to strand scission if not immediately than almost always after base treatment ( 43) .
- a very sensitive method for measuring DNA modification is by plasmid isomerization ( 44 , 22) . This method is therefore also a sensitive method for measuring the instantaneous concentration of the modification reagent.
- a parallel approach described below is to assess the extent of modification of different families of amino acids.
- the change in surface hydrophobicity of a protein can be assessed by the change in fluorescence of 8-anilino-l- naphthalene-sulfonic acid (ANSA) ( 45) .
- ANSA 8-anilino-l- naphthalene-sulfonic acid
- An increase in hydrophobicity measured this way is observed upon protein oxidation, which is very well correlated with formation of tyrosine dimers and oxidation of methionine ( 36) .
- Hydrophobicity and other methods that monitor modification of specific sets or subsets of amino acids, such as changes in reactive carbonyl group or formation of oxidized methionine
- the one-electron reagents based on Ru(bpy) 3 3+ can also be analyzed by changes in optical spectra.
- Plasmid Isomerization The method for measuring the rate constants by competition with plasmid isomerization is shown in Figure 4. Reaction of the reagent of interest with supercoiled (Form I) DNA leads to nicking of the DNA to produce a nicked plasmid (Form II) , which is readily separated from Form I on an agarose gel ( 44 , 22) . Because the plasmid is as much as 4 kb in length, the plasmid isomerization assay is very sensitive to small amounts of DNA damage.
- reaction is then repeated in the presence of the protein for which a relative rate constant is desired.
- protein is modified instead of the plasmid, less isomerization is observed.
- This assay can then be performed for any reagent that modifies both DNA and amino acids .
- a drawback to the approach in Figure 4 is the that the assay cannot be performed for DNA-binding proteins; however, the simplicity and low quantities of material required are attractive.
- a more general strategy would be to analyze the oxidized protein by matrix-assisted laser-desorption ionization (MALDI) mass spectrometry ( 45) .
- MALDI matrix-assisted laser-desorption ionization
- This method can be used to detect small changes in molecular weight of large proteins on 0.1 pmol of protein (for a recent example, see ( 45) ) .
- the mass spectrometry will simply be used to detect the change in the concentration of the unmodified protein as a means for determining the rate of modification by the reagent.
- a final general approach to measuring the relative rates would be to measure the quantity of reagent before and during reaction with the protein. This method could involve traditional analytical chemistry techniques such as HPLC or GC for suitable substrates. The use of isotopically labeled reagents will provide the desired sensitivity. As discussed above, a separate strategy for measuring the rates would be rather than to measure the total rate for all of the amino acids, to use detection methods that sample a subset of the amino acids. Such assays would involve quantitation of reactive carbonyl in the oxidized protein following the reaction, analysis for individual oxidized amino acids such as methionine sulfoxide, or measuring the change in hydrophobicity by ANSA emission ( 36) . These methods would then provide further specificity in the descriptors that would be meaningful when the same detection methods were compared for different proteins.
- the reactivity desciptors should be predictable from the three- dimensional structure and the reactivity profile, which describes the inherent chemical reactivity of each amino acid towards the reagent.
- the three-dimensional structure can be used to weight each amino acid by its solvent accessibility in the folded protein, which can then be weighted by its inherent reactivity. The ability to perform these calculations carefully will depend on accurate reactivity profiles, which is why choosing transition-metal reagents where absolute rate constants can be measured is vital in the initial studies.
- rates are measured in the presence of the BioKey peptide, again the reactivity should be predictable from the three-dimensional structure and the reactivity profile.
- the amino acid composition of protein binding sites and surfaces has been determined in 50 proteins whose crystal structures with bound ligands are known ( 9) . These studies show that Trp, His, Arg, and Tyr are much more abundant in binding sites contacting bound ligands than in general in the protein. Gly and Ser are often found near the bound ligand as well; however, these residues are generally abundant throughout the protein. Therefore, reagents that are specific for Trp and His, which are found at very low frequencies outside the binding site, will be very informative. Attractive reagents include Ru(tpy) (bpy)0 2+ , which oxidizes ring nitrogens, reagents that alkylate ring nitrogens, or oxidants that form amine oxides at ring nitrogens.
- the collected data will be used to generate a new database containing in vi tro descriptors of protein targets, which we will call R (for reactivity or recognition) to distinguish it from NCI's S, A, and T databases.
- R for reactivity or recognition
- the database preferably contains information regarding reaction rates in the unfolded, folded, and ligand-bound states.
- Rate constants may provided for several different ligand-bound states, i.e., with different ligands.
- the rate information is expressed in relative terms, and normalized on a scale from zero to one.
- the rate information may be expressed in difference form, e.g., by providing the (rate folded-rate unfolded), (rate ligand bound-rate folded) , (rate ligand bound-rate unfolded) , and/or (rate ligand 1 bound-rate ligand 2 bound) .
- the data may be conceptualized as a three-dimensional matrix where each point corresponds to a particular reagent for a given protein in one of the three states, although its most efficient representation is likely to be that of a relational database .
- the various databases are normalized in accordance with standard database programming practice to minimize the duplication of information.
- one database may be of reactions.
- Each record in this database will contain a reaction ID, and will normally contain additional information about the reaction, such as the chemical name of the reagent, the reaction conditions (e.g., temperature, solvent) , and assay method. If a reagent is used under many different reaction conditions, is may be appropriate to also create a reagent database, with a reagent ID field used to link the reaction and reagent databases, and information about the reagent placed in the other fields of the reagent database record.
- Another database may tabulate target proteins.
- Each record in this database will contain a target ID, and may optionally contain additional information about the target protein, such as its name, biological activity, sequence, etc.
- a third database could provide the target protein state.
- Each record of the "state" database will have a state ID field, and additional fields which identify whether the state is a folded or unfolded protein and, if folded, whether a ligand is bound to it.
- a ligand could be identified by a ligand ID acting as a lookup field for retrieval of information from a ligand database .
- the reaction, protein, and protein state databases are relationally linked to the reactivity database, with the reaction ID used for lookup of a reaction and the target protein ID used for lookup of a target protein.
- each record in the reactivity database will contain fields for the reaction ID, the target protein ID, and the result of the reaction (e.g., a rate constant) .
- reaction database There is a many-to-one relationship between the reaction database and the reactivity database, and a one-to-many relationship between the reactivity database and the protein database.
- each record identifies a different type of assay. Proteins may have more than one activity, and have a different spectrum of relevant drugs for each. Therefore, in a preferred interaction database, each record will include a drug ID, a protein ID, an assay ID, and an assay value or potency. Each ID field is a relational link to another database .
- the R database Once the R database is compiled, it will be used to improve the efficiency of lead generation as hereafter described. As discussed above, the initial database will be compiled for proteins that bind known inhibitors and have well understood biology and expression profiles. Once a sufficient number of these proteins are in the database, reactivity descriptors may be determined for proteins that have not been screened against combinatorial libraries, whose three-dimensional structures are not known, and/or whose biological profile has not yet been determined. It will be appreciated that there is no fixed minimum number of proteins which must be in the database before it can provide useful information about a protein of interest.
- the database will provide data on at least 50, more preferably at least 200, still more preferably at least 1000, and most preferably every known protein. Another factor is the diversity of the proteins in the database. The more diverse the proteins, the more likely it is that at least one database protein will have a reasonably similar reactivity.
- the reactivity descriptor for a protein is the set of rate constants (or rate constant differences) for all of the characterizing reactions to which that protein has been subjected.
- the reactivity descriptors will be entered into R and the similarity between the unknown protein and the known proteins in the database will be determined as described in the section on "Descriptors" .
- the utility of a reactivity database is dependent, not only on the number of proteins in the database, but also on the number of reactions to which the protein was subjected, and the number of protein states examined. All else being equal, the more data points, the greater the degree of characterization. Preferably, there are at least 1, more preferably at least 3, still more preferably at least 10, most preferably at least 100, datapoints (target protein/state/reaction triplets) . While less easily defined, the greater the diversity in the characterizing reactions, the more useful the database will be.
- Relation to databases with information on activity (such as A) will predict what types of chemical compounds will most likely bind to the unknown target, because proteins that appear similar in R will bind similar compounds. This procedure will be analogous to calculating similarities between chemical compounds (such as in S) and predicting that compounds similar to a known inhibitor will also inhibit similar targets.
- Relation to an expression database such as T will provide pharmacological information on the unknown target.
- An aptamer-based descriptor is a description of a protein in terms of the aptamers which recognize it. While peptides could serve as aptamers, the preferred aptamers are nucleic acids.
- Such a descriptor is not a simple numerical value. Rather, it is a list of sequences (and, preferably, secondary structures and contact points) for each of the aptamers identified as binding a particular protein. In this section, we will describe how nucleic acid aptamers are identified and characterized, and how the similarity of the aptamer-based descriptors for two different proteins may be calculated.
- a single-stranded oligonucleotide library is screened to identify aptamers which bind a protein with a desired affinity.
- This protein may be a reference protein with known drug antagonists, or the target protein for which such antagonists are to be identified.
- the desired aptamers are amplified and sequenced.
- the aptamers serve to characterize the protein in that only those oligonucleotides which can conform to the surface of the protein will bind to it. One may say that the aptamers take "impressions" of the protein surface. Some of the aptamers may be expected to bind the protein at a site corresponding or, overlapping, or otherwise occluding the functional site of the protein. (Such aptamers may, if desired, be identified by screening the aptamers for antagonist activity.) Others will bind at sites distal to the functional site. Some will bind to the same site, others, to different sites. All contribute to a "picture" of the protein.
- the contact sites (the bases within the aptamer) through which the aptamer contacts the protein are identified.
- a preferred means for such identification is a footprinting reaction where chemical modification of the nucleic acid is performed with and without the bound protein; sites where bound protein blocks chemical modification are contact sites. Footprinting of nucleic acids in this manner can be achieved using enzymes such as DNAse or chemical reagents such as copper-phenanthroline (Papavassiliou, A.G. Biochem. J. 1995, 305, 345-357), Fe(EDTA) 2 (Pogozelski, et al . , J. Am. Chem. Soc .
- the secondary structure of at least the contacting bases of the oligonucleotide is analyzed.
- the secondary structures of the aptamers can be predicted using the approach of Zichi et al . (J.P. Davis, N. Janjic, D. Pribnow, D.A. Zichi, Nucleic Acids Res.
- the descriptor for each aptamer preferably identifies not only the overall sequence of the aptamer, but also the contact site.
- the bases of the contact site are preferably described not only by identifying the base itself, but also indicating whether in the secondary structure of the aptamer it is paired to any base, and if so what.
- two-letter codes are used for each nucleotide.
- the contact nucleotides are followed by the small letter "o", so the contacts are Ao, To, Go, and Co for a DNA aptamer.
- the contact sites are followed by a small letter representing the nucleotide on the opposite strand.
- At the entire list of double-stranded codes.
- An alternative to comparing the footprinted sequence in a linear array would be to develop a two-dimensional projection of the secondary structure of the aptamer. For example, suppose the aptamer at the right is selected for a given target and the sites indicated with an arrow are determined to be protected by the target via Pt-pop footprinting. The aptamer can then me mapped onto a two-dimensional grid where the contact sites are coded as before and the remaining "placeholder" sites are coded as either Nn for a base pair or mismatch site or No for a single-stranded site (see Figure 5) . There is no need to differentiate the base pair and mismatch placeholder sites because this information is already in the contact site codes.
- the two-dimensional grid can now be analyzed for similarity to other two-dimensional representations by graph-theoretical approaches, such as those used for determining compound similarity (as in Patterson et al . , J. Med. Chem., 1996).
- the contact sites and secondary structure are used to develop a consensus "functionality sequence" (epitope) , which represents both the primary sequence and the secondary structure of the nucleic acid moiety which contacts the target protein.
- These functionality sequences are entered into a relational database that is used to analyze functionality sequences for unknown targets.
- Targets with homologous functionality sequences bind small molecules of a similar nature.
- Sequence identity among aptamer contact sites may be determined using an adaptation of BLAST from the National Center for Biotechnology Information.
- the BLAST (Basic Local Alignment Search Tool) algorithm is described in S.F. Altschul, W. Gish, W. Miller, E.W. Myers, andD.J. Lipman, J. Mol. Biol. 215, 403- 10 (1990) .
- a new version that allows the functionality (secondary structure) codes (At, Ta, Ao, etc.) to be compared can be adapted from art in the public domain.
- an identity matrix is used. If the aligned functionality elements are the same, the score for that pair of elements is one, if they are different, the scope is zero. The individual scores are summed and divided by the total number of elements.
- a second approach would be to weight more highly the alignments least likely to occur by chance alone. This would require tabulating predicted secondary structure information for a large number of DNA sequences. This could be done by (1) randomly generating DNA sequences of the length and base composition to be used in the library, (2) predicting the secondary structure of each sequence by standard methods, (3) converting the string of bases into a string of "functionality elements” and (4) calculating the probability of occurrence of each alignment.
- this weighting scheme is not limited to protein contact sites; all bases are considered.
- a base pair (Gc, Cg, At, Ta) is likely to be more common, and hence less informative, than a mismatch (Gt, Gg, Ga, Ag, etc.) or a single-stranded nucleotide (Go, Ao, To, Co) .
- These elements could therefore be weighted appropriately to allow the elements that have the greatest information to be considered most heavily in determining similarity. For one possible weighting, see Ex. 3.
- nucleic acids are preferred, because they can be amplified, peptides may also be used to generate aptamer-based descriptors. Preferably, these peptides are 5-10 a. a. in length.
- the peptide library is screened for binding to the query protein and the peptide aptamers are compared to those of the peptide aptamers found to bind each of the reference proteins.
- BLAST may be used in its standard form to compare the amino acid sequences. Any scoring matrix conventional in the art, e.g., BLOSUM, may be used to score aligned amino acid pairs. The resulting sequence similarity score may then be standardized.
- aptamer similarity on the basis of sequence similarity
- a 2-D fingerprint may be devised in which each bit represents the presence or absence of a particular secondary structure, such as an unbonded region, an unbonded region of a particular length (e.g., 2-6 bases, 7-20 bases, >20 bases), an interior loop, an interior loop of a particular length (e.g., 2-6 bases, 7-20 bases, >20 bases), a bulge loop, a bulge loop of a particular length (e.g.
- a hairpin loop of any length and type a hairpin loop closed by G:C, a hairpin loop closed by A.U, a particular type of hairpin loop of a particular length (e.g., 3 bases, 4-5 bases, 6-7 bases, 8-9 bases, 10-30 bases, >30 bases) , a run of paired bases of a particular length, an overall base composition in a particular range (e.g., 40-60% GC) , etc.
- Tinoco, et al . Nature New Biol., 246:40-41 (1973) and Tinoco, et al .
- m aptamers bind the query protein, and n aptamers binding a reference protein.
- the aptamer-based similarity of the target and reference protein may be calculated on the basis of any or all of the m x n possible comparisons. Many approaches are possible, including, but not limited to
- aptamers are selected for new targets of unknown structure.
- the ligands for these targets are footprinted, e.g., using Pt 2 (pop) 4 4" , and the functionality sequences are determined.
- the functionality sequences are entered into the data base, and connections are drawn between the ligands that bind the unknown target and those already in the data base for known targets.
- Initial screening of small molecule libraries is then directed towards compounds that have similar size and functionality to those that bind the known targets that exhibit high functionality sequence homology to the unknown target of interest .
- this information is entered into a set of relationally linked databases.
- the aptamers database has a many-to-many relationship with the protein database.
- each record includes an aptamer ID field (a link to an aptamer database) and a protein ID field (a link to a protein database) .
- the record may optionally identify the contact site. Secondary structure maybe indicated in either the aptamer record (if invariant) or in the aptamer- protein interaction record (if affected by the protein binding) .
- the database may include additional descriptors for the target proteins. Possible descriptors include the following:
- the compound library is a combinatorial library whose members are suitable for use as drugs if, indeed, they have the ability to mediate a biological activity of the target protein.
- Peptides have certain disadvantages as drugs. These include susceptibility to degradation by serum proteases, and difficulty in penetrating cell membranes.
- all or most of the compounds of the compound library avoid, or at least do not suffer to the same degree, one or more of the pharmaceutical disadvantages of peptides.
- disjunction in which a lead drug is simplified to identify its component pharmacophoric moieties
- conjunction in which two or more known pharmacophoric moieties, which may be the same or different, are associated, covalently or noncovalently, to form a new drug
- alteration in which one moiety is replaced by another which may be similar or different, but which is not in effect a disjunction or conjunction.
- disjunction in which a lead drug is simplified to identify its component pharmacophoric moieties
- conjunction in which two or more known pharmacophoric moieties, which may be the same or different, are associated, covalently or noncovalently, to form a new drug
- alteration in which one moiety is replaced by another which may be similar or different, but which is not in effect a disjunction or conjunction.
- the use of the terms "disjunction”, “conjunction” and “alteration” is intended only to connote the structural relationship of the end product to the original leads, and not how the new drugs are actually synth
- Alterations may modify the size, polarity, or electron distribution of an original moiety. Alterations include ring closing or opening, formation of lower or higher homologues, introduction or saturation of double bands, introduction of optically active centers, introduction, removal or replacement of bulky groups, isosteric or bioisosteric substitution, changes in the position or orientation of a group, introduction of alkylating groups, and introduction, removal or replacement of groups with a view toward inhibiting or promoting inductive (electrostatic or conjugative (resonance) effects.
- the substituents may include electron acceptors and/or electron donors.
- Typical electron donors (+1) include -CH 3 , -CH 2 R, -CHR 2 , -CR 3 and -COO " .
- the substituents may also include those which increase or decrease electronic density in conjugated systems.
- the former (+R) groups include -CH 3 , -CR 3 , -F, -Cl, -Br, -I, -OH, -OR,
- the later (-R) groups include -N0 2 , -CN, -CHC, -COR, -COOH, -COOR, -CONH 2 , -S0 2 R and -CF 3 .
- a compound, or a family of compounds, having one or more pharmacological activities may be disjoined into two or more known or potential pharmacophoric moieties.
- Analogues of each of these moieties may be identified, and mixtures of these analogues reacted so as to reassemble compounds which have some similarity to the original lead compound. It is not necessary that all members of the library possess moieties analogous to all of the moieties of the lead compound.
- Benzodiazepines The design of a library may be illustrated by the example of the benzodiazepines .
- Benzodiazepine drugs including chlordiazepoxide, diazepam and oxazepam, have been used on anti- anxiety drugs.
- Derivatives of benzodiazepines have widespread biological activities; derivatives have been reported to act not only as anxiolytics, but also as anticonvulsants, cholecystokinin (CCK) receptor subtype A or B, kappa opioid receptor, platelet activating factor, and HIV transactivator Tat antagonists, and GPIIblla, reverse transcriptase and ras farnesyltransferase inhibitors.
- CCK cholecystokinin
- the benzodiazepine structure has been disjoined into a 2- aminobenzophenone , an amino acid, and an alkylating agent. See Bunin, et al . , Proc. Nat. Acad. Sci. USA, 91:4708 (1994). Since only a few 2-aminobenzophenone derivatives are commercially available, it was later disjoined into 2-aminoarylstannane, an acid chloride, an amino acid, and an alkylating agent. Bunin, et al., Meth. Enzymol . , 267:448 (1996).
- the arylstannane may be considered the core structure upon which the other moieties are substituted, or all four may be considered equals which are conjoined to make each library member.
- a basic library synthesis plan and member structure is shown in Figure 1 of Fowlkes, et al . , U,S. Serial No. 08/740,671, incorporated by reference in its entirety.
- the acid chloride building block introduces variability at the R 1 site.
- the R 2 site is introduced by the amino acid, and the R 3 site by the alkylating agent.
- the R 4 site is inherent in the arylstannane.
- Bunin, et al . generated a 1, 4 -benzodiazepine library of 11,200 different derivatives prepared from 20 acid chlorides, 35 amino acids, and 16 alkylating agents.
- variable elements included both aliphatic and aromatic groups.
- aliphatic groups both acyclic and cyclic
- the aromatic groups featured either single and multiple rings, fused or not, substituted or not, and with heteroatoms or not.
- the secondary substitutents included -NH 2 , -OH, -OMe, -CN, -Cl, -F, and -COOH. While not used, spacer moieties, such as -O- , -S-, -00-, -CS-, - NH-, and -NR- , could have been incorporated.
- Bunin et al suggest that instead of using a 1, 4- benzodiazepine as a core structure, one may instead use a 1, 4- benzodiazepine-2 , 5-dione structure.
- DeWitt, et al . , Proc. Nat. Acad. Sci. (USA), 90:6909-13 (1993) describes the simultaneous but separate, synthesis of 40 discrete hydantoins and 40 discrete benzodiazepines. They carry out their synthesis on a solid support (inside a gas dispersion tube) , in an array format, as opposed to other conventional simultaneous synthesis techniques (e.g., in a well, or on a pin) .
- the hydantoins were synthesized by first simultaneously deprotecting and then treating each of five amino acid resins with each of eight isocyanates.
- the benzodiazepines were synthesized by treating each of five deprotected amino acid resins with each of eight 2 -amino benzophenone imines .
- a polymer bead-bound aldehyde preparation was "split" into three aliquots, each reacted with one of three different ylide reagents.
- the reaction products were combined, and then divided into three new aliquots, each of which was reacted with a different Michael donor.
- Compound identity was found to be determinable on a single bead basis by gas chromatography/mass spectroscopy analysis.
- hypnotics higher alcohols clomethiazole
- aldehydes chloral hydrate
- carbamates meprobamate
- acetylcarbromal barbiturates
- barbital benzodiazepine
- narcotic analgesics morphines phenylpiperidines (meperidine) diphenylpropylamines (methadone) phenothiazihes (methotrimeprazine)
- analgesics analgesics, antipyretics, antirheumatics salicylates (acetylsalicylic acid) p-aminophenol (acetaminophen)
- anxiolytics propandiol carbamates meprobamate
- benzodiazepines chlordiazepoxide, diazepam, oxazepam
- CNS stimulants xanthines (caffeine, theophylline) phenylalkylamines (amphetamine) (Fenetylline is a conjunction of theophylline and amphetamine) oxazolidinones (pemoline) cholinergics choline esters (acetylcholine) N,N-dimethylcarbamates
- adrenergics aromatic amines epinephrine, isoproterenol, phenylephrine
- alicyclic amines cyclopentamine
- aliphatic amines methylhexaneamine
- imidazolines naphazoline
- antihistamines ethanolamines (diphenhydramine) ethylenediamines (tripelennomine ) alkylamines (chlorpheniramine) piperazines (cyclizine) phenothiazines (promethazine)
- vasodilators polyol nitrates (nitroglycerin)
- antibiotics penicillins cephalosporins octahydronapthacenes (tetracycline) sulfonamides nitrofurans cyclic amines naphthyridines xylenols
- antitumor alkylating agents nitrogen mustards aziridines methanesulfonate esters epoxides amino acid antagonists folic acid antagonists pyrimidine antagonists purine antagonists antiviral adamantanes nucleosides thiosemicarbazones inosines amidines and guanidines isoquinolines benzimidazoles piperazines
- Goth Medical Pharmacology: Principles and Concepts (C.V. Mosby Co. : 8th ed. 1976) ; Korolkovas and Burckhalter, Essentials of Medicinal Chemistry (John Wiley & Sons, Inc. : 1976) .
- the library is preferably synthesized so that the individual members remain identifiable so that, if a member is shown to be active, it is not necessary to analyze it.
- Several methods of identification have been proposed, including:
- each member is synthesized only at a particular coordinate on or in a matrix, or in a particular chamber. This might be, for example, the location of a particular pin, or a particular well on a microtiter plate, or inside a "tea bag".
- the present invention is not limited to any particular form of identification .
- Solid phase synthesis permits greater control over which derivatives are formed. However, the solid phase could interfere with activity. To overcome this problem, some or all of the molecules of each member could be liberated, after synthesis but before screening.
- the user receives a list of reference proteins. For each reference protein a similarity score, and a list of known antagonists (or other modulators) is given. These are the "drug leads" .
- the antagonists are weighted by the similarity scores of their respective proteins. If available, and desired, they may also be weighted by their potency against their corresponding reference protein, and/or by other physicochemical characteristics of interest, e.g., lipophilicity.
- This invention contemplates the construction of a composite combinatorial compound library which is biased in favor of compounds (both scaffoldings and substituents) which are structurally similar to the drug leads.
- the composite library is based on these already available simple libraries .
- Each drug lead is compared, using structural descriptors, to the basic scaffold (or to the most similar member) of each candidate simple combinatorial library.
- the structural descriptors which may be used include, but are limited to, those listed in Patterson, et al . (1996), Klebe and Abraham (1993), Cummins, et al . (1996), and Matter (1997). Conventional mathematical methods may be used to select or weight the descriptors .
- the 2D fingerprint method described in Matter, et al . (1997) is of particular interest. In essence, the compound is analyzed for the presence or absence of particular molecular fragments, the results being encoded in a binary format.
- each fragment was projected using a pseudorandomization algorithmin to a bitstring of limited size (i.e., fewer bits than the total number of unique fragments in the compounds of the database) .
- a pseudorandomization algorithmin to a bitstring of limited size (i.e., fewer bits than the total number of unique fragments in the compounds of the database) .
- the presence of 60 specific functional groups, rings or atoms was encoded in 60 of the total 988 bits. Details are given in UNITY Chemical Information Software, version 2.5. Reference Guide, pp. 45-58, Tripos Inc., 1699 Hanley Rd. , St. Louis, MO 63144.
- the candidate library is assigned a weight which is a function of (1) the drug lead's query score (reflecting the similarity of its reference protein to the query protein and, optionally, the potency or other drug characteristics of the drug lead) and (2) the structural similarity score between the drug lead and the scaffold. These weights then determine the predominance of that candidate library in the ideal composite library. Thus, if benzodiazepines score twice as high as carbamates, the benzodiazepine library screened might be twice as big as the carbamate library.
- Each drug lead may also be used to evaluate possible substituents to be conjugated to the scaffold.
- Each candidate substituent is scored, on the basis of structural descriptors, for its similarity to the drug lead.
- the proportion of the candidate substituent in the combinatorial reaction mix may then be governed by its similarity score.
- Low-scoring candidate substituents may be omitted entirely, or merely reduced in concentration.
- the mix may be limited to high scoring substituents, or their concentrations may merely be increased. The concentration changes need not be strictly proportional to the scores.
- a reference protein is very similar to the query protein, it should have a strong influence on the library composition, and, if its similarity is only modest, its influence should be weak.
- a drug lead which strongly resembles a candidate library component should strongly favor that library, and one which only weakly resembles it should imply a more modest enrichment.
- W L original absolute weight of library type L in composite library [0..1]
- the new relative weights W' L are converted to new absolute weights by dividing each by the sum of W' L for all L.
- the leads could be considered in descending order of structural similarity with a proposed library component, i.e., most similar lead first.
- the user may decide to simply screen the most similar of the candidate simple combinatorial libraries. This is equivalent to giving it a weight of 1 and the others a weight of 0.
- Drug leads are not equal in value.
- the drugs will vary in potency, side effects, deliverability, residence time, ease of synthesis, cost of production etc. Those factors which the chemist wishes to consider may be subsumed into a quality factor (q d ) ranging from 0 to 1, with unity being the most desirable sort of lead.
- a quality factor may be assigned by any rational method. If only potency is considered, then the most potent drug in the database could be assigned a value of 1, and the other drugs assigned relative values based on the logarithms of their potencies. Thus, if the best drug has an IC50 of 10 ⁇ 12 , a drug with an IC50 of 10 ⁇ might have a q d of 1/6. Of course, any function which converts potencies to a zero to one scale in which higher potencies yield higher values might be used.
- Examples of candidate simple libraries which might be evaluated include derivatives of the following: Cyclic Compounds Containing One Hetero Atom Heteronitrogen pyrroles pentasubstituted pyrroles pyrrolidines pyrrolines prolines indoles beta-carbolines pyridines dihydropyridines 1 , 4 -dihydropyridines pyrido [2 , 3 -d] pyrimidines tetrahydro-3H-imidazo [4, 5-c] pyridines Isoquinolines tetrahydroisoquinolines quinolones beta-lactams azabicyclo [4.3.0] nonen-8-one amino acid Heterooxygen furans tetrahydrofurans
- 1,2,3-triazoles purines Heteronitrogen and Heterooxygen dikelomorpholines isoxazoles isoxazolines Heteronitrogen and Heterosulfur thiazolidines
- the library one may consider not only the drug leads retrieved for the high similarity reference proteins, but also structures which are analogues of the BioKey peptides or nucleic acid aptamers known to bind the query protein and/or the higher-ranked reference proteins.
- the present invention is useful, not only in designing a library, but also in deciding which of several query proteins to target. It may be desirable to inhibit a biochemical pathway. Several different proteins may be known to mediate that pathway. To decide which one to develop drugs for, each protein might be used to query the database . The one with the most potent and specific drug leads, or with the drug lead having the greatest resemblance to a readily available combinatorial library, becomes the target of choice.
- the protein is immobilized, e.g, on filter paper, and subjected to binding by a library of random oligonucleotides.
- the oligonucleotides may be DNA, RNA, or a DNA or RNA analogue, e.g., in which the sugar functionality of the nucleotide is modified at the 2' position with substituents such as fluoro-, amino-, or methoxy- .
- the members of the library that bind to the target protein are separated and amplified using the polymerase chain reaction or other amplification scheme. The amplified pool is bound to the immobilized protein again, and the strong binding fraction is selected.
- the affinity of the pool may be determined after each cycle.
- the oligonucleotides are cloned into suitable host cells, e.g., bacteria, and the sequences are determined by automated methods.
- the nucleic acid aptamers for a specific target are resynthesized and 3 ' - or 5 '-labeled with 32 P.
- Each radiolabeled oligomer is then mixed with an excess of the target protein and 100 - 500 ⁇ M Pt 2 (pop) 4 4" and photolyzed at wavelengths between 350 and 500 nm until the reaction with Pt 2 (pop) 4 4" is complete.
- the radiolabeled oligomer is then precipitated.
- a parallel control reaction is run without the protein.
- the aptamers photolyzed with and without the protein are then loaded onto a polyacrylamide sequencing gel.
- the nucleotides where there is significant reaction without the protein but not with the protein are then classified as contact sites. This process is repeated for all of the aptamers for a given target and shows a high degree of similarity in the contact sites between aptamers .
- a DNA library is screened for binding to lysozyme
- This sequence is then synthesized and radiolabeled on the 5' end with 32 P.
- the labeled oligomer is reacted with Pt 2 (pop) 4 4" by photolysis at 400 nm in phosphate buffered saline and analyzed on a sequencing gel, which shows that Pt 2 (pop) 4 4 " induces scission of the oligonucleotide at every base in the sequence, giving a ladder of bands for the oligomer with approximately the same extent of scission at each nucleotide in the sequence.
- the reaction is then repeated in the presence of enough protein to cause the majority of the oligonucleotide to be bound to the protein.
- the Pt 2 (pop) 4 4 ⁇ exhibits some reaction with the protein, so the concentration of the Pt 2 (pop) 4 4" must be higher than in the reaction of the oligonucleotide alone, and the reaction must be performed in a short time so that alteration of the structure of the nucleoprotein complex due to damage of the protein by Pt 2 (pop) 4 4" is not a factor. Nevertheless, only the oligonucleotide is labeled, so only scission of the protein is detected.
- the scission pattern visualized on a sequencing gel then shows the same relative reactivity at the nucleotides that do not contact the protein as in the reaction of the oligomer alone, and greatly attenuated reactivity at sites protected by the protein.
- Studies on crystallographically characterized DNA-protein complexes show that the experiment faithfully indicates the sites of contact of the protein on DNA (see Breiner already cited) .
- the "footprint" of the protein on the DNA is determined by quantitating the extent of cleavage at each nucleotide in both reactions to form two histograms, normalizing the two histograms so that nucleotides that are clearly outside the binding site give the same intensity, and then assigning contact sites as those where addition of the protein attenuates the relative intensity.
- cleavage without the protein will give approximately the same intensity at all nucleotides.
- the cleavage intensity in the reaction with the protein is then normalized for the nucleotides on either end, and the sites where less cleavage occurs is counted as a protein site. For example, if relatively less cleavage is observed at the underlined sites: 5'- TAGCTGGCCAAAGTGCGAACACGGCCTTG. then the cleavage for example on the end nucleotides and in the CGAA loop would be relatively the same with and without the protein, while cleavage in the stem and AAA bulge would be protected by the protein. This result would imply that the protein recognizes the AAA bulge and flanking stem, and this structure would be used to search for proteins that bind similar functionality sequences.
- RNA aptamers shown in Scheme 1A were isolated by SELEX for the reverse transcriptase from Feline Immunodeficiency Virus (Chen, H. ; McBroom, D.G.; Zhu, Y.-Q; Gold, L.; North, T.W. Biochemistry 1996 , 35, 6923-6930) and from the ribosomal L22 protein associated with Epstein-Barr Virus (Dobbelstein, M.; Shenk, T. J. Virology 1995, 69, 8027-8034).
- the homology score is determined by comparing the two sequences according to the (illustrative) scoring system shown in C. By this scheme, a single-stranded residue is given a maximum score of 2 , a mismatch is given a maximum score of 1.8, and a base pair is given a maximum score of 1. The longer of the two sequences is then chosen as the "parent sequence", and the maximum score is calculated, which in this case is 26. This sequence is then aligned with the homolog to be compared, allowing for gaps if necessary; it is important to choose the longer sequence as the parent sequence because this imposes an implicit penalty for gaps. The score is then computed at each nucleotide. For a perfect match, the score is the same as the maximum score.
- the score is 0.5 for transposition of a base pair (i.e., substitution of Ta for At), 0.5 for a base pair/mismatch pair where the same base contacts the protein, and 0.9 for a two mismatches where the contacted base is the same.
- the summed comparison score is then divided by the maximum total score.
- the comparison then of the two aptamers in Scheme 2 gives a score of 11/26 or 0.423. (The score would have been 4/26 for an identity matrix) .
- DNA aptamers would be scored in an identical manner except that T would replace U.
- Related strategies might involve changing in any of the maximum score numbers for single strands, mismatches, and base pairs or any of the partial match scores. It also need not be the case that all four nucleotides (A, T/U, G, C) have the same maximum score or partial match scores. Also, the individual scores could be summed instead as the squares of all the scored and then divided by the sum of the squares of the maximum score.
- Ru(tpy) (bpy)0 2+ has a rate constant with methionine of 15 M "1 s '1 and the rate constant with all of the other amino acids is negligible
- Pt 2 (pop) 4 4 ⁇ has a Stern- Volmer rate constant for tryptophan that is 1.0 x 10 7 M "1 s "1 and that all of the other amino acids give negligible rate constants .
- glutamine synthetase and lysozyme are reacted with the two compounds in the folded, unfolded, and BioKey-bound states.
- the BioKey peptides are engineered to avoid the inclusion of methionine and tryptophan so that cross-reaction with the BioKey is not an issue.
- the rate constants are normalized to the number of the most reactive residue for each reagent, i.e., the Ru(tpy) (bpy)0 2+ rate constant is in moles of methionine and the Pt 2 (pop) 4 4" rate constant is in moles of tryptophan. Now the rate constants are measured in the three states to give (hypothetically) :
- G ven as " s _ w ere t e mo ar concentration is per tota methionine in the two proteins.
- B Given as M “1 s" 1 where the molar concentration is per total tryptophan.
- the rate constants are close to those for the amino acids themselves but slightly attenuated due to some steric constraints imposed by inclusion in the linear polymer.
- the reactivity descriptors are then calculated by comparing the difference in the folded and unfolded or the BioKey and folded rate constants normalized by the unfolded rate constant.
- the protein concentration would be determined as the total protein concentration times ( (0.40 times the fraction of the total amino acids which are of isoleucine residues) plus (0.40 times the fraction which are leucine residues) plus (0.20 times the fraction which are alanine residues)).
- aptamers and reactivity descriptors are determined for a new protein (protein X) .
- Protein X is compared to the other proteins in the database and is determined to be 60% similar to lysozyme, 20% similar to ribosomal L22 protein, 8% similar to glutamine synthetase and ⁇ 3% similar to all of the other proteins in the database.
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Urology & Nephrology (AREA)
- Hematology (AREA)
- Biomedical Technology (AREA)
- Chemical & Material Sciences (AREA)
- Immunology (AREA)
- Medicinal Chemistry (AREA)
- Physics & Mathematics (AREA)
- Microbiology (AREA)
- Pathology (AREA)
- Biotechnology (AREA)
- Food Science & Technology (AREA)
- General Physics & Mathematics (AREA)
- Cell Biology (AREA)
- Analytical Chemistry (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Pharmacology & Pharmacy (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Investigating Or Analysing Biological Materials (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
Abstract
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP98938201A EP1002235A1 (fr) | 1997-08-01 | 1998-07-30 | Procede d'identification et de developpement de chefs de file de medicaments |
CA002298629A CA2298629A1 (fr) | 1997-08-01 | 1998-07-30 | Procede d'identification et de developpement de chefs de file de medicaments |
AU86781/98A AU8678198A (en) | 1997-08-01 | 1998-07-30 | Method of identifying and developing drug leads |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US90484297A | 1997-08-01 | 1997-08-01 | |
US08/904,842 | 1997-08-01 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO1999006839A1 true WO1999006839A1 (fr) | 1999-02-11 |
Family
ID=25419872
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US1998/015943 WO1999006839A1 (fr) | 1997-08-01 | 1998-07-30 | Procede d'identification et de developpement de chefs de file de medicaments |
Country Status (4)
Country | Link |
---|---|
EP (1) | EP1002235A1 (fr) |
AU (1) | AU8678198A (fr) |
CA (1) | CA2298629A1 (fr) |
WO (1) | WO1999006839A1 (fr) |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2001036980A3 (fr) * | 1999-11-18 | 2002-03-14 | Melacure Therapeutics Ab | Procede d'identification du site actif dans une cible biologique |
EP1209610A1 (fr) * | 2000-11-28 | 2002-05-29 | Valentin Capital Management | Dispositif et procédé de détermination de données d'affinité entre une cible et un ligand |
US6475726B1 (en) | 1998-01-09 | 2002-11-05 | Cubist Pharmaceuticals, Inc. | Method for identifying validated target and assay combinations for drug development |
US6586190B2 (en) | 2000-08-18 | 2003-07-01 | Syngenta Participations Ag | Parallel high throughput method and kit |
US6589738B1 (en) | 1999-11-09 | 2003-07-08 | Elitra Pharmaceuticals, Inc. | Genes essential for microbial proliferation and antisense thereto |
US6720139B1 (en) | 1999-01-27 | 2004-04-13 | Elitra Pharmaceuticals, Inc. | Genes identified as required for proliferation in Escherichia coli |
US6846625B1 (en) | 1998-01-09 | 2005-01-25 | Cubist Pharmaceuticals, Inc. | Method for identifying validated target and assay combination for drug development |
WO2002044990A3 (fr) * | 2000-11-28 | 2005-04-21 | Valentin Capital Man | Appareil et procede permettant de determiner des donnees d'affinite entre une cible et un ligand |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1993001484A1 (fr) * | 1991-07-11 | 1993-01-21 | The Regents Of The University Of California | Methode permettant d'identifier des sequences de proteines qui se plient pour former une structure en trois dimensions connue |
US5338659A (en) * | 1991-04-02 | 1994-08-16 | Terrapin Technologies, Inc. | Method for determining analyte concentration by cross-reactivity profiling |
WO1995032425A1 (fr) * | 1994-05-23 | 1995-11-30 | Smithkline Beecham Corporation | Bibliotheques combinatoires codees |
WO1997042500A1 (fr) * | 1996-05-09 | 1997-11-13 | 3-Dimensional Pharmaceuticals, Inc. | Procede d'analyse et appareil a changement thermique et a microplaque pour l'optimisation de la mise au point de ligands et de la chimie des proteines a variables multiples |
-
1998
- 1998-07-30 EP EP98938201A patent/EP1002235A1/fr not_active Withdrawn
- 1998-07-30 AU AU86781/98A patent/AU8678198A/en not_active Abandoned
- 1998-07-30 CA CA002298629A patent/CA2298629A1/fr not_active Abandoned
- 1998-07-30 WO PCT/US1998/015943 patent/WO1999006839A1/fr not_active Application Discontinuation
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5338659A (en) * | 1991-04-02 | 1994-08-16 | Terrapin Technologies, Inc. | Method for determining analyte concentration by cross-reactivity profiling |
WO1993001484A1 (fr) * | 1991-07-11 | 1993-01-21 | The Regents Of The University Of California | Methode permettant d'identifier des sequences de proteines qui se plient pour former une structure en trois dimensions connue |
WO1995032425A1 (fr) * | 1994-05-23 | 1995-11-30 | Smithkline Beecham Corporation | Bibliotheques combinatoires codees |
WO1997042500A1 (fr) * | 1996-05-09 | 1997-11-13 | 3-Dimensional Pharmaceuticals, Inc. | Procede d'analyse et appareil a changement thermique et a microplaque pour l'optimisation de la mise au point de ligands et de la chimie des proteines a variables multiples |
Non-Patent Citations (1)
Title |
---|
ELOFSSON M ET AL: "Tightening the nuts and bolts", TRENDS IN BIOTECHNOLOGY, vol. 16, no. 4, April 1998 (1998-04-01), pages 147-149, XP004112297 * |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6475726B1 (en) | 1998-01-09 | 2002-11-05 | Cubist Pharmaceuticals, Inc. | Method for identifying validated target and assay combinations for drug development |
US6846625B1 (en) | 1998-01-09 | 2005-01-25 | Cubist Pharmaceuticals, Inc. | Method for identifying validated target and assay combination for drug development |
US6720139B1 (en) | 1999-01-27 | 2004-04-13 | Elitra Pharmaceuticals, Inc. | Genes identified as required for proliferation in Escherichia coli |
US6589738B1 (en) | 1999-11-09 | 2003-07-08 | Elitra Pharmaceuticals, Inc. | Genes essential for microbial proliferation and antisense thereto |
WO2001036980A3 (fr) * | 1999-11-18 | 2002-03-14 | Melacure Therapeutics Ab | Procede d'identification du site actif dans une cible biologique |
US6586190B2 (en) | 2000-08-18 | 2003-07-01 | Syngenta Participations Ag | Parallel high throughput method and kit |
EP1209610A1 (fr) * | 2000-11-28 | 2002-05-29 | Valentin Capital Management | Dispositif et procédé de détermination de données d'affinité entre une cible et un ligand |
WO2002044990A3 (fr) * | 2000-11-28 | 2005-04-21 | Valentin Capital Man | Appareil et procede permettant de determiner des donnees d'affinite entre une cible et un ligand |
Also Published As
Publication number | Publication date |
---|---|
AU8678198A (en) | 1999-02-22 |
EP1002235A1 (fr) | 2000-05-24 |
CA2298629A1 (fr) | 1999-02-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Leutert et al. | Decoding post-translational modification crosstalk with proteomics | |
Cheng et al. | Recognition of nucleic acid bases and base-pairs by hydrogen bonding to amino acid side-chains | |
Moreira et al. | Hot spots—A review of the protein–protein interface determinant amino‐acid residues | |
Schenone et al. | Target identification and mechanism of action in chemical biology and drug discovery | |
Hargrove | Small molecule–RNA targeting: starting with the fundamentals | |
Duffner et al. | A pipeline for ligand discovery using small-molecule microarrays | |
Yap et al. | Calmodulin target database | |
Wiedemann et al. | Quantification of PDZ domain specificity, prediction of ligand affinity and rational design of super-binding peptides | |
Arkin et al. | Small-molecule inhibitors of protein–protein interactions: progressing towards the dream | |
Lepre | Library design for NMR-based screening | |
Nguyen et al. | Quantitative mapping of protein-peptide affinity landscapes using spectrally encoded beads | |
US20040132100A1 (en) | Pharmacophore recombination for the identification of small molecule drug lead compounds | |
JP2007137887A (ja) | 独立下部構造分析を実行するためのコンピュータ・システムの操作方法 | |
JP2004523726A (ja) | コンフォメーションに敏感な結合ペプチドの同定方法およびその利用 | |
Hubbard | Can drugs be designed? | |
JP2009075116A (ja) | タンパク質−タンパク質相互作用の解析および標識の方法 | |
Kapoor et al. | Exploring the specificity pockets of two homologous SH3 domains using structure-based, split-pool synthesis and affinity-based selection | |
WO1999006839A1 (fr) | Procede d'identification et de developpement de chefs de file de medicaments | |
Ozawa et al. | Application of the fragment molecular orbital method analysis to fragment-based drug discovery of BET (bromodomain and extra-terminal proteins) inhibitors | |
Budayeva et al. | Increasing the throughput and reproducibility of activity-based proteome profiling studies with hyperplexing and intelligent data acquisition | |
Zwillinger et al. | Isotope ratio encoding of sequence-defined oligomers | |
Kauvar et al. | Protein affinity map of chemical space | |
EP1360560A2 (fr) | Representation de la selectivite de recepteurs | |
Jenmalm Jensen et al. | Affinity‐Based Chemoproteomics for Target Identification | |
Froloff et al. | Multiple human taste receptor sites: a molecular modeling approach |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI GB GE GH GM HU ID IL IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG US UZ VN YU ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): GH GM KE LS MW SD SZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG |
|
DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
ENP | Entry into the national phase |
Ref document number: 2298629 Country of ref document: CA Ref country code: CA Ref document number: 2298629 Kind code of ref document: A Format of ref document f/p: F |
|
NENP | Non-entry into the national phase |
Ref country code: KR |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1998938201 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 09485007 Country of ref document: US |
|
WWP | Wipo information: published in national office |
Ref document number: 1998938201 Country of ref document: EP |
|
REG | Reference to national code |
Ref country code: DE Ref legal event code: 8642 |
|
WWW | Wipo information: withdrawn in national office |
Ref document number: 1998938201 Country of ref document: EP |