AU735512B2 - Modulators of BRAC1 activity - Google Patents
Modulators of BRAC1 activity Download PDFInfo
- Publication number
- AU735512B2 AU735512B2 AU38293/97A AU3829397A AU735512B2 AU 735512 B2 AU735512 B2 AU 735512B2 AU 38293/97 A AU38293/97 A AU 38293/97A AU 3829397 A AU3829397 A AU 3829397A AU 735512 B2 AU735512 B2 AU 735512B2
- Authority
- AU
- Australia
- Prior art keywords
- leu
- brca1
- gin
- glu
- lys
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
- 230000000694 effects Effects 0.000 title description 28
- 108700020463 BRCA1 Proteins 0.000 claims description 339
- 102000036365 BRCA1 Human genes 0.000 claims description 334
- 101150072950 BRCA1 gene Proteins 0.000 claims description 334
- 108090000623 proteins and genes Proteins 0.000 claims description 184
- 102000004169 proteins and genes Human genes 0.000 claims description 110
- 238000000034 method Methods 0.000 claims description 71
- 101710154541 Modulator protein Proteins 0.000 claims description 65
- 239000002299 complementary DNA Substances 0.000 claims description 52
- 239000013598 vector Substances 0.000 claims description 33
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 30
- 150000007523 nucleic acids Chemical group 0.000 claims description 23
- 239000003814 drug Substances 0.000 claims description 4
- 239000001963 growth medium Substances 0.000 claims description 2
- 239000002609 medium Substances 0.000 claims description 2
- 230000008569 process Effects 0.000 claims description 2
- 238000012258 culturing Methods 0.000 claims 1
- 235000018102 proteins Nutrition 0.000 description 105
- 210000004027 cell Anatomy 0.000 description 93
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 64
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 59
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 54
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 52
- 235000001014 amino acid Nutrition 0.000 description 49
- 229940024606 amino acid Drugs 0.000 description 48
- 150000001875 compounds Chemical class 0.000 description 47
- 150000001413 amino acids Chemical class 0.000 description 46
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 43
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 35
- 239000013612 plasmid Substances 0.000 description 35
- 239000012634 fragment Substances 0.000 description 33
- 108090000765 processed proteins & peptides Proteins 0.000 description 32
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 27
- 230000027455 binding Effects 0.000 description 27
- 230000014509 gene expression Effects 0.000 description 26
- 239000000047 product Substances 0.000 description 26
- 241000894007 species Species 0.000 description 26
- 238000012360 testing method Methods 0.000 description 25
- 108091034117 Oligonucleotide Proteins 0.000 description 24
- 238000003556 assay Methods 0.000 description 24
- 108020004414 DNA Proteins 0.000 description 22
- 230000003993 interaction Effects 0.000 description 22
- 102000004196 processed proteins & peptides Human genes 0.000 description 22
- 108020001507 fusion proteins Proteins 0.000 description 21
- 102000037865 fusion proteins Human genes 0.000 description 21
- 102100039556 Galectin-4 Human genes 0.000 description 20
- 125000003275 alpha amino acid group Chemical group 0.000 description 20
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 20
- 125000003729 nucleotide group Chemical group 0.000 description 20
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 19
- 241001465754 Metazoa Species 0.000 description 19
- 239000002773 nucleotide Substances 0.000 description 18
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 17
- 101000608765 Homo sapiens Galectin-4 Proteins 0.000 description 17
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 16
- 206010028980 Neoplasm Diseases 0.000 description 16
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 16
- 239000013604 expression vector Substances 0.000 description 16
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 15
- 229920001184 polypeptide Polymers 0.000 description 15
- 238000013518 transcription Methods 0.000 description 14
- 230000035897 transcription Effects 0.000 description 14
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 13
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 13
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 13
- 230000010261 cell growth Effects 0.000 description 13
- 238000006243 chemical reaction Methods 0.000 description 13
- 238000012216 screening Methods 0.000 description 13
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 12
- 102000001742 Tumor Suppressor Proteins Human genes 0.000 description 12
- 108010040002 Tumor Suppressor Proteins Proteins 0.000 description 12
- 230000004913 activation Effects 0.000 description 12
- 239000000872 buffer Substances 0.000 description 12
- 201000011510 cancer Diseases 0.000 description 12
- 102000039446 nucleic acids Human genes 0.000 description 12
- 108020004707 nucleic acids Proteins 0.000 description 12
- 238000000746 purification Methods 0.000 description 12
- KOSRFJWDECSPRO-WDSKDSINSA-N Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(O)=O KOSRFJWDECSPRO-WDSKDSINSA-N 0.000 description 11
- 108700019146 Transgenes Proteins 0.000 description 11
- 108010034529 leucyl-lysine Proteins 0.000 description 11
- 241000701447 unidentified baculovirus Species 0.000 description 11
- 108700028369 Alleles Proteins 0.000 description 10
- 108700008625 Reporter Genes Proteins 0.000 description 10
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 10
- 230000002452 interceptive effect Effects 0.000 description 10
- 239000000203 mixture Substances 0.000 description 10
- 206010006187 Breast cancer Diseases 0.000 description 9
- 208000026310 Breast neoplasm Diseases 0.000 description 9
- 230000004568 DNA-binding Effects 0.000 description 9
- 241000588724 Escherichia coli Species 0.000 description 9
- 201000010099 disease Diseases 0.000 description 9
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 9
- 238000004519 manufacturing process Methods 0.000 description 9
- 230000035772 mutation Effects 0.000 description 9
- 239000011541 reaction mixture Substances 0.000 description 9
- 230000001105 regulatory effect Effects 0.000 description 9
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 9
- 108091026890 Coding region Proteins 0.000 description 8
- 108020004635 Complementary DNA Proteins 0.000 description 8
- 241000282414 Homo sapiens Species 0.000 description 8
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 8
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 8
- 239000007983 Tris buffer Substances 0.000 description 8
- 241000700605 Viruses Species 0.000 description 8
- 238000007792 addition Methods 0.000 description 8
- 239000011324 bead Substances 0.000 description 8
- 238000000338 in vitro Methods 0.000 description 8
- 238000001727 in vivo Methods 0.000 description 8
- 229930182817 methionine Natural products 0.000 description 8
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 8
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 7
- 241000238631 Hexapoda Species 0.000 description 7
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 7
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 7
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 7
- 108010033276 Peptide Fragments Proteins 0.000 description 7
- 102000007079 Peptide Fragments Human genes 0.000 description 7
- 108091006629 SLC13A2 Proteins 0.000 description 7
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 7
- 230000015572 biosynthetic process Effects 0.000 description 7
- 210000000481 breast Anatomy 0.000 description 7
- 238000004113 cell culture Methods 0.000 description 7
- 239000007787 solid Substances 0.000 description 7
- 239000000126 substance Substances 0.000 description 7
- 230000009261 transgenic effect Effects 0.000 description 7
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 7
- 239000000225 tumor suppressor protein Substances 0.000 description 7
- 238000010396 two-hybrid screening Methods 0.000 description 7
- WRDABNWSWOHGMS-UHFFFAOYSA-N AEBSF hydrochloride Chemical compound Cl.NCCC1=CC=C(S(F)(=O)=O)C=C1 WRDABNWSWOHGMS-UHFFFAOYSA-N 0.000 description 6
- 108010039627 Aprotinin Proteins 0.000 description 6
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 6
- 101000595182 Homo sapiens Podocan Proteins 0.000 description 6
- GDBQQVLCIARPGH-UHFFFAOYSA-N Leupeptin Natural products CC(C)CC(NC(C)=O)C(=O)NC(CC(C)C)C(=O)NC(C=O)CCCN=C(N)N GDBQQVLCIARPGH-UHFFFAOYSA-N 0.000 description 6
- 102100036036 Podocan Human genes 0.000 description 6
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 description 6
- 229960004405 aprotinin Drugs 0.000 description 6
- 238000001514 detection method Methods 0.000 description 6
- 238000002474 experimental method Methods 0.000 description 6
- 230000004927 fusion Effects 0.000 description 6
- ZPNFWUPYTFPOJU-LPYSRVMUSA-N iniprol Chemical compound C([C@H]1C(=O)NCC(=O)NCC(=O)N[C@H]2CSSC[C@H]3C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](C(N[C@H](C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=4C=CC(O)=CC=4)C(=O)N[C@@H](CC=4C=CC=CC=4)C(=O)N[C@@H](CC=4C=CC(O)=CC=4)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CSSC[C@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC=4C=CC=CC=4)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C)NC(=O)[C@H](CCCNC(N)=N)NC2=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CSSC[C@H](NC(=O)[C@H](CC=2C=CC=CC=2)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H]2N(CCC2)C(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N2[C@@H](CCC2)C(=O)N2[C@@H](CCC2)C(=O)N[C@@H](CC=2C=CC(O)=CC=2)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N2[C@@H](CCC2)C(=O)N3)C(=O)NCC(=O)NCC(=O)N[C@@H](C)C(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(=O)N[C@@H](CC=2C=CC=CC=2)C(=O)N[C@H](C(=O)N1)C(C)C)[C@@H](C)O)[C@@H](C)CC)=O)[C@@H](C)CC)C1=CC=C(O)C=C1 ZPNFWUPYTFPOJU-LPYSRVMUSA-N 0.000 description 6
- 230000003834 intracellular effect Effects 0.000 description 6
- GDBQQVLCIARPGH-ULQDDVLXSA-N leupeptin Chemical compound CC(C)C[C@H](NC(C)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C=O)CCCN=C(N)N GDBQQVLCIARPGH-ULQDDVLXSA-N 0.000 description 6
- 108010052968 leupeptin Proteins 0.000 description 6
- 229950000964 pepstatin Drugs 0.000 description 6
- 108010091212 pepstatin Proteins 0.000 description 6
- FAXGPCHRFPCXOO-LXTPJMTPSA-N pepstatin A Chemical compound OC(=O)C[C@H](O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)C[C@H](O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)NC(=O)CC(C)C FAXGPCHRFPCXOO-LXTPJMTPSA-N 0.000 description 6
- 238000010561 standard procedure Methods 0.000 description 6
- 238000011282 treatment Methods 0.000 description 6
- 238000003160 two-hybrid assay Methods 0.000 description 6
- 241000701161 unidentified adenovirus Species 0.000 description 6
- 239000011701 zinc Substances 0.000 description 6
- 229910052725 zinc Inorganic materials 0.000 description 6
- KLSJWNVTNUYHDU-UHFFFAOYSA-N Amitrole Chemical compound NC1=NC=NN1 KLSJWNVTNUYHDU-UHFFFAOYSA-N 0.000 description 5
- 102100021569 Apoptosis regulator Bcl-2 Human genes 0.000 description 5
- 108700040618 BRCA1 Genes Proteins 0.000 description 5
- 102000004190 Enzymes Human genes 0.000 description 5
- 108090000790 Enzymes Proteins 0.000 description 5
- 102000005720 Glutathione transferase Human genes 0.000 description 5
- 108010070675 Glutathione transferase Proteins 0.000 description 5
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 5
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 5
- REPPKAMYTOJTFC-DCAQKATOSA-N Leu-Arg-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O REPPKAMYTOJTFC-DCAQKATOSA-N 0.000 description 5
- OTXBNHIUIHNGAO-UWVGGRQHSA-N Leu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN OTXBNHIUIHNGAO-UWVGGRQHSA-N 0.000 description 5
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 5
- 241000283973 Oryctolagus cuniculus Species 0.000 description 5
- 229920002684 Sepharose Polymers 0.000 description 5
- 108091081024 Start codon Proteins 0.000 description 5
- 230000004075 alteration Effects 0.000 description 5
- 108010062796 arginyllysine Proteins 0.000 description 5
- 230000001580 bacterial effect Effects 0.000 description 5
- 238000010367 cloning Methods 0.000 description 5
- 229940088598 enzyme Drugs 0.000 description 5
- 108010050848 glycylleucine Proteins 0.000 description 5
- 108010025306 histidylleucine Proteins 0.000 description 5
- 108010057821 leucylproline Proteins 0.000 description 5
- 108010009298 lysylglutamic acid Proteins 0.000 description 5
- 239000003550 marker Substances 0.000 description 5
- 108020004999 messenger RNA Proteins 0.000 description 5
- 230000010076 replication Effects 0.000 description 5
- 238000003786 synthesis reaction Methods 0.000 description 5
- 210000001519 tissue Anatomy 0.000 description 5
- 238000013519 translation Methods 0.000 description 5
- 230000003612 virological effect Effects 0.000 description 5
- 229920000936 Agarose Polymers 0.000 description 4
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 4
- JQFZHHSQMKZLRU-IUCAKERBSA-N Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N JQFZHHSQMKZLRU-IUCAKERBSA-N 0.000 description 4
- ATIPDCIQTUXABX-UWVGGRQHSA-N Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN ATIPDCIQTUXABX-UWVGGRQHSA-N 0.000 description 4
- OVAOHZIOUBEQCJ-IHRRRGAJSA-N Lys-Leu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OVAOHZIOUBEQCJ-IHRRRGAJSA-N 0.000 description 4
- 241000829100 Macaca mulatta polyomavirus 1 Species 0.000 description 4
- 108700020978 Proto-Oncogene Proteins 0.000 description 4
- 102000052575 Proto-Oncogene Human genes 0.000 description 4
- 238000012300 Sequence Analysis Methods 0.000 description 4
- 239000012190 activator Substances 0.000 description 4
- 125000000539 amino acid group Chemical group 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 4
- 230000000692 anti-sense effect Effects 0.000 description 4
- 230000008859 change Effects 0.000 description 4
- 230000002759 chromosomal effect Effects 0.000 description 4
- DEFVIWRASFVYLL-UHFFFAOYSA-N ethylene glycol bis(2-aminoethyl)tetraacetic acid Chemical compound OC(=O)CN(CC(O)=O)CCOCCOCCN(CC(O)=O)CC(O)=O DEFVIWRASFVYLL-UHFFFAOYSA-N 0.000 description 4
- 230000005764 inhibitory process Effects 0.000 description 4
- 238000003780 insertion Methods 0.000 description 4
- 230000037431 insertion Effects 0.000 description 4
- 239000000463 material Substances 0.000 description 4
- 239000011159 matrix material Substances 0.000 description 4
- 108010056582 methionylglutamic acid Proteins 0.000 description 4
- 230000000813 microbial effect Effects 0.000 description 4
- 230000002611 ovarian Effects 0.000 description 4
- 102000040430 polynucleotide Human genes 0.000 description 4
- 108091033319 polynucleotide Proteins 0.000 description 4
- 239000002157 polynucleotide Substances 0.000 description 4
- 238000012545 processing Methods 0.000 description 4
- 239000011780 sodium chloride Substances 0.000 description 4
- 238000006467 substitution reaction Methods 0.000 description 4
- 230000001225 therapeutic effect Effects 0.000 description 4
- 108010061238 threonyl-glycine Proteins 0.000 description 4
- 238000005406 washing Methods 0.000 description 4
- OSJPPGNTCRNQQC-UWTATZPHSA-N 3-phospho-D-glyceric acid Chemical compound OC(=O)[C@H](O)COP(O)(O)=O OSJPPGNTCRNQQC-UWTATZPHSA-N 0.000 description 3
- NOGFDULFCFXBHB-CIUDSAMLSA-N Ala-Leu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NOGFDULFCFXBHB-CIUDSAMLSA-N 0.000 description 3
- 241000894006 Bacteria Species 0.000 description 3
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 3
- 108010001515 Galectin 4 Proteins 0.000 description 3
- YBAFDPFAUTYYRW-YUMQZZPRSA-N Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O YBAFDPFAUTYYRW-YUMQZZPRSA-N 0.000 description 3
- 101150009006 HIS3 gene Proteins 0.000 description 3
- 101000971171 Homo sapiens Apoptosis regulator Bcl-2 Proteins 0.000 description 3
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 3
- 102000001706 Immunoglobulin Fab Fragments Human genes 0.000 description 3
- 108010054477 Immunoglobulin Fab Fragments Proteins 0.000 description 3
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 3
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 3
- AZLASBBHHSLQDB-GUBZILKMSA-N Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(C)C AZLASBBHHSLQDB-GUBZILKMSA-N 0.000 description 3
- LRKCBIUDWAXNEG-CSMHCCOUSA-N Leu-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRKCBIUDWAXNEG-CSMHCCOUSA-N 0.000 description 3
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 3
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 3
- 108010047357 Luminescent Proteins Proteins 0.000 description 3
- 102000006830 Luminescent Proteins Human genes 0.000 description 3
- CIOWSLJGLSUOME-BQBZGAKWSA-N Lys-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O CIOWSLJGLSUOME-BQBZGAKWSA-N 0.000 description 3
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 3
- IMTUWVJPCQPJEE-IUCAKERBSA-N Met-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN IMTUWVJPCQPJEE-IUCAKERBSA-N 0.000 description 3
- 206010061535 Ovarian neoplasm Diseases 0.000 description 3
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 3
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 3
- WBAXJMCUFIXCNI-WDSKDSINSA-N Ser-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O WBAXJMCUFIXCNI-WDSKDSINSA-N 0.000 description 3
- BECPPKYKPSRKCP-ZDLURKLDSA-N Thr-Glu Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O BECPPKYKPSRKCP-ZDLURKLDSA-N 0.000 description 3
- 108091006088 activator proteins Proteins 0.000 description 3
- 238000013459 approach Methods 0.000 description 3
- 108010092854 aspartyllysine Proteins 0.000 description 3
- 230000004071 biological effect Effects 0.000 description 3
- 230000033228 biological regulation Effects 0.000 description 3
- 239000013592 cell lysate Substances 0.000 description 3
- 230000001413 cellular effect Effects 0.000 description 3
- 239000003153 chemical reaction reagent Substances 0.000 description 3
- 210000000349 chromosome Anatomy 0.000 description 3
- 230000009918 complex formation Effects 0.000 description 3
- ATDGTVJJHBUTRL-UHFFFAOYSA-N cyanogen bromide Chemical compound BrC#N ATDGTVJJHBUTRL-UHFFFAOYSA-N 0.000 description 3
- 238000010828 elution Methods 0.000 description 3
- 210000003527 eukaryotic cell Anatomy 0.000 description 3
- 230000012010 growth Effects 0.000 description 3
- 238000004128 high performance liquid chromatography Methods 0.000 description 3
- 238000009396 hybridization Methods 0.000 description 3
- 210000004408 hybridoma Anatomy 0.000 description 3
- 238000011534 incubation Methods 0.000 description 3
- 230000000977 initiatory effect Effects 0.000 description 3
- 108010027338 isoleucylcysteine Proteins 0.000 description 3
- 101150066555 lacZ gene Proteins 0.000 description 3
- 239000007791 liquid phase Substances 0.000 description 3
- 108010003700 lysyl aspartic acid Proteins 0.000 description 3
- 108010054155 lysyllysine Proteins 0.000 description 3
- 210000004962 mammalian cell Anatomy 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 239000008194 pharmaceutical composition Substances 0.000 description 3
- 239000012071 phase Substances 0.000 description 3
- 230000029279 positive regulation of transcription, DNA-dependent Effects 0.000 description 3
- 230000006916 protein interaction Effects 0.000 description 3
- 230000001177 retroviral effect Effects 0.000 description 3
- 239000000523 sample Substances 0.000 description 3
- 108010048818 seryl-histidine Proteins 0.000 description 3
- 108010026333 seryl-proline Proteins 0.000 description 3
- 108010071207 serylmethionine Proteins 0.000 description 3
- 241001515965 unidentified phage Species 0.000 description 3
- DIGQNXIGRZPYDK-WKSCXVIASA-N (2R)-6-amino-2-[[2-[[(2S)-2-[[2-[[(2R)-2-[[(2S)-2-[[(2R,3S)-2-[[2-[[(2S)-2-[[2-[[(2S)-2-[[(2S)-2-[[(2R)-2-[[(2S,3S)-2-[[(2R)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[2-[[(2S)-2-[[(2R)-2-[[2-[[2-[[2-[(2-amino-1-hydroxyethylidene)amino]-3-carboxy-1-hydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1,5-dihydroxy-5-iminopentylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]hexanoic acid Chemical compound C[C@@H]([C@@H](C(=N[C@@H](CS)C(=N[C@@H](C)C(=N[C@@H](CO)C(=NCC(=N[C@@H](CCC(=N)O)C(=NC(CS)C(=N[C@H]([C@H](C)O)C(=N[C@H](CS)C(=N[C@H](CO)C(=NCC(=N[C@H](CS)C(=NCC(=N[C@H](CCCCN)C(=O)O)O)O)O)O)O)O)O)O)O)O)O)O)O)N=C([C@H](CS)N=C([C@H](CO)N=C([C@H](CO)N=C([C@H](C)N=C(CN=C([C@H](CO)N=C([C@H](CS)N=C(CN=C(C(CS)N=C(C(CC(=O)O)N=C(CN)O)O)O)O)O)O)O)O)O)O)O)O DIGQNXIGRZPYDK-WKSCXVIASA-N 0.000 description 2
- ALBODLTZUXKBGZ-JUUVMNCLSA-N (2s)-2-amino-3-phenylpropanoic acid;(2s)-2,6-diaminohexanoic acid Chemical compound NCCCC[C@H](N)C(O)=O.OC(=O)[C@@H](N)CC1=CC=CC=C1 ALBODLTZUXKBGZ-JUUVMNCLSA-N 0.000 description 2
- HZAXFHJVJLSVMW-UHFFFAOYSA-N 2-Aminoethan-1-ol Chemical compound NCCO HZAXFHJVJLSVMW-UHFFFAOYSA-N 0.000 description 2
- 102000010735 Adenomatous polyposis coli protein Human genes 0.000 description 2
- 108010038310 Adenomatous polyposis coli protein Proteins 0.000 description 2
- 229920001817 Agar Polymers 0.000 description 2
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 2
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 2
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 2
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 2
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 2
- XEPSCVXTCUUHDT-AVGNSLFASA-N Arg-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCN=C(N)N XEPSCVXTCUUHDT-AVGNSLFASA-N 0.000 description 2
- OVVUNXXROOFSIM-SDDRHHMPSA-N Arg-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O OVVUNXXROOFSIM-SDDRHHMPSA-N 0.000 description 2
- VXXHDZKEQNGXNU-QXEWZRGKSA-N Arg-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N VXXHDZKEQNGXNU-QXEWZRGKSA-N 0.000 description 2
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 2
- WKPXXXUSUHAXDE-SRVKXCTJSA-N Arg-Pro-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O WKPXXXUSUHAXDE-SRVKXCTJSA-N 0.000 description 2
- DDBMKOCQWNFDBH-RHYQMDGZSA-N Arg-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O DDBMKOCQWNFDBH-RHYQMDGZSA-N 0.000 description 2
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 2
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 2
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 2
- DPNWSMBUYCLEDG-CIUDSAMLSA-N Asp-Lys-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O DPNWSMBUYCLEDG-CIUDSAMLSA-N 0.000 description 2
- DWBZEJHQQIURML-IMJSIDKUSA-N Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(O)=O DWBZEJHQQIURML-IMJSIDKUSA-N 0.000 description 2
- GCACQYDBDHRVGE-LKXGYXEUSA-N Asp-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC(O)=O GCACQYDBDHRVGE-LKXGYXEUSA-N 0.000 description 2
- 241001203868 Autographa californica Species 0.000 description 2
- 108091012583 BCL2 Proteins 0.000 description 2
- ABLJDBFJPUWQQB-DCAQKATOSA-N Cys-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CS)N ABLJDBFJPUWQQB-DCAQKATOSA-N 0.000 description 2
- YXQDRIRSAHTJKM-IMJSIDKUSA-N Cys-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(O)=O YXQDRIRSAHTJKM-IMJSIDKUSA-N 0.000 description 2
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 2
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 2
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 2
- FYBSCGZLICNOBA-XQXXSGGOSA-N Glu-Ala-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FYBSCGZLICNOBA-XQXXSGGOSA-N 0.000 description 2
- CVPXINNKRTZBMO-CIUDSAMLSA-N Glu-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N CVPXINNKRTZBMO-CIUDSAMLSA-N 0.000 description 2
- TUTIHHSZKFBMHM-WHFBIAKZSA-N Glu-Asn Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(O)=O TUTIHHSZKFBMHM-WHFBIAKZSA-N 0.000 description 2
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 2
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 2
- WNRZUESNGGDCJX-JYJNAYRXSA-N Glu-Leu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WNRZUESNGGDCJX-JYJNAYRXSA-N 0.000 description 2
- BBBXWRGITSUJPB-YUMQZZPRSA-N Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O BBBXWRGITSUJPB-YUMQZZPRSA-N 0.000 description 2
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 2
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 description 2
- UQHGAYSULGRWRG-WHFBIAKZSA-N Glu-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(O)=O UQHGAYSULGRWRG-WHFBIAKZSA-N 0.000 description 2
- JSIQVRIXMINMTA-ZDLURKLDSA-N Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O JSIQVRIXMINMTA-ZDLURKLDSA-N 0.000 description 2
- SITLTJHOQZFJGG-XPUUQOCRSA-N Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O SITLTJHOQZFJGG-XPUUQOCRSA-N 0.000 description 2
- 102000005731 Glucose-6-phosphate isomerase Human genes 0.000 description 2
- 108010070600 Glucose-6-phosphate isomerase Proteins 0.000 description 2
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 2
- 239000004471 Glycine Substances 0.000 description 2
- MMFKFJORZBJVNF-UWVGGRQHSA-N His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 MMFKFJORZBJVNF-UWVGGRQHSA-N 0.000 description 2
- ILUVWFTXAUYOBW-CUJWVEQBSA-N His-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CN=CN1)N)O ILUVWFTXAUYOBW-CUJWVEQBSA-N 0.000 description 2
- 241000282412 Homo Species 0.000 description 2
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 2
- LPXHYGGZJOCAFR-MNXVOIDGSA-N Ile-Glu-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N LPXHYGGZJOCAFR-MNXVOIDGSA-N 0.000 description 2
- JWBXCSQZLLIOCI-GUBZILKMSA-N Ile-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CC(C)C JWBXCSQZLLIOCI-GUBZILKMSA-N 0.000 description 2
- TUYOFUHICRWDGA-CIUDSAMLSA-N Ile-Met Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CCSC TUYOFUHICRWDGA-CIUDSAMLSA-N 0.000 description 2
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 2
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 2
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 2
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 2
- QOOWRKBDDXQRHC-BQBZGAKWSA-N L-lysyl-L-alanine Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN QOOWRKBDDXQRHC-BQBZGAKWSA-N 0.000 description 2
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 2
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 2
- NFNVDJGXRFEYTK-YUMQZZPRSA-N Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O NFNVDJGXRFEYTK-YUMQZZPRSA-N 0.000 description 2
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 2
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 2
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 2
- MJWVXZABPOKJJF-ACRUOGEOSA-N Leu-Phe-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MJWVXZABPOKJJF-ACRUOGEOSA-N 0.000 description 2
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 2
- UGTZHPSKYRIGRJ-YUMQZZPRSA-N Lys-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O UGTZHPSKYRIGRJ-YUMQZZPRSA-N 0.000 description 2
- WGLAORUKDGRINI-WDCWCFNPSA-N Lys-Glu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGLAORUKDGRINI-WDCWCFNPSA-N 0.000 description 2
- UQRZFMQQXXJTTF-AVGNSLFASA-N Lys-Lys-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O UQRZFMQQXXJTTF-AVGNSLFASA-N 0.000 description 2
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 2
- MIFFFXHMAHFACR-KATARQTJSA-N Lys-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN MIFFFXHMAHFACR-KATARQTJSA-N 0.000 description 2
- RBGLBUDVQVPTEG-DCAQKATOSA-N Met-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCSC)N RBGLBUDVQVPTEG-DCAQKATOSA-N 0.000 description 2
- 102000003792 Metallothionein Human genes 0.000 description 2
- 108090000157 Metallothionein Proteins 0.000 description 2
- 241000699666 Mus <mouse, genus> Species 0.000 description 2
- 241000699670 Mus sp. Species 0.000 description 2
- 108010079364 N-glycylalanine Proteins 0.000 description 2
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 2
- 108010085793 Neurofibromin 1 Proteins 0.000 description 2
- 102000007530 Neurofibromin 1 Human genes 0.000 description 2
- 108010085839 Neurofibromin 2 Proteins 0.000 description 2
- 102000007517 Neurofibromin 2 Human genes 0.000 description 2
- 101100205189 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-5 gene Proteins 0.000 description 2
- 108700020796 Oncogene Proteins 0.000 description 2
- 102000043276 Oncogene Human genes 0.000 description 2
- 206010033128 Ovarian cancer Diseases 0.000 description 2
- CSYVXYQDIVCQNU-QWRGUYRKSA-N Phe-Asp-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O CSYVXYQDIVCQNU-QWRGUYRKSA-N 0.000 description 2
- FZBGMXYQPACKNC-HJWJTTGWSA-N Phe-Pro-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FZBGMXYQPACKNC-HJWJTTGWSA-N 0.000 description 2
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 2
- 108091000080 Phosphotransferase Proteins 0.000 description 2
- DWGFLKQSGRUQTI-IHRRRGAJSA-N Pro-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 DWGFLKQSGRUQTI-IHRRRGAJSA-N 0.000 description 2
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 2
- XDKKMRPRRCOELJ-GUBZILKMSA-N Pro-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 XDKKMRPRRCOELJ-GUBZILKMSA-N 0.000 description 2
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 2
- 108010079005 RDV peptide Proteins 0.000 description 2
- 108010003201 RGH 0205 Proteins 0.000 description 2
- 241000700159 Rattus Species 0.000 description 2
- 108020004511 Recombinant DNA Proteins 0.000 description 2
- 201000000582 Retinoblastoma Diseases 0.000 description 2
- 101100394989 Rhodopseudomonas palustris (strain ATCC BAA-98 / CGA009) hisI gene Proteins 0.000 description 2
- 108091028664 Ribonucleotide Proteins 0.000 description 2
- VQBLHWSPVYYZTB-DCAQKATOSA-N Ser-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N VQBLHWSPVYYZTB-DCAQKATOSA-N 0.000 description 2
- OLIJLNWFEQEFDM-SRVKXCTJSA-N Ser-Asp-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLIJLNWFEQEFDM-SRVKXCTJSA-N 0.000 description 2
- MOVJSUIKUNCVMG-ZLUOBGJFSA-N Ser-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)O MOVJSUIKUNCVMG-ZLUOBGJFSA-N 0.000 description 2
- LAFKUZYWNCHOHT-WHFBIAKZSA-N Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O LAFKUZYWNCHOHT-WHFBIAKZSA-N 0.000 description 2
- HMRAQFJFTOLDKW-GUBZILKMSA-N Ser-His-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O HMRAQFJFTOLDKW-GUBZILKMSA-N 0.000 description 2
- NFDYGNFETJVMSE-BQBZGAKWSA-N Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CO NFDYGNFETJVMSE-BQBZGAKWSA-N 0.000 description 2
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 2
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 2
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 2
- JAWGSPUJAXYXJA-IHRRRGAJSA-N Ser-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=CC=C1 JAWGSPUJAXYXJA-IHRRRGAJSA-N 0.000 description 2
- GZGFSPWOMUKKCV-NAKRPEOUSA-N Ser-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO GZGFSPWOMUKKCV-NAKRPEOUSA-N 0.000 description 2
- XZKQVQKUZMAADP-IMJSIDKUSA-N Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(O)=O XZKQVQKUZMAADP-IMJSIDKUSA-N 0.000 description 2
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 2
- UQTNIFUCMBFWEJ-IWGUZYHVSA-N Thr-Asn Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O UQTNIFUCMBFWEJ-IWGUZYHVSA-N 0.000 description 2
- BIYXEUAFGLTAEM-WUJLRWPWSA-N Thr-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(O)=O BIYXEUAFGLTAEM-WUJLRWPWSA-N 0.000 description 2
- YKRQRPFODDJQTC-CSMHCCOUSA-N Thr-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN YKRQRPFODDJQTC-CSMHCCOUSA-N 0.000 description 2
- GXDLGHLJTHMDII-WISUUJSJSA-N Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(O)=O GXDLGHLJTHMDII-WISUUJSJSA-N 0.000 description 2
- DSGIVWSDDRDJIO-ZXXMMSQZSA-N Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DSGIVWSDDRDJIO-ZXXMMSQZSA-N 0.000 description 2
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 2
- 241000723873 Tobacco mosaic virus Species 0.000 description 2
- 229920004890 Triton X-100 Polymers 0.000 description 2
- DNOOLPROHJWCSQ-RCWTZXSCSA-N Val-Arg-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DNOOLPROHJWCSQ-RCWTZXSCSA-N 0.000 description 2
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 2
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 2
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 2
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 2
- 230000002159 abnormal effect Effects 0.000 description 2
- 239000008272 agar Substances 0.000 description 2
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 2
- 108010087924 alanylproline Proteins 0.000 description 2
- 238000010171 animal model Methods 0.000 description 2
- 239000000427 antigen Substances 0.000 description 2
- 108091007433 antigens Proteins 0.000 description 2
- 102000036639 antigens Human genes 0.000 description 2
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 2
- 108010068380 arginylarginine Proteins 0.000 description 2
- 108010036533 arginylvaline Proteins 0.000 description 2
- 108010038633 aspartylglutamate Proteins 0.000 description 2
- 230000022131 cell cycle Effects 0.000 description 2
- 238000003776 cleavage reaction Methods 0.000 description 2
- 230000000295 complement effect Effects 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 230000008878 coupling Effects 0.000 description 2
- 238000010168 coupling process Methods 0.000 description 2
- 238000005859 coupling reaction Methods 0.000 description 2
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 2
- 235000018417 cysteine Nutrition 0.000 description 2
- 108010016616 cysteinylglycine Proteins 0.000 description 2
- 108010060199 cysteinylproline Proteins 0.000 description 2
- 230000034994 death Effects 0.000 description 2
- 231100000517 death Toxicity 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 238000002405 diagnostic procedure Methods 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000011156 evaluation Methods 0.000 description 2
- 108091006047 fluorescent proteins Proteins 0.000 description 2
- 102000034287 fluorescent proteins Human genes 0.000 description 2
- 239000000499 gel Substances 0.000 description 2
- 238000010363 gene targeting Methods 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- 210000004602 germ cell Anatomy 0.000 description 2
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 2
- 235000004554 glutamine Nutrition 0.000 description 2
- 108010049041 glutamylalanine Proteins 0.000 description 2
- RWSXRVCMGQZWBV-WDSKDSINSA-N glutathione Chemical compound OC(=O)[C@@H](N)CCC(=O)N[C@@H](CS)C(=O)NCC(O)=O RWSXRVCMGQZWBV-WDSKDSINSA-N 0.000 description 2
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 2
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 2
- 108010092114 histidylphenylalanine Proteins 0.000 description 2
- 238000002649 immunization Methods 0.000 description 2
- 230000003053 immunization Effects 0.000 description 2
- 210000003000 inclusion body Anatomy 0.000 description 2
- 230000001965 increasing effect Effects 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 230000037041 intracellular level Effects 0.000 description 2
- JVTAAEKCZFNVCJ-UHFFFAOYSA-N lactic acid Chemical compound CC(O)C(O)=O JVTAAEKCZFNVCJ-UHFFFAOYSA-N 0.000 description 2
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 2
- 108010000761 leucylarginine Proteins 0.000 description 2
- 239000006166 lysate Substances 0.000 description 2
- 239000012139 lysis buffer Substances 0.000 description 2
- 108010038320 lysylphenylalanine Proteins 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 230000001404 mediated effect Effects 0.000 description 2
- 244000005700 microbiome Species 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 108010012581 phenylalanylglutamate Proteins 0.000 description 2
- 102000020233 phosphotransferase Human genes 0.000 description 2
- 230000008488 polyadenylation Effects 0.000 description 2
- 239000000376 reactant Substances 0.000 description 2
- 238000003259 recombinant expression Methods 0.000 description 2
- 230000006798 recombination Effects 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 238000010839 reverse transcription Methods 0.000 description 2
- 238000012552 review Methods 0.000 description 2
- 239000002336 ribonucleotide Substances 0.000 description 2
- 125000002652 ribonucleotide group Chemical group 0.000 description 2
- 230000007017 scission Effects 0.000 description 2
- 239000007790 solid phase Substances 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 231100000331 toxic Toxicity 0.000 description 2
- 230000002588 toxic effect Effects 0.000 description 2
- 230000002103 transcriptional effect Effects 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 238000012384 transportation and delivery Methods 0.000 description 2
- 108010051110 tyrosyl-lysine Proteins 0.000 description 2
- GJLXVWOMRRWCIB-MERZOTPQSA-N (2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-acetamido-5-(diaminomethylideneamino)pentanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-3-(1H-indol-3-yl)propanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanamide Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(N)=O)C1=CC=C(O)C=C1 GJLXVWOMRRWCIB-MERZOTPQSA-N 0.000 description 1
- AXFMEGAFCUULFV-BLFANLJRSA-N (2s)-2-[[(2s)-1-[(2s,3r)-2-amino-3-methylpentanoyl]pyrrolidine-2-carbonyl]amino]pentanedioic acid Chemical compound CC[C@@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AXFMEGAFCUULFV-BLFANLJRSA-N 0.000 description 1
- AUXMWYRZQPIXCC-KNIFDHDWSA-N (2s)-2-amino-4-methylpentanoic acid;(2s)-2-aminopropanoic acid Chemical compound C[C@H](N)C(O)=O.CC(C)C[C@H](N)C(O)=O AUXMWYRZQPIXCC-KNIFDHDWSA-N 0.000 description 1
- RVLOMLVNNBWRSR-KNIFDHDWSA-N (2s)-2-aminopropanoic acid;(2s)-2,6-diaminohexanoic acid Chemical compound C[C@H](N)C(O)=O.NCCCC[C@H](N)C(O)=O RVLOMLVNNBWRSR-KNIFDHDWSA-N 0.000 description 1
- ASWBNKHCZGQVJV-UHFFFAOYSA-N (3-hexadecanoyloxy-2-hydroxypropyl) 2-(trimethylazaniumyl)ethyl phosphate Chemical compound CCCCCCCCCCCCCCCC(=O)OCC(O)COP([O-])(=O)OCC[N+](C)(C)C ASWBNKHCZGQVJV-UHFFFAOYSA-N 0.000 description 1
- HPYLHFWTUAGUNX-BGZSDMPXSA-N (3s)-4-[[(1s)-1-carboxy-2-hydroxyethyl]amino]-4-oxo-3-[[(2s,6s)-2,6,10-triamino-4-[(diaminomethylideneamino)methyl]-5-oxodecanoyl]amino]butanoic acid Chemical compound NCCCC[C@H](N)C(=O)C(CN=C(N)N)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O HPYLHFWTUAGUNX-BGZSDMPXSA-N 0.000 description 1
- BZSALXKCVOJCJJ-IPEMHBBOSA-N (4s)-4-[[(2s)-2-acetamido-3-methylbutanoyl]amino]-5-[[(2s)-1-[[(2s)-1-[[(2s,3r)-1-[[(2s)-1-[[(2s)-1-[[2-[[(2s)-1-amino-1-oxo-3-phenylpropan-2-yl]amino]-2-oxoethyl]amino]-5-(diaminomethylideneamino)-1-oxopentan-2-yl]amino]-1-oxopropan-2-yl]amino]-3-hydroxy Chemical compound CC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCC)C(=O)N[C@@H](CCCC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)N[C@H](C(N)=O)CC1=CC=CC=C1 BZSALXKCVOJCJJ-IPEMHBBOSA-N 0.000 description 1
- GZCWLCBFPRFLKL-UHFFFAOYSA-N 1-prop-2-ynoxypropan-2-ol Chemical compound CC(O)COCC#C GZCWLCBFPRFLKL-UHFFFAOYSA-N 0.000 description 1
- 108020004465 16S ribosomal RNA Proteins 0.000 description 1
- UFBJCMHMOXMLKC-UHFFFAOYSA-N 2,4-dinitrophenol Chemical compound OC1=CC=C([N+]([O-])=O)C=C1[N+]([O-])=O UFBJCMHMOXMLKC-UHFFFAOYSA-N 0.000 description 1
- 102000013563 Acid Phosphatase Human genes 0.000 description 1
- 108010051457 Acid Phosphatase Proteins 0.000 description 1
- 102100029457 Adenine phosphoribosyltransferase Human genes 0.000 description 1
- 108010024223 Adenine phosphoribosyltransferase Proteins 0.000 description 1
- UWQJHXKARZWDIJ-ZLUOBGJFSA-N Ala-Ala-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O UWQJHXKARZWDIJ-ZLUOBGJFSA-N 0.000 description 1
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 1
- LGQPPBQRUBVTIF-JBDRJPRFSA-N Ala-Ala-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LGQPPBQRUBVTIF-JBDRJPRFSA-N 0.000 description 1
- SITWEMZOJNKJCH-WDSKDSINSA-N Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SITWEMZOJNKJCH-WDSKDSINSA-N 0.000 description 1
- DVWVZSJAYIJZFI-FXQIFTODSA-N Ala-Arg-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DVWVZSJAYIJZFI-FXQIFTODSA-N 0.000 description 1
- UCIYCBSJBQGDGM-LPEHRKFASA-N Ala-Arg-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N UCIYCBSJBQGDGM-LPEHRKFASA-N 0.000 description 1
- CCUAQNUWXLYFRA-IMJSIDKUSA-N Ala-Asn Chemical compound C[C@H]([NH3+])C(=O)N[C@H](C([O-])=O)CC(N)=O CCUAQNUWXLYFRA-IMJSIDKUSA-N 0.000 description 1
- LBJYAILUMSUTAM-ZLUOBGJFSA-N Ala-Asn-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LBJYAILUMSUTAM-ZLUOBGJFSA-N 0.000 description 1
- JYEBJTDTPNKQJG-FXQIFTODSA-N Ala-Asn-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N JYEBJTDTPNKQJG-FXQIFTODSA-N 0.000 description 1
- LSLIRHLIUDVNBN-CIUDSAMLSA-N Ala-Asp-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LSLIRHLIUDVNBN-CIUDSAMLSA-N 0.000 description 1
- JQDFGZKKXBEANU-IMJSIDKUSA-N Ala-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(O)=O JQDFGZKKXBEANU-IMJSIDKUSA-N 0.000 description 1
- UHMQKOBNPRAZGB-CIUDSAMLSA-N Ala-Glu-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N UHMQKOBNPRAZGB-CIUDSAMLSA-N 0.000 description 1
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 1
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 1
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 1
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 1
- JDIQCVUDDFENPU-ZKWXMUAHSA-N Ala-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CNC=N1 JDIQCVUDDFENPU-ZKWXMUAHSA-N 0.000 description 1
- NJWJSLCQEDMGNC-MBLNEYKQSA-N Ala-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C)N)O NJWJSLCQEDMGNC-MBLNEYKQSA-N 0.000 description 1
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 1
- NMXKFWOEASXOGB-QSFUFRPTSA-N Ala-Ile-His Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NMXKFWOEASXOGB-QSFUFRPTSA-N 0.000 description 1
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 1
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 1
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 1
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 1
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 1
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 1
- FSHURBQASBLAPO-WDSKDSINSA-N Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)N FSHURBQASBLAPO-WDSKDSINSA-N 0.000 description 1
- OMDNCNKNEGFOMM-BQBZGAKWSA-N Ala-Met-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O OMDNCNKNEGFOMM-BQBZGAKWSA-N 0.000 description 1
- YHBDGLZYNIARKJ-GUBZILKMSA-N Ala-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N YHBDGLZYNIARKJ-GUBZILKMSA-N 0.000 description 1
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 1
- XQNRANMFRPCFFW-GCJQMDKQSA-N Ala-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C)N)O XQNRANMFRPCFFW-GCJQMDKQSA-N 0.000 description 1
- JJHBEVZAZXZREW-LFSVMHDDSA-N Ala-Thr-Phe Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O JJHBEVZAZXZREW-LFSVMHDDSA-N 0.000 description 1
- KLKARCOHVHLAJP-UWJYBYFXSA-N Ala-Tyr-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CS)C(O)=O KLKARCOHVHLAJP-UWJYBYFXSA-N 0.000 description 1
- LIWMQSWFLXEGMA-WDSKDSINSA-N Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)N LIWMQSWFLXEGMA-WDSKDSINSA-N 0.000 description 1
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 1
- RFJNDTQGEJRBHO-DCAQKATOSA-N Ala-Val-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)[NH3+] RFJNDTQGEJRBHO-DCAQKATOSA-N 0.000 description 1
- ZDILXFDENZVOTL-BPNCWPANSA-N Ala-Val-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZDILXFDENZVOTL-BPNCWPANSA-N 0.000 description 1
- 108020000948 Antisense Oligonucleotides Proteins 0.000 description 1
- WVRUNFYJIHNFKD-WDSKDSINSA-N Arg-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N WVRUNFYJIHNFKD-WDSKDSINSA-N 0.000 description 1
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 1
- PEFFAAKJGBZBKL-NAKRPEOUSA-N Arg-Ala-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PEFFAAKJGBZBKL-NAKRPEOUSA-N 0.000 description 1
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 1
- OMLWNBVRVJYMBQ-YUMQZZPRSA-N Arg-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OMLWNBVRVJYMBQ-YUMQZZPRSA-N 0.000 description 1
- QEKBCDODJBBWHV-GUBZILKMSA-N Arg-Arg-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O QEKBCDODJBBWHV-GUBZILKMSA-N 0.000 description 1
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 1
- RWCLSUOSKWTXLA-FXQIFTODSA-N Arg-Asp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RWCLSUOSKWTXLA-FXQIFTODSA-N 0.000 description 1
- OBFTYSPXDRROQO-SRVKXCTJSA-N Arg-Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCN=C(N)N OBFTYSPXDRROQO-SRVKXCTJSA-N 0.000 description 1
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 1
- XUUXCWCKKCZEAW-YFKPBYRVSA-N Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N XUUXCWCKKCZEAW-YFKPBYRVSA-N 0.000 description 1
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 1
- FRMQITGHXMUNDF-GMOBBJLQSA-N Arg-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FRMQITGHXMUNDF-GMOBBJLQSA-N 0.000 description 1
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 1
- YVTHEZNOKSAWRW-DCAQKATOSA-N Arg-Lys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O YVTHEZNOKSAWRW-DCAQKATOSA-N 0.000 description 1
- MJINRRBEMOLJAK-DCAQKATOSA-N Arg-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N MJINRRBEMOLJAK-DCAQKATOSA-N 0.000 description 1
- BNYNOWJESJJIOI-XUXIUFHCSA-N Arg-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N BNYNOWJESJJIOI-XUXIUFHCSA-N 0.000 description 1
- LQJAALCCPOTJGB-YUMQZZPRSA-N Arg-Pro Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O LQJAALCCPOTJGB-YUMQZZPRSA-N 0.000 description 1
- XSPKAHFVDKRGRL-DCAQKATOSA-N Arg-Pro-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XSPKAHFVDKRGRL-DCAQKATOSA-N 0.000 description 1
- NGYHSXDNNOFHNE-AVGNSLFASA-N Arg-Pro-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O NGYHSXDNNOFHNE-AVGNSLFASA-N 0.000 description 1
- IJYZHIOOBGIINM-WDSKDSINSA-N Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N IJYZHIOOBGIINM-WDSKDSINSA-N 0.000 description 1
- VENMDXUVHSKEIN-GUBZILKMSA-N Arg-Ser-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VENMDXUVHSKEIN-GUBZILKMSA-N 0.000 description 1
- ICRHGPYYXMWHIE-LPEHRKFASA-N Arg-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ICRHGPYYXMWHIE-LPEHRKFASA-N 0.000 description 1
- XNSKSTRGQIPTSE-ACZMJKKPSA-N Arg-Thr Chemical compound C[C@@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O XNSKSTRGQIPTSE-ACZMJKKPSA-N 0.000 description 1
- WCZXPVPHUMYLMS-VEVYYDQMSA-N Arg-Thr-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O WCZXPVPHUMYLMS-VEVYYDQMSA-N 0.000 description 1
- MOGMYRUNTKYZFB-UNQGMJICSA-N Arg-Thr-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MOGMYRUNTKYZFB-UNQGMJICSA-N 0.000 description 1
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 1
- XHFXZQHTLJVZBN-FXQIFTODSA-N Asn-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N XHFXZQHTLJVZBN-FXQIFTODSA-N 0.000 description 1
- LJUOLNXOWSWGKF-ACZMJKKPSA-N Asn-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N LJUOLNXOWSWGKF-ACZMJKKPSA-N 0.000 description 1
- IIFDPDVJAHQFSR-WHFBIAKZSA-N Asn-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O IIFDPDVJAHQFSR-WHFBIAKZSA-N 0.000 description 1
- MSBDSTRUMZFSEU-PEFMBERDSA-N Asn-Glu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MSBDSTRUMZFSEU-PEFMBERDSA-N 0.000 description 1
- KLKHFFMNGWULBN-VKHMYHEASA-N Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)NCC(O)=O KLKHFFMNGWULBN-VKHMYHEASA-N 0.000 description 1
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 1
- FFMIYIMKQIMDPK-BQBZGAKWSA-N Asn-His Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 FFMIYIMKQIMDPK-BQBZGAKWSA-N 0.000 description 1
- MQLZLIYPFDIDMZ-HAFWLYHUSA-N Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(N)=O MQLZLIYPFDIDMZ-HAFWLYHUSA-N 0.000 description 1
- JLNFZLNDHONLND-GARJFASQSA-N Asn-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N JLNFZLNDHONLND-GARJFASQSA-N 0.000 description 1
- ZYPWIUFLYMQZBS-SRVKXCTJSA-N Asn-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZYPWIUFLYMQZBS-SRVKXCTJSA-N 0.000 description 1
- YUOXLJYVSZYPBJ-CIUDSAMLSA-N Asn-Pro-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O YUOXLJYVSZYPBJ-CIUDSAMLSA-N 0.000 description 1
- OOXUBGLNDRGOKT-FXQIFTODSA-N Asn-Ser-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OOXUBGLNDRGOKT-FXQIFTODSA-N 0.000 description 1
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 1
- BEHQTVDBCLSCBY-CFMVVWHZSA-N Asn-Tyr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BEHQTVDBCLSCBY-CFMVVWHZSA-N 0.000 description 1
- KWBQPGIYEZKDEG-FSPLSTOPSA-N Asn-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(N)=O KWBQPGIYEZKDEG-FSPLSTOPSA-N 0.000 description 1
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 description 1
- KVMPVNGOKHTUHZ-GCJQMDKQSA-N Asp-Ala-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KVMPVNGOKHTUHZ-GCJQMDKQSA-N 0.000 description 1
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 1
- AXXCUABIFZPKPM-BQBZGAKWSA-N Asp-Arg-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O AXXCUABIFZPKPM-BQBZGAKWSA-N 0.000 description 1
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 1
- UGKZHCBLMLSANF-CIUDSAMLSA-N Asp-Asn-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UGKZHCBLMLSANF-CIUDSAMLSA-N 0.000 description 1
- TVVYVAUGRHNTGT-UGYAYLCHSA-N Asp-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O TVVYVAUGRHNTGT-UGYAYLCHSA-N 0.000 description 1
- CKAJHWFHHFSCDT-WHFBIAKZSA-N Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O CKAJHWFHHFSCDT-WHFBIAKZSA-N 0.000 description 1
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 1
- XDGBFDYXZCMYEX-NUMRIWBASA-N Asp-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)O XDGBFDYXZCMYEX-NUMRIWBASA-N 0.000 description 1
- TZOZNVLBTAFJRW-UGYAYLCHSA-N Asp-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N TZOZNVLBTAFJRW-UGYAYLCHSA-N 0.000 description 1
- KQBVNNAPIURMPD-PEFMBERDSA-N Asp-Ile-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KQBVNNAPIURMPD-PEFMBERDSA-N 0.000 description 1
- MFTVXYMXSAQZNL-DJFWLOJKSA-N Asp-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)O)N MFTVXYMXSAQZNL-DJFWLOJKSA-N 0.000 description 1
- SCQIQCWLOMOEFP-DCAQKATOSA-N Asp-Leu-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SCQIQCWLOMOEFP-DCAQKATOSA-N 0.000 description 1
- CLUMZOKVGUWUFD-CIUDSAMLSA-N Asp-Leu-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O CLUMZOKVGUWUFD-CIUDSAMLSA-N 0.000 description 1
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 1
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 1
- HKEZZWQWXWGASX-KKUMJFAQSA-N Asp-Leu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HKEZZWQWXWGASX-KKUMJFAQSA-N 0.000 description 1
- OAMLVOVXNKILLQ-BQBZGAKWSA-N Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(O)=O OAMLVOVXNKILLQ-BQBZGAKWSA-N 0.000 description 1
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 1
- YWLDTBBUHZJQHW-KKUMJFAQSA-N Asp-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N YWLDTBBUHZJQHW-KKUMJFAQSA-N 0.000 description 1
- DONWIPDSZZJHHK-HJGDQZAQSA-N Asp-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)O DONWIPDSZZJHHK-HJGDQZAQSA-N 0.000 description 1
- SAKCBXNPWDRWPE-BQBZGAKWSA-N Asp-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)O)N SAKCBXNPWDRWPE-BQBZGAKWSA-N 0.000 description 1
- NTQDELBZOMWXRS-IWGUZYHVSA-N Asp-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(O)=O NTQDELBZOMWXRS-IWGUZYHVSA-N 0.000 description 1
- GXHDGYOXPNQCKM-XVSYOHENSA-N Asp-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GXHDGYOXPNQCKM-XVSYOHENSA-N 0.000 description 1
- WAEDSQFVZJUHLI-BYULHYEWSA-N Asp-Val-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WAEDSQFVZJUHLI-BYULHYEWSA-N 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 241000304886 Bacilli Species 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- 108020004513 Bacterial RNA Proteins 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- 101710132601 Capsid protein Proteins 0.000 description 1
- 102000053642 Catalytic RNA Human genes 0.000 description 1
- 108090000994 Catalytic RNA Proteins 0.000 description 1
- 241000701489 Cauliflower mosaic virus Species 0.000 description 1
- 241000700198 Cavia Species 0.000 description 1
- 241000282693 Cercopithecidae Species 0.000 description 1
- 101710094648 Coat protein Proteins 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- 108700010070 Codon Usage Proteins 0.000 description 1
- 241000186216 Corynebacterium Species 0.000 description 1
- 108050006400 Cyclin Proteins 0.000 description 1
- 102000016736 Cyclin Human genes 0.000 description 1
- AEJSNWMRPXAKCW-WHFBIAKZSA-N Cys-Ala-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AEJSNWMRPXAKCW-WHFBIAKZSA-N 0.000 description 1
- CLDCTNHPILWQCW-CIUDSAMLSA-N Cys-Arg-Glu Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N CLDCTNHPILWQCW-CIUDSAMLSA-N 0.000 description 1
- LHLSSZYQFUNWRZ-NAKRPEOUSA-N Cys-Arg-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LHLSSZYQFUNWRZ-NAKRPEOUSA-N 0.000 description 1
- NIPJKKSXHSBEMX-CIUDSAMLSA-N Cys-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N NIPJKKSXHSBEMX-CIUDSAMLSA-N 0.000 description 1
- QJUDRFBUWAGUSG-SRVKXCTJSA-N Cys-Cys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CS)N QJUDRFBUWAGUSG-SRVKXCTJSA-N 0.000 description 1
- HQZGVYJBRSISDT-BQBZGAKWSA-N Cys-Gly-Arg Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQZGVYJBRSISDT-BQBZGAKWSA-N 0.000 description 1
- PQHYZJPCYRDYNE-QWRGUYRKSA-N Cys-Gly-Phe Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PQHYZJPCYRDYNE-QWRGUYRKSA-N 0.000 description 1
- VPQZSNQICFCCSO-BJDJZHNGSA-N Cys-Leu-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VPQZSNQICFCCSO-BJDJZHNGSA-N 0.000 description 1
- WXOFKRKAHJQKLT-BQBZGAKWSA-N Cys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CS WXOFKRKAHJQKLT-BQBZGAKWSA-N 0.000 description 1
- CIVXDCMSSFGWAL-YUMQZZPRSA-N Cys-Lys-Gly Chemical compound C(CCN)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N CIVXDCMSSFGWAL-YUMQZZPRSA-N 0.000 description 1
- HSAWNMMTZCLTPY-DCAQKATOSA-N Cys-Met-Leu Chemical compound SC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O HSAWNMMTZCLTPY-DCAQKATOSA-N 0.000 description 1
- DQGIAOGALAQBGK-BWBBJGPYSA-N Cys-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N)O DQGIAOGALAQBGK-BWBBJGPYSA-N 0.000 description 1
- SAEVTQWAYDPXMU-KATARQTJSA-N Cys-Thr-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O SAEVTQWAYDPXMU-KATARQTJSA-N 0.000 description 1
- 241000701022 Cytomegalovirus Species 0.000 description 1
- 150000008574 D-amino acids Chemical class 0.000 description 1
- 101150074155 DHFR gene Proteins 0.000 description 1
- 108010008286 DNA nucleotidylexotransferase Proteins 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 102100029764 DNA-directed DNA/RNA polymerase mu Human genes 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 108090000204 Dipeptidase 1 Proteins 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 1
- 241001302584 Escherichia coli str. K-12 substr. W3110 Species 0.000 description 1
- 101000686777 Escherichia phage T7 T7 RNA polymerase Proteins 0.000 description 1
- 108010074860 Factor Xa Proteins 0.000 description 1
- 230000010337 G2 phase Effects 0.000 description 1
- 102000018898 GTPase-Activating Proteins Human genes 0.000 description 1
- 108091006094 GTPase-accelerating proteins Proteins 0.000 description 1
- 241001200922 Gagata Species 0.000 description 1
- 208000034826 Genetic Predisposition to Disease Diseases 0.000 description 1
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 1
- 241001678517 Ginaia Species 0.000 description 1
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 1
- WLODHVXYKYHLJD-ACZMJKKPSA-N Gln-Asp-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N WLODHVXYKYHLJD-ACZMJKKPSA-N 0.000 description 1
- ZNZPKVQURDQFFS-FXQIFTODSA-N Gln-Glu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZNZPKVQURDQFFS-FXQIFTODSA-N 0.000 description 1
- OKARHJKJTKFQBM-ACZMJKKPSA-N Gln-Ser-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OKARHJKJTKFQBM-ACZMJKKPSA-N 0.000 description 1
- JZDHUJAFXGNDSB-WHFBIAKZSA-N Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O JZDHUJAFXGNDSB-WHFBIAKZSA-N 0.000 description 1
- FHPXTPQBODWBIY-CIUDSAMLSA-N Glu-Ala-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHPXTPQBODWBIY-CIUDSAMLSA-N 0.000 description 1
- ATRHMOJQJWPVBQ-DRZSPHRISA-N Glu-Ala-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ATRHMOJQJWPVBQ-DRZSPHRISA-N 0.000 description 1
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 1
- MPZWMIIOPAPAKE-BQBZGAKWSA-N Glu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N MPZWMIIOPAPAKE-BQBZGAKWSA-N 0.000 description 1
- AVZHGSCDKIQZPQ-CIUDSAMLSA-N Glu-Arg-Ala Chemical compound C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AVZHGSCDKIQZPQ-CIUDSAMLSA-N 0.000 description 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 1
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 1
- NTBDVNJIWCKURJ-ACZMJKKPSA-N Glu-Asp-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NTBDVNJIWCKURJ-ACZMJKKPSA-N 0.000 description 1
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 1
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 1
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 1
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 1
- APHGWLWMOXGZRL-DCAQKATOSA-N Glu-Glu-His Chemical compound N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O APHGWLWMOXGZRL-DCAQKATOSA-N 0.000 description 1
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 1
- IQACOVZVOMVILH-FXQIFTODSA-N Glu-Glu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O IQACOVZVOMVILH-FXQIFTODSA-N 0.000 description 1
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 1
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 1
- NJPQBTJSYCKCNS-HVTMNAMFSA-N Glu-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N NJPQBTJSYCKCNS-HVTMNAMFSA-N 0.000 description 1
- WVTIBGWZUMJBFY-GUBZILKMSA-N Glu-His-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O WVTIBGWZUMJBFY-GUBZILKMSA-N 0.000 description 1
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 1
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 1
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 1
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 1
- ZGEJRLJEAMPEDV-SRVKXCTJSA-N Glu-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N ZGEJRLJEAMPEDV-SRVKXCTJSA-N 0.000 description 1
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 1
- SXGAGTVDWKQYCX-BQBZGAKWSA-N Glu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O SXGAGTVDWKQYCX-BQBZGAKWSA-N 0.000 description 1
- ZQYZDDXTNQXUJH-CIUDSAMLSA-N Glu-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(=O)O)N ZQYZDDXTNQXUJH-CIUDSAMLSA-N 0.000 description 1
- LKOAAMXDJGEYMS-ZPFDUUQYSA-N Glu-Met-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKOAAMXDJGEYMS-ZPFDUUQYSA-N 0.000 description 1
- CBEUFCJRFNZMCU-SRVKXCTJSA-N Glu-Met-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O CBEUFCJRFNZMCU-SRVKXCTJSA-N 0.000 description 1
- XMBSYZWANAQXEV-QWRGUYRKSA-N Glu-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-QWRGUYRKSA-N 0.000 description 1
- QNJNPKSWAHPYGI-JYJNAYRXSA-N Glu-Phe-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 QNJNPKSWAHPYGI-JYJNAYRXSA-N 0.000 description 1
- QJVZSVUYZFYLFQ-CIUDSAMLSA-N Glu-Pro-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O QJVZSVUYZFYLFQ-CIUDSAMLSA-N 0.000 description 1
- UDEPRBFQTWGLCW-CIUDSAMLSA-N Glu-Pro-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O UDEPRBFQTWGLCW-CIUDSAMLSA-N 0.000 description 1
- WIKMTDVSCUJIPJ-CIUDSAMLSA-N Glu-Ser-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WIKMTDVSCUJIPJ-CIUDSAMLSA-N 0.000 description 1
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 1
- QOXDAWODGSIDDI-GUBZILKMSA-N Glu-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N QOXDAWODGSIDDI-GUBZILKMSA-N 0.000 description 1
- JWNZHMSRZXXGTM-XKBZYTNZSA-N Glu-Ser-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWNZHMSRZXXGTM-XKBZYTNZSA-N 0.000 description 1
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 1
- RGJKYNUINKGPJN-RWRJDSDZSA-N Glu-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCC(=O)O)N RGJKYNUINKGPJN-RWRJDSDZSA-N 0.000 description 1
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 1
- JDAYMLXPUJRSDJ-XIRDDKMYSA-N Glu-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 JDAYMLXPUJRSDJ-XIRDDKMYSA-N 0.000 description 1
- RXJFSLQVMGYQEL-IHRRRGAJSA-N Glu-Tyr-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 RXJFSLQVMGYQEL-IHRRRGAJSA-N 0.000 description 1
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 1
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 1
- 102000030595 Glucokinase Human genes 0.000 description 1
- 108010021582 Glucokinase Proteins 0.000 description 1
- 108010024636 Glutathione Proteins 0.000 description 1
- OGCIHJPYKVSMTE-YUMQZZPRSA-N Gly-Arg-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O OGCIHJPYKVSMTE-YUMQZZPRSA-N 0.000 description 1
- VXKCPBPQEKKERH-IUCAKERBSA-N Gly-Arg-Pro Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N1CCC[C@H]1C(O)=O VXKCPBPQEKKERH-IUCAKERBSA-N 0.000 description 1
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 1
- FUESBOMYALLFNI-VKHMYHEASA-N Gly-Asn Chemical compound NCC(=O)N[C@H](C(O)=O)CC(N)=O FUESBOMYALLFNI-VKHMYHEASA-N 0.000 description 1
- BEQGFMIBZFNROK-JGVFFNPUSA-N Gly-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)CN)C(=O)O BEQGFMIBZFNROK-JGVFFNPUSA-N 0.000 description 1
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 1
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 1
- DKEXFJVMVGETOO-LURJTMIESA-N Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CN DKEXFJVMVGETOO-LURJTMIESA-N 0.000 description 1
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 1
- JYPCXBJRLBHWME-IUCAKERBSA-N Gly-Pro-Arg Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JYPCXBJRLBHWME-IUCAKERBSA-N 0.000 description 1
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 1
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 1
- XHVONGZZVUUORG-WEDXCCLWSA-N Gly-Thr-Lys Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN XHVONGZZVUUORG-WEDXCCLWSA-N 0.000 description 1
- BXDLTKLPPKBVEL-FJXKBIBVSA-N Gly-Thr-Met Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O BXDLTKLPPKBVEL-FJXKBIBVSA-N 0.000 description 1
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 1
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 1
- WSWWTQYHFCBKBT-DVJZZOLTSA-N Gly-Thr-Trp Chemical compound C[C@@H](O)[C@H](NC(=O)CN)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O WSWWTQYHFCBKBT-DVJZZOLTSA-N 0.000 description 1
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 1
- 102100021181 Golgi phosphoprotein 3 Human genes 0.000 description 1
- 102000005548 Hexokinase Human genes 0.000 description 1
- 108700040460 Hexokinases Proteins 0.000 description 1
- AWHJQEYGWRKPHE-LSJOCFKGSA-N His-Ala-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AWHJQEYGWRKPHE-LSJOCFKGSA-N 0.000 description 1
- DCRODRAURLJOFY-XPUUQOCRSA-N His-Ala-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)NCC(O)=O DCRODRAURLJOFY-XPUUQOCRSA-N 0.000 description 1
- CIWILNZNBPIHEU-DCAQKATOSA-N His-Arg-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O CIWILNZNBPIHEU-DCAQKATOSA-N 0.000 description 1
- LYSVCKOXIDKEEL-SRVKXCTJSA-N His-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CN=CN1 LYSVCKOXIDKEEL-SRVKXCTJSA-N 0.000 description 1
- BDHUXUFYNUOUIT-SRVKXCTJSA-N His-Asp-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BDHUXUFYNUOUIT-SRVKXCTJSA-N 0.000 description 1
- ZZLWLWSUIBSMNP-CIUDSAMLSA-N His-Asp-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZZLWLWSUIBSMNP-CIUDSAMLSA-N 0.000 description 1
- FZKFYOXDVWDELO-KBPBESRZSA-N His-Gly-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FZKFYOXDVWDELO-KBPBESRZSA-N 0.000 description 1
- PMWSGVRIMIFXQH-KKUMJFAQSA-N His-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1NC=NC=1)C1=CN=CN1 PMWSGVRIMIFXQH-KKUMJFAQSA-N 0.000 description 1
- IDXZDKMBEXLFMB-HGNGGELXSA-N His-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CNC=N1 IDXZDKMBEXLFMB-HGNGGELXSA-N 0.000 description 1
- ZRSJXIKQXUGKRB-TUBUOCAGSA-N His-Ile-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZRSJXIKQXUGKRB-TUBUOCAGSA-N 0.000 description 1
- RNMNYMDTESKEAJ-KKUMJFAQSA-N His-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 RNMNYMDTESKEAJ-KKUMJFAQSA-N 0.000 description 1
- LVXFNTIIGOQBMD-SRVKXCTJSA-N His-Leu-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O LVXFNTIIGOQBMD-SRVKXCTJSA-N 0.000 description 1
- XMAUFHMAAVTODF-STQMWFEESA-N His-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 XMAUFHMAAVTODF-STQMWFEESA-N 0.000 description 1
- FHKZHRMERJUXRJ-DCAQKATOSA-N His-Ser-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 FHKZHRMERJUXRJ-DCAQKATOSA-N 0.000 description 1
- GIRSNERMXCMDBO-GARJFASQSA-N His-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O GIRSNERMXCMDBO-GARJFASQSA-N 0.000 description 1
- ALPXXNRQBMRCPZ-MEYUZBJRSA-N His-Thr-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ALPXXNRQBMRCPZ-MEYUZBJRSA-N 0.000 description 1
- NBWATNYAUVSAEQ-ZEILLAHLSA-N His-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O NBWATNYAUVSAEQ-ZEILLAHLSA-N 0.000 description 1
- 101000934870 Homo sapiens Breast cancer type 1 susceptibility protein Proteins 0.000 description 1
- 101000721661 Homo sapiens Cellular tumor antigen p53 Proteins 0.000 description 1
- 101000911753 Homo sapiens Protein FAM107B Proteins 0.000 description 1
- 108090000144 Human Proteins Proteins 0.000 description 1
- 102000003839 Human Proteins Human genes 0.000 description 1
- 241000701109 Human adenovirus 2 Species 0.000 description 1
- 101100321817 Human parvovirus B19 (strain HV) 7.5K gene Proteins 0.000 description 1
- 108010091358 Hypoxanthine Phosphoribosyltransferase Proteins 0.000 description 1
- 102100029098 Hypoxanthine-guanine phosphoribosyltransferase Human genes 0.000 description 1
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 1
- YOTNPRLPIPHQSB-XUXIUFHCSA-N Ile-Arg-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOTNPRLPIPHQSB-XUXIUFHCSA-N 0.000 description 1
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 1
- QTUSJASXLGLJSR-OSUNSFLBSA-N Ile-Arg-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N QTUSJASXLGLJSR-OSUNSFLBSA-N 0.000 description 1
- DCQMJRSOGCYKTR-GHCJXIJMSA-N Ile-Asp-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DCQMJRSOGCYKTR-GHCJXIJMSA-N 0.000 description 1
- JSZMKEYEVLDPDO-ACZMJKKPSA-N Ile-Cys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CS)C(O)=O JSZMKEYEVLDPDO-ACZMJKKPSA-N 0.000 description 1
- AWTDTFXPVCTHAK-BJDJZHNGSA-N Ile-Cys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N AWTDTFXPVCTHAK-BJDJZHNGSA-N 0.000 description 1
- KTGFOCFYOZQVRJ-ZKWXMUAHSA-N Ile-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O KTGFOCFYOZQVRJ-ZKWXMUAHSA-N 0.000 description 1
- DFFTXLCCDFYRKD-MBLNEYKQSA-N Ile-Gly-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N DFFTXLCCDFYRKD-MBLNEYKQSA-N 0.000 description 1
- QNBYCZTZNOVDMI-HGNGGELXSA-N Ile-His Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 QNBYCZTZNOVDMI-HGNGGELXSA-N 0.000 description 1
- YKLOMBNBQUTJDT-HVTMNAMFSA-N Ile-His-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YKLOMBNBQUTJDT-HVTMNAMFSA-N 0.000 description 1
- PWDSHAAAFXISLE-SXTJYALSSA-N Ile-Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O PWDSHAAAFXISLE-SXTJYALSSA-N 0.000 description 1
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 1
- YGDWPQCLFJNMOL-MNXVOIDGSA-N Ile-Leu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YGDWPQCLFJNMOL-MNXVOIDGSA-N 0.000 description 1
- UWBDLNOCIDGPQE-GUBZILKMSA-N Ile-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN UWBDLNOCIDGPQE-GUBZILKMSA-N 0.000 description 1
- IDMNOFVUXYYZPF-DKIMLUQUSA-N Ile-Lys-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N IDMNOFVUXYYZPF-DKIMLUQUSA-N 0.000 description 1
- KCTIFOCXAIUQQK-QXEWZRGKSA-N Ile-Pro-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O KCTIFOCXAIUQQK-QXEWZRGKSA-N 0.000 description 1
- TWVKGYNQQAUNRN-ACZMJKKPSA-N Ile-Ser Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)N[C@@H](CO)C([O-])=O TWVKGYNQQAUNRN-ACZMJKKPSA-N 0.000 description 1
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 1
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 1
- SAEWJTCJQVZQNZ-IUKAMOBKSA-N Ile-Thr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SAEWJTCJQVZQNZ-IUKAMOBKSA-N 0.000 description 1
- COWHUQXTSYTKQC-RWRJDSDZSA-N Ile-Thr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N COWHUQXTSYTKQC-RWRJDSDZSA-N 0.000 description 1
- 108700002232 Immediate-Early Genes Proteins 0.000 description 1
- 108060003951 Immunoglobulin Proteins 0.000 description 1
- 102000009786 Immunoglobulin Constant Regions Human genes 0.000 description 1
- 108010009817 Immunoglobulin Constant Regions Proteins 0.000 description 1
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 1
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 1
- 108020005350 Initiator Codon Proteins 0.000 description 1
- 108010065920 Insulin Lispro Proteins 0.000 description 1
- 102100034343 Integrase Human genes 0.000 description 1
- PWWVAXIEGOYWEE-UHFFFAOYSA-N Isophenergan Chemical compound C1=CC=C2N(CC(C)N(C)C)C3=CC=CC=C3SC2=C1 PWWVAXIEGOYWEE-UHFFFAOYSA-N 0.000 description 1
- 108010079091 KRDS peptide Proteins 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 1
- HFKJBCPRWWGPEY-BQBZGAKWSA-N L-arginyl-L-glutamic acid Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HFKJBCPRWWGPEY-BQBZGAKWSA-N 0.000 description 1
- ZUKPVRWZDMRIEO-VKHMYHEASA-N L-cysteinylglycine Chemical compound SC[C@H]([NH3+])C(=O)NCC([O-])=O ZUKPVRWZDMRIEO-VKHMYHEASA-N 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- FBOZXECLQNJBKD-ZDUSSCGKSA-N L-methotrexate Chemical compound C=1N=C2N=C(N)N=C(N)C2=NC=1CN(C)C1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 FBOZXECLQNJBKD-ZDUSSCGKSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 1
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 1
- 241000880493 Leptailurus serval Species 0.000 description 1
- HSQGMTRYSIHDAC-BQBZGAKWSA-N Leu-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(O)=O HSQGMTRYSIHDAC-BQBZGAKWSA-N 0.000 description 1
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 1
- DQPQTXMIRBUWKO-DCAQKATOSA-N Leu-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(C)C)N DQPQTXMIRBUWKO-DCAQKATOSA-N 0.000 description 1
- SENJXOPIZNYLHU-IUCAKERBSA-N Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-IUCAKERBSA-N 0.000 description 1
- NTRAGDHVSGKUSF-AVGNSLFASA-N Leu-Arg-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NTRAGDHVSGKUSF-AVGNSLFASA-N 0.000 description 1
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 1
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 1
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 1
- IGUOAYLTQJLPPD-DCAQKATOSA-N Leu-Asn-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IGUOAYLTQJLPPD-DCAQKATOSA-N 0.000 description 1
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 1
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 1
- QLQHWWCSCLZUMA-KKUMJFAQSA-N Leu-Asp-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QLQHWWCSCLZUMA-KKUMJFAQSA-N 0.000 description 1
- HIZYETOZLYFUFF-BQBZGAKWSA-N Leu-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(O)=O HIZYETOZLYFUFF-BQBZGAKWSA-N 0.000 description 1
- PPBKJAQJAUHZKX-SRVKXCTJSA-N Leu-Cys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(C)C PPBKJAQJAUHZKX-SRVKXCTJSA-N 0.000 description 1
- PNUCWVAGVNLUMW-CIUDSAMLSA-N Leu-Cys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O PNUCWVAGVNLUMW-CIUDSAMLSA-N 0.000 description 1
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 1
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 1
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 1
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 1
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 1
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 1
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 1
- OHZIZVWQXJPBJS-IXOXFDKPSA-N Leu-His-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OHZIZVWQXJPBJS-IXOXFDKPSA-N 0.000 description 1
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 1
- LCPYQJIKPJDLLB-UWVGGRQHSA-N Leu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC(C)C LCPYQJIKPJDLLB-UWVGGRQHSA-N 0.000 description 1
- IFMPDNRWZZEZSL-SRVKXCTJSA-N Leu-Leu-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O IFMPDNRWZZEZSL-SRVKXCTJSA-N 0.000 description 1
- KYIIALJHAOIAHF-KKUMJFAQSA-N Leu-Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KYIIALJHAOIAHF-KKUMJFAQSA-N 0.000 description 1
- DNDWZFHLZVYOGF-KKUMJFAQSA-N Leu-Leu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O DNDWZFHLZVYOGF-KKUMJFAQSA-N 0.000 description 1
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 1
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 1
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 1
- REPBGZHJKYWFMJ-KKUMJFAQSA-N Leu-Lys-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N REPBGZHJKYWFMJ-KKUMJFAQSA-N 0.000 description 1
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 1
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 1
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 1
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 1
- KXCMQWMNYQOAKA-SRVKXCTJSA-N Leu-Met-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KXCMQWMNYQOAKA-SRVKXCTJSA-N 0.000 description 1
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 1
- VTJUNIYRYIAIHF-IUCAKERBSA-N Leu-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O VTJUNIYRYIAIHF-IUCAKERBSA-N 0.000 description 1
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 1
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 1
- XGDCYUQSFDQISZ-BQBZGAKWSA-N Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(O)=O XGDCYUQSFDQISZ-BQBZGAKWSA-N 0.000 description 1
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 1
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 1
- CGHXMODRYJISSK-NHCYSSNCSA-N Leu-Val-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O CGHXMODRYJISSK-NHCYSSNCSA-N 0.000 description 1
- TUIOUEWKFFVNLH-DCAQKATOSA-N Leu-Val-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O TUIOUEWKFFVNLH-DCAQKATOSA-N 0.000 description 1
- FMFNIDICDKEMOE-XUXIUFHCSA-N Leu-Val-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMFNIDICDKEMOE-XUXIUFHCSA-N 0.000 description 1
- 108060001084 Luciferase Proteins 0.000 description 1
- 239000005089 Luciferase Substances 0.000 description 1
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 1
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 1
- NFLFJGGKOHYZJF-BJDJZHNGSA-N Lys-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN NFLFJGGKOHYZJF-BJDJZHNGSA-N 0.000 description 1
- NPBGTPKLVJEOBE-IUCAKERBSA-N Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N NPBGTPKLVJEOBE-IUCAKERBSA-N 0.000 description 1
- GQUDMNDPQTXZRV-DCAQKATOSA-N Lys-Arg-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GQUDMNDPQTXZRV-DCAQKATOSA-N 0.000 description 1
- VHNOAIFVYUQOOY-XUXIUFHCSA-N Lys-Arg-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VHNOAIFVYUQOOY-XUXIUFHCSA-N 0.000 description 1
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 description 1
- SWWCDAGDQHTKIE-RHYQMDGZSA-N Lys-Arg-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWWCDAGDQHTKIE-RHYQMDGZSA-N 0.000 description 1
- QYOXSYXPHUHOJR-GUBZILKMSA-N Lys-Asn-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYOXSYXPHUHOJR-GUBZILKMSA-N 0.000 description 1
- QUCDKEKDPYISNX-HJGDQZAQSA-N Lys-Asn-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QUCDKEKDPYISNX-HJGDQZAQSA-N 0.000 description 1
- KWUKZRFFKPLUPE-HJGDQZAQSA-N Lys-Asp-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWUKZRFFKPLUPE-HJGDQZAQSA-N 0.000 description 1
- GKFNXYMAMKJSKD-NHCYSSNCSA-N Lys-Asp-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GKFNXYMAMKJSKD-NHCYSSNCSA-N 0.000 description 1
- RLZDUFRBMQNYIJ-YUMQZZPRSA-N Lys-Cys-Gly Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N RLZDUFRBMQNYIJ-YUMQZZPRSA-N 0.000 description 1
- MQMIRLVJXQNTRJ-SDDRHHMPSA-N Lys-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O MQMIRLVJXQNTRJ-SDDRHHMPSA-N 0.000 description 1
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 1
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 1
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 1
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 1
- DUTMKEAPLLUGNO-JYJNAYRXSA-N Lys-Glu-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DUTMKEAPLLUGNO-JYJNAYRXSA-N 0.000 description 1
- VEGLGAOVLFODGC-GUBZILKMSA-N Lys-Glu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VEGLGAOVLFODGC-GUBZILKMSA-N 0.000 description 1
- ULUQBUKAPDUKOC-GVXVVHGQSA-N Lys-Glu-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ULUQBUKAPDUKOC-GVXVVHGQSA-N 0.000 description 1
- RFQATBGBLDAKGI-VHSXEESVSA-N Lys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCCN)N)C(=O)O RFQATBGBLDAKGI-VHSXEESVSA-N 0.000 description 1
- IGRMTQMIDNDFAA-UWVGGRQHSA-N Lys-His Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IGRMTQMIDNDFAA-UWVGGRQHSA-N 0.000 description 1
- FMIIKPHLJKUXGE-GUBZILKMSA-N Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN FMIIKPHLJKUXGE-GUBZILKMSA-N 0.000 description 1
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 1
- QKXZCUCBFPEXNK-KKUMJFAQSA-N Lys-Leu-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 QKXZCUCBFPEXNK-KKUMJFAQSA-N 0.000 description 1
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 1
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 1
- ORVFEGYUJITPGI-IHRRRGAJSA-N Lys-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN ORVFEGYUJITPGI-IHRRRGAJSA-N 0.000 description 1
- GAHJXEMYXKLZRQ-AJNGGQMLSA-N Lys-Lys-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GAHJXEMYXKLZRQ-AJNGGQMLSA-N 0.000 description 1
- URBJRJKWSUFCKS-AVGNSLFASA-N Lys-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCCCN)N URBJRJKWSUFCKS-AVGNSLFASA-N 0.000 description 1
- MTBLFIQZECOEBY-IHRRRGAJSA-N Lys-Met-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O MTBLFIQZECOEBY-IHRRRGAJSA-N 0.000 description 1
- AIXUQKMMBQJZCU-IUCAKERBSA-N Lys-Pro Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O AIXUQKMMBQJZCU-IUCAKERBSA-N 0.000 description 1
- YSZNURNVYFUEHC-BQBZGAKWSA-N Lys-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(O)=O YSZNURNVYFUEHC-BQBZGAKWSA-N 0.000 description 1
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 1
- QVTDVTONTRSQMF-WDCWCFNPSA-N Lys-Thr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CCCCN QVTDVTONTRSQMF-WDCWCFNPSA-N 0.000 description 1
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 1
- YKBSXQFZWFXFIB-VOAKCMCISA-N Lys-Thr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O YKBSXQFZWFXFIB-VOAKCMCISA-N 0.000 description 1
- WAAZECNCPVGPIV-RHYQMDGZSA-N Lys-Thr-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O WAAZECNCPVGPIV-RHYQMDGZSA-N 0.000 description 1
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 description 1
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 1
- XYLSGAWRCZECIQ-JYJNAYRXSA-N Lys-Tyr-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 XYLSGAWRCZECIQ-JYJNAYRXSA-N 0.000 description 1
- BWECSLVQIWEMSC-IHRRRGAJSA-N Lys-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N BWECSLVQIWEMSC-IHRRRGAJSA-N 0.000 description 1
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 230000027311 M phase Effects 0.000 description 1
- 101710125418 Major capsid protein Proteins 0.000 description 1
- QAHFGYLFLVGBNW-DCAQKATOSA-N Met-Ala-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN QAHFGYLFLVGBNW-DCAQKATOSA-N 0.000 description 1
- UASDAHIAHBRZQV-YUMQZZPRSA-N Met-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N UASDAHIAHBRZQV-YUMQZZPRSA-N 0.000 description 1
- DLAFCQWUMFMZSN-GUBZILKMSA-N Met-Arg-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N DLAFCQWUMFMZSN-GUBZILKMSA-N 0.000 description 1
- CAODKDAPYGUMLK-FXQIFTODSA-N Met-Asn-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CAODKDAPYGUMLK-FXQIFTODSA-N 0.000 description 1
- OSOLWRWQADPDIQ-DCAQKATOSA-N Met-Asp-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OSOLWRWQADPDIQ-DCAQKATOSA-N 0.000 description 1
- XOMXAVJBLRROMC-IHRRRGAJSA-N Met-Asp-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOMXAVJBLRROMC-IHRRRGAJSA-N 0.000 description 1
- ADHNYKZHPOEULM-BQBZGAKWSA-N Met-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O ADHNYKZHPOEULM-BQBZGAKWSA-N 0.000 description 1
- OBCRZLRPJFNLAN-DCAQKATOSA-N Met-His-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O OBCRZLRPJFNLAN-DCAQKATOSA-N 0.000 description 1
- AEQVPPGEJJBFEE-CYDGBPFRSA-N Met-Ile-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEQVPPGEJJBFEE-CYDGBPFRSA-N 0.000 description 1
- PBOUVYGPDSARIS-IUCAKERBSA-N Met-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(O)=O)CC(C)C PBOUVYGPDSARIS-IUCAKERBSA-N 0.000 description 1
- KMSMNUFBNCHMII-IHRRRGAJSA-N Met-Leu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN KMSMNUFBNCHMII-IHRRRGAJSA-N 0.000 description 1
- AXHNAGAYRGCDLG-UWVGGRQHSA-N Met-Lys-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AXHNAGAYRGCDLG-UWVGGRQHSA-N 0.000 description 1
- AOFZWWDTTJLHOU-ULQDDVLXSA-N Met-Lys-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AOFZWWDTTJLHOU-ULQDDVLXSA-N 0.000 description 1
- OIFHHODAXVWKJN-ULQDDVLXSA-N Met-Phe-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 OIFHHODAXVWKJN-ULQDDVLXSA-N 0.000 description 1
- QLESZRANMSYLCZ-CYDGBPFRSA-N Met-Pro-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QLESZRANMSYLCZ-CYDGBPFRSA-N 0.000 description 1
- HLZORBMOISUNIV-DCAQKATOSA-N Met-Ser-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C HLZORBMOISUNIV-DCAQKATOSA-N 0.000 description 1
- LBSWWNKMVPAXOI-GUBZILKMSA-N Met-Val-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O LBSWWNKMVPAXOI-GUBZILKMSA-N 0.000 description 1
- 101100523604 Mus musculus Rassf5 gene Proteins 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 1
- 108700010674 N-acetylVal-Nle(7,8)- allatotropin (5-13) Proteins 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- 208000003019 Neurofibromatosis 1 Diseases 0.000 description 1
- 208000024834 Neurofibromatosis type 1 Diseases 0.000 description 1
- 238000000636 Northern blotting Methods 0.000 description 1
- 101710141454 Nucleoprotein Proteins 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- 241000282579 Pan Species 0.000 description 1
- 241000282520 Papio Species 0.000 description 1
- 108010087702 Penicillinase Proteins 0.000 description 1
- 102000057297 Pepsin A Human genes 0.000 description 1
- 108090000284 Pepsin A Proteins 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- LSXGADJXBDFXQU-DLOVCJGASA-N Phe-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 LSXGADJXBDFXQU-DLOVCJGASA-N 0.000 description 1
- BBDSZDHUCPSYAC-QEJZJMRPSA-N Phe-Ala-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BBDSZDHUCPSYAC-QEJZJMRPSA-N 0.000 description 1
- VUYCNYVLKACHPA-KKUMJFAQSA-N Phe-Asp-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N VUYCNYVLKACHPA-KKUMJFAQSA-N 0.000 description 1
- VJEZWOSKRCLHRP-MELADBBJSA-N Phe-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O VJEZWOSKRCLHRP-MELADBBJSA-N 0.000 description 1
- MGECUMGTSHYHEJ-QEWYBTABSA-N Phe-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGECUMGTSHYHEJ-QEWYBTABSA-N 0.000 description 1
- GLUBLISJVJFHQS-VIFPVBQESA-N Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 GLUBLISJVJFHQS-VIFPVBQESA-N 0.000 description 1
- JWBLQDDHSDGEGR-DRZSPHRISA-N Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 JWBLQDDHSDGEGR-DRZSPHRISA-N 0.000 description 1
- RFCVXVPWSPOMFJ-STQMWFEESA-N Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 RFCVXVPWSPOMFJ-STQMWFEESA-N 0.000 description 1
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 1
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 1
- LRBSWBVUCLLRLU-BZSNNMDCSA-N Phe-Leu-Lys Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1ccccc1)C(=O)N[C@@H](CCCCN)C(O)=O LRBSWBVUCLLRLU-BZSNNMDCSA-N 0.000 description 1
- YCCUXNNKXDGMAM-KKUMJFAQSA-N Phe-Leu-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YCCUXNNKXDGMAM-KKUMJFAQSA-N 0.000 description 1
- ZIQQNOXKEFDPBE-BZSNNMDCSA-N Phe-Lys-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N ZIQQNOXKEFDPBE-BZSNNMDCSA-N 0.000 description 1
- DOXQMJCSSYZSNM-BZSNNMDCSA-N Phe-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O DOXQMJCSSYZSNM-BZSNNMDCSA-N 0.000 description 1
- 102000001105 Phosphofructokinases Human genes 0.000 description 1
- 108010069341 Phosphofructokinases Proteins 0.000 description 1
- 102000012288 Phosphopyruvate Hydratase Human genes 0.000 description 1
- 108010022181 Phosphopyruvate Hydratase Proteins 0.000 description 1
- 102000004160 Phosphoric Monoester Hydrolases Human genes 0.000 description 1
- 108090000608 Phosphoric Monoester Hydrolases Proteins 0.000 description 1
- 241000235648 Pichia Species 0.000 description 1
- 241000276498 Pollachius virens Species 0.000 description 1
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 description 1
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 1
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 1
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 1
- FEPSEIDIPBMIOS-QXEWZRGKSA-N Pro-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEPSEIDIPBMIOS-QXEWZRGKSA-N 0.000 description 1
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 1
- FEVDNIBDCRKMER-IUCAKERBSA-N Pro-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEVDNIBDCRKMER-IUCAKERBSA-N 0.000 description 1
- YTWNSIDWAFSEEI-RWMBFGLXSA-N Pro-His-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)N3CCC[C@@H]3C(=O)O YTWNSIDWAFSEEI-RWMBFGLXSA-N 0.000 description 1
- VZKBJNBZMZHKRC-XUXIUFHCSA-N Pro-Ile-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O VZKBJNBZMZHKRC-XUXIUFHCSA-N 0.000 description 1
- ZKQOUHVVXABNDG-IUCAKERBSA-N Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 ZKQOUHVVXABNDG-IUCAKERBSA-N 0.000 description 1
- YXHYJEPDKSYPSQ-AVGNSLFASA-N Pro-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 YXHYJEPDKSYPSQ-AVGNSLFASA-N 0.000 description 1
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 1
- VTFXTWDFPTWNJY-RHYQMDGZSA-N Pro-Leu-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VTFXTWDFPTWNJY-RHYQMDGZSA-N 0.000 description 1
- RVQDZELMXZRSSI-IUCAKERBSA-N Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 RVQDZELMXZRSSI-IUCAKERBSA-N 0.000 description 1
- WFIVLLFYUZZWOD-RHYQMDGZSA-N Pro-Lys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WFIVLLFYUZZWOD-RHYQMDGZSA-N 0.000 description 1
- GFHXZNVJIKMAGO-IHRRRGAJSA-N Pro-Phe-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GFHXZNVJIKMAGO-IHRRRGAJSA-N 0.000 description 1
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 1
- BJCXXMGGPHRSHV-GUBZILKMSA-N Pro-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BJCXXMGGPHRSHV-GUBZILKMSA-N 0.000 description 1
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 1
- QUBVFEANYYWBTM-VEVYYDQMSA-N Pro-Thr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUBVFEANYYWBTM-VEVYYDQMSA-N 0.000 description 1
- GXWRTSIVLSQACD-RCWTZXSCSA-N Pro-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@@H]1CCCN1)O GXWRTSIVLSQACD-RCWTZXSCSA-N 0.000 description 1
- IMNVAOPEMFDAQD-NHCYSSNCSA-N Pro-Val-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IMNVAOPEMFDAQD-NHCYSSNCSA-N 0.000 description 1
- IIRBTQHFVNGPMQ-AVGNSLFASA-N Pro-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 IIRBTQHFVNGPMQ-AVGNSLFASA-N 0.000 description 1
- 101710083689 Probable capsid protein Proteins 0.000 description 1
- 102100026983 Protein FAM107B Human genes 0.000 description 1
- 102000001253 Protein Kinase Human genes 0.000 description 1
- 241000589516 Pseudomonas Species 0.000 description 1
- 108010011939 Pyruvate Decarboxylase Proteins 0.000 description 1
- 102000013009 Pyruvate Kinase Human genes 0.000 description 1
- 108020005115 Pyruvate Kinase Proteins 0.000 description 1
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 1
- 108050002653 Retinoblastoma protein Proteins 0.000 description 1
- 101800000684 Ribonuclease H Proteins 0.000 description 1
- 230000018199 S phase Effects 0.000 description 1
- SSJMZMUVNKEENT-IMJSIDKUSA-N Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CO SSJMZMUVNKEENT-IMJSIDKUSA-N 0.000 description 1
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 1
- RZEQTVHJZCIUBT-WDSKDSINSA-N Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N RZEQTVHJZCIUBT-WDSKDSINSA-N 0.000 description 1
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 1
- ZXLUWXWISXIFIX-ACZMJKKPSA-N Ser-Asn-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZXLUWXWISXIFIX-ACZMJKKPSA-N 0.000 description 1
- KCFKKAQKRZBWJB-ZLUOBGJFSA-N Ser-Cys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O KCFKKAQKRZBWJB-ZLUOBGJFSA-N 0.000 description 1
- PVDTYLHUWAEYGY-CIUDSAMLSA-N Ser-Glu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PVDTYLHUWAEYGY-CIUDSAMLSA-N 0.000 description 1
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 1
- VQBCMLMPEWPUTB-ACZMJKKPSA-N Ser-Glu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VQBCMLMPEWPUTB-ACZMJKKPSA-N 0.000 description 1
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 1
- WOUIMBGNEUWXQG-VKHMYHEASA-N Ser-Gly Chemical compound OC[C@H](N)C(=O)NCC(O)=O WOUIMBGNEUWXQG-VKHMYHEASA-N 0.000 description 1
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 1
- YMDNFPNTIPQMJP-NAKRPEOUSA-N Ser-Ile-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O YMDNFPNTIPQMJP-NAKRPEOUSA-N 0.000 description 1
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 description 1
- IUXGJEIKJBYKOO-SRVKXCTJSA-N Ser-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N IUXGJEIKJBYKOO-SRVKXCTJSA-N 0.000 description 1
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 1
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 1
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 1
- JWOBLHJRDADHLN-KKUMJFAQSA-N Ser-Leu-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JWOBLHJRDADHLN-KKUMJFAQSA-N 0.000 description 1
- SBMNPABNWKXNBJ-BQBZGAKWSA-N Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CO SBMNPABNWKXNBJ-BQBZGAKWSA-N 0.000 description 1
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 1
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 1
- OCWWJBZQXGYQCA-DCAQKATOSA-N Ser-Lys-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O OCWWJBZQXGYQCA-DCAQKATOSA-N 0.000 description 1
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 1
- PBUXMVYWOSKHMF-WDSKDSINSA-N Ser-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CO PBUXMVYWOSKHMF-WDSKDSINSA-N 0.000 description 1
- XVWDJUROVRQKAE-KKUMJFAQSA-N Ser-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=CC=C1 XVWDJUROVRQKAE-KKUMJFAQSA-N 0.000 description 1
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 1
- OVQZAFXWIWNYKA-GUBZILKMSA-N Ser-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CO)N OVQZAFXWIWNYKA-GUBZILKMSA-N 0.000 description 1
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 1
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 1
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 1
- DKGRNFUXVTYRAS-UBHSHLNASA-N Ser-Ser-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O DKGRNFUXVTYRAS-UBHSHLNASA-N 0.000 description 1
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 1
- LZLREEUGSYITMX-JQWIXIFHSA-N Ser-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)N)C(O)=O)=CNC2=C1 LZLREEUGSYITMX-JQWIXIFHSA-N 0.000 description 1
- HAUVENOGHPECML-BPUTZDHNSA-N Ser-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CO)=CNC2=C1 HAUVENOGHPECML-BPUTZDHNSA-N 0.000 description 1
- OQSQCUWQOIHECT-YJRXYDGGSA-N Ser-Tyr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OQSQCUWQOIHECT-YJRXYDGGSA-N 0.000 description 1
- ILVGMCVCQBJPSH-WDSKDSINSA-N Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CO ILVGMCVCQBJPSH-WDSKDSINSA-N 0.000 description 1
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 241000607720 Serratia Species 0.000 description 1
- 241000700584 Simplexvirus Species 0.000 description 1
- 238000002105 Southern blotting Methods 0.000 description 1
- 241000282887 Suidae Species 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- YRNBANYVJJBGDI-VZFHVOOUSA-N Thr-Ala-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O)N)O YRNBANYVJJBGDI-VZFHVOOUSA-N 0.000 description 1
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 1
- XVNZSJIKGJLQLH-RCWTZXSCSA-N Thr-Arg-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCSC)C(=O)O)N)O XVNZSJIKGJLQLH-RCWTZXSCSA-N 0.000 description 1
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 1
- JNQZPAWOPBZGIX-RCWTZXSCSA-N Thr-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N JNQZPAWOPBZGIX-RCWTZXSCSA-N 0.000 description 1
- IOWJRKAVLALBQB-IWGUZYHVSA-N Thr-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O IOWJRKAVLALBQB-IWGUZYHVSA-N 0.000 description 1
- MMTOHPRBJKEZHT-BWBBJGPYSA-N Thr-Cys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O MMTOHPRBJKEZHT-BWBBJGPYSA-N 0.000 description 1
- KGKWKSSSQGGYAU-SUSMZKCASA-N Thr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KGKWKSSSQGGYAU-SUSMZKCASA-N 0.000 description 1
- VOHWDZNIESHTFW-XKBZYTNZSA-N Thr-Glu-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)O VOHWDZNIESHTFW-XKBZYTNZSA-N 0.000 description 1
- HEJJDUDEHLPDAW-CUJWVEQBSA-N Thr-His-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N)O HEJJDUDEHLPDAW-CUJWVEQBSA-N 0.000 description 1
- XOWKUMFHEZLKLT-CIQUZCHMSA-N Thr-Ile-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O XOWKUMFHEZLKLT-CIQUZCHMSA-N 0.000 description 1
- GMXIJHCBTZDAPD-QPHKQPEJSA-N Thr-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N GMXIJHCBTZDAPD-QPHKQPEJSA-N 0.000 description 1
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 1
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 1
- FIFDDJFLNVAVMS-RHYQMDGZSA-N Thr-Leu-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O FIFDDJFLNVAVMS-RHYQMDGZSA-N 0.000 description 1
- PRNGXSILMXSWQQ-OEAJRASXSA-N Thr-Leu-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PRNGXSILMXSWQQ-OEAJRASXSA-N 0.000 description 1
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 1
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 1
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 1
- QNCFWHZVRNXAKW-OEAJRASXSA-N Thr-Lys-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNCFWHZVRNXAKW-OEAJRASXSA-N 0.000 description 1
- DXPURPNJDFCKKO-RHYQMDGZSA-N Thr-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DXPURPNJDFCKKO-RHYQMDGZSA-N 0.000 description 1
- XNTVWRJTUIOGQO-RHYQMDGZSA-N Thr-Met-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNTVWRJTUIOGQO-RHYQMDGZSA-N 0.000 description 1
- WNQJTLATMXYSEL-OEAJRASXSA-N Thr-Phe-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WNQJTLATMXYSEL-OEAJRASXSA-N 0.000 description 1
- VGYVVSQFSSKZRJ-OEAJRASXSA-N Thr-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=CC=C1 VGYVVSQFSSKZRJ-OEAJRASXSA-N 0.000 description 1
- MROIJTGJGIDEEJ-RCWTZXSCSA-N Thr-Pro-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 MROIJTGJGIDEEJ-RCWTZXSCSA-N 0.000 description 1
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 1
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 1
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 1
- CKHWEVXPLJBEOZ-VQVTYTSYSA-N Thr-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@@H]([NH3+])[C@@H](C)O CKHWEVXPLJBEOZ-VQVTYTSYSA-N 0.000 description 1
- XGFYGMKZKFRGAI-RCWTZXSCSA-N Thr-Val-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XGFYGMKZKFRGAI-RCWTZXSCSA-N 0.000 description 1
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 1
- SBYQHZCMVSPQCS-RCWTZXSCSA-N Thr-Val-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O SBYQHZCMVSPQCS-RCWTZXSCSA-N 0.000 description 1
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 1
- BPGDJSUFQKWUBK-KJEVXHAQSA-N Thr-Val-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BPGDJSUFQKWUBK-KJEVXHAQSA-N 0.000 description 1
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- 108090000190 Thrombin Proteins 0.000 description 1
- 102000006601 Thymidine Kinase Human genes 0.000 description 1
- 108020004440 Thymidine kinase Proteins 0.000 description 1
- 101710195626 Transcriptional activator protein Proteins 0.000 description 1
- 102000005924 Triose-Phosphate Isomerase Human genes 0.000 description 1
- 108700015934 Triose-phosphate isomerases Proteins 0.000 description 1
- 241000209140 Triticum Species 0.000 description 1
- 235000021307 Triticum Nutrition 0.000 description 1
- TWJDQTTXXZDJKV-BPUTZDHNSA-N Trp-Arg-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O TWJDQTTXXZDJKV-BPUTZDHNSA-N 0.000 description 1
- RWAYYYOZMHMEGD-XIRDDKMYSA-N Trp-Leu-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 RWAYYYOZMHMEGD-XIRDDKMYSA-N 0.000 description 1
- OSYOKZZRVGUDMO-HSCHXYMDSA-N Trp-Lys-Ile Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OSYOKZZRVGUDMO-HSCHXYMDSA-N 0.000 description 1
- ULHASJWZGUEUNN-XIRDDKMYSA-N Trp-Lys-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O ULHASJWZGUEUNN-XIRDDKMYSA-N 0.000 description 1
- IMMPMHKLUUZKAZ-WMZOPIPTSA-N Trp-Phe Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=CC=C1 IMMPMHKLUUZKAZ-WMZOPIPTSA-N 0.000 description 1
- YTVJTXJTNRWJCR-JBACZVJFSA-N Trp-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N YTVJTXJTNRWJCR-JBACZVJFSA-N 0.000 description 1
- UEFHVUQBYNRNQC-SFJXLCSZSA-N Trp-Phe-Thr Chemical compound C([C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=CC=C1 UEFHVUQBYNRNQC-SFJXLCSZSA-N 0.000 description 1
- YCQXZDHDSUHUSG-FJHTZYQYSA-N Trp-Thr-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 YCQXZDHDSUHUSG-FJHTZYQYSA-N 0.000 description 1
- UOXPLPBMEPLZBW-WDSOQIARSA-N Trp-Val-Lys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 UOXPLPBMEPLZBW-WDSOQIARSA-N 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- BURPTJBFWIOHEY-UWJYBYFXSA-N Tyr-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 BURPTJBFWIOHEY-UWJYBYFXSA-N 0.000 description 1
- TVOGEPLDNYTAHD-CQDKDKBSSA-N Tyr-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TVOGEPLDNYTAHD-CQDKDKBSSA-N 0.000 description 1
- SLCSPPCQWUHPPO-JYJNAYRXSA-N Tyr-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SLCSPPCQWUHPPO-JYJNAYRXSA-N 0.000 description 1
- AUEJLPRZGVVDNU-STQMWFEESA-N Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-STQMWFEESA-N 0.000 description 1
- QHLIUFUEUDFAOT-MGHWNKPDSA-N Tyr-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHLIUFUEUDFAOT-MGHWNKPDSA-N 0.000 description 1
- AOLHUMAVONBBEZ-STQMWFEESA-N Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AOLHUMAVONBBEZ-STQMWFEESA-N 0.000 description 1
- 101710100170 Unknown protein Proteins 0.000 description 1
- 241000700618 Vaccinia virus Species 0.000 description 1
- IBIDRSSEHFLGSD-YUMQZZPRSA-N Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-YUMQZZPRSA-N 0.000 description 1
- UUYCNAXCCDNULB-QXEWZRGKSA-N Val-Arg-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O UUYCNAXCCDNULB-QXEWZRGKSA-N 0.000 description 1
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 description 1
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 1
- JYVKKBDANPZIAW-AVGNSLFASA-N Val-Arg-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N JYVKKBDANPZIAW-AVGNSLFASA-N 0.000 description 1
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 description 1
- OBTCMSPFOITUIJ-FSPLSTOPSA-N Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O OBTCMSPFOITUIJ-FSPLSTOPSA-N 0.000 description 1
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 1
- AHHJARQXFFGOKF-NRPADANISA-N Val-Glu-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N AHHJARQXFFGOKF-NRPADANISA-N 0.000 description 1
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 1
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 1
- PNVLWFYAPWAQMU-CIUDSAMLSA-N Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)C(C)C PNVLWFYAPWAQMU-CIUDSAMLSA-N 0.000 description 1
- VHRLUTIMTDOVCG-PEDHHIEDSA-N Val-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](C(C)C)N VHRLUTIMTDOVCG-PEDHHIEDSA-N 0.000 description 1
- WLHIIWDIDLQTKP-IHRRRGAJSA-N Val-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)C(C)C WLHIIWDIDLQTKP-IHRRRGAJSA-N 0.000 description 1
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 1
- CXWJFWAZIVWBOS-XQQFMLRXSA-N Val-Lys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CXWJFWAZIVWBOS-XQQFMLRXSA-N 0.000 description 1
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 1
- XPKCFQZDQGVJCX-RHYQMDGZSA-N Val-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N)O XPKCFQZDQGVJCX-RHYQMDGZSA-N 0.000 description 1
- MGVYZTPLGXPVQB-CYDGBPFRSA-N Val-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MGVYZTPLGXPVQB-CYDGBPFRSA-N 0.000 description 1
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 1
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 1
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 1
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 1
- CFIBZQOLUDURST-IHRRRGAJSA-N Val-Tyr-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CS)C(=O)O)N CFIBZQOLUDURST-IHRRRGAJSA-N 0.000 description 1
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 1
- 208000008383 Wilms tumor Diseases 0.000 description 1
- PTFCDOFLOPIGGS-UHFFFAOYSA-N Zinc dication Chemical compound [Zn+2] PTFCDOFLOPIGGS-UHFFFAOYSA-N 0.000 description 1
- 239000008351 acetate buffer Substances 0.000 description 1
- FKNHDDTXBWMZIR-GEMLJDPKSA-N acetic acid;(2s)-1-[(2r)-2-amino-3-sulfanylpropanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(O)=O.SC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O FKNHDDTXBWMZIR-GEMLJDPKSA-N 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 239000013543 active substance Substances 0.000 description 1
- 230000010933 acylation Effects 0.000 description 1
- 238000005917 acylation reaction Methods 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 108010005233 alanylglutamic acid Proteins 0.000 description 1
- 108010047495 alanylglycine Proteins 0.000 description 1
- ZVDPYSVOZFINEE-BQBZGAKWSA-N alpha-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(O)=O ZVDPYSVOZFINEE-BQBZGAKWSA-N 0.000 description 1
- WNROFYMDJYEPJX-UHFFFAOYSA-K aluminium hydroxide Chemical compound [OH-].[OH-].[OH-].[Al+3] WNROFYMDJYEPJX-UHFFFAOYSA-K 0.000 description 1
- 230000009435 amidation Effects 0.000 description 1
- 238000007112 amidation reaction Methods 0.000 description 1
- 229940126575 aminoglycoside Drugs 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 238000004873 anchoring Methods 0.000 description 1
- -1 and secondly Proteins 0.000 description 1
- 230000003466 anti-cipated effect Effects 0.000 description 1
- 230000003302 anti-idiotype Effects 0.000 description 1
- 230000000340 anti-metabolite Effects 0.000 description 1
- 229940100197 antimetabolite Drugs 0.000 description 1
- 239000002256 antimetabolite Substances 0.000 description 1
- 239000000074 antisense oligonucleotide Substances 0.000 description 1
- 238000012230 antisense oligonucleotides Methods 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 108010013835 arginine glutamate Proteins 0.000 description 1
- 108010080488 arginyl-arginyl-leucine Proteins 0.000 description 1
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 230000001651 autotrophic effect Effects 0.000 description 1
- 150000001540 azides Chemical class 0.000 description 1
- 210000003719 b-lymphocyte Anatomy 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 102000006635 beta-lactamase Human genes 0.000 description 1
- 238000004166 bioassay Methods 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 239000012472 biological sample Substances 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000019522 cellular metabolic process Effects 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 239000007795 chemical reaction product Substances 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 238000000749 co-immunoprecipitation Methods 0.000 description 1
- 238000011490 co-immunoprecipitation assay Methods 0.000 description 1
- 239000011248 coating agent Substances 0.000 description 1
- 238000000576 coating method Methods 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 239000000356 contaminant Substances 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000004132 cross linking Methods 0.000 description 1
- 108010004073 cysteinylcysteine Proteins 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- PGUYAANYCROBRT-UHFFFAOYSA-N dihydroxy-selanyl-selanylidene-lambda5-phosphane Chemical compound OP(O)([SeH])=[Se] PGUYAANYCROBRT-UHFFFAOYSA-N 0.000 description 1
- NAGJZTKCGNOGPW-UHFFFAOYSA-K dioxido-sulfanylidene-sulfido-$l^{5}-phosphane Chemical compound [O-]P([O-])([S-])=S NAGJZTKCGNOGPW-UHFFFAOYSA-K 0.000 description 1
- 239000002552 dosage form Substances 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 238000007878 drug screening assay Methods 0.000 description 1
- 238000001035 drying Methods 0.000 description 1
- 210000001671 embryonic stem cell Anatomy 0.000 description 1
- 210000002257 embryonic structure Anatomy 0.000 description 1
- 239000000839 emulsion Substances 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 231100000221 frame shift mutation induction Toxicity 0.000 description 1
- 230000037433 frameshift Effects 0.000 description 1
- 230000005714 functional activity Effects 0.000 description 1
- 125000000524 functional group Chemical group 0.000 description 1
- 238000001415 gene therapy Methods 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 1
- 108010040856 glutamyl-cysteinyl-alanine Proteins 0.000 description 1
- 108010079547 glutamylmethionine Proteins 0.000 description 1
- 229960003180 glutathione Drugs 0.000 description 1
- 102000006602 glyceraldehyde-3-phosphate dehydrogenase Human genes 0.000 description 1
- 108020004445 glyceraldehyde-3-phosphate dehydrogenase Proteins 0.000 description 1
- 230000002414 glycolytic effect Effects 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- YMAWOPBAYDPSLA-UHFFFAOYSA-N glycylglycine Chemical compound [NH3+]CC(=O)NCC([O-])=O YMAWOPBAYDPSLA-UHFFFAOYSA-N 0.000 description 1
- 208000037824 growth disorder Diseases 0.000 description 1
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical class O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 1
- 210000003958 hematopoietic stem cell Anatomy 0.000 description 1
- 238000002744 homologous recombination Methods 0.000 description 1
- 230000006801 homologous recombination Effects 0.000 description 1
- 102000048580 human BRCA1 Human genes 0.000 description 1
- 210000005260 human cell Anatomy 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 230000028993 immune response Effects 0.000 description 1
- 238000003018 immunoassay Methods 0.000 description 1
- 102000018358 immunoglobulin Human genes 0.000 description 1
- 230000016784 immunoglobulin production Effects 0.000 description 1
- 238000007901 in situ hybridization Methods 0.000 description 1
- 230000000415 inactivating effect Effects 0.000 description 1
- 230000002779 inactivation Effects 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 229910052500 inorganic mineral Inorganic materials 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 238000010255 intramuscular injection Methods 0.000 description 1
- 239000007927 intramuscular injection Substances 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- 108010045069 keyhole-limpet hemocyanin Proteins 0.000 description 1
- 235000014655 lactic acid Nutrition 0.000 description 1
- 239000004310 lactic acid Substances 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 231100000518 lethal Toxicity 0.000 description 1
- 230000001665 lethal effect Effects 0.000 description 1
- DVCSNHXRZUVYAM-BQBZGAKWSA-N leu-asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O DVCSNHXRZUVYAM-BQBZGAKWSA-N 0.000 description 1
- 125000001909 leucine group Chemical group [H]N(*)C(C(*)=O)C([H])([H])C(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 108010071185 leucyl-alanine Proteins 0.000 description 1
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 1
- 108010049589 leucyl-leucyl-leucine Proteins 0.000 description 1
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 1
- 108010091798 leucylleucine Proteins 0.000 description 1
- 108010091871 leucylmethionine Proteins 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 238000001638 lipofection Methods 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 1
- 108010064235 lysylglycine Proteins 0.000 description 1
- 108010017391 lysylvaline Proteins 0.000 description 1
- 229960000485 methotrexate Drugs 0.000 description 1
- HPNSFSBZBAHARI-UHFFFAOYSA-N micophenolic acid Natural products OC1=C(CC=C(C)CCC(O)=O)C(OC)=C(C)C2=C1C(=O)OC2 HPNSFSBZBAHARI-UHFFFAOYSA-N 0.000 description 1
- 238000009629 microbiological culture Methods 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 230000003278 mimic effect Effects 0.000 description 1
- 239000011707 mineral Substances 0.000 description 1
- 230000011278 mitosis Effects 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- HPNSFSBZBAHARI-RUDMXATFSA-N mycophenolic acid Chemical compound OC1=C(C\C=C(/C)CCC(O)=O)C(OC)=C(C)C2=C1C(=O)OC2 HPNSFSBZBAHARI-RUDMXATFSA-N 0.000 description 1
- 229960000951 mycophenolic acid Drugs 0.000 description 1
- XMWREJIIPYGZGZ-UHFFFAOYSA-N n-[2-[4-(acridin-9-ylamino)anilino]-2-oxoethyl]-2-[[2-[[6-[(2-aminoethylamino)methyl]pyridin-2-yl]amino]-3-(1h-imidazol-5-yl)propanoyl]amino]-n'-(3,4,5,6-tetrahydroxy-1-oxohexan-2-yl)pentanediamide Chemical compound NCCNCC1=CC=CC(NC(CC=2NC=NC=2)C(=O)NC(CCC(=O)NC(C=O)C(O)C(O)C(O)CO)C(=O)NCC(=O)NC=2C=CC(NC=3C4=CC=CC=C4N=C4C=CC=CC4=3)=CC=2)=N1 XMWREJIIPYGZGZ-UHFFFAOYSA-N 0.000 description 1
- 208000002761 neurofibromatosis 2 Diseases 0.000 description 1
- 208000022032 neurofibromatosis type 2 Diseases 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 231100000956 nontoxicity Toxicity 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 238000001216 nucleic acid method Methods 0.000 description 1
- 238000002515 oligonucleotide synthesis Methods 0.000 description 1
- 238000011275 oncology therapy Methods 0.000 description 1
- 230000036961 partial effect Effects 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 229950009506 penicillinase Drugs 0.000 description 1
- 108010091617 pentalysine Proteins 0.000 description 1
- 229940111202 pepsin Drugs 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 108010073101 phenylalanylleucine Proteins 0.000 description 1
- 108010051242 phenylalanylserine Proteins 0.000 description 1
- 125000002467 phosphate group Chemical group [H]OP(=O)(O[H])O[*] 0.000 description 1
- PTMHPRAIXMAOOB-UHFFFAOYSA-N phosphoramidic acid Chemical compound NP(O)(O)=O PTMHPRAIXMAOOB-UHFFFAOYSA-N 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 239000000902 placebo Substances 0.000 description 1
- 229940068196 placebo Drugs 0.000 description 1
- 230000036470 plasma concentration Effects 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 229920001983 poloxamer Polymers 0.000 description 1
- 229920000447 polyanionic polymer Polymers 0.000 description 1
- 229920005862 polyol Polymers 0.000 description 1
- 150000003077 polyols Chemical class 0.000 description 1
- 108010005636 polypeptide C Proteins 0.000 description 1
- 230000001323 posttranslational effect Effects 0.000 description 1
- 239000002244 precipitate Substances 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 230000037452 priming Effects 0.000 description 1
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 1
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 description 1
- 108010093296 prolyl-prolyl-alanine Proteins 0.000 description 1
- 108010070643 prolylglutamic acid Proteins 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 235000019833 protease Nutrition 0.000 description 1
- 230000004952 protein activity Effects 0.000 description 1
- 108060006633 protein kinase Proteins 0.000 description 1
- 230000009145 protein modification Effects 0.000 description 1
- 230000004850 protein–protein interaction Effects 0.000 description 1
- 230000002285 radioactive effect Effects 0.000 description 1
- 238000002708 random mutagenesis Methods 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 238000003757 reverse transcription PCR Methods 0.000 description 1
- 108091092562 ribozyme Proteins 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 239000012723 sample buffer Substances 0.000 description 1
- 238000013341 scale-up Methods 0.000 description 1
- 239000006152 selective media Substances 0.000 description 1
- JRPHGDYSKGJTKZ-UHFFFAOYSA-K selenophosphate Chemical compound [O-]P([O-])([O-])=[Se] JRPHGDYSKGJTKZ-UHFFFAOYSA-K 0.000 description 1
- 235000004400 serine Nutrition 0.000 description 1
- 125000003607 serino group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C(O[H])([H])[H] 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 1
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 1
- 238000012868 site-directed mutagenesis technique Methods 0.000 description 1
- FQENQNTWSFEDLI-UHFFFAOYSA-J sodium diphosphate Chemical compound [Na+].[Na+].[Na+].[Na+].[O-]P([O-])(=O)OP([O-])([O-])=O FQENQNTWSFEDLI-UHFFFAOYSA-J 0.000 description 1
- 229940048086 sodium pyrophosphate Drugs 0.000 description 1
- 230000003381 solubilizing effect Effects 0.000 description 1
- 238000001179 sorption measurement Methods 0.000 description 1
- 230000010473 stable expression Effects 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- 238000010254 subcutaneous injection Methods 0.000 description 1
- 239000007929 subcutaneous injection Substances 0.000 description 1
- 125000001424 substituent group Chemical group 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 230000004083 survival effect Effects 0.000 description 1
- 208000024891 symptom Diseases 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- 235000019818 tetrasodium diphosphate Nutrition 0.000 description 1
- 239000001577 tetrasodium phosphonato phosphate Substances 0.000 description 1
- 231100001274 therapeutic index Toxicity 0.000 description 1
- RYYWUUFWQRZTIU-UHFFFAOYSA-K thiophosphate Chemical compound [O-]P([O-])([O-])=S RYYWUUFWQRZTIU-UHFFFAOYSA-K 0.000 description 1
- 235000008521 threonine Nutrition 0.000 description 1
- 229960004072 thrombin Drugs 0.000 description 1
- 231100000419 toxicity Toxicity 0.000 description 1
- 230000001988 toxicity Effects 0.000 description 1
- 230000005030 transcription termination Effects 0.000 description 1
- 108091006106 transcriptional activators Proteins 0.000 description 1
- 230000002463 transducing effect Effects 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 238000005820 transferase reaction Methods 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 230000014621 translational initiation Effects 0.000 description 1
- 210000004881 tumor cell Anatomy 0.000 description 1
- 230000004614 tumor growth Effects 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 235000002374 tyrosine Nutrition 0.000 description 1
- 108010078580 tyrosylleucine Proteins 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 239000004474 valine Substances 0.000 description 1
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 1
- YSGSDAIMSCVPHG-UHFFFAOYSA-N valyl-methionine Chemical compound CSCCC(C(O)=O)NC(=O)C(N)C(C)C YSGSDAIMSCVPHG-UHFFFAOYSA-N 0.000 description 1
- 239000013603 viral vector Substances 0.000 description 1
- 108010027345 wheylin-1 peptide Proteins 0.000 description 1
- 108010000998 wheylin-2 peptide Proteins 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
- 238000003158 yeast two-hybrid assay Methods 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/46—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
- C07K14/47—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
- C07K14/4701—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals not used
- C07K14/4702—Regulators; Modulating activity
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P43/00—Drugs for specific purposes, not provided for in groups A61P1/00-A61P41/00
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
Landscapes
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Medicinal Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Veterinary Medicine (AREA)
- Gastroenterology & Hepatology (AREA)
- General Chemical & Material Sciences (AREA)
- Pharmacology & Pharmacy (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Animal Behavior & Ethology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Public Health (AREA)
- Engineering & Computer Science (AREA)
- Toxicology (AREA)
- Zoology (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Biochemistry (AREA)
- Biophysics (AREA)
- Genetics & Genomics (AREA)
- Molecular Biology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Peptides Or Proteins (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
- Oscillators With Electromechanical Resonators (AREA)
- Amplifiers (AREA)
Description
WO 98/10066 PCT/US97/13944 MODULATORS OF BRCA1 ACTIVITY Field of the Invention The invention described herein relates generally to the field of human disease, and more specifically to treating and diagnosing disease based on the presence of modulators of BRCA1 activity.
Background Breast cancer is one of the leading causes of cancer deaths of women in the United States, and approximately 170,000 women are affected by the disease each year.
About 5% of these reported cases are thought to result from a patient's genetic predisposition to the disease. Breast cancer is generally considered to be classifiable as early-age onset and late-age onset, the latter being defined as occurring at about age Approximately 25% of patients diagnosed with breast cancer before the age of 40 are thought to be familial, and thus have an underlying genetic component. Late-age onset breast cancer is also often familial although the risks of a family member developing the disease is less compared to early-age onset if relatives have presented with the disease.
As a result of studies involving families with inherited early onset breast and ovarian cancers a gene thought to be involved in these diseases has been mapped to the long arm of chromosome 17 and termed BRCA1, or breast cancer one gene. See, Hitoyuki, et al., Cancer res. vol. 55: 2998-3002. Additional studies on sporadic cases of breast cancer have also established a genetic link with this disease to BRCA1 which was more precisely localized to the chromosomal region 17q21. See, Hall, J. et al.
Science, vol. 250: 1684-1689 (1990).
Recently, the BRCA1 gene has been cloned, and shown to encode a protein having the properties of a tumor suppressor protein. See, Miki, et al Science, vol.
266: 66-71; and W096/05306. It has been known for some time that a variety of cancers are caused, at least in part, by mutations to certain normal genes, termed "protooncogenes." Proto-oncogenes are involved in regulating normal cell growth in ways WO 98/10066 PCT/US97/13944 that are only now beginning to be appreciated at the molecular level. The mutated proto-oncogenes, or cancer causing genes termed "oncogenes," disrupt normal cell growth which ultimately causes the death of the organism, if the cancer is not detected and treated in time. During normal or cancer cell growth, proto-oncogenes or oncogenes, are counterbalanced by growth-regulating proteins which regulate or try to regulate the growth of normal or cancer cells, respectively. Such proteins are termed "tumor suppressor proteins," and include BRCA1, p53, retinoblastoma protein (Rb), adenomatous polyposis coli protein (APC), Wilm's tumor 1 protein (WT1), neurofibromatosis type 1 protein (NF1), and neurofibromatosis type 2 protein (NF2).
BRCA1 cDNA encodes a 1863 amino acid protein with a predicted molecular weight of approximately 207,000. See, Miki, et al. (1994) Science vol. 266, pages 66- 71. The cloning and characterization of BRCA1 has facilitated establishing it as a tumor suppressor protein. For example, recent work by several investigators have shown that transfection and expression of the BRCA1 gene sequence into MCF-7 tumor cells retards tumor growth in vivo, and extends the survival time of tumor bearing animals. See, Holt, J. et al, (1996) Nat. Genet. vol. 12, pages 298-302. Similar results were obtained using a retroviral vector expressing wild-type BRCA1 against an established MCF-7 peritoneal tumor.
Considerable work has been done to identify those regions of BRCA1 that affects its tumor suppressor activity. It appears that different regions of the molecule may affect its tumor suppressor activity differently. For instance, near full length truncated BRCA1 proteins do not inhibit breast cancer cell growth, but do inhibit ovarian cancer cell growth. See, Holt, J. et al, (1996) Nat. Genet. vol. 12, pages 298-302. These observations strongly suggest that different host cell factors, presumably proteins, are interacting with different regions of BRCA1 to affect cell growth.
Over the past several years, the interactions of certain tumor suppressor proteins with host cell proteins have begun to be elucidated. See, Levin, Annu. Rev.
Biochem. 1993, vol. 62: pages 623-651. The identification of proteins involved in these interactions will facilitate the development of novel diagnostic methods, as well as novel therapeutics for identifying and treating cancer. For example, the retinoblastoma tumor suppressor protein is phosphorylated at serine residues adjacent to a proline.
The level of phosphorylation is high through S, G2, and M-phase of the cell cycle. The 2 3 kinase that effects this reaction is, in turn, activated by a cyclin that regulates events in the cell cycle. Subsequently, in late mitosis, a phosphatase removes the phosphate groups from the protein, and returns the retinoblastoma tumor suppressor protein to an unphosphorylated state in Go-G1. Clearly, the identification of drugs that can effect these interactions can be expected to play a critical role in regulating cell growth and thus be useful in the treatment of cancer.
To date, however, there have been few, if any studies on the interaction of proteins with the tumor suppressor protein, BRCA1. In order to better develop methods to diagnosis and treat both breast and ovarian cancer the identification and isolation of such proteins is critical.
Summary of the Invention In one aspect the present invention provides an isolated nucleic acid sequence that encodes a BRCA1 Modulator Protein wherein said sequence is selected from the group consisting of 091-21A31, Sequence ID NO. 1, 091- 1F84, Sequence ID No. 3 and 091-132Q20, Sequence ID No. In a further aspect the present invention provides an isolated nucleic acid sequence that encodes a protein encoded by the cDNA on deposit with the V00.0 0ATCC with accession no. 98141 (091-1F84, Sequence ID No. 3).
In an even further aspect the present invention provides an isolated o *nucleic acid sequence that encodes a protein encoded by the cDNA on deposit with the ATCC with accession no. 98142 (091-21A31, Sequence ID No. 1).
In yet an even further aspect the present invention provides an isolated Oleo 2 nucleic acid sequence that encodes a protein encoded by the cDNA on deposit 25 with the ATCC with accession no. 98143 (091-132Q0, Sequence ID No. The discussion of documents, acts, materials, devices, articles and the like is included in this specification solely for the purpose of providing a context for the present invention. It is not suggested or represented that any or all of these matters formed part of the prior art base or were common general 3A knowledge in the field relevant to the present invention as it existed in Australia before the priority date of each claim of this application.
Brief Description of the Drawings Figure 1 shows the clDNA and amino acid sequence of the BROAl Modulator Protein, depicted in Sequence ID No. 1, 091-21A31.
0@ @0 0
S
0 @000 0e 0 0 00 OS 0 S 0
S.
S
@0 S S S0 0 S@ SO S S 0
S
S.
50 S 5500 0@ 0S S
S
0000 0 0555 0 0 0@ S S .55.
05 S 0 S
S.
0 m c$7 ~~DUANEiiE~SPECI\38293DOC 4 Figure 2 shows the cDNA and amino acid sequence of the BRCA1 Modulator Protein, depicted in Sequence ID No. 3, 091-1F84.
Figure 3 shows the cDNA and amino acid sequence of the BRCA1 Modulator Protein, depicted in Sequence ID No. 5, 091-132Q20.
Figure 4 shows the format of an assay to identify compounds that increase the intracellular levels of BRCA1.
Table 1 shows the regions of BRCA1 that interact with the BRCA1 Modulator Proteins 091-1F84, Sequence ID No. 3, 091-21A31, Sequence ID No. 1 and 091-132Q20, Sequence ID No. 5. The experiment was conducted using the two-hybrid assay as described in U.S. Patent No. 5,283,173, or Chien et al., 1991, Proc. Natl. Acad, Sci. USA, 88:9578-9582. The cDNAs that encode 091-1F84, Sequence ID No. 3, 091-21A31, Sequence ID No. 1 and 091-132Q20, Sequence ID No. 5 were fused to the GAL 4 activation domain, and those regions of BRCA1 shown in the table were fused to the binding domain of GAL4. The sign is a subjective measure of the amount of bgalactosidase activity. One being the lowest, and three being the highest activity.
Table 2 shows regions of the BRCA1 Modulator Protein 091-21A31, 2 Sequence ID No. 1 that interact with regions of BRCA1.
A 20 Detailed Description of the Invention All publications and patent applications mentioned in this specification are herein incorporated by reference to the same extent as if each individual publication or patent application was specifically and individually indicated to be incorporated by reference.
25 Definitions At the outset it is worth noting that unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly "understood by one of ordinary skill in the art to which this invention belongs.
Generally, the nomenclature used herein and the laboratory procedures described below are those well known and commonly employed in the art.
Standard techniques are used for recombinant nucleic acid methods, q polynucleotide synthesis, and microbial culture and transformation ::\WINWORDUENNYM\SPECNKI 8293-97.DOC electroporation, lipofection). Generally enzymatic reactions and purification steps are performed according to the manufacturer's specifications. The techniques and procedures are generally performed according to conventional methods in the art and various general references (see generally, Sambrook et al., Molecular Cloning: A Laboratory Manual, 2nd. edition (1989) Cold Spring Harbor Laboratory Press, Cold Spring Harbor, which is incorporated herein by reference) which are provided throughout this document. The nomenclature used herein and the laboratory procedures in analytical chemistry, organic synthetic chemistry, and pharmaceutical formulation described below are those well known and commonly employed in the art.
Standard techniques are used for chemical syntheses, chemical analyses, pharmaceutical formulation and delivery, and treatment of patients.
In the formulas representing selected specific embodiments of BRCA1 or BRCA1 Modulator Proteins of the present invention, the amino- and carboxyterminal groups, although often not specifically shown, will be understood to be in the form they would assume at physiological pH values, unless otherwise specified. Thus, the N-terminal H 2 and C-terminal-O0 at physiological pH are :understood to be present though not necessarily specified and shown, either in specific examples or in generic formulas. In the polypeptide notation used 20 herein, the left-hand end of the molecule is the amino terminal end and the right-hand end is the carboxy-terminal end, in accordance with standard usage and convention. Of course, the basic and acid addition salts including those which are formed at nonphysiological pH values are also included in the compounds of the invention. The amino acid residues described herein are 25 preferably in the isomeric form. Stereoisomers D-amino acids) of the twenty conventional amino acids, unnatural amino acids such as a,a-distributed amino acids, N-alkyl amino acids, lactic acid, and other unconventional amino acids may also be suitable components for polypeptides of the present invention, as long as the desired functional property is retained by the polypeptide. For the peptides shown, each encoded residue where appropriate is represented by a three letter designation, corresponding to the trivial name of Sthe conventional amino acid, in keeping with standard polypeptide C:\WINWORDUENNYM\SPECNKI\38293-97.DOC 6 nomenclature (described in J. Biol. Chem., 243:3552-59 (1969) and adopted at 37 CFR Free functional groups, including those at the carboxy- or aminoterminus, referred to as noninterfering substituents, can also be modified by amidation, acylation or other substitution, which can, for example, change the solubility of the compounds without affecting their activity. This may be particularly useful in those instances where BRCA1 Modulator Proteins are known to have certain regions that bind to BRCA1, and it is desirable to make soluble peptides from these regions.
As employed throughout the disclosure, the following terms, unless otherwise indicated, shall be understood to have the following meanings: Throughout the description and claims of the specification the word "comprise" and variations of the word, such as "comprising" and "comprises", is not intended to exclude other additives, components, integers or steps.
The term "isolated protein" referred to herein means a protein of cDNA, recombinant RNA, or synthetic origin or some combination thereof, which by virtue of its origin the "isolated protein" is not substantially associated with proteins found in nature, is substantially free of other proteins from the same source, e.g. free of human proteins, may be expressed by a cell from a 20 different species, or does not occur in nature.
The term "naturally-occurring" as used herein as applied to an object refers to the fact that an object can be found in nature. For example, a polypeptide or polynucleotide sequence that is present in an organism (including viruses) that can be isolated from a source in nature and which has 25 not been intentionally modified by man in the laboratory is naturally-occurring.
C:\WINWORDUJENNYM\SPECNKIl38293-97.DOC WO 98/10066 PCT/US97/13944 The term "polynucleotide" as referred to herein means a polymeric form of nucleotides of at least 10 bases in length, either ribonucleotides or deoxynucleotides or a modified form of either type of nucleotide. The term includes single and double stranded forms of DNA.
The term "oligonucleotide" referred to herein includes naturally occurring, and modified nucleotides linked together by naturally occurring, and non-naturally occurring oligonucleotide linkages. Oligonucleotides are a polynucleotide subset with 200 bases or fewer in length. Preferably oligonucleotides are 10 to 60 bases in length and most preferably 12, 13, 14, 15, 16, 17, 18, 19, or 20 to 40 bases in length.
Oligonucleotides are usually single stranded, e.g. for probes; although oligonucleotides may be double stranded, e.g. for use in the construction of a gene mutant.
Oligonucleotides of the invention can be either sense or antisense oligonucleotides. The term "naturally occurring nucleotides" referred to herein includes deoxyribonucleotides and ribonucleotides. The term "modified nucleotides" referred to herein includes nucleotides with modified or substituted sugar groups and the like. The term "oligonucleotide linkages" referred to herein includes oligonucleotides linkages such as phosphorothioate, phosphorodithioate, phosphoroselenoate, phosphorodiselenoate, phosphoroanilothioate, phoshoroaniladate, phosphoroamidate, and the like. An oligonucleotide can include a label for detection, if desired.
The term "sequence homology" referred to herein describes the proportion of base matches between two nucleic acid sequences or the proportion amino acid matches between two amino acid sequences. When sequence homology is expressed as a percentage, 5 the percentage denotes the proportion of matches over the length of sequence from BRCA 1 that is compared to some other sequence. Gaps (in either of the two sequences) are permitted to maximize matching; gap lengths of 15 bases or less are usually used, 6 bases or less are preferred with 2 bases or less more preferred.
When using oligonucleotides as probes or treatments the sequence homology between the target nucleic acid and the oligonucleotide sequence is generally not less than 17 target base matches out of 20 possible oligonucleotide base pair matches preferably not less than 9 matches out of 10 possible base pair matches and most preferably not less than 19 matches out of 20 possible base pair matches WO 98/10066 PCT/US97/13944 Two amino acid sequences are homologous if there is a partial or complete identity between their sequences. For example, 85% homology means that 85% of the amino acids are identical when the two sequences are aligned for maximum matching.
Gaps (in either of the two sequences being matched) are allowed in maximizing matching; gap lengths of 5 or less are preferred with 2 or less being more preferred.
Alternatively and preferably, two protein sequences (or polypeptide sequences derived from them of at least 30 amino acids in length) are homologous, as this term is used herein, if they have an alignment score of at more than 5 (in standard deviation units) using the program ALIGN with the mutation data matrix and a gap penalty of 6 or greater. See Dayhoff, in Atlas of Protein Sequence and Structure, 1972, volume National Biomedical Research Foundation, pp. 101-110, and Supplement 2 to this volume, pp. 1-10. The two sequences or parts thereof are more preferably homologous if their amino acids are greater than or equal to 50% identical when optimally aligned using the ALIGN program.
One of the properties of a BRCA1 Modulator Protein is the presence of a leucine zipper domain. The latter is defined as a stretch of amino acids rich in leucine residues, generally every seventh residue, which provide a means whereby a protein may dimerize to form either homodimers or heterodimers. Examples of proteins with leucine zippers include Jun and Fos.
An optional property of a BRCA1 Modulator Protein is the presence of a zinc finger domain, preferrably of the type C 3
H
2
C
3
C
3
HC
4 or CX2CX,, 27
,,CXHX
2 H or CX 2
CX
6 ,7CX 2 C; where C, X, and H denote cysteine, an amino acid, and histidine, respectively.
The domain binds zinc ions, and is often associated with proteins that bind DNA. Such domains are readily identified using an appropriate data base known to a skilled practitioner of this art, particularly the Prosite Protein Database.
As used herein, "substantially pure" means an object species is the predominant species present on a molar basis it is more abundant than any other macromolecular individual species in the composition), and preferably a substantially purified fraction is a composition wherein the object species comprises at least about percent (on a molar basis) of all macromolecular species present. Generally, a substantially pure composition will comprise more than about 80 percent of all macromolecular species present in the composition, more preferably more than about 8 WO 98/10066 PCT/US97/13944 90%, 95%, and 99%. Most preferably, the object species is purified to essential homogeneity (contaminant species cannot be detected in the composition by conventional detection methods) wherein the composition consists essentially of a single macromolecular species.
The phrases "Modulator Protein," "Modulator Peptide," or Modulator Polypeptide" refer to proteins or peptides that affect the activity of the BRCA1 gene or the protein encoded by the gene. Each of these definitions is meant to encompass one or more such entities.
Chemistry terms herein are used according to conventional usage in the art, as exemplified by The McGraw-Hill Dictionary of Chemical Terms (ed. Parker, 1985), McGraw-Hill, San Francisco, incorporated herein by reference.
The production of proteins from cloned genes by genetic engineering is well known. See, e.g. U.S. Patent Number 4,761,371 to Bell et al. at column 6, line 3 to column 9, line 65. (The disclosure of all patent references cited herein is to be incorporated herein by reference.) The discussion which follows is accordingly intended as an overview of this field, and is not intended to reflect the full state of the art.
DNA regions are operably linked when they are functionally related to each other. For example: a promoter is operably linked to a coding sequence if it controls the transcription of the sequence; a ribosome binding site is operably linked to a coding sequence if it is positioned so as to permit translation. Generally, operably linked means contiguous and, in the case of leader sequences, contiguous and in reading frame.
Suitable host cells include prokaryotes, yeast cells, or higher eukaryotic cells.
Prokaryotes include gram negative or gram positive organisms, for example Escherichia coli coli) or Bacilli. Higher eukaryotic cells include established cell lines of mammalian origin as described below. Exemplary host cells are DH5a, E. coli W3110 (ATCC 27,325), E coli B, E. coli X1776 (ATCC 31,537) and E. coli 294 (ATCC 31,446).
Pseudomonas species, Bacillus species, and Serratia marcesans are also suitable.
In an insect system, Autographa californica nuclear polyhidrosis virus (AcNPV) may be used as a vector to express foreign genes. see Smith et al., 1983, J. Virol.
46: 584; Smith, U.S. Patent No. 4,215,051). In a specific embodiment described below, WO 98/10066 PCT/US97/13944 Sf9 insect cells are infected with a baculovirus vector expressing a glu-glu epitope tagged BRCA1 Modulator construct. See, Rubinfeld, et al., J. Biol. Chem. vol. 270, no.
pp 5549-5555 (1995). Other epitope tags may be employed that are known in the art including a 6x histidine tag myc, or an EE-tag Glu-Glu-tag). refers to the amino acid glutamine.
A broad variety of suitable microbial vectors are available. Generally, a microbial vector will contain an origin of replication recognized by the intended host, a promoter which will function in the host and a phenotypic selection gene such as a gene encoding proteins conferring antibiotic resistance or supplying an autotrophic requirement. Similar constructs will be manufactured for other hosts. E. coli is typically transformed using pBR322. See Bolivar et al., Gene 2, 95 (1977). pBR322 contains genes for ampicillin and tetracycline resistance and thus provides easy means for identifying transformed cells. Expression vectors should contain a promoter which is recognized by the host organism. This generally means a promoter obtained from the intended host. Promoters most commonly used in recombinant microbial expression vectors include the beta-lactamase (penicillinase) and lactose promoter systems (Chang et al., Nature 275, 615 (1978); and Goeddel et al., Nucleic Acids Res. 8, 4057 (1980) and EPO Application Publication Number 36,776) and the tac promoter De Boer et al., Proc.
Natl. Acad. Sci. USA 80, 21 (1983)). While these are commonly used, other microbial promoters are suitable. Details concerning nucleotide sequences of many promoters have been published, enabling a skilled worker to operably ligate them to DNA encoding BRCA 1 in plasmid or viral vectors (Siebenlist et al., Cell 20, 269, 1980)). The promoter and Shine-Dalgarno (SD) sequence (for prokaryotic host expression) are operably linked to the DNA encoding BRCA 1, i.e. they are positioned so as to promote transcription of the BRCA 1 messenger RNA from the DNA. The SD sequence is thought to promote binding of mRNA to the ribosome by the pairing of bases between the SD sequence and the 3' end of E. coli 16S rRNA (Steitz et al. (1979). In Biological Regulation and Development: Gene Expression (ed. R.F. Goldberger)). To express eukaryotic genes and prokaryotic genes with a weak ribosome-binding site see Sambrook et al. (1989) "Expression of cloned genes in Escherichia coli." In Molecular Cloning: A Laboratory Manual. Furthermore, a bacterial promoter can include naturally occurring promoters of non-bacterial origin that have the ability to bind WO 98/10066 PCT/US97/13944 bacterial RNA polymerase and initiate transcription. A naturally occurring promoter of non-bacterial origin can also be coupled with a compatible RNA polymerase to produce high levels of expression of some genes in prokaryotes. The bacteriophage T7 RNA polymerase/promoter system is an example of a coupled promoter system (Studier et al. (1986) J. Mol. Biol. 189:113; Tabor et al. (1985) Proc. Natl. Acad. Sci. 82:1074). In addition, a hybrid promoter can also be composed of a bacteriophage promoter and an E. coli operator region (EPO Pub. No. 267,851).
BRCA1 Modulators can be expressed intracellularly. A promoter sequence can be directly linked with a BRCA1 Modulator gene or a fragment thereof, in which case the first amino acid at the N-terminus will always be a methionine, which is encoded by the ATG start codon. If desired, methionine at the N-terminus can be cleaved from the protein by in vitro incubation with cyanogen bromide or by either in vivo on in vitro incubation with a bacterial methionine N-terminal peptidase (EPO Pub. No. 219,237).
Eukaryotic microbes such as yeast cultures may be transformed with suitable BRCA1 Modulator vectors. See, e.g. U.S. Patent Number 4,745,057. Saccharomyces cerevisiae is the most commonly used among lower eukaryotic host microorganisms, although a number of other strains are commonly available. Yeast vectors may contain an origin of replication from the 2 micron yeast plasmid or an autonomously replicating sequence (ARS), a promoter, DNA encoding BRCA1 Modulator, sequences for polyadenylation and transcription termination, and a selection gene.
Suitable promoting sequences in yeast vectors include the promoters for metallothionein, 3-phosphoglycerate kinase (Hitzeman et al., J. Biol. Chem. 255, 2073 (1980) or other glycolytic enzymes (Hess et al., J. Adv. Enzyme Reg. 7, 149 (1968); and Holland et al., Biochemistry 17, 4900 (1978)), such as enolase, glyceraldehyde-3phosphate dehydrogenase, hexokinase, pyruvate decarboxylase, phosphofructokinase, glucose-6-phosphate isomerase, 3-phosphoglycerate mutase, pyruvate kinase, triosephosphate isomerase, phosphoglucose isomerase, and glucokinase. Suitable vectors and promotes for use in yeast expression are further described in R. Hitzman et al., EPO Publication Number 73,657.
Cultures of cells derived from multicellular organisms are a desirable host for recombinant BRCA1 Modulator synthesis. In principal, any higher eukaryotic cell culture is workable, whether from vertebrate or invertebrate culture. However, 11 WO 98/10066 PCT/US97/13944 mammalian cells are preferred, as illustrated in the Examples. Propagation of such cells in cell culture has become a routine procedure. See Tissue Culture, Academic Press, Kruse and Paterson, editors (1973).
The transcriptional and translational control sequences in expression vectors to be used in transforming vertebrate cells are often provided by viral sources. For example, commonly used promoters are derived from CMV, polyoma, Adenovirus 2, and Simian Virus 40 (SV40). See, U.S. Patent Number 4,599,308.
An origin of replication may be provided either by construction of the vector to include an exogenous origin, such as may be derived from SV40 or other viral source Polyoma, Adenovirus, VSV, or BPV), or may be provided by the host cell chromosomal replication mechanism. If the vector is integrated into the host cell chromosome, the latter may be sufficient.
Identification of BRCA1 Modulators BRCA1 Modulators can be identified using several different techniques for detecting protein-protein interactions. Among the traditional methods which may be employed are co-immunoprecipitation, crosslinking and co-purification through gradients or chromatographic columns of cell lysates, or proteins obtained from cell lysates using BRCA1 to identify proteins in the lysate that interact with BRCA1. Such assays may employ full length BRCA1 or a BRCA1 peptide. Once isolated, such an intracellular protein can be identified and can, in turn, be used, in conjunction with standard techniques, to identify proteins with which it interacts. For example, at least a portion of the amino acid sequence of an intracellular protein which interacts with BRCA1, can be ascertained using techniques well known to those of skill in the art, such as the Edman degradation technique. (See, Creighton, 1983, "Proteins: Structures and Molecular Principles", W.H. Freeman Co., pp.34-49). The amino acid sequence obtained may be used as a guide for the generation of oligonucleotide mixtures that can be used to screen for gene sequences encoding such intracellular proteins. Screening may be accomplished, for example, by standard hybridization or PCR techniques. Techniques for the generation of oligonucleotide mixtures and the screening are well-known. (See, Ausubel, sura., and PR Protocols: A Guide to Methods and Applications, 1990, Innis, M. et al., eds. Academic Press, Inc., New York).
WO 98/10066 PCT/US97/13944 Additionally, methods may be employed which result in the simultaneous identification of genes which encode the intracellular proteins interacting with BRCA1.
These methods include, for example, probing expression libraries, in a manner similar to the well known technique of antibody probing of Xgt11 libraries, using labeled BRCA1 protein, or fusion protein, BRCA1 fused to a marker an enzyme, fluor, luminescent protein, or dye), or an Ig-Fc domain.
One method which detects protein interactions in vivo, the two-hybrid system is described in detail for illustration only and not by way of limitation. This system has been described S. Patent No. 5, 283, 173 Chien et al., 1991, Proc. Natl. Acad. Sci.
USA, 88:9578-9582) and is commercially available from Clontech (Palo Alto, CA).
Briefly, utilizing such a system, plasmids are constructed that encode two hybrid proteins: one plasmid consists of nucleotides encoding the DNA-binding domain of a transcription activator protein fused to a BRCA1 nucleotide sequence encoding BRCA1, or BRCA1 peptide or fusion protein, and the other plasmid consists of nucleotides encoding the transcription activator protein's activation domain fused to a cDNA encoding an unknown protein which has been recombined into this plasmid as a part of the cDNA library. The DNA-binding domain fusion plasmid and the cDNA library are transformed into a strain of the yeast Saccharomyces cerevisiae that contains a reporter gene HIS3 or lacZ) whose regulatory region contain the transcription activator's binding site. Either hybrid protein alone cannot activate transcription of the reporter gene; the DNA-binding domain hybrid cannot because it does not provide activation function, and the activation domain hybrid cannot because it cannot localize to the activator's binding sites. Interaction of the two hybrid proteins reconstitutes the functional activator protein and results in transcriptional activation of the reporter gene, which is detected by an assay for the reporter gene product.
The two-hybrid system or related methodology may be used to screen activation domain libraries for proteins that interact with the "bait" gene product. By way of example, and not by way of limitation, preferrably BRCA1 peptides, or fusion proteins are used as the bait gene product. Full length BRCA1 alone can act as a transcriptional activator protein and thus cannot serve as "bait." Total genomic or cDNA sequences are fused to the DNA encoding an activation domain. This library and a plasmid encoding a hybrid of a bait BRCA1 gene product fused to the DNA-binding domain are 13 WO 98/10066 PCT/US97/13944 cotransformed into a yeast reporter strain, and the resulting tranformants are screened for those that have transcriptionally activated reporter gene. For example, and not by way of limitation, a bait BRCA1 gene sequence, such as the open reading frame of BRCA1 (or a domain of BRCA1) can be cloned into a vector such that it is translationally fused to the DNA encoding the DNA-binding domain of the GAL4 protein. These colonies are purified and the library plasmids responsible for reporter gene transcription are isolated. DNA sequencing is then used to determine the nucleotide sequence of the clones which, in turn, reveals the identity of the protein sequences encoded by the library plasmids.
A cDNA library of the cell line from which proteins that interact with bait BRCA1 gene product are to be detected can be made using methods routinely practiced in the art. According to the particular system described herein, for example, the cDNA fragments can be inserted into a vector such that they are translationally fused to the transcriptional activation domain of GAL4. This library can be co-transformed along with the bait BRCA1 gene-GAL4 fusion plasmid into a yeast strain which contains a lacZ gene driven by a promoter which contains GAL4 activation sequence. A cDNA encoded protein, fused to GAL4 transcriptional activation domain, that interacts with bait BRCA1 gene product will reconstitute an active GAL4 protein and thereby drive expression of the HIS3 gene. Colonies which express HIS3 can be detected by their growth on petri dishes containing semi-solid agar based media lacking histidine. The cDNA can then be purified from these strains, and used to produce and isolate the bait BRCA1 gene-interacting protein using techniques routinely practiced in the art.
Using the above described two-hybrid technique several BRCA1 modulators were identified, and shown to share certain properties including a leucine zipper domain.
BRCA1 Modulator cDNA The cDNA, and deduced amino acid sequences, of three representative BRCA1 Modulator Proteins are shown in Figures 1-3. The cDNAs or the proteins that they encode are hereinafter referred to as 091-21A31, Sequence ID No. 1, 091-1F84, Sequence ID No. 3, and 091-132Q20, Sequence ID No. 5. The cDNAs encode proteins that have calculated molecular weights in the range of about 45-97kd. Particularly noteworthy is the presence of at least one leucine zipper motif, and optionally a zinc finger domain.
14 WO 98/10066 PCT/US97/13944 The BRCA1 Modulator Protein nucleotide sequences of the invention include: (a)the DNA sequences shown in Figures 1-3 or contained in the cDNA clones as deposited with the American Type Culture Collection on August 14, 1996 (ATCC) under accession numbers 98141 (091-1F84, Sequence ID No. 98142 (091-21A31, Sequence ID No. and 98143 (091-132Q20, Sequence ID No. and any nucleotide sequence that hybridizes to the complement of the DNA sequence shown in Figures 1-3 or contained in the cDNA clones as deposited with the ATCC under highly stringent conditions, hybridization to filter-bound DNA in 0.5 M NaHPO 4 7% sodium dodecyl sulfate (SDS), 1mM EDTA at 65 0 C, and washing in 0.1xSSC/0.1% SDS at 68 0
C
(Ausubel F.M. et al., eds., 1989, Current Protocols in Molecular Biology, Vol. I, Green Publishing Associates, Inc., and John Wiley sons, Inc., New York, at p. 2.10.3) and encodes a functionally equivalent gene product; and any nucleotide sequence that hybridizes to the complement of the DNA sequences that encode the amino acid sequence shown in Figures 1-3 or contained in the cDNA clones as deposited with the ATCC, as described above, under less stringent conditions, such as moderately stringent conditions, washing in 0.2xSSC/0.1% SDS at 42 0 C (Ausubel et al., 1989, supra), yet which still encodes a functionally equivalent BRCA1 Modulator Protein gene product. Functional equivalents include naturally occurring BRCA1 Modulator Protein genes present in other species, and mutant BRCA1 Modulator Protein genes whether naturally occurring or engineered which retain at least some of the functional activities of a BRCA1 Modulator Protein binding to BRCA1). The invention also includes degenerate variants of sequences through The invention also includes nucleic acid molecules, preferably DNA molecules, that hybridize to, and are therefore the complements of, the nucleotide sequences through in the preceding paragraph. Such hybridization conditions may be highly stringent or less highly stringent, as described above. In instances wherein the nucleic acid molecules are deoxyoligonucleotides ("oligos"), highly stringent conditions may refer, e.g, to washing in 6xSSC/0.05% sodium pyrophosphate at 37 0 C (for 14-base oligos), 48 0 C (for 17-base oligos), 55 0 C (for 20-base oligos), and 60 0 C (for 23-base oligos).
These nucleic acid molecules may encode or act as BRCA1 Modulator gene antisense molecules, useful, for example, in gene regulation (for and/or as antisense primers in WO 98/10066 PCT/US97/13944 amplification reactions of BRCA1 Modulator gene nucleic acid sequences). Such sequences may be used as part of ribozyme and/or triple helix sequences, also useful for BRCA1 Modulator gene regulation. Still further, such molecules may be used as components of diagnostic methods whereby, for example, the presence of a particular BRCA1 Modulator allele associated with uncontrolled cell growth cancer) may be detected.
Further, it will be appreciated by one skilled in the art that a BRCA1 Modulator gene homolog may be isolated from nucleic acid of an organism of interest by performing PCR using two degenerate oligonucleotide primer pools designed on the basis of amino acid sequences within the BRCA1 Modulator gene product disclosed herein. The template for the reaction may be cDNA obtained by reverse transcription of mRNA prepared from, for example, human or non-human cell lines or cell types, such as breast or ovarian cells, known or suspected to express a BRCA1 Modulator gene allele.
The PCR product may be subcloned and sequenced to ensure that the amplified sequences represent the sequences of a BRCA1 Modulator gene. The PCR fragment may then be used to isolate a full length cDNA clone by a variety of methods. For example, the amplified fragment may be labeled and used to screen a cDNA library, such as a bacteriophage cDNA library. Alternatively, the labeled fragment may be used to isolate genomic clones via the screening of a genomic library.
PCR technology may also be utilized to isolate full length cDNA sequences. For example, RNA may be isolated, following standard procedures, from an appropriate cellular source one known, or suspected, to express a BRCA1 Modulator gene, such as, for example, from breast or ovarian cells). A reverse transcription reaction may be performed on the RNA using an oligonucleotide primer specific for the most 5' end of the amplified fragment for the priming of first strand synthesis. The resulting RNA/DNA hybrid may then be "tailed" with guanines using a standard terminal transferase reaction, the hybrid may be digested with RNAase H, and second strand synthesis may then be primed with a poly-C primer. Thus, cDNA sequences upstream of the amplified fragment may easily be isolated. For a review of cloning strategies which may be used, see Sambrook et al., 1989, supra.
WO 98/10066 PCT/US97/13944 A cDNA of a mutant BRCA1 Modulator gene may also be isolated, for example, by using PCR. In this case, the first cDNA strand may be synthesized by hybridizing an oligo-dT oligonucleotide to mRNA isolated from cells known or suspected to be expressed in an individual putatively carrying the mutant BRCA1 Modulator allele, and by extending the new strand with reverse transcriptase. The second strand of the cDNA is then synthesized using an oligonucleotide that hybridizes specifically to the end of the normal gene. Using these two primers, the product is then amplified via PCR, cloned into a suitable vector, and subjected to DNA sequence analysis through methods well known to those of skill in the art. By comparing the DNA sequence of the mutant BRCA1 Modulator allele to that of the normal BRCA1 Modulator allele, the mutation(s) responsible for the loss or alteration of function of the mutant BRCA1 Modulator gene product can be ascertained.
A genomic library can be constructed using DNA obtained from an individual suspected of or known to carry the mutant BRCA1 Modulator allele, or a cDNA library can be constructed using RNA from a cell type known, or suspected, to express the mutant BRCA1 Modulator allele. The normal BRCA1 Modulator gene or any suitable fragment thereof may then be labeled and used as a probe to identify the corresponding mutant BRCA1 Modulator allele in such libraries. Clones containing the mutant BRCA1 Modulator gene sequences may then be purified and subjected to sequence analysis according to methods well known to those of skill in the art.
Additionally, an expression library can be constructed utilizing cDNA synthesized from, for example, RNA isolated from a cell type known, or suspected, to express a mutant BRCA1 Modulator allele in an individual suspected of or known to carry such a mutant allele. In this manner, gene products made by the putatively mutant cell type may be expressed and screened using standard antibody screening techniques in conjunction with antibodies raised against the normal BRCA1 Modulator gene product, as described, below. (For screening techniques, see, for example, Harlow, E. and Lane, eds., 1988, "Antibodies: A Laboratory Manual", Cold Spring Harbor Press, Cold Spring Harbor.) Additionally, screening can be accomplished by screening with labeled fusion proteins. In cases where a BRCA1 Modulator mutation results in an expressed gene product with altered function as a result of a missense or a frameshift mutation), a polyclonal set of antibodies to a BRCA1 Modulator are likely to 17 WO 98/10066 PCT/US97/13944 cross-react with the BRCA1 Modulator mutant. Such BRCA1 Modulator mutants detected via their reaction with labeled antibodies can be purified and subjected to sequence analysis according to methods well known to those of skill in the art.
The invention also encompasses nucleotide sequences that encode peptide fragments of a BRCA1 Modulator, truncated BRCA1 Modulators, and fusion proteins of a BRCA1 Modulator. Nucleotides encoding fusion proteins may include but are not limited to full length BRCA1 Modulators, truncated BRCA1 Modulators or peptide fragments to an unrelated protein or peptide, such as for example, an epitope tag which aids in purification or detection of the resulting fusion protein; or an enzyme, fluorescent protein, luminescent protein which can be used as a marker. The preferred epitope tag is glu-glu as described by Rubinfeld, et al., J. Biol. Chem. vol. 270, no. pp 5549-5555 (1995), and Grussenmyer, et al., Proc. Natl. Acad. Sci. U. S. A. vol. 82, pp. 7952-7954 (1985).
The invention also encompasses DNA vectors that contain any of the foregoing BRCA1 Modulator coding sequences and/or their complements antisense); DNA expression vectors that contain any of the foregoing BRCA1 Modulator coding sequences operatively associated with a regulatory element that directs the expression of the coding sequences; and genetically engineered host cells that contain any of the foregoing BRCA1 Modulator coding sequences operatively associated with a regulatory element that directs the expression of the coding sequences in the host cell. As used herein, regulatory elements include but are not limited to inducible and non-inducible promoters, enhancers, operators and other elements known to those skilled in the art that drive and regulate expression. Such regulatory elements include but'are not limited to the baculovirus promoter, cytomegalovirus hCMV immediate early gene, the early or late promoters of SV40 adenovirus, the lac system, the trp system, the TAC system, the TRC system, the major operator and promoter regions of phage A, the control regions of fd coat protein, the promoter for 3phosphoglycerate kinase, the promoters of acid phosphatase, and the promoters of the yeast-mating factors.
BRCA1 Modulator Proteins As mentioned above, Figures 1-3 shows the cDNA, and deduced amino acid sequences, of three representative BRCA1 Modulator Proteins; 091-21A31, Sequence ID 18 WO 98/10066 PCTIUS97/13944 No. 1, 091-1F84, Sequence ID No. 3, and 091-132Q20, Sequence ID No. 5. 091-132Q20, Sequence ID No. 5 is not a full length sequence. The proteins have calculated molecular weights in the range of about 45-97kd. Particularly noteworthy is the presence of at least one leucine zipper motif, and optionally a zinc finger domain. For instance, 091-1F84, Sequence ID No. 3 has two leucine zippers, 091-132Q20, Sequence ID No. 5 has a single leucine zipper, while 091-21A31, Sequence ID No. 1 has a single leucine zipper and a zinc finger domain. Such domains are readily identified using the Prosite Protein Database.
The invention BRCA1 Modular Proteins, peptide fragments, mutated, truncated or deleted forms of and fusion proteins of these can be prepared for a variety of uses, including but not limited to the generation of antibodies, as reagents in diagnostic assays, the identification and/or the interaction with other cellular gene products involved in cell growth, as reagents in assays for screening for compounds that can be used in the treatment of unwanted cell growth disorders, including but not limited to cancer, and as pharmaceutical reagents useful in the treatment of such diseases.
By way of example, the 091-21A31, Sequence ID No. 1 BRCA1 Modulator Protein sequence begins with a methionine in a DNA sequence context consistent with a translation initiation site. The predicted molecular mass of this BRCA1 Modulator Protein is 53.3 kD.
The BRCA1 Modulator Protein amino acid sequences of the invention include the amino acid sequence shown in FIG. 1, or the amino acid sequence encoded by the cDNA clone, as deposited with the ATCC, as described above. Further, BRCA1 Modulator Proteins of other species are encompassed by the invention. In fact, any BRCA1 Modulator Protein protein encoded by the cDNAs described above, are within the scope of the invention.
The invention also encompasses proteins that are functionally equivalent to the BRCA1 Modulator Protein encoded by the nucleotide sequences described above, as judged by any of a number of criteria, including but not limited to the ability to bind BRCA1, the binding affinity for BRCA1 a change in cellular metabolism or change in phenotype when the BRCA1 Modulator Protein equivalent is present in an appropriate cell type (such as ovarian or breast cells). Such functionally equivalent BRCA1 Modulator Protein proteins include but are not limited to additions or substitutions of 19 WO 98/10066 PCT/US97/13944 amino acid residues within the amino acid sequence encoded by the BRCA1 Modulator nucleotide sequences described, above, but which result in a silent change, thus producing a functionally equivalent gene product. Amino acid substitutions may be made on the basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues involved. For example, nonpolar (hydrophobic) amino acids include alanine, leucine, isoleucine, valine, proline, phenylalanine, tryptophan, and methionine; polar neutral amino acids include glycine, serine, threonine, cysteine, tyrosine, asparagine, and glutamine; positively charged (basic) amino acids include arginine, lysine, and histidine; and negatively charged (acidic) amino acids include aspartic acid and glutamic acid.
While random mutations can be made to BRCA1 Modulator DNA (using random mutagenesis techniques well known to those skilled in the art) and the resulting mutant BRCA1 Modulator Proteins tested for activity, site-directed mutations of the BRCA1 Modulator coding sequence can be engineered (using site-directed mutagenesis techniques well known to those skilled in the art) to generate mutant BRCA1 Modulator Proteins with increased function, altered binding affinity for BRCA1.
For example, mutant BRCA1 Modulator Proteins can be engineered so that regions of interspecies identity are maintained, whereas the variable residues are altered, eg, by deletion or insertion of an amino acid residue(s) or by substitution of one or more different amino acid residues. Conservative alterations at the variable positions can be engineered in order to produce a mutant BRCA1 Modulator Protein that retains function. Non-conservative changes can be engineered at these variable positions to alter function. Alternatively, where alteration of function is desired, deletion or non-conservative alterations of the conserved regions can be engineered.
One of skill in the art may easily test such mutant or deleted BRCA1 Modulator Proteins for these alterations in function using the teachings presented herein.
Other mutations to a BRCA1 Modulator coding sequence can be made to generate BRCA1 Modulator Proteins that are better suited for expression, scale up, etc.
in the host cells chosen. For example, the triplet code for each amino acid can be modified to conform more closely to the preferential codon usage of the host cell's translational machinery.
WO 98/10066 PCT/US97/13944 Peptides corresponding to one or more domains (or a portion of a domain) of a BRCA1 Modulator Protein leucine zippers, zinc fingers), truncated or deleted BRCA1 Modulator Proteins BRCA1 Modulator Proteins in which portions of one or more of the above domains are deleted) as well as fusion proteins in which the full length of a BRCA1 Modulator Protein, a BRCA1 Modulator Protein peptide or truncated BRCA1 Modulator Protein is fused to an unrelated protein are also within the scope of the invention and can be designed on the basis of a BRCA1 Modulator nucleotide and BRCA1 Modulator Protein amino acid sequences disclosed in this Section and above. Such fusion proteins include but are not limited to fusions to an epitope tag (such as is exemplified herein); or fusions to an enzyme, fluorescent protein, or luminescent protein which provide a marker function.
While the BRCA1 Modulator Proteins and peptides can be chemically synthesized see Creighton, 1983, Proteins: Structures and Molecular Principles, W.H. Freeman Co., large polypeptides derived from the BRCA1 Modulator Protein and the full length BRCA1 Modulator Protein itself may advantageously be produced by recombinant DNA technology using techniques well known in the art for expressing nucleic acid containing BRCA1 Modulator gene sequences and/or coding sequences. Such methods can be used to construct expression vectors containing the BRCA1 Modulator nucleotide sequences described above and appropriate transcriptional and translational control signals. These methods include, for example, in vitro recombinant DNA techniques, synthetic techniques, and in vivo genetic recombination. See, for example, the techniques described in Sambrook et al., 1989, supra, and Ausubel et al., 1989, supra. Alternatively, RNA capable of encoding BRCA1 Modulator nucleotide sequences may be chemically synthesized using, for example, synthesizers. See, for example, the techniques described in "Oligonucleotide Synthesis", 1984, Gait, M.J. ed., IRL Press, Oxford, which is incorporated by reference herein in its entirety.
A variety of host-expression vector systems may be utilized to express the BRCA1 Modulator nucleotide sequences of the invention. Where a BRCA1 Modulator Protein peptide or polypeptide is a soluble secreted derivative the peptide or polypeptide can be recovered from the culture medium. If the BRCA1 Modulator Protein peptide or polypeptide is not secreted, it may be isolated from the host cells.
21 WO 98/10066 PCT/US97/13944 However, such engineered host cells themselves may be used in situations where it is important not only to retain the structural and functional characteristics of a BRCA1 Modulator Protein, but to assess biological activity, in drug screening assays.
The expression systems that may be used for purposes of the invention include but are not limited to microorganisms such as bacteria E. coli, B. subtilis) transformed with recombinant bacteriophage DNA, plasmid DNA or cosmid DNA expression vectors containing BRCA1 Modulator nucleotide sequences; yeast Saccharomyces, Pichia) transformed with recombinant yeast expression vectors containing the BRCA1 Modulator nucleotide sequences; insect cell systems infected with recombinant virus expression vectors baculovirus) containing the BRCA1 Modulator sequences; plant cell systems infected with recombinant virus expression vectors (e cauliflower mosaic virus, CaMV; tobacco mosaic virus, TMV) or transformed with recombinant plasmid expression vectors Ti plasmid) containing BRCA1 Modulator nucleotide sequences; or mammalian cell systems COS, CHO, BHK, 293, 3T3, U937) harboring recombinant expression constructs containing promoters derived from the genome of mammalian cells metallothionein promoter) or from mammalian viruses (eg, the adenovirus late promoter; the vaccinia virus 7.5K promoter).
In bacterial systems, a number of expression vectors may be advantageously selected depending upon the use intended for the BRCA1 Modulator gene product being expressed. For example, when a large quantity of such a protein is to be produced, for the generation of pharmaceutical compositions of BRCA1 Modulator Protein or for raising antibodies to the BRCA1 Modulator Protein, for example, vectors which direct the expression of high levels of fusion protein products that are readily purified may be desirable. Such vectors include, but are not limited, to the E. coli expression vector pUR278 (Ruther et al., 1983, EMBO J. 2:1791), in which the BRCA1 Modulator coding sequence may be ligated individually into the vector in frame with the lacZ coding region so that a fusion protein is produced; pIN vectors (Inouye Inouye, 1985, Nucleic Acids Res. 13:3101-3109; Van Heeke Schuster, 1989, J. Biol.
Chem. 264:5503-5509); and the like. pGEX vectors may also be used to express foreign polypeptides as fusion proteins with glutathione S-transferase (GST). If the inserted sequence encodes a relatively small polypeptide (less than 25 kD), such fusion proteins WO 98/10066 PCT/US97/13944 are generally soluble and can easily be purified from lysed cells by adsorption to glutathione-agarose beads followed by elution in the presence of free glutathione. The pGEX vectors are designed to include thrombin or factor Xa protease cleavage sites so that the cloned target gene product can be released from the GST moiety. Alternatively, if the resulting fusion protein is insoluble and forms inclusion bodies in the host cell, the inclusion bodies may be purified and the recombinant protein solubilized using techniques well known to one of skill in the art.
In an insect system, Autographa californica nuclear polyhidrosis virus (AcNPV) may be used as a vector to express foreign genes. see Smith et al., 1983, J. Virol.
46: 584; Smith, U.S. Patent No. 4,215,051). In a specific embodiment described below, Sf9 insect cells are infected with a baculovirus vectors expressing either a 6 x HIStagged construct, or an (EE)-tagged BRCA1 Modulator construct.
In mammalian host cells, a number of viral-based expression systems may be utilized. Specific embodiments described more fully below express tagged BRCA1 Modulator cDNA sequences using a CMV promoter to transiently express recombinant protein in U937 cells or in Cos-7 cells. Alternatively, retroviral vector systems well known in the art may be used to insert the recombinant expression construct into host cells. For example, retroviral vector systems for transducing hematopoietic cells are described in published PCT applications WO 96/09400 and WO 94/29438.
In cases where an adenovirus is used as an expression vector, the BRCA1 Modulator nucleotide sequence of interest may be ligated to an adenovirus transcription/translation control complex, the late promoter and tripartite leader sequence. This chimeric gene may then be inserted in the adenovirus genome by in vitro or in vivo recombination. Insertion in a non-essential region of the viral genome region El or E3) will result in a recombinant virus that is viable and capable of expressing the BRCA1 Modulator gene product in infected hosts. See Logan Shenk, 1984, Proc. Natl. Acad. Sci. USA 81:3655-3659). Specific initiation signals may also be required for efficient translation of inserted BRCA1 Modulator nucleotide sequences. These signals include the ATG initiation codon and adjacent sequences. In cases where an entire BRCA1 Modulator gene or cDNA, including its own initiation codon and adjacent sequences, is inserted into the appropriate expression vector, no additional translational control signals may be needed. However, in cases where only a 23 WO 98/10066 PCT[US97/13944 portion of the BRCA1 Modulator coding sequence is inserted, exogenous translational control signals, including, perhaps, the ATG initiation codon, must be provided.
Furthermore, the initiation codon must be in phase with the reading frame of the desired coding sequence to ensure translation of the entire insert. These exogenous translational control signals and initiation codons can be of a variety of origins, both natural and synthetic. The efficiency of expression may be enhanced by the inclusion of appropriate transcription enhancer elements, transcription terminators, etc. (See Bittner et al., 1987, Methods in Enzymol. 153:516-544).
In addition, a host cell strain may be chosen which modulates the expression of the inserted sequences, or modifies and processes the gene product in the specific fashion desired. Such modifications glycosylation) and processing cleavage) of protein products may be important for the function of the protein. Different host cells have characteristic and specific mechanisms for the post-translational processing and modification of proteins and gene products. Appropriate cell lines or host systems can be chosen to ensure the correct modification and processing of the foreign protein expressed. To this end, eukaryotic host cells which possess the cellular machinery for proper processing of the primary transcript may be used. Such mammalian host cells include but are not limited to CHO, VERO, BHK, HeLa, COS, MDCK, 293, 3T3, WI38, and U937 cells.
For long-term, high-yield production of recombinant proteins, stable expression is preferred. For example, cell lines which stably express the BRCA1 Modulator sequences described above may be engineered. Rather than using expression vectors which contain viral origins of replication, host cells can be transformed with DNA controlled by appropriate expression control elements promoter, enhancer sequences, transcription terminators, polyadenylation sites, etc.), and a selectable marker. Following the introduction of the foreign DNA, engineered cells may be allowed to grow for 1-2 days in an enriched media, and then are switched to a selective media. The selectable marker in the recombinant plasmid confers resistance to the selection and allows cells to stably integrate the plasmid into their chromosomes and grow to form colonies which in turn can be cloned and expanded into cell lines. This method may advantageously be used to engineer cell lines which express the BRCA1 Modulator gene product. Such engineered cell lines may be particularly useful in 24 WO 98/10066 PCT/US97/13944 screening and evaluation of compounds that affect the endogenous activity of the BRCA1 Modulator gene product.
A number of selection systems may be used, including but not limited to the herpes simplex virus thymidine kinase (Wigler et al., 1977, Cell 11:223), hypoxanthineguanine phosphoribosyltransferase (Szybalska Szybalski, 1962, Proc. Natl. Acad. Sci.
USA 48:2026), and adenine phosphoribosyltransferase (Lowy et al., 1980, Cell 22:817) genes can be employed in tk-, hgprt- or aprt- cells, respectively. Also, antimetabolite resistance can be used as the basis of selection for the following genes: dhfr, which confers resistance to methotrexate (Wigler et al., 1980, Natl. Acad. Sci. USA 77:3567; O'Hare et al., 1981, Proc. Natl. Acad. Sci. USA 78:1527); gpt, which confers resistance to mycophenolic acid (Mulligan Berg, 1981, Proc. Natl. Acad. Sci. USA 78:2072); neo, which confers resistance to the aminoglycoside G-418 (Colberre-Garapin et al., 1981, J.
Mol. Biol. 150:1); and hygro, which confers resistance to hygromycin (Santerre et al., 1984, Gene 30:147).
The BRCA1 Modulator gene products can also be expressed in transgenic animals. Animals of any species, including, but not limited to, mice, rats, rabbits, guinea pigs, pigs, micro-pigs, goats, and non-human primates, e.g, baboons, monkeys, and chimpanzees may be used to generate BRCA1 Modulator transgenic animals.
Any technique known in the art may be used to introduce the BRCA1 Modulator transgene into animals to produce the founder lines of transgenic animals. Such techniques include, but are not limited to pronuclear microinjection (Hoppe, P.C. and Wagner, 1989, U.S. Pat. No. 4,873,191); retrovirus mediated gene transfer into germ lines (Van der Putten et al., 1985, Proc. Natl. Acad. Sci., USA 82:6148-6152); gene targeting in embryonic stem cells (Thompson et al., 1989, Cell 56:313-321); electroporation of embryos (Lo, 1983, Mol Cell. Biol. 3:1803-1814); and sperm-mediated gene transfer (Lavitrano et al., 1989, Cell 57:717-723); etc. For a review of such techniques, see Gordon, 1989, Transgenic Animals, Intl. Rev. Cytol. 115:171-229, which is incorporated by reference herein in its entirety.
The present invention provides for transgenic animals that carry the BRCA1 Modulator transgene in all their cells, as well as animals which carry the transgene in some, but not all their cells, mosaic animals. The transgene may be integrated as a WO 98/10066 PCT/US97/13944 single transgene or in concatamers, head-to-head tandems or head-to-tail tandems.
The transgene may also be selectively introduced into and activated in a particular cell type by following, for example, the teaching of Lasko et al. (Lasko, M. et al., 1992, Proc.
Natl. Acad. Sci. USA 89: 6232-6236). The regulatory sequences required for such a celltype specific activation will depend upon the particular cell type of interest, and will be apparent to those of skill in the art. When it is desired that the BRCA1 Modulator transgene be integrated into the chromosomal site of the endogenous BRCA1 Modulator gene, gene targeting is preferred. Briefly, when such a technique is to be utilized, vectors containing some nucleotide sequences homologous to the endogenous BRCA1 Modulator gene are designed for the purpose of integrating, via homologous recombination with chromosomal sequences, into and disrupting the function of the nucleotide sequence of the endogenous BRCA1 Modulator gene. In this way, the expression of the endogenous BRCA1 Modulator gene may also be eliminated by inserting non-functional sequences into the endogenous gene. The transgene may also be selectively introduced into a particular cell type, thus inactivating the endogenous BRCA1 Modulator gene in only that cell type, by following, for example, the teaching of Gu et al. (Gu et al., 1994, Science 265: 103-106). The regulatory sequences required for such a cell-type specific inactivation will depend upon the particular cell type of interest, and will be apparent to those of skill in the art.
Once transgenic animals have been generated, the expression of the recombinant BRCA1 Modulator gene may be assayed utilizing standard techniques. Initial screening may be accomplished by Southern blot analysis or PCR techniques to analyze animal tissues to assay whether integration of the transgene has taken place. The level of mRNA expression of the transgene in the tissues of the transgenic animals may also be assessed using techniques which include but are not limited to Northern blot analysis of cell type samples obtained from the animal, in situ hybridization analysis, and RT-PCR.
Samples of BRCA1 Modulator gene-expressing tissue, may also be evaluated immunocytochemically using antibodies specific for the BRCA1 Modulator transgene product, as described below.
Antibodies to BRCA1 Modulator Proteins Antibodies that specifically recognize one or more epitopes of a BRCA1 Modulator Protein, or epitopes of conserved variants, or peptide fragments are also 26 WO 98/10066 PCT/US97/13944 encompassed by the invention. Such antibodies include but are not limited to polyclonal antibodies, monoclonal antibodies (mAbs), humanized or chimeric antibodies, single chain antibodies, Fab fragments, F(ab') 2 fragments, fragments produced by a Fab expression library, anti-idiotypic (anti-Id) antibodies, and epitopebinding fragments of any of the above.
The antibodies of the invention may be used, for example, in the detection of the BRCA1 Modulator Protein in a biological sample and may, therefore, be utilized as part of a diagnostic or prognostic technique whereby patients may be tested for abnormal amounts of these proteins. Such antibodies may also be utilized in conjunction with, for example, compound screening schemes, as described herein for the evaluation of the effect of test compounds on expression and/or activity of the BRCA1 Modulator Protein. Additionally, such antibodies can be used in conjunction with the gene therapy techniques described herein, to, for example, evaluate the normal and/or engineered BRCA1 Modulator Protein expressing cells prior to their introduction into the patient.
Such antibodies may additionally be used as a method for the inhibition of abnormal BRCA1 Modulator Protein activity.
For the production of antibodies, various host animals may be immunized by injection with the BRCA1 Modulator Protein, a BRCA1 Modulator Protein peptide, truncated BRCA1 Modulator Protein polypeptides, functional equivalents of the BRCA1 Modulator Protein or mutants of the BRCA1 Modulator Protein. Such host animals may include but are not limited to rabbits, mice, and rats, to name but a few.
Various adjutants may be used to increase the immunological response, depending on the host species, including but not limited to Freund's (complete and incomplete), mineral gels such as aluminum hydroxide, surface active substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, keyhole limpet hemocyanin, dinitrophenol, and potentially useful human adjutants such as BCG (bacille Calmette-Guerin) and Corynebacterium parvum. Polyclonal antibodies are heterogeneous populations of antibody molecules derived from the sera of the immunized animals.
Monoclonal antibodies, which are homogeneous populations of antibodies to a particular antigen, may be obtained by any technique which provides for the WO 98/10066 PCT/US97/13944 production of antibody molecules by continuous cell lines in culture. These include, but are not limited to, the hybridoma technique of Kohler and Milstein, (1975, Nature 256:495-497; and U.S. Patent No. 4,376,110), the human B-cell hybridoma technique (Kosbor et al., 1983, Immunology Today 4:72; Cole et al., 1983, Proc. Natl. Acad. Sci.
USA 80:2026-2030), and the EBV-hybridoma technique (Cole et al., 1985, Monoclonal Antibodies And Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). Such antibodies may be of any immunoglobulin class including IgG, IgM, IgE, IgA, IgD and any subclass thereof. The hybridoma producing the mAb of this invention may be cultivated in vitro or in vivo. Production of high titers of mAbs in vivo makes this the presently preferred method of production.
In addition, techniques developed for the production of "chimeric antibodies" (Morrison et al., 1984, Proc. Natl. Acad. Sci. USA, 81:6851-6855; Neuberger et al., 1984, Nature, 312:604-608; Takeda et al., 1985, Nature, 314:452-454) by splicing the genes from a mouse antibody molecule of appropriate antigen specificity together with genes from a human antibody molecule of appropriate biological activity can be used. A chimeric antibody is a molecule in which different portions are derived from different animal species, such as those having a variable region derived from a mAb and a human immunoglobulin constant region.
Alternatively, techniques described for the production of single chain antibodies Patent 4,946,778; Bird, 1988, Science 242:423-426; Huston et al., 1988, Proc. Natl.
Acad. Sci. USA 85:5879-5883; and Ward et al., 1989, Nature 334:544-546) can be adapted to produce single chain antibodies against BRCA1 Modulator Protein gene products.
Single chain antibodies are formed by linking the heavy and light chain fragments of the Fv region via an amino acid bridge, resulting in a single chain polypeptide.
Antibody fragments which recognize specific epitopes may be generated by known techniques. For example, such fragments include but are not limited to: the F(ab') 2 fragments which can be produced by pepsin digestion of the antibody molecule and the Fab fragments which can be generated by reducing the disulfide bridges of the F(ab') 2 fragments. Alternatively, Fab expression libraries may be constructed (Huse et al., 1989, Science, 246:1275-1281) to allow rapid and easy identification of monoclonal Fab fragments with the desired specificity.
WO 98/10066 PCT/US97/13944 Antibodies to the BRCA1 Modulator Protein can, in turn, be utilized to generate anti-idiotype antibodies that "mimic" the BRCA1 Modulator Protein using techniques well known to those skilled in the art. (See, Greenspan Bona, 1993, FASEB J 7(5):437-444; and Nissinoff, 1991, J. Immunol. 147(8):2429-2438).
Identification of Compounds that Increase BRCA1 Levels using BRCA1 Modulators The BRCA1 gene encodes a protein that has been shown to have tumor suppressor activity. See, Holt, J. et al, (1996) Nat. Genet. vol. 12, pages 298-302. Such studies have shown that certain cancer cells have low levels of BRCA1, and that increasing the levels causes a reversion to the normal cell phenotype. Thus, compounds that increase BRCA1 levels will have significant therapeutic use for the treatment of cancer.
An aspect of the instant invention is the description of an assay using BRCA1 and BRCA1 Modulators that facilitates the identification of compounds that increase intracellular levels of BRCA1. One format of the assay is shown in schematic form in Figure 4. Briefly, the assay makes use of two events: firstly, BRCA1 is known to be a general transcriptional activator, and secondly, BRCA1 Modulators bind to BRCA1.
The assay makes use of certain features of the two-hybrid assay described above. Two plasmids are constructed and transfected into a suitable cell line, preferrably a breast or ovarian cell line. A preferred breast cell line would be MCF-7. One plasmid contains the nucleotide sequence recognized by GAL4 operably linked to an activator sequence, and a reporter gene downstream of this sequence. An example of a preferred reporter gene is the gene that encodes luciferase. The second plasmid encodes and expresses the GAL4 DNA binding domain fused to a BRCA1 Modulator. The preferred Modulator is 091-21A31, Sequence ID No. 1.
The GAL4 DNA binding domain-BRCA1 Modulator fusion protein binds to the GAL4 DNA binding domain on the first plasmid which, in turn, recruits any BRCA1 present to form a complex consisting of GAL4 DNA binding domain-BRCA1 Modulator fusion and BRCA1. As part of the complex, BRCA1 is in proximity to the activator sequence which in turn initiates transcription of the reporter gene. Thus, compounds can be tested for their capacity to stimulate the production of BRCA1.
WO 98/10066 PCT/US97/13944 Those that do will cause an increase in the reporter gene product. The above assay is schematically presented in Figure 4.
Identification of Compounds that alter BRCA1 Interaction with BRCA1 Modulators As mentioned above, BRCA1 is a known tumor suppressor. See, Holt, J. et al, (1996) Nat. Genet. vol. 12, pages 298-302. Thus compounds that affect the normal interaction of BRCA1 with BRCA1 Modulator Proteins may affect the tumor suppressor activity of BRCA1. The extent of the effect will, in large part, depend on the chemical properties of the compounds tested. Some may strongly disrupt the interaction of BRCA1 with BRCA1 Modulator Proteins, while others would have a minimal effect.
The former would be reflected in a biological assay for altered tumorgenicity, while the latter would not. The converse is also true, certain compounds may strengthen the interaction of BRCA1 with BRCA1 Modulator Proteins, in which case the opposite biological effect would be anticipated. Thus, it is highly desirable to assay for compounds that affect BRCA1 interactions with BRCA1 Modulator Protein.
The basic principle of the assay systems used to identify such compounds that affect BRCA1 interactions with BRCA1 Modulator Proteins involves preparing a reaction mixture containing BRCA1 protein, polypeptide, peptide or fusion protein as described above, and a BRCA1 Modulator Protein under conditions and for a time sufficient to allow the two to interact and bind, thus forming a complex. In order to test a compound for inhibitory activity, the reaction mixture is prepared in the presence and absence of the test compound. The test compound may be initially included in the reaction mixture, or may be added at a time subsequent to the addition of the BRCA1 moiety and its BRCA1 Modulator Protein. Control reaction mixtures are incubated without the test compound or with a placebo. The formation of any complexes between the BRCA1 moiety and the BRCA1 Modulator Protein is then detected. The formation of a complex in the control reaction, but not in the reaction mixture containing the test compound, indicates that the compound interferes with the interaction of the BRCA1 and the interactive BRCA1 Modulator Protein. Additionally, complex formation within reaction mixtures containing the test compound and normal BRCA1 protein may also be compared to complex formation within reaction mixtures containing the test compound and a mutant BRCA1. This comparison may be important in those cases WO 98/10066 PCTIUS97/13944 wherein it is desirable to identify compounds that disrupt interactions of mutant but not normal BRCA1.
The assay for compounds that interfere with the interaction of the BRCA1 and BRCA1 Modulator Proteins can be conducted in a heterogeneous or homogeneous 5 format. Heterogeneous assays involve anchoring either the BRCA1 moiety or the BRCA1 Modulator Protein onto a solid phase and detecting complexes anchored on the solid phase at the end of the reaction. In homogeneous assays, the entire reaction is carried out in a liquid phase. In either approach, the order of addition of reactants can be varied to obtain different information about the compounds being tested. For example, test compounds that interfere with the interaction by competition can be identified by conducting the reaction in the presence of the test substance; by adding the test substance to the reaction mixture prior to or simultaneously with the BRCA1 moiety and interactive BRCA1 Modulator Protein. Alternatively, test compounds that disrupt preformed complexes, compounds with higher binding constants that displace one of the components from the complex, can be tested by adding the test compound to the reaction mixture after complexes have been formed.
Representative formats are described briefly below.
In a heterogeneous assay system, either the BRCA1 moiety or the interactive BRCA1 Modulator Protein, is anchored onto a solid surface, while the non-anchored species is labeled, either directly or indirectly. In practice, microtiter plates are conveniently utilized. The anchored species may be immobilized by non-covalent or covalent attachments. Non-covalent attachment may be accomplished simply by coating the solid surface with a solution of BRCA1 or BRCA1 Modulator Protein and drying. Alternatively, an immobilized antibody specific for the species to be anchored may be used to anchor the species to the solid surface. The surfaces may be prepared in advance and stored.
In order to conduct the assay, the partner of the immobilized species is exposed to the coated surface with or without the test compound. After the reaction is complete, unreacted components are removed by washing) and any complexes formed will remain immobilized on the solid surface. The detection of complexes anchored on the solid surface can be accomplished in a number of ways. Where the non-immobilized species is pre-labeled, the detection of label immobilized on the surface indicates that WO 98/10066 PCT/US97/13944 complexes were formed. Where the non-immobilized species is not pre-labeled, an indirect label can be used to detect complexes anchored on the surface; using a labeled antibody specific for the initially non-immobilized species (the antibody, in turn, may be directly labeled or indirectly labeled with a labeled anti-Ig antibody).
Depending upon the order of addition of reaction components, test compounds which inhibit complex formation or which disrupt preformed complexes can be detected.
Alternatively, the reaction can be conducted in a liquid phase in the presence or absence of the test compound, the reaction products separated from unreacted components, and complexes detected; using an immobilized antibody specific for one of the binding components to anchor any complexes formed in solution, and a labeled antibody specific for the other partner to detect anchored complexes. Again, depending upon the order of addition of reactants to the liquid phase, test compounds which inhibit complex or which disrupt preformed complexes can be identified.
In an alternate embodiment of the invention, a homogeneous assay can be used.
In this approach, a preformed complex of the BRCA1 moiety and the interactive BRCA1 Modulator Protein is prepared in which either the BRCA1 or its BRCA1 Modulator Proteins is labeled, but the signal generated by the label is quenched due to formation of the complex (see, g. U.S. Patent No.4,109,496 by Rubenstein which utilizes this approach for immunoassays). The addition of a test substance that competes with and displaces one of the species from the preformed complex will result in the generation of a signal above background. In this way, test substances which disrupt BRCA1/intracellular BRCA1 Modulator Protein interaction can be identified.
In a particular embodiment, a BRCA1 fusion protein can be prepared for immobilization. For example, BRCA1 or a peptide fragment can be fused to a glutathione-S-transferase (GST) gene using a fusion vector, such as pGEX-5X-1, in such a manner that its binding activity is maintained in the resulting fusion protein. The interactive BRCA1 Modulator Protein can be purified and used to raise a monoclonal antibody, using methods routinely practiced in the art and described above. This antibody can be labeled with the radioactive isotope 125I, for example, by methods routinely practiced in the art. In a heterogeneous assay, the GST-BRCA1 fusion protein can be anchored to glutathione-agarose beads. The interactive BRCA1 WO 98/10066 PCT/US97/13944 Modulator Protein can then be added in the presence or absence of the test compound in a manner that allows interaction and binding to occur. At the end of the reaction period, unbound material can be washed away, and the labeled monoclonal antibody can be added to the system and allowed to bind to the complexed components. The interaction between the BRCA1 protein and the interactive BRCA1 Modulator Protein can be detected by measuring the amount of radioactivity that remains associated with the glutathione-agarose beads. A successful inhibition of the interaction by the test compound will result in a decrease in measured radioactivity.
Alternatively, the GST-BRCA1 fusion protein and the interactive BRCA1 Modulator Protein can be mixed together in liquid in the absence of the solid glutathione-agarose beads. The test compound can be added either during or after the species are allowed to interact. This mixture can then be added to the glutathioneagarose beads and unbound material is washed away. Again the extent of inhibition of the BRCA1/BRCA1 Modulator Protein interaction can be detected by adding the labeled antibody and measuring the radioactivity associated with the beads.
In another embodiment of the invention, these same techniques can be employed using peptide fragments that correspond to the binding domains of the BRCA1 and/or the interactive or BRCA1 Modulator in place of one or both of the full length proteins.
Any number of methods routinely practiced in the art can be used to identify and isolate the binding domains. Such domains are discussed more fully in the examples, below. These methods include, but are not limited to, mutagenesis of the gene encoding one of the proteins and screening for disruption of binding in a co-immunoprecipitation assay. Compensating mutations in the gene encoding the second species in the complex can then be selected. Sequence analysis of the genes encoding the respective proteins will reveal the mutations that correspond to the region of the protein involved in interactive binding. The two hybrid assay may also be used, as discussed more fully in the examples below. For instance, once the gene coding for the intracellular BRCA1 Modulator Protein is obtained, short gene segments can be engineered to express peptide fragments of the protein, which can then be tested for binding activity and purified or synthesized. purified or synthesized.
WO 98/10066, PCTIUS97/13944 Effective Dose Toxicity and therapeutic efficacy of compounds identified above that affect the interaction of BRCA1 with BRCA1 Modulator Proteins, and thus affect cell growth can be determined by standard pharmaceutical procedures in cell cultures or experimental animals, for determining the LD 5 0 (the dose lethal to 50% of the population) and the ED 5 0 (the dose therapeutically effective in 50% of the population). Numerous model systems are known to the skilled practitioner of the art that can be employed to test the cell growth properties of the instant compounds including growth of cells in soft agar, and effect on tumors in vivo. Such experiments can be conducted on cells cotransfected with BRCA1 and BRCA1 Modulators The dose ratio between toxic and therapeutic effects is the therapeutic index and it can be expressed as the ratio LD 5 0
/ED
5 0 Compounds which exhibit large therapeutic indices are preferred. While compounds that exhibit toxic side effects may be used, care should be taken to design a delivery system that targets such compounds to the site of affected tissue in order to minimize potential damage to uninfected cells and, thereby, reduce side effects.
The data obtained from the cell culture assays and animal studies can be used in formulating a range of dosage for use in humans. The dosage of such compounds lies preferably within a range of circulating concentrations that include the ED 5 0 with little or no toxicity. The dosage may vary within this range depending upon the dosage form employed and the route of administration utilized. For any compound used in the method of the invention, the therapeutically effective dose can be estimated initially from cell culture assays. A dose may be formulated in animal models to achieve a circulating plasma concentration range that includes the IC 5 0 the concentration of the test compound which achieves a half-maximal inhibition of symptoms) as determined in cell culture. Such information can be used to more accurately determine useful doses in humans. Levels in plasma may be measured, for example, by high performance liquid chromatography.
The Examples which follow are illustrative of specific embodiments of the invention, and various uses thereof. They are set forth for explanatory purposes only, and are not to be taken as limiting the invention.
34 WO 98/10066 PCT/US97/13944 Example 1 Identification of cDNAs that Encode BRCA1 Modulator Proteins BRCA1 modulators were identified initially using the yeast two hybrid assay system described in U. S. Patent No. 5, 283, 173, or Chien et al., 1991, Proc. Natl. Acad.
Sci. USA, 88:9578-9582. The assay components are also commercially available from Clontech (Palo Alto, CA).
The cDNA encoding human BRCA1 (See, Miki, et al Science, vol. 266: 66-71; and PCT/US95/10202) was digested with MvnI-NheI and the fragment representing BRCA1 amino acids 8-1293 was fused to the GAL4 binding domain in the SmaI-NheI sites of pGBT8 plasmid, which is the pMA424 plasmid of Chien et al. as described in Proc. Natl. Acad. Sci. vol. 88: pages 9578-9582 (1991), modified by the insertion of the sequence CCGGGGATCCCCATGGCTAGCCATATG-3' between the EcoRI and Sail unique sites.
This was transformed into the yeast strain YGH1, and the YGH1 strain carrying the plasmid GAL4-BRCA1 (8-1293) was evaluated for its intrinsic ability to activate the two reporters-growth in histidine minus media and P-galactosidase activity. The YGH1 strain carrying the plasmid GAL4-BRCA1 (8-1293) was able to grow on minus histidine plates but this was controlled by the addition of 7.5mM 3-amino-1,2,4-Triazole (3AT) to the minus histidine plates and the strain had no detectable P-galactosidase activity. The YGH1 strain carrying the plasmid GAL4-BRCA1 (8-1293) was subsequently transformed with a HeLa cell cDNA library fused to the GAL4 activation domain in the pGAD plasmid (Chien et al., Proc. Natl. Acad. Sci. vol. 88: pages 9578-9582 (1991).
When a cDNA encodes a protein that interacts with the BRCA1 protein (amino acids 8- 1293), the YGH1 strain is expected to grow in the absence of histidine supplemented with 7.5mM 3AT and produce p-galactosidase.
Four of the 2.5 X 106 transformants screened grew in the absence of histidine supplemented with 7.5mM 3AT and had P-galactosidase activity. The plasmids recovered from these 4 yeast strains were used to re-transform the original YGH1 GAL4-BRCA1 (8-1293) strain. All the plasmids conferred the ability to grow in the absence of histidine supplemented with 7.5mM 3AT and to produce p-galactosidase.
Upon subsequent screening, three of the four were found to have cDNAs that encode Modulator Proteins that clearly bound to BRCA1. One of the plasmids contained the novel cDNA encoding for the BRCA1 Modulator Protein hereinafter termed, 091-21A31, Sequence ID No. 1. The nucleotide and protein sequence are shown in Figure 1. The calculated molecular weight is about 53kd, and it has an estimated pi of 9.05.
Particularly noteworthy is the presence of a zinc finger domain and a leucine zipper motif.
The nucleotide sequence of the second cDNA and amino acid sequence that it encodes, hereinafter termed, 091-1F84, Sequence ID No. 3, is shown in Figure 2. Note that this clone displays two leucine zipper domains. The protein has a calculated molecular weight of 96, 443. 3 and an estimated pI of 4.95.
The nucleotide sequence of the third cDNA and amino acid sequence that it encodes, hereinafter termed, 091-132Q20, Sequence ID No. 5, is shown in Figure 3. Note that this clone also displays a leucine zipper domain. The protein has a calculated molecular weight of 45,904. 9 and an estimated pi of 6.73.
Example 2 Binding of BRCA1 Domains to BRCA1 Modulators Experiments were conducted to ascertain which regions of BRCA1 interact with the three BRCA1 Modulators described in Example 1. The experiment was conducted using the two-hybrid assay as described in U. S. Patent No. 5, 283, 173, or Chien et al., 20 1991, Proc. Natl. Acad. Sci. USA, 88:9578-9582. The cDNA that encodes the 091-1F84, Sequence ID No. 3, 091-21A31, Sequence ID No. 1, and 091-132Q20, Sequence ID No. was fused to the GAL 4 activation domain, and those regions of BRCA1 shown in Table 1 and that contain BRCA1 amino acids 1-300, 1-600, or 8-1293 were fused to the binding domain of GAL4. Controls consisted of the vector, or bcl-2 fused to the GAL 4 binding S 25 domain (See, U. S. Patent 5, 539,085).
9@ 36a TABLE 1 INTERACTION OF BRCAI WITH TWO HYBRID HITS (091-1 GAL4AD GAL4BD F84 21 A31 132 BRCA1 (1-300) BROAl (1-600) BROA1 (8-1293) vector or BCL2 The BRCAI constructs employed in the above studies were generated using restriction fragments of BROA1, and cloning them into the plasmid pGBT8, which is a derivative of the plasmid pMA424, as described by Chien et al. in Proc. Nati. Acad. Sci. vol. 88: pages 9578-9582 (1991), modified by the insertion of the sequence 5'-CCGGGGATCCCCATGGCTAGCCATATG-3' 10 between the EcoRI and Sall unique sites.
9* 99 9 9 9 9 9999 9* 9* .9 99 9 9* 99 9 99 9 9 9 9 99 99 9.
9 9 9 9 9*99 9* 9 9* 99 9, 99 *9 9 9 9 9 9. 99 9. 9 9 9 9 9 9. 9 9 99 9 99 9 9 WO 98/10066 PCT/US97/13944 Briefly, the construct containing the first 300 amino acids of BRCA1 was generated by subcloning the Ncol-EcoR1 blunted BRCA1 fragment into the blunted EcoR1 site of pGBT8. The BRCA1 containing amino acids 8-1293 was generated as described above.
Lastly, the BRCA1 construct containing amino acids 1-600 was generated by subcloning the Ncol-Spel BRCA1 fragment into the Ncol-Nhel site of pGBT8.
Table 1 shows those regions of BRCA1 that interact with the proteins encoded by 091-1F84, Sequence ID No. 3, 091-21A31, Sequence ID No. 1, and 091-132Q20, Sequence ID No. 5. The sign is a subjective measure of the amount of b-galactosidase activity. One being the lowest, and three being the highest activity. It is apparent from Table 1 that the first 300 amino acids of BRCA1 do not bind to any of the three BRCA1 Modulators, but that all three BRCA1 Modulators bind to the BRCA1 construct containing the first 600 amino acids of BRCA. None of the BRCA1 Modulators bind to the vector or bcl-2 controls, while all the BRCA1 Modulators bound to the near full length BRCA1 construct which has amino acids 8-1293.
The results show that the three BRCA1 Modulators preferrentially bind to the first 600 amino acids of BRCA1.
Example 3 Identification of Interacting Domains of 091-21A31, Sequence ID No. 1 and BRCA1 Two hybrid experiments were conducted to ascertain the regions of the BRCA1 Modulator 091-21A31, Sequence ID No. 1 that interact with BRCA1. The assay was run essentially as described in Example 1. Transformation and growth of yeast cultures were performed essentially as described in U. S. Patent No. 5, 283, 173; Chien et al., 1991, Proc. Natl. Acad. Sci. USA, 88:9578-9582; or Spaargaren, et al., (1994) Biochem. J. 300, 303-307.
Briefly, the YGH1 yeast strain was co-transformed with cDNA encoding 091- 21A31, Sequence ID No. 1, or cDNA encoding 091-21A31, Sequence ID No. 1 fragments containing amino acids 78-469, 1-300, or 300-469 fused to the GAL4 activation domain.
As a control, bcl-2 cDNA (See, U. S. Patent 5, 539, 085) was fused to the GAL4 activation domain. cDNAs encoding BRCA1 fragments having amino acids 1-300, 1- 600, or 8-1293 were fused to the GAL4 binding domain as described in Example 2.
The 091-21A31, Sequence ID No. 1 constructs were generated using the plasmids pGADGH or pGAD424; both are available from Clontech.
The 091-21A31, Sequence ID No. 1 construct containing amino acids 75-469 was generated by subcloning the EcoR1-Xhol 091-21A31, Sequence ID No. 1 fragment into the EcoR1-Sall site of pGAD424.
The 091-21A31, Sequence ID No. 1 construct containing amino acids 1-300 was generated by subcloning the BamH1-Sall 091-21A31, Sequence ID No. 1 fragment into the BamH1-Sall site of pGADGH.
The 091-21A31, Sequence ID No. 1 construct containing amino acids 300-469 was generated by subcloning the BamH1 blunted-Sail 091-21A31, Sequence ID No. 1 fragment into the Sail blunted-Xhol site of pGADGH.
Table 2 shows the results of the co-transformation studies. It is apparent that the first 300 amino acids of BRCA1 do not to bind to any of three 091-21A31, Sequence ID No. 1 fragment constructs, nor to 091-21A31, Sequence ID No. 1. The BRCA1 construct containing amino acids 1-600 does bind to 091-21A31, Sequence ID No. 1, and to the construct containing 091-21A31, Sequence ID No. 1 amino acids 78-469, but not to the amino acid 091-21A31, Sequence ID No. 1 constructs 1-300 and 300-469. Also, the BRCA1 construct having amino acids 8-1293 also binds 091-21A31, Sequence ID No. 1, the 78-469 and 1-300 amino acid constructs, but not to the 091-21A31, Sequence ID No. 1 20 construct having amino acids 300-469.
O
0 o 38a TABLE 2 INTERACTION OF BRCA1 WITH 091-21 GAL4AD 21 21 21 21 GAL4BD A31 (78-469) (1-300) (300-469) BRCA1 (1-300) BRCA1 (1-600) BRCA1 (8-1293) vector or BCL2 Example 4 Expression and Purification of BRCA1 Modulators The BRCA1 Modulators were expressed in and purified from baculovirus SF9 infected cells. Methods for producing baculovirus, as well as growing SF9 cells are well known in the art, and detailed procedures can be found in M.
10 Summers and G. Smith in "A Manual of Methods for Baculovirus Vectors and S" Insect Cell Culture Procedures," Texas Agricultural Experiment Station, Bulletin No. 1555 (May, 1987 or in EPS 127,839 to G. E. Smith and M. D. Summers.
The following constructs were generated using pAcC13 (See, Rubinfeld, et al. Cell 65, 1033-1042 (1991)) or pAcOG, a derivative of pAcC13 in which 15 the polylinker T.oOC WO 98/10066 PCT/US97/13944 was replaced with a synthetic linker engineered to encode an initiating methionine, the Glu-Glu (See, Grussenmyer, et al. Proc. Natl. Acad. Sci. U.S.A. 82, 7952 (1985)) epitope tag, and a multiple cloning site containing several stop codons (See, Rubinfeld, et al. J. Biol. Chem, 270,5549-5555 (1995)).
The construct containing 091-21A31, Sequence ID No. 1 was generated by subcloning the Kpnl-Xbal 091-21A31, Sequence ID No. 1 fragment into pAcC13 at the Kpnl-Xbal site.
The construct containing 091-1F84, Sequence ID No. 3 was generated by subcloning the Ncol Xbal 091-1F84, Sequence ID No. 3 fragment into pAcOG1 at the Ncol-Xbal sites.
The construct containing 091-132Q20, Sequence ID No. 5 was generated by subcloning the Kpnl-Xbal fragment of 091-132Q20, Sequence ID No. 5 into pAcC13 at the Kpnl-Xbal site.
Baculovirus containing the appropriate BRCA1 Modulator was produced by transfecting the above described plasmids into SF9 cells, and isolating the corresponding baculovirus using essentially the methods described in Pharmingen's cat. no. 21100D, BaculoGoldtm/Baculovirus DNA. Virus was isolated from individual plaques, and used to infect Sf9 cells. The cells were grown for 4 days, isolated by centrifugation, and cell extracts made by solubilizing the cell pellet. Briefly, recombinant Sf9 cells were pelleted, lysed in 5 volumes of [20mM Tris (pH8.0), 1mM EDTA, 10g/ml each of leupeptin, pepstatin, pefabloc, 1mM aprotinin and ImM DTT] and incubated on ice for 10 minutes. NaCl was then added to a final concentration of 150mM, incubated at room temperature for 10 minutes and centrifuged. The resulting supernatant was loaded onto a 1-ml affinity column containing a mouse Glu-Glu monoclonal antibody covalently cross-linked to protein G-Sepharose. See, Grussenmyer, et al., Proc. Natl. Acad. Sci. U. S. A. vol. 82, pp. 7952-7954 (1985).
The column was washed with 10-15ml of lysis buffer and eluted with 100lg of Glu-Glu peptide (EYMPME) per ml in the same buffer. Fractions were collected and analyzed by sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE), the peak fractions were pooled and based on purity subjected to further purification on HPLC columns which include Resource Q, Resource S and Resource Eth (Pharmacia). For WO 98/10066 PCT/US97/13944 purification of insoluble proteins, in particular 091-21A31, Sequence ID No. 1, recombinant Sf9 cells were pelleted, lysed in 5 volumes of [20mM Tris (pH8.0), 137mM NaC1, ImM EGTA, 1.5mM MgC 1 2%SDS, 10pg/ml each of leupeptin, pepstatin, pefabloc, 1mM aprotinin and 1mM DTT], incubated at room temperature for 20-30 minutes and ultra centifuged. The upper phase was removed, NaCl was adjusted to 400mM and recentrifuged. The clarified supernatent was then diluted 1:10 in 1 X TG buffer [20mM Tris (pH8.0), 137mM NaC1, ImM EGTA, 1.5mM MgCl 2 1% Triton X100, glycerol, 10g/ml each of leupeptin, pepstatin, pefabloc, ImM aprotinin and 1mM DTT], filtered through a 3uM Gelman Versapore filter and loaded onto a 1-ml anti-Glu- Glu affinity column. See, Rubinfeld, et al., Mol. Cell. Bio. 12, 4634-4642 (1992). The column was washed with 10-15ml of 1 X TG buffer with 400mM NaC and eluted in 1 X TG buffer with 1% SDS and 100g/ml Glu-Glu peptide. Fractions were analyzed by
SDS-PAGE.
Example Confirmation of BRCA1 Modulator Protein Binding to BRCA1 To confirm the results of the two-hybrid assays described in Example 1 and further establish the binding of each of the BRCA1 Modulators to BRCA1, two BRCA1 constructs were generated and tested for BRCA1 Modulator binding. The BRCA1 constructs were Glu-Glu tagged BRCA1 5' (1-1293), and BRCA1 3' (1293-1863). The Glu- Glu epitope tag facilitated immunoaffinity purification as described in the above examples. A control construct consisted of rapGAP. This construct was made as described by Rubinfeld, B. and Polakis "Purification of Baculovirus-Produced Rapl GTPase-activating Proteins". In: Methods and Enzymology, W.E. Balch, Channing J.
Der and Alan Hall, Eds., California: Academic Press, Inc., 255, 31-38. The BRCA1 constructs were generated as follows:. pAcO BRCA1 5' (1-1293) was generated by subcloning the NcoI-Nhel BRCA1 fragment into pAcO G1S NcoI-Nhel sites. pAcO BRCA1 3' (1293-1863) was generated by subcloning the Nhel blunted-Not1 BRCA1 fragment into pAcO G2 StuI-Notl sites. Using standard methods, the constructs were transfected into Sf9 cells. The BRCA1 constructs were purified using the immunoaffinity purification methods essentially as described in the preceding Examples.
WO 98/10066 PCT/US97/13944 For in vitro transcription/translation of the BRCA1 Modulators, the following constructs were subcloned into PCANmyc, a derivative of pCDNA3 (Invitrogen) in which the polylinker was replaced with a synthetic linker engineered to encode an initiating methionine, the Myc (See, Evan, et al. Mol. Cell. Biol. 5, 3610 (1985)) epitope tag, and a multiple cloning site (See, Rubinfeld, et al. Science, 272, 1023- 1026(1996)).
The plasmid containing the BRCA1 Modulator 091-1F84, Sequence ID No. 3, PCAN myc 091-1F84, Sequence ID No. 3, was generated by subcloning the Spel blunted Xhol 091-1F84, Sequence ID No. 3 fragment into PCAN myc3 EcoRV-Xhol sites. The plasmid containing the BRCA1 Modulator 091-21A31, Sequence ID No. 1, PCAN myc 091-21A31, Sequence ID No. 1, was generated by subcloning the BamH1- Xhol 091-21A31, Sequence ID No. 1 fragment into PCAN myc3 BamH1-Xhol sites.
Lastly, the plasmid containing the BRCA1 Modulator 091-132Q20, Sequence ID No. PCAN myc 091-132Q20, Sequence ID No. 5, was generated by subcloning-the EcoR1- Xhol 091-132Q20, Sequence ID No. 5 fragment into PCAN myc3 EcoR1-Xhol sites.
For in vitro binding analysis, the BRCA1 Modulator cDNAs (091-1F84, Sequence ID No. 3, 091-21A31, Sequence ID No. 1, 091-132Q20, Sequence ID No. 5) were transcribed and translated in vitro in the presence of 35 S]Methionine using the TNTcoupled wheat germ cell lysate system (Promega). Next, one-two ug of purified recombinant BRCA1 protein, either Glu-Glu tagged BRCA1 5' (1-1293), or BRCA1 3' (1293-1863) was added to 25pl of precleared lysate along with 10lO of anti-Glu Glu coupled protein G-Sepharose beads. Following a 2 hour incubation with rocking at 4 0
C,
the beads were washed three times with 1 ml each of ice cold buffer B (20mM tris pH 150mM NaC1, 0.5% Nonidet P-40), eluted with 2041 of SDS-PAGE sample buffer and subjected to SDS-PAGE and fluorography.
SDS-PAGE fluorography revealed that all three of the BRCA1 Modulators were affinity precipitated with the construct BRCA1 5' (1-1293) but not BRCA1 3' (1293-1863).
The rapGAP control also did not affinity precipitate any of the three BRCA1 Modulators. Taken together these results confirm and extend the results of the two hybrid assay, and establishes that the BRCA1 Modulator proteins interact with BRCA1.
WO 98/10066 PCT/US97/13944 Example 6 Preparation of Antibody to BRCA1 Modulators For antibody production, immunoaffinity purification of BRCA1 Modulators from baculovirus infected Sf9 insect cells was performed with immobilized anti-Glu- Glu antibody specific for the Glu-Glu epitope tag expressed on the recombinant soluble proteins (See, Rubinfeld, et al., Mol. Cell. Bio. 12, 4634-4642 (1992)). Briefly, recombinant Sf9 cells were pelleted, lysed in 5 volumes of [20mM Tris (pH8.0), 1mM EDTA, 10tg/ml each of leupeptin, pepstatin, pefabloc, 1mM aprotinin and 1mM DTT] and incubated on ice for 10 minutes. NaC was then added to a final concentration of 150mM, incubated at room temperature for 10 minutes and centrifuged. Then resulting supernatant was loaded onto a 1-ml affinity column containing the Glu-Glu antibody covalently cross-linked to protein G-Sepharose. The column was washed with 10-15ml of lysis buffer and eluted with 100pg of Glu-Glu peptide (EYMPME) per ml in the same buffer. Fractions were collected and analyzed by sodium dodecyl sulfatepolyacrylamide gel electrophoresis (SDS-PAGE), the peak fractions were pooled and based on purity subjected to further purification on HPLC columns which include Resource Q, Resource S and Resource Eth (Pharmacia). For purification of insoluble proteins, in particular 21, recombinant Sf9 cells were pelleted, lysed in 5 volumes of Tris (pH8.0), 137mM NaC1, ImM EGTA, 1.5mM MgC1, 2%SDS, 10ug/ml each of leupeptin, pepstatin, pefabloc, 1mM aprotinin and 1mM DTT], incubated at room temperature for 20-30 minutes and ultra centifuged. The upper phase was removed, NaCl was adjusted to 400mM and recentrifuged. The clarified supernatent was then diluted 1:10 in 1 X TG buffer [20mM Tris (pH8.0), 137mM NaC1, ImM EGTA, MgCl,, 1% Triton X100, 10% glycerol, 10ug/ml each of leupeptin, pepstatin, pefabloc, 1mM aprotinin and 1mM DTT], filtered through a 3um Gelman Versapore filter and loaded onto a 1-ml anti-Glu-Glu affinity column. The column was washed with 10-15ml of 1 X TG buffer with 400mM NaC1 and eluted in 1 X TG buffer with 1% SDS and 100gg/ml Glu-Glu peptide. Fractions were analyzed by SDS-PAGE, pooled and used to immunize rabbits.
To produce antisera containing antibodies directed against the BRCA1 Modulators the latter are used to immunize rabbits as follows. For the BRCA1 Modulator 091-21A31, Sequence ID No. 1, the immunization protocol generally 42 WO 98/10066 PCT/US97/13944 consisted of two immunizations; the first was a subcutaneous injection of 0.500mg in CFA, followed by a second intramuscular injection of 0.250 mg about four weeks later in ICFA. The rabbits were bled, antisera collected and antibody purified as setforth below.
BRCA1 Modulator antibodies are affinity purified using BRCA1 Modulator immunogens which have been coupled to a support matrix. Briefly, the BRCA1 Modulator 091-21A31, Sequence ID No. 1 is coupled to CNBr activated Sepharose 6MB (Pharmacia) as follows. One ml of matrix was activated according to manufacturer's instructions (ie. resuspended in 1mM H Cl, washed for 15 min. in ImM H Cl on a sintered glass filter). One mg of 091-21A31, Sequence ID No. 1 was dialyzed against coupling buffer [0.1M NaHCO, pH 8.3, 0.5M NaC1] overnight at 4°C with two changes of buffer. The dialyzed protein was then incubated with the CNBr activated Sepharose 6MB and incubated with rocking overnight at 4°C. The excess ligand was washed away with coupling buffer and any remaining active groups were blocked with 1M ethanolamine at room temperature for two hours. This material was then washed with three cycles of alternating pH each cycle consists of a wash with 0.1M acetate buffer, pH 4.0, 0.5M NaCl followed by a wash with 0.1M Tris, pH 8.0, 0.5M NaCI. The protein coupled gel matrix was then resuspended in PBS and incubated with 5ml of antibody serum with rocking overnight at 4°C. The mixture was poured into a column, allowed to drip through and washed 3 times with 15ml PBS per wash. Seven elutions with 800 1 of 0.2M glycine, pH 2.5, were collected and each elution was neutralized immediately with 200pl 1M K 2
HPO
4 Peak fractions were combined and dialyzed into PBS Azide for storage.
American Type Culture Collection Deposits The cDNA clones that encode 091-1F84, Sequence ID No. 3, 091-21A31, Sequence ID No. 1, and 091-132Q20, Sequence ID No. 5 were deposited with the American Type Culture Collection (ATCC) on August 14, 1996 under accession numbers 98141 (091- 1F84, Sequence ID No. 98142 (091-21A31, Sequence ID No. and 98143 (091- 132Q20, Sequence ID No. The deposits were made under the Budapest Treaty and shall be maintained at least 30 years after the date of depost and 5 years after the date of the most recent request for the deposit.
WO 98/10066 PCT/US97/13944 The invention now being fully described, it will be apparent to one of ordinary skill in the art that many changes and modifications can be made thereto without departing from the spirit or scope of the appended claims.
WO 98/10066 PCTIUS97/13944 SEQUENCE LISTING GENERAL INFORMATION: APPLICANT: Rubinfeld, Bonnee Polakis, Paul G.
Ligenfelter, Carol Vuong, Terilyn T.
(ii) TITLE OF INVENTION: MODULATORS OF BRCA1 ACTIVITY (iii) NUMBER OF SEQUENCES: 6 (iv) CORRESPONDENCE ADDRESS: ADDRESSEE: ONYX Pharmaceuticals, Inc.
STREET: 3031 Research Drive CITY: Richmond STATE: CA COUNTRY: USA ZIP: 94806 COMPUTER READABLE FORM: MEDIUM TYPE: Floppy disk COMPUTER: IBM PC compatible OPERATING SYSTEM: PC-DOS/MS-DOS SOFTWARE: PatentIn Release Version #1.30 (vi) CURRENT APPLICATION DATA: APPLICATION NUMBER: US Unknown FILING DATE: CLASSIFICATION: Utility (viii) ATTORNEY/AGENT
INFORMATION:
NAME: Giotta, Gregory REGISTRATION NUMBER: 32,028 REFERENCE/DOCKET NUMBER: ONYX1024 GG (ix) TELECOMMUNICATION INFORMATION: TELEPHONE: (510) 262-8710 TELEFAX: (510) 222-9758 INFORMATION FOR SEQ ID NO:1: SEQUENCE CHARACTERISTICS: LENGTH: 2065 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (ix) FEATURE: NAME/KEY: CDS LOCATION: 103..1512 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1: WO 98/10066 PCT/US97/13944 GTGGATCCCC CGGGCTGCAG GAATTCGGCA CGAGCGGCAC AGCAGTTTCT TTGGCTGCCT GGGCCCCTTG AGTCCAGCCA GCT CTG TGC ACT Ala Leu Cys Thr GCC GCC ATC CAC Ala Ala Ile His TGG TTT GAG ACA Trp Phe Giu Thr GTT GGC AAA AGA Val Gly Lys Arg ATC TGC Ile Cys 10 TGC GGC Cys Gly TCC GAC TTC TTC Ser Asp Phe Phe CAC ACC TTC CAC His Thr Phe His AGT CGG ACC TGC Ser Arg Thr Cys GAGTACGAAG CCCGACCTGT TC ATG CCT ATC CGT Met Pro Ile Arg 1 CAC TCC CGC GAC GTG His Ser Arg Asp Vai CAG TGC CTA ATT CAG Gin Cys Leu Ile Gin 114
CCA
Pro CCA CAG Pro Gin ACC ATT ATC Thr Ile Ile
GAG
Glu
AAT
Asn 60
GCA
Ala CTC TTC TTT Leu Phe Phe TTC TTA AAG Phe Leu Lvs GAG GAG Glu Glu 70 AAT GTC Asn Val AAT GTC TTG Asn Val Leu
CAT
Asp 75
TCC
Ser
CAA
Glu
CAG
Gin AGA CCC CAG CTT Arg Ala Gin Leu 90 ATC ATC GAC ACT Ile Ile Asp Thr CAG AAA GAC Gin Lys Asp
GAG
Glu
GAA
Glu TGC CGA ATC CAC Cys Arg Ile Gin GAT CTT GCC CAG Asp Leu Ala Gin AAT GAA CTG GAC Asn Giu Leu Asp AAA CGA GAC AGC Lys Arg Asp Ser 100 GAA CGC AAT CCT Glu Arg Asn Ala 115 402
GTC
Val ACT GTG GTA TCT Thr Val Val Ser 120 105
CTG
Leu CTG CGG GAT ACG CTG Leu Arg Asp Thr Leu 110 CAG GCC TTC GGC AAG Gin Aia Leu Gly Lys
CAG
Gin TCC ACA CTC Ser Thr Leu 135 ACC AAA CAA Thr Lys Gin 150
AAA
Lys
GCA
Ala AAG CAG ATC AAG Lys Gin Met Lys 140 CAA GAG GAG GCC Gin Giu Giu Ala 125
TAC
Tyr GCC GAG ATG CTG TGC Ala Giu Met Leu Cys 130 CAC CAG CAG CAT GAG Gin Gin Gin Asp Glu TTA GAG Leu Glu
ACC
Thr 165
GAG
Glu ATG GAG CAG ATT GAG Met Giu Gin Ile Glu 170 GAG ATG ATC CGA GAC Glu Met Ile Arg Asp 185 GCT CTG TAC TGT GTG Ala Val Tyr Cys Val 200 155
CTT
Leu
ATG
Met CGC CGG CTC AGG Arg Arg Leu Arg 160 145
AGC
Ser AAG ATG AAC Lys Met Lys CTA CTC CAG AGC Leu Leu Gin Ser 175 GGT GTC GGA CAC Gly Val Gly Gin
CAG
Gin
TCA
Ser CGC CCT GAG GTG Arg Pro Giu Val 180 GCG GTG GAA CAG Ala Val Giu Gin
CTC
Leu TCT CTC AAG Ser Leu Lys 205 190
AAA
Lys GAG TAC GAG Glu Tyr Glu 195 CTA AAA Leu Lys
AAT
Asn 210 WO 98/10066 WO 9810066PCTIUS97/1 3944 GAG GCA COG AAG Glu Ala Arg Lys 215 TTG TTT TCC TCC Leu Phe Ser Ser GCC TCA GGG GAG GTG GCT GAC AAG CTG AGG AAG GAT Ala Ser Gly Glu 220
TTG
Leu Val Ala Asp Lys Leu 225
TCT
S er Arg Lys Asp GAA TTG OAT Glu Leu Asp AGA AOC Arg Ser 230
CAG
Gin 245 10 GAC Asp 0CC Al a AAG TTA GAA CTG Lys Leu Olu Leu 250
AAG
Lys 235
AAG
Lys
CTO
Leu CAG ACA OTC Gin Thr Val
TAC
Tyr 240
GAC
Asp 834 882 TCA 0CC CAG AAG Ser Ala Gin Lys 255 AAA AAO AAG CTA Lys Lys Lys Leu TTA CAG AOT Leu Gin Ser
OCT
Ala 260 AAG OAA ATC ATO Lys Olu Ile Met 265 TTO AAC CTG CCA Leu Asn Leu Pro
AOC
Ser ACO ATO CTG Thr Met Leu CAG GAA Gin Glu 275
ACC
Thr CCA OTO 0CC Pro Val Ala
AGT
Ser 285
OTO
Val1 ACT GTC GAC COC CTG OTT Thr Val Asp Arg Leu Val 290 CTG AAO CTC COC COG OCA Leu Lys Leu Arg Arg Pro TTA GAG AGC CCA Leu Olu Ser Pro 295 TCC TTC COT OAT Ser Phe Arg Asp 0CC CCT OTO Ala Pro Val OAT ATT OAT Asp Ile Asp 315 CCC TCC AOC Pro Ser Ser
GAG
Oiu 300
CTC
Leu AAT OCT ACC Asn Ala Thr
AAT
Asn 305
OAT
Asp OTO OAT ACT Val Asp Thr 310
CCC
Pro 325
TOO
Cys
CCA
Pro 0CC COO Ala Arg TCC CAG CAT Ser Gin His CTA GAG AAO TCA Leu Giu Lys Ser 345 AAA GOC CCC AGO Lys Gly Pro Arg 330
CAC
His
GT
Gly 335
OAT
Asp TAC OAA AAA CTT Tyr Giu Lys Leu 340 CCC AAO AAG ATA Pro Lys Lys Ile 1026 1074 1122 1170 1218 TCC CCA ATT Ser Pro Ile
OTC
Val
TOC
Cys AAO GAG TCC CAG Lys Giu Ser Gin 365 TCA CTG GOT Ser Leu Gly
GOC
Gly 370 355 CAG AOC Gin Ser 360 TOT OCA OGA Cys Ala Gly 375 GAO CCA OAT GAG Glu Pro Asp Oiu GTC COO Val Arg 390 GAG TCC Oiu Ser
AAT
Asn 0CC ATC CTA Ala Ile Leu
GOC
Gly 395
OAT
Asp
GAA
Glu 380
CAG
Gin
OTO
Val1 CTO OTT GOT 0CC Leu Val Gly Ala AAA CAG CCC AAO Lys Gln Pro Lys 400 OTA AGO ACA GOC Val Arg Thr Oly TTC CCT ATT TTT Phe Pro Ile Phe 385 AGO CCC AGO TCA Arg Pro Arg Ser 1266 1314 1362 1410 405
GOT
Gly TCT TOC AGC AAA Ser Cys Ser Lys 410 COG ACA AAA TTC Arg Thr Lys Phe TTC OAT 000 Phe Asp Gly
CTC
Leu 420
COC
Arg
GOC
Gly ATC CAG CCT Ile Gin Pro ACA GTC ATO Thr Val Met 425 WO 98/10066 PCTIUS97/13944 CCA TTG CCT GTT AAG CCC AAG ACC AAG GTT AAG CAG AGG GTG AGG GTG Pro Leu Pro Val Lys Pro Lys Thr Lys Val Lys Gin Arg Val Arg Val 440 445 450 AAG ACA GTG CCT TCT CTC TTC CAG GCC AAG CTG GAC ACC TTC CTG TGG Lys Thr Val Pro Ser Leu Phe Gin Ala Lys Leu Asp Thr Phe Leu Trp 455 460 465 TCG TGA GAACAGTGAG TCTGACCAAT GGCCAGACAC ATGCCTGCAA~
CTTGTAGGTC
Ser 1458 1506 1562 470
AAGGACTGTC
GTAAGGGCAG
CACCCTGCCC
GGTCCTGCTC
TCTGGGCCTG
ATCTCAGGCA
GGGCCAAGCA
TCATGTAAAA
AA~AAAAAAA
CAGGCAGGGG
ACAAACAGGT
CACTCCTACG
CTGTTGCCAG
GAGACCACGG
GCCTCAGCCC
GGGTGGGGAA
TAAAATTAAA
AAAAAAACTC
TTTTGTGGAC
GAGGGTGAGT
ACTGGGAGCT
GCTCCTGTTT
TCACTTGTTG
AAGCTTCTAC
TGGAGGATAG
GAG
AGAGCCCCAC
GTGACACCCA
GACATGACCA
ATAGCCATGA
ACTGTCTCTG
CTGCCTTTGA
CATOGGATGT
CAAAAAAAAA
TTTCGGGACC
GAGACTGCTC
GCCCACTGAT
TCAGATGTGG
TGGACCAGAG
CTTGCTTCTA
ATGGAGAGGA
AAAAZAAAA
AGCCTGAGGT
TTCCTGCCCT
CCTGTCAGCA
TCAGACTCTT
TGCTTGAGGC
GGCATAGCCT
TGGAAGATTT
1622 1682 1742 1802 1862 1922 1982 2042 2065 INFORMATION FOR SEQ ID NO:2: SEQUENCE CHARACTERISTICS: LENGTH: 470 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE (xi) SEQUENCE TYPE: protein DESCRIPTION: SEQ ID NO:2: Met 1 Ser Pro Ile Arg Ala Leu Cys Thr Ile 5 Ser Asp Phe Phe Asp His Arg Asp Val Leu Ile Gin Ala Ala Ile His Trp Phe Glu Thr Cys 25 Ala His Thr Phe Cys Pro Ser Arg 40 Arg Thr Lys His Leu Gin Cys Pro Gin Leu Phe Phe Cys Arg 50 Asp Leu Ile Gin Val Gly Lys Glu Thr Ile Ile Ala Gin Glu Asn Val Leu Asn Giu Phe Leu Lys Glu Leu Asp Lys Arg Asp Glu Arg Asn Ser 100 Ala Asn Gin Thr Val Arg Ala Gin Val Ile Ile Asp 105 Val Val Ser Leu 120 Leu 90 Thr Gin Lys Asp Lys Giu Leu Arg Asp Thr Leu Glu 110 Giy Lys Ala Gin Gin Ala Leu 125 WO 98/10066 PCT[US97/13944 Glu Met Leu Cys Ser Thr Leu Lys Lys Gin Met Lys Tyr Leu Glu Gin 130 135 140 Gin Gin Asp Glu Thr Lys Gin Ala Gin Glu Glu Ala Arg Arg Leu Arg 145 150 155 160 Ser Lys Met Lys Thr Met Glu Gin Ile Glu Leu Leu Leu Gin Ser Gin 165 170 175 Arg Pro Glu Val Glu Glu Met Ile Arg Asp Met Gly Val Gly Gin Ser 180 185 190 Ala Val Glu Gin Leu Ala Val Tyr Cys Val Ser Leu Lys Lys Glu Tyr 195 200 205 Glu Asn Leu Lys Glu Ala Arg Lys Ala Ser Gly Glu Val Ala Asp Lys 210 215 220 Leu Arg Lys Asp Leu Phe Ser Ser Arg Ser Lys Leu Gin Thr Val Tyr 225 230 235 240 Ser Glu Leu Asp Gin Ala Lys Leu Glu Leu Lys Ser Ala Gin Lys Asp 245 250 255 Leu Gin Ser Ala Asp Lys Glu Ile Met Ser Leu Lys Lys Lys Leu Thr 260 265 270 Met Leu Gin Glu Thr Leu Asn Leu Pro Pro Val Ala Ser Glu Thr Val 275 280 285 Asp Arg Leu Val Leu Glu Ser Pro Ala Pro Val Glu Val Asn Leu Lys 290 295 300 Leu Arg Arg Pro Ser Phe Arg Asp Asp Ile Asp Leu Asn Ala Thr Phe 305 310 315 320 Asp Val Asp Thr Pro Pro Ala Arg Pro Ser Ser Ser Gin His Gly Tyr 325 330 335 Tyr Glu Lys Leu Cys Leu Glu Lys Ser His Ser Pro Ile Gin Asp Val 340 345 350 Pro Lys Lys Ile Cys Lys Gly Pro Arg Lys Glu Ser Gin Leu Ser Leu 355 360 365 Gly Gly Gin Ser Cys Ala Gly Glu Pro Asp Glu Glu Leu Val Gly Ala 370 375 380 Phe Pro Ile Phe Val Arg Asn Ala Ile Leu Gly Gin Lys Gin Pro Lys 385 390 395 400 Arg Pro Arg Ser Glu Ser Ser Cys Ser Lys Asp Val Val Arg Thr Gly 405 410 415 Phe Asp Gly Leu Gly Gly Arg Thr Lys Phe Ile Gin Pro Thr Asp Thr 420 425 430 Val Met Ile Arg Pro Leu Pro Val Lys Pro Lys Thr Lys Val Lys Gin 435 440 445 Arg Val Arg Val Lys Thr Val Pro Ser Leu Phe Gin Ala Lys Leu Asp 450 455 460 Thr Phe Leu Trp Ser 465 470 WO 98/10066 PCT/US97/13944 INFORMATION FOR SEQ ID NO:3: SEQUENCE CHARACTERISTICS: LENGTH: 3256 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (ix) FEATURE: NAME/KEY: CDS LOCATION: 34..2541 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: GAACTAGTGG ATCCCCCGGG CTGCAGGAAT TCG GCA CGA GAA AGC TTA TCC CTT Ala Arg Glu Ser Leu Ser Leu CCC TCG ATG Pro Ser Met CTT CGG GAT GCT Leu Arg Asp Ala
GCA
Ala 15
ACT
Thr ATT GGC ACT ACC Ile Gly Thr Thr CCT TCA GCA CCA Pro Ser Ala Pro TGC TCG Cys Ser ACA AAC Thr Asn
GTG
Val GGG ACT TGG Gly Thr Trp
TTT
Phe 30
GGC
Gly CCT TTC TCT ACT Pro Phe Ser Thr CAG GAA AAG AGT Gin Glu Lys Ser CAC AGT ACT TCT His Ser Thr Ser CTG ACT GCC TTG Leu Thr Ala Leu 102 150 198 ACA TCC CAG Thr Ser Gin 40
GAG
Glu ACA GAG CAG Thr Glu Gin
CTC
Leu
TTG
Leu
ACA
Thr 45
CTG
Leu
GAA
Glu CTG GTT GGC Leu Val Gly
AAG
Lys
GAT
Asp TGT GGC CGG Cys Gly Arg GAT AAC CTG Asp Asn Leu
CCT
Pro 65 TCT CGA CAT Ser Arg His GAG GTT CTC Glu Val Leu CCT CAC CCA Pro His Pro
GAC
Asp
TCC
Ser
GAA
Glu CGC CAG CTT CGG Arg Gin Leu Arg 95 ACC CAG GAC AGT Thr Gin Asp Ser 80
GAC
Asp
AGC
Ser CTG AGC TCT CTT GTC ATT CTG Leu Ser Ser Leu Val Ile Leu TGG AAG AGC CAG CTG GCT GTC Trp Lys Ser Gin Leu Ala Val 100 ACA CAG ACT GAC ACA TCT CAC Thr Gln Thr Asp Thr Ser His 105 AGT GGG Ser Gly ATA ACT AAT Ile Thr Asn 120
GGA
Gly
AAA
Lys 125
CAG
Gin 115 CAG CAT CTT AAG GAG Gin His Leu Lys Glu AGC CAT GAG Ser His Glu
ATG
Met 135
CTT
Leu CAG GCC CTA CAG Gln Ala Leu Gin 140 GCC AGA AAT GTC ATG CAA TCA TGG Ala Arg Asn Val Met Gin Ser Trp WO 98/10066 PCT/US97/13944 ATC TCT AAA GAG CTG ATA TCC TTG CTT CAC CTA TCC CTG TTG CAT TTA Ile Ser Lys Glu Leu Ile Ser Leu GAA GAA GAT Glu Glu Asp 170 TTG GTC TGT Leu Val Cys 155
AAG
Lys ACT ACT GTG Thr Thr Val
AGT
Ser 175
TTG
Leu Leu 160
CAG
Gin
CTG
Leu His Leu Ser Leu Leu His Leu 165 GAG TCT CGG CGT GCA GAA ACA Glu Ser Arg Arg Ala Glu Thr 180 AAG AAA TTG AGG GCA AAG CTC Lys Lys Leu Arg Ala Lys Leu 582
CAG
Gin 200
GCT
Ala 185
AGC
Ser TGC TGT TTT Cys Cys Phe AAA GCA GAA Lys Ala Glu
GAT
Asp 190
CTC
Leu CTC AGA GGC Leu Arg Gly
AAG
Lys 220
CAG
Gin 205
GAT
Asp
CGC
Arg AGG GAG GAG GCA Arg Glu Glu Ala GCG GCA GAG ATA Ala Ala Glu Ile 225 ATC AGC CAG CTG Ile Ser Gin Leu 195
CAC
His
TTG
Leu AGA GAG GAA ATG Arg Glu Glu Met 215 GAG GCT TTC TGT Glu Ala Phe Cys 230 GCA CAC GCC Ala His Ala 240 GAA CAG GAC CTA GCA TCC Glu Gin Asp Leu Ala Ser 245 GCC CAG ACC CAA CTG GTA Ala Gin Thr Gin Leu Val ATG CGG Met Arg GGG CTT Gly Leu 265 ACT TCT Thr Ser
GAA
Glu 250
CAT
His
ACC
Thr AGA GGC CTT CTG Arg Gly Leu Leu 255
AAG
Lys
CTG
Leu
GAT
Asp GCC AAG CAA GAA Ala Lys Gin Glu 270 TTG CAA CAA GAC Leu Gin Gin Asp
GAG
Glu 260 GTT CAG CAG ACA Val Gin Gin Thr GTG AGT CTT Val Ser Leu 275
CAA
Gin 280
ACA
Thr
ACA
Thr TGG ACA GCT Trp Thr Ala GTC AAG AGC Val Lys Ser
TTG
Leu 300 285
CTG
Leu TGG AGG TCC ATG Trp Arg Ser Met 290 CGG TCC CGA CAA Arg Ser Arg Gin CTG GAT TAT Leu Asp Tyr
ACA
Thr 295
AGT
Ser CTC ACA GAG Leu Thr Glu GAA AAG Glu Lys GAG GAG Glu Glu 345 CTA GCA Leu Ala 360 315
GAG
Glu
AAA
Lys CAG CAA GCC CTG Gin Gin Ala Leu GTT TCT AGG GTG Val Ser Arg Val 335 GGC CAA ACA GAA Gly Gin Thr Glu
CAG
Gin 320
CTG
Leu 305
GAA
Glu
GAA
Glu CGT GAT GTG GCA ATT GAG Arg Asp Val Ala Ile Glu 325 CAA GTC TCT GCC CAG TTA Gin Val Ser Ala Gin Leu AAA CTC Lys Leu 310 870 918 966 1014 1062 1110 1158
ACA
Thr 350 GAT CTC CGG GCT Asp Leu Arg Ala CAA CTG GAG TTG Gin Leu Glu Leu 355 TTG CAG ATT CTG Leu Gin Ile Leu AAC AGT CGT Asn Ser Arg
CAG
Gin GCC AAC ATG Ala Asn Met 365 WO 98/10066 PCT/US97/13944 AGC CAG CTA AAA Ser Gin Leu Lys CTG GCT ATG AAG Leu Ala Met Lys 395 GAG CAG GCT GCT Glu Gin Ala Ala GAG CTA CAG AGT CAG CAT ACC CAT TGT GCC CAG GAC Glu 380
GAT
Asp Leu Gin Ser Gin Thr His Cys Ala Gin Asp 390 GAG TTA TTC Glu Leu Phe
TGC
Cys 400
GAA
Glu CTT ACC CAG Leu Thr Gin AGC AAT GAG Ser Asn Glu 405 CAA TGG CAA Gin Trp Gin CAG GCA Gin Ala 425 GAC CTG Asp Leu 410
GAA
Glu
AAG
Lys 415
CAA
Gin GAG ATG GCA Glu Met Ala CTA AAA CAC ATG Leu Lys His Met 420 AAA GAG GTG CGG Lys Glu Val Arg CTG CAG CAG Leu Gin Gin
CAA
Gin 430
GAG
Glu GCT GTC CTG Ala Val Leu
GCC
Ala 435
GAG
Glu 440
CAC
His
GTG
Val AAA GAG ACC TTG Lys Glu Thr Leu 445 TTT GCA GAC Phe Ala Asp CTG GAG CTG GGT Leu Glu Leu Gly 460 CTC CGG GAG CGC Leu Arg Glu Arg CAG GTT GAG TGT Gin Val Glu Cys AGC TTG CAG TGT Ser Leu Gin Cys 480 AAA CTG GCC AGC Lys Leu Ala Ser
CAA
Gin 465
GAG
Glu
CAG
Gin 450
TTG
Leu
AAC
Asn AAT CAG GTT Asn Gin Val
GCT
Ala 455 AAA ACC ACA CTG GAA Lys Thr Thr Leu Glu 470 CTC AAG GAC ACT GTA Leu Lys Asp Thr Val 485 1206 1254 1302 1350 1398 1446 1494 1542 1590 1638 1686 1734 1782 1830 GAG AAC Glu Asn CAA GAT Gin Asp 505 ACT GAG Thr Glu
CTA
Leu 490
CTG
Leu
CAA
Gin
GCT
Ala ACC ATA GCA GAT AAC CAG GAG Thr Ile Ala Asp Asn Gin Glu 500 TCT CAA AAG CTA AGG CTG CTG Ser Gin Lys Leu Arg Leu Leu GAG AAA ACA CGG Glu Lys Thr Arg 510 CTA CAG AGC CTG Leu Gin Ser Leu
TAC
Tyr 515 520
GAG
Glu ACT CTC TTT CTA Thr Leu Phe Leu 530 CTT CTG CTG AGT Leu Leu Leu Ser
CAG
Gin ACA AAA CTA Thr Lys Leu
AAG
Lys 535 AAG ACT GAA Lys Thr Glu ACC CAG GAA Thr Gin Glu TTG ACA GCA Leu Thr Ala 570 CTT GGA AGT Leu Gly Ser 585
CAC
His 555
GTG
Val
CAA
Gin 540
CCT
Pro
GCA
Ala
ACC
Thr ACA GCC TGT CCT CCC Thr Ala Cys Pro Pro 550 TTC CTG GGA AGC ATC Phe Leu Gly Ser Ile CTG CCT AAT Leu Pro Asn GAT GAA GAG Asp Glu Glu
GAC
Asp 560
ACC
Thr 565
GTG
Val CCA GAA TCA ACT Pro Glu Ser Thr ACC CGA GTA GCA Thr Arg Val Ala 595
CCT
Pro 580
TCA
Ser CCC TTG Pro Leu GAC AAG AGT GCT Asp Lys Ser Ala 590 ATG GTT TCC Met Val Ser WO 98/10066 PCT/US97/13944 CTT CAG CCC Leu Gin Pro 600 AGT ATT ATG Ser Ile Met GCA GAG ACC CCA GGC ATG GAG GAG AGC CTG GCA GAA ATG Ala Glu Thr 605 ACT ACT GAG Thr Thr Glu Pro Gly Met Glu Ser Leu Ala Glu Met 615 CTT CAG AGT Leu Gin Ser 620
GCC
Ala
CTT
Leu 625
CAG
Gin TCC.CTG CTA Ser Leu Leu TCT AAA GAA Ser Lys Glu ATC AGG ACT Ile Arg Thr CAA GTT AGG CTG Gin Val Arg Leu 650 GCA AAA GAA GCA Ala Lys Glu Ala CAG GCC CAG Gin Ala Gin GAC ATA GAG Asp Ile Glu 670
GAA
Glu 655
AAG
Lys
CTG
Leu 640
GAA
Glu
CTG
Leu CGA AAA ATT TGT Arg Lys Ile Cys 645 CAA GAG Gin Glu 630 GAG CTG Glu Leu CAG AAG Gin Lys CAG CAT CAG GAA Gin His Gin Glu 660 AAC CAG GCC TTG Asn Gin Ala Leu
GTC
Val TGC TTG CGC Cys Leu Arg 665 TAC AAG Tyr Lys AAT GAA Asn Glu AAG GAG Lys Glu 680
AAG
Lys ATC CTA GAA CAG Ile Leu Glu Gin 700 GAG GTG ACC CAC Glu Val Thr His 685
ATA
Ile
CTT
Leu CTC CAG GAA GTG Leu Gin Glu Val GAC AAG AGT GGC Asp Lys Ser Gly 705 ACC CGC TCA CTT Thr Arg Ser Leu
ATA
Ile 690 675
CAG
Gin CAG CAG AAT Gin Gin Asn
GAG
Glu GAG CTC ATA AGC CTT AGA Glu Leu Ile Ser Leu Arg 710 CGG CGT GCG GAG ACA GAG Arg Arg Ala Glu Thr Glu 725 CAG CTG GAC TCC AAC TGC Gin Leu Asp Ser Asn Cvs 1878 1926 1974 2022 2070 2118 2166 2214 2262 2310 2358 2406 2454 2502 ACC AAA GTG Thr Lys Val 730 CAG CCT ATG Gin Pro Met 745 715
CTC
Leu
CAG
Gin 720 GAG GCC CTG GCA Glu Ala Leu Ala
GGC
Gly GCC ACC AAT Ala Thr Asn 735
ATC
Ile
CAG
Gin GAG AAA GTG Glu Lys Val
GAG
Glu 760
GAA
Glu GTG GAC AAA CTG AGA Val Asp Lys Leu Arg 765 AAA CTC ATG ATC AAG Lys Leu Met Ile Lys 780 CTT CGG CGC TCT GAC Leu Arg Arg Ser Asp ATG TTC CTG Met Phe Leu
GAG
Glu 770
AGA
Arg 755
ATG
Met
AAT
Asn 740 TGG CTC TCT CAG Trp Leu Ser Gin AAA AAT GAG AAG Lys Asn Glu Lys 775 ATC CTA GAG GAG Ile Leu Glu Glu TTC CAG AGC Phe Gin Ser AAG GAG TTA Lys Glu Leu
CAT
His 785
GAA
Glu
AAC
Asn
AAA
Lys 790 CTA GAT GAC ATT GTT Leu Asp Asp Ile Val CAG CAT ATT Gin His Ile 810 800 AAG ACC CTG CTC TCT Lys Thr Leu Leu Ser 815 ATT CCA GAG Ile Pro Glu
GTG
Val 820 805 GTG AGG GGA Val Arg Gly WO 98/10066 WO 9810066PCTIUS97/13944 TGC AGA GAA CTA CAG GGA TTG CTG GAA TTT CTG AGC TAA GAAACTGAAA Cys Arg Glu Leu Gin Gly Leu Leu Glu Phe Leu Ser 2551
GCCAGAATCT
ACTGGCATAG
GTGCCGAATT
CCTGAAAAGT
ATAGCCTTTG
AGGAALAGCAG
CGTGTGATCA
AGACAGA.AGG
AAGCTCCGTG
TTCTGGGGCC
GTGGGTGTGT
ACCAAAAAAA.
GCTTCACCTC
AGCCAACTGA
CGGCACGAGC
TCCAGCATAT
CCATCACTGC
ACATTGACCT
CCATTATGCA
ATGTAAAGGA
AAGACCTGGA
TTCGTGTCCG
CCAAGAAGAA
TTTTTACCTG
GATAAATGCT
GGCACGAGCG
TTTGCGAGTA
CATTAAGGGT
CACCAAGAGG
GAATCCACGC
TGGAAAATAC
GCGACTGAAG
AGGCCAGCAC
ATAA.GTCTGT
ACTCGAGCAT
CAATACCCCC
ATTTAAATAA
GCACGAGCTG
CTCAACACCA
GTGGGCCGAA
GCGGGAGAAC
CAGTACAAGA
AGCCAGGTCC
AAGATTCGGG
ACCAAGACCA
AGGCCTTGTC
GCATCTAGAG
TTACCCCAAT
AGTGTATTTA
CAGCCATGTC
ACATCGATGG
GATATGCTCA
TCACTGAGGA
TCCCAGACTG
TAGCCAATGG
CCCATAGAGG
CTGGCCGCCG
TGTTAATAAA
GGCCC
ACCAAGACCA
ATGAAAACTC
TCTAGTGATC
GCGGCGGAAA
TGTGGTGTTG
TGAGGTGGA\
GTTCTTGAAC
TCTGGACAAC
GCTGCGTCAC
TGGCCGCACC
TAGTTTATAT
2611 2671 2731 2791 2851 2911 2971 3031 3091 3151 3211 3256 INFORMATION FOR SEQ ID NO:4: SEQUENCE CHARACTERISTICS: LENGTH: 836 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: Ala 1 Gly Arg Glu Ser Thr Thr Pro Ala Pro Gin Leu 5 Phe Ser Leu Pro Ser Leu Arg Asp Ala Ala Ile Ser Thr Cys Ser 25 Asn Gly Thr Trp Phe Thr Pro Gly Leu Val Ser Glu Lys Ser Thr Ser Gin Gly Thr Lys His Ser Thr Ser 55 Leu Thr Giu Gin Leu Leu Cys Gly Arg Pro Leu Pro Asp Leu Thr Ala 70 Ile Ser Arg His Asp Ser Glu Asp Asn Ser Ser Leu Leu Glu Val Arg Gin Leu Arg Asp Ser Ser Trp Lys Ser Thr Gin Thr 115 Leu Lys Giu Gin 100 Asp Ala Val Pro His 105 Gly Giu Thr Gin Asp Thr Ser His Ile Thr Ser His Glu 130 Val Met Gin Ser Trp Val Met 135 Leu Gin Ala Leu Asn Lys 125 Gin Gin 140 Leu Ile 110 Leu Gin His Ala Arg Asn Ser Leu Leu 160 Ile Ser Lys Giu 155 150 WO 98/10066 WO 9810066PCTIUS97/13944 His Leu Ser Leu Leu His Leu Giu Giu Asp Lys Thr Thr Val Ser Gin 170 175 Giu Lys Ala Ile 225 Leu Asp Vai Ser Arg 305 Giu Giu Leu Gin His 385 Gin Giu Val Asp Gin 465 Glu Ser Lys Arg 210 Val1 Giu Ala Gin Met 290 Gin Arg Gin Giu Ile 370 Thr Leu Met Leu Gin 450 Leu Asn Arg Leu 195 His Leu Gin Gin Gin 275 Gin Leu Asp Val1 Leu 355 Leu His Thr Ala Ala 435 Giu Lys Leu Arg 180 Arg Arg Giu Asp Thr 260 Thr Leu Thr Val1 Ser 340 Giu Ala Cys Gin Leu 420 Lys Asn Thr Lys Ala Ala Glu Ala Leu 245 Gin Val1 Asp Glu Ala 325 Ala Asn Asn Ala Ser 405 Lys Giu Gin Thr Asp 485 Giu Lys Glu Phe 230 Ala Leu Ser Tyr Lys 310 Ile Gin Ser Met Gin 390 Asn His Val1 Val Leu 470 Thr Thr Leu Met 215 Cys Ser Val1 Leu Thr 295 Leu Giu Leu Arg Asp 375 Asp Giu Met Arg Ala 455 Giu Val1 Leu Gin 200 Ala Ala Met Gly Thr 280 Thr Thr Glu Giu Leu 360 Ser Leu Glu Gin Asp 440 His Val1 Glu Val 185 Ser Leu His Arg Leu 265 S er Trp Val Lys Giu 345 Ala Gin Ala Gin Ala 425 Leu Leu Leu Asn Cys Leu Arg Ala Giu 250 His Thr Thr Lys Gin 330 Cys Thr Leu Met Ala 410 Glu Lys Giu Arg Leu 490 Cys Lys Gly Ser 235 Phe Ala Leu Ala Ser 315 Giu Lys Asp Lys Lys 395 Ala Leu Glu Leu Giu 475 Thr Glu Cys Ala Lys 220 Gin Arg Lys Gin Leu 300 Gin Val Gly Leu Glu 380 Asp Gin Gin Thr Gly 460 Arg Ala Lys Phe Glu 205 Asp Arg Gly Gin Gin 285 Leu Gin Ser Gin Arg 365 Leu Giu Trp Gin Leu 445 Gin Ser Lys Thr Asp 190 Arg Ala Ile Leu Giu 270 Asp Ser Al a Arg Thr 350 Al a Gin Leu Gin Gin 430 Giu Val1 Leu Leu Arg L eu Glu Ala Ser Leu 255 Glu Trp Arg Leu Val 335 Glu Gin Ser Phe Lys 415 Gin Phe Giu Gin Ala 495 Gin Leu Giu Giu Gin 240 Lys Leu Arg Ser Gin 320 Leu Gin Leu Gin Cys 400 Giu Ala Ala Cys Cys 480 Ser Tyr Thr Ile Ala Asp Asn Gin Glu Gin Asp Leu 500 505 WO 98/10066 Ser Gin Lys 515 Phe Leu Gin 530 Leu Ser Thr 545 Arg Thr Phe Glu Ser Thr Arg Val Ala 595 Giu Glu Ser 610 Leu Cys Ser 625 Gin Arg Lys Gin His Gin Asn Gin Ala 675 Val Ile Gin 690 Giy Glu Leu 705 Leu Arg Arg Giy Gin Leu Giu Lys Vai 755 Leu Giu Met 770 His Arg Asn 785 Glu Lys Leu Ile Pro Giu Phe Leu Ser 835 PCT/US97/13944 Leu Leu Arg Leu Leu Thr Glu Gin Leu Gin Ser Leu Thr 525 Thr Ala Leu Pro 580 Ser L eu Leu Ile Giu 660 Leu Gin Ile Ala Asp 740 Trp Lys Ile Asp Val1 Lys Cys Gly 565 Val1 Met Al a Leu Cys 645 Val1 Cys Gin Ser Glu 725 Ser Leu Asn Leu Asp 805 Val1 Leu Pro 550 Ser Pro Val1 Giu Gin 630 Giu Gin Leu Asn Leu 710 Thr Asn S er Giu Giu 790 Ile Arg Lys 535 Pro Ile Leu Ser Met 615 Giu Leu Lys Arg Giu 695 Arg Giu Cys Gin Lys 775 Giu Val1 Gly Glu Thr Leu Leu Leu 600 Ser Ser Gin Ala Tyr 680 Lys Glu Thr Gin Glu 760 Giu Asn Gin Cys Lys Gin Thr Gly 585 Gin Ile Lys Vali Lys 665 Lys Ile Giu Lys Pro 745 Val1 Lys Leu His Arg Thr Giu Aia 570 Ser Pro Met Giu Arg 650 Glu Asn Leu Val Val1 730 Met Asp Leu Arg Ile 810 Glu Glu His 555 Val1 Asp Ala Thr Glu 635 Leu Aia Glu Glu Thr 715 Leu Al a Lys Met Arg 795 Tyr Leu Gin 540 Pro Ala Lys Glu Thr 620 Ala Gin Asp Lys Gin 700 His Gin Thr Leu Ile 780 Ser Lys Gin Giu Leu Asp Ser Thr 605 Giu Ile Aia Ile Giu 685 Ile Leu Glu Asn Arg 765 Lys Asp Thr Gly Thr Pro Glu Ala 590 Pro Leu Arg Gin Giu 670 Leu Asp Thr Ala Trp 750 Val1 Phe Lys Leu Leu Leu Asn Giu 575 Phe Gly Gin Thr Giu 655 Lys Gin Lys Arg Leu 735 Ile Met Gin Giu Leu 815 Leu Leu Asp 560 Pro Thr Met Ser Leu 640 Glu Leu Giu Ser Ser 720 Ala Gin Phe Ser Leu 800 Ser Glu WO 98/10066 PCT/US97/13944 INFORMATION FOR SEQ ID SEQUENCE CHARACTERISTICS: LENGTH: 1191 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (ix) FEATURE: NAME/KEY: CDS LOCATION: 34..1191 (xi) SEQUENCE DESCRIPTION: SEQ ID GAACTAGTGG ATCCCCCGGG CTGCAGGAAT TCG GCA CGA GGC GGC GCC GAA GAG Ala Arg Gly Gly Ala Glu Glu GCG ACT GAG Ala Thr Glu TTT GAA ATT Phe Glu Ile GCC GGA CGG GGC Ala Gly Arg Gly GGC ACA ATG GAA Gly Thr Met Glu 30 ATG TTG TGT AAC Met Leu Cys Asn CGG CGA Arg Arg CGC AGC CCG Arg Ser Pro CGG CAG AAG Arg Gln Lys CTA GGG GTG Leu Gly Val AAA GCA Lys Ala GCT GGA ATT TGT Ala Gly Ile Cys CAA TCA AAT GAT Gln Ser Asn Asp
GGG
Gly
GAT
Asp
TCT
Ser
CAA
Gln 45 ATT CTT CAA CAT Ile Leu Gln His TCA TTG GAA GAG Ser Leu Glu Glu GGC TCA AAT TGT Gly Ser Asn Cys GAA GGC AGT GAC Glu Gly Ser Asp
GGT
Gly
TTT
Phe GGC ACA AGT AAC Gly Thr Ser Asn 65 ATA ACA GAG AAC Ile Thr Glu Asn
CAT
His
GAT
Asp GCA TAC TGC Ala Tyr Cys CGA ACA GAT Arg Thr Asp CAA GAA TCA AGA Gln Glu Ser Arg 95 CCT GAT GGT CAG Pro Asp Gly Gln
GAG
Glu AGG AAT TTG GTG Arg Asn Leu Val ATC CCT GGG GGA Ile Pro Gly Gly AGC CCA Ser Pro GAA GCT Glu Ala
GAA
Glu
CCC
Pro 105
AAA
Lys 120
AAC
Asn 105 GAA AAA ACT TTA GGA Glu Lys Thr Leu Gly 125 ACC CTT TCA ACC CCA Thr Leu Ser Thr Pro 110
AAA
Lys CAA GAT TCA Gln Asp Ser GTT TTA TTA Val Leu Leu
GAG
Glu 115 100
TGC
Cys AAC AGG AAC Asn Arg Asn
GAA
Glu CTG ATG CAA GCC CTA Leu Met Gln Ala Leu 135 GCT CTC TGT AAG AAA Ala Leu Cys Lys Lys GAG GAG AAG CTG Glu Glu Lys Leu WO 98/10066 PCT/US97/13944 TAT GCT GAT Tyr Ala Asp ATC CTG CAG Ile Leu Gin 170
CTT
Leu 155
AAG
Lys CTG GAG GAG AGC Leu Glu Glu Ser AGT GTT CAG AAG Ser Val Gin Lys
CAA
Gin 165
GTT
Val ATG AAG Met Lys CAC TTG His Leu AAG CAA GCC Lys Gin Ala
CAG
Gin 175
ATC
Ile GTG AAA GAG Val Lys Glu
AAA
Lys 180
AAG
Lys CAG AGT Gin Ser
CTT
Leu 200
ATG
Met GAA CAT AGC AAG Glu His Ser Lys AGA GAA CTT CAG Arg Glu Leu Gin 205 CAG GCA CGA GAG Gin Ala Arg Glu
GCT
Ala 190
CGT
Arg TTG GCA AGA Leu Ala Arg CAC AAT AAG His Asn Lys
ACG
Thr 210
AGC
Ser 195
TTA
Leu
ATA
Ile CTA GAA TCT Leu Glu Ser
CAG
Gin 220
ACC
Thr GAA GAA GAA CGA CGT Glu Glu Glu Arg Arg 225 AAT GAA ATT CAA GCC Asn Glu Ile Gin Ala AAG GAG GAA AAT Lys Glu Glu Asn 215 GAA GCA ACT GCA Glu Ala Thr Ala 230 726 CAT TTC CAG His Phe Gin GAC ATC CAC Asp Ile His 250 AAG CTA AAG Lys Leu Lys
ATT
Ile 235
AAC
Asn
TTA
Leu GCC AAA CTC CGA Ala Lys Leu Arg 255 240
CAG
Gin CAG CTG GAG CAG CAT Gin Leu Glu Gin His 245 ATT GAG CTG GGG GAG Ile Glu Leu Gly Glu GAA AAC Glu Asn 822
GAT
Asp 280
AAA
Lys 265
AAG
Lys
CTG
Leu AAG CTC ATC GAA Lys Leu Ile Glu 270 TTC AAA CAT AAG Phe Lys His Lys
CAG
Gin
GAA
Glu TAC GCA CTG AGG Tyr Ala Leu Arg 275 CTG CAA CAG CAG Leu Gin Gin Gin 260
GAA
Glu GAG CAC ATT Glu His Ile
GTG
Val 285 CAG CAA ACG ACA Gin Gin Thr Thr CAA CTG ATA Gin Leu Ile 290
GAA
Glu CTC GTG GAT GCC Leu Val Asp Ala 295 GAT GAA AAA CAT AsD Glu Lvs His
GCT
Ala 310 CAG AGA GAG Gin Arg Glu AAA TAC GAA Lys Tyr Glu 330 TCT CTT TAT Ser Leu Tyr
AGA
Arg 315
CAA
Gin
ATG
Met TTT TTA TTA Phe Leu Leu
AAA
Lys 320
GAA
Glu GCG ACA GAA TCG AGG CAC Ala Thr Glu Ser Arg His 325 CAA CTA AAA CAG CAG CTT Gin Leu Lys Gin Gin Leu 870 918 966 1014 1062 1110 1158 ATG AAA CAG CAA Met Lys Gin Gin 335 GAT AAG TTT GAA Asp Lys Phe Glu
GTA
Val GAA TTC CAG ACT Glu Phe Gin Thr 355 AGA CAG GAA ATG Arg Gin Glu Met 370 340
ACC
Thr ATG GCA AAA Met Ala Lys
AGC
Ser 360 GAA CTG TTT Glu Leu Phe
ACA
Thr 365
TTC
Phe GAA AAG ATG Glu Lys Met WO 98/10066 AAG AAA ATT AAA AAA AAA AAA AAA AAA CTC GAG Lys Lys Ile Lys Lys Lys Lys Lys Lys Leu Glu 380 385 INFORMATION FOR SEQ ID NO:6: SEQUENCE CHA RACTERISTICS: LENGTH: 386 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: PCTIUS97/13944 1191 Ala Arg Gly Gly Ala Glu Glu Ala Thr Giu 1 Arg Gly S er Asn Asn Glu Asp Leu Leu 145 Ser Val Ala Lys Arg 225 Gin Arg Ile Asn 50 Lys Arg Ile Ser Leu 130 Ala Val1 Lys Arg Thr 210 Arg Ala Ser Cys Asp His Asn Pro Glu 115 Leu Ala Gin Glu Ser 195 Leu Ile Gin Pro Gly Ile Ser Leu Gly 100 Cys Met Leu Lys Lys 180 Lys Lys Glu Leu 5 Arg Leu Leu Leu Val1 Gly Asn Gin Cys Gin 165 Val Leu Glu Ala Glu 245 Gin Gly Gin Glu 70 Ser Glu Arg Ala Lys 150 Met His Giu Glu Thr 230 Gin Lys Val His 55 Glu Pro Ala Asn Leu 135 Lys Lys Leu Ser Asn 215 Ala H is Phe Lys 40 Gin Asp Al a Arg Lys 120 Asn Tyr Ile Gin Leu 200 Met His Asp Giu 25 Ala Gly Glu Tyr Thr 105 Glu Thr Al a Leu Ser 185 Cys Gin Phe Ile 10 Ile Asp Ser Gly Cys 90 Asp Lys Leu Asp Gin 170 Glu Arg Gin Gin His Ala Gly Met Asn Ser 75 Thr Pro Thr Ser Leu 155 Lys His Giu Ala Ile 235 Gly Thr Leu Cys Asp Gin Pro Leu Thr 140 Leu Lys Ser Leu Arg 220 Thr Arg Met Cys Gly Phe Glu Asp Gly 125 Pro Giu Gin Lys Gin 205 Glu Leu Gly Glu Asn Gly Ile Ser Gly 110 Lys Glu Giu Ala Ala 190 Arg Giu Asn Gly Giu Ser Thr Thr Arg Gin Glu Glu Ser Gin 175 Ile His Glu Glu Arg Ala Gin S er Glu Giu Gin Val1 Lys Arg 160 Ile Leu Asn Glu Ile 240 Asn Ala Lys Leu Arg Gin WO 98/10066 Giu Asn Ile Giu 260 Ala Leu Arg Giu 275 Gin Gin Gin Leu 290 Lys Giu Ala Asp 305 Glu Aia Thr Giu Val Gin Leu Lys 340 Phe Gin Thr Thr 355 Gin Giu Met Glu 370 Leu Giu 385 Leu Giu Val Giu Ser 325 Gin Met Lys Gly His Asp Lys 310 Arg Gin Ala Met Giu Ile Ala 295 His His Leu Lys Thr 375 Lys Asp 280 Lys Gin Lys Ser Ser 360 Lys Leu 265 Lys Leu Arg Tyr Leu 345 Asn Lys Lys Vai Gin Giu Giu 330 Tyr Giu Ile Lys Phe Gin Arg 315 Gin Met Leu Lys Leu Lys Thr 300 Giu Met Asp Phe Lys 380 Ile His 285 Thr Phe Lys Lys Thr 365 Lys Giu 270 Lys Gin Leu Gin Phe 350 Thr Lys Gin Giu Leu Leu Gin 335 Giu Phe Lys PCTJUS97/13944 Tyr Leu Ile Lys 320 Giu Giu Arg Lys
Claims (7)
1. An isolated nucleic acid sequence that encodes a BRCA1 Modulator Protein wherein said sequence is selected from the group consisting of 091- 21A31, Sequence ID NO. 1, 091-1F84, Sequence ID No. 3 and 091-132Q20, Sequence ID No.
2. Isolated host cells comprising an isolated nucleic acid sequence of claim 1 that encodes a BRCA1 Modulator Protein.
3. Vectors that comprise an isolated nucleic acid sequence of claim 1 that encodes a BRCA1 Modulator Protein.
4. An isolated nucleic acid sequence that encodes a protein encoded by the 15 cDNA on deposit with the ATCC with accession no. 98141 (091-1F84, Sequence ID No. 3).
5. An isolated nucleic acid sequence that encodes a protein encoded by the cDNA on deposit with the ATCC with accession no. 98142 (091-21A31, 20 Sequence ID No. 1).
6. An isolated nucleic acid sequence that encodes a protein encoded by the cDNA on deposit with the ATCC with accession no. 98143 (091-132Q20, Sequence ID No.
7. An isolated process for producing a BRCA1 Modulator Protein comprising culturing a cell of claim 2 in a suitable culture medium and isolating said protein from said cell or said medium. DATED: 10 May, 2001 PHILLIPS ORMONDE FITZPATRICK Attorneys for: ONYX PHARMACEUTICALS, INC W:nanelle\speci38293a.doc
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US2560196P | 1996-09-04 | 1996-09-04 | |
| US60/025601 | 1996-09-04 | ||
| PCT/US1997/013944 WO1998010066A1 (en) | 1996-09-04 | 1997-08-06 | Modulators of brca1 activity |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| AU3829397A AU3829397A (en) | 1998-03-26 |
| AU735512B2 true AU735512B2 (en) | 2001-07-12 |
Family
ID=21827003
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| AU38293/97A Ceased AU735512B2 (en) | 1996-09-04 | 1997-08-06 | Modulators of BRAC1 activity |
Country Status (6)
| Country | Link |
|---|---|
| EP (1) | EP0929668A1 (en) |
| JP (1) | JP2001502893A (en) |
| CN (1) | CN1228811A (en) |
| AU (1) | AU735512B2 (en) |
| CA (1) | CA2259959A1 (en) |
| WO (1) | WO1998010066A1 (en) |
Families Citing this family (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2001070982A2 (en) * | 2000-03-23 | 2001-09-27 | Immusol Incorporated | Brca-1 regulators and methods of use |
| CA2293489C (en) * | 1997-06-06 | 2009-09-29 | The Regents Of The University Of California | Inhibitors of dna immunostimulatory sequence activity |
| JP2002537327A (en) * | 1999-02-26 | 2002-11-05 | ナプロ バイオセラピューティクス,インコーポレイテッド | Prostate cancer treatment regimen |
Family Cites Families (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP0786523A3 (en) * | 1996-01-29 | 2000-01-12 | Eli Lilly And Company | Protein kinase and method of using |
-
1997
- 1997-08-06 CN CN97197589A patent/CN1228811A/en active Pending
- 1997-08-06 JP JP10512659A patent/JP2001502893A/en active Pending
- 1997-08-06 CA CA002259959A patent/CA2259959A1/en not_active Abandoned
- 1997-08-06 AU AU38293/97A patent/AU735512B2/en not_active Ceased
- 1997-08-06 EP EP97935333A patent/EP0929668A1/en not_active Withdrawn
- 1997-08-06 WO PCT/US1997/013944 patent/WO1998010066A1/en not_active Application Discontinuation
Also Published As
| Publication number | Publication date |
|---|---|
| JP2001502893A (en) | 2001-03-06 |
| CA2259959A1 (en) | 1998-03-12 |
| CN1228811A (en) | 1999-09-15 |
| EP0929668A1 (en) | 1999-07-21 |
| AU3829397A (en) | 1998-03-26 |
| WO1998010066A1 (en) | 1998-03-12 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US6043083A (en) | Inhibitors of the JNK signal transduction pathway and methods of use | |
| US7777005B2 (en) | Compositions, kits, and methods relating to the human FEZ1 gene, a novel tumor suppressor gene | |
| US6399328B1 (en) | Methods and compositions for diagnosis and treatment of breast cancer | |
| US5948643A (en) | Modulators of BRCA1 activity | |
| AU2001263952B2 (en) | Tumour suppressor and uses thereof | |
| AU2001263952A1 (en) | Tumour suppressor and uses thereof | |
| US20020176853A1 (en) | Card domain containing polypeptides, encoding nucleic acids, and methods of use | |
| US6720413B1 (en) | Methods and compositions for diagnosis and treatment of cancer | |
| AU735512B2 (en) | Modulators of BRAC1 activity | |
| US6180763B1 (en) | Cyclin D binding factor, and uses thereof | |
| AU734736B2 (en) | Nucleic acid sequences that encode a cell growth regulatory protein and uses thereof | |
| US20040047845A1 (en) | Methods and compositions for diagnosis and treatment of cancer based on the transcription factor ets2 | |
| WO1999041376A9 (en) | Retinoblastoma protein complexes and retinoblastoma interacting proteins | |
| US20040106148A1 (en) | Polypeptides | |
| JP2001500018A (en) | Novel human phosphorylase kinase γ subunit | |
| WO1997043415A9 (en) | Cyclin d binding factor, and uses thereof | |
| WO1997043415A1 (en) | Cyclin d binding factor, and uses thereof | |
| WO2002022676A1 (en) | A longevity guarantee protein and its encoding sequence and use | |
| AU3291799A (en) | Retinoblastoma protein complexes and retinoblastoma interacting proteins | |
| JPH11235187A (en) | CYP7 promoter binding factor |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| FGA | Letters patent sealed or granted (standard patent) |