WO1997010843A1 - Modified p53 constructs and uses therefor - Google Patents
Modified p53 constructs and uses therefor Download PDFInfo
- Publication number
- WO1997010843A1 WO1997010843A1 PCT/US1996/015188 US9615188W WO9710843A1 WO 1997010843 A1 WO1997010843 A1 WO 1997010843A1 US 9615188 W US9615188 W US 9615188W WO 9710843 A1 WO9710843 A1 WO 9710843A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- pro
- ser
- leu
- glu
- arg
- Prior art date
Links
- 102100025064 Cellular tumor antigen p53 Human genes 0.000 claims abstract description 178
- 230000004568 DNA-binding Effects 0.000 claims abstract description 35
- 239000013598 vector Substances 0.000 claims abstract description 19
- 125000000539 amino acid group Chemical group 0.000 claims abstract description 11
- 239000004475 Arginine Substances 0.000 claims abstract description 9
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 claims abstract description 9
- 108091028043 Nucleic acid sequence Proteins 0.000 claims abstract description 5
- 239000012634 fragment Substances 0.000 claims abstract description 4
- 235000001014 amino acid Nutrition 0.000 claims description 75
- 150000001413 amino acids Chemical class 0.000 claims description 72
- 206010028980 Neoplasm Diseases 0.000 claims description 45
- 230000000694 effects Effects 0.000 claims description 35
- 150000007523 nucleic acids Chemical group 0.000 claims description 35
- 238000000034 method Methods 0.000 claims description 22
- 239000008194 pharmaceutical composition Substances 0.000 claims description 15
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 14
- 201000011510 cancer Diseases 0.000 claims description 14
- 238000012217 deletion Methods 0.000 claims description 10
- 230000037430 deletion Effects 0.000 claims description 10
- 210000004899 c-terminal region Anatomy 0.000 claims description 8
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 claims description 5
- 239000004473 Threonine Substances 0.000 claims description 5
- 239000003937 drug carrier Substances 0.000 claims description 5
- 235000018417 cysteine Nutrition 0.000 claims description 4
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 claims description 4
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 claims description 4
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 claims description 4
- 239000002773 nucleotide Substances 0.000 claims description 3
- 125000003729 nucleotide group Chemical group 0.000 claims description 3
- 239000002253 acid Substances 0.000 claims description 2
- 230000002950 deficient Effects 0.000 claims 3
- 108090000623 proteins and genes Proteins 0.000 abstract description 56
- 102000004169 proteins and genes Human genes 0.000 abstract description 51
- 108090000765 processed proteins & peptides Proteins 0.000 abstract description 5
- 102000035118 modified proteins Human genes 0.000 abstract description 3
- 108091005573 modified proteins Proteins 0.000 abstract description 3
- 239000004472 Lysine Substances 0.000 abstract 1
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 abstract 1
- 108010087924 alanylproline Proteins 0.000 description 87
- 235000018102 proteins Nutrition 0.000 description 48
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 42
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 41
- 108020004414 DNA Proteins 0.000 description 38
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 35
- 108010068488 methionylphenylalanine Proteins 0.000 description 30
- CUQDCPXNZPDYFQ-ZLUOBGJFSA-N Asp-Ser-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O CUQDCPXNZPDYFQ-ZLUOBGJFSA-N 0.000 description 29
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 23
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 23
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 22
- CQAHWYDHKUWYIX-YUMQZZPRSA-N Glu-Pro-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O CQAHWYDHKUWYIX-YUMQZZPRSA-N 0.000 description 22
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 22
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 22
- YXTFLTJYLIAZQG-FJXKBIBVSA-N Gly-Thr-Arg Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YXTFLTJYLIAZQG-FJXKBIBVSA-N 0.000 description 22
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 22
- 241000880493 Leptailurus serval Species 0.000 description 22
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 22
- GPAHWYRSHCKICP-GUBZILKMSA-N Met-Glu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GPAHWYRSHCKICP-GUBZILKMSA-N 0.000 description 22
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 22
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 22
- KBUAPZAZPWNYSW-SRVKXCTJSA-N Pro-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KBUAPZAZPWNYSW-SRVKXCTJSA-N 0.000 description 22
- WRQLCVIALDUQEQ-UNQGMJICSA-N Thr-Phe-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WRQLCVIALDUQEQ-UNQGMJICSA-N 0.000 description 22
- UEFHVUQBYNRNQC-SFJXLCSZSA-N Trp-Phe-Thr Chemical compound C([C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=CC=C1 UEFHVUQBYNRNQC-SFJXLCSZSA-N 0.000 description 22
- 108010064997 VPY tripeptide Proteins 0.000 description 22
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 22
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 22
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 22
- 108010057821 leucylproline Proteins 0.000 description 22
- 108010029020 prolylglycine Proteins 0.000 description 22
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 21
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 21
- FFZJHQODAYHGPO-KZVJFYERSA-N Ala-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N FFZJHQODAYHGPO-KZVJFYERSA-N 0.000 description 21
- QAODJPUKWNNNRP-DCAQKATOSA-N Arg-Glu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QAODJPUKWNNNRP-DCAQKATOSA-N 0.000 description 21
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 21
- CQGBSALYGOXQPE-HTUGSXCWSA-N Glu-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O CQGBSALYGOXQPE-HTUGSXCWSA-N 0.000 description 21
- JPVGHHQGKPQYIL-KBPBESRZSA-N Gly-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 JPVGHHQGKPQYIL-KBPBESRZSA-N 0.000 description 21
- QLRMMMQNCWBNPQ-QXEWZRGKSA-N Ile-Arg-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N QLRMMMQNCWBNPQ-QXEWZRGKSA-N 0.000 description 21
- QGQGAIBGTUJRBR-NAKRPEOUSA-N Met-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCSC QGQGAIBGTUJRBR-NAKRPEOUSA-N 0.000 description 21
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 21
- MPGJIHFJCXTVEX-KKUMJFAQSA-N Phe-Arg-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O MPGJIHFJCXTVEX-KKUMJFAQSA-N 0.000 description 21
- AGYXCMYVTBYGCT-ULQDDVLXSA-N Phe-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O AGYXCMYVTBYGCT-ULQDDVLXSA-N 0.000 description 21
- XXAOSEUPEMQJOF-KKUMJFAQSA-N Phe-Glu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 XXAOSEUPEMQJOF-KKUMJFAQSA-N 0.000 description 21
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 21
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 21
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 21
- GHPQVUYZQQGEDA-BIIVOSGPSA-N Ser-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N)C(=O)O GHPQVUYZQQGEDA-BIIVOSGPSA-N 0.000 description 21
- FHXGMDRKJHKLKW-QWRGUYRKSA-N Ser-Tyr-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 FHXGMDRKJHKLKW-QWRGUYRKSA-N 0.000 description 21
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 21
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 21
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 21
- XOLLWQIBBLBAHQ-WDSOQIARSA-N Trp-Pro-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O XOLLWQIBBLBAHQ-WDSOQIARSA-N 0.000 description 21
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 21
- VDPRBUOZLIFUIM-GUBZILKMSA-N Val-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N VDPRBUOZLIFUIM-GUBZILKMSA-N 0.000 description 21
- 108010081551 glycylphenylalanine Proteins 0.000 description 21
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 21
- 108020004707 nucleic acids Proteins 0.000 description 21
- 102000039446 nucleic acids Human genes 0.000 description 21
- 108010051242 phenylalanylserine Proteins 0.000 description 21
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 20
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 20
- 108091034117 Oligonucleotide Proteins 0.000 description 20
- XZGWNSIRZIUHHP-SRVKXCTJSA-N Pro-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 XZGWNSIRZIUHHP-SRVKXCTJSA-N 0.000 description 20
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 20
- 108010068380 arginylarginine Proteins 0.000 description 20
- 108010047857 aspartylglycine Proteins 0.000 description 20
- QQLBPVKLJBAXBS-FXQIFTODSA-N Glu-Glu-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QQLBPVKLJBAXBS-FXQIFTODSA-N 0.000 description 19
- BEQGFMIBZFNROK-JGVFFNPUSA-N Gly-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)CN)C(=O)O BEQGFMIBZFNROK-JGVFFNPUSA-N 0.000 description 19
- WSEITRHJRVDTRX-QTKMDUPCSA-N His-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CN=CN1)N)O WSEITRHJRVDTRX-QTKMDUPCSA-N 0.000 description 19
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 19
- 210000004027 cell Anatomy 0.000 description 19
- 108010005942 methionylglycine Proteins 0.000 description 19
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 19
- JVMKBJNSRZWDBO-FXQIFTODSA-N Arg-Cys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O JVMKBJNSRZWDBO-FXQIFTODSA-N 0.000 description 18
- NAPULYCVEVVFRB-HEIBUPTGSA-N Cys-Thr-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)CS NAPULYCVEVVFRB-HEIBUPTGSA-N 0.000 description 18
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 18
- BXOLYFJYQQRQDJ-MXAVVETBSA-N His-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CN=CN1)N BXOLYFJYQQRQDJ-MXAVVETBSA-N 0.000 description 18
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 18
- YFQSSOAGMZGXFT-MEYUZBJRSA-N Lys-Thr-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YFQSSOAGMZGXFT-MEYUZBJRSA-N 0.000 description 18
- TZJSEJOXAIWOST-RHYQMDGZSA-N Thr-Lys-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N TZJSEJOXAIWOST-RHYQMDGZSA-N 0.000 description 18
- NWQCKAPDGQMZQN-IHPCNDPISA-N Trp-Lys-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O NWQCKAPDGQMZQN-IHPCNDPISA-N 0.000 description 18
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 18
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 17
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 17
- QPCVIQJVRGXUSA-LURJTMIESA-N Gly-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QPCVIQJVRGXUSA-LURJTMIESA-N 0.000 description 17
- ZHHLTWUOWXHVQJ-YUMQZZPRSA-N His-Ser-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZHHLTWUOWXHVQJ-YUMQZZPRSA-N 0.000 description 17
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 17
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 17
- JUJGNDZIKKQMDJ-IHRRRGAJSA-N Pro-His-His Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O JUJGNDZIKKQMDJ-IHRRRGAJSA-N 0.000 description 17
- UCOYFSCEIWQYNL-FXQIFTODSA-N Ser-Cys-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCSC)C(O)=O UCOYFSCEIWQYNL-FXQIFTODSA-N 0.000 description 17
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 description 17
- 108010060199 cysteinylproline Proteins 0.000 description 17
- 108010089804 glycyl-threonine Proteins 0.000 description 17
- 108010077515 glycylproline Proteins 0.000 description 17
- 108010009298 lysylglutamic acid Proteins 0.000 description 17
- 108010064235 lysylglycine Proteins 0.000 description 17
- KJGNDQCYBNBXDA-GUBZILKMSA-N Arg-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N KJGNDQCYBNBXDA-GUBZILKMSA-N 0.000 description 16
- KQBVNNAPIURMPD-PEFMBERDSA-N Asp-Ile-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KQBVNNAPIURMPD-PEFMBERDSA-N 0.000 description 16
- DRXOWZZHCSBUOI-YJRXYDGGSA-N Cys-Thr-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CS)N)O DRXOWZZHCSBUOI-YJRXYDGGSA-N 0.000 description 16
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 16
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 16
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 16
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 16
- CWSIBTLMMQLPPZ-FXQIFTODSA-N Val-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N CWSIBTLMMQLPPZ-FXQIFTODSA-N 0.000 description 16
- 238000006467 substitution reaction Methods 0.000 description 16
- CKOFNWCLWRYUHK-XHNCKOQMSA-N Glu-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CKOFNWCLWRYUHK-XHNCKOQMSA-N 0.000 description 15
- CTGZVVQVIBSOBB-AVGNSLFASA-N His-His-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O CTGZVVQVIBSOBB-AVGNSLFASA-N 0.000 description 15
- JGFWUKYIQAEYAH-DCAQKATOSA-N His-Ser-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JGFWUKYIQAEYAH-DCAQKATOSA-N 0.000 description 15
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 15
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 15
- ZUXQFMVPAYGPFJ-JXUBOQSCSA-N Thr-Ala-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN ZUXQFMVPAYGPFJ-JXUBOQSCSA-N 0.000 description 15
- LHTGRUZSZOIAKM-SOUVJXGZSA-N Tyr-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O LHTGRUZSZOIAKM-SOUVJXGZSA-N 0.000 description 15
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 15
- 230000027455 binding Effects 0.000 description 15
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 15
- 108010054155 lysyllysine Proteins 0.000 description 15
- 108010077112 prolyl-proline Proteins 0.000 description 15
- NITLUESFANGEIW-BQBZGAKWSA-N Cys-Pro-Gly Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O NITLUESFANGEIW-BQBZGAKWSA-N 0.000 description 14
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 description 14
- VNDQNDYEPSXHLU-JUKXBJQTSA-N Ile-His-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N VNDQNDYEPSXHLU-JUKXBJQTSA-N 0.000 description 14
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 14
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 14
- VBFVQTPETKJCQW-RPTUDFQQSA-N Tyr-Phe-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VBFVQTPETKJCQW-RPTUDFQQSA-N 0.000 description 14
- VKCOHFFSTKCXEQ-OLHMAJIHSA-N Asn-Asn-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VKCOHFFSTKCXEQ-OLHMAJIHSA-N 0.000 description 13
- NJPLPRFQLBZAMH-IHRRRGAJSA-N Asn-Tyr-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(O)=O NJPLPRFQLBZAMH-IHRRRGAJSA-N 0.000 description 13
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 13
- 108020004705 Codon Proteins 0.000 description 13
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 13
- VZKBJNBZMZHKRC-XUXIUFHCSA-N Pro-Ile-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O VZKBJNBZMZHKRC-XUXIUFHCSA-N 0.000 description 13
- GMXIJHCBTZDAPD-QPHKQPEJSA-N Thr-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N GMXIJHCBTZDAPD-QPHKQPEJSA-N 0.000 description 13
- CBCCCLMNOBLBSC-XVYDVKMFSA-N Ala-His-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CBCCCLMNOBLBSC-XVYDVKMFSA-N 0.000 description 12
- XVLLUZMFSAYKJV-GUBZILKMSA-N Arg-Asp-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XVLLUZMFSAYKJV-GUBZILKMSA-N 0.000 description 12
- RKQRHMKFNBYOTN-IHRRRGAJSA-N Arg-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N RKQRHMKFNBYOTN-IHRRRGAJSA-N 0.000 description 12
- VLDRQOHCMKCXLY-SRVKXCTJSA-N Asn-Ser-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VLDRQOHCMKCXLY-SRVKXCTJSA-N 0.000 description 12
- CPTUXCUWQIBZIF-ZLUOBGJFSA-N Cys-Asn-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CPTUXCUWQIBZIF-ZLUOBGJFSA-N 0.000 description 12
- GFDBWMDLBKCLQH-IHRRRGAJSA-N Met-Phe-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N GFDBWMDLBKCLQH-IHRRRGAJSA-N 0.000 description 12
- GMRGSBAMMMVDGG-GUBZILKMSA-N Asn-Arg-Arg Chemical compound C(C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N GMRGSBAMMMVDGG-GUBZILKMSA-N 0.000 description 11
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 11
- ORVFEGYUJITPGI-IHRRRGAJSA-N Lys-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN ORVFEGYUJITPGI-IHRRRGAJSA-N 0.000 description 11
- XZQYIJALMGEUJD-OEAJRASXSA-N Phe-Lys-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZQYIJALMGEUJD-OEAJRASXSA-N 0.000 description 11
- DWGFLKQSGRUQTI-IHRRRGAJSA-N Pro-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 DWGFLKQSGRUQTI-IHRRRGAJSA-N 0.000 description 11
- 238000012986 modification Methods 0.000 description 11
- 230000004048 modification Effects 0.000 description 11
- 239000013612 plasmid Substances 0.000 description 11
- 108010026333 seryl-proline Proteins 0.000 description 11
- CICQXRWZNVXFCU-SRVKXCTJSA-N Ser-His-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O CICQXRWZNVXFCU-SRVKXCTJSA-N 0.000 description 10
- 239000000203 mixture Substances 0.000 description 10
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 9
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 9
- 230000006870 function Effects 0.000 description 9
- 239000000499 gel Substances 0.000 description 9
- 238000000338 in vitro Methods 0.000 description 9
- DCBSZJJHOTXMHY-DCAQKATOSA-N Glu-Pro-Pro Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DCBSZJJHOTXMHY-DCAQKATOSA-N 0.000 description 8
- DMKWYMWNEKIPFC-IUCAKERBSA-N Pro-Gly-Arg Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O DMKWYMWNEKIPFC-IUCAKERBSA-N 0.000 description 8
- PPQRSMGDOHLTBE-UWVGGRQHSA-N Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PPQRSMGDOHLTBE-UWVGGRQHSA-N 0.000 description 8
- 108010040002 Tumor Suppressor Proteins Proteins 0.000 description 8
- 102000001742 Tumor Suppressor Proteins Human genes 0.000 description 8
- 235000009697 arginine Nutrition 0.000 description 8
- 108010034529 leucyl-lysine Proteins 0.000 description 8
- 108010090894 prolylleucine Proteins 0.000 description 8
- 230000002103 transcriptional effect Effects 0.000 description 8
- BRPMXFSTKXXNHF-IUCAKERBSA-N (2s)-1-[2-[[(2s)-pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H]1NCCC1 BRPMXFSTKXXNHF-IUCAKERBSA-N 0.000 description 7
- VHEVVUZDDUCAKU-FXQIFTODSA-N Ala-Met-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O VHEVVUZDDUCAKU-FXQIFTODSA-N 0.000 description 7
- UULLJGQFCDXVTQ-CYDGBPFRSA-N Arg-Pro-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UULLJGQFCDXVTQ-CYDGBPFRSA-N 0.000 description 7
- CUYLIWAAAYJKJH-RYUDHWBXSA-N Gly-Glu-Tyr Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUYLIWAAAYJKJH-RYUDHWBXSA-N 0.000 description 7
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 7
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 7
- KLYYKKGCPOGDPE-OEAJRASXSA-N Phe-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O KLYYKKGCPOGDPE-OEAJRASXSA-N 0.000 description 7
- QWCZXKIFPWPQHR-JYJNAYRXSA-N Val-Pro-Tyr Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QWCZXKIFPWPQHR-JYJNAYRXSA-N 0.000 description 7
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 7
- 239000013603 viral vector Substances 0.000 description 7
- AUZAXCPWMDBWEE-HJGDQZAQSA-N Arg-Thr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O AUZAXCPWMDBWEE-HJGDQZAQSA-N 0.000 description 6
- 239000012623 DNA damaging agent Substances 0.000 description 6
- KTGFOCFYOZQVRJ-ZKWXMUAHSA-N Ile-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O KTGFOCFYOZQVRJ-ZKWXMUAHSA-N 0.000 description 6
- DDVHDMSBLRAKNV-IHRRRGAJSA-N Leu-Met-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O DDVHDMSBLRAKNV-IHRRRGAJSA-N 0.000 description 6
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 6
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 6
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 6
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 6
- 238000004458 analytical method Methods 0.000 description 6
- 108010053725 prolylvaline Proteins 0.000 description 6
- CXISPYVYMQWFLE-VKHMYHEASA-N Ala-Gly Chemical compound C[C@H]([NH3+])C(=O)NCC([O-])=O CXISPYVYMQWFLE-VKHMYHEASA-N 0.000 description 5
- 101100107610 Arabidopsis thaliana ABCF4 gene Proteins 0.000 description 5
- UGSVSNXPJJDJKL-SDDRHHMPSA-N Glu-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UGSVSNXPJJDJKL-SDDRHHMPSA-N 0.000 description 5
- BCCRXDTUTZHDEU-VKHMYHEASA-N Gly-Ser Chemical compound NCC(=O)N[C@@H](CO)C(O)=O BCCRXDTUTZHDEU-VKHMYHEASA-N 0.000 description 5
- HQBOMRTVKVKFMN-WDSOQIARSA-N Leu-Trp-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O HQBOMRTVKVKFMN-WDSOQIARSA-N 0.000 description 5
- UZWMJZSOXGOVIN-LURJTMIESA-N Met-Gly-Gly Chemical compound CSCC[C@H](N)C(=O)NCC(=O)NCC(O)=O UZWMJZSOXGOVIN-LURJTMIESA-N 0.000 description 5
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 5
- 101100068078 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GCN4 gene Proteins 0.000 description 5
- CYCGARJWIQWPQM-YJRXYDGGSA-N Thr-Tyr-Ser Chemical compound C[C@@H](O)[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CO)C([O-])=O)CC1=CC=C(O)C=C1 CYCGARJWIQWPQM-YJRXYDGGSA-N 0.000 description 5
- KYPMKDGKAYQCHO-RYUDHWBXSA-N Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 KYPMKDGKAYQCHO-RYUDHWBXSA-N 0.000 description 5
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 5
- 230000004913 activation Effects 0.000 description 5
- 108010047495 alanylglycine Proteins 0.000 description 5
- 230000004663 cell proliferation Effects 0.000 description 5
- 238000001415 gene therapy Methods 0.000 description 5
- 108010070643 prolylglutamic acid Proteins 0.000 description 5
- 238000011282 treatment Methods 0.000 description 5
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 4
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 4
- NABSCJGZKWSNHX-RCWTZXSCSA-N Arg-Arg-Thr Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NABSCJGZKWSNHX-RCWTZXSCSA-N 0.000 description 4
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 4
- HDHZCEDPLTVHFZ-GUBZILKMSA-N Asn-Leu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O HDHZCEDPLTVHFZ-GUBZILKMSA-N 0.000 description 4
- SITLTJHOQZFJGG-XPUUQOCRSA-N Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O SITLTJHOQZFJGG-XPUUQOCRSA-N 0.000 description 4
- HGCNKOLVKRAVHD-RYUDHWBXSA-N Met-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-RYUDHWBXSA-N 0.000 description 4
- 102000008300 Mutant Proteins Human genes 0.000 description 4
- 108010021466 Mutant Proteins Proteins 0.000 description 4
- CLJLVCYFABNTHP-DCAQKATOSA-N Pro-Leu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O CLJLVCYFABNTHP-DCAQKATOSA-N 0.000 description 4
- RVQDZELMXZRSSI-IUCAKERBSA-N Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 RVQDZELMXZRSSI-IUCAKERBSA-N 0.000 description 4
- AWJGUZSYVIVZGP-YUMQZZPRSA-N Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 AWJGUZSYVIVZGP-YUMQZZPRSA-N 0.000 description 4
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 4
- WBAXJMCUFIXCNI-WDSKDSINSA-N Ser-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O WBAXJMCUFIXCNI-WDSKDSINSA-N 0.000 description 4
- LDEBVRIURYMKQS-WISUUJSJSA-N Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CO LDEBVRIURYMKQS-WISUUJSJSA-N 0.000 description 4
- NHQVWACSJZJCGJ-FLBSBUHZSA-N Thr-Thr-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NHQVWACSJZJCGJ-FLBSBUHZSA-N 0.000 description 4
- 108010013835 arginine glutamate Proteins 0.000 description 4
- 108700023214 chimeric p53 Proteins 0.000 description 4
- 239000002299 complementary DNA Substances 0.000 description 4
- 238000002474 experimental method Methods 0.000 description 4
- 238000001727 in vivo Methods 0.000 description 4
- 230000035772 mutation Effects 0.000 description 4
- 238000002560 therapeutic procedure Methods 0.000 description 4
- 108010073969 valyllysine Proteins 0.000 description 4
- VOGCFWDZYYTEOY-DCAQKATOSA-N Asn-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N VOGCFWDZYYTEOY-DCAQKATOSA-N 0.000 description 3
- SONUFGRSSMFHFN-IMJSIDKUSA-N Asn-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(O)=O SONUFGRSSMFHFN-IMJSIDKUSA-N 0.000 description 3
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 3
- GVPSCJQLUGIKAM-GUBZILKMSA-N Asp-Arg-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GVPSCJQLUGIKAM-GUBZILKMSA-N 0.000 description 3
- LIVXPXUVXFRWNY-CIUDSAMLSA-N Asp-Lys-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O LIVXPXUVXFRWNY-CIUDSAMLSA-N 0.000 description 3
- YWLDTBBUHZJQHW-KKUMJFAQSA-N Asp-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N YWLDTBBUHZJQHW-KKUMJFAQSA-N 0.000 description 3
- YIDFBWRHIYOYAA-LKXGYXEUSA-N Asp-Ser-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YIDFBWRHIYOYAA-LKXGYXEUSA-N 0.000 description 3
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 3
- GLWXKFRTOHKGIT-ACZMJKKPSA-N Glu-Asn-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GLWXKFRTOHKGIT-ACZMJKKPSA-N 0.000 description 3
- SPQWWEZBHXHUJN-KBIXCLLPSA-N Ile-Glu-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O SPQWWEZBHXHUJN-KBIXCLLPSA-N 0.000 description 3
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 3
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 3
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 3
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 3
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 3
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 3
- QLDHBYRUNQZIJQ-DKIMLUQUSA-N Leu-Ile-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QLDHBYRUNQZIJQ-DKIMLUQUSA-N 0.000 description 3
- MJTOYIHCKVQICL-ULQDDVLXSA-N Leu-Met-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MJTOYIHCKVQICL-ULQDDVLXSA-N 0.000 description 3
- SBSIKVMCCJUCBZ-GUBZILKMSA-N Met-Asn-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N SBSIKVMCCJUCBZ-GUBZILKMSA-N 0.000 description 3
- RMLLCGYYVZKKRT-CIUDSAMLSA-N Met-Ser-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O RMLLCGYYVZKKRT-CIUDSAMLSA-N 0.000 description 3
- FXBKQTOGURNXSL-HJGDQZAQSA-N Met-Thr-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O FXBKQTOGURNXSL-HJGDQZAQSA-N 0.000 description 3
- 108010065395 Neuropep-1 Proteins 0.000 description 3
- BBDSZDHUCPSYAC-QEJZJMRPSA-N Phe-Ala-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BBDSZDHUCPSYAC-QEJZJMRPSA-N 0.000 description 3
- GPLWGAYGROGDEN-BZSNNMDCSA-N Phe-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GPLWGAYGROGDEN-BZSNNMDCSA-N 0.000 description 3
- QSWKNJAPHQDAAS-MELADBBJSA-N Phe-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O QSWKNJAPHQDAAS-MELADBBJSA-N 0.000 description 3
- ZKQOUHVVXABNDG-IUCAKERBSA-N Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 ZKQOUHVVXABNDG-IUCAKERBSA-N 0.000 description 3
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 3
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 3
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 3
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 3
- NAXBBCLCEOTAIG-RHYQMDGZSA-N Thr-Arg-Lys Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O NAXBBCLCEOTAIG-RHYQMDGZSA-N 0.000 description 3
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 3
- WCRFXRIWBFRZBR-GGVZMXCHSA-N Thr-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WCRFXRIWBFRZBR-GGVZMXCHSA-N 0.000 description 3
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 3
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 3
- 241000700605 Viruses Species 0.000 description 3
- 230000003281 allosteric effect Effects 0.000 description 3
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 3
- 230000006907 apoptotic process Effects 0.000 description 3
- 239000000969 carrier Substances 0.000 description 3
- 150000001875 compounds Chemical class 0.000 description 3
- 201000010099 disease Diseases 0.000 description 3
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 3
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 3
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 3
- 239000006166 lysate Substances 0.000 description 3
- 108020004999 messenger RNA Proteins 0.000 description 3
- 238000002360 preparation method Methods 0.000 description 3
- 230000005855 radiation Effects 0.000 description 3
- 230000001105 regulatory effect Effects 0.000 description 3
- 239000011780 sodium chloride Substances 0.000 description 3
- 230000001225 therapeutic effect Effects 0.000 description 3
- 238000013518 transcription Methods 0.000 description 3
- 230000035897 transcription Effects 0.000 description 3
- 230000014616 translation Effects 0.000 description 3
- 210000004881 tumor cell Anatomy 0.000 description 3
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 2
- XPSGESXVBSQZPL-SRVKXCTJSA-N Arg-Arg-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XPSGESXVBSQZPL-SRVKXCTJSA-N 0.000 description 2
- OVVUNXXROOFSIM-SDDRHHMPSA-N Arg-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O OVVUNXXROOFSIM-SDDRHHMPSA-N 0.000 description 2
- NYDIVDKTULRINZ-AVGNSLFASA-N Arg-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NYDIVDKTULRINZ-AVGNSLFASA-N 0.000 description 2
- LCBSSOCDWUTQQV-SDDRHHMPSA-N Arg-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LCBSSOCDWUTQQV-SDDRHHMPSA-N 0.000 description 2
- VENMDXUVHSKEIN-GUBZILKMSA-N Arg-Ser-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VENMDXUVHSKEIN-GUBZILKMSA-N 0.000 description 2
- URAUIUGLHBRPMF-NAKRPEOUSA-N Arg-Ser-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O URAUIUGLHBRPMF-NAKRPEOUSA-N 0.000 description 2
- UWFOMGUWGPRVBW-GUBZILKMSA-N Asn-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)N)N UWFOMGUWGPRVBW-GUBZILKMSA-N 0.000 description 2
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 2
- DVUFTQLHHHJEMK-IMJSIDKUSA-N Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O DVUFTQLHHHJEMK-IMJSIDKUSA-N 0.000 description 2
- VPPXTHJNTYDNFJ-CIUDSAMLSA-N Asp-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N VPPXTHJNTYDNFJ-CIUDSAMLSA-N 0.000 description 2
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 2
- 108091033380 Coding strand Proteins 0.000 description 2
- 108010054576 Deoxyribonuclease EcoRI Proteins 0.000 description 2
- IRDASPPCLZIERZ-XHNCKOQMSA-N Glu-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N IRDASPPCLZIERZ-XHNCKOQMSA-N 0.000 description 2
- YYOBUPFZLKQUAX-FXQIFTODSA-N Glu-Asn-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YYOBUPFZLKQUAX-FXQIFTODSA-N 0.000 description 2
- KOSRFJWDECSPRO-WDSKDSINSA-N Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(O)=O KOSRFJWDECSPRO-WDSKDSINSA-N 0.000 description 2
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 2
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 2
- RXJFSLQVMGYQEL-IHRRRGAJSA-N Glu-Tyr-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 RXJFSLQVMGYQEL-IHRRRGAJSA-N 0.000 description 2
- KCCNSVHJSMMGFS-NRPADANISA-N Glu-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N KCCNSVHJSMMGFS-NRPADANISA-N 0.000 description 2
- ZALGPUWUVHOGAE-GVXVVHGQSA-N Glu-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZALGPUWUVHOGAE-GVXVVHGQSA-N 0.000 description 2
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 2
- VHOLZZKNEBBHTH-YUMQZZPRSA-N His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CNC=N1 VHOLZZKNEBBHTH-YUMQZZPRSA-N 0.000 description 2
- FIMNVXRZGUAGBI-AVGNSLFASA-N His-Glu-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FIMNVXRZGUAGBI-AVGNSLFASA-N 0.000 description 2
- PHRWFSFCNJPWRO-PPCPHDFISA-N Ile-Leu-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N PHRWFSFCNJPWRO-PPCPHDFISA-N 0.000 description 2
- CKRFDMPBSWYOBT-PPCPHDFISA-N Ile-Lys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CKRFDMPBSWYOBT-PPCPHDFISA-N 0.000 description 2
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 2
- 108010065920 Insulin Lispro Proteins 0.000 description 2
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 2
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 2
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 2
- BQVUABVGYYSDCJ-ZFWWWQNUSA-N Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-ZFWWWQNUSA-N 0.000 description 2
- WXJKFRMKJORORD-DCAQKATOSA-N Lys-Arg-Ala Chemical compound NC(=N)NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CCCCN WXJKFRMKJORORD-DCAQKATOSA-N 0.000 description 2
- HGNRJCINZYHNOU-LURJTMIESA-N Lys-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(O)=O HGNRJCINZYHNOU-LURJTMIESA-N 0.000 description 2
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 2
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 2
- YXPJCVNIDDKGOE-MELADBBJSA-N Lys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N)C(=O)O YXPJCVNIDDKGOE-MELADBBJSA-N 0.000 description 2
- OBZHNHBAAVEWKI-DCAQKATOSA-N Lys-Pro-Asn Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O OBZHNHBAAVEWKI-DCAQKATOSA-N 0.000 description 2
- CRIODIGWCUPXKU-AVGNSLFASA-N Lys-Pro-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O CRIODIGWCUPXKU-AVGNSLFASA-N 0.000 description 2
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 2
- SDTSLIMYROCDNS-FXQIFTODSA-N Met-Cys-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O SDTSLIMYROCDNS-FXQIFTODSA-N 0.000 description 2
- 229930193140 Neomycin Natural products 0.000 description 2
- 108700020796 Oncogene Proteins 0.000 description 2
- DJPXNKUDJKGQEE-BZSNNMDCSA-N Phe-Asp-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DJPXNKUDJKGQEE-BZSNNMDCSA-N 0.000 description 2
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 2
- LNOWDSPAYBWJOR-PEDHHIEDSA-N Pro-Ile-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LNOWDSPAYBWJOR-PEDHHIEDSA-N 0.000 description 2
- FKVNLUZHSFCNGY-RVMXOQNASA-N Pro-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 FKVNLUZHSFCNGY-RVMXOQNASA-N 0.000 description 2
- AUQGUYPHJSMAKI-CYDGBPFRSA-N Pro-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 AUQGUYPHJSMAKI-CYDGBPFRSA-N 0.000 description 2
- APIAILHCTSBGLU-JYJNAYRXSA-N Pro-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@@H]2CCCN2 APIAILHCTSBGLU-JYJNAYRXSA-N 0.000 description 2
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 2
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 2
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 2
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 2
- IOVBCLGAJJXOHK-SRVKXCTJSA-N Ser-His-His Chemical compound C([C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 IOVBCLGAJJXOHK-SRVKXCTJSA-N 0.000 description 2
- VZQRNAYURWAEFE-KKUMJFAQSA-N Ser-Leu-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VZQRNAYURWAEFE-KKUMJFAQSA-N 0.000 description 2
- HDBOEVPDIDDEPC-CIUDSAMLSA-N Ser-Lys-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O HDBOEVPDIDDEPC-CIUDSAMLSA-N 0.000 description 2
- CRJZZXMAADSBBQ-SRVKXCTJSA-N Ser-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO CRJZZXMAADSBBQ-SRVKXCTJSA-N 0.000 description 2
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 description 2
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 2
- NVNPWELENFJOHH-CIUDSAMLSA-N Ser-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CO)N NVNPWELENFJOHH-CIUDSAMLSA-N 0.000 description 2
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 2
- VPZKQTYZIVOJDV-LMVFSUKVSA-N Thr-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(O)=O VPZKQTYZIVOJDV-LMVFSUKVSA-N 0.000 description 2
- LVHHEVGYAZGXDE-KDXUFGMBSA-N Thr-Ala-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(=O)O)N)O LVHHEVGYAZGXDE-KDXUFGMBSA-N 0.000 description 2
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 2
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 2
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 2
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 2
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 2
- NSTPFWRAIDTNGH-BZSNNMDCSA-N Tyr-Asn-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NSTPFWRAIDTNGH-BZSNNMDCSA-N 0.000 description 2
- FBHBVXUBTYVCRU-BZSNNMDCSA-N Tyr-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CN=CN1 FBHBVXUBTYVCRU-BZSNNMDCSA-N 0.000 description 2
- QMNWABHLJOHGDS-IHRRRGAJSA-N Tyr-Met-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QMNWABHLJOHGDS-IHRRRGAJSA-N 0.000 description 2
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 2
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 2
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 2
- 230000003321 amplification Effects 0.000 description 2
- 125000000637 arginyl group Chemical class N[C@@H](CCCNC(N)=N)C(=O)* 0.000 description 2
- 108010062796 arginyllysine Proteins 0.000 description 2
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 2
- 108010093581 aspartyl-proline Proteins 0.000 description 2
- 238000003556 assay Methods 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 230000005757 colony formation Effects 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 238000007796 conventional method Methods 0.000 description 2
- 230000002708 enhancing effect Effects 0.000 description 2
- 210000003527 eukaryotic cell Anatomy 0.000 description 2
- 239000013604 expression vector Substances 0.000 description 2
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 2
- 208000015181 infectious disease Diseases 0.000 description 2
- 235000014705 isoleucine Nutrition 0.000 description 2
- 150000002520 isoleucines Chemical group 0.000 description 2
- 235000005772 leucine Nutrition 0.000 description 2
- 125000001909 leucine group Chemical group [H]N(*)C(C(*)=O)C([H])([H])C(C([H])([H])[H])C([H])([H])[H] 0.000 description 2
- 239000002502 liposome Substances 0.000 description 2
- 210000004962 mammalian cell Anatomy 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 229960004927 neomycin Drugs 0.000 description 2
- 238000003199 nucleic acid amplification method Methods 0.000 description 2
- 238000011275 oncology therapy Methods 0.000 description 2
- 201000008968 osteosarcoma Diseases 0.000 description 2
- 238000007911 parenteral administration Methods 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 210000001236 prokaryotic cell Anatomy 0.000 description 2
- 108010031719 prolyl-serine Proteins 0.000 description 2
- 108010004914 prolylarginine Proteins 0.000 description 2
- 108010015796 prolylisoleucine Proteins 0.000 description 2
- 230000004044 response Effects 0.000 description 2
- 238000002741 site-directed mutagenesis Methods 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- 230000004614 tumor growth Effects 0.000 description 2
- 241000701161 unidentified adenovirus Species 0.000 description 2
- 230000003612 virological effect Effects 0.000 description 2
- AXFMEGAFCUULFV-BLFANLJRSA-N (2s)-2-[[(2s)-1-[(2s,3r)-2-amino-3-methylpentanoyl]pyrrolidine-2-carbonyl]amino]pentanedioic acid Chemical compound CC[C@@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AXFMEGAFCUULFV-BLFANLJRSA-N 0.000 description 1
- SUQWGICKJIJKNO-IHRRRGAJSA-N (2s)-2-[[2-[[(2s)-6-amino-2-[[(2s)-2,6-diaminohexanoyl]amino]hexanoyl]amino]acetyl]amino]pentanedioic acid Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O SUQWGICKJIJKNO-IHRRRGAJSA-N 0.000 description 1
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 1
- 125000000980 1H-indol-3-ylmethyl group Chemical group [H]C1=C([H])C([H])=C2N([H])C([H])=C(C([H])([H])[*])C2=C1[H] 0.000 description 1
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 1
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 1
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 1
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 1
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 1
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 1
- XUCHENWTTBFODJ-FXQIFTODSA-N Ala-Met-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O XUCHENWTTBFODJ-FXQIFTODSA-N 0.000 description 1
- WPWUFUBLGADILS-WDSKDSINSA-N Ala-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O WPWUFUBLGADILS-WDSKDSINSA-N 0.000 description 1
- AOAKQKVICDWCLB-UWJYBYFXSA-N Ala-Tyr-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N AOAKQKVICDWCLB-UWJYBYFXSA-N 0.000 description 1
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- UISQLSIBJKEJSS-GUBZILKMSA-N Arg-Arg-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(O)=O UISQLSIBJKEJSS-GUBZILKMSA-N 0.000 description 1
- JSLGXODUIAFWCF-WDSKDSINSA-N Arg-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(O)=O JSLGXODUIAFWCF-WDSKDSINSA-N 0.000 description 1
- IIABBYGHLYWVOS-FXQIFTODSA-N Arg-Asn-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O IIABBYGHLYWVOS-FXQIFTODSA-N 0.000 description 1
- ZTKHZAXGTFXUDD-VEVYYDQMSA-N Arg-Asn-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZTKHZAXGTFXUDD-VEVYYDQMSA-N 0.000 description 1
- YWENWUYXQUWRHQ-LPEHRKFASA-N Arg-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O YWENWUYXQUWRHQ-LPEHRKFASA-N 0.000 description 1
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 1
- HQIZDMIGUJOSNI-IUCAKERBSA-N Arg-Gly-Arg Chemical compound N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQIZDMIGUJOSNI-IUCAKERBSA-N 0.000 description 1
- PNIGSVZJNVUVJA-BQBZGAKWSA-N Arg-Gly-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O PNIGSVZJNVUVJA-BQBZGAKWSA-N 0.000 description 1
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 1
- PZVMBNFTBWQWQL-DCAQKATOSA-N Arg-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N PZVMBNFTBWQWQL-DCAQKATOSA-N 0.000 description 1
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 1
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 1
- ROWCTNFEMKOIFQ-YUMQZZPRSA-N Arg-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCNC(N)=N ROWCTNFEMKOIFQ-YUMQZZPRSA-N 0.000 description 1
- IJYZHIOOBGIINM-WDSKDSINSA-N Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N IJYZHIOOBGIINM-WDSKDSINSA-N 0.000 description 1
- 206010051113 Arterial restenosis Diseases 0.000 description 1
- POOCJCRBHHMAOS-FXQIFTODSA-N Asn-Arg-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O POOCJCRBHHMAOS-FXQIFTODSA-N 0.000 description 1
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 1
- BXUHCIXDSWRSBS-CIUDSAMLSA-N Asn-Leu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BXUHCIXDSWRSBS-CIUDSAMLSA-N 0.000 description 1
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 1
- YUOXLJYVSZYPBJ-CIUDSAMLSA-N Asn-Pro-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O YUOXLJYVSZYPBJ-CIUDSAMLSA-N 0.000 description 1
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 1
- FLJVGAFLZVBBNG-BPUTZDHNSA-N Asn-Trp-Arg Chemical compound N[C@@H](CC(=O)N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CCCNC(=N)N)C(=O)O FLJVGAFLZVBBNG-BPUTZDHNSA-N 0.000 description 1
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 1
- QHAJMRDEWNAIBQ-FXQIFTODSA-N Asp-Arg-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O QHAJMRDEWNAIBQ-FXQIFTODSA-N 0.000 description 1
- QRULNKJGYQQZMW-ZLUOBGJFSA-N Asp-Asn-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QRULNKJGYQQZMW-ZLUOBGJFSA-N 0.000 description 1
- WCFCYFDBMNFSPA-ACZMJKKPSA-N Asp-Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O WCFCYFDBMNFSPA-ACZMJKKPSA-N 0.000 description 1
- PXLNPFOJZQMXAT-BYULHYEWSA-N Asp-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O PXLNPFOJZQMXAT-BYULHYEWSA-N 0.000 description 1
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 1
- SPKCGKRUYKMDHP-GUDRVLHUSA-N Asp-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N SPKCGKRUYKMDHP-GUDRVLHUSA-N 0.000 description 1
- KFAFUJMGHVVYRC-DCAQKATOSA-N Asp-Leu-Met Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O KFAFUJMGHVVYRC-DCAQKATOSA-N 0.000 description 1
- WOPJVEMFXYHZEE-SRVKXCTJSA-N Asp-Phe-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WOPJVEMFXYHZEE-SRVKXCTJSA-N 0.000 description 1
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 1
- MVRGBQGZSDJBSM-GMOBBJLQSA-N Asp-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)N MVRGBQGZSDJBSM-GMOBBJLQSA-N 0.000 description 1
- PDIYGFYAMZZFCW-JIOCBJNQSA-N Asp-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N)O PDIYGFYAMZZFCW-JIOCBJNQSA-N 0.000 description 1
- 201000001320 Atherosclerosis Diseases 0.000 description 1
- 208000023275 Autoimmune disease Diseases 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 241001598984 Bromius obscurus Species 0.000 description 1
- 101100285688 Caenorhabditis elegans hrg-7 gene Proteins 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- 108020004635 Complementary DNA Proteins 0.000 description 1
- 241000702421 Dependoparvovirus Species 0.000 description 1
- IYAUFWMUCGBFMQ-CIUDSAMLSA-N Glu-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N IYAUFWMUCGBFMQ-CIUDSAMLSA-N 0.000 description 1
- LTUVYLVIZHJCOQ-KKUMJFAQSA-N Glu-Arg-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LTUVYLVIZHJCOQ-KKUMJFAQSA-N 0.000 description 1
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 1
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 1
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 1
- NTBDVNJIWCKURJ-ACZMJKKPSA-N Glu-Asp-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NTBDVNJIWCKURJ-ACZMJKKPSA-N 0.000 description 1
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 1
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 1
- XEKAJTCACGEBOK-KKUMJFAQSA-N Glu-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XEKAJTCACGEBOK-KKUMJFAQSA-N 0.000 description 1
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 1
- JSIQVRIXMINMTA-ZDLURKLDSA-N Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O JSIQVRIXMINMTA-ZDLURKLDSA-N 0.000 description 1
- LLEUXCDZPQOJMY-AAEUAGOBSA-N Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(O)=O)=CNC2=C1 LLEUXCDZPQOJMY-AAEUAGOBSA-N 0.000 description 1
- SFKMXFWWDUGXRT-NWLDYVSISA-N Glu-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)O)N)O SFKMXFWWDUGXRT-NWLDYVSISA-N 0.000 description 1
- PMSDOVISAARGAV-FHWLQOOXSA-N Glu-Tyr-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 PMSDOVISAARGAV-FHWLQOOXSA-N 0.000 description 1
- HQTDNEZTGZUWSY-XVKPBYJWSA-N Glu-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)NCC(O)=O HQTDNEZTGZUWSY-XVKPBYJWSA-N 0.000 description 1
- XUDLUKYPXQDCRX-BQBZGAKWSA-N Gly-Arg-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O XUDLUKYPXQDCRX-BQBZGAKWSA-N 0.000 description 1
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 1
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 1
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 1
- FXLVSYVJDPCIHH-STQMWFEESA-N Gly-Phe-Arg Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FXLVSYVJDPCIHH-STQMWFEESA-N 0.000 description 1
- WDXLKVQATNEAJQ-BQBZGAKWSA-N Gly-Pro-Asp Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WDXLKVQATNEAJQ-BQBZGAKWSA-N 0.000 description 1
- POJJAZJHBGXEGM-YUMQZZPRSA-N Gly-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN POJJAZJHBGXEGM-YUMQZZPRSA-N 0.000 description 1
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 1
- FKYQEVBRZSFAMJ-QWRGUYRKSA-N Gly-Ser-Tyr Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FKYQEVBRZSFAMJ-QWRGUYRKSA-N 0.000 description 1
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- BEWFWZRGBDVXRP-PEFMBERDSA-N Ile-Glu-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BEWFWZRGBDVXRP-PEFMBERDSA-N 0.000 description 1
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 1
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 1
- ZUWSVOYKBCHLRR-MGHWNKPDSA-N Ile-Tyr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUWSVOYKBCHLRR-MGHWNKPDSA-N 0.000 description 1
- RQZFWBLDTBDEOF-RNJOBUHISA-N Ile-Val-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N RQZFWBLDTBDEOF-RNJOBUHISA-N 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- PWWVAXIEGOYWEE-UHFFFAOYSA-N Isophenergan Chemical compound C1=CC=C2N(CC(C)N(C)C)C3=CC=CC=C3SC2=C1 PWWVAXIEGOYWEE-UHFFFAOYSA-N 0.000 description 1
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 1
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 1
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 1
- KEVYYIMVELOXCT-KBPBESRZSA-N Leu-Gly-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KEVYYIMVELOXCT-KBPBESRZSA-N 0.000 description 1
- OYQUOLRTJHWVSQ-SRVKXCTJSA-N Leu-His-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O OYQUOLRTJHWVSQ-SRVKXCTJSA-N 0.000 description 1
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 1
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 1
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 1
- ZGGVHTQAPHVMKM-IHPCNDPISA-N Leu-Trp-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCCCN)C(=O)O)N ZGGVHTQAPHVMKM-IHPCNDPISA-N 0.000 description 1
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 1
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 1
- GHOIOYHDDKXIDX-SZMVWBNQSA-N Lys-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN)C(O)=O)=CNC2=C1 GHOIOYHDDKXIDX-SZMVWBNQSA-N 0.000 description 1
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 1
- VUTWYNQUSJWBHO-BZSNNMDCSA-N Lys-Leu-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VUTWYNQUSJWBHO-BZSNNMDCSA-N 0.000 description 1
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 description 1
- YDDDRTIPNTWGIG-SRVKXCTJSA-N Lys-Lys-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O YDDDRTIPNTWGIG-SRVKXCTJSA-N 0.000 description 1
- SKUOQDYMJFUMOE-ULQDDVLXSA-N Lys-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N SKUOQDYMJFUMOE-ULQDDVLXSA-N 0.000 description 1
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 1
- ZOKVLMBYDSIDKG-CSMHCCOUSA-N Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN ZOKVLMBYDSIDKG-CSMHCCOUSA-N 0.000 description 1
- QVTDVTONTRSQMF-WDCWCFNPSA-N Lys-Thr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CCCCN QVTDVTONTRSQMF-WDCWCFNPSA-N 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 1
- 108091061960 Naked DNA Proteins 0.000 description 1
- 108091005461 Nucleic proteins Proteins 0.000 description 1
- WSDRAZIPGVLSNP-UHFFFAOYSA-N O.P(=O)(O)(O)O.O.O.P(=O)(O)(O)O Chemical group O.P(=O)(O)(O)O.O.O.P(=O)(O)(O)O WSDRAZIPGVLSNP-UHFFFAOYSA-N 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- UAMFZRNCIFFMLE-FHWLQOOXSA-N Phe-Glu-Tyr Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N UAMFZRNCIFFMLE-FHWLQOOXSA-N 0.000 description 1
- XDMMOISUAHXXFD-SRVKXCTJSA-N Phe-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O XDMMOISUAHXXFD-SRVKXCTJSA-N 0.000 description 1
- FUVBEZJCRMHWEM-FXQIFTODSA-N Pro-Asn-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FUVBEZJCRMHWEM-FXQIFTODSA-N 0.000 description 1
- ILMLVTGTUJPQFP-FXQIFTODSA-N Pro-Asp-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ILMLVTGTUJPQFP-FXQIFTODSA-N 0.000 description 1
- WIPAMEKBSHNFQE-IUCAKERBSA-N Pro-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@@H]1CCCN1 WIPAMEKBSHNFQE-IUCAKERBSA-N 0.000 description 1
- AUYKOPJPKUCYHE-SRVKXCTJSA-N Pro-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1 AUYKOPJPKUCYHE-SRVKXCTJSA-N 0.000 description 1
- XSXABUHLKPUVLX-JYJNAYRXSA-N Pro-Ser-Trp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O XSXABUHLKPUVLX-JYJNAYRXSA-N 0.000 description 1
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 1
- AIOWVDNPESPXRB-YTWAJWBKSA-N Pro-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2)O AIOWVDNPESPXRB-YTWAJWBKSA-N 0.000 description 1
- 201000004681 Psoriasis Diseases 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 1
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 1
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 1
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 1
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 1
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 108020005038 Terminator Codon Proteins 0.000 description 1
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 1
- QGXCWPNQVCYJEL-NUMRIWBASA-N Thr-Asn-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGXCWPNQVCYJEL-NUMRIWBASA-N 0.000 description 1
- KWQBJOUOSNJDRR-XAVMHZPKSA-N Thr-Cys-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N)O KWQBJOUOSNJDRR-XAVMHZPKSA-N 0.000 description 1
- UZJDBCHMIQXLOQ-HEIBUPTGSA-N Thr-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O UZJDBCHMIQXLOQ-HEIBUPTGSA-N 0.000 description 1
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 1
- BDENGIGFTNYZSJ-RCWTZXSCSA-N Thr-Pro-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O BDENGIGFTNYZSJ-RCWTZXSCSA-N 0.000 description 1
- YRJOLUDFVAUXLI-GSSVUCPTSA-N Thr-Thr-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O YRJOLUDFVAUXLI-GSSVUCPTSA-N 0.000 description 1
- 206010052779 Transplant rejections Diseases 0.000 description 1
- PNHABSVRPFBUJY-UMPQAUOISA-N Trp-Arg-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O PNHABSVRPFBUJY-UMPQAUOISA-N 0.000 description 1
- BSCBBPKDVOZICB-KKUMJFAQSA-N Tyr-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BSCBBPKDVOZICB-KKUMJFAQSA-N 0.000 description 1
- SYFHQHYTNCQCCN-MELADBBJSA-N Tyr-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O SYFHQHYTNCQCCN-MELADBBJSA-N 0.000 description 1
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 1
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 1
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 1
- YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 description 1
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 1
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 1
- STTYIMSDIYISRG-UHFFFAOYSA-N Valyl-Serine Chemical compound CC(C)C(N)C(=O)NC(CO)C(O)=O STTYIMSDIYISRG-UHFFFAOYSA-N 0.000 description 1
- IOUPEELXVYPCPG-UHFFFAOYSA-N Valylglycine Chemical compound CC(C)C(N)C(=O)NCC(O)=O IOUPEELXVYPCPG-UHFFFAOYSA-N 0.000 description 1
- 230000001594 aberrant effect Effects 0.000 description 1
- 230000002159 abnormal effect Effects 0.000 description 1
- 150000007513 acids Chemical class 0.000 description 1
- 239000008186 active pharmaceutical agent Substances 0.000 description 1
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 230000000844 anti-bacterial effect Effects 0.000 description 1
- 230000002421 anti-septic effect Effects 0.000 description 1
- 239000004599 antimicrobial Substances 0.000 description 1
- 239000003963 antioxidant agent Substances 0.000 description 1
- 239000007864 aqueous solution Substances 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 108010038633 aspartylglutamate Proteins 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 239000011324 bead Substances 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 238000010804 cDNA synthesis Methods 0.000 description 1
- 230000005907 cancer growth Effects 0.000 description 1
- 229910052799 carbon Inorganic materials 0.000 description 1
- 230000030833 cell death Effects 0.000 description 1
- 230000002032 cellular defenses Effects 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 238000002512 chemotherapy Methods 0.000 description 1
- 210000000038 chest Anatomy 0.000 description 1
- 238000010293 colony formation assay Methods 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 230000009881 electrostatic interaction Effects 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 1
- 108010049041 glutamylalanine Proteins 0.000 description 1
- 108010050848 glycylleucine Proteins 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 230000006058 immune tolerance Effects 0.000 description 1
- 238000011065 in-situ storage Methods 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 230000002458 infectious effect Effects 0.000 description 1
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 1
- 206010025135 lupus erythematosus Diseases 0.000 description 1
- 210000004698 lymphocyte Anatomy 0.000 description 1
- 108010076718 lysyl-glutamyl-tryptophan Proteins 0.000 description 1
- 238000000302 molecular modelling Methods 0.000 description 1
- 238000012900 molecular simulation Methods 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 102000027450 oncoproteins Human genes 0.000 description 1
- 108091008819 oncoproteins Proteins 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 239000000546 pharmaceutical excipient Substances 0.000 description 1
- 230000000144 pharmacologic effect Effects 0.000 description 1
- 108010012581 phenylalanylglutamate Proteins 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- 230000001766 physiological effect Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 230000002062 proliferating effect Effects 0.000 description 1
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 1
- 230000004853 protein function Effects 0.000 description 1
- 238000001959 radiotherapy Methods 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 210000001995 reticulocyte Anatomy 0.000 description 1
- 206010039073 rheumatoid arthritis Diseases 0.000 description 1
- 102000023888 sequence-specific DNA binding proteins Human genes 0.000 description 1
- 108091008420 sequence-specific DNA binding proteins Proteins 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 1
- 238000012868 site-directed mutagenesis technique Methods 0.000 description 1
- 239000003381 stabilizer Substances 0.000 description 1
- 238000001356 surgical procedure Methods 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 208000024891 symptom Diseases 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 230000009885 systemic effect Effects 0.000 description 1
- 230000008685 targeting Effects 0.000 description 1
- 238000011287 therapeutic dose Methods 0.000 description 1
- 238000011285 therapeutic regimen Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 108010084932 tryptophyl-proline Proteins 0.000 description 1
- 230000005748 tumor development Effects 0.000 description 1
- 230000005760 tumorsuppression Effects 0.000 description 1
- 238000009281 ultraviolet germicidal irradiation Methods 0.000 description 1
- 231100000402 unacceptable toxicity Toxicity 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 230000029812 viral genome replication Effects 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/46—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
- C07K14/47—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
- C07K14/4701—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals not used
- C07K14/4746—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals not used p53
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K48/00—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
Definitions
- the present invention relates generally to the field of oncoproteins, and more specifically to p53.
- Wild-type (wt) p53 is a sequence-specific DNA binding protein found in humans and other mammals, which has tumor suppressor function [See, e.g., Harris, Science. 262: 1980-1981 (1993)].
- the wild-type p53 protein functions to regulate cell proliferation and cell death (also known as apoptosis) . It also participates in the response of the cell to DNA damaging agents [Harris (1993), cited above].
- DNA damaging agents such as radiation and che otherapeutics commonly used for cancer treatment.
- Fig. 3 illustrates the effects of a number of amino acid substitutions on DNA binding of the tumor- derived p53His273 mutant. Binding was assayed using a high affinity DNA site (oligonucleotide BC) by gel retardation shift analysis on native electrophoretic gels. The amino acids are abbreviated using the single letter code: His, H; Arg, R; Lys, K.
- Fig. 4 illustrates the effect of the Thr284 to
- the amino acids are abbreviated using the single letter code: Arg, R; Cys, C; Gin, Q; His, H; Lys, K.
- Fig. 5 illustrates the results of an experiment relating to rescue of the transcriptional and tumor suppressor activities of tumor-derived p53 mutants.
- Transcriptional activities from a reporter plasmid containing a high affinity p53 DNA site are presented as means ⁇ SE.
- the activity of wild ⁇ type p53 was adjusted to 100%. No transcription was detected from a reporter lacking a p53 site.
- Tumor suppressor activities in Saos-2 osteosarcoma cells are presented as means ⁇ SE of the number of tumor cell colonies per plate.
- the amino acids are abbreviated using the single letter code: Arg, R; Cys, C; Gin, Q; His, H. threonine corresponding to amino acid residue 284 of the wild-type human p53 protein is changed to arginine.
- the invention provides a method of enhancing the DNA-binding ability of a p53 construct having a p53 DNA binding domain comprising the step of modifying the codon encoding amino acid 284 to a codon encoding arginine.
- the present invention provides a nucleic acid sequence encoding a protein of the invention.
- These nucleic acids may be inserted into an appropriate vector for delivery to patients for gene therapy.
- the nucleic acids may be inserted into a vector for in vitro expression of a protein of the invention, which is then introduced into patients.
- Other aspects and advantages of the present invention are described further in the following detailed description of the preferred embodiments thereof.
- Fig. 1 illustrates activation of DNA binding of common Class I mutants by antibody PAb421. Binding was assayed using a high affinity DNA site (oligonucleotide BC) by gel retardation shift analysis on native electrophoretic gels.
- the amino acids are abbreviated using the single letter code: Cys, C; Gin, Q; His, H; Ser, S; Trp, W.
- Fig. 2 illustrates activation of DNA binding of common Class I mutants by deletion of the p53 C-terminal 30 amino acids (residues 364-393) .
- Binding was assayed using a high affinity DNA site (oligonucleotide BC) by gel retardation shift analysis on native electrophoretic gels in the presence of specific (S - oligonucleotide BC) or non-specific (NS - oligonucleotide TF3) unlabeled excess competitor DNA.
- S - oligonucleotide BC specific - oligonucleotide BC
- NS - oligonucleotide TF3 non-specific
- the invention provides modified p53 protein constructs in which the amino acid residue corresponding to residue 284 of wild-type or native human p53 is modified from the native threonine to arginine.
- the native Thr residue at position 284 may be substituted with Lys (K284) .
- Lys284 was introduced into a p53His273 mutant, it was found to bind oligonucleotide BC somewhat better than the original p53His273 mutant, using the assay described in detail in Example 3. It will be understood that where reference is made to R284 in the following discussion, K284 may be substituted.
- the modified p53 constructs of the invention may be derived from full-length p53.
- p53Arg284 [SEQ ID NO: 3] and p53Lys284 [SEQ ID NO:4] are example ⁇ of such modified constructs.
- the modified p53 constructs of the invention may contain a C-terminal p53 deletion.
- a preferred deletion involves truncation of amino acid residues 364 - 393.
- One example of such a truncated construct is p53Arg284 ⁇ 364-393 [SEQ ID NO: 17].
- suitable deletions include truncation following amino acid residue 355, and deletions internal to this region (corresponding to residues 356-393 of SEQ ID N0:2) .
- p53 protein constructs encompasses full-length and truncated p53 proteins containing a p53 DNA binding domain. Included in this Detailed Description of the Invention
- the present invention provides modified p53 constructs containing arginine at the amino acid residue corresponding to residue 284 of wild-type human p53 [SEQ ID NO: 2].
- the inventor has found that such a modification results in an increase in the DNA binding avidity of the p53 and more efficient tumor suppression than the corresponding unmodified construct.
- the R284p53 [SEQ ID NO: 3] was found to bind DNA more avidly than wild-type p53 in vitro and to suppress colony growth of tumor cells about five- to six-fold more efficiently than wild-type p53 in tissue culture experiments.
- the inventor demonstrates herein that the tumor suppressor function of common Class I tumor-derived p53 mutants can be restored and provides the means for pharmacological rescue of p53 function in cancer patients.
- the inventor introduced a novel p53-DNA contact between a phosphate of the DNA backbone and p53. This was done by replacing Thr284 of wild-type human p53 with Arg. This substitution, in conjunction with the conformational switch that involves the C-terminus of p53 and allosterically regulates the activity of the p53 DNA binding domain, fully restored DNA binding of the tumor-derived p53 mutants. Furthermore, the transcriptional and tumor suppressing activities of these p53 mutants were also restored. 8
- chimeric p53 proteins include proteins containing the N-terminal portion of p53 fused, optionally via a suitable linker, to a heterologous tetramerization domain.
- a heterologous tetramerization domain includes any sequence of amino acids heterologous to p53 which forms stable homo-tetramers.
- One particularly desirable tetramerization domain includes the tetrameric variant of the GCN4 LZ [Harbury et al,
- GCN4 numbering follows Hinnenbusch et al, Proc. Natl. Acad. Sci. USA, 81:6442- 6446 (1984) and Ellenberger et al, Cell, 71: 1223-1237 (1992) .
- Wild-type GCN4 is provided in SEQ ID NOS: 5 and 6.
- the LZ variant has Ile at positions d of the coil and Leu at positions a [SEQ ID NO: 33], in contrast to the original zipper which has Leu and Val, respectively.
- Suitable chimera include (from N-terminus to C-terminus) :
- the above nucleotide sequences can be included within larger DNA or RNA fragments, or may be interrupted by introns.
- nucleic acids encoding the modified proteins of the invention are present in the context of vectors suitable for amplification in prokaryotic or eukaryotic cells or for expression in cell-free extracts or lysates or in prokaryotic or eukaryotic cells.
- vectors suitable for amplification in prokaryotic or eukaryotic cells or for expression in cell-free extracts or lysates or in prokaryotic or eukaryotic cells are known and many of these are commercially available.
- plasmids with bacterial or yeast replication origins allow amplification in bacteria or yeast, respectively.
- Such vectors allow the production of large quantities of nucleic acids encoding the proteins of the invention, which nucleic acids can be used for gene therapy or for expression of the modified p53 proteins of the invention.
- expression vectors are known.
- the vector pGEM4 (Promega, Madison, WI) is suitable for expression of the p53 proteins in cell-free lysates
- the vector pSV2 [Mulligan et al, Proc. Natl. Acad. Sci. USA. 18:2072-2076 (1981)] is suitable for expression in mammalian cells.
- Such vectors allow the production of the modified proteins of the invention in vitro for analysis of their functional properties or for delivery to patients.
- one of skill in the art may readily select or construct another suitable expression vector.
- nucleic acid sequences of the invention may be inserted into a vector capable of targeting and infecting a desired cell, either in vivo or ex vivo for which is fused via an Ile linker, to aa residues 352-393 of p53wt [SEQ ID NO: 2].
- mutants include p53 having glutamine at residue 248 (p53Q248) [SEQ ID NO: 11], p53 having histidine at residue 273 (p53H273) [SEQ ID NO: 12], and p53 having cysteine at residue 273 (p53C273) [SEQ ID NO: 13].
- Other p53 mutants which may be susceptible to this R284 mutation are known in the art.
- Modifying the p53 protein construct according to the method of the invention involves altering the residue corresponding to aa residue 284 of human p53wt or of a p53 mutant containing the native Thr284 to Arg. This modification can be achieved by mutating the 284 codon using conventional site-directed mutagenesis techniques [R. Higuchi et al, in M. A. Innis et al, (eds.) , PCR Protocols: A Guide to Methods and Applications, Academic Press, San Diego, pp. 177-183
- the native codon 284 (ACA) is modified by site-directed mutagenesis to CGA or preferably CGT, which encodes Arg.
- CGA native codon 284
- CGT preferably CGT
- the present invention further provides nucleic acid sequences encoding the modified p53 protein constructs of this invention.
- the nucleic acid sequences of the invention 12 a pharmaceutically acceptable carrier, such as saline, and administered parenterally (or by other suitable means) in sufficient amounts to infect the desired cells and provide sufficient levels of p53 activity to arrest abnormal cellular proliferation.
- a pharmaceutically acceptable carrier such as saline
- Other pharmaceutically acceptable carriers are well known to those of skill in the art.
- a suitable amount of the vector containing the chimeric nucleic acid sequences is between about 10 6 to 10° infectious particles per mL carrier. The delivery of the vector may be repeated as needed to sustain satisfactory levels of p53 activity, as determined by monitoring clinical symptoms.
- this therapy may be combined with other therapies for the disease or condition being treated.
- therapy involving the administration of a vector capable of expressing a modified p53 protein construct of the invention is well suited for use in conjunction with conventional cancer therapies, including surgery, radiation and chemotherapy.
- nucleic acid sequences driving expression of a p53 protein of the invention may also be introduced as "naked DNA" by "carriers" other than viral vectors, such as liposomes, nucleic acid-coated gold beads or can simply be suspended in saline or the like and injected in situ [Fujiwara et al (1994) , cited above; Fynan et al , Proc. Natl. Acad. Sci.
- a suitable amount of nucleic acid is between about 10 ⁇ g to about 1 mg per mL carrier.
- a suitable amount of nucleic acid is between about 10 ⁇ g to about 1 mg per mL carrier.
- many such viral vectors are useful for this purpose, e.g., adenoviruses, retroviruses and adeno-associated viruses (AAV) [Schreiber et al,
- a recombinant viral vector e.g. an adenovirus
- a recombinant viral vector comprises DNA of at least that portion of the viral genome which is capable of infecting the target cells operatively linked to the nucleic acid sequences of the invention.
- infection is generally meant the process by which a virus transfers genetic material to its host or target cell.
- the virus used in the construction of a vector of the invention is rendered replication-defective to remove the effects of viral replication on the target cells.
- the replication-defective viral genome can be packaged by a helper virus in association with conventional techniques.
- the vector(s) containing the nucleic acids encoding a protein of the invention is suspended in 14 effective to treat the conditions referred to below.
- a preferred dose of a pharmaceutical composition containing a protein of this invention is generally effective above about 0.1 mg modified p53 protein, and preferably from about 1 mg to about 100 mg. Dosage units of such pharmaceutical compositions containing the proteins of this invention preferably contain about 1 mg to 5 g of the protein. These doses may be administered with a frequency necessary to achieve and maintain satisfactory p53 DNA binding and tumor suppressor activity levels. Although a preferred range has been described above, alternative doses for treatment of each type of tumor or other condition may be determined by those of skill in the art.
- nucleic acids and proteins of the invention can be introduced into human patients for therapeutic benefits in conditions characterized by insufficient wild-type p53 activity.
- the nucleic acids of the invention may be introduced into the patient in the form of a suitable viral vector (or by direct DNA delivery) to harness the patient's cellular machinery to express the proteins of the invention in vivo.
- the proteins of the invention may be introduced into the patient in appropriate pharmaceutical formulations as described above.
- compositions of thi ⁇ invention containing a protein of the invention or a nucleic acid or a viral vector which express a protein of the invention in vivo, may be employed to induce the cellular defense to DNA damaging agents.
- DNA damaging agents include sunlight, UV irradiation, as well as radiation and chemotherapeutics used for cancer treatment.
- UV irradiation examples include sunlight, UV irradiation, as well as radiation and chemotherapeutics used for cancer treatment.
- modified p53 protein constructs of this invention may also be formulated into pharmaceutical compositions and administered using a therapeutic regimen compatible with the particular formulation.
- compositions within the scope of the present invention include compositions containing a protein of the invention in an effective amount to have the desired physiological effect, e.g. to arrest the growth of cancer cells without causing unacceptable toxicity for the patient.
- Suitable carriers for parenteral administration include aqueous solutions of the active compounds in water-soluble or water-dispersible form, e.g. saline.
- suspensions of the active compounds may be administered in suitable conventional lipophilic carriers or in liposomes.
- compositions may be supplemented by active pharmaceutical ingredients, where desired.
- Optional antibacterial, antiseptic, and antioxidant agents in the compositions can perform their ordinary functions.
- the pharmaceutical compositions of the invention may further contain any of a number of suitable viscosity enhancers, stabilizers, excipients and auxiliaries which facilitate processing of the active compounds into preparations that can be used pharmaceutically.
- these preparations, as well as those preparations discussed below, are designed for parenteral administration.
- compositions designed for oral or rectal administration are also considered to fall within the scope of the present invention.
- suitable amount or “effective amount” means an amount which is 16
- Plasmids of the pGEM series were used to generate in vitro translated p53 proteins, as previously described [T. Halazonetis and A. Kandil, EMBO J.. 12:5057-5064 (1993a); T. Halazonetis and A. Kandil, EMBO J. , 12:1021-1028 (1993b) ; J. L. Waterman et al, EMBO J. , 14:512-519 (1995)].
- plasmid pGEMhump53wt (also termed pGEMhp53wtB) encodes full-length human wild-type p53.
- This plasmid was prepared by PCR using a human p53 cDNA, which is readily available to those practicing the art. The PCR procedure was designed to incorporate unique restriction sites within the coding sequence of human p53: Kpn I at codon 218, Sst I at codon 299, Sst II at codon 333, Bst BI at codon 338 and Sal I immediately following the termination codon. An Msc I site at codon 138 was eliminated.
- the proteins were derived from pGEMhump53wt by site-directed mutagenesis [Higuchi, in Innis et al, PCR Protocols: A Guide to Methods and Applications, Academic Press, San Diego, pp. 177-183 (1990) ] of the codons indicated below.
- In vitro amount of a composition of this invention patients may tolerate higher doses of such DNA damaging agents.
- compositions of this invention are in inducing apoptosis of specific cells, such as proliferating lymphocytes.
- a suitable amount of an appropriate pharmaceutical composition of this invention is administered to a subject to enhance the development of immune tolerance.
- This method may employ both in vivo and ex vivo modes of administration.
- this therapy is useful as the sole treatment or as an accessory treatment to prevent transplant rejection, or to treat autoimmune diseases, e.g. , systemic lupus erythrematosis, rheumatoid arthritis and the like.
- the pharmaceutical compositions of this invention may also be employed to restore p53 function in tumor cells.
- a suitable amount of the composition of this invention is administered systemically, or locally to the site of the tumor with or without concurrent administration of conventional cancer therapy (i.e. DNA damaging agents) .
- compositions of this invention may be administered in methods to suppress cell proliferation in diseases other than cancers, which are characterized by aberrant cell proliferation.
- diseases include psoriasis, atherosclerosis and arterial restenosis.
- This method is conducted by administering a suitable amount of the selected composition systemically or locally to the patient.
- These examples illustrate the preferred method for preparing exemplary modified p53 constructs of the invention and the biological activity of the modified p53 constructs. These examples are illustrative only and do not limit the scope of the invention. 18
- the DNA binding activity of wild-type p53 is allosterically regulated by a basic region within the C- ter inal 30 amino acids of p53.
- Monoclonal antibodies that mask this regulatory region, such as PAb421, or deletion of this region stimulate binding to DNA [T. Halanonetis et al, EMBJO J. , 12:1021-1028 (1993) ; T.R. Hupp et al, Cell. 21:875-886 (1992) ; J.L.F. Waterman et al, EMBO J. , ⁇ :512-519 (1995)].
- some tumor-derived mutants have also been reported to bind DNA when allosterically activated by antibody PAb421 [T. Halazonetis and A.
- Proteins corresponding to a) to h) each containing a deletion of the C-terminal 30 amino acid of human p53 ( ⁇ 364-393) , were also generated [SEQ ID NOS: 17-24]. These deletions permit in vitro DNA binding.
- Plasmid pBC/TKseap has one copy of oligonucleotide BC [Halazonetis, EMBO J. , 12:1021-1028 (1993) cloned in the Eco RV site of pTKseap [Waterman, 1996] and expresses secreted alkaline phosphatase in a p53-responsive manner.
- an Arg side chain introduced at position 284 could form electrostatic interactions with the phosphate oxygen atoms of DNA closest to its ⁇ -carbon and without violating bond lengths and angles. Modeling was performed with Quanta 4.1 (Molecular Simulations Inc., Burlington, MA) .
- Example 1 All the proteins of Example 1 containing the 30 amino acid C-terminal deletion were expressed by in vitro translation and assayed for DNA binding using 0.2 ng 32 P- labeled DNA and, where indicated below, 100 ng unlabeled competitor DNA [J. L. F. Waterman et al, EMBO J. , 14: 512- 519 (1995)], The analysis was restricted to the C- terminally truncated proteins because full-length p53 translated in vitro is in a latent state and cannot bind DNA unless activated by a C-terminal truncation or by a monoclonal antibody (PAb421) that binds to the p53 C- terminus [Waterman et al, cited above].
- PAb421 monoclonal antibody
- Oligonucleotide BC which has the following sequence (top strand) is: [SEQ ID NO: 29] CC-GGGCA-TGTCC- GGGCA-TGTCC-GGGCATGT, and oligonucleotide structure of their DNA binding domain, have latent sequence-specific DNA binding activity, whereas Clas ⁇ II mutants, which have unfolded DNA binding domains [C. A. Finlay et al, Mol. Cell. Biol.
- residues of the DNA binding domain of p53His273 were replaced with basic amino acids.
- the substitutions targeted essentially all the residues close to the DNA backbone, except for those that already contact DNA or those that unequivocally stabilize the three-dimensional structure of p53 [Cho, cited above].
- the targeted residues were: Glyll7, Thrll ⁇ , Alall9, Asn247, Thr284, Glu285 and Glu287.
- Substitution of Thr284 with Arg enhanced binding of p53His273 to the high affinity DNA ⁇ ite, although binding wa ⁇ still dependent on allosteric activation by antibody PAb421 (Fig. 3) .
- Substitution of Thr284 with Lys also enhanced binding of p53His273 to the high affinity DNA ⁇ ite, but less than sub ⁇ titution of 22
- the Class I p53 mutants had either weak (p53Hi ⁇ 273) or no (p53Gln248 and p53Cys273) transcriptional activity. However, their transcriptional activity was enhanced to wild-type levels by the Thr284 to Arg substitution or, for p53Gln248, by combining the Thr284 to Arg substitution with C-terminal allosteric activation (Fig. 1) .
- Tumor suppre ⁇ ing activity was tested in a colony formation assay, by cotransfecting Saos-2 osteosarcoma cells with 5 ⁇ g of pSV2hp53 expres ⁇ ion pla ⁇ mid directing p53 expre ⁇ ion, 0.5 ⁇ g of pSV7neo, a plasmid that confers neomycin/G418 resistance [K. Zhang et al, Proc. Natl. Acad. Sci. USA. 82:6281-6285 (1990)] and 24 ⁇ g of pBC12/PLseap [T. D. Halazonetis, Anticancer Res. , 11:285-292 (1992)], a carrier plasmid.
- the transfected cell ⁇ were ⁇ elected for G418 re ⁇ i ⁇ tance, a neomycin relative. Two weeks later the colonies were stained with cry ⁇ tal violet and counted. High tumor suppressor activity corresponds to low colony formation.
- the proteins containing the Arg284 modification ⁇ uppre ⁇ ed tumor colony formation more efficiently than the corresponding proteins without the Arg284 modification (Table 1) .
- the magnitude of the effect is greater for the tumor-derived p53 mutants; however, even Ep21, which has the following sequence: [SEQ ID NO: 30] CCC-GAACA-TGTCC-CAACA-TGTTG-GGG, each contain a p53 binding site, which is underlined.
- the BC oligonucleotide has a high affinity p53-binding site, while oligonucleotide Ep21 contains a lower affinity site, which is present in the regulatory sequences of the p21 gene [W. S.
- Oligonucleotide Egadd45 has the sequence [SEQ ID NO: 31] ACA-GAACA-TGTCT-AAGCA-TGCTG-GGGA.
- Oligonucleotide TF3 which contains three tandem repeats of [SEQ ID NO: 32] ATCACGTGAT, is a non-specific DNA [Halazonetis et al, EMBO J. , 11:1021-1028 (1993)].
- Thr284 to Arg substitution enhanced binding of all p53 proteins examined (Fig. 4) .
- oligonucleotides BC and Ep21 for wild-type p53 the effect is evident with oligonucleotides BC and Ep21, for p53Gln248 it is evident with oligonucleotide BC, for p53His273 and p53Cys273 it is evident with all oligonucleotides tested (Fig. 4) .
- Example 3 Transcription and Tumor Suppres ⁇ ion A ⁇ ay ⁇
- the proteins of Example 1 were examined for their transcriptional activity and tumor suppres ⁇ or activity. Wild-type p53 activate ⁇ transcription of target genes and suppresses tumor growth, whereas tumor- derived mutants lack both these activities [S.E. Kern et al, Science. 256: 827-830 (1992) ; C. A. Finlay et al, Cell, 5_7:1083-1093 (1989)].
- the transcriptional activities of wild-type p53 and various p53 mutants were as ⁇ ayed with a p53-responsive reporter pla ⁇ mid in Saos-2 human o ⁇ teo ⁇ arcoma cells, which lack endogenous p53 [M.J.F.
- transcriptional activity was determined by transfecting Saos-2 cell ⁇ with 2.5 ⁇ g pSV2hp53 expre ⁇ ion plasmid and 27.5 ⁇ g pBC/TKseap or pTKseap reporter pla ⁇ ids [Waterman et al, Cancer Res. , 24
- MOLECULE TYPE DNA (genomic)
- GGT TCT AAA TCA ACC AAC GAA AAT GTA TCT GCT TCC ACT TCT 879 Gly Ser Lys Ser Thr Asn Glu Asn Val Ser Ala Ser Thr Ser
- MOLECULE TYPE DNA (genomic)
- AAAAATTTCC GACTTTAAAT ACGGAAGATA AATACTCCAA CCTTTTTTTC 100
- GAAAACTGTC AGTTTTTTGA AGAGTTATTT GTTTTGTTAC CAATTGCTAT 600
- Lys Gin Arg Ser lie Pro Leu Ser Pro Ile Val Pro Glu Ser Ser
- AAA CGT GCT AGA AAC ACT GAA GCC GCC AGG CGT TCT CGT GCG 1509 Lys Arg Ala Arg Asn Thr Glu Ala Ala Arg Arg Ser Arg Ala 235 240 245
- MOLECULE TYPE protein Glu Ala Leu Glu Leu Lys Asp Ala Gin Ala Gly Lys Glu Pro Gly
Landscapes
- Chemical & Material Sciences (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Gastroenterology & Hepatology (AREA)
- Biochemistry (AREA)
- Biophysics (AREA)
- Zoology (AREA)
- Genetics & Genomics (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Toxicology (AREA)
- Peptides Or Proteins (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
Abstract
A modified p53 protein or peptide having DNA binding in which amino acid residue 284 of a p53 protein or protein fragment is changed to Arginine or Lysine, is described. Also described are nucleotide sequences encoding the modified protein and vectors capable of expressing it.
Description
MODIFIED P53 CONSTRUCTS AND USES THEREFOR
Field of the Invention
The present invention relates generally to the field of oncoproteins, and more specifically to p53.
Background of the Invention
Wild-type (wt) p53 is a sequence-specific DNA binding protein found in humans and other mammals, which has tumor suppressor function [See, e.g., Harris, Science. 262: 1980-1981 (1993)]. The wild-type p53 protein functions to regulate cell proliferation and cell death (also known as apoptosis) . It also participates in the response of the cell to DNA damaging agents [Harris (1993), cited above]. In more than half of all human tumors p53 is inactivated by mutations and is therefore unable to arrest cell proliferation or induce apoptosis in response to DNA damaging agents, such as radiation and che otherapeutics commonly used for cancer treatment. The nucleotide and amino acid sequences of human p53 have been reported by Zakut-Houri et al, EMBO J. , 4.: 1251-1255 (1985) ; GenBank Code Hsp53]. The amino acid sequence of p53 is conserved across evolution [Soussi et al, Oncogene. 5_: 945-952 (1990) ], suggesting that its function is also conserved. The ability of p53 to bind DNA in a sequence-specific manner maps to amino acid residues 90-290 of human p53 [Halazonetis and Kandil, EMBO J.. 12: 5057-5064 (1993); Pavletich et al, Genes Dev.. 1_: 2556-2564 (1993) ; Wang et al, Genes Dev.. 1_: 2575-2586 (1993) ] and the tetramerization domain maps to amino acid residues 322-355 of human p53. Mutations of the p53 protein in most human tumors involve the sequence-specific DNA binding domain, so that the mutant proteins are unable to bind DNA [Bargonetti et al, Genes Dev. , 6 : 1886-1898 (1992)]. The loss of p53 function is critical for tumor development.
using the single letter code: Cys, C; Gin, Q; His, H; Ser, S; Trp, W.
Fig. 3 illustrates the effects of a number of amino acid substitutions on DNA binding of the tumor- derived p53His273 mutant. Binding was assayed using a high affinity DNA site (oligonucleotide BC) by gel retardation shift analysis on native electrophoretic gels. The amino acids are abbreviated using the single letter code: His, H; Arg, R; Lys, K. Fig. 4 illustrates the effect of the Thr284 to
Arg substitution on binding of wild-type p53 and tumor- derived p53 mutants to a high affinity DNA site, and to the sites in the p21c-^ and gadd45 genes (oligonucleotides BC, Ep21 and Egadd45, respectively) . Binding was assayed by gel retardation shift analysis on native electrophoretic gels. Only the region of the gel corresponding to the p53-DNA complexes is shown. The amino acids are abbreviated using the single letter code: Arg, R; Cys, C; Gin, Q; His, H; Lys, K. Fig. 5 illustrates the results of an experiment relating to rescue of the transcriptional and tumor suppressor activities of tumor-derived p53 mutants. Transcriptional activities from a reporter plasmid containing a high affinity p53 DNA site (oligonucleotide BC) are presented as means ±SE. The activity of wild¬ type p53 was adjusted to 100%. No transcription was detected from a reporter lacking a p53 site. Tumor suppressor activities in Saos-2 osteosarcoma cells are presented as means ± SE of the number of tumor cell colonies per plate. The amino acids are abbreviated using the single letter code: Arg, R; Cys, C; Gin, Q; His, H.
threonine corresponding to amino acid residue 284 of the wild-type human p53 protein is changed to arginine.
In another aspect, the invention provides a method of enhancing the DNA-binding ability of a p53 construct having a p53 DNA binding domain comprising the step of modifying the codon encoding amino acid 284 to a codon encoding arginine.
In yet another aspect, the present invention provides a nucleic acid sequence encoding a protein of the invention. These nucleic acids may be inserted into an appropriate vector for delivery to patients for gene therapy. Alternatively the nucleic acids may be inserted into a vector for in vitro expression of a protein of the invention, which is then introduced into patients. Other aspects and advantages of the present invention are described further in the following detailed description of the preferred embodiments thereof.
Brief Description of the Drawings
Fig. 1 illustrates activation of DNA binding of common Class I mutants by antibody PAb421. Binding was assayed using a high affinity DNA site (oligonucleotide BC) by gel retardation shift analysis on native electrophoretic gels. The amino acids are abbreviated using the single letter code: Cys, C; Gin, Q; His, H; Ser, S; Trp, W.
Fig. 2 illustrates activation of DNA binding of common Class I mutants by deletion of the p53 C-terminal 30 amino acids (residues 364-393) . Binding was assayed using a high affinity DNA site (oligonucleotide BC) by gel retardation shift analysis on native electrophoretic gels in the presence of specific (S - oligonucleotide BC) or non-specific (NS - oligonucleotide TF3) unlabeled excess competitor DNA. The amino acids are abbreviated
Thus, the inventor has demonstrated that the tumor suppressor function of common Class I tumor-derived p53 mutants can be restored and that p53 function can be rescued in cancer patients.
I. p53 Mutant Proteins
Thus, in one aspect the invention provides modified p53 protein constructs in which the amino acid residue corresponding to residue 284 of wild-type or native human p53 is modified from the native threonine to arginine. In an alternate, and currently lesε preferred, embodiment, the native Thr residue at position 284 may be substituted with Lys (K284) . When Lys284 was introduced into a p53His273 mutant, it was found to bind oligonucleotide BC somewhat better than the original p53His273 mutant, using the assay described in detail in Example 3. It will be understood that where reference is made to R284 in the following discussion, K284 may be substituted.
The modified p53 constructs of the invention may be derived from full-length p53. p53Arg284 [SEQ ID NO: 3] and p53Lys284 [SEQ ID NO:4] are exampleε of such modified constructs. Alternatively, the modified p53 constructs of the invention may contain a C-terminal p53 deletion. Currently a preferred deletion involves truncation of amino acid residues 364 - 393. One example of such a truncated construct is p53Arg284Δ364-393 [SEQ ID NO: 17]. However, suitable deletions include truncation following amino acid residue 355, and deletions internal to this region (corresponding to residues 356-393 of SEQ ID N0:2) .
As used herein, "p53 protein constructs" encompasses full-length and truncated p53 proteins containing a p53 DNA binding domain. Included in this
Detailed Description of the Invention
The present invention provides modified p53 constructs containing arginine at the amino acid residue corresponding to residue 284 of wild-type human p53 [SEQ ID NO: 2]. The inventor has found that such a modification results in an increase in the DNA binding avidity of the p53 and more efficient tumor suppression than the corresponding unmodified construct. For example, when wild-type p53 was so modified, the R284p53 [SEQ ID NO: 3] was found to bind DNA more avidly than wild-type p53 in vitro and to suppress colony growth of tumor cells about five- to six-fold more efficiently than wild-type p53 in tissue culture experiments. Particularly, the inventor demonstrates herein that the tumor suppressor function of common Class I tumor-derived p53 mutants can be restored and provides the means for pharmacological rescue of p53 function in cancer patients.
All references to human p53 residue numbers herein refer to the numbering scheme provided by
Zakut-Houri et al, (1985) [cited above], which is incorporated by reference, and reproduced in SEQ ID NOS: 1 and 2.
Without wishing to be bound by theory, to fully restore DNA binding to tumor-derived p53 mutants, such as Gln248, His273 and Cys273, the inventor introduced a novel p53-DNA contact between a phosphate of the DNA backbone and p53. This was done by replacing Thr284 of wild-type human p53 with Arg. This substitution, in conjunction with the conformational switch that involves the C-terminus of p53 and allosterically regulates the activity of the p53 DNA binding domain, fully restored DNA binding of the tumor-derived p53 mutants. Furthermore, the transcriptional and tumor suppressing activities of these p53 mutants were also restored.
8
(c) aa 1-325 of p53wt [SEQ ID NO: 2], fused via an Arg-Gly-Asn linker [SEQ ID NO: 8], to the heterologous sequence of (a) above;
(d) aa 1-325 of p53wt [SEQ ID NO: 2], fused via an Arg-Gly-Gly-Asn-Pro-Glu linker [SEQ ID NO: 9], to the heterologous sequence of (b) above;
(e) aa 1-323 of p53wt [SEQ ID NO: 2], fused via an Arg-Gly-Asn linker [SEQ ID NO: 8], to the heterologous sequence of (a) above; (f) aa 1-323 of p53wt [SEQ ID NO: 2], fused via an Arg-Gly-Gly-Asn-Pro-Glu linker [SEQ ID NO: 9], to the heterologous sequence of (b) above;
(g) aa 1-300 of p53wt [SEQ ID NO: 2], fused via a Gly-Gly-Asn-Gln-Ala linker [SEQ ID NO: 10], to the heterologous sequence of (b) above;
(h) aa 1-325 of p53wt [SEQ ID NO: 2], fused via an Arg-Gly-Asn linker [SEQ ID NO: 8], to the heterologous sequence of (a) above, fused via an Ile linker, to aa 352-393 of p53wt [SEQ ID NO: 2]; (i) aa 1-325 of p53wt [SEQ ID NO: 2], fused via an Arg-Gly-Gly-Asn-Pro-Glu linker [SEQ ID NO: 9], to the heterologous sequence of (b) above, which is fused via an Ile linker, to aa 352-393 of p53wt [SEQ ID NO: 2];
(j) aa 1-323 of p53wt [SEQ ID NO: 2], fused via an Arg-Gly-Asn linker [SEQ ID NO: 8], to the heterologous sequence of (a) above, which is fused via an Ile linker to aa 352-393 of p53wt [SEQ ID NO: 2] ;
(k) aa 1-325 of p53wt [SEQ ID NO: 2], fused via an Arg-Gly-Gly-Asn-Pro-Glu linker [SEQ ID NO: 9], to the heterologous sequence of (b) above, which is fused via an Ile linker, to aa residues 352-393 of p53wt [SEQ ID NO: 2] ; and
(1) aa 1-334 of p53wt [SEQ ID NO: 2], fused via an Asn linker, to the heterologous sequence of (a) above,
definition are chimeric and mutant p53 proteins. Such proteins are known in the art.
Exemplary chimeric p53 proteins are described in detail in International Publication No. W096/16989, published June 6, 1996 and co-pending U.S. Patent
Application No. 08/347,792 and co-pending U.S. Patent application No. 08/431,357, which are incorporated by reference. For example, chimeric p53 proteins include proteins containing the N-terminal portion of p53 fused, optionally via a suitable linker, to a heterologous tetramerization domain. A heterologous tetramerization domain includes any sequence of amino acids heterologous to p53 which forms stable homo-tetramers. One particularly desirable tetramerization domain includes the tetrameric variant of the GCN4 LZ [Harbury et al,
Science, 262: 1401-1407 (1993)]. GCN4 numbering follows Hinnenbusch et al, Proc. Natl. Acad. Sci. USA, 81:6442- 6446 (1984) and Ellenberger et al, Cell, 71: 1223-1237 (1992) . Wild-type GCN4 is provided in SEQ ID NOS: 5 and 6. The LZ variant has Ile at positions d of the coil and Leu at positions a [SEQ ID NO: 33], in contrast to the original zipper which has Leu and Val, respectively. Suitable chimera include (from N-terminus to C-terminus) :
(a) aa 1-334 of p53wt [SEQ ID NO: 2], fused via an Asn linker, to a heterologous sequence spanning residues 249-281 of GCN4 containing isoleucines at positions d of the coiled coil and leucines at positions a [SEQ ID NO: 33] ;
(b) aa 1-334 of p53wt [SEQ ID NO: 2], fused via a Gly-Asn-Pro-Glu linker [SEQ ID NO: 7], to a heterologous sequence spanning residues 250-281 of GCN4 containing isoleucines at positions d of the coiled coil and leucines at positions a [SEQ ID NO: 33];
10 include the complementary DNA sequence representing the non-coding strand, the messenger RNA sequence, the corresponding cDNA sequence and the RNA sequence complementary to the messenger RNA sequence. The above nucleotide sequences can be included within larger DNA or RNA fragments, or may be interrupted by introns.
In another embodiment the nucleic acids encoding the modified proteins of the invention are present in the context of vectors suitable for amplification in prokaryotic or eukaryotic cells or for expression in cell-free extracts or lysates or in prokaryotic or eukaryotic cells. Many such vectors are known and many of these are commercially available. For example, plasmids with bacterial or yeast replication origins allow amplification in bacteria or yeast, respectively. Such vectors allow the production of large quantities of nucleic acids encoding the proteins of the invention, which nucleic acids can be used for gene therapy or for expression of the modified p53 proteins of the invention. Similarly, expression vectors are known. For example, the vector pGEM4 (Promega, Madison, WI) is suitable for expression of the p53 proteins in cell-free lysates, while the vector pSV2 [Mulligan et al, Proc. Natl. Acad. Sci. USA. 18:2072-2076 (1981)] is suitable for expression in mammalian cells. Such vectors allow the production of the modified proteins of the invention in vitro for analysis of their functional properties or for delivery to patients. Alternatively, one of skill in the art may readily select or construct another suitable expression vector.
III. Gene Therapy
The nucleic acid sequences of the invention may be inserted into a vector capable of targeting and infecting a desired cell, either in vivo or ex vivo for
which is fused via an Ile linker, to aa residues 352-393 of p53wt [SEQ ID NO: 2].
Also encompassed within the definition of "p53 protein constructs" are both naturally occurring and engineered mutant proteins. Exemplary mutants include p53 having glutamine at residue 248 (p53Q248) [SEQ ID NO: 11], p53 having histidine at residue 273 (p53H273) [SEQ ID NO: 12], and p53 having cysteine at residue 273 (p53C273) [SEQ ID NO: 13]. Other p53 mutants which may be susceptible to this R284 mutation are known in the art.
Modifying the p53 protein construct according to the method of the invention, involves altering the residue corresponding to aa residue 284 of human p53wt or of a p53 mutant containing the native Thr284 to Arg. This modification can be achieved by mutating the 284 codon using conventional site-directed mutagenesis techniques [R. Higuchi et al, in M. A. Innis et al, (eds.) , PCR Protocols: A Guide to Methods and Applications, Academic Press, San Diego, pp. 177-183
(1990)]. For example, preferably, the native codon 284 (ACA) is modified by site-directed mutagenesis to CGA or preferably CGT, which encodes Arg. However, one of skill in the art can readily make alternative modifications resulting in an Arg codon at position 284.
Alternatively, conventional chemical synthesis techniques may be used to generate a p53 sequence containing this modification.
II. Nucleic Acid Sequences Encoding Modified p53 Proteins of the Invention
The present invention further provides nucleic acid sequences encoding the modified p53 protein constructs of this invention. In addition to the coding strand, the nucleic acid sequences of the invention
12 a pharmaceutically acceptable carrier, such as saline, and administered parenterally (or by other suitable means) in sufficient amounts to infect the desired cells and provide sufficient levels of p53 activity to arrest abnormal cellular proliferation. Other pharmaceutically acceptable carriers are well known to those of skill in the art. A suitable amount of the vector containing the chimeric nucleic acid sequences is between about 106 to 10° infectious particles per mL carrier. The delivery of the vector may be repeated as needed to sustain satisfactory levels of p53 activity, as determined by monitoring clinical symptoms.
As desired, this therapy may be combined with other therapies for the disease or condition being treated. For example, therapy involving the administration of a vector capable of expressing a modified p53 protein construct of the invention is well suited for use in conjunction with conventional cancer therapies, including surgery, radiation and chemotherapy. Alternatively, nucleic acid sequences driving expression of a p53 protein of the invention may also be introduced as "naked DNA" by "carriers" other than viral vectors, such as liposomes, nucleic acid-coated gold beads or can simply be suspended in saline or the like and injected in situ [Fujiwara et al (1994) , cited above; Fynan et al , Proc. Natl. Acad. Sci. USA, 90: 11478-11482 (1993) ; Cohen, Science. 259: 1691-1692 (1993) ; Wolff et al, Biotechniques, 11: 474-485 (1991)]. A suitable amount of nucleic acid is between about 10 μg to about 1 mg per mL carrier. However, one of skill in the art may modify the therapeutic dose as desired.
gene therapy, and causing the encoded modified p53 protein construct of this invention to be expressed by that cell. Many such viral vectors are useful for this purpose, e.g., adenoviruses, retroviruses and adeno-associated viruses (AAV) [Schreiber et al,
Biotechniques. 14 : 818-823 (1993) ; Davidson et al, Nature Genetics, 3 : 219-223 (1993) ; Roessler et al, J. Clin. Invest. , 92: 1085-1092 (1993) ; S ythe et al, Ann. Thorac. Surg.. 5_7: 1395-1401 (1994) ; Kaplitt et al, Nature Genetics, 8 : 148-154 (1994)]. There has already been success using viral vectors driving expression of wild-type p53 [Fujiwara et al, Cancer Res. , 53 : 4129-4133 (1993) ; Fujiwara et al, Cancer Res. , 54 : 2287-2291 (1994) ; Friedmann, Cancer. 70(6 Suppl) : 1810-1817 (1992) ; Fujiwara et al, Curr. Opin. Oncol., 6 : 96-105 (1994)]. For use in gene therapy, these viral vectors containing nucleic acid sequences encoding a modified p53 protein construct of the invention, are prepared by one of skill in the art with resort to conventional techniques (see references mentioned above) . For example, a recombinant viral vector, e.g. an adenovirus, of the present invention comprises DNA of at least that portion of the viral genome which is capable of infecting the target cells operatively linked to the nucleic acid sequences of the invention. By "infection" is generally meant the process by which a virus transfers genetic material to its host or target cell. Preferably, the virus used in the construction of a vector of the invention is rendered replication-defective to remove the effects of viral replication on the target cells. In such cases, the replication-defective viral genome can be packaged by a helper virus in association with conventional techniques.
Briefly, the vector(s) containing the nucleic acids encoding a protein of the invention is suspended in
14 effective to treat the conditions referred to below. A preferred dose of a pharmaceutical composition containing a protein of this invention is generally effective above about 0.1 mg modified p53 protein, and preferably from about 1 mg to about 100 mg. Dosage units of such pharmaceutical compositions containing the proteins of this invention preferably contain about 1 mg to 5 g of the protein. These doses may be administered with a frequency necessary to achieve and maintain satisfactory p53 DNA binding and tumor suppressor activity levels. Although a preferred range has been described above, alternative doses for treatment of each type of tumor or other condition may be determined by those of skill in the art.
V. Therapeutic Indications
The nucleic acids and proteins of the invention can be introduced into human patients for therapeutic benefits in conditions characterized by insufficient wild-type p53 activity. As stated above, the nucleic acids of the invention may be introduced into the patient in the form of a suitable viral vector (or by direct DNA delivery) to harness the patient's cellular machinery to express the proteins of the invention in vivo. Alternatively, the proteins of the invention may be introduced into the patient in appropriate pharmaceutical formulations as described above.
As one example, the pharmaceutical compositions of thiε invention, containing a protein of the invention or a nucleic acid or a viral vector which express a protein of the invention in vivo, may be employed to induce the cellular defense to DNA damaging agents. Examples of DNA damaging agents include sunlight, UV irradiation, as well as radiation and chemotherapeutics used for cancer treatment. By administering a suitable
IV. Pharmaceutical Compositions
The modified p53 protein constructs of this invention may also be formulated into pharmaceutical compositions and administered using a therapeutic regimen compatible with the particular formulation.
Pharmaceutical compositions within the scope of the present invention include compositions containing a protein of the invention in an effective amount to have the desired physiological effect, e.g. to arrest the growth of cancer cells without causing unacceptable toxicity for the patient.
Suitable carriers for parenteral administration include aqueous solutions of the active compounds in water-soluble or water-dispersible form, e.g. saline. Alternatively, suspensions of the active compounds may be administered in suitable conventional lipophilic carriers or in liposomes.
The compositions may be supplemented by active pharmaceutical ingredients, where desired. Optional antibacterial, antiseptic, and antioxidant agents in the compositions can perform their ordinary functions. The pharmaceutical compositions of the invention may further contain any of a number of suitable viscosity enhancers, stabilizers, excipients and auxiliaries which facilitate processing of the active compounds into preparations that can be used pharmaceutically. Preferably, these preparations, as well as those preparations discussed below, are designed for parenteral administration. However, compositions designed for oral or rectal administration are also considered to fall within the scope of the present invention.
Those of skill in the pharmaceutical art should be able to derive suitable dosages and schedules of administration. As used herein, the terms "suitable amount" or "effective amount" means an amount which is
16
Example 1 - p53 Protein Production
Plasmids of the pGEM series were used to generate in vitro translated p53 proteins, as previously described [T. Halazonetis and A. Kandil, EMBO J.. 12:5057-5064 (1993a); T. Halazonetis and A. Kandil, EMBO J. , 12:1021-1028 (1993b) ; J. L. Waterman et al, EMBO J. , 14:512-519 (1995)].
More specifically, plasmid pGEMhump53wt (also termed pGEMhp53wtB) encodes full-length human wild-type p53. This plasmid was prepared by PCR using a human p53 cDNA, which is readily available to those practicing the art. The PCR procedure was designed to incorporate unique restriction sites within the coding sequence of human p53: Kpn I at codon 218, Sst I at codon 299, Sst II at codon 333, Bst BI at codon 338 and Sal I immediately following the termination codon. An Msc I site at codon 138 was eliminated. These changes did not alter the sequence of the encoded p53, and were only performed to expedite construction of mutant proteins bearing altered tetramerization domains or point mutations associated with human cancer. The PCR product of the human p53 cDNA was digested with Neo I and Sal I and cloned in the vector pGEM4 [Promega, Madison, WI], which was linearized with Eco RI and Sal I. Synthetic oligonucleotides were used to bridge the Eco RI site of the vector and the Neo I site at the initiation codon of p53. Plasmid, pGEMhump53wt, was used to generate all the p53 mutants and modified p53 protein constructs described below, as well as for expression of wild-type p53 by in vitro translation [J.L.F. Waterman et al, EMBO J. , 14.: 512-519 (1995)]. The proteins were derived from pGEMhump53wt by site-directed mutagenesis [Higuchi, in Innis et al, PCR Protocols: A Guide to Methods and Applications, Academic Press, San Diego, pp. 177-183 (1990) ] of the codons indicated below. In vitro
amount of a composition of this invention, patients may tolerate higher doses of such DNA damaging agents.
Another therapeutic use of the compositions of this invention is in inducing apoptosis of specific cells, such as proliferating lymphocytes. According to this method of use, a suitable amount of an appropriate pharmaceutical composition of this invention is administered to a subject to enhance the development of immune tolerance. This method may employ both in vivo and ex vivo modes of administration. Preferably, this therapy is useful as the sole treatment or as an accessory treatment to prevent transplant rejection, or to treat autoimmune diseases, e.g. , systemic lupus erythrematosis, rheumatoid arthritis and the like. The pharmaceutical compositions of this invention may also be employed to restore p53 function in tumor cells. Desirably, a suitable amount of the composition of this invention is administered systemically, or locally to the site of the tumor with or without concurrent administration of conventional cancer therapy (i.e. DNA damaging agents) .
Additionally, the compositions of this invention may be administered in methods to suppress cell proliferation in diseases other than cancers, which are characterized by aberrant cell proliferation. Among such diseases are included psoriasis, atherosclerosis and arterial restenosis. This method is conducted by administering a suitable amount of the selected composition systemically or locally to the patient. These examples illustrate the preferred method for preparing exemplary modified p53 constructs of the invention and the biological activity of the modified p53 constructs. These examples are illustrative only and do not limit the scope of the invention.
18
Example 2 - DNA Binding Activity
The DNA binding activity of wild-type p53 is allosterically regulated by a basic region within the C- ter inal 30 amino acids of p53. Monoclonal antibodies that mask this regulatory region, such as PAb421, or deletion of this region stimulate binding to DNA [T. Halanonetis et al, EMBJO J. , 12:1021-1028 (1993) ; T.R. Hupp et al, Cell. 21:875-886 (1992) ; J.L.F. Waterman et al, EMBO J. , ϋ:512-519 (1995)]. Interestingly, some tumor-derived mutants have also been reported to bind DNA when allosterically activated by antibody PAb421 [T. Halazonetis and A. Kandil, EMBO J. , 12.: 5057-5064 (1993) ; T. Hupp et al, Nucl. Acids. Res.. 21:3167-3174 (1993); D. Niewolik et al, Oncogene, 10:881-890 (1995)]. The seven most common tumor-derived mutants: p53Hiεl75 [SEQ ID NO: 25], Gln248 [SEQ ID NO: 11], Trp248 [SEQ ID NO: 26], Ser249 [SEQ ID NO: 27], His273 [SEQ ID NO: 12], Trp282 [SEQ ID NO: 28] and Cys273 [SEQ ID NO: 13] [M. Hollstein et al, Science, 253:49-53 (1991)] were examined. The substitutions in these mutants target arginines 248 or 273 that contact DNA (Class I mutants) or arginines 175, 249 or 282 that stabilize the structure of the DNA binding domain (Class II mutants) [Y. Cho et al, Science. 265:346-355 (1994)]. Of the seven tumor- derived mutants, four recognized a high affinity p53 DNA site in the presence of PAb421 (Fig. 1) or when their C- ter inal 30 amino acids were deleted (Fig. 2) . Significantly, the mutants bound DNA in the presence of excess unlabeled non-specific DNA suggesting that they retain sequence specificity. Except for p53Trp248, allosteric activation enhanced DNA binding of all Class I mutants examined. DNA binding of Clasε II mutantε waε not activated, except for p53Trp282, which, like wild¬ type p53, bound DNA in the abεence and presence of PAb421. Thus, Class I mutants, which retain a native
translated proteins were expressed using SP6 transcribed mRNA and rabbit reticulocyte lysates, as previously described [Halazonetis et al, Cell. 5_5:917-925 (1988)]. The following proteins were generated in this manner: a) Wild-type p53 (p53wt) [SEQ ID NO: 2] b) Wild-type p53 containing the Thr284 to Arg subεtitution (p53R284) [SEQ ID NO: 3] c) Tumor-derived mutant p53 glutamine 248 (p53Q248) [SEQ ID NO: 11] d) Tumor-derived mutant p53 glutamine 248 containing the Thr284 to Arg substitution (p53Q248R284) [SEQ ID NO: 14] e) Tumor-derived mutant p53 mutant histidine 273 (p53H273) [SEQ ID NO: 12] f) Tumor-derived mutant p53 histidine 273 containing the Thr 284 to Arg substitution (p53H273R284) [SEQ ID NO: 15] g) Tumor-derived mutant p53 cysteine 273 (p53C273) [SEQ ID NO: 13] h) Tumor-derived mutant p53 cysteine 273 containing the Thr 284 to Arg substitution (p53C273R284) [SEQ ID NO: 16] .
Proteins corresponding to a) to h) , each containing a deletion of the C-terminal 30 amino acid of human p53 (Δ364-393) , were also generated [SEQ ID NOS: 17-24]. These deletions permit in vitro DNA binding.
In addition, plasmid pSV2hp53wtB was used to express wild-type p53 in mammalian cells [M.J.F. Waterman et al, Cancer Res.. 56:158-163 (1996)]. Plasmid pBC/TKseap has one copy of oligonucleotide BC [Halazonetis, EMBO J. , 12:1021-1028 (1993) cloned in the Eco RV site of pTKseap [Waterman, 1996] and expresses secreted alkaline phosphatase in a p53-responsive manner.
20
Thr284 with Arg. All other substitutions either suppresεed or had no effect on p53His273 DNA binding (Fig. 3) .
The effects of the substitution of Thr284 with Arg can be rationalized using molecular modeling.
Specifically, using the coordinates of the wild-type p53 DNA binding domain bound to DNA [Cho, cited above] an Arg side chain introduced at position 284 could form electrostatic interactions with the phosphate oxygen atoms of DNA closest to its α-carbon and without violating bond lengths and angles. Modeling was performed with Quanta 4.1 (Molecular Simulations Inc., Burlington, MA) .
In the following experiments, the effect of the Thr284 to Arg substitution on binding to natural DNA sites was examined in the context of wild-type p53, of p53His273 and of the other Class I p53 mutants.
All the proteins of Example 1 containing the 30 amino acid C-terminal deletion were expressed by in vitro translation and assayed for DNA binding using 0.2 ng 32P- labeled DNA and, where indicated below, 100 ng unlabeled competitor DNA [J. L. F. Waterman et al, EMBO J. , 14: 512- 519 (1995)], The analysis was restricted to the C- terminally truncated proteins because full-length p53 translated in vitro is in a latent state and cannot bind DNA unless activated by a C-terminal truncation or by a monoclonal antibody (PAb421) that binds to the p53 C- terminus [Waterman et al, cited above].
For analysiε of DNA binding activity, these proteins were incubated with 32P-labeled oligonucleotides and subjected to electrophoresis as described [Halazonetis (1993a and 1993b) and Waterman (1995) , both cited above] . Oligonucleotide BC, which has the following sequence (top strand) is: [SEQ ID NO: 29] CC-GGGCA-TGTCC- GGGCA-TGTCC-GGGCATGT, and oligonucleotide
structure of their DNA binding domain, have latent sequence-specific DNA binding activity, whereas Clasε II mutants, which have unfolded DNA binding domains [C. A. Finlay et al, Mol. Cell. Biol. , 8:531-539 (1988)], do not. Regarding the exceptions, we speculate that the large tryptophan side chain at position 248 precludes the p53Trp248 mutant from binding DNA due to steric interference with the DNA site. The ability of p53Trp282 to bind DNA may indicate that a small fraction of this mutant adopts the native fold.
Allosterically activated Class I p53 mutants compared favorably with wild-type p53 for binding to a high affinity DNA site. However, further experiments indicated that the mutants failed to recognize efficiently natural p53 sites, such as those present in the p2lcιp1 and gadd45 genes [W. S. El-Deiry et al, Cell. 25:817-825 (1993) , M.B. Kastan et al, Cell. 71:587-597 (1992)]. Since Class I p53 mutants apparently retain DNA binding sequence specificity, in an attempt to increaεe the affinity of Class I p53 mutantε for DNA, novel protein-DNA backbone contacts were introduced. Towards this goal, residues of the DNA binding domain of p53His273 were replaced with basic amino acids. The substitutions targeted essentially all the residues close to the DNA backbone, except for those that already contact DNA or those that unequivocally stabilize the three-dimensional structure of p53 [Cho, cited above]. The targeted residues were: Glyll7, Thrllδ, Alall9, Asn247, Thr284, Glu285 and Glu287. Substitution of Thr284 with Arg enhanced binding of p53His273 to the high affinity DNA εite, although binding waε still dependent on allosteric activation by antibody PAb421 (Fig. 3) . Substitution of Thr284 with Lys also enhanced binding of p53His273 to the high affinity DNA εite, but less than subεtitution of
22
56 -.158-163 (1996)]. The Class I p53 mutants had either weak (p53Hiε273) or no (p53Gln248 and p53Cys273) transcriptional activity. However, their transcriptional activity was enhanced to wild-type levels by the Thr284 to Arg substitution or, for p53Gln248, by combining the Thr284 to Arg substitution with C-terminal allosteric activation (Fig. 1) .
Tumor suppreεεing activity was tested in a colony formation assay, by cotransfecting Saos-2 osteosarcoma cells with 5 μg of pSV2hp53 expresεion plaεmid directing p53 expreεεion, 0.5 μg of pSV7neo, a plasmid that confers neomycin/G418 resistance [K. Zhang et al, Proc. Natl. Acad. Sci. USA. 82:6281-6285 (1990)] and 24 μg of pBC12/PLseap [T. D. Halazonetis, Anticancer Res. , 11:285-292 (1992)], a carrier plasmid. The transfected cellε were εelected for G418 reεiεtance, a neomycin relative. Two weeks later the colonies were stained with cryεtal violet and counted. High tumor suppressor activity corresponds to low colony formation.
Table 1
Tumor Colonies
Expressed Protein SE0 ID NO: (mean ± 1 S.E.) Human wild-type p53 2 11.3 ± 3.3 Human p53Δ364-393 17 17.7 ± 4.8 Human p53Arg284 3 2.0 ± 1.0
Human p53Arg284Δ364-393 18 4.7 ± 0.7
As illustrated in Table 1 above and in Fig. 5, the proteins containing the Arg284 modification εuppreεεed tumor colony formation more efficiently than the corresponding proteins without the Arg284 modification (Table 1) . The magnitude of the effect is greater for the tumor-derived p53 mutants; however, even
Ep21, which has the following sequence: [SEQ ID NO: 30] CCC-GAACA-TGTCC-CAACA-TGTTG-GGG, each contain a p53 binding site, which is underlined. The BC oligonucleotide has a high affinity p53-binding site, while oligonucleotide Ep21 contains a lower affinity site, which is present in the regulatory sequences of the p21 gene [W. S. El-Deiry et al, Cell , 25:817-825 (1993)]. Oligonucleotide Egadd45 has the sequence [SEQ ID NO: 31] ACA-GAACA-TGTCT-AAGCA-TGCTG-GGGA. Oligonucleotide TF3 , which contains three tandem repeats of [SEQ ID NO: 32] ATCACGTGAT, is a non-specific DNA [Halazonetis et al, EMBO J. , 11:1021-1028 (1993)].
The Thr284 to Arg substitution enhanced binding of all p53 proteins examined (Fig. 4) . For wild-type p53 the effect is evident with oligonucleotides BC and Ep21, for p53Gln248 it is evident with oligonucleotide BC, for p53His273 and p53Cys273 it is evident with all oligonucleotides tested (Fig. 4) .
Example 3 - Transcription and Tumor Suppresεion Aεεayε The proteins of Example 1 were examined for their transcriptional activity and tumor suppresεor activity. Wild-type p53 activateε transcription of target genes and suppresses tumor growth, whereas tumor- derived mutants lack both these activities [S.E. Kern et al, Science. 256: 827-830 (1992) ; C. A. Finlay et al, Cell, 5_7:1083-1093 (1989)]. The transcriptional activities of wild-type p53 and various p53 mutants were asεayed with a p53-responsive reporter plaεmid in Saos-2 human oεteoεarcoma cells, which lack endogenous p53 [M.J.F. Waterman et al, Cancer Res. , 5_6:158-163 (1996)]. More particularly, transcriptional activity was determined by transfecting Saos-2 cellε with 2.5μg pSV2hp53 expreεεion plasmid and 27.5 μg pBC/TKseap or pTKseap reporter plaε ids [Waterman et al, Cancer Res. ,
24
SEQUENCE LISTING
(1) GENERAL INFORMATION:
(i) APPLICANT: Wistar Institute of Anatomy & Biology
(ii) TITLE OF INVENTION: Modified p53 Constructs and Uses
Therefor
(iii) NUMBER OF SEQUENCES: 33
(iv) CORRESPONDENCE ADDRESS:
(A) ADDRESSEE: Howson and Howson
(B) STREET: Spring House Corporate Cntr. , PO Box 457
(C) CITY: Spring House
(D) STATE: Pennsylvania
(E) COUNTRY: USA
(F) ZIP: 19477
(V) COMPUTER READABLE FORM:
(A) MEDIUM TYPE: Floppy disk
(B) COMPUTER: IBM PC compatible
(C) OPERATING SYSTEM: PC-DOS/MS-DOS
(D) SOFTWARE: Patentin Release #1.0, Version #1.30
(vi) CURRENT APPLICATION DATA:
(A) APPLICATION NUMBER: WO
(B) FILING DATE:
(C) CLASSIFICATION:
(vii) PRIOR APPLICATION DATA:
(A) APPLICATION NUMBER: US 60/004,802
(B) FILING DATE: 22-SEP-1995
(viii) ATTORNEY/AGENT INFORMATION:
(A) NAME: Kodroff, Cathy A.
(B) REGISTRATION NUMBER: 33,980
(C) REFERENCE/DOCKET NUMBER: WST64APCT
(ix) TELECOMMUNICATION INFORMATION:
(A) TELEPHONE: 215-540-9206
(B) TELEFAX: 215-540-5818
(2) INFORMATION FOR SEQ ID NO:l:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1317 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: double
(D) TOPOLOGY: linear
the tumor suppressor activity of wild-type p53 is enhanced by the Arg284 modification.
While the effect of the Arg284 modification seems to be greater for tumor-derived mutants, rather than wild-type p53, this is a reflection of the limitations of the asεayε uεed. In theεe aεsays, wild- type p53 demonstrates high activity. If the assays were adjusted so that wild-type p53 would have low activity, then the effect of the Arg284 modification would be as dramatic as observed with the tumor-derived p53 mutants.
Numerous modifications and variations of the present invention are included in the above-identified specification and are expected to be obvious to one of skill in the art. Such modifications and alterations to the compoεitions and processes of the preεent invention are believed to be encompaεεed in the scope of the claims appended hereto.
26
AAC AAG ATG TTT TGC CAA CTG GCC AAG ACC TGC CCT GTG CAG 267 Asn Lys Met Phe Cys Gin Leu Ala Lys Thr Cys Pro Val Gin
135 140
CTG TGG GTT GAT TCC ACA CCC CCG CCC GGC ACC CGC GTC CGC 609 Leu Trp Val Asp Ser Thr Pro Pro Pro Gly Thr Arg Val Arg 145 150 155
GCC ATG GCC ATC TAC AAG CAG TCA CAG CAC ATG ACG GAG GTT 651 Ala Met Ala Ile Tyr Lys Gin Ser Gin His Met Thr Glu Val 160 165 170
GTG AGG CGC TGC CCC CAC CAT GAG CGC TGC TCA GAT AGC GAT 693 Val Arg Arg Cys Pro His Hiε Glu Arg Cys Ser Asp Ser Asp 175 180 185
GGT CTG GCC CCT CCT CAG CAT CTT ATC CGA GTG GAA GGA AAT 735 Gly Leu Ala Pro Pro Gin His Leu Ile Arg Val Glu Gly Asn 190 195 200
TTG CGT GTG GAG TAT TTG GAT GAC AGA AAC ACT TTT CGA CAT 777 Leu Arg Val Glu Tyr Leu Asp Asp Arg Asn Thr Phe Arg His
205 210
AGT GTG GTG GTG CCC TAT GAG CCG CCT GAG GTT GGC TCT GAC 819 Ser Val Val Val Pro Tyr Glu Pro Pro Glu Val Gly Ser Asp 215 220 225
TGT ACC ACC ATC CAC TAC AAC TAC ATG TGT AAC AGT TCC TGC 861 Cys Thr Thr Ile His Tyr Aεn Tyr Met Cys Asn Ser Ser Cys 230 235 240
ATG GGC GGC ATG AAC CGG AGA CCC ATC CTC ACC ATC ATC ACA 903 Met Gly Gly Met Asn Arg Arg Pro Ile Leu Thr Ile Ile Thr 245 250 255
CTG GAA GAC TCC AGT GGT AAT CTA CTG GGA CGG AAC AGC TTT 945 Leu Glu Asp Ser Ser Gly Asn Leu Leu Gly Arg Asn Ser Phe 260 265 270
GAG GTG CGT GTT TGT GCC TGT CCT GGG AGA GAC CGG CGC ACA 987 Glu Val Arg Val Cys Ala Cys Pro Gly Arg Asp Arg Arg Thr
275 280
GAG GAA GAG AAT CTC CGC AAG AAA GGG GAG CCT CAC CAC GAG 1029 Glu Glu Glu Asn Leu Arg Lys Lys Gly Glu Pro Hiε Hiε Glu 285 290 295
CTG CCC CCA GGG AGC ACT AAG CGA GCA CTG CCC AAC AAC ACC 1071 Leu Pro Pro Gly Ser Thr Lys Arg Ala Leu Pro Aεn Aεn Thr 300 305 310
(ii) MOLECULE TYPE: DNA (genomic)
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 136..1314
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l:
GTCTAGAGCC ACCGTCCAGG GAGCAGGTAG CTGCTGGGCT CCGGGGACAC 50
TTTGCGTTCG GGCTGGGAGC GTGCTTTCCA CGACGGTGAC ACGCTTCCCT 100
GGATTGGCAG CCAGACTGCC TTCCGGGTCA CTGCC ATG GAG GAG CCG 147
Met Glu Glu Pro 1
CAG TCA GAT CCT AGC GTC GAG CCC CCT CTG AGT CAG GAA ACA 189 Gin Ser Asp Pro Ser Val Glu Pro Pro Leu Ser Gin Glu Thr 5 10 15
TTT TCA GAC CTA TGG AAA CTA CTT CCT GAA AAC AAC GTT CTG 231 Phe Ser Asp Leu Trp Lys Leu Leu Pro Glu Asn Asn Val Leu 20 25 30
TCC CCC TTG CCG TCC CAA GCA ATG GAT GAT TTG ATG CTG TCC 273 Ser Pro Leu Pro Ser Gin Ala Met Asp Asp Leu Met Leu Ser 35 40 45
CCG GAC GAT ATT GAA CAA TGG TTC ACT GAA GAC CCA GGT CCA 315 Pro Asp Asp Ile Glu Gin Trp Phe Thr Glu Asp Pro Gly Pro 50 55 60
GAT GAA GCT CCC AGA ATG CCA GAG GCT GCT CCC CCC GTG GCC 357 Asp Glu Ala Pro Arg Met Pro Glu Ala Ala Pro Pro Val Ala
65 70
CCT GCA CCA GCA GCT CCT ACA CCG GCG GCC CCT GCA CCA GCC 399 Pro Ala Pro Ala Ala Pro Thr Pro Ala Ala Pro Ala Pro Ala 75 80 85
CCC TCC TGG CCC CTG TCA TCT TCT GTC CCT TCC CAG AAA ACC 441 Pro Ser Trp Pro Leu Ser Ser Ser Val Pro Ser Gin Lys Thr 90 95 100
TAC CAG GGC AGC TAC GGT TTC CGT CTG GGC TTC TTG CAT TCT 483 Tyr Gin Gly Ser Tyr Gly Phe Arg Leu Gly Phe Leu His Ser 105 110 115
GGG ACA GCC AAG TCT GTA ACT TGC ACG TAC TCC CCT GCC CTC 525 Gly Thr Ala Lys Ser Val Thr Cys Thr Tyr Ser Pro Ala Leu 120 125 130
28
Ala Pro Ala Ala Pro Thr Pro Ala Ala Pro Ala Pro Ala Pro Ser
80 85 90
Trp Pro Leu Ser Ser Ser Val Pro Ser Gin Lys Thr Tyr Gin Gly
95 100 105
Ser Tyr Gly Phe Arg Leu Gly Phe Leu His Ser Gly Thr Ala Lys
110 115 120
Ser Val Thr Cys Thr Tyr Ser Pro Ala Leu Aεn Lys Met Phe Cys
125 130 135
Gin Leu Ala Lys Thr Cys Pro Val Gin Leu Trp Val Asp Ser Thr
140 145 150
Pro Pro Pro Gly Thr Arg Val Arg Ala Met Ala Ile Tyr Lys Gin
155 160 165
Ser Gin His Met Thr Glu Val Val Arg Arg Cys Pro His Hiε Glu
170 175 180
Arg Cys Ser Asp Ser Asp Gly Leu Ala Pro Pro Gin His Leu Ile
185 190 195
Arg Val Glu Gly Asn Leu Arg Val Glu Tyr Leu Asp Aεp Arg Asn
200 205 210
Thr Phe Arg His Ser Val Val Val Pro Tyr Glu Pro Pro Glu Val
215 220 225
Gly Ser Asp Cys Thr Thr Ile His Tyr Asn Tyr Met Cys Asn Ser
230 235 240
Ser Cys Met Gly Gly Met Asn Arg Arg Pro Ile Leu Thr Ile Ile
245 250 255
Thr Leu Glu Asp Ser Ser Gly Asn Leu Leu Gly Arg Asn Ser Phe
260 265 270
Glu Val Arg Val Cys Ala Cys Pro Gly Arg Aεp Arg Arg Thr Glu
275 280 285
Glu Glu Asn Leu Arg Lys Lys Gly Glu Pro His His Glu Leu Pro
290 295 300
Pro Gly Ser Thr Lyε Arg Ala Leu Pro Asn Asn Thr Ser Ser Ser
305 310 315
Pro Gin Pro Lys Lys Lys Pro Leu Asp Gly Glu Tyr Phe Thr Leu
320 325 330
AGC TCC TCT CCC CAG CCA AAG AAG AAA CCA CTG GAT GGA GAA 1113
Ser Ser Ser Pro Gin Pro Lys Lys Lyε Pro Leu Asp Gly Glu
315 320 325
TAT TTC ACC CTT CAG ATC CGT GGG CGT GAG CGC TTC GAG ATG 1155
Tyr Phe Thr Leu Gin Ile Arg Gly Arg Glu Arg Phe Glu Met
330 335 340
TTC CGA GAG CTG AAT GAG GCC TTG GAA CTC AAG GAT GCC CAG 1197
Phe Arg Glu Leu Asn Glu Ala Leu Glu Leu Lyε Aεp Ala Gin
345 350
GCT GGG AAG GAG CCA GGG GGG AGC AGG GCT CAC TCC AGC CAC 1239
Ala Gly Lyε Glu Pro Gly Gly Ser Arg Ala His Ser Ser His 355 360 365
CTG AAG TCC AAA AAG GGT CAG TCT ACC TCC CGC CAT AAA AAA 1281
Leu Lys Ser Lys Lys Gly Gin Ser Thr Ser Arg His Lys Lyε 370 375 380
CTC ATG TTC AAG ACA GAA GGG CCT GAC TCA GAC TGA 1317
Leu Met Phe Lys Thr Glu Gly Pro Asp Ser Asp
385 390
(2) INFORMATION FOR SEQ ID NO: 2:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 393 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2:
Met Glu Glu Pro Gin Ser Aεp Pro Ser Val Glu Pro Pro Leu Ser 1 5 10 15
Gin Glu Thr Phe Ser Asp Leu Trp Lys Leu Leu Pro Glu Asn Asn
20 25 30
Val Leu Ser Pro Leu Pro Ser Gin Ala Met Asp Aεp Leu Met Leu
35 40 45
Ser Pro Asp Asp Ile Glu Gin Trp Phe Thr Glu Asp Pro Gly Pro
50 55 60
Asp Glu Ala Pro Arg Met Pro Glu Ala Ala Pro Pro Val Ala Pro
65 70 75
30
Ser Val Thr Cyε Thr Tyr Ser Pro Ala Leu Asn Lys Met Phe Cys
125 130 135
Gin Leu Ala Lys Thr Cys Pro Val Gin Leu Trp Val Asp Ser Thr
140 145 150
Pro Pro Pro Gly Thr Arg Val Arg Ala Met Ala Ile Tyr Lys Gin
155 160 165
Ser Gin His Met Thr Glu Val Val Arg Arg Cys Pro His His Glu
170 175 180
Arg Cys Ser Asp Ser Asp Gly Leu Ala Pro Pro Gin His Leu Ile
185 190 195
Arg Val Glu Gly Asn Leu Arg Val Glu Tyr Leu Asp Aεp Arg Asn
200 205 210
Thr Phe Arg His Ser Val Val Val Pro Tyr Glu Pro Pro Glu Val
215 220 225
Gly Ser Asp Cys Thr Thr Ile His Tyr Asn Tyr Met Cys Asn Ser
230 235 240
Ser Cys Met Gly Gly Met Asn Arg Arg Pro Ile Leu Thr Ile Ile
245 250 255
Thr Leu Glu Asp Ser Ser Gly Asn Leu Leu Gly Arg Aεn Ser Phe
260 265 270
Glu Val Arg Val Cyε Ala Cyε Pro Gly Arg Asp Arg Arg Arg Glu
275 280 285
Glu Glu Asn Leu Arg Lys Lys Gly Glu Pro His His Glu Leu Pro
290 295 300
Pro Gly Ser Thr Lys Arg Ala Leu Pro Asn Asn Thr Ser Ser Ser
305 310 315
Pro Gin Pro Lys Lys Lys Pro Leu Asp Gly Glu Tyr Phe Thr Leu
320 325 330
Gin Ile Arg Gly Arg Glu Arg Phe Glu Met Phe Arg Glu Leu Asn
335 340 345
Glu Ala Leu Glu Leu Lys Asp Ala Gin Ala Gly Lys Glu Pro Gly
350 355 360
Gly Ser Arg Ala His Ser Ser His Leu Lys Ser Lyε Lys Gly Gin
365 370 375
Gin Ile Arg Gly Arg Glu Arg Phe Glu Met Phe Arg Glu Leu Asn
335 340 345
Glu Ala Leu Glu Leu Lys Asp Ala Gin Ala Gly Lys Glu Pro Gly
350 355 360
Gly Ser Arg Ala His Ser Ser Hiε Leu Lys Ser Lys Lys Gly Gin
365 370 375
Ser Thr Ser Arg His Lys Lys Leu Met Phe Lys Thr Glu Gly Pro
380 385 390
Asp Ser Asp
(2) INFORMATION FOR SEQ ID NO: 3:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 393 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS:
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3:
Met Glu Glu Pro Gin Ser Asp Pro Ser Val Glu Pro Pro Leu Ser
1 5 10 15
Gin Glu Thr Phe Ser Asp Leu Trp Lys Leu Leu Pro Glu Asn Asn
20 25 30
Val Leu Ser Pro Leu Pro Ser Gin Ala Met Asp Asp Leu Met Leu
35 40 45
Ser Pro Asp Asp Ile Glu Gin Trp Phe Thr Glu Asp Pro Gly Pro
50 55 60
Asp Glu Ala Pro Arg Met Pro Glu Ala Ala Pro Pro Val Ala Pro
65 70 75
Ala Pro Ala Ala Pro Thr Pro Ala Ala Pro Ala Pro Ala Pro Ser
80 85 90
Trp Pro Leu Ser Ser Ser Val Pro Ser Gin Lys Thr Tyr Gin Gly
95 100 105
Ser Tyr Gly Phe Arg Leu Gly Phe Leu His Ser Gly Thr Ala Lys
110 115 120
32
Ser Gin His Met Thr Glu Val Val Arg Arg Cys Pro Hiε His Glu
170 175 180
Arg Cys Ser Asp Ser Asp Gly Leu Ala Pro Pro Gin Hiε Leu Ile
185 190 195
Arg Val Glu Gly Asn Leu Arg Val Glu Tyr Leu Asp Aεp Arg Aεn
200 205 210
Thr Phe Arg His Ser Val Val Val Pro Tyr Glu Pro Pro Glu Val
215 220 225
Gly Ser Asp Cys Thr Thr Ile Hiε Tyr Aεn Tyr Met Cyε Asn Ser
230 235 240
Ser Cyε Met Gly Gly Met Asn Arg Arg Pro Ile Leu Thr Ile Ile
245 250 255
Thr Leu Glu Asp Ser Ser Gly Asn Leu Leu Gly Arg Asn Ser Phe
260 265 270
Glu Val Arg Val Cys Ala Cys Pro Gly Arg Asp Arg Arg Lyε Glu
275 280 285
Glu Glu Asn Leu Arg Lys Lys Gly Glu Pro Hiε His Glu Leu Pro
290 295 300
Pro Gly Ser Thr Lys Arg Ala Leu Pro Asn Asn Thr Ser Ser Ser
305 310 315
Pro Gin Pro Lys Lys Lys Pro Leu Asp Gly Glu Tyr Phe Thr Leu
320 325 330
Gin Ile Arg Gly Arg Glu Arg Phe Glu Met Phe Arg Glu Leu Aεn
335 340 345
Glu Ala Leu Glu Leu Lys Asp Ala Gin Ala Gly Lys Glu Pro Gly
350 355 360
Gly Ser Arg Ala His Ser Ser His Leu Lys Ser Lyε Lys Gly Gin
365 370 375
Ser Thr Ser Arg His Lys Lys Leu Met Phe Lys Thr Glu Gly Pro
380 385 390
Asp Ser Asp
Ser Thr Ser Arg His Lys Lys Leu Met Phe Lys Thr Glu Gly Pro
380 385 390
Asp Ser Asp
(2) INFORMATION FOR SEQ ID NO:4:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 393 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS:
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:
Met Glu Glu Pro Gin Ser Asp Pro Ser Val Glu Pro Pro Leu Ser
1 5 10 15
Gin Glu Thr Phe Ser Asp Leu Trp Lys Leu Leu Pro Glu Asn Asn
20 25 30
Val Leu Ser Pro Leu Pro Ser Gin Ala Met Asp Asp Leu Met Leu
35 40 45
Ser Pro Asp Asp Ile Glu Gin Trp Phe Thr Glu Aεp Pro Gly Pro
50 55 60
Aεp Glu Ala Pro Arg Met Pro Glu Ala Ala Pro Pro Val Ala Pro
65 70 75
Ala Pro Ala Ala Pro Thr Pro Ala Ala Pro Ala Pro Ala Pro Ser
80 85 90
Trp Pro Leu Ser Ser Ser Val Pro Ser Gin Lyε Thr Tyr Gin Gly
95 100 105
Ser Tyr Gly Phe Arg Leu Gly Phe Leu Hiε Ser Gly Thr Ala Lyε
110 115 120
Ser Val Thr Cyε Thr Tyr Ser Pro Ala Leu Asn Lyε Met Phe Cyε
125 130 135
Gin Leu Ala Lyε Thr Cyε Pro Val Gin Leu Trp Val Aεp Ser Thr
140 145 150
Pro Pro Pro Gly Thr Arg Val Arg Ala Met Ala Ile Tyr Lys Gin
155 160 165
34
AGT TTA TTT GCT TTA AAT CCA ATG GGT TTC TCA CCA TTG GAT 837 Ser Leu Phe Ala Leu Asn Pro Met Gly Phe Ser Pro Leu Asp 10 15 20
GGT TCT AAA TCA ACC AAC GAA AAT GTA TCT GCT TCC ACT TCT 879 Gly Ser Lys Ser Thr Asn Glu Asn Val Ser Ala Ser Thr Ser
25 30
ACT GCC AAA CCA ATG GTT GGC CAA TTG ATT TTT GAT AAA TTC 921 Thr Ala Lys Pro Met Val Gly Gin Leu Ile Phe Asp Lys Phe 35 40 45
ATC AAG ACT GAA GAG GAT CCA ATT ATC AAA CAG GAT ACC CCT 963 Ile Lys Thr Glu Glu Asp Pro Ile Ile Lys Gin Asp Thr Pro 50 55 60
TCG AAC CTT GAT TTT GAT TTT GCT CTT CCA CAA ACG GCA ACT 1005 Ser Asn Leu Asp Phe Asp Phe Ala Leu Pro Gin Thr Ala Thr 65 70 75
GCA CCT GAT GCC AAG ACC GTT TTG CCA ATT CCG GAG CTA GAT 1047 Ala Pro Aεp Ala Lyε Thr Val Leu Pro Ile Pro Glu Leu Aεp 80 85 90
GAC GCT GTA GTG GAA TCT TTC TTT TCG TCA AGC ACT GAT TCA 1089 Asp Ala Val Val Glu Ser Phe Phe Ser Ser Ser Thr Asp Ser
95 100
ACT CCA ATG TTT GAG TAT GAA AAC CTA GAA GAC AAC TCT AAA 1131 Thr Pro Met Phe Glu Tyr Glu Asn Leu Glu Asp Asn Ser Lys 105 110 115
GAA TGG ACA TCC TTG TTT GAC AAT GAC ATT CCA GTT ACC ACT 1173 Glu Trp Thr Ser Leu Phe Asp Asn Asp Ile Pro Val Thr Thr 120 125 130
GAC GAT GTT TCA TTG GCT GAT AAG GCA ATT GAA TCC ACT GAA 1215 Asp Asp Val Ser Leu Ala Asp Lys Ala Ile Glu Ser Thr Glu 135 140 145
GAA GTT TCT CTG GTA CCA TCC AAT CTG GAA GTC TCG ACA ACT 1257 Glu Val Ser Leu Val Pro Ser Asn Leu Glu Val Ser Thr Thr 150 155 160
TCA TTC TTA CCC ACT CCT GTT CTA GAA GAT GCT AAA CTG ACT 1299 Ser Phe Leu Pro Thr Pro Val Leu Glu Asp Ala Lys Leu Thr 165 170 175
CAA ACA AGA AAG GTT AAG AAA CCA AAT TCA GTC GTT AAG AAG 1341 Gin Thr Arg Lys Val Lys Lys Pro Asn Ser Val Val Lys Lys
180 185
(2) INFORMATION FOR SEQ ID NO:5:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1824 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: double
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: DNA (genomic)
(ix) FEATURE:
(A) NAME/KEY: CDS
(B) LOCATION: 778..1620
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5:
ATCTTCGGGG ATATAAAGTG CATGAGCATA CATCTTGAAA AAAAAAGATG 50
AAAAATTTCC GACTTTAAAT ACGGAAGATA AATACTCCAA CCTTTTTTTC 100
CAATTCCGAA ATTTTAGTCT TCTTTAAAGA AGTTTCGGCT CGCTGTCTTA 150
CCTTTTAAAA TCTTCTACTT CTTGACAGTA CTTATCTTCT TATATAATAG 200
ATATACAAAA CAAAACAAAA CAAAAACTCA CAACACAGGT TACTCTCCCC 250
CCTAAATTCA AATTTTTTTT GCCCATCAGT TTCACTAGCG AATTATACAA 300
CTCACCAGCC ACACAGCTCA CTCATCTACT TCGCAATCAA AACAAAATAT 350
TTTATTTTAG TTCAGTTTAT TAAGTTATTA TCAGTATCGT ATTAAAAAAT 400
TAAAGATCAT TGAAAAATGG CTTGCTAAAC CGATTATATT TTGTTTTTAA 450
AGTAGATTAT TATTAGAAAA TTATTAAGAG AATTATGTGT TAAATTTATT 500
GAAAGAGAAA ATTTATTTTC CCTTATTAAT TAAAGTCCTT TACTTTTTTT 550
GAAAACTGTC AGTTTTTTGA AGAGTTATTT GTTTTGTTAC CAATTGCTAT 600
CATGTACCCG TAGAATTTTA TTCAAGATGT TTCCGTAACG GTTACCTTTC 650
TGTCAAATTA TCCAGGTTTA CTCGCCAATA AAAATTTCCC TATACTATCA 700
TTAATTAAAT CATTATTATT ACTAAAGTTT TGTTTACCAA TTTGTCTGCT 750
CAAGAAAATA AATTAAATAC AAATAAA ATG TCC GAA TAT CAG CCA 795
Met Ser Glu Tyr Gin Pro 1 5
36
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6:
Met Ser Glu Tyr Gin Pro Ser Leu Phe Ala Leu Asn Pro Met Gly
1 5 10 15
Phe Ser Pro Leu Asp Gly Ser Lys Ser Thr Asn Glu Asn Val Ser
20 25 30
Ala Ser Thr Ser Thr Ala Lys Pro Met Val Gly Gin Leu Ile Phe
35 40 45
Asp Lys Phe Ile Lys Thr Glu Glu Asp Pro Ile Ile Lys Gin Asp
50 55 60
Thr Pro Ser Asn Leu Aεp Phe Asp Phe Ala Leu Pro Gin Thr Ala
65 70 75
Thr Ala Pro Asp Ala Lys Thr Val Leu Pro Ile Pro Glu Leu Asp
80 85 90
Asp Ala Val Val Glu Ser Phe Phe Ser Ser Ser Thr Asp Ser Thr
95 100 105
Pro Met Phe Glu Tyr Glu Asn Leu Glu Asp Asn Ser Lys Glu Trp
110 115 120
Thr Ser Leu Phe Asp Aεn Asp Ile Pro Val Thr Thr Aεp Aεp Val
125 130 135
Ser Leu Ala Asp Lys Ala Ile Glu Ser Thr Glu Glu Val Ser Leu
140 145 150
Val Pro Ser Asn Leu Glu Val Ser Thr Thr Ser Phe Leu Pro Thr
155 160 165
Pro Val Leu Glu Asp Ala Lys Leu Thr Gin Thr Arg Lys Val Lys
170 175 180
Lys Pro Asn Ser Val Val Lys Lyε Ser His His Val Gly Lys Asp
185 190 195
Asp Glu Ser Arg Leu Asp His Leu Gly Val Val Ala Tyr Asn Arg
200 205 210
Lys Gin Arg Ser lie Pro Leu Ser Pro Ile Val Pro Glu Ser Ser
215 220 225
Asp Pro Ala Ala Leu Lys Arg Ala Arg Asn Thr Glu Ala Ala Arg
230 235 240
TCA CAT CAT GTT GGA AAG GAT GAC GAA TCG AGA CTG GAT CAT 1383 Ser His His Val Gly Lys Asp Asp Glu Ser Arg Leu Asp His 190 195 200
CTA GGT GTT GTT GCT TAC AAC CGC AAA CAG CGT TCG ATT CCA 1425 Leu Gly Val Val Ala Tyr Asn Arg Lys Gin Arg Ser Ile Pro 205 210 215
CTT TCT CCA ATT GTG CCC GAA TCC AGT GAT CCT GCT GCT CTA 1467 Leu Ser Pro Ile Val Pro Glu Ser Ser Asp Pro Ala Ala Leu 220 225 230
AAA CGT GCT AGA AAC ACT GAA GCC GCC AGG CGT TCT CGT GCG 1509 Lys Arg Ala Arg Asn Thr Glu Ala Ala Arg Arg Ser Arg Ala 235 240 245
AGA AAG TTG CAA AGA ATG AAA CAA CTT GAA GAC AAG GTT GAA 1551 Arg Lys Leu Gin Arg Met Lys Gin Leu Glu Asp Lys Val Glu
250 255
GAA TTG CTT TCG AAA AAT TAT CAC TTG GAA AAT GAG GTT GCC 1593 Glu Leu Leu Ser Lys Asn Tyr His Leu Glu Asn Glu Val Ala 260 265 270
AGA TTA AAG AAA TTA GTT GGC GAA CGC TGATTTCATT 1630
Arg Leu Lys Lys Leu Val Gly Glu Arg 275 280
TACCTTTTAT TTTATATTTT TTATTTCATT CTCGTGTATA ACGAAATAGA 1680
TACATTCACT TAGATAAGAA TTTAATCTTT TTTATGCCAA TTTTCTTAAG 1730
TAGAATTTTA CACCACGCAT TTATAATCTG CCGTATGTTC TGGTATTTAC 1780
TGGTTAGGAA TAGATAAAAA AAACACTCAC GATGGGGGTC GAAC 1824
(2) INFORMATION FOR SEQ ID NO: 6:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 281 amino acids
(B) TYPE: amino acid (D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
38
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9:
Arg Gly Gly Asn Pro Glu 1 5
(2) INFORMATION FOR SEQ ID NO: 10:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 5 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS:
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: peptide
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10
Gly Gly Asn Gin Ala
1 5
(2) INFORMATION FOR SEQ ID NO: 11:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 393 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS:
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11:
Met Glu Glu Pro Gin Ser Asp Pro Ser Val Glu Pro Pro Leu Ser 1 5 10 15
Gin Glu Thr Phe Ser Asp Leu Trp Lys Leu Leu Pro Glu Asn Asn
20 25 30
Val Leu Ser Pro Leu Pro Ser Gin Ala Met Asp Asp Leu Met Leu
35 40 45
Ser Pro Asp Asp Ile Glu Gin Trp Phe Thr Glu Asp Pro Gly Pro
50 55 60
Asp Glu Ala Pro Arg Met Pro Glu Ala Ala Pro Pro Val Ala Pro
65 70 75
Arg Ser Arg Ala Arg Lys Leu Gin Arg Met Lys Gin Leu Glu Asp
245 250 255
Lys Val Glu Glu Leu Leu Ser Lys Asn Tyr His Leu Glu Asn Glu
260 265 270
Val Ala Arg Leu Lys Lys Leu Val Gly Glu Arg
275 280
(2) INFORMATION FOR SEQ ID NO: 7:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 4 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS:
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: peptide
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:
Gly Asn Pro Glu
1
(2) INFORMATION FOR SEQ ID NO:8:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 3 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS:
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: peptide
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:
Arg Gly Asn 1
(2) INFORMATION FOR SEQ ID NO:9:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 6 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS:
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: peptide
40
Gin Ile Arg Gly Arg Glu Arg Phe Glu Met Phe Arg Glu Leu Asn
335 340 345
Glu Ala Leu Glu Leu Lys Asp Ala Gin Ala Gly Lys Glu Pro Gly
350 355 360
Gly Ser Arg Ala His Ser Ser His Leu Lyε Ser Lyε Lyε Gly Gin
365 370 375
Ser Thr Ser Arg His Lys Lys Leu Met Phe Lys Thr Glu Gly Pro
380 385 390
Asp Ser Asp
(2) INFORMATION FOR SEQ ID NO: 12:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 393 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS:
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12:
Met Glu Glu Pro Gin Ser Asp Pro Ser Val Glu Pro Pro Leu Ser 1 5 10 15
Gin Glu Thr Phe Ser Asp Leu Trp Lys Leu Leu Pro Glu Asn Asn
20 25 30
Val Leu Ser Pro Leu Pro Ser Gin Ala Met Asp Asp Leu Met Leu
35 40 45
Ser Pro Asp Asp Ile Glu Gin Trp Phe Thr Glu Asp Pro Gly Pro
50 55 60
Asp Glu Ala Pro Arg Met Pro Glu Ala Ala Pro Pro Val Ala Pro
65 70 75
Ala Pro Ala Ala Pro Thr Pro Ala Ala Pro Ala Pro Ala Pro Ser
80 85 90
Trp Pro Leu Ser Ser Ser Val Pro Ser Gin Lys Thr Tyr Gin Gly
95 100 105
Ser Tyr Gly Phe Arg Leu Gly Phe Leu Hiε Ser Gly Thr Ala Lys
110 115 120
Ala Pro Ala Ala Pro Thr Pro Ala Ala Pro Ala Pro Ala Pro Ser
80 85 90
Trp Pro Leu Ser Ser Ser Val Pro Ser Gin Lys Thr Tyr Gin Gly
95 100 105
Ser Tyr Gly Phe Arg Leu Gly Phe Leu His Ser Gly Thr Ala Lyε
110 115 120
Ser Val Thr Cys Thr Tyr Ser Pro Ala Leu Asn Lys Met Phe Cys
125 130 135
Gin Leu Ala Lys Thr Cys Pro Val Gin Leu Trp Val Asp Ser Thr
140 145 150
Pro Pro Pro Gly Thr Arg Val Arg Ala Met Ala Ile Tyr Lys Gin
155 160 165
Ser Gin His Met Thr Glu Val Val Arg Arg Cys Pro His His Glu
170 175 180
Arg Cys Ser Asp Ser Asp Gly Leu Ala Pro Pro Gin His Leu Ile
185 190 195
Arg Val Glu Gly Asn Leu Arg Val Glu Tyr Leu Aεp Aεp Arg Aεn
200 205 210
Thr Phe Arg Hiε Ser Val Val Val Pro Tyr Glu Pro Pro Glu Val
215 220 225
Gly Ser Aεp Cys Thr Thr Ile His Tyr Asn Tyr Met Cys Asn Ser
230 235 240
Ser Cys Met Gly Gly Met Asn Gin Arg Pro Ile Leu Thr Ile Ile
245 250 255
Thr Leu Glu Asp Ser Ser Gly Aεn Leu Leu Gly Arg Aεn Ser Phe
260 265 270
Glu Val Arg Val Cyε Ala Cyε Pro Gly Arg Aεp Arg Arg Thr Glu
275 280 285
Glu Glu Asn Leu Arg Lyε Lys Gly Glu Pro His His Glu Leu Pro
290 295 300
Pro Gly Ser Thr Lys Arg Ala Leu Pro Asn Asn Thr Ser Ser Ser
305 310 315
Pro Gin Pro Lys Lyε Lyε Pro Leu Aεp Gly Glu Tyr Phe Thr Leu
320 325 330
42
Ser Thr Ser Arg His Lys Lyε Leu Met Phe Lyε Thr Glu Gly Pro
380 385 390
Asp Ser Asp
(2) INFORMATION FOR SEQ ID NO: 13:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 393 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS:
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13:
Met Glu Glu Pro Gin Ser Asp Pro Ser Val Glu Pro Pro Leu Ser 1 5 10 15
Gin Glu Thr Phe Ser Asp Leu Trp Lys Leu Leu Pro Glu Asn Asn
20 25 30
Val Leu Ser Pro Leu Pro Ser Gin Ala Met Asp Asp L«ΪU Met Leu
35 40 45
Ser Pro Aεp Aεp Ile Glu Gin Trp Phe Thr Glu Asp Pro Gly Pro
50 55 60
Asp Glu Ala Pro Arg Met Pro Glu Ala Ala Pro Pro Val Ala Pro
65 70 75
Ala Pro Ala Ala Pro Thr Pro Ala Ala Pro Ala Pro Ala Pro Ser
80 85 90
Trp Pro Leu Ser Ser Ser Val Pro Ser Gin Lys Thr Tyr Gin Gly
95 100 105
Ser Tyr Gly Phe Arg Leu Gly Phe Leu His Ser Gly Thr Ala Lys
110 115 120
Ser Val Thr Cys Thr Tyr Ser Pro Ala Leu Asn Lys Met Phe Cys
125 130 135
Gin Leu Ala Lys Thr Cys Pro Val Gin Leu Trp Val Asp Ser Thr
140 145 150
Pro Pro Pro Gly Thr Arg Val Arg Ala Met Ala Ile Tyr Lys Gin
155 160 165
Ser Val Thr Cys Thr Tyr Ser Pro Ala Leu Asn Lys Met Phe Cys
125 130 135
Gin Leu Ala Lys Thr Cys Pro Val Gin Leu Trp Val Asp Ser Thr
140 145 150
Pro Pro Pro Gly Thr Arg Val Arg Ala Met Ala Ile Tyr Lys Gin
155 160 165
Ser Gin His Met Thr Glu Val Val Arg Arg Cys Pro His His Glu
170 175 180
Arg Cys Ser Asp Ser Asp Gly Leu Ala Pro Pro Gin His Leu Ile
185 190 195
Arg Val Glu Gly Asn Leu Arg Val Glu Tyr Leu Asp Asp Arg Asn
200 205 210
Thr Phe Arg His Ser Val Val Val Pro Tyr Glu Pro Pro Glu Val
215 220 225
Gly Ser Aεp Cyε Thr Thr Ile Hiε Tyr Aεn Tyr Met Cys Asn Ser
230 235 240
Ser Cys Met Gly Gly Met Asn Arg Arg Pro Ile Leu Thr Ile Ile
245 250 255
Thr Leu Glu Asp Ser Ser Gly Aεn Leu Leu Gly Arg Aεn Ser Phe
260 265 270
Glu Val Hiε Val Cyε Ala Cyε Pro Gly Arg Aεp Arg Arg Thr Glu
275 280 285
Glu Glu Asn Leu Arg Lys Lys Gly Glu Pro His His Glu Leu Pro
290 295 300
Pro Gly Ser Thr Lys Arg Ala Leu Pro Aεn Aεn Thr Ser Ser Ser
305 310 315
Pro Gin Pro Lys Lys Lys Pro Leu Asp Gly Glu Tyr Phe Thr Leu
320 325 330
Gin Ile Arg Gly Arg Glu Arg Phe Glu Met Phe Arg Glu Leu Asn
335 340 345
Glu Ala Leu Glu Leu Lys Asp Ala Gin Ala Gly Lys Glu Pro Gly
350 355 360
Gly Ser Arg Ala His Ser Ser His Leu Lys Ser Lys Lys Gly Gin
365 370 375
44 (2) INFORMATION FOR SEQ ID NO: 14:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 393 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS:
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14:
Met Glu Glu Pro Gin Ser Asp Pro Ser Val Glu Pro Pro Leu Ser 1 5 10 15
Gin Glu Thr Phe Ser Aεp Leu Trp Lyε Leu Leu Pro Glu Asn Asn
20 25 30
Val Leu Ser Pro Leu Pro Ser Gin Ala Met Asp Asp Leu Met Leu
35 40 45
Ser Pro Asp Asp Ile Glu Gin Trp Phe Thr Glu Asp Pro Gly Pro
50 55 60
Asp Glu Ala Pro Arg Met Pro Glu Ala Ala Pro Pro Val Ala Pro
65 70 75
Ala Pro Ala Ala Pro Thr Pro Ala Ala Pro Ala Pro Ala Pro Ser
80 85 90
Trp Pro Leu Ser Ser Ser Val Pro Ser Gin Lys Thr Tyr Gin Gly
95 100 105
Ser Tyr Gly Phe Arg Leu Gly Phe Leu His Ser Gly Thr Ala Lyε
110 115 120
Ser Val Thr Cys Thr Tyr Ser Pro Ala Leu Asn Lys Met Phe Cys
125 130 135
Gin Leu Ala Lys Thr Cys Pro Val Gin Leu Trp Val Asp Ser Thr
140 145 150
Pro Pro Pro Gly Thr Arg Val Arg Ala Met Ala Ile Tyr Lys Gin
155 160 165
Ser Gin Hiε Met Thr Glu Val Val Arg Arg Cys Pro His His Glu
170 175 180
Arg Cys Ser Asp Ser Asp Gly Leu Ala Pro Pro Gin His Leu Ile
185 190 195
Ser Gin His Met Thr Glu Val Val Arg Arg Cyε Pro His His Glu
170 175 180
Arg Cys Ser Asp Ser Asp Gly Leu Ala Pro Pro Gin His Leu Ile
185 190 195
Arg Val Glu Gly Asn Leu Arg Val Glu Tyr Leu Asp Asp Arg Asn
200 205 210
Thr Phe Arg His Ser Val Val Val Pro Tyr Glu Pro Pro Glu Val
215 220 225
Gly Ser Asp Cys Thr Thr Ile His Tyr Asn Tyr Met Cys Asn Ser
230 235 240
Ser Cys Met Gly Gly Met Asn Arg Arg Pro Ile Leu Thr Ile Ile
245 250 255
Thr Leu Glu Asp Ser Ser Gly Aεn Leu Leu Gly Arg Aεn Ser Phe
260 265 270
Glu Val Cyε Val Cys Ala Cys Pro Gly Arg Asp Arg Arg Thr Glu
275 280 285
Glu Glu Asn Leu Arg Lys Lys Gly Glu Pro His His Glu Leu Pro
290 295 300
Pro Gly Ser Thr Lys Arg Ala Leu Pro Aεn Aεn Thr Ser Ser Ser
305 310 315
Pro Gin Pro Lyε Lyε Lyε Pro Leu Aεp Gly Glu Tyr Phe Thr Leu
320 325 330
Gin Ile Arg Gly Arg Glu Arg Phe Glu Met Phe Arg Glu Leu Aεn
335 340 345
Glu Ala Leu Glu Leu Lys Asp Ala Gin Ala Gly Lys Glu Pro Gly
350 355 360
Gly Ser Arg Ala His Ser Ser His Leu Lys Ser Lys Lys Gly Gin
365 370 375
Ser Thr Ser Arg His Lys Lys Leu Met Phe Lys Thr Glu Gly Pro
380 385 390
Asp Ser Asp
46
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15:
Met Glu Glu Pro Gin Ser Asp Pro Ser Val Glu Pro Pro Leu Ser
1 5 10 15
Gin Glu Thr Phe Ser Asp Leu Trp Lys Leu Leu Pro Glu Asn Asn
20 25 30
Val Leu Ser Pro Leu Pro Ser Gin Ala Met Asp Asp Leu Met Leu
35 40 45
Ser Pro Asp Aεp Ile Glu Gin Trp Phe Thr Glu Aεp Pro Gly Pro
50 55 60
Asp Glu Ala Pro Arg Met Pro Glu Ala Ala Pro Pro Val Ala Pro
65 70 75
Ala Pro Ala Ala Pro Thr Pro Ala Ala Pro Ala Pro Ala Pro Ser
80 85 90
Trp Pro Leu Ser Ser Ser Val Pro Ser Gin Lys Thr Tyr Gin Gly
95 100 105
Ser Tyr Gly Phe Arg Leu Gly Phe Leu His Ser Gly Thr Ala Lys
110 115 120
Ser Val Thr Cys Thr Tyr Ser Pro Ala Leu Asn Lys Met Phe Cys
125 130 135
Gin Leu Ala Lys Thr Cys Pro Val Gin Leu Trp Val Asp Ser Thr
140 145 150
Pro Pro Pro Gly Thr Arg Val Arg Ala Met Ala Ile Tyr Lys Gin
155 160 165
Ser Gin His Met Thr Glu Val Val Arg Arg Cys Pro His His Glu
170 175 180
Arg Cys Ser Asp Ser Asp Gly Leu Ala Pro Pro Gin His Leu Ile
185 190 195
Arg Val Glu Gly Asn Leu Arg Val Glu Tyr Leu Aεp Aεp Arg Asn
200 205 210
Thr Phe Arg His Ser Val Val Val Pro Tyr Glu Pro Pro Glu Val
215 220 225
Gly Ser Asp Cys Thr Thr Ile Hiε Tyr Asn Tyr Met Cys Asn Ser
230 235 240
Ser Cys Met Gly Gly Met Asn Arg Arg Pro Ile Leu Thr Ile Ile
245 250 255
Arg Val Glu Gly Asn Leu Arg Val Glu Tyr Leu Asp Asp Arg Asn
200 205 210
Thr Phe Arg His Ser Val Val Val Pro Tyr Glu Pro Pro Glu Val
215 220 225
Gly Ser Asp Cys Thr Thr Ile Hiε Tyr Aεn Tyr Met Cyε Asn Ser
230 235 240
Ser Cys Met Gly Gly Met Asn Gin Arg Pro Ile Leu Thr Ile Ile
245 250 255
Thr Leu Glu Asp Ser Ser Gly Asn Leu Leu Gly Arg Asn Ser Phe
260 265 270
Glu Val Arg Val Cys Ala Cys Pro Gly Arg Asp Arg Arg Arg Glu
275 280 285
Glu Glu Asn Leu Arg Lys Lys Gly Glu Pro His His Glu Leu Pro
290 295 300
Pro Gly Ser Thr Lys Arg Ala Leu Pro Asn Asn Thr Ser Ser Ser
305 310 315
Pro Gin Pro Lys Lys Lys Pro Leu Asp Gly Glu Tyr Phe Thr Leu
320 325 330
Gin Ile Arg Gly Arg Glu Arg Phe Glu Met Phe Arg Glu Leu Asn
335 340 345
Glu Ala Leu Glu Leu Lyε Asp Ala Gin Ala Gly Lys Glu Pro Gly
350 355 360
Gly Ser Arg Ala His Ser Ser His Leu Lys Ser Lys Lys Gly Gin
365 370 375
Ser Thr Ser Arg His Lyε Lyε Leu Met Phe Lyε Thr Glu Gly Pro
380 385 390
Asp Ser Asp
(2) INFORMATION FOR SEQ ID NO: 15:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 393 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS:
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
48
Ser Pro Asp Asp Ile Glu Gin Trp Phe Thr Glu Asp Pro Gly Pro
50 55 60
Asp Glu Ala Pro Arg Met Pro Glu Ala Ala Pro Pro Val Ala Pro
65 70 75
Ala Pro Ala Ala Pro Thr Pro Ala Ala Pro Ala Pro Ala Pro Ser
80 85 90
Trp Pro Leu Ser Ser Ser Val Pro Ser Gin Lys Thr Tyr Gin Gly
95 100 105
Ser Tyr Gly Phe Arg Leu Gly Phe Leu His Ser Gly Thr Ala Lys
110 115 120
Ser Val Thr Cys Thr Tyr Ser Pro Ala Leu Asn Lys Met Phe Cys
125 130 135
Gin Leu Ala Lys Thr Cyε Pro Val Gin Leu Trp Val Asp Ser Thr
140 145 150
Pro Pro Pro Gly Thr Arg Val Arg Ala Met Ala Ile Tyr Lys Gin
155 160 165
Ser Gin His Met Thr Glu Val Val Arg Arg Cys Pro His His Glu
170 175 180
Arg Cys Ser Asp Ser Aεp Gly Leu Ala Pro Pro Gin Hiε Leu Ile
185 190 195
Arg Val Glu Gly Asn Leu Arg Val Glu Tyr Leu Asp Asp Arg Asn
200 205 210
Thr Phe Arg His Ser Val Val Val Pro Tyr Glu Pro Pro Glu Val
215 220 225
Gly Ser Asp Cys Thr Thr Ile His Tyr Asn Tyr Met Cys Asn Ser
230 235 240
Ser Cys Met Gly Gly Met Asn Arg Arg Pro Ile Leu Thr Ile Ile
245 250 255
Thr Leu Glu Asp Ser Ser Gly Asn Leu Leu Gly Arg Asn Ser Phe
260 265 270
Glu Val Cys Val Cys Ala Cys Pro Gly Arg Asp Arg Arg Arg Glu
275 280 285
Glu Glu Asn Leu Arg Lys Lys Gly Glu Pro His His Glu Leu Pro
290 295 300
Thr Leu Glu Asp Ser Ser Gly Asn Leu Leu Gly Arg Asn Ser Phe
260 265 270
Glu Val His Val Cys Ala Cys Pro Gly Arg Aεp Arg Arg Arg Glu
275 280 285
Glu Glu Aεn Leu Arg Lyε Lyε Gly Glu Pro Hiε His Glu Leu Pro
290 295 300
Pro Gly Ser Thr Lys Arg Ala Leu Pro Asn Asn Thr Ser Ser Ser
305 310 315
Pro Gin Pro Lys Lys Lys Pro Leu Asp Gly Glu Tyr Phe Thr Leu
320 325 330
Gin Ile Arg Gly Arg Glu Arg Phe Glu Met Phe Arg Glu Leu Aεn
335 340 345
Glu Ala Leu Glu Leu Lyε Aεp Ala Gin Ala Gly Lyε Glu Pro Gly
350 355 360
Gly Ser Arg Ala Hiε Ser Ser His Leu Lyε Ser Lyε Lyε Gly Gin
365 370 375
Ser Thr Ser Arg His Lys Lys Leu Met Phe Lys Thr Glu Gly Pro
380 385 390
Asp Ser Asp
(2) INFORMATION FOR SEQ ID NO: 16:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 393 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS:
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16:
Met Glu Glu Pro Gin Ser Asp Pro Ser Val Glu Pro Pro Leu Ser 1 5 10 15
Gin Glu Thr Phe Ser Asp Leu Trp Lys Leu Leu Pro Glu Asn Asn
20 25 30
Val Leu Ser Pro Leu Pro Ser Gin Ala Met Asp Aεp Leu Met Leu
35 40 45
50
Trp Pro Leu Ser Ser Ser Val Pro Ser Gin Lys Thr Tyr Gin Gly
95 100 105
Ser Tyr Gly Phe Arg Leu Gly Phe Leu His Ser Gly Thr Ala Lys
110 115 120
Ser Val Thr Cyε Thr Tyr Ser Pro Ala Leu Aεn Lyε Met Phe Cyε
125 130 135
Gin Leu Ala Lyε Thr Cyε Pro Val Gin Leu Trp Val Asp Ser Thr
140 145 150
Pro Pro Pro Gly Thr Arg Val Arg Ala Met Ala Ile Tyr Lys Gin
155 160 165
Ser Gin His Met Thr Glu Val Val Arg Arg Cys Pro His His Glu
170 175 180
Arg Cys Ser Asp Ser Asp Gly Leu Ala Pro Pro Gin His Leu Ile
185 190 195
Arg Val Glu Gly Asn Leu Arg Val Glu Tyr Leu Asp Asp Arg Asn
200 205 210
Thr Phe Arg His Ser Val Val Val Pro Tyr Glu Pro Pro Glu Val
215 220 225
Gly Ser Asp Cys Thr Thr Ile His Tyr Asn Tyr Met Cys Asn Ser
230 235 240
Ser Cys Met Gly Gly Met Asn Arg Arg Pro Ile Leu Thr Ile Ile
245 250 255
Thr Leu Glu Asp Ser Ser Gly Aεn Leu Leu Gly Arg Asn Ser Phe
260 265 270
Glu Val Arg Val Cys Ala Cyε Pro Gly Arg Asp Arg Arg Thr Glu
275 280 285
Glu Glu Asn Leu Arg Lys Lys Gly Glu Pro His His Glu Leu Pro
290 295 300
Pro Gly Ser Thr Lys Arg Ala Leu Pro Aεn Asn Thr Ser Ser Ser
305 310 315
Pro Gin Pro Lys Lyε Lyε Pro Leu Asp Gly Glu Tyr Phe Thr Leu
320 325 330
Gin Ile Arg Gly Arg Glu Arg Phe Glu Met Phe Arg Glu Leu Asn
335 340 345
Pro Gly Ser Thr Lys Arg Ala Leu Pro Asn Asn Thr Ser Ser Ser
305 310 315
Pro Gin Pro Lyε Lys Lys Pro Leu Asp Gly Glu Tyr Phe Thr Leu
320 325 330
Gin Ile Arg Gly Arg Glu Arg Phe Glu Met Phe Arg Glu Leu Asn
335 340 345
Glu Ala Leu Glu Leu Lys Asp Ala Gin Ala Gly Lys Glu Pro Gly
350 355 360
Gly Ser Arg Ala His Ser Ser His Leu Lys Ser Lys Lys Gly Gin
365 370 375
Ser Thr Ser Arg His Lys Lys Leu Met Phe Lys Thr Glu Gly Pro
380 385 390
Asp Ser Asp
(2) INFORMATION FOR SEQ ID NO: 17:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 363 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS:
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17:
Met Glu Glu Pro Gin Ser Asp Pro Ser Val Glu Pro Pro Leu Ser 1 5 10 15
Gin Glu Thr Phe Ser Aεp Leu Trp Lyε Leu Leu Pro Glu Aεn Asn
20 25 30
Val Leu Ser Pro Leu Pro Ser Gin Ala Met Asp Aεp Leu Met Leu
35 40 45
Ser Pro Asp Asp Ile Glu Gin Trp Phe Thr Glu Asp Pro Gly Pro
50 55 60
Asp Glu Ala Pro Arg Met Pro Glu Ala Ala Pro Pro Val Ala Pro
65 70 75
Ala Pro Ala Ala Pro Thr Pro Ala Ala Pro Ala Pro Ala Pro Ser
80 85 90
52
Ser Gin His Met Thr Glu Val Val Arg Arg Cys Pro His His Glu
170 175 180
Arg Cys Ser Asp Ser Asp Gly Leu Ala Pro Pro Gin His Leu Ile
185 190 195
Arg Val Glu Gly Asn Leu Arg Val Glu Tyr Leu Asp Aεp Arg Asn
200 205 210
Thr Phe Arg Hiε Ser Val Val Val Pro Tyr Glu Pro Pro Glu Val
215 220 225
Gly Ser Aεp Cyε Thr Thr Ile Hiε Tyr Aεn Tyr Met Cyε Asn Ser
230 235 240
Ser Cys Met Gly Gly Met Asn Arg Arg Pro Ile Leu Thr Ile Ile
245 250 255
Thr Leu Glu Asp Ser Ser Gly Asn Leu Leu Gly Arg Asn Ser Phe
260 265 270
Glu Val Arg Val Cys Ala Cys Pro Gly Arg Asp Arg Arg Arg Glu
275 280 285
Glu Glu Asn Leu Arg Lys Lys Gly Glu Pro His His Glu Leu Pro
290 295 300
Pro Gly Ser Thr Lys Arg Ala Leu Pro Asn Asn Thr Ser Ser Ser
305 310 315
Pro Gin Pro Lyε Lys Lys Pro Leu Aεp Gly Glu Tyr Phe Thr Leu
320 325 330
Gin Ile Arg Gly Arg Glu Arg Phe Glu Met Phe Arg Glu Leu Asn
335 340 345
Glu Ala Leu Glu Leu Lyε Asp Ala Gin Ala Gly Lys Glu Pro Gly
350 355 360
Gly Ser Arg
(2) INFORMATION FOR SEQ ID NO: 19:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 363 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS:
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
Glu Ala Leu Glu Leu Lys Asp Ala Gin Ala Gly Lys Glu Pro Gly
350 355 360
Gly Ser Arg
(2) INFORMATION FOR SEQ ID NO: 18:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 363 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS:
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18:
Met Glu Glu Pro Gin Ser Asp Pro Ser Val Glu Pro Pro Leu Ser
1 5 10 15
Gin Glu Thr Phe Ser Asp Leu Trp Lys Leu Leu Pro Glu Asn Asn
20 25 30
Val Leu Ser Pro Leu Pro Ser Gin Ala Met Asp Asp Leu Met Leu
35 40 45
Ser Pro Asp Asp Ile Glu Gin Trp Phe Thr Glu Asp Pro Gly Pro
50 55 60
Asp Glu Ala Pro Arg Met Pro Glu Ala Ala Pro Pro Val Ala Pro
65 70 75
Ala Pro Ala Ala Pro Thr Pro Ala Ala Pro Ala Pro Ala Pro Ser
80 85 90
Trp Pro Leu Ser Ser Ser Val Pro Ser Gin Lys Thr Tyr Gin Gly
95 100 105
Ser Tyr Gly Phe Arg Leu Gly Phe Leu His Ser Gly Thr Ala Lys
110 115 120
Ser Val Thr Cys Thr Tyr Ser Pro Ala Leu Asn Lys Met Phe Cyε
125 130 135
Gin Leu Ala Lys Thr Cys Pro Val Gin Leu Trp Val Asp Ser Thr
140 145 150
Pro Pro Pro Gly Thr Arg Val Arg Ala Met Ala Ile Tyr Lys Gin
155 160 165
54
Thr Leu Glu Asp Ser Ser Gly Asn Leu Leu Gly Arg Aεn Ser Phe
260 265 270
Glu Val Arg Val Cys Ala Cys Pro Gly Arg Asp Arg Arg Thr Glu
275 280 285
Glu Glu Asn Leu Arg Lys Lys Gly Glu Pro His His Glu Leu Pro
290 295 300
Pro Gly Ser Thr Lys Arg Ala Leu Pro Aεn Aεn Thr Ser Ser Ser
305 310 315
Pro Gin Pro Lyε Lyε Lyε Pro Leu Asp Gly Glu Tyr Phe Thr Leu
320 325 330
Gin Ile Arg Gly Arg Glu Arg Phe Glu Met Phe Arg Glu Leu Asn
335 340 345
Glu Ala Leu Glu Leu Lys Asp Ala Gin Ala Gly Lys Glu Pro Gly
350 355 360
Gly Ser Arg
(2) INFORMATION FOR SEQ ID NO:20:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 363 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS:
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20:
Met Glu Glu Pro Gin Ser Asp Pro Ser Val Glu Pro Pro Leu Ser 1 5 10 15
Gin Glu Thr Phe Ser Asp Leu Trp Lys Leu Leu Pro Glu Asn Asn
20 25 30
Val Leu Ser Pro Leu Pro Ser Gin Ala Met Asp Asp Leu Met Leu
35 40 45
Ser Pro Asp Asp Ile Glu Gin Trp Phe Thr Glu Aεp Pro Gly Pro
50 55 60
Asp Glu Ala Pro Arg Met Pro Glu Ala Ala Pro Pro Val Ala Pro
65 70 75
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19:
Met Glu Glu Pro Gin Ser Asp Pro Ser Val Glu Pro Pro Leu Ser
1 5 10 15
Gin Glu Thr Phe Ser Asp Leu Trp Lys Leu Leu Pro Glu Asn Asn
20 25 30
Val Leu Ser Pro Leu Pro Ser Gin Ala Met Asp Aεp Leu Met Leu
35 40 45
Ser Pro Aεp Aεp Ile Glu Gin Trp Phe Thr Glu Aεp Pro Gly Pro
50 55 60
Asp Glu Ala Pro Arg Met Pro Glu Ala Ala Pro Pro Val Ala Pro
65 70 75
Ala Pro Ala Ala Pro Thr Pro Ala Ala Pro Ala Pro Ala Pro Ser
80 85 90
Trp Pro Leu Ser Ser Ser Val Pro Ser Gin Lys Thr Tyr Gin Gly
95 100 105
Ser Tyr Gly Phe Arg Leu Gly Phe Leu His Ser Gly Thr Ala Lys
110 115 120
Ser Val Thr Cys Thr Tyr Ser Pro Ala Leu Asn Lys Met Phe Cys
125 130 135
Gin Leu Ala Lys Thr Cyε Pro Val Gin Leu Trp Val Aεp Ser Thr
140 145 150
Pro Pro Pro Gly Thr Arg Val Arg Ala Met Ala Ile Tyr Lyε Gin
155 160 165
Ser Gin Hiε Met Thr Glu Val Val Arg Arg Cyε Pro Hiε His Glu
170 175 180
Arg Cys Ser Asp Ser Asp Gly Leu Ala Pro Pro Gin His Leu Ile
185 190 195
Arg Val Glu Gly Asn Leu Arg Val Glu Tyr Leu Asp Asp Arg Asn
200 205 210
Thr Phe Arg His Ser Val Val Val Pro Tyr Glu Pro Pro Glu Val
215 220 225
Gly Ser Asp Cys Thr Thr Ile His Tyr Asn Tyr Met Cys Aεn Ser
230 235 240
Ser Cyε Met Gly Gly Met Asn Gin Arg Pro Ile Leu Thr Ile Ile
245 250 255
56
Gin Ile Arg Gly Arg Glu Arg Phe Glu Met Phe Arg Glu Leu Asn
335 340 345
Glu Ala Leu Glu Leu Lys Asp Ala Gin Ala Gly Lys Glu Pro Gly
350 355 360
Gly Ser Arg
(2) INFORMATION FOR SEQ ID NO: 21:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 363 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS:
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21:
Met Glu Glu Pro Gin Ser Asp Pro Ser Val Glu Pro Pro Leu Ser 1 5 10 15
Gin Glu Thr Phe Ser Asp Leu Trp Lys Leu Leu Pro Glu Asn Asn
20 25 30
Val Leu Ser Pro Leu Pro Ser Gin Ala Met Asp Aεp Leu Met Leu
35 40 45
Ser Pro Asp Asp Ile Glu Gin Trp Phe Thr Glu Aεp Pro Gly Pro
50 55 60
Aεp Glu Ala Pro Arg Met Pro Glu Ala Ala Pro Pro Val Ala Pro
65 70 75
Ala Pro Ala Ala Pro Thr Pro Ala Ala Pro Ala Pro Ala Pro Ser
80 85 90
Trp Pro Leu Ser Ser Ser Val Pro Ser Gin Lyε Thr Tyr Gin Gly
95 100 105
Ser Tyr Gly Phe Arg Leu Gly Phe Leu His Ser Gly Thr Ala Lys
110 115 120
Ser Val Thr Cyε Thr Tyr Ser Pro Ala Leu Asn Lys Met Phe Cys
125 130 135
Gin Leu Ala Lys Thr Cys Pro Val Gin Leu Trp Val Asp Ser Thr
140 145 150
Ala Pro Ala Ala Pro Thr Pro Ala Ala Pro Ala Pro Ala Pro Ser
80 85 90
Trp Pro Leu Ser Ser Ser Val Pro Ser Gin Lys Thr Tyr Gin Gly
95 100 105
Ser Tyr Gly Phe Arg Leu Gly Phe Leu His Ser Gly Thr Ala Lys
110 115 120
Ser Val Thr Cys Thr Tyr Ser Pro Ala Leu Asn Lys Met Phe Cyε
125 130 135
Gin Leu Ala Lys Thr Cyε Pro Val Gin Leu Trp Val Aεp Ser Thr
140 145 150
Pro Pro Pro Gly Thr Arg Val Arg Ala Met Ala Ile Tyr Lys Gin
155 160 165
Ser Gin His Met Thr Glu Val Val Arg Arg Cys Pro His His Glu
170 175 180
Arg Cys Ser Aεp Ser Aεp Gly Leu Ala Pro Pro Gin His Leu Ile
185 190 195
Arg Val Glu Gly Asn Leu Arg Val Glu Tyr Leu Aεp Asp Arg Asn
200 205 210
Thr Phe Arg Hiε Ser Val Val Val Pro Tyr Glu Pro Pro Glu Val
215 220 225
Gly Ser Asp Cys Thr Thr Ile His Tyr Asn Tyr Met Cys Asn Ser
230 235 240
Ser Cys Met Gly Gly Met Aεn Gin Arg Pro Ile Leu Thr Ile Ile
245 250 255
Thr Leu Glu Aεp Ser Ser Gly Aεn Leu Leu Gly Arg Aεn Ser Phe
260 265 270
Glu Val Arg Val Cyε Ala Cyε Pro Gly Arg Aεp Arg Arg Arg Glu
275 280 285
Glu Glu Aεn Leu Arg Lyε Lyε Gly Glu Pro His Hiε Glu Leu Pro
290 295 300
Pro Gly Ser Thr Lyε Arg Ala Leu Pro Asn Asn Thr Ser Ser Ser
305 310 315
Pro Gin Pro Lyε Lyε Lys Pro Leu Asp Gly Glu Tyr Phe Thr Leu
320 325 330
58 (2) INFORMATION FOR SEQ ID NO:22:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 363 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS:
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22:
Met Glu Glu Pro Gin Ser Asp Pro Ser Val Glu Pro Pro Leu Ser 1 5 10 15
Gin Glu Thr Phe Ser Asp Leu Trp Lys Leu Leu Pro Glu Asn Asn
20 25 30
Val Leu Ser Pro Leu Pro Ser Gin Ala Met Asp Aεp Leu Met Leu
35 40 45
Ser Pro Aεp Aεp Ile Glu Gin Trp Phe Thr Glu Asp Pro Gly Pro
50 55 60
Asp Glu Ala Pro Arg Met Pro Glu Ala Ala Pro Pro Val Ala Pro
65 70 75
Ala Pro Ala Ala Pro Thr Pro Ala Ala Pro Ala Pro Ala Pro Ser
80 85 90
Trp Pro Leu Ser Ser Ser Val Pro Ser Gin Lys Thr Tyr Gin Gly
95 100 105
Ser Tyr Gly Phe Arg Leu Gly Phe Leu Hiε Ser Gly Thr Ala Lys
110 115 120
Ser Val Thr Cys Thr Tyr Ser Pro Ala Leu Asn Lys Met Phe Cys
125 130 135
Gin Leu Ala Lyε Thr Cys Pro Val Gin Leu Trp Val Asp Ser Thr
140 145 150
Pro Pro Pro Gly Thr Arg Val Arg Ala Met Ala Ile Tyr Lys Gin
155 160 165
Ser Gin His Met Thr Glu Val Val Arg Arg Cys Pro His His Glu
170 175 180
Arg Cys Ser Asp Ser Asp Gly Leu Ala Pro Pro Gin His Leu Ile
185 190 195
Pro Pro Pro Gly Thr Arg Val Arg Ala Met Ala Ile Tyr Lys Gin
155 160 165
Ser Gin His Met Thr Glu Val Val Arg Arg Cys Pro His His Glu
170 175 180
Arg Cys Ser Asp Ser Asp Gly Leu Ala Pro Pro Gin His Leu Ile
185 190 195
Arg Val Glu Gly Asn Leu Arg Val Glu Tyr Leu Asp Aεp Arg Asn
200 205 210
Thr Phe Arg His Ser Val Val Val Pro Tyr Glu Pro Pro Glu Val
215 220 225
Gly Ser Asp Cyε Thr Thr Ile Hiε Tyr Aεn Tyr Met Cyε Aεn Ser
230 235 240
Ser Cyε Met Gly Gly Met Asn Arg Arg Pro Ile Leu Thr Ile Ile
245 250 255
Thr Leu Glu Asp Ser Ser Gly Asn Leu Leu Gly Arg Asn Ser Phe
260 265 270
Glu Val Hiε Val Cyε Ala Cyε Pro Gly Arg Aεp Arg Arg Thr Glu
275 280 285
Glu Glu Asn Leu Arg Lys Lyε Gly Glu Pro His His Glu Leu Pro
290 295 300
Pro Gly Ser Thr Lys Arg Ala Leu Pro Asn Asn Thr Ser Ser Ser
305 310 315
Pro Gin Pro Lys Lys Lys Pro Leu Asp Gly Glu Tyr Phe Thr Leu
320 325 330
Gin Ile Arg Gly Arg Glu Arg Phe Glu Met Phe Arg Glu Leu Asn
335 340 345
Glu Ala Leu Glu Leu Lys Asp Ala Gin Ala Gly Lys Glu Pro Gly
350 355 360
Gly Ser Arg
60
Gin Glu Thr Phe Ser Asp Leu Trp Lys Leu Leu Pro Glu Asn Asn
20 25 30
Val Leu Ser Pro Leu Pro Ser Gin Ala Met Asp Asp Leu Met Leu
35 40 45
Ser Pro Asp Asp Ile Glu Gin Trp Phe Thr Glu Asp Pro Gly Pro
50 55 60
Asp Glu Ala Pro Arg Met Pro Glu Ala Ala Pro Pro Val Ala Pro
65 70 75
Ala Pro Ala Ala Pro Thr Pro Ala Ala Pro Ala Pro Ala Pro Ser
80 85 90
Trp Pro Leu Ser Ser Ser Val Pro Ser Gin Lys Thr Tyr Gin Gly
95 100 105
Ser Tyr Gly Phe Arg Leu Gly Phe Leu His Ser Gly Thr Ala Lys
110 115 120
Ser Val Thr Cys Thr Tyr Ser Pro Ala Leu Asn Lys Met Phe Cys
125 130 135
Gin Leu Ala Lys Thr Cys Pro Val Gin Leu Trp Val Asp Ser Thr
140 145 150
Pro Pro Pro Gly Thr Arg Val Arg Ala Met Ala Ile Tyr Lys Gin
155 160 165
Ser Gin His Met Thr Glu Val Val Arg Arg Cys Pro His His Glu
170 175 180
Arg Cys Ser Asp Ser Asp Gly Leu Ala Pro Pro Gin His Leu Ile
185 190 195
Arg Val Glu Gly Asn Leu Arg Val Glu Tyr Leu Asp Asp Arg Aεn
200 205 210
Thr Phe Arg Hiε Ser Val Val Val Pro Tyr Glu Pro Pro Glu Val
215 220 225
Gly Ser Asp Cys Thr Thr Ile His Tyr Asn Tyr Met Cys Asn Ser
230 235 240
Ser Cys Met Gly Gly Met Asn Arg Arg Pro Ile Leu Thr Ile Ile
245 250 255
Thr Leu Glu Asp Ser Ser Gly Asn Leu Leu Gly Arg Asn Ser Phe
260 265 270
Arg Val Glu Gly Asn Leu Arg Val Glu Tyr Leu Aεp Aεp Arg Asn 200 205 210
Thr Phe Arg Hiε Ser Val Val Val Pro Tyr Glu Pro Pro Glu Val 215 220 225
Gly Ser Asp Cys Thr Thr Ile His Tyr Asn Tyr Met Cys Asn Ser 230 235 240
Ser Cys Met Gly Gly Met Aεn Arg Arg Pro Ile Leu Thr Ile Ile 245 250 255
Thr Leu Glu Asp Ser Ser Gly Asn Leu Leu Gly Arg Asn Ser Phe 260 265 270
Glu Val His Val Cys Ala Cyε Pro Gly Arg Asp Arg Arg Arg Glu 275 280 285
Glu Glu Asn Leu Arg Lyε Lyε Gly Glu Pro Hiε Hiε Glu Leu Pro 290 295 300
Pro Gly Ser Thr Lys Arg Ala Leu Pro Asn Aεn Thr Ser Ser Ser 305 310 315
Pro Gin Pro Lys Lyε Lyε Pro Leu Asp Gly Glu Tyr Phe Thr Leu 320 325 330
Gin lie Arg Gly Arg Glu Arg Phe Glu Met Phe Arg Glu Leu Asn 335 340 345
Glu Ala Leu Glu Leu Lyε Aεp Ala Gin Ala Gly Lyε Glu Pro Gly 350 355 360
Gly Ser Arg
(2) INFORMATION FOR SEQ ID NO:23:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 363 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS:
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23:
Met Glu Glu Pro Gin Ser Asp Pro Ser Val Glu Pro Pro Leu Ser 1 5 10 15
62
Trp Pro Leu Ser Ser Ser Val Pro Ser Gin Lys Thr Tyr Gin Gly
95 100 105
Ser Tyr Gly Phe Arg Leu Gly Phe Leu His Ser Gly Thr Ala Lys
110 115 120
Ser Val Thr Cys Thr Tyr Ser Pro Ala Leu Asn Lys Met Phe Cys
125 130 135
Gin Leu Ala Lys Thr Cys Pro Val Gin Leu Trp Val Asp Ser Thr
140 145 150
Pro Pro Pro Gly Thr Arg Val Arg Ala Met Ala Ile Tyr Lys Gin
155 160 165
Ser Gin His Met Thr Glu Val Val Arg Arg Cyε Pro Hiε Hiε Glu
170 175 180
Arg Cyε Ser Aεp Ser Aεp Gly Leu Ala Pro Pro Gin Hiε Leu Ile
185 190 195
Arg Val Glu Gly Asn Leu Arg Val Glu Tyr Leu Asp Asp Arg Asn
200 205 210
Thr Phe Arg His Ser Val Val Val Pro Tyr Glu Pro Pro Glu Val
215 220 225
Gly Ser Asp Cys Thr Thr Ile Hiε Tyr Aεn Tyr Met Cyε Aεn Ser
230 235 340
Ser Cyε Met Gly Gly Met Aεn Arg Arg Pro Ile Leu Thr Ile Ile
245 250 255
Thr Leu Glu Asp Ser Ser Gly Asn Leu Leu Gly Arg Asn Ser Phe
260 265 270
Glu Val Cyε Val Cys Ala Cys Pro Gly Arg Asp Arg Arg Arg Glu
275 280 285
Glu Glu Asn Leu Arg Lys Lys Gly Glu Pro His His Glu Leu Pro
290 295 300
Pro Gly Ser Thr Lys Arg Ala Leu Pro Aεn Aεn Thr Ser Ser Ser
305 310 315
Pro Gin Pro Lyε Lyε Lys Pro Leu Asp Gly Glu Tyr Phe Thr Leu
320 325 330
Gin Ile Arg Gly Arg Glu Arg Phe Glu Met Phe Arg Glu Leu Asn
335 340 345
Glu Val Cys Val Cys Ala Cys Pro Gly Arg Asp Arg Arg Thr Glu
275 280 285
Glu Glu Asn Leu Arg Lys Lys Gly Glu Pro Hiε Hiε Glu Leu Pro
290 295 300
Pro Gly Ser Thr Lyε Arg Ala Leu Pro Aεn Asn Thr Ser Ser Ser
305 310 315
Pro Gin Pro Lys Lys Lyε Pro Leu Aεp Gly Glu Tyr Phe Thr Leu
320 325 330
Gin Ile Arg Gly Arg Glu Arg Phe Glu Met Phe Arg Glu Leu Asn
335 340 345
Glu Ala Leu Glu Leu Lys Asp Ala Gin Ala Gly Lys Glu Pro Gly
350 355 360
Gly Ser Arg
(2) INFORMATION FOR SEQ ID NO:24:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 363 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS:
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24:
Met Glu Glu Pro Gin Ser Asp Pro Ser Val Glu Pro Pro Leu Ser 1 5 10 15
Gin Glu Thr Phe Ser Asp Leu Trp Lyε Leu Leu Pro Glu Asn Asn
20 25 30
Val Leu Ser Pro Leu Pro Ser Gin Ala Met Asp Asp Leu Met Leu
35 40 45
Ser Pro Asp Asp Ile Glu Gin Trp Phe Thr Glu Asp Pro Gly Pro
50 55 60
Asp Glu Ala Pro Arg Met Pro Glu Ala Ala Pro Pro Val Ala Pro
65 70 75
Ala Pro Ala Ala Pro Thr Pro Ala Ala Pro Ala Pro Ala Pro Ser
80 85 90
64
Ser Gin His Met Thr Glu Val Val Arg His Cys Pro His His Glu
170 175 180
Arg Cys Ser Asp Ser Asp Gly Leu Ala Pro Pro Gin His Leu Ile
185 190 195
Arg Val Glu Gly Asn Leu Arg Val Glu Tyr Leu Asp Asp Arg Asn
200 205 210
Thr Phe Arg His Ser Val Val Val Pro Tyr Glu Pro Pro Glu Val
215 220 225
Gly Ser Asp Cys Thr Thr Ile His Tyr Asn Tyr Met Cys Asn Ser
230 235 240
Ser Cys Met Gly Gly Met Asn Arg Arg Pro Ile Leu Thr Ile Ile
245 250 255
Thr Leu Glu Asp Ser Ser Gly Asn Leu Leu Gly Arg Aεn Ser Phe
260 265 270
Glu Val Arg Val Cys Ala Cys Pro Gly Arg Asp Arg Arg Thr Glu
275 280 285
Glu Glu Asn Leu Arg Lys Lys Gly Glu Pro His His Glu Leu Pro
290 295 300
Pro Gly Ser Thr Lys Arg Ala Leu Pro Asn Asn Thr Ser Ser Ser
305 310 315
Pro Gin Pro Lys Lyε Lys Pro Leu Asp Gly Glu Tyr Phe Thr Leu
320 325 330
Gin Ile Arg Gly Arg Glu Arg Phe Glu Met Phe Arg Glu Leu Asn
335 340 345
Glu Ala Leu Glu Leu Lys Asp Ala Gin Ala Gly Lys Glu Pro Gly
350 355 360
Gly Ser Arg Ala His Ser Ser His Leu Lys Ser Lys Lys Gly Gin
365 370 375
Ser Thr Ser Arg His Lys Lys Leu Met Phe Lys Thr Glu Gly Pro
380 385 390
Asp Ser Asp
Glu Ala Leu Glu Leu Lys Asp Ala Gin Ala Gly Lys Glu Pro Gly
350 355 360
Gly Ser Arg
(2) INFORMATION FOR SEQ ID NO: 25:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 393 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS:
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25:
Met Glu Glu Pro Gin Ser Asp Pro Ser Val Glu Pro Pro Leu Ser
1 5 10 15
Gin Glu Thr Phe Ser Asp Leu Trp Lys Leu Leu Pro Glu Asn Asn
20 25 30
Val Leu Ser Pro Leu Pro Ser Gin Ala Met Asp Asp Leu Met Leu
35 40 45
Ser Pro Asp Asp Ile Glu Gin Trp Phe Thr Glu Aεp Pro Gly Pro
50 55 60
Asp Glu Ala Pro Arg Met Pro Glu Ala Ala Pro Pro Val Ala Pro
65 70 75
Ala Pro Ala Ala Pro Thr Pro Ala Ala Pro Ala Pro Ala Pro Ser
80 85 90
Trp Pro Leu Ser Ser Ser Val Pro Ser Gin Lys Thr Tyr Gin Gly
95 100 105
Ser Tyr Gly Phe Arg Leu Gly Phe Leu His Ser Gly Thr Ala Lys
110 115 120
Ser Val Thr Cys Thr Tyr Ser Pro Ala Leu Asn Lys Met Phe Cys
125 130 135
Gin Leu Ala Lyε Thr Cys Pro Val Gin Leu Trp Val Asp Ser Thr
140 145 150
Pro Pro Pro Gly Thr Arg Val Arg Ala Met Ala Ile Tyr Lys Gin
155 160 165
66
Arg Val Glu Gly Asn Leu Arg Val Glu Tyr Leu Asp Asp Arg Asn
200 205 210
Thr Phe Arg His Ser Val Val Val Pro Tyr Glu Pro Pro Glu Val
215 220 225
Gly Ser Aεp Cyε Thr Thr Ile Hiε Tyr Asn Tyr Met Cys Asn Ser
230 235 240
Ser Cys Met Gly Gly Met Asn Trp Arg Pro Ile Leu Thr Ile Ile
245 250 255
Thr Leu Glu Asp Ser Ser Gly Asn Leu Leu Gly Arg Asn Ser Phe
260 265 270
Glu Val Arg Val Cys Ala Cys Pro Gly Arg Asp Arg Arg Thr Glu
275 280 285
Glu Glu Asn Leu Arg Lyε Lys Gly Glu Pro His His Glu Leu Pro
290 295 300
Pro Gly Ser Thr Lys Arg Ala Leu Pro Asn Aεn Thr Ser Ser Ser
305 310 315
Pro Gin Pro Lyε Lyε Lys Pro Leu Asp Gly Glu Tyr Phe Thr Leu
320 325 330
Gin Ile Arg Gly Arg Glu Arg Phe Glu Met Phe Arg Glu Leu Asn
335 340 345
Glu Ala Leu Glu Leu Lys Asp Ala Gin Ala Gly Lyε Glu Pro Gly
350 355 360
Gly Ser Arg Ala His Ser Ser His Leu Lys Ser Lys Lyε Gly Gin
365 370 375
Ser Thr Ser Arg His Lyε Lys Leu Met Phe Lys Thr Glu Gly Pro
380 385 390
Asp Ser Asp
(2) INFORMATION FOR SEQ ID NO: 27:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 393 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS:
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(2) INFORMATION FOR SEQ ID NO: 26:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 393 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS:
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26:
Met Glu Glu Pro Gin Ser Asp Pro Ser Val Glu Pro Pro Leu Ser
1 5 10 15
Gin Glu Thr Phe Ser Asp Leu Trp Lys Leu Leu Pro Glu Asn Asn
20 25 30
Val Leu Ser Pro Leu Pro Ser Gin Ala Met Asp Asp Leu Met Leu
35 40 45
Ser Pro Asp Aεp Ile Glu Gin Trp Phe Thr Glu Aεp Pro Gly Pro
50 55 60
Asp Glu Ala Pro Arg Met Pro Glu Ala Ala Pro Pro Val Ala Pro
65 70 75
Ala Pro Ala Ala Pro Thr Pro Ala Ala Pro Ala Pro Ala Pro Ser
80 85 90
Trp Pro Leu Ser Ser Ser Val Pro Ser Gin Lys Thr Tyr Gin Gly
95 100 105
Ser Tyr Gly Phe Arg Leu Gly Phe Leu His Ser Gly Thr Ala Lyε
110 115 120
Ser Val Thr Cyε Thr Tyr Ser Pro Ala Leu Aεn Lyε Met Phe Cyε
125 130 135
Gin Leu Ala Lyε Thr Cyε Pro Val Gin Leu Trp Val Aεp Ser Thr
140 145 150
Pro Pro Pro Gly Thr Arg Val Arg Ala Met Ala Ile Tyr Lyε Gin
155 160 165
Ser Gin Hiε Met Thr Glu Val Val Arg Arg Cyε Pro His His Glu
170 175 180
Arg Cys Ser Asp Ser Asp Gly Leu Ala Pro Pro Gin His Leu Ile
185 190 195
68
Thr Leu Glu Asp Ser Ser Gly Asn Leu Leu Gly Arg Asn Ser Phe
260 265 270
Glu Val Arg Val Cys Ala Cys Pro Gly Arg Asp Arg Arg Thr Glu
275 280 285
Glu Glu Asn Leu Arg Lys Lys Gly Glu Pro His His Glu Leu Pro
290 295 300
Pro Gly Ser Thr Lys Arg Ala Leu Pro Asn Asn Thr Ser Ser Ser
305 310 315
Pro Gin Pro Lys Lys Lys Pro Leu Asp Gly Glu Tyr Phe Thr Leu
320 325 330
Gin Ile Arg Gly Arg Glu Arg Phe Glu Met Phe Arg Glu Leu Asn
335 340 345
Glu Ala Leu Glu Leu Lys Asp Ala Gin Ala Gly Lyε Glu Pro Gly
350 355 360
Gly Ser Arg Ala Hiε Ser Ser His Leu Lyε Ser Lys Lys Gly Gin
365 370 375
Ser Thr Ser Arg His Lys Lys Leu Met Phe Lys Thr Glu Gly Pro
380 385 390
Asp Ser Asp
(2) INFORMATION FOR SEQ ID NO: 28:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 393 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS:
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
( i) SEQUENCE DESCRIPTION: SEQ ID NO: 28:
Met Glu Glu Pro Gin Ser Asp Pro Ser Val Glu Pro Pro Leu Ser 1 5 10 15
Gin Glu Thr Phe Ser Asp Leu Trp Lys Leu Leu Pro Glu Asn Asn
20 25 30
Val Leu Ser Pro Leu Pro Ser Gin Ala Met Aεp Asp Leu Met Leu
35 40 45
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27:
Met Glu Glu Pro Gin Ser Asp Pro Ser Val Glu Pro Pro Leu Ser 1 5 10 15
Gin Glu Thr Phe Ser Asp Leu Trp Lys Leu Leu Pro Glu Asn Asn
20 25 30
Val Leu Ser Pro Leu Pro Ser Gin Ala Met Aεp Asp Leu Met Leu
35 40 45
Ser Pro Aεp Asp Ile Glu Gin Trp Phe Thr Glu Asp Pro Gly Pro
50 55 60
Asp Glu Ala Pro Arg Met Pro Glu Ala Ala Pro Pro Val Ala Pro
65 70 75
Ala Pro Ala Ala Pro Thr Pro Ala Ala Pro Ala Pro Ala Pro Ser
80 85 90
Trp Pro Leu Ser Ser Ser Val Pro Ser Gin Lys Thr Tyr Gin Gly
95 100 105
Ser Tyr Gly Phe Arg Leu Gly Phe Leu His Ser Gly Thr Ala Lys
110 115 120
Ser Val Thr Cys Thr Tyr Ser Pro Ala Leu Asn Lys Met Phe Cys
125 130 135
Gin Leu Ala Lys Thr Cys Pro Val Gin Leu Trp Val Asp Ser Thr
140 145 150
Pro Pro Pro Gly Thr Arg Val Arg Ala Met Ala Ile Tyr Lys Gin
155 160 165
Ser Gin His Met Thr Glu Val Val Arg Arg Cys Pro His His Glu
170 175 180
Arg Cyε Ser Aεp Ser Aεp Gly Leu Ala Pro Pro Gin His Leu Ile
185 190 195
Arg Val Glu Gly Asn Leu Arg Val Glu Tyr Leu Asp Asp Arg Asn
200 205 210
Thr Phe Arg His Ser Val Val Val Pro Tyr Glu Pro Pro Glu Val
215 220 225
Gly Ser Asp Cys Thr Thr Ile His Tyr Asn Tyr Met Cys Asn Ser
230 235 240
Ser Cys Met Gly Gly Met Asn Arg Ser Pro Ile Leu Thr Ile Ile
245 250 255
70
Pro Gly Ser Thr Lys Arg Ala Leu Pro Asn Asn Thr Ser Ser Ser
305 310 315
Pro Gin Pro Lys Lys Lys Pro Leu Asp Gly Glu Tyr Phe Thr Leu
320 325 330
Gin Ile Arg Gly Arg Glu Arg Phe Glu Met Phe Arg Glu Leu Asn
335 340 345
Glu Ala Leu Glu Leu Lys Asp Ala Gin Ala Gly Lys Glu Pro Gly
350 355 360
Gly Ser Arg Ala His Ser Ser His Leu Lyε Ser Lys Lys Gly Gin
365 370 375
Ser Thr Ser Arg His Lys Lys Leu Met Phe Lys Thr Glu Gly Pro
380 385 390
Asp Ser Asp
(2) INFORMATION FOR SEQ ID NO:29:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 30 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: double
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: other nucleic acid (xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: CCGGGCATGT CCGGGCATGT CCGGGCATGT 30
(2) INFORMATION FOR SEQ ID NO:30:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 26 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: double
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: other nucleic acid (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: CCCGAACATG TCCCAACATG TTGGGG 26
Ser Pro Asp Asp Ile Glu Gin Trp Phe Thr Glu Asp Pro Gly Pro
50 55 60
Asp Glu Ala Pro Arg Met Pro Glu Ala Ala Pro Pro Val Ala Pro
65 70 75
Ala Pro Ala Ala Pro Thr Pro Ala Ala Pro Ala Pro Ala Pro Ser
80 85 90
Trp Pro Leu Ser Ser Ser Val Pro Ser Gin Lyε Thr Tyr Gin Gly
95 100 105
Ser Tyr Gly Phe Arg Leu Gly Phe Leu Hiε Ser Gly Thr Ala Lyε
110 115 120
Ser Val Thr Cys Thr Tyr Ser Pro Ala Leu Asn Lyε Met Phe Cyε
125 130 135
Gin Leu Ala Lys Thr Cys Pro Val Gin Leu Trp Val Asp Ser Thr
140 145 150
Pro Pro Pro Gly Thr Arg Val Arg Ala Met Ala Ile Tyr Lys Gin
155 160 165
Ser Gin His Met Thr Glu Val Val Arg Arg Cys Pro His Hiε Glu
170 175 180
Arg Cyε Ser Aεp Ser Aεp Gly Leu Ala Pro Pro Gin Hiε Leu Ile
185 190 195
Arg Val Glu Gly Aεn Leu Arg Val Glu Tyr Leu Asp Asp Arg Asn
200 205 210
Thr Phe Arg Hiε Ser Val Val Val Pro Tyr Glu Pro Pro Glu Val
215 220 225
Gly Ser Aεp Cys Thr Thr Ile His Tyr Asn Tyr Met Cys Asn Ser
230 235 240
Ser Cys Met Gly Gly Met Asn Arg Arg Pro Ile Leu Thr Ile Ile
245 250 255
Thr Leu Glu Asp Ser Ser Gly Aεn Leu Leu Gly Arg Aεn Ser Phe
260 265 270
Glu Val Arg Val Cyε Ala Cyε Pro Gly Arg Aεp Trp Arg Thr Glu
275 280 285
Glu Glu Asn Leu Arg Lys Lys Gly Glu Pro His His Glu Leu Pro
290 295 300
72
Ala Ser Thr Ser Thr Ala Lys Pro Met Val Gly Gin Leu Ile Phe
35 40 45
Asp Lys Phe Ile Lyε Thr Glu Glu Asp Pro Ile Ile Lys Gin Asp
50 55 60
Thr Pro Ser Asn Leu Asp Phe Asp Phe Ala Leu Pro Gin Thr Ala
65 70 75
Thr Ala Pro Aεp Ala Lys Thr Val Leu Pro Ile Pro Glu Leu Asp
80 85 90
Asp Ala Val Val Glu Ser Phe Phe Ser Ser Ser Thr Asp Ser Thr
95 100 105
Pro Met Phe Glu Tyr Glu Asn Leu Glu Asp Asn Ser Lyε Glu Trp
110 115 120
Thr Ser Leu Phe Aεp Aεn Aεp Ile Pro Val Thr Thr Asp Asp Val
125 130 135
Ser Leu Ala Asp Lys Ala Ile Glu Ser Thr Glu Glu Val Ser Leu
140 145 150
Val Pro Ser Asn Leu Glu Val Ser Thr Thr Ser Phe Leu Pro Thr
155 160 165
Pro Val Leu Glu Aεp Ala Lys Leu Thr Gin Thr Arg Lys Val Lys
170 175 180
Lys Pro Asn Ser Val Val Lys Lys Ser Hiε Hiε Val Gly Lyε Aεp
185 190 195
Asp Glu Ser Arg Leu Aεp Hiε Leu Gly Val Val Ala Tyr Aεn Arg
200 205 210
Lyε Gin Arg Ser Ile Pro Leu Ser Pro Ile Val Pro Glu Ser Ser
215 220 225
Asp Pro Ala Ala Leu Lyε Arg Ala Arg Aεn Thr Glu Ala Ala Arg
230 235 240
Arg Ser Arg Ala Arg Lyε Leu Gin Arg Met Lyε Gin Ile Glu Aεp
245 250 255
Lyε Leu Glu Glu Ile Leu Ser Lys Leu Tyr Hiε Ile Glu Asn Glu
260 265 270
Leu Ala Arg Ile Lys Lys Leu Leu Gly Glu Arg
275 280
(2) INFORMATION FOR SEQ ID NO:31:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 27 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: double
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: other nucleic acid (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: ACAGAACATG TCTAAGCATG CTGGGGA 27
(2) INFORMATION FOR SEQ ID NO: 32:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 10 base pairs
(B) TYPE: nucleic acid
(C) STRANDEDNESS: double
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: other nucleic acid (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: ATCACGTGAT 10
(2) INFORMATION FOR SEQ ID NO:33:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 281 amino acids
(B) TYPE: amino acid
(C) STRANDEDNESS:
(D) TOPOLOGY: linear
(ii) MOLECULE TYPE: protein
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33:
Met Ser Glu Tyr Gin Pro Ser Leu Phe Ala Leu Asn Pro Met Gly 1 5 10 15
Phe Ser Pro Leu Asp Gly Ser Lys Ser Thr Aεn Glu Aεn Val Ser
20 25 30
74
7. The modified p53 construct according to claim 4 wherein the p53 amino acid sequence is a chimeric p53 protein.
8. The modified p53 construct according to claim 4 wherein the p53 amino acid sequence contains an engineered p53 DNA binding domain.
9. A pharmaceutical composition comprising a modified p53 protein construct according to claim 1 and a pharmaceutically acceptable carrier.
10. A method of enhancing the DNA-binding ability of a p53 construct having a p53 DNA binding domain comprising the step of: modifying the codon encoding the amino acid correεponding to residue 284 of wild-type p53 to a codon encoding arginine, whereby the resulting modified p53 construct is characterized by enhanced DNA-binding ability.
11. The method according to claim 10 wherein the p53 amino acid sequence iε a natural or engineered mutant p53.
12. The method according to claim 10 wherein the p53 amino acid εequence is a chimeric p53 protein.
13. A nucleotide sequence encoding a modified p53 protein construct having DNA binding activity comprising a p53 amino acid sequence in which the threonine corresponding to amino acid residue 284 of the wild-type p53 protein is changed to arginine.
Claims
What is claimed is:
1. A modified p53 protein construct having DNA binding ability comprising a p53 amino acid sequence in which the threonine corresponding to amino acid residue 284 of the wild-type human p53 protein is changed to arginine.
2. The modified p53 construct according to claim 1 wherein the p53 amino acid sequence iε full- length human wild-type human p53.
3. The modified p53 conεtruct according to claim 1 wherein the p53 amino acid εequence iε human wild-type p53 bearing a deletion of all or a fragment of the C-terminal reεidues 356 to 393.
4. The modified p53 construct according to claim 1 wherein the p53 amno acid sequence is a natural or engineered mutant p53 sequence.
5. The modified p53 construct according to claim 4 wherein the p53 amino acid sequence is a p53 mutant amino acid sequence selected from the group consisting of: a mutant p53 having glutamine at amino acid position 248, a mutant p53 having histidine at amino acid position 273, and a mutant p53 having cysteine at amino acid poεition 273.
6. The modified p53 conεtruct according to claim 5 wherein the p53 amino acid εequence iε deleted of all or a fragment of the C-terminal reεidues 356 to 393.
76
22. The method according to claim 19 wherein the condition is cancer.
14. A vector comprising a nucleotide sequence encoding a modified p53 protein construct having DNA binding activity comprising a p53 amino acid sequence in which the threonine corresponding to amino acid residue 284 of the wild-type p53 protein is changed to arginine.
15. A pharmaceutical composition comprising a nucleic acid sequence according to claim 13 and a pharmaceutically acceptable carrier.
16. A pharmaceutical composition comprising a vector according to claim 14 and a pharmaceutically acceptable carrier.
17. A method of treating a condition associated with deficient p53 activity comprising the step of administering a pharmaceutical composition according to claim 9.
18. A method of treating a condition asεociated with deficient p53 activity compriεing the step of adminiεtering a pharmaceutical compoεition according to claim 15.
19. A method of treating a condition associated with deficient p53 activity comprising the step of administering a pharmaceutical composition according to claim 16.
20. The method according to claim 17 wherein the condition is cancer.
21. The method according to claim 18 wherein the condition is cancer.
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| AU72429/96A AU7242996A (en) | 1995-09-22 | 1996-09-20 | Modified p53 constructs and uses therefor |
Applications Claiming Priority (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US480295P | 1995-09-22 | 1995-09-22 | |
| US60/004,802 | 1995-09-22 | ||
| US08/697,221 US5847083A (en) | 1996-08-21 | 1996-08-21 | Modified p53 constructs which enhance DNA binding |
| US08/697,221 | 1996-08-21 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO1997010843A1 true WO1997010843A1 (en) | 1997-03-27 |
Family
ID=26673499
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/US1996/015188 WO1997010843A1 (en) | 1995-09-22 | 1996-09-20 | Modified p53 constructs and uses therefor |
Country Status (2)
| Country | Link |
|---|---|
| AU (1) | AU7242996A (en) |
| WO (1) | WO1997010843A1 (en) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2000022115A3 (en) * | 1998-10-13 | 2000-09-21 | Univ Texas | Assays for identifying functional alterations in the p53 tumor suppressor |
Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5362623A (en) * | 1991-06-14 | 1994-11-08 | The John Hopkins University | Sequence specific DNA binding by p53 |
| WO1995017213A1 (en) * | 1993-12-21 | 1995-06-29 | Sloan-Kettering Institute For Cancer Research | P53-based polypeptide fragments, nucleic acid molecules encoding same, and uses thereof |
| US5573925A (en) * | 1994-11-28 | 1996-11-12 | The Wistar Institute Of Anatomy And Biology | P53 proteins with altered tetramerization domains |
-
1996
- 1996-09-20 AU AU72429/96A patent/AU7242996A/en not_active Abandoned
- 1996-09-20 WO PCT/US1996/015188 patent/WO1997010843A1/en active Application Filing
Patent Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5362623A (en) * | 1991-06-14 | 1994-11-08 | The John Hopkins University | Sequence specific DNA binding by p53 |
| WO1995017213A1 (en) * | 1993-12-21 | 1995-06-29 | Sloan-Kettering Institute For Cancer Research | P53-based polypeptide fragments, nucleic acid molecules encoding same, and uses thereof |
| US5573925A (en) * | 1994-11-28 | 1996-11-12 | The Wistar Institute Of Anatomy And Biology | P53 proteins with altered tetramerization domains |
Non-Patent Citations (3)
| Title |
|---|
| CELL, 27 November 1992, Vol. 71, HUPP et al., "Regulation of the Specific DNA Binding Function of p53", pages 875-886. * |
| SCIENCE, 05 July 1991, Vol. 253, HOLLSTEIN et al., "p53 Mutations in Human Cancers", pages 49-53. * |
| THE EMBO JOURNAL, 1993, Vol. 12, No. 13, HALAZONETIS et al., "Conformational Shifts Propagate from the Oligomerization Domain of p53 to its Tetrameric DNA Binding Domain and Restore DNA Binding to Select p53 Mutants", pages 5057-5064. * |
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2000022115A3 (en) * | 1998-10-13 | 2000-09-21 | Univ Texas | Assays for identifying functional alterations in the p53 tumor suppressor |
| US6429298B1 (en) | 1998-10-13 | 2002-08-06 | Board Of Regents, The University Of Texas System | Assays for identifying functional alterations in the p53 tumor suppressor |
Also Published As
| Publication number | Publication date |
|---|---|
| AU7242996A (en) | 1997-04-09 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Selivanova et al. | Restoration of the growth suppression function of mutant p53 by a synthetic peptide derived from the p53 C-terminal domain | |
| US6635623B1 (en) | Lipoproteins as nucleic acid vectors | |
| AU698437B2 (en) | Recombinant P53 adenovirus methods and compositions | |
| US5573925A (en) | P53 proteins with altered tetramerization domains | |
| US6140058A (en) | Activation of p53 protein | |
| CA2406233A1 (en) | Compositions for drug delivery | |
| KR19990021828A (en) | New Variants of Apolipoprotein A-I | |
| KR20020013473A (en) | Adenovirus-Mediated Gene Therapy | |
| US7772367B2 (en) | C-terminal p53 palindromic peptide that induces apoptosis of cells with aberrant p53 and uses thereof | |
| US20170065684A9 (en) | Phenotypic reversion of pancreatic carcinoma cells | |
| US5721340A (en) | p53 proteins with altered tetramerization domains | |
| EP0799243A1 (en) | p53 PROTEINS WITH ALTERED TETRAMERIZATION DOMAINS | |
| CA2277880A1 (en) | Use of pea3 in tumor suppression | |
| CA2517285A1 (en) | Transcriptional factor inducing apoptosis in cancer cell | |
| WO1996020207A9 (en) | MUTANTS OF THE Rb AND p53 GENES AND USES THEREOF | |
| WO1996020207A1 (en) | MUTANTS OF THE Rb AND p53 GENES AND USES THEREOF | |
| CA2343099C (en) | Role of human kis (hkis) as an inhibitory kinase of the cyclin-dependentkinase inhibitor p27. compositions, methods and uses thereof to control cell proliferation | |
| US5847083A (en) | Modified p53 constructs which enhance DNA binding | |
| WO1997010843A1 (en) | Modified p53 constructs and uses therefor | |
| US6770473B1 (en) | hKIS compositions and methods of use | |
| US6388062B1 (en) | Modified p53 tetramerization domains having hydrophobic amino acid substitutions | |
| KR20030009522A (en) | Biosynthetic oncolytic molecules and uses therefor | |
| PT1377667E (en) | Fusion proteins for specific treatment of cancer and auto-immune diseases | |
| US5965398A (en) | DNA sequence encoding a tumor suppressor gene | |
| KR100636017B1 (en) | Transcriptional activator |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AK | Designated states |
Kind code of ref document: A1 Designated state(s): AU CA JP |
|
| AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): AT BE CH DE DK ES FI FR GB GR IE IT LU MC NL PT SE |
|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
| DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
| 122 | Ep: pct application non-entry in european phase | ||
| NENP | Non-entry into the national phase |
Ref country code: CA |