US20030124685A1 - Carbamoyl-phosphate synthetase gene of coryneform bacteria and method for producing l-arginine - Google Patents
Carbamoyl-phosphate synthetase gene of coryneform bacteria and method for producing l-arginine Download PDFInfo
- Publication number
- US20030124685A1 US20030124685A1 US09/494,359 US49435900A US2003124685A1 US 20030124685 A1 US20030124685 A1 US 20030124685A1 US 49435900 A US49435900 A US 49435900A US 2003124685 A1 US2003124685 A1 US 2003124685A1
- Authority
- US
- United States
- Prior art keywords
- ala
- amino acid
- gly
- val
- glu
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 101000859568 Methanobrevibacter smithii (strain ATCC 35061 / DSM 861 / OCM 144 / PS) Carbamoyl-phosphate synthase Proteins 0.000 title claims abstract description 77
- ODKSFYDXXFIFQN-BYPYZUCNSA-N L-arginine Chemical compound OC(=O)[C@@H](N)CCCN=C(N)N ODKSFYDXXFIFQN-BYPYZUCNSA-N 0.000 title claims description 47
- 238000004519 manufacturing process Methods 0.000 title claims description 10
- 241000186031 Corynebacteriaceae Species 0.000 title description 26
- 150000001413 amino acids Chemical class 0.000 claims abstract description 106
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 55
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 53
- 229920001184 polypeptide Polymers 0.000 claims abstract description 50
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 50
- 239000012634 fragment Substances 0.000 claims abstract description 39
- 230000000694 effects Effects 0.000 claims abstract description 35
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims abstract description 26
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 25
- 238000007792 addition Methods 0.000 claims abstract description 22
- 238000012217 deletion Methods 0.000 claims abstract description 22
- 230000037430 deletion Effects 0.000 claims abstract description 22
- 238000003780 insertion Methods 0.000 claims abstract description 22
- 230000037431 insertion Effects 0.000 claims abstract description 22
- 238000006467 substitution reaction Methods 0.000 claims abstract description 21
- 229930064664 L-arginine Natural products 0.000 claims description 45
- 235000014852 L-arginine Nutrition 0.000 claims description 45
- 235000001014 amino acid Nutrition 0.000 claims description 44
- 239000002773 nucleotide Substances 0.000 claims description 44
- 125000003729 nucleotide group Chemical group 0.000 claims description 44
- 244000005700 microbiome Species 0.000 claims description 25
- 235000018102 proteins Nutrition 0.000 claims description 23
- 241000186254 coryneform bacterium Species 0.000 claims description 14
- 230000003834 intracellular effect Effects 0.000 claims description 12
- -1 substitution Chemical compound 0.000 claims description 8
- 230000033228 biological regulation Effects 0.000 claims description 5
- 238000012258 culturing Methods 0.000 claims description 5
- 125000003275 alpha amino acid group Chemical group 0.000 abstract 6
- 108020004414 DNA Proteins 0.000 description 102
- 241000588724 Escherichia coli Species 0.000 description 35
- 210000004027 cell Anatomy 0.000 description 31
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 30
- 238000000034 method Methods 0.000 description 30
- 101150014229 carA gene Proteins 0.000 description 29
- 101150070764 carB gene Proteins 0.000 description 29
- 229940024606 amino acid Drugs 0.000 description 28
- 239000013612 plasmid Substances 0.000 description 27
- 239000002609 medium Substances 0.000 description 23
- 239000013598 vector Substances 0.000 description 22
- 241000894006 Bacteria Species 0.000 description 19
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 18
- 241000186226 Corynebacterium glutamicum Species 0.000 description 17
- 229940035893 uracil Drugs 0.000 description 15
- 108010049041 glutamylalanine Proteins 0.000 description 13
- 210000000349 chromosome Anatomy 0.000 description 12
- 239000004475 Arginine Substances 0.000 description 10
- 244000063299 Bacillus subtilis Species 0.000 description 10
- 235000014469 Bacillus subtilis Nutrition 0.000 description 10
- 108700026244 Open Reading Frames Proteins 0.000 description 10
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 10
- 235000009697 arginine Nutrition 0.000 description 10
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 10
- 230000035772 mutation Effects 0.000 description 10
- 108010050848 glycylleucine Proteins 0.000 description 9
- 229930027917 kanamycin Natural products 0.000 description 9
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 9
- 229960000318 kanamycin Drugs 0.000 description 9
- 229930182823 kanamycin A Natural products 0.000 description 9
- 108091008146 restriction endonucleases Proteins 0.000 description 9
- 239000011780 sodium chloride Substances 0.000 description 9
- 241000186146 Brevibacterium Species 0.000 description 8
- CSNNHWWHGAXBCP-UHFFFAOYSA-L Magnesium sulfate Chemical compound [Mg+2].[O-][S+2]([O-])([O-])[O-] CSNNHWWHGAXBCP-UHFFFAOYSA-L 0.000 description 8
- 241000186216 Corynebacterium Species 0.000 description 7
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 7
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 7
- 241000319304 [Brevibacterium] flavum Species 0.000 description 7
- 229940041514 candida albicans extract Drugs 0.000 description 7
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 7
- 239000000047 product Substances 0.000 description 7
- 239000012138 yeast extract Substances 0.000 description 7
- 229920001817 Agar Polymers 0.000 description 6
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 6
- AHLPHDHHMVZTML-BYPYZUCNSA-N L-Ornithine Chemical compound NCCC[C@H](N)C(O)=O AHLPHDHHMVZTML-BYPYZUCNSA-N 0.000 description 6
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 6
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 6
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 6
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 6
- 239000008272 agar Substances 0.000 description 6
- 108010005233 alanylglutamic acid Proteins 0.000 description 6
- 108010038633 aspartylglutamate Proteins 0.000 description 6
- 230000002950 deficient Effects 0.000 description 6
- 230000001747 exhibiting effect Effects 0.000 description 6
- 239000008103 glucose Substances 0.000 description 6
- 108010037850 glycylvaline Proteins 0.000 description 6
- 238000002360 preparation method Methods 0.000 description 6
- 238000011144 upstream manufacturing Methods 0.000 description 6
- YYAYLSOQGBAUHK-UHFFFAOYSA-N 2-(1,3-thiazol-2-ylamino)propanoic acid Chemical compound OC(=O)C(C)NC1=NC=CS1 YYAYLSOQGBAUHK-UHFFFAOYSA-N 0.000 description 5
- IXHTVNGQTIZAFS-BYPYZUCNSA-N L-arginine hydroxamate Chemical compound ONC(=O)[C@@H](N)CCCN=C(N)N IXHTVNGQTIZAFS-BYPYZUCNSA-N 0.000 description 5
- 108010079364 N-glycylalanine Proteins 0.000 description 5
- AHLPHDHHMVZTML-UHFFFAOYSA-N Orn-delta-NH2 Natural products NCCCC(N)C(O)=O AHLPHDHHMVZTML-UHFFFAOYSA-N 0.000 description 5
- UTJLXEIPEHZYQJ-UHFFFAOYSA-N Ornithine Natural products OC(=O)C(C)CCCN UTJLXEIPEHZYQJ-UHFFFAOYSA-N 0.000 description 5
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 5
- 108020004707 nucleic acids Proteins 0.000 description 5
- 150000007523 nucleic acids Chemical class 0.000 description 5
- 102000039446 nucleic acids Human genes 0.000 description 5
- 229960003104 ornithine Drugs 0.000 description 5
- 239000013600 plasmid vector Substances 0.000 description 5
- 108010061238 threonyl-glycine Proteins 0.000 description 5
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 4
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 4
- 102000004190 Enzymes Human genes 0.000 description 4
- 108090000790 Enzymes Proteins 0.000 description 4
- 241000588722 Escherichia Species 0.000 description 4
- JDUKCSSHWNIQQZ-IHRRRGAJSA-N Glu-Phe-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JDUKCSSHWNIQQZ-IHRRRGAJSA-N 0.000 description 4
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 4
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 4
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 4
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 4
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 4
- 241000880493 Leptailurus serval Species 0.000 description 4
- KUIDCYNIEJBZBU-AJNGGQMLSA-N Leu-Ile-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O KUIDCYNIEJBZBU-AJNGGQMLSA-N 0.000 description 4
- 108091034117 Oligonucleotide Proteins 0.000 description 4
- JJHVFCUWLSKADD-ONGXEEELSA-N Phe-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O JJHVFCUWLSKADD-ONGXEEELSA-N 0.000 description 4
- 108020004511 Recombinant DNA Proteins 0.000 description 4
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 4
- XEYUMGGWQCIWAR-XVKPBYJWSA-N Val-Gln-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N XEYUMGGWQCIWAR-XVKPBYJWSA-N 0.000 description 4
- 108010047495 alanylglycine Proteins 0.000 description 4
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 4
- 108010013835 arginine glutamate Proteins 0.000 description 4
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 4
- 108010047857 aspartylglycine Proteins 0.000 description 4
- 230000001580 bacterial effect Effects 0.000 description 4
- FFQKYPRQEYGKAF-UHFFFAOYSA-N carbamoyl phosphate Chemical compound NC(=O)OP(O)(O)=O FFQKYPRQEYGKAF-UHFFFAOYSA-N 0.000 description 4
- 238000006243 chemical reaction Methods 0.000 description 4
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 4
- 229960005091 chloramphenicol Drugs 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 108010089804 glycyl-threonine Proteins 0.000 description 4
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 4
- 229910052943 magnesium sulfate Inorganic materials 0.000 description 4
- 229960004452 methionine Drugs 0.000 description 4
- 108010056582 methionylglutamic acid Proteins 0.000 description 4
- 239000000523 sample Substances 0.000 description 4
- KDYFGRWQOYBRFD-UHFFFAOYSA-N succinic acid Chemical compound OC(=O)CCC(O)=O KDYFGRWQOYBRFD-UHFFFAOYSA-N 0.000 description 4
- 230000009466 transformation Effects 0.000 description 4
- UFAABHKZLQFLSG-YFKPBYRVSA-N 2-[(4s)-4-amino-5-hydroxypentyl]guanidine Chemical compound OC[C@@H](N)CCCN=C(N)N UFAABHKZLQFLSG-YFKPBYRVSA-N 0.000 description 3
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 3
- SSPYSWLZOPCOLO-UHFFFAOYSA-N 6-azauracil Chemical compound O=C1C=NNC(=O)N1 SSPYSWLZOPCOLO-UHFFFAOYSA-N 0.000 description 3
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 3
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 3
- GFEDXKNBZMPEDM-KZVJFYERSA-N Ala-Met-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFEDXKNBZMPEDM-KZVJFYERSA-N 0.000 description 3
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 3
- QGZKDVFQNNGYKY-UHFFFAOYSA-N Ammonia Chemical compound N QGZKDVFQNNGYKY-UHFFFAOYSA-N 0.000 description 3
- NLXLAEXVIDQMFP-UHFFFAOYSA-N Ammonia chloride Chemical compound [NH4+].[Cl-] NLXLAEXVIDQMFP-UHFFFAOYSA-N 0.000 description 3
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 3
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 description 3
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 3
- XLILXFRAKOYEJX-GUBZILKMSA-N Asp-Leu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLILXFRAKOYEJX-GUBZILKMSA-N 0.000 description 3
- 241000193830 Bacillus <bacterium> Species 0.000 description 3
- 241000194032 Enterococcus faecalis Species 0.000 description 3
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 3
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 3
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 3
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 3
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 description 3
- FSBIGDSBMBYOPN-VKHMYHEASA-N L-canavanine Chemical compound OC(=O)[C@@H](N)CCONC(N)=N FSBIGDSBMBYOPN-VKHMYHEASA-N 0.000 description 3
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical class C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 3
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 3
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 3
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 3
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 3
- 108091028043 Nucleic acid sequence Proteins 0.000 description 3
- FSBIGDSBMBYOPN-UHFFFAOYSA-N O-guanidino-DL-homoserine Natural products OC(=O)C(N)CCON=C(N)N FSBIGDSBMBYOPN-UHFFFAOYSA-N 0.000 description 3
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 3
- 108091081024 Start codon Proteins 0.000 description 3
- ZTPXSEUVYNNZRB-CDMKHQONSA-N Thr-Gly-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZTPXSEUVYNNZRB-CDMKHQONSA-N 0.000 description 3
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 3
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 3
- 239000004473 Threonine Substances 0.000 description 3
- CVUDMNSZAIZFAE-UHFFFAOYSA-N Val-Arg-Pro Natural products NC(N)=NCCCC(NC(=O)C(N)C(C)C)C(=O)N1CCCC1C(O)=O CVUDMNSZAIZFAE-UHFFFAOYSA-N 0.000 description 3
- AAOPYWQQBXHINJ-DZKIICNBSA-N Val-Gln-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AAOPYWQQBXHINJ-DZKIICNBSA-N 0.000 description 3
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 3
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 3
- 108010087924 alanylproline Proteins 0.000 description 3
- 108010077245 asparaginyl-proline Proteins 0.000 description 3
- 238000009395 breeding Methods 0.000 description 3
- 230000001488 breeding effect Effects 0.000 description 3
- KRKNYBCHXYNGOX-UHFFFAOYSA-N citric acid Chemical compound OC(=O)CC(O)(C(O)=O)CC(O)=O KRKNYBCHXYNGOX-UHFFFAOYSA-N 0.000 description 3
- 238000010367 cloning Methods 0.000 description 3
- 238000012869 ethanol precipitation Methods 0.000 description 3
- 238000000605 extraction Methods 0.000 description 3
- 108010078144 glutaminyl-glycine Proteins 0.000 description 3
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 3
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 3
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 3
- 239000001963 growth medium Substances 0.000 description 3
- 229960002885 histidine Drugs 0.000 description 3
- 238000009396 hybridization Methods 0.000 description 3
- 108010034529 leucyl-lysine Proteins 0.000 description 3
- 239000000126 substance Substances 0.000 description 3
- 229960002898 threonine Drugs 0.000 description 3
- OPIFSICVWOWJMJ-AEOCFKNESA-N 5-bromo-4-chloro-3-indolyl beta-D-galactoside Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1OC1=CNC2=CC=C(Br)C(Cl)=C12 OPIFSICVWOWJMJ-AEOCFKNESA-N 0.000 description 2
- 108010044087 AS-I toxin Proteins 0.000 description 2
- ZKHQWZAMYRWXGA-KQYNXXCUSA-N Adenosine triphosphate Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)[C@H]1O ZKHQWZAMYRWXGA-KQYNXXCUSA-N 0.000 description 2
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 2
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 2
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 description 2
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 2
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 2
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 2
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 2
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 2
- HQJKCXHQNUCKMY-GHCJXIJMSA-N Ala-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C)N HQJKCXHQNUCKMY-GHCJXIJMSA-N 0.000 description 2
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 2
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 2
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 2
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 2
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 2
- CNQAFFMNJIQYGX-DRZSPHRISA-N Ala-Phe-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 CNQAFFMNJIQYGX-DRZSPHRISA-N 0.000 description 2
- FEGOCLZUJUFCHP-CIUDSAMLSA-N Ala-Pro-Gln Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FEGOCLZUJUFCHP-CIUDSAMLSA-N 0.000 description 2
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 2
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 2
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 2
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 2
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 2
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 2
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 2
- MZRBYBIQTIKERR-GUBZILKMSA-N Arg-Glu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MZRBYBIQTIKERR-GUBZILKMSA-N 0.000 description 2
- GOWZVQXTHUCNSQ-NHCYSSNCSA-N Arg-Glu-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GOWZVQXTHUCNSQ-NHCYSSNCSA-N 0.000 description 2
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 2
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 2
- OVQJAKFLFTZDNC-GUBZILKMSA-N Arg-Pro-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O OVQJAKFLFTZDNC-GUBZILKMSA-N 0.000 description 2
- AWMAZIIEFPFHCP-RCWTZXSCSA-N Arg-Pro-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWMAZIIEFPFHCP-RCWTZXSCSA-N 0.000 description 2
- VUGWHBXPMAHEGZ-SRVKXCTJSA-N Arg-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N VUGWHBXPMAHEGZ-SRVKXCTJSA-N 0.000 description 2
- MOGMYRUNTKYZFB-UNQGMJICSA-N Arg-Thr-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MOGMYRUNTKYZFB-UNQGMJICSA-N 0.000 description 2
- HOIFSHOLNKQCSA-FXQIFTODSA-N Asn-Arg-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O HOIFSHOLNKQCSA-FXQIFTODSA-N 0.000 description 2
- GNKVBRYFXYWXAB-WDSKDSINSA-N Asn-Glu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O GNKVBRYFXYWXAB-WDSKDSINSA-N 0.000 description 2
- MDDXKBHIMYYJLW-FXQIFTODSA-N Asn-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N MDDXKBHIMYYJLW-FXQIFTODSA-N 0.000 description 2
- QXOPPIDJKPEKCW-GUBZILKMSA-N Asn-Pro-Arg Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O QXOPPIDJKPEKCW-GUBZILKMSA-N 0.000 description 2
- WSWYMRLTJVKRCE-ZLUOBGJFSA-N Asp-Ala-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O WSWYMRLTJVKRCE-ZLUOBGJFSA-N 0.000 description 2
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 2
- XPGVTUBABLRGHY-BIIVOSGPSA-N Asp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N XPGVTUBABLRGHY-BIIVOSGPSA-N 0.000 description 2
- VHQOCWWKXIOAQI-WDSKDSINSA-N Asp-Gln-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VHQOCWWKXIOAQI-WDSKDSINSA-N 0.000 description 2
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 2
- GBSUGIXJAAKZOW-GMOBBJLQSA-N Asp-Ile-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GBSUGIXJAAKZOW-GMOBBJLQSA-N 0.000 description 2
- TZOZNVLBTAFJRW-UGYAYLCHSA-N Asp-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N TZOZNVLBTAFJRW-UGYAYLCHSA-N 0.000 description 2
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 2
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 2
- LIVXPXUVXFRWNY-CIUDSAMLSA-N Asp-Lys-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O LIVXPXUVXFRWNY-CIUDSAMLSA-N 0.000 description 2
- 241000222120 Candida <Saccharomycetales> Species 0.000 description 2
- 241000222178 Candida tropicalis Species 0.000 description 2
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 2
- TVYMKYUSZSVOAG-ZLUOBGJFSA-N Cys-Ala-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O TVYMKYUSZSVOAG-ZLUOBGJFSA-N 0.000 description 2
- VZKXOWRNJDEGLZ-WHFBIAKZSA-N Cys-Asp-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O VZKXOWRNJDEGLZ-WHFBIAKZSA-N 0.000 description 2
- HEPLXMBVMCXTBP-QWRGUYRKSA-N Cys-Phe-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O HEPLXMBVMCXTBP-QWRGUYRKSA-N 0.000 description 2
- 108010090461 DFG peptide Proteins 0.000 description 2
- VZCYOOQTPOCHFL-OWOJBTEDSA-N Fumaric acid Chemical compound OC(=O)\C=C\C(O)=O VZCYOOQTPOCHFL-OWOJBTEDSA-N 0.000 description 2
- FFVXLVGUJBCKRX-UKJIMTQDSA-N Gln-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N FFVXLVGUJBCKRX-UKJIMTQDSA-N 0.000 description 2
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 2
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 2
- WATXSTJXNBOHKD-LAEOZQHASA-N Glu-Asp-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O WATXSTJXNBOHKD-LAEOZQHASA-N 0.000 description 2
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 2
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 2
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 2
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 2
- HPJLZFTUUJKWAJ-JHEQGTHGSA-N Glu-Gly-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HPJLZFTUUJKWAJ-JHEQGTHGSA-N 0.000 description 2
- NJPQBTJSYCKCNS-HVTMNAMFSA-N Glu-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N NJPQBTJSYCKCNS-HVTMNAMFSA-N 0.000 description 2
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 2
- DWBBKNPKDHXIAC-SRVKXCTJSA-N Glu-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCC(O)=O DWBBKNPKDHXIAC-SRVKXCTJSA-N 0.000 description 2
- VHPVBPCCWVDGJL-IRIUXVKKSA-N Glu-Thr-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VHPVBPCCWVDGJL-IRIUXVKKSA-N 0.000 description 2
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 2
- FIQQRCFQXGLOSZ-WDSKDSINSA-N Gly-Glu-Asp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FIQQRCFQXGLOSZ-WDSKDSINSA-N 0.000 description 2
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 2
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 2
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 2
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 2
- NZOAFWHVAFJERA-OALUTQOASA-N Gly-Phe-Trp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O NZOAFWHVAFJERA-OALUTQOASA-N 0.000 description 2
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 2
- MUGLKCQHTUFLGF-WPRPVWTQSA-N Gly-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)CN MUGLKCQHTUFLGF-WPRPVWTQSA-N 0.000 description 2
- 244000068988 Glycine max Species 0.000 description 2
- 235000010469 Glycine max Nutrition 0.000 description 2
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 2
- SKYULSWNBYAQMG-IHRRRGAJSA-N His-Leu-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SKYULSWNBYAQMG-IHRRRGAJSA-N 0.000 description 2
- PYNPBMCLAKTHJL-SRVKXCTJSA-N His-Pro-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O PYNPBMCLAKTHJL-SRVKXCTJSA-N 0.000 description 2
- DPTBVFUDCPINIP-JURCDPSOSA-N Ile-Ala-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DPTBVFUDCPINIP-JURCDPSOSA-N 0.000 description 2
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 2
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 2
- GYAFMRQGWHXMII-IUKAMOBKSA-N Ile-Asp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N GYAFMRQGWHXMII-IUKAMOBKSA-N 0.000 description 2
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 2
- FUOYNOXRWPJPAN-QEWYBTABSA-N Ile-Glu-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FUOYNOXRWPJPAN-QEWYBTABSA-N 0.000 description 2
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 2
- DFFTXLCCDFYRKD-MBLNEYKQSA-N Ile-Gly-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N DFFTXLCCDFYRKD-MBLNEYKQSA-N 0.000 description 2
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 description 2
- PNTWNAXGBOZMBO-MNXVOIDGSA-N Ile-Lys-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PNTWNAXGBOZMBO-MNXVOIDGSA-N 0.000 description 2
- WLRJHVNFGAOYPS-HJPIBITLSA-N Ile-Ser-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N WLRJHVNFGAOYPS-HJPIBITLSA-N 0.000 description 2
- NURNJECQNNCRBK-FLBSBUHZSA-N Ile-Thr-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NURNJECQNNCRBK-FLBSBUHZSA-N 0.000 description 2
- APQYGMBHIVXFML-OSUNSFLBSA-N Ile-Val-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N APQYGMBHIVXFML-OSUNSFLBSA-N 0.000 description 2
- 239000007836 KH2PO4 Substances 0.000 description 2
- FFEARJCKVFRZRR-UHFFFAOYSA-N L-Methionine Natural products CSCCC(N)C(O)=O FFEARJCKVFRZRR-UHFFFAOYSA-N 0.000 description 2
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 2
- UKAUYVFTDYCKQA-VKHMYHEASA-N L-homoserine Chemical compound OC(=O)[C@@H](N)CCO UKAUYVFTDYCKQA-VKHMYHEASA-N 0.000 description 2
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 2
- 229930195722 L-methionine Natural products 0.000 description 2
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 2
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 2
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 2
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 2
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 2
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 2
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 2
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 2
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 2
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 2
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 description 2
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 2
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 2
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 2
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 2
- SWWCDAGDQHTKIE-RHYQMDGZSA-N Lys-Arg-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWWCDAGDQHTKIE-RHYQMDGZSA-N 0.000 description 2
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 2
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 2
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 description 2
- URBJRJKWSUFCKS-AVGNSLFASA-N Lys-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCCCN)N URBJRJKWSUFCKS-AVGNSLFASA-N 0.000 description 2
- 239000004472 Lysine Substances 0.000 description 2
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 2
- MYAPQOBHGWJZOM-UWVGGRQHSA-N Met-Gly-Leu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C MYAPQOBHGWJZOM-UWVGGRQHSA-N 0.000 description 2
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 2
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 2
- FPTXMUIBLMGTQH-ONGXEEELSA-N Phe-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 FPTXMUIBLMGTQH-ONGXEEELSA-N 0.000 description 2
- XWBJLKDCHJVKAK-KKUMJFAQSA-N Phe-Arg-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N XWBJLKDCHJVKAK-KKUMJFAQSA-N 0.000 description 2
- CSDMCMITJLKBAH-SOUVJXGZSA-N Phe-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O CSDMCMITJLKBAH-SOUVJXGZSA-N 0.000 description 2
- CWFGECHCRMGPPT-MXAVVETBSA-N Phe-Ile-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O CWFGECHCRMGPPT-MXAVVETBSA-N 0.000 description 2
- KBVJZCVLQWCJQN-KKUMJFAQSA-N Phe-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KBVJZCVLQWCJQN-KKUMJFAQSA-N 0.000 description 2
- YKUGPVXSDOOANW-KKUMJFAQSA-N Phe-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKUGPVXSDOOANW-KKUMJFAQSA-N 0.000 description 2
- XZQYIJALMGEUJD-OEAJRASXSA-N Phe-Lys-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZQYIJALMGEUJD-OEAJRASXSA-N 0.000 description 2
- KDIIENQUNVNWHR-JYJNAYRXSA-N Pro-Arg-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KDIIENQUNVNWHR-JYJNAYRXSA-N 0.000 description 2
- PTLOFJZJADCNCD-DCAQKATOSA-N Pro-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 PTLOFJZJADCNCD-DCAQKATOSA-N 0.000 description 2
- IBGCFJDLCYTKPW-NAKRPEOUSA-N Pro-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 IBGCFJDLCYTKPW-NAKRPEOUSA-N 0.000 description 2
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 2
- 108020005091 Replication Origin Proteins 0.000 description 2
- 241000235070 Saccharomyces Species 0.000 description 2
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 2
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 2
- DKGRNFUXVTYRAS-UBHSHLNASA-N Ser-Ser-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O DKGRNFUXVTYRAS-UBHSHLNASA-N 0.000 description 2
- HAYADTTXNZFUDM-IHRRRGAJSA-N Ser-Tyr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HAYADTTXNZFUDM-IHRRRGAJSA-N 0.000 description 2
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 2
- 241000607720 Serratia Species 0.000 description 2
- 241000607715 Serratia marcescens Species 0.000 description 2
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 2
- DCLBXIWHLVEPMQ-JRQIVUDYSA-N Thr-Asp-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DCLBXIWHLVEPMQ-JRQIVUDYSA-N 0.000 description 2
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 2
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 2
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 2
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 2
- MUAFDCVOHYAFNG-RCWTZXSCSA-N Thr-Pro-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MUAFDCVOHYAFNG-RCWTZXSCSA-N 0.000 description 2
- PJCYRZVSACOYSN-ZJDVBMNYSA-N Thr-Thr-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O PJCYRZVSACOYSN-ZJDVBMNYSA-N 0.000 description 2
- ZNFPUOSTMUMUDR-JRQIVUDYSA-N Tyr-Asn-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZNFPUOSTMUMUDR-JRQIVUDYSA-N 0.000 description 2
- YGKVNUAKYPGORG-AVGNSLFASA-N Tyr-Asp-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YGKVNUAKYPGORG-AVGNSLFASA-N 0.000 description 2
- GULIUBBXCYPDJU-CQDKDKBSSA-N Tyr-Leu-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 GULIUBBXCYPDJU-CQDKDKBSSA-N 0.000 description 2
- VKYDVKAKGDNZED-STECZYCISA-N Tyr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N VKYDVKAKGDNZED-STECZYCISA-N 0.000 description 2
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 2
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 2
- LABUITCFCAABSV-UHFFFAOYSA-N Val-Ala-Tyr Natural products CC(C)C(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LABUITCFCAABSV-UHFFFAOYSA-N 0.000 description 2
- CVUDMNSZAIZFAE-TUAOUCFPSA-N Val-Arg-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N CVUDMNSZAIZFAE-TUAOUCFPSA-N 0.000 description 2
- DNOOLPROHJWCSQ-RCWTZXSCSA-N Val-Arg-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DNOOLPROHJWCSQ-RCWTZXSCSA-N 0.000 description 2
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 2
- XQVRMLRMTAGSFJ-QXEWZRGKSA-N Val-Asp-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XQVRMLRMTAGSFJ-QXEWZRGKSA-N 0.000 description 2
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 2
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 2
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 2
- BVWPHWLFGRCECJ-JSGCOSHPSA-N Val-Gly-Tyr Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N BVWPHWLFGRCECJ-JSGCOSHPSA-N 0.000 description 2
- KDKLLPMFFGYQJD-CYDGBPFRSA-N Val-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N KDKLLPMFFGYQJD-CYDGBPFRSA-N 0.000 description 2
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 2
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 2
- XXWBHOWRARMUOC-NHCYSSNCSA-N Val-Lys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XXWBHOWRARMUOC-NHCYSSNCSA-N 0.000 description 2
- MJFSRZZJQWZHFQ-SRVKXCTJSA-N Val-Met-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)O)N MJFSRZZJQWZHFQ-SRVKXCTJSA-N 0.000 description 2
- MJOUSKQHAIARKI-JYJNAYRXSA-N Val-Phe-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 MJOUSKQHAIARKI-JYJNAYRXSA-N 0.000 description 2
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 2
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 2
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 2
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 2
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 2
- TVGWMCTYUFBXAP-QTKMDUPCSA-N Val-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N)O TVGWMCTYUFBXAP-QTKMDUPCSA-N 0.000 description 2
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 2
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 2
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 2
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 2
- 108010044940 alanylglutamine Proteins 0.000 description 2
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 2
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 2
- 235000011130 ammonium sulphate Nutrition 0.000 description 2
- 101150072344 argA gene Proteins 0.000 description 2
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 2
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 2
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 2
- 108010093581 aspartyl-proline Proteins 0.000 description 2
- 210000003578 bacterial chromosome Anatomy 0.000 description 2
- 230000006696 biosynthetic metabolic pathway Effects 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 229960002685 biotin Drugs 0.000 description 2
- 235000020958 biotin Nutrition 0.000 description 2
- 239000011616 biotin Substances 0.000 description 2
- 229910052799 carbon Inorganic materials 0.000 description 2
- 239000003795 chemical substances by application Substances 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- VHJLVAABSRFDPM-QWWZWVQMSA-N dithiothreitol Chemical compound SC[C@@H](O)[C@H](O)CS VHJLVAABSRFDPM-QWWZWVQMSA-N 0.000 description 2
- 230000007613 environmental effect Effects 0.000 description 2
- 238000000855 fermentation Methods 0.000 description 2
- 230000004151 fermentation Effects 0.000 description 2
- QEWYKACRFQMRMB-UHFFFAOYSA-N fluoroacetic acid Chemical compound OC(=O)CF QEWYKACRFQMRMB-UHFFFAOYSA-N 0.000 description 2
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 2
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 2
- 108010023364 glycyl-histidyl-arginine Proteins 0.000 description 2
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 2
- 108010087823 glycyltyrosine Proteins 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 2
- 108010040030 histidinoalanine Proteins 0.000 description 2
- 229960000310 isoleucine Drugs 0.000 description 2
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 2
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 2
- 108010057821 leucylproline Proteins 0.000 description 2
- 108010012058 leucyltyrosine Proteins 0.000 description 2
- 108010003700 lysyl aspartic acid Proteins 0.000 description 2
- 108010057952 lysyl-phenylalanyl-lysine Proteins 0.000 description 2
- 108010054155 lysyllysine Proteins 0.000 description 2
- 235000019341 magnesium sulphate Nutrition 0.000 description 2
- 229930182817 methionine Natural products 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 229910000402 monopotassium phosphate Inorganic materials 0.000 description 2
- 229910052757 nitrogen Inorganic materials 0.000 description 2
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 2
- 238000003752 polymerase chain reaction Methods 0.000 description 2
- GNSKLFRGEWLPPA-UHFFFAOYSA-M potassium dihydrogen phosphate Chemical compound [K+].OP(O)([O-])=O GNSKLFRGEWLPPA-UHFFFAOYSA-M 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 229960002429 proline Drugs 0.000 description 2
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 2
- 108010053725 prolylvaline Proteins 0.000 description 2
- 238000009790 rate-determining step (RDS) Methods 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 2
- 108010026333 seryl-proline Proteins 0.000 description 2
- 108010071207 serylmethionine Proteins 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 239000000758 substrate Substances 0.000 description 2
- 239000001384 succinic acid Substances 0.000 description 2
- ZEMGGZBWXRYJHK-UHFFFAOYSA-N thiouracil Chemical compound O=C1C=CNC(=S)N1 ZEMGGZBWXRYJHK-UHFFFAOYSA-N 0.000 description 2
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 2
- LWIHDJKSTIGBAC-UHFFFAOYSA-K tripotassium phosphate Chemical compound [K+].[K+].[K+].[O-]P([O-])([O-])=O LWIHDJKSTIGBAC-UHFFFAOYSA-K 0.000 description 2
- 108010020532 tyrosyl-proline Proteins 0.000 description 2
- 239000004474 valine Substances 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- UKAUYVFTDYCKQA-UHFFFAOYSA-N -2-Amino-4-hydroxybutanoic acid Natural products OC(=O)C(N)CCO UKAUYVFTDYCKQA-UHFFFAOYSA-N 0.000 description 1
- NWUYHJFMYQTDRP-UHFFFAOYSA-N 1,2-bis(ethenyl)benzene;1-ethenyl-2-ethylbenzene;styrene Chemical compound C=CC1=CC=CC=C1.CCC1=CC=CC=C1C=C.C=CC1=CC=CC=C1C=C NWUYHJFMYQTDRP-UHFFFAOYSA-N 0.000 description 1
- GEWRKGDRYZIFNP-UHFFFAOYSA-N 1h-1,3,5-triazine-2,4-dione Chemical compound OC1=NC=NC(O)=N1 GEWRKGDRYZIFNP-UHFFFAOYSA-N 0.000 description 1
- BQQVEASFNMRTBA-UHFFFAOYSA-N 2-[4-(3-aminopropyl)piperazin-1-yl]ethanol Chemical compound NCCCN1CCN(CCO)CC1 BQQVEASFNMRTBA-UHFFFAOYSA-N 0.000 description 1
- RBCXEDQEZDUMHD-UHFFFAOYSA-N 2-fluoropropanedioic acid Chemical compound OC(=O)C(F)C(O)=O RBCXEDQEZDUMHD-UHFFFAOYSA-N 0.000 description 1
- XPYXSZDENRDLKD-UHFFFAOYSA-N 2-octylguanidine Chemical compound CCCCCCCCN=C(N)N XPYXSZDENRDLKD-UHFFFAOYSA-N 0.000 description 1
- LGVJIYCMHMKTPB-UHFFFAOYSA-N 3-hydroxynorvaline Chemical compound CCC(O)C(N)C(O)=O LGVJIYCMHMKTPB-UHFFFAOYSA-N 0.000 description 1
- MFEFTTYGMZOIKO-UHFFFAOYSA-N 5-azacytosine Chemical compound NC1=NC=NC(=O)N1 MFEFTTYGMZOIKO-UHFFFAOYSA-N 0.000 description 1
- LQLQRFGHAALLLE-UHFFFAOYSA-N 5-bromouracil Chemical compound BrC1=CNC(=O)NC1=O LQLQRFGHAALLLE-UHFFFAOYSA-N 0.000 description 1
- 241000186361 Actinobacteria <class> Species 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 1
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 1
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 1
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 1
- KVWLTGNCJYDJET-LSJOCFKGSA-N Ala-Arg-His Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KVWLTGNCJYDJET-LSJOCFKGSA-N 0.000 description 1
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 1
- PJNSIUPOXFBHDM-GUBZILKMSA-N Ala-Arg-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O PJNSIUPOXFBHDM-GUBZILKMSA-N 0.000 description 1
- LBJYAILUMSUTAM-ZLUOBGJFSA-N Ala-Asn-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LBJYAILUMSUTAM-ZLUOBGJFSA-N 0.000 description 1
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 1
- ZIBWKCRKNFYTPT-ZKWXMUAHSA-N Ala-Asn-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZIBWKCRKNFYTPT-ZKWXMUAHSA-N 0.000 description 1
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 1
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 description 1
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 1
- MCKSLROAGSDNFC-ACZMJKKPSA-N Ala-Asp-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MCKSLROAGSDNFC-ACZMJKKPSA-N 0.000 description 1
- DECCMEWNXSNSDO-ZLUOBGJFSA-N Ala-Cys-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O DECCMEWNXSNSDO-ZLUOBGJFSA-N 0.000 description 1
- CXQODNIBUNQWAS-CIUDSAMLSA-N Ala-Gln-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CXQODNIBUNQWAS-CIUDSAMLSA-N 0.000 description 1
- NKJBKNVQHBZUIX-ACZMJKKPSA-N Ala-Gln-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKJBKNVQHBZUIX-ACZMJKKPSA-N 0.000 description 1
- AWAXZRDKUHOPBO-GUBZILKMSA-N Ala-Gln-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O AWAXZRDKUHOPBO-GUBZILKMSA-N 0.000 description 1
- MVBWLRJESQOQTM-ACZMJKKPSA-N Ala-Gln-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O MVBWLRJESQOQTM-ACZMJKKPSA-N 0.000 description 1
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 1
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 1
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 1
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 1
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 1
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 1
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 1
- CWEAKSWWKHGTRJ-BQBZGAKWSA-N Ala-Gly-Met Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O CWEAKSWWKHGTRJ-BQBZGAKWSA-N 0.000 description 1
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 1
- CFPQUJZTLUQUTJ-HTFCKZLJSA-N Ala-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@H](C)N CFPQUJZTLUQUTJ-HTFCKZLJSA-N 0.000 description 1
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 1
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 1
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 1
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 1
- FUKFQILQFQKHLE-DCAQKATOSA-N Ala-Lys-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O FUKFQILQFQKHLE-DCAQKATOSA-N 0.000 description 1
- BLTRAARCJYVJKV-QEJZJMRPSA-N Ala-Lys-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccccc1)C(O)=O BLTRAARCJYVJKV-QEJZJMRPSA-N 0.000 description 1
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 1
- NLOMBWNGESDVJU-GUBZILKMSA-N Ala-Met-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLOMBWNGESDVJU-GUBZILKMSA-N 0.000 description 1
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 1
- WQLDNOCHHRISMS-NAKRPEOUSA-N Ala-Pro-Ile Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WQLDNOCHHRISMS-NAKRPEOUSA-N 0.000 description 1
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 1
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 1
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 1
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 1
- HIIJOGIBQXHFKE-HHKYUTTNSA-N Ala-Thr-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O HIIJOGIBQXHFKE-HHKYUTTNSA-N 0.000 description 1
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 1
- SAHQGRZIQVEJPF-JXUBOQSCSA-N Ala-Thr-Lys Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN SAHQGRZIQVEJPF-JXUBOQSCSA-N 0.000 description 1
- AENHOIXXHKNIQL-AUTRQRHGSA-N Ala-Tyr-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H]([NH3+])C)CC1=CC=C(O)C=C1 AENHOIXXHKNIQL-AUTRQRHGSA-N 0.000 description 1
- YCTIYBUTCKNOTI-UWJYBYFXSA-N Ala-Tyr-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCTIYBUTCKNOTI-UWJYBYFXSA-N 0.000 description 1
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 1
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 1
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 description 1
- VHUUQVKOLVNVRT-UHFFFAOYSA-N Ammonium hydroxide Chemical compound [NH4+].[OH-] VHUUQVKOLVNVRT-UHFFFAOYSA-N 0.000 description 1
- 239000004254 Ammonium phosphate Substances 0.000 description 1
- VWVPYNGMOCSSGK-GUBZILKMSA-N Arg-Arg-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O VWVPYNGMOCSSGK-GUBZILKMSA-N 0.000 description 1
- OTUQSEPIIVBYEM-IHRRRGAJSA-N Arg-Asn-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OTUQSEPIIVBYEM-IHRRRGAJSA-N 0.000 description 1
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 1
- YSUVMPICYVWRBX-VEVYYDQMSA-N Arg-Asp-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YSUVMPICYVWRBX-VEVYYDQMSA-N 0.000 description 1
- OBFTYSPXDRROQO-SRVKXCTJSA-N Arg-Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCN=C(N)N OBFTYSPXDRROQO-SRVKXCTJSA-N 0.000 description 1
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 1
- SKTGPBFTMNLIHQ-KKUMJFAQSA-N Arg-Glu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SKTGPBFTMNLIHQ-KKUMJFAQSA-N 0.000 description 1
- RFXXUWGNVRJTNQ-QXEWZRGKSA-N Arg-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N RFXXUWGNVRJTNQ-QXEWZRGKSA-N 0.000 description 1
- SYAUZLVLXCDRSH-IUCAKERBSA-N Arg-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N SYAUZLVLXCDRSH-IUCAKERBSA-N 0.000 description 1
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 1
- BMNVSPMWMICFRV-DCAQKATOSA-N Arg-His-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CN=CN1 BMNVSPMWMICFRV-DCAQKATOSA-N 0.000 description 1
- NVUIWHJLPSZZQC-CYDGBPFRSA-N Arg-Ile-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NVUIWHJLPSZZQC-CYDGBPFRSA-N 0.000 description 1
- PZBSKYJGKNNYNK-ULQDDVLXSA-N Arg-Leu-Tyr Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCN=C(N)N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O PZBSKYJGKNNYNK-ULQDDVLXSA-N 0.000 description 1
- INXWADWANGLMPJ-JYJNAYRXSA-N Arg-Phe-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)CC1=CC=CC=C1 INXWADWANGLMPJ-JYJNAYRXSA-N 0.000 description 1
- AOHKLEBWKMKITA-IHRRRGAJSA-N Arg-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AOHKLEBWKMKITA-IHRRRGAJSA-N 0.000 description 1
- XSPKAHFVDKRGRL-DCAQKATOSA-N Arg-Pro-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XSPKAHFVDKRGRL-DCAQKATOSA-N 0.000 description 1
- VENMDXUVHSKEIN-GUBZILKMSA-N Arg-Ser-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VENMDXUVHSKEIN-GUBZILKMSA-N 0.000 description 1
- FTMRPIVPSDVGCC-GUBZILKMSA-N Arg-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FTMRPIVPSDVGCC-GUBZILKMSA-N 0.000 description 1
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 description 1
- VDCIPFYVCICPEC-FXQIFTODSA-N Asn-Arg-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O VDCIPFYVCICPEC-FXQIFTODSA-N 0.000 description 1
- MEFGKQUUYZOLHM-GMOBBJLQSA-N Asn-Arg-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MEFGKQUUYZOLHM-GMOBBJLQSA-N 0.000 description 1
- HUZGPXBILPMCHM-IHRRRGAJSA-N Asn-Arg-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HUZGPXBILPMCHM-IHRRRGAJSA-N 0.000 description 1
- XSGBIBGAMKTHMY-WHFBIAKZSA-N Asn-Asp-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O XSGBIBGAMKTHMY-WHFBIAKZSA-N 0.000 description 1
- ZWASIOHRQWRWAS-UGYAYLCHSA-N Asn-Asp-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZWASIOHRQWRWAS-UGYAYLCHSA-N 0.000 description 1
- UPALZCBCKAMGIY-PEFMBERDSA-N Asn-Gln-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UPALZCBCKAMGIY-PEFMBERDSA-N 0.000 description 1
- FUHFYEKSGWOWGZ-XHNCKOQMSA-N Asn-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O FUHFYEKSGWOWGZ-XHNCKOQMSA-N 0.000 description 1
- UBKOVSLDWIHYSY-ACZMJKKPSA-N Asn-Glu-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UBKOVSLDWIHYSY-ACZMJKKPSA-N 0.000 description 1
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 1
- GJFYPBDMUGGLFR-NKWVEPMBSA-N Asn-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC(=O)N)N)C(=O)O GJFYPBDMUGGLFR-NKWVEPMBSA-N 0.000 description 1
- VXLBDJWTONZHJN-YUMQZZPRSA-N Asn-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N VXLBDJWTONZHJN-YUMQZZPRSA-N 0.000 description 1
- PHJPKNUWWHRAOC-PEFMBERDSA-N Asn-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PHJPKNUWWHRAOC-PEFMBERDSA-N 0.000 description 1
- PNHQRQTVBRDIEF-CIUDSAMLSA-N Asn-Leu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)N)N PNHQRQTVBRDIEF-CIUDSAMLSA-N 0.000 description 1
- RCFGLXMZDYNRSC-CIUDSAMLSA-N Asn-Lys-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O RCFGLXMZDYNRSC-CIUDSAMLSA-N 0.000 description 1
- PPCORQFLAZWUNO-QWRGUYRKSA-N Asn-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N PPCORQFLAZWUNO-QWRGUYRKSA-N 0.000 description 1
- YUOXLJYVSZYPBJ-CIUDSAMLSA-N Asn-Pro-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O YUOXLJYVSZYPBJ-CIUDSAMLSA-N 0.000 description 1
- VHQSGALUSWIYOD-QXEWZRGKSA-N Asn-Pro-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O VHQSGALUSWIYOD-QXEWZRGKSA-N 0.000 description 1
- FMNBYVSGRCXWEK-FOHZUACHSA-N Asn-Thr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O FMNBYVSGRCXWEK-FOHZUACHSA-N 0.000 description 1
- WUQXMTITJLFXAU-JIOCBJNQSA-N Asn-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N)O WUQXMTITJLFXAU-JIOCBJNQSA-N 0.000 description 1
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 description 1
- GHWWTICYPDKPTE-NGZCFLSTSA-N Asn-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N GHWWTICYPDKPTE-NGZCFLSTSA-N 0.000 description 1
- WQAOZCVOOYUWKG-LSJOCFKGSA-N Asn-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC(=O)N)N WQAOZCVOOYUWKG-LSJOCFKGSA-N 0.000 description 1
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 1
- SOYOSFXLXYZNRG-CIUDSAMLSA-N Asp-Arg-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O SOYOSFXLXYZNRG-CIUDSAMLSA-N 0.000 description 1
- AXXCUABIFZPKPM-BQBZGAKWSA-N Asp-Arg-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O AXXCUABIFZPKPM-BQBZGAKWSA-N 0.000 description 1
- HMQDRBKQMLRCCG-GMOBBJLQSA-N Asp-Arg-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HMQDRBKQMLRCCG-GMOBBJLQSA-N 0.000 description 1
- TVVYVAUGRHNTGT-UGYAYLCHSA-N Asp-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O TVVYVAUGRHNTGT-UGYAYLCHSA-N 0.000 description 1
- UWOPETAWXDZUJR-ACZMJKKPSA-N Asp-Cys-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O UWOPETAWXDZUJR-ACZMJKKPSA-N 0.000 description 1
- OEUQMKNNOWJREN-AVGNSLFASA-N Asp-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N OEUQMKNNOWJREN-AVGNSLFASA-N 0.000 description 1
- XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 description 1
- VFUXXFVCYZPOQG-WDSKDSINSA-N Asp-Glu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VFUXXFVCYZPOQG-WDSKDSINSA-N 0.000 description 1
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 1
- OVPHVTCDVYYTHN-AVGNSLFASA-N Asp-Glu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OVPHVTCDVYYTHN-AVGNSLFASA-N 0.000 description 1
- XDGBFDYXZCMYEX-NUMRIWBASA-N Asp-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)O XDGBFDYXZCMYEX-NUMRIWBASA-N 0.000 description 1
- YNCHFVRXEQFPBY-BQBZGAKWSA-N Asp-Gly-Arg Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N YNCHFVRXEQFPBY-BQBZGAKWSA-N 0.000 description 1
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 1
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 1
- PGUYEUCYVNZGGV-QWRGUYRKSA-N Asp-Gly-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PGUYEUCYVNZGGV-QWRGUYRKSA-N 0.000 description 1
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 1
- LDGUZSIPGSPBJP-XVYDVKMFSA-N Asp-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N LDGUZSIPGSPBJP-XVYDVKMFSA-N 0.000 description 1
- UBPMOJLRVMGTOQ-GARJFASQSA-N Asp-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)O)N)C(=O)O UBPMOJLRVMGTOQ-GARJFASQSA-N 0.000 description 1
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 1
- YFSLJHLQOALGSY-ZPFDUUQYSA-N Asp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N YFSLJHLQOALGSY-ZPFDUUQYSA-N 0.000 description 1
- SPKCGKRUYKMDHP-GUDRVLHUSA-N Asp-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N SPKCGKRUYKMDHP-GUDRVLHUSA-N 0.000 description 1
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 1
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 1
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 1
- QJHOOKBAHRJPPX-QWRGUYRKSA-N Asp-Phe-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 QJHOOKBAHRJPPX-QWRGUYRKSA-N 0.000 description 1
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 1
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 1
- SXLCDCZHNCLFGZ-BPUTZDHNSA-N Asp-Pro-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O SXLCDCZHNCLFGZ-BPUTZDHNSA-N 0.000 description 1
- MNQMTYSEKZHIDF-GCJQMDKQSA-N Asp-Thr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O MNQMTYSEKZHIDF-GCJQMDKQSA-N 0.000 description 1
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 1
- RSMZEHCMIOKNMW-GSSVUCPTSA-N Asp-Thr-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RSMZEHCMIOKNMW-GSSVUCPTSA-N 0.000 description 1
- OTKUAVXGMREHRX-CFMVVWHZSA-N Asp-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 OTKUAVXGMREHRX-CFMVVWHZSA-N 0.000 description 1
- CZIVKMOEXPILDK-SRVKXCTJSA-N Asp-Tyr-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O CZIVKMOEXPILDK-SRVKXCTJSA-N 0.000 description 1
- BYLPQJAWXJWUCJ-YDHLFZDLSA-N Asp-Tyr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O BYLPQJAWXJWUCJ-YDHLFZDLSA-N 0.000 description 1
- 102100021277 Beta-secretase 2 Human genes 0.000 description 1
- 241001025270 Brevibacterium album Species 0.000 description 1
- 101100505161 Caenorhabditis elegans mel-32 gene Proteins 0.000 description 1
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- 108091035707 Consensus sequence Proteins 0.000 description 1
- 241001517047 Corynebacterium acetoacidophilum Species 0.000 description 1
- 241000909293 Corynebacterium alkanolyticum Species 0.000 description 1
- 241000186248 Corynebacterium callunae Species 0.000 description 1
- 241000133018 Corynebacterium melassecola Species 0.000 description 1
- 241000337023 Corynebacterium thermoaminogenes Species 0.000 description 1
- DCXGXDGGXVZVMY-GHCJXIJMSA-N Cys-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CS DCXGXDGGXVZVMY-GHCJXIJMSA-N 0.000 description 1
- SBMGKDLRJLYZCU-BIIVOSGPSA-N Cys-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N)C(=O)O SBMGKDLRJLYZCU-BIIVOSGPSA-N 0.000 description 1
- VIRYODQIWJNWNU-NRPADANISA-N Cys-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N VIRYODQIWJNWNU-NRPADANISA-N 0.000 description 1
- SSNJZBGOMNLSLA-CIUDSAMLSA-N Cys-Leu-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O SSNJZBGOMNLSLA-CIUDSAMLSA-N 0.000 description 1
- UOEYKPDDHSFMLI-DCAQKATOSA-N Cys-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CS)N UOEYKPDDHSFMLI-DCAQKATOSA-N 0.000 description 1
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 description 1
- ODKSFYDXXFIFQN-SCSAIBSYSA-N D-arginine Chemical compound OC(=O)[C@H](N)CCCNC(N)=N ODKSFYDXXFIFQN-SCSAIBSYSA-N 0.000 description 1
- 229930028154 D-arginine Natural products 0.000 description 1
- FBPFZTCFMRRESA-JGWLITMVSA-N D-glucitol Chemical compound OC[C@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-JGWLITMVSA-N 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 102000012410 DNA Ligases Human genes 0.000 description 1
- 108010061982 DNA Ligases Proteins 0.000 description 1
- 230000004544 DNA amplification Effects 0.000 description 1
- 206010059866 Drug resistance Diseases 0.000 description 1
- 241001131785 Escherichia coli HB101 Species 0.000 description 1
- 241000701959 Escherichia virus Lambda Species 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- GHASVSINZRGABV-UHFFFAOYSA-N Fluorouracil Chemical compound FC1=CNC(=O)NC1=O GHASVSINZRGABV-UHFFFAOYSA-N 0.000 description 1
- 229930091371 Fructose Natural products 0.000 description 1
- RFSUNEUAIZKAJO-ARQDHWQXSA-N Fructose Chemical compound OC[C@H]1O[C@](O)(CO)[C@@H](O)[C@@H]1O RFSUNEUAIZKAJO-ARQDHWQXSA-N 0.000 description 1
- 239000005715 Fructose Substances 0.000 description 1
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 1
- 241000193385 Geobacillus stearothermophilus Species 0.000 description 1
- XOKGKOQWADCLFQ-GARJFASQSA-N Gln-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O XOKGKOQWADCLFQ-GARJFASQSA-N 0.000 description 1
- LLVXTGUTDYMJLY-GUBZILKMSA-N Gln-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N LLVXTGUTDYMJLY-GUBZILKMSA-N 0.000 description 1
- CRRFJBGUGNNOCS-PEFMBERDSA-N Gln-Asp-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CRRFJBGUGNNOCS-PEFMBERDSA-N 0.000 description 1
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 1
- PXAFHUATEHLECW-GUBZILKMSA-N Gln-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N PXAFHUATEHLECW-GUBZILKMSA-N 0.000 description 1
- GNMQDOGFWYWPNM-LAEOZQHASA-N Gln-Gly-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)CNC(=O)[C@@H](N)CCC(N)=O)C(O)=O GNMQDOGFWYWPNM-LAEOZQHASA-N 0.000 description 1
- FGYPOQPQTUNESW-IUCAKERBSA-N Gln-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N FGYPOQPQTUNESW-IUCAKERBSA-N 0.000 description 1
- YRWWJCDWLVXTHN-LAEOZQHASA-N Gln-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N YRWWJCDWLVXTHN-LAEOZQHASA-N 0.000 description 1
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 1
- JRHPEMVLTRADLJ-AVGNSLFASA-N Gln-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JRHPEMVLTRADLJ-AVGNSLFASA-N 0.000 description 1
- JNVGVECJCOZHCN-DRZSPHRISA-N Gln-Phe-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O JNVGVECJCOZHCN-DRZSPHRISA-N 0.000 description 1
- QFXNFFZTMFHPST-DZKIICNBSA-N Gln-Phe-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)N)N QFXNFFZTMFHPST-DZKIICNBSA-N 0.000 description 1
- FNAJNWPDTIXYJN-CIUDSAMLSA-N Gln-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O FNAJNWPDTIXYJN-CIUDSAMLSA-N 0.000 description 1
- DUGYCMAIAKAQPB-GLLZPBPUSA-N Gln-Thr-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DUGYCMAIAKAQPB-GLLZPBPUSA-N 0.000 description 1
- VLOLPWWCNKWRNB-LOKLDPHHSA-N Gln-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O VLOLPWWCNKWRNB-LOKLDPHHSA-N 0.000 description 1
- WTJIWXMJESRHMM-XDTLVQLUSA-N Gln-Tyr-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O WTJIWXMJESRHMM-XDTLVQLUSA-N 0.000 description 1
- WPJDPEOQUIXXOY-AVGNSLFASA-N Gln-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O WPJDPEOQUIXXOY-AVGNSLFASA-N 0.000 description 1
- FHPXTPQBODWBIY-CIUDSAMLSA-N Glu-Ala-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHPXTPQBODWBIY-CIUDSAMLSA-N 0.000 description 1
- SZXSSXUNOALWCH-ACZMJKKPSA-N Glu-Ala-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O SZXSSXUNOALWCH-ACZMJKKPSA-N 0.000 description 1
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 1
- OGMQXTXGLDNBSS-FXQIFTODSA-N Glu-Ala-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O OGMQXTXGLDNBSS-FXQIFTODSA-N 0.000 description 1
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 1
- AVZHGSCDKIQZPQ-CIUDSAMLSA-N Glu-Arg-Ala Chemical compound C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AVZHGSCDKIQZPQ-CIUDSAMLSA-N 0.000 description 1
- VTTSANCGJWLPNC-ZPFDUUQYSA-N Glu-Arg-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VTTSANCGJWLPNC-ZPFDUUQYSA-N 0.000 description 1
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 1
- NTBDVNJIWCKURJ-ACZMJKKPSA-N Glu-Asp-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NTBDVNJIWCKURJ-ACZMJKKPSA-N 0.000 description 1
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 1
- CYHBMLHCQXXCCT-AVGNSLFASA-N Glu-Asp-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CYHBMLHCQXXCCT-AVGNSLFASA-N 0.000 description 1
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 1
- VOORMNJKNBGYGK-YUMQZZPRSA-N Glu-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N VOORMNJKNBGYGK-YUMQZZPRSA-N 0.000 description 1
- LGYCLOCORAEQSZ-PEFMBERDSA-N Glu-Ile-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O LGYCLOCORAEQSZ-PEFMBERDSA-N 0.000 description 1
- ITBHUUMCJJQUSC-LAEOZQHASA-N Glu-Ile-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O ITBHUUMCJJQUSC-LAEOZQHASA-N 0.000 description 1
- ZSWGJYOZWBHROQ-RWRJDSDZSA-N Glu-Ile-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSWGJYOZWBHROQ-RWRJDSDZSA-N 0.000 description 1
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 1
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 1
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 1
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 1
- AQNYKMCFCCZEEL-JYJNAYRXSA-N Glu-Lys-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AQNYKMCFCCZEEL-JYJNAYRXSA-N 0.000 description 1
- ZQYZDDXTNQXUJH-CIUDSAMLSA-N Glu-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(=O)O)N ZQYZDDXTNQXUJH-CIUDSAMLSA-N 0.000 description 1
- NPMSEUWUMOSEFM-CIUDSAMLSA-N Glu-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N NPMSEUWUMOSEFM-CIUDSAMLSA-N 0.000 description 1
- PMSMKNYRZCKVMC-DRZSPHRISA-N Glu-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)O)N PMSMKNYRZCKVMC-DRZSPHRISA-N 0.000 description 1
- FQFWFZWOHOEVMZ-IHRRRGAJSA-N Glu-Phe-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O FQFWFZWOHOEVMZ-IHRRRGAJSA-N 0.000 description 1
- YRMZCZIRHYCNHX-RYUDHWBXSA-N Glu-Phe-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O YRMZCZIRHYCNHX-RYUDHWBXSA-N 0.000 description 1
- YTRBQAQSUDSIQE-FHWLQOOXSA-N Glu-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 YTRBQAQSUDSIQE-FHWLQOOXSA-N 0.000 description 1
- BPLNJYHNAJVLRT-ACZMJKKPSA-N Glu-Ser-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O BPLNJYHNAJVLRT-ACZMJKKPSA-N 0.000 description 1
- WIKMTDVSCUJIPJ-CIUDSAMLSA-N Glu-Ser-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WIKMTDVSCUJIPJ-CIUDSAMLSA-N 0.000 description 1
- GMVCSRBOSIUTFC-FXQIFTODSA-N Glu-Ser-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMVCSRBOSIUTFC-FXQIFTODSA-N 0.000 description 1
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 1
- ZGXGVBYEJGVJMV-HJGDQZAQSA-N Glu-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O ZGXGVBYEJGVJMV-HJGDQZAQSA-N 0.000 description 1
- HHSKZJZWQFPSKN-AVGNSLFASA-N Glu-Tyr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O HHSKZJZWQFPSKN-AVGNSLFASA-N 0.000 description 1
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 1
- ZALGPUWUVHOGAE-GVXVVHGQSA-N Glu-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZALGPUWUVHOGAE-GVXVVHGQSA-N 0.000 description 1
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 1
- NTNUEBVGKMVANB-NHCYSSNCSA-N Glu-Val-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O NTNUEBVGKMVANB-NHCYSSNCSA-N 0.000 description 1
- QRWPTXLWHHTOCO-DZKIICNBSA-N Glu-Val-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QRWPTXLWHHTOCO-DZKIICNBSA-N 0.000 description 1
- BRFJMRSRMOMIMU-WHFBIAKZSA-N Gly-Ala-Asn Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O BRFJMRSRMOMIMU-WHFBIAKZSA-N 0.000 description 1
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 1
- JXYMPBCYRKWJEE-BQBZGAKWSA-N Gly-Arg-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JXYMPBCYRKWJEE-BQBZGAKWSA-N 0.000 description 1
- XUDLUKYPXQDCRX-BQBZGAKWSA-N Gly-Arg-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O XUDLUKYPXQDCRX-BQBZGAKWSA-N 0.000 description 1
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 description 1
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 1
- KTSZUNRRYXPZTK-BQBZGAKWSA-N Gly-Gln-Glu Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KTSZUNRRYXPZTK-BQBZGAKWSA-N 0.000 description 1
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 1
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 1
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 1
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 description 1
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 1
- TVDHVLGFJSHPAX-UWVGGRQHSA-N Gly-His-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 TVDHVLGFJSHPAX-UWVGGRQHSA-N 0.000 description 1
- DGKBSGNCMCLDSL-BYULHYEWSA-N Gly-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN DGKBSGNCMCLDSL-BYULHYEWSA-N 0.000 description 1
- UTYGDAHJBBDPBA-BYULHYEWSA-N Gly-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN UTYGDAHJBBDPBA-BYULHYEWSA-N 0.000 description 1
- HKSNHPVETYYJBK-LAEOZQHASA-N Gly-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)CN HKSNHPVETYYJBK-LAEOZQHASA-N 0.000 description 1
- DENRBIYENOKSEX-PEXQALLHSA-N Gly-Ile-His Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DENRBIYENOKSEX-PEXQALLHSA-N 0.000 description 1
- UESJMAMHDLEHGM-NHCYSSNCSA-N Gly-Ile-Leu Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O UESJMAMHDLEHGM-NHCYSSNCSA-N 0.000 description 1
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 description 1
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 1
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 1
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 1
- WMGHDYWNHNLGBV-ONGXEEELSA-N Gly-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 WMGHDYWNHNLGBV-ONGXEEELSA-N 0.000 description 1
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 1
- SCJJPCQUJYPHRZ-BQBZGAKWSA-N Gly-Pro-Asn Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O SCJJPCQUJYPHRZ-BQBZGAKWSA-N 0.000 description 1
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 1
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 1
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 1
- LKJCZEPXHOIAIW-HOTGVXAUSA-N Gly-Trp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN LKJCZEPXHOIAIW-HOTGVXAUSA-N 0.000 description 1
- GNNJKUYDWFIBTK-QWRGUYRKSA-N Gly-Tyr-Asp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O GNNJKUYDWFIBTK-QWRGUYRKSA-N 0.000 description 1
- NWOSHVVPKDQKKT-RYUDHWBXSA-N Gly-Tyr-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O NWOSHVVPKDQKKT-RYUDHWBXSA-N 0.000 description 1
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 1
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 1
- YDIDLLVFCYSXNY-RCOVLWMOSA-N Gly-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN YDIDLLVFCYSXNY-RCOVLWMOSA-N 0.000 description 1
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 1
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 1
- FULZDMOZUZKGQU-ONGXEEELSA-N Gly-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN FULZDMOZUZKGQU-ONGXEEELSA-N 0.000 description 1
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 1
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 1
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 1
- AFPFGFUGETYOSY-HGNGGELXSA-N His-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AFPFGFUGETYOSY-HGNGGELXSA-N 0.000 description 1
- FLXCRBXJRJSDHX-AVGNSLFASA-N His-Pro-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O FLXCRBXJRJSDHX-AVGNSLFASA-N 0.000 description 1
- MDOBWSFNSNPENN-PMVVWTBXSA-N His-Thr-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O MDOBWSFNSNPENN-PMVVWTBXSA-N 0.000 description 1
- VXZZUXWAOMWWJH-QTKMDUPCSA-N His-Thr-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VXZZUXWAOMWWJH-QTKMDUPCSA-N 0.000 description 1
- AVXURJPOCDRRFD-UHFFFAOYSA-N Hydroxylamine Chemical compound ON AVXURJPOCDRRFD-UHFFFAOYSA-N 0.000 description 1
- QICVAHODWHIWIS-HTFCKZLJSA-N Ile-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N QICVAHODWHIWIS-HTFCKZLJSA-N 0.000 description 1
- TZCGZYWNIDZZMR-NAKRPEOUSA-N Ile-Arg-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C)C(=O)O)N TZCGZYWNIDZZMR-NAKRPEOUSA-N 0.000 description 1
- FVEWRQXNISSYFO-ZPFDUUQYSA-N Ile-Arg-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FVEWRQXNISSYFO-ZPFDUUQYSA-N 0.000 description 1
- YKRIXHPEIZUDDY-GMOBBJLQSA-N Ile-Asn-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKRIXHPEIZUDDY-GMOBBJLQSA-N 0.000 description 1
- UKTUOMWSJPXODT-GUDRVLHUSA-N Ile-Asn-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N UKTUOMWSJPXODT-GUDRVLHUSA-N 0.000 description 1
- BGZIJZJBXRVBGJ-SXTJYALSSA-N Ile-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N BGZIJZJBXRVBGJ-SXTJYALSSA-N 0.000 description 1
- NPROWIBAWYMPAZ-GUDRVLHUSA-N Ile-Asp-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N NPROWIBAWYMPAZ-GUDRVLHUSA-N 0.000 description 1
- LLZLRXBTOOFODM-QSFUFRPTSA-N Ile-Asp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N LLZLRXBTOOFODM-QSFUFRPTSA-N 0.000 description 1
- DURWCDDDAWVPOP-JBDRJPRFSA-N Ile-Cys-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N DURWCDDDAWVPOP-JBDRJPRFSA-N 0.000 description 1
- BSWLQVGEVFYGIM-ZPFDUUQYSA-N Ile-Gln-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N BSWLQVGEVFYGIM-ZPFDUUQYSA-N 0.000 description 1
- JDAWAWXGAUZPNJ-ZPFDUUQYSA-N Ile-Glu-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JDAWAWXGAUZPNJ-ZPFDUUQYSA-N 0.000 description 1
- BEWFWZRGBDVXRP-PEFMBERDSA-N Ile-Glu-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BEWFWZRGBDVXRP-PEFMBERDSA-N 0.000 description 1
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 1
- DFJJAVZIHDFOGQ-MNXVOIDGSA-N Ile-Glu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DFJJAVZIHDFOGQ-MNXVOIDGSA-N 0.000 description 1
- TVSPLSZTKTUYLV-ZPFDUUQYSA-N Ile-Glu-Met Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O TVSPLSZTKTUYLV-ZPFDUUQYSA-N 0.000 description 1
- NZOCIWKZUVUNDW-ZKWXMUAHSA-N Ile-Gly-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O NZOCIWKZUVUNDW-ZKWXMUAHSA-N 0.000 description 1
- SLQVFYWBGNNOTK-BYULHYEWSA-N Ile-Gly-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N SLQVFYWBGNNOTK-BYULHYEWSA-N 0.000 description 1
- IGJWJGIHUFQANP-LAEOZQHASA-N Ile-Gly-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N IGJWJGIHUFQANP-LAEOZQHASA-N 0.000 description 1
- LBRCLQMZAHRTLV-ZKWXMUAHSA-N Ile-Gly-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LBRCLQMZAHRTLV-ZKWXMUAHSA-N 0.000 description 1
- VOBYAKCXGQQFLR-LSJOCFKGSA-N Ile-Gly-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VOBYAKCXGQQFLR-LSJOCFKGSA-N 0.000 description 1
- KEKTTYCXKGBAAL-VGDYDELISA-N Ile-His-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N KEKTTYCXKGBAAL-VGDYDELISA-N 0.000 description 1
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 1
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 1
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 1
- FGBRXCZYVRFNKQ-MXAVVETBSA-N Ile-Phe-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N FGBRXCZYVRFNKQ-MXAVVETBSA-N 0.000 description 1
- CIJLNXXMDUOFPH-HJWJTTGWSA-N Ile-Pro-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 CIJLNXXMDUOFPH-HJWJTTGWSA-N 0.000 description 1
- KTNGVMMGIQWIDV-OSUNSFLBSA-N Ile-Pro-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O KTNGVMMGIQWIDV-OSUNSFLBSA-N 0.000 description 1
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 1
- MGUTVMBNOMJLKC-VKOGCVSHSA-N Ile-Trp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](C(C)C)C(=O)O)N MGUTVMBNOMJLKC-VKOGCVSHSA-N 0.000 description 1
- AUIYHFRUOOKTGX-UKJIMTQDSA-N Ile-Val-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N AUIYHFRUOOKTGX-UKJIMTQDSA-N 0.000 description 1
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 1
- NJGXXYLPDMMFJB-XUXIUFHCSA-N Ile-Val-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N NJGXXYLPDMMFJB-XUXIUFHCSA-N 0.000 description 1
- RQZFWBLDTBDEOF-RNJOBUHISA-N Ile-Val-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N RQZFWBLDTBDEOF-RNJOBUHISA-N 0.000 description 1
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 1
- QUOGESRFPZDMMT-UHFFFAOYSA-N L-Homoarginine Natural products OC(=O)C(N)CCCCNC(N)=N QUOGESRFPZDMMT-UHFFFAOYSA-N 0.000 description 1
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- RHGKLRLOHDJJDR-BYPYZUCNSA-N L-citrulline Chemical compound NC(=O)NCCC[C@H]([NH3+])C([O-])=O RHGKLRLOHDJJDR-BYPYZUCNSA-N 0.000 description 1
- QUOGESRFPZDMMT-YFKPBYRVSA-N L-homoarginine Chemical compound OC(=O)[C@@H](N)CCCCNC(N)=N QUOGESRFPZDMMT-YFKPBYRVSA-N 0.000 description 1
- 229930182844 L-isoleucine Natural products 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 1
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 1
- 229930182821 L-proline Natural products 0.000 description 1
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 1
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 1
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 1
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 1
- IBMVEYRWAWIOTN-RWMBFGLXSA-N Leu-Arg-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(O)=O IBMVEYRWAWIOTN-RWMBFGLXSA-N 0.000 description 1
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 1
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 1
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 1
- ZDSNOSQHMJBRQN-SRVKXCTJSA-N Leu-Asp-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZDSNOSQHMJBRQN-SRVKXCTJSA-N 0.000 description 1
- KAFOIVJDVSZUMD-DCAQKATOSA-N Leu-Gln-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-DCAQKATOSA-N 0.000 description 1
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 1
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 1
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 1
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 1
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 1
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 1
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 1
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 1
- JRJLGNFWYFSJHB-HOCLYGCPSA-N Leu-Gly-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JRJLGNFWYFSJHB-HOCLYGCPSA-N 0.000 description 1
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 1
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 1
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 1
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 1
- ARRIJPQRBWRNLT-DCAQKATOSA-N Leu-Met-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ARRIJPQRBWRNLT-DCAQKATOSA-N 0.000 description 1
- ZAVCJRJOQKIOJW-KKUMJFAQSA-N Leu-Phe-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=CC=C1 ZAVCJRJOQKIOJW-KKUMJFAQSA-N 0.000 description 1
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 1
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 1
- XXXXOVFBXRERQL-ULQDDVLXSA-N Leu-Pro-Phe Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XXXXOVFBXRERQL-ULQDDVLXSA-N 0.000 description 1
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 1
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 1
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 1
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 1
- ODRREERHVHMIPT-OEAJRASXSA-N Leu-Thr-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ODRREERHVHMIPT-OEAJRASXSA-N 0.000 description 1
- VQHUBNVKFFLWRP-ULQDDVLXSA-N Leu-Tyr-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 VQHUBNVKFFLWRP-ULQDDVLXSA-N 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 1
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 1
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 1
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 1
- MKBIVWXCFINCLE-SRVKXCTJSA-N Lys-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N MKBIVWXCFINCLE-SRVKXCTJSA-N 0.000 description 1
- YFGWNAROEYWGNL-GUBZILKMSA-N Lys-Gln-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YFGWNAROEYWGNL-GUBZILKMSA-N 0.000 description 1
- MRWXLRGAFDOILG-DCAQKATOSA-N Lys-Gln-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MRWXLRGAFDOILG-DCAQKATOSA-N 0.000 description 1
- ITWQLSZTLBKWJM-YUMQZZPRSA-N Lys-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCCN ITWQLSZTLBKWJM-YUMQZZPRSA-N 0.000 description 1
- IUWMQCZOTYRXPL-ZPFDUUQYSA-N Lys-Ile-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O IUWMQCZOTYRXPL-ZPFDUUQYSA-N 0.000 description 1
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 1
- GFWLIJDQILOEPP-HSCHXYMDSA-N Lys-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N GFWLIJDQILOEPP-HSCHXYMDSA-N 0.000 description 1
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 1
- URGPVYGVWLIRGT-DCAQKATOSA-N Lys-Met-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O URGPVYGVWLIRGT-DCAQKATOSA-N 0.000 description 1
- AZOFEHCPMBRNFD-BZSNNMDCSA-N Lys-Phe-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 AZOFEHCPMBRNFD-BZSNNMDCSA-N 0.000 description 1
- CENKQZWVYMLRAX-ULQDDVLXSA-N Lys-Phe-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O CENKQZWVYMLRAX-ULQDDVLXSA-N 0.000 description 1
- WLXGMVVHTIUPHE-ULQDDVLXSA-N Lys-Phe-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O WLXGMVVHTIUPHE-ULQDDVLXSA-N 0.000 description 1
- WGILOYIKJVQUPT-DCAQKATOSA-N Lys-Pro-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WGILOYIKJVQUPT-DCAQKATOSA-N 0.000 description 1
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 1
- MIFFFXHMAHFACR-KATARQTJSA-N Lys-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN MIFFFXHMAHFACR-KATARQTJSA-N 0.000 description 1
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 1
- BDFHWFUAQLIMJO-KXNHARMFSA-N Lys-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N)O BDFHWFUAQLIMJO-KXNHARMFSA-N 0.000 description 1
- RQILLQOQXLZTCK-KBPBESRZSA-N Lys-Tyr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O RQILLQOQXLZTCK-KBPBESRZSA-N 0.000 description 1
- ONGCSGVHCSAATF-CIUDSAMLSA-N Met-Ala-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O ONGCSGVHCSAATF-CIUDSAMLSA-N 0.000 description 1
- CWFYZYQMUDWGTI-GUBZILKMSA-N Met-Arg-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O CWFYZYQMUDWGTI-GUBZILKMSA-N 0.000 description 1
- WDTLNWHPIPCMMP-AVGNSLFASA-N Met-Arg-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O WDTLNWHPIPCMMP-AVGNSLFASA-N 0.000 description 1
- OHMKUHXCDSCOMT-QXEWZRGKSA-N Met-Asn-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHMKUHXCDSCOMT-QXEWZRGKSA-N 0.000 description 1
- CHQWUYSNAOABIP-ZPFDUUQYSA-N Met-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N CHQWUYSNAOABIP-ZPFDUUQYSA-N 0.000 description 1
- JPCHYAUKOUGOIB-HJGDQZAQSA-N Met-Glu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPCHYAUKOUGOIB-HJGDQZAQSA-N 0.000 description 1
- OOSPRDCGTLQLBP-NHCYSSNCSA-N Met-Glu-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OOSPRDCGTLQLBP-NHCYSSNCSA-N 0.000 description 1
- UZWMJZSOXGOVIN-LURJTMIESA-N Met-Gly-Gly Chemical compound CSCC[C@H](N)C(=O)NCC(=O)NCC(O)=O UZWMJZSOXGOVIN-LURJTMIESA-N 0.000 description 1
- BCRQJDMZQUHQSV-STQMWFEESA-N Met-Gly-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BCRQJDMZQUHQSV-STQMWFEESA-N 0.000 description 1
- HZLSUXCMSIBCRV-RVMXOQNASA-N Met-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N HZLSUXCMSIBCRV-RVMXOQNASA-N 0.000 description 1
- HLZORBMOISUNIV-DCAQKATOSA-N Met-Ser-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C HLZORBMOISUNIV-DCAQKATOSA-N 0.000 description 1
- CIIJWIAORKTXAH-FJXKBIBVSA-N Met-Thr-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O CIIJWIAORKTXAH-FJXKBIBVSA-N 0.000 description 1
- VWFHWJGVLVZVIS-QXEWZRGKSA-N Met-Val-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O VWFHWJGVLVZVIS-QXEWZRGKSA-N 0.000 description 1
- 241000144155 Microbacterium ammoniaphilum Species 0.000 description 1
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 1
- KSMRODHGGIIXDV-YFKPBYRVSA-N N-acetyl-L-glutamine Chemical group CC(=O)N[C@H](C(O)=O)CCC(N)=O KSMRODHGGIIXDV-YFKPBYRVSA-N 0.000 description 1
- HLKXYZVTANABHZ-REOHCLBHSA-N N-carbamoyl-L-aspartic acid Chemical compound NC(=O)N[C@H](C(O)=O)CC(O)=O HLKXYZVTANABHZ-REOHCLBHSA-N 0.000 description 1
- PYUSHNKNPOHWEZ-YFKPBYRVSA-N N-formyl-L-methionine Chemical compound CSCC[C@@H](C(O)=O)NC=O PYUSHNKNPOHWEZ-YFKPBYRVSA-N 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- VZUNGTLZRAYYDE-UHFFFAOYSA-N N-methyl-N'-nitro-N-nitrosoguanidine Chemical compound O=NN(C)C(=N)N[N+]([O-])=O VZUNGTLZRAYYDE-UHFFFAOYSA-N 0.000 description 1
- RHGKLRLOHDJJDR-UHFFFAOYSA-N Ndelta-carbamoyl-DL-ornithine Natural products OC(=O)C(N)CCCNC(N)=O RHGKLRLOHDJJDR-UHFFFAOYSA-N 0.000 description 1
- IOVCWXUNBOPUCH-UHFFFAOYSA-N Nitrous acid Chemical compound ON=O IOVCWXUNBOPUCH-UHFFFAOYSA-N 0.000 description 1
- DFEVBOYEUQJGER-JURCDPSOSA-N Phe-Ala-Ile Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O DFEVBOYEUQJGER-JURCDPSOSA-N 0.000 description 1
- BBDSZDHUCPSYAC-QEJZJMRPSA-N Phe-Ala-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BBDSZDHUCPSYAC-QEJZJMRPSA-N 0.000 description 1
- MGBRZXXGQBAULP-DRZSPHRISA-N Phe-Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGBRZXXGQBAULP-DRZSPHRISA-N 0.000 description 1
- PSKRILMFHNIUAO-JYJNAYRXSA-N Phe-Glu-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N PSKRILMFHNIUAO-JYJNAYRXSA-N 0.000 description 1
- BFYHIHGIHGROAT-HTUGSXCWSA-N Phe-Glu-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFYHIHGIHGROAT-HTUGSXCWSA-N 0.000 description 1
- XEXSSIBQYNKFBX-KBPBESRZSA-N Phe-Gly-His Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1N=CNC=1)C(O)=O)C1=CC=CC=C1 XEXSSIBQYNKFBX-KBPBESRZSA-N 0.000 description 1
- HGNGAMWHGGANAU-WHOFXGATSA-N Phe-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HGNGAMWHGGANAU-WHOFXGATSA-N 0.000 description 1
- VJLLEKDQJSMHRU-STQMWFEESA-N Phe-Gly-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O VJLLEKDQJSMHRU-STQMWFEESA-N 0.000 description 1
- KXUZHWXENMYOHC-QEJZJMRPSA-N Phe-Leu-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUZHWXENMYOHC-QEJZJMRPSA-N 0.000 description 1
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 description 1
- IWZRODDWOSIXPZ-IRXDYDNUSA-N Phe-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 IWZRODDWOSIXPZ-IRXDYDNUSA-N 0.000 description 1
- FZBGMXYQPACKNC-HJWJTTGWSA-N Phe-Pro-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FZBGMXYQPACKNC-HJWJTTGWSA-N 0.000 description 1
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 1
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 1
- YFXXRYFWJFQAFW-JHYOHUSXSA-N Phe-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YFXXRYFWJFQAFW-JHYOHUSXSA-N 0.000 description 1
- XALFIVXGQUEGKV-JSGCOSHPSA-N Phe-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XALFIVXGQUEGKV-JSGCOSHPSA-N 0.000 description 1
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 description 1
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 1
- CQZNGNCAIXMAIQ-UBHSHLNASA-N Pro-Ala-Phe Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O CQZNGNCAIXMAIQ-UBHSHLNASA-N 0.000 description 1
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 1
- NHDVNAKDACFHPX-GUBZILKMSA-N Pro-Arg-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O NHDVNAKDACFHPX-GUBZILKMSA-N 0.000 description 1
- ORPZXBQTEHINPB-SRVKXCTJSA-N Pro-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H]1CCCN1)C(O)=O ORPZXBQTEHINPB-SRVKXCTJSA-N 0.000 description 1
- VPVHXWGPALPDGP-GUBZILKMSA-N Pro-Asn-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPVHXWGPALPDGP-GUBZILKMSA-N 0.000 description 1
- SMCHPSMKAFIERP-FXQIFTODSA-N Pro-Asn-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 SMCHPSMKAFIERP-FXQIFTODSA-N 0.000 description 1
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 1
- BWCZJGJKOFUUCN-ZPFDUUQYSA-N Pro-Ile-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O BWCZJGJKOFUUCN-ZPFDUUQYSA-N 0.000 description 1
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 1
- DRKAXLDECUGLFE-ULQDDVLXSA-N Pro-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O DRKAXLDECUGLFE-ULQDDVLXSA-N 0.000 description 1
- VTFXTWDFPTWNJY-RHYQMDGZSA-N Pro-Leu-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VTFXTWDFPTWNJY-RHYQMDGZSA-N 0.000 description 1
- QGLFRQCECIWXFA-RCWTZXSCSA-N Pro-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1)O QGLFRQCECIWXFA-RCWTZXSCSA-N 0.000 description 1
- XYAFCOJKICBRDU-JYJNAYRXSA-N Pro-Phe-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O XYAFCOJKICBRDU-JYJNAYRXSA-N 0.000 description 1
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 1
- QKDIHFHGHBYTKB-IHRRRGAJSA-N Pro-Ser-Phe Chemical compound N([C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 QKDIHFHGHBYTKB-IHRRRGAJSA-N 0.000 description 1
- CHYAYDLYYIJCKY-OSUNSFLBSA-N Pro-Thr-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CHYAYDLYYIJCKY-OSUNSFLBSA-N 0.000 description 1
- SHTKRJHDMNSKRM-ULQDDVLXSA-N Pro-Tyr-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O SHTKRJHDMNSKRM-ULQDDVLXSA-N 0.000 description 1
- IMNVAOPEMFDAQD-NHCYSSNCSA-N Pro-Val-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IMNVAOPEMFDAQD-NHCYSSNCSA-N 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 108010009736 Protein Hydrolysates Proteins 0.000 description 1
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 1
- BKOKTRCZXRIQPX-ZLUOBGJFSA-N Ser-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N BKOKTRCZXRIQPX-ZLUOBGJFSA-N 0.000 description 1
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 1
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 1
- PZZJMBYSYAKYPK-UWJYBYFXSA-N Ser-Ala-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PZZJMBYSYAKYPK-UWJYBYFXSA-N 0.000 description 1
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 description 1
- CTLVSHXLRVEILB-UBHSHLNASA-N Ser-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N CTLVSHXLRVEILB-UBHSHLNASA-N 0.000 description 1
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 1
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 1
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 1
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 1
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 1
- KDGARKCAKHBEDB-NKWVEPMBSA-N Ser-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CO)N)C(=O)O KDGARKCAKHBEDB-NKWVEPMBSA-N 0.000 description 1
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 1
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 1
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 1
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 1
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 1
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 1
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 1
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 1
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 1
- BUYHXYIUQUBEQP-AVGNSLFASA-N Ser-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N BUYHXYIUQUBEQP-AVGNSLFASA-N 0.000 description 1
- FBLNYDYPCLFTSP-IXOXFDKPSA-N Ser-Phe-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FBLNYDYPCLFTSP-IXOXFDKPSA-N 0.000 description 1
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 1
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 1
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 1
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 1
- HXPNJVLVHKABMJ-KKUMJFAQSA-N Ser-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CO)N)O HXPNJVLVHKABMJ-KKUMJFAQSA-N 0.000 description 1
- IAOHCSQDQDWRQU-GUBZILKMSA-N Ser-Val-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IAOHCSQDQDWRQU-GUBZILKMSA-N 0.000 description 1
- 101000794822 Serratia marcescens Anthranilate synthase component 1 Proteins 0.000 description 1
- 101000847781 Serratia marcescens Anthranilate synthase component 2 Proteins 0.000 description 1
- 238000002105 Southern blotting Methods 0.000 description 1
- 229920002472 Starch Polymers 0.000 description 1
- 229930006000 Sucrose Natural products 0.000 description 1
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 1
- DDPVJPIGACCMEH-XQXXSGGOSA-N Thr-Ala-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DDPVJPIGACCMEH-XQXXSGGOSA-N 0.000 description 1
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 1
- PXQUBKWZENPDGE-CIQUZCHMSA-N Thr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)O)N PXQUBKWZENPDGE-CIQUZCHMSA-N 0.000 description 1
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 1
- YLXAMFZYJTZXFH-OLHMAJIHSA-N Thr-Asn-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O YLXAMFZYJTZXFH-OLHMAJIHSA-N 0.000 description 1
- VXMHQKHDKCATDV-VEVYYDQMSA-N Thr-Asp-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VXMHQKHDKCATDV-VEVYYDQMSA-N 0.000 description 1
- KRPKYGOFYUNIGM-XVSYOHENSA-N Thr-Asp-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O KRPKYGOFYUNIGM-XVSYOHENSA-N 0.000 description 1
- JXKMXEBNZCKSDY-JIOCBJNQSA-N Thr-Asp-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O JXKMXEBNZCKSDY-JIOCBJNQSA-N 0.000 description 1
- ASJDFGOPDCVXTG-KATARQTJSA-N Thr-Cys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O ASJDFGOPDCVXTG-KATARQTJSA-N 0.000 description 1
- FHDLKMFZKRUQCE-HJGDQZAQSA-N Thr-Glu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHDLKMFZKRUQCE-HJGDQZAQSA-N 0.000 description 1
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 1
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 1
- LKEKWDJCJSPXNI-IRIUXVKKSA-N Thr-Glu-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LKEKWDJCJSPXNI-IRIUXVKKSA-N 0.000 description 1
- NIEWSKWFURSECR-FOHZUACHSA-N Thr-Gly-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NIEWSKWFURSECR-FOHZUACHSA-N 0.000 description 1
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 1
- YSXYEJWDHBCTDJ-DVJZZOLTSA-N Thr-Gly-Trp Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O YSXYEJWDHBCTDJ-DVJZZOLTSA-N 0.000 description 1
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 1
- CRZNCABIJLRFKZ-IUKAMOBKSA-N Thr-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N CRZNCABIJLRFKZ-IUKAMOBKSA-N 0.000 description 1
- URPSJRMWHQTARR-MBLNEYKQSA-N Thr-Ile-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O URPSJRMWHQTARR-MBLNEYKQSA-N 0.000 description 1
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 1
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 1
- CJXURNZYNHCYFD-WDCWCFNPSA-N Thr-Lys-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O CJXURNZYNHCYFD-WDCWCFNPSA-N 0.000 description 1
- MGJLBZFUXUGMML-VOAKCMCISA-N Thr-Lys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MGJLBZFUXUGMML-VOAKCMCISA-N 0.000 description 1
- WFAUDCSNCWJJAA-KXNHARMFSA-N Thr-Lys-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(O)=O WFAUDCSNCWJJAA-KXNHARMFSA-N 0.000 description 1
- QHUWWSQZTFLXPQ-FJXKBIBVSA-N Thr-Met-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QHUWWSQZTFLXPQ-FJXKBIBVSA-N 0.000 description 1
- KPNSNVTUVKSBFL-ZJDVBMNYSA-N Thr-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KPNSNVTUVKSBFL-ZJDVBMNYSA-N 0.000 description 1
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 1
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 1
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 1
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 1
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 1
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 1
- KAJRRNHOVMZYBL-IRIUXVKKSA-N Thr-Tyr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAJRRNHOVMZYBL-IRIUXVKKSA-N 0.000 description 1
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 1
- SBYQHZCMVSPQCS-RCWTZXSCSA-N Thr-Val-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O SBYQHZCMVSPQCS-RCWTZXSCSA-N 0.000 description 1
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 1
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 1
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- HOJPPPKZWFRTHJ-PJODQICGSA-N Trp-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N HOJPPPKZWFRTHJ-PJODQICGSA-N 0.000 description 1
- DXDMNBJJEXYMLA-UBHSHLNASA-N Trp-Asn-Asp Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O)=CNC2=C1 DXDMNBJJEXYMLA-UBHSHLNASA-N 0.000 description 1
- PWPJLBWYRTVYQS-PMVMPFDFSA-N Trp-Phe-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PWPJLBWYRTVYQS-PMVMPFDFSA-N 0.000 description 1
- VTFWAGGJDRSQFG-MELADBBJSA-N Tyr-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O VTFWAGGJDRSQFG-MELADBBJSA-N 0.000 description 1
- IYHNBRUWVBIVJR-IHRRRGAJSA-N Tyr-Gln-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IYHNBRUWVBIVJR-IHRRRGAJSA-N 0.000 description 1
- RYSNTWVRSLCAJZ-RYUDHWBXSA-N Tyr-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RYSNTWVRSLCAJZ-RYUDHWBXSA-N 0.000 description 1
- KEANSLVUGJADPN-LKTVYLICSA-N Tyr-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N KEANSLVUGJADPN-LKTVYLICSA-N 0.000 description 1
- ADECJAKCRKPSOR-ULQDDVLXSA-N Tyr-His-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O ADECJAKCRKPSOR-ULQDDVLXSA-N 0.000 description 1
- OSXNCKRGMSHWSQ-ACRUOGEOSA-N Tyr-His-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OSXNCKRGMSHWSQ-ACRUOGEOSA-N 0.000 description 1
- CDKZJGMPZHPAJC-ULQDDVLXSA-N Tyr-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDKZJGMPZHPAJC-ULQDDVLXSA-N 0.000 description 1
- GYKDRHDMGQUZPU-MGHWNKPDSA-N Tyr-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N GYKDRHDMGQUZPU-MGHWNKPDSA-N 0.000 description 1
- LRHBBGDMBLFYGL-FHWLQOOXSA-N Tyr-Phe-Glu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=C(O)C=C1 LRHBBGDMBLFYGL-FHWLQOOXSA-N 0.000 description 1
- VYQQQIRHIFALGE-UWJYBYFXSA-N Tyr-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VYQQQIRHIFALGE-UWJYBYFXSA-N 0.000 description 1
- KWKJGBHDYJOVCR-SRVKXCTJSA-N Tyr-Ser-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O KWKJGBHDYJOVCR-SRVKXCTJSA-N 0.000 description 1
- PWKMJDQXKCENMF-MEYUZBJRSA-N Tyr-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O PWKMJDQXKCENMF-MEYUZBJRSA-N 0.000 description 1
- DJIJBQYBDKGDIS-JYJNAYRXSA-N Tyr-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O DJIJBQYBDKGDIS-JYJNAYRXSA-N 0.000 description 1
- DJJCXFVJDGTHFX-UHFFFAOYSA-N Uridinemonophosphate Natural products OC1C(O)C(COP(O)(O)=O)OC1N1C(=O)NC(=O)C=C1 DJJCXFVJDGTHFX-UHFFFAOYSA-N 0.000 description 1
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 1
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 1
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 1
- LABUITCFCAABSV-BPNCWPANSA-N Val-Ala-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LABUITCFCAABSV-BPNCWPANSA-N 0.000 description 1
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 1
- DCOOGDCRFXXQNW-ZKWXMUAHSA-N Val-Asn-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N DCOOGDCRFXXQNW-ZKWXMUAHSA-N 0.000 description 1
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 1
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 1
- YLHLNFUXDBOAGX-DCAQKATOSA-N Val-Cys-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YLHLNFUXDBOAGX-DCAQKATOSA-N 0.000 description 1
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 1
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 1
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 1
- FEFZWCSXEMVSPO-LSJOCFKGSA-N Val-His-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](C)C(O)=O FEFZWCSXEMVSPO-LSJOCFKGSA-N 0.000 description 1
- RHYOAUJXSRWVJT-GVXVVHGQSA-N Val-His-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RHYOAUJXSRWVJT-GVXVVHGQSA-N 0.000 description 1
- PYPZMFDMCCWNST-NAKRPEOUSA-N Val-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N PYPZMFDMCCWNST-NAKRPEOUSA-N 0.000 description 1
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 1
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 1
- DJQIUOKSNRBTSV-CYDGBPFRSA-N Val-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](C(C)C)N DJQIUOKSNRBTSV-CYDGBPFRSA-N 0.000 description 1
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 1
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 1
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 1
- JVGHIFMSFBZDHH-WPRPVWTQSA-N Val-Met-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N JVGHIFMSFBZDHH-WPRPVWTQSA-N 0.000 description 1
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 1
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 1
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 1
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 description 1
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 1
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 1
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 1
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 1
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- 150000001298 alcohols Chemical class 0.000 description 1
- 125000001931 aliphatic group Chemical group 0.000 description 1
- 239000003513 alkali Substances 0.000 description 1
- WQZGKKKJIJFFOK-PHYPRBDBSA-N alpha-D-galactose Chemical compound OC[C@H]1O[C@H](O)[C@H](O)[C@@H](O)[C@H]1O WQZGKKKJIJFFOK-PHYPRBDBSA-N 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- 235000019270 ammonium chloride Nutrition 0.000 description 1
- 229910000148 ammonium phosphate Inorganic materials 0.000 description 1
- 235000019289 ammonium phosphates Nutrition 0.000 description 1
- 150000003863 ammonium salts Chemical class 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 239000005557 antagonist Substances 0.000 description 1
- 101150094408 argI gene Proteins 0.000 description 1
- 108010008355 arginyl-glutamine Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 108010036533 arginylvaline Proteins 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 108010092854 aspartyllysine Proteins 0.000 description 1
- 108010068265 aspartyltyrosine Proteins 0.000 description 1
- 239000002585 base Substances 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 239000001110 calcium chloride Substances 0.000 description 1
- 229910001628 calcium chloride Inorganic materials 0.000 description 1
- 125000003917 carbamoyl group Chemical group [H]N([H])C(*)=O 0.000 description 1
- BVKZGUZCCUSVTD-UHFFFAOYSA-N carbonic acid Chemical compound OC(O)=O BVKZGUZCCUSVTD-UHFFFAOYSA-N 0.000 description 1
- 239000013611 chromosomal DNA Substances 0.000 description 1
- 229960002173 citrulline Drugs 0.000 description 1
- 235000013477 citrulline Nutrition 0.000 description 1
- 239000013599 cloning vector Substances 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 230000008021 deposition Effects 0.000 description 1
- MNNHAPBLZZVQHP-UHFFFAOYSA-N diammonium hydrogen phosphate Chemical compound [NH4+].[NH4+].OP([O-])([O-])=O MNNHAPBLZZVQHP-UHFFFAOYSA-N 0.000 description 1
- 235000014113 dietary fatty acids Nutrition 0.000 description 1
- 108010054813 diprotin B Proteins 0.000 description 1
- BNIILDVGGAEEIG-UHFFFAOYSA-L disodium hydrogen phosphate Chemical compound [Na+].[Na+].OP([O-])([O-])=O BNIILDVGGAEEIG-UHFFFAOYSA-L 0.000 description 1
- 229910000397 disodium phosphate Inorganic materials 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 229930195729 fatty acid Natural products 0.000 description 1
- 239000000194 fatty acid Substances 0.000 description 1
- 150000004665 fatty acids Chemical group 0.000 description 1
- 239000012467 final product Substances 0.000 description 1
- 229960002949 fluorouracil Drugs 0.000 description 1
- 239000001530 fumaric acid Substances 0.000 description 1
- 229930182830 galactose Natural products 0.000 description 1
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 108010059898 glycyl-tyrosyl-lysine Proteins 0.000 description 1
- 108010077515 glycylproline Proteins 0.000 description 1
- 150000002411 histidines Chemical class 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 229910001410 inorganic ion Inorganic materials 0.000 description 1
- 239000000543 intermediate Substances 0.000 description 1
- 239000003456 ion exchange resin Substances 0.000 description 1
- 229920003303 ion-exchange polymer Polymers 0.000 description 1
- XEEYBQQBJWHFJM-UHFFFAOYSA-N iron Substances [Fe] XEEYBQQBJWHFJM-UHFFFAOYSA-N 0.000 description 1
- 229910052742 iron Inorganic materials 0.000 description 1
- BAUYGSIQEAFULO-UHFFFAOYSA-L iron(2+) sulfate (anhydrous) Chemical compound [Fe+2].[O-]S([O-])(=O)=O BAUYGSIQEAFULO-UHFFFAOYSA-L 0.000 description 1
- 229910000359 iron(II) sulfate Inorganic materials 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 108010060857 isoleucyl-valyl-tyrosine Proteins 0.000 description 1
- 108010027338 isoleucylcysteine Proteins 0.000 description 1
- 101150066555 lacZ gene Proteins 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 108010009932 leucyl-alanyl-glycyl-valine Proteins 0.000 description 1
- 229960003646 lysine Drugs 0.000 description 1
- 108010009298 lysylglutamic acid Proteins 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 108010017391 lysylvaline Proteins 0.000 description 1
- 229910001629 magnesium chloride Inorganic materials 0.000 description 1
- 229910001437 manganese ion Inorganic materials 0.000 description 1
- SQQMAOCOWKFBNP-UHFFFAOYSA-L manganese(II) sulfate Chemical compound [Mn+2].[O-]S([O-])(=O)=O SQQMAOCOWKFBNP-UHFFFAOYSA-L 0.000 description 1
- 229910000357 manganese(II) sulfate Inorganic materials 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- 150000007524 organic acids Chemical class 0.000 description 1
- 235000005985 organic acids Nutrition 0.000 description 1
- 125000001477 organic nitrogen group Chemical group 0.000 description 1
- 230000035699 permeability Effects 0.000 description 1
- 108010082795 phenylalanyl-arginyl-arginine Proteins 0.000 description 1
- 108010024607 phenylalanylalanine Proteins 0.000 description 1
- 108010018625 phenylalanylarginine Proteins 0.000 description 1
- 108010012581 phenylalanylglutamate Proteins 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- 229910000160 potassium phosphate Inorganic materials 0.000 description 1
- 235000011009 potassium phosphates Nutrition 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 230000002062 proliferating effect Effects 0.000 description 1
- 108010079317 prolyl-tyrosine Proteins 0.000 description 1
- 108010070643 prolylglutamic acid Proteins 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 210000001938 protoplast Anatomy 0.000 description 1
- 239000011541 reaction mixture Substances 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 239000013605 shuttle vector Substances 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 239000000600 sorbitol Substances 0.000 description 1
- 239000008107 starch Substances 0.000 description 1
- 235000019698 starch Nutrition 0.000 description 1
- 239000005720 sucrose Substances 0.000 description 1
- 235000000346 sugar Nutrition 0.000 description 1
- 150000008163 sugars Chemical class 0.000 description 1
- SEEPANYCNGTZFQ-UHFFFAOYSA-N sulfadiazine Chemical compound C1=CC(N)=CC=C1S(=O)(=O)NC1=NC=CC=N1 SEEPANYCNGTZFQ-UHFFFAOYSA-N 0.000 description 1
- 229960004306 sulfadiazine Drugs 0.000 description 1
- BRBKOPJOKNSWSG-UHFFFAOYSA-N sulfaguanidine Chemical compound NC(=N)NS(=O)(=O)C1=CC=C(N)C=C1 BRBKOPJOKNSWSG-UHFFFAOYSA-N 0.000 description 1
- 229960004257 sulfaguanidine Drugs 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- DPJRMOMPQZCRJU-UHFFFAOYSA-M thiamine hydrochloride Chemical compound Cl.[Cl-].CC1=C(CCO)SC=[N+]1CC1=CN=C(C)N=C1N DPJRMOMPQZCRJU-UHFFFAOYSA-M 0.000 description 1
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 1
- VZCYOOQTPOCHFL-UHFFFAOYSA-N trans-butenedioic acid Natural products OC(=O)C=CC(O)=O VZCYOOQTPOCHFL-UHFFFAOYSA-N 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 229960004799 tryptophan Drugs 0.000 description 1
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 1
- 108010051110 tyrosyl-lysine Proteins 0.000 description 1
- DJJCXFVJDGTHFX-ZAKLUEHWSA-N uridine-5'-monophosphate Chemical compound O[C@@H]1[C@@H](O)[C@H](COP(O)(O)=O)O[C@H]1N1C(=O)NC(=O)C=C1 DJJCXFVJDGTHFX-ZAKLUEHWSA-N 0.000 description 1
- 108010009962 valyltyrosine Proteins 0.000 description 1
- 108700026220 vif Genes Proteins 0.000 description 1
- 239000011691 vitamin B1 Substances 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/93—Ligases (6)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P13/00—Preparation of nitrogen-containing organic compounds
- C12P13/04—Alpha- or beta- amino acids
- C12P13/10—Citrulline; Arginine; Ornithine
Definitions
- the present invention relates to carbamoyl-phosphate synthetase of coryneform bacteria, and a gene therefor.
- the gene can be utilized for production of carbamoyl-phosphate synthetase and subunits thereof, breeding of L-arginine-producing bacteria and nucleic acid-producing bacteria and so forth.
- Carbamoyl-phosphate synthetase is an enzyme that catalyzes the reactions producing carbamoyl phosphate from carbonic acid, ATP and glutamine. Carbamoyl phosphate produced by these reactions serves as a source of carbamoyl group required for the reaction producing citrulline from ornithine in the L-arginine biosynthetic pathway. Furthermore, carbamoyl aspartate produced from aspartic acid and carbamoyl phosphate is one of the intermediates of the pyrimidine biosynthesis system including uridine 5′-monophosphate.
- Carbamoyl-phosphate synthetase consists of two subunits, and it has been known for bacteria belonging to the genus Escherichia or Bacillus that those subunits are encoded by carA and carB genes.
- ArgA N-acetylglutamine synthetase
- An object of the present invention is to provide carbamoyl-phosphate synthetase of coryneform bacteria, a gene coding for it, and a method for producing L-arginine with a microorganism utilizing the gene.
- the inventors of the present invention eagerly studied in order to achieve the aforementioned object. As a result, the inventors successfully obtained a DNA fragment containing the carA gene and the carB gene from a wild strain of Breveibacterium lactofermentum by utilizing a carB-deficient strain of Escherichia coli , and thus accomplished the present invention.
- the present invention provides the followings.
- a polypeptide which has an amino acid sequence comprises at least the amino acid numbers 50 to 393 of the amino acid sequence of SEQ ID NO: 2,
- a polypeptide which has an amino acid sequence comprises at least the amino acid numbers 50 to 393 of the amino acid sequence of SEQ ID NO: 2 including substitution, deletion, insertion, addition, or inversion of one or several amino acids, and can constitute a protein having a carbamoyl-phosphate synthetase activity with a large subunit of carbamoyl-phosphate synthetase comprising the amino acid sequence of SEQ ID NO: 3.
- a polypeptide which comprises the amino acid sequence of SEQ ID NO: 3 including substitution, deletion, insertion, addition, or inversion of one or several amino acids, and can constitute a protein having a carbamoyl-phosphate synthetase activity with a small subunit of carbamoyl-phosphate synthetase having an amino acid sequence comprises at least the amino acid numbers 50 to 393 of the amino acid sequence of SEQ ID NO: 2.
- polypeptide which has an amino acid sequence comprising at least the amino acid numbers 50 to 393 in SEQ ID NO: 2 including substitution, deletion, insertion, addition, or inversion of one or several amino acids, and can constitute a protein having a carbamoyl-phosphate synthetase activity with a large subunit of carbamoyl-phosphate synthetase comprising the amino acid sequence of SEQ ID NO: 3,
- a polypeptide which comprises the amino acid sequence of SEQ ID NO: 3 including substitution, deletion, insertion, addition, or inversion of one or several amino acids, and can constitute a protein having a carbamoyl-phosphate synthetase activity with a small subunit of carbamoyl-phosphate synthetase having an amino acid sequence comprising the amino acid numbers 50 to 393 in SEQ ID NO: 2.
- a protein which comprises a polypeptide defined in the following (a) or (b), and a polypeptide defined in the following (c) or (d):
- polypeptide which has an amino acid sequence comprising at least the amino acid numbers 50 to 393 in SEQ ID NO: 2 including substitution, deletion, insertion, addition, or inversion of one or several amino acids, and can constitute a protein having a carbamoyl-phosphate synthetase activity with a large subunit of carbamoyl-phosphate synthetase comprising the amino acid sequence of SEQ ID NO: 3,
- a polypeptide which comprises the amino acid sequence of SEQ ID NO: 3 including substitution, deletion, insertion, addition, or inversion of one or several amino acids, and can constitute a protein having a carbamoyl-phosphate synthetase activity with a small subunit of carbamoyl-phosphate synthetase having an amino acid sequence comprising at least the amino acid numbers 50 to 393 in SEQ ID NO: 2.
- a method for producing of L-arginine comprising the steps of culturing a coryneform bacterium according to any one of (10) to (13) in a medium to produce and accumulate L-arginine in the medium, and collecting the L-arginine from the medium.
- the present invention provides genes coding for the subunits that constitute carbamoyl-phosphate synthetase.
- the gene can be utilized for production of carbamoyl-phosphate synthetase and subunits thereof, breeding of L-arginine-producing bacteria and nucleic acid-producing bacteria and so forth. Aditionally, L-arginine can be produced efficiently according to the present invention.
- FIG. 1 shows the structure of plasmid p19 containing the carA gene and carB gene.
- FIG. 2 shows a construction process of plasmid pK1.
- FIG. 3 shows a construction process of plasmid pSFK6.
- the DNA of the present invention can be obtained from a chromosome DNA library of coryneform bacteria prepared with vectors such as plasmids by selection of the DNA using a microorganism which is deficient in carA or carB, for example, Escherichia coli RC50 (carA50, tsx ⁇ 273, ⁇ ⁇ , rpsL135 (str R ), malT1 ( ⁇ R), xy1A7, thi ⁇ 1; Mol. Gen. Genet., 133, 299 (1974)), Escherichia coli JEF8 (thr ⁇ 31, ⁇ carB, relA ⁇ , metB1, Mol. Gen.
- Escherichia coli JEF8 thr ⁇ 31, ⁇ carB, relA ⁇ , metB1, Mol. Gen.
- a DNA fragment can be obtained by transforming such a microorganism with a chromosome DNA library, selecting clones in which the auxotrophy is complemented, and recovering a recombinant vector from the selected transformants.
- the coryneform bacteria used for preparing a chromosome DNA library are not particularly limited, and examples thereof include bacteria having been hitherto classified into the genus Brevibacterium but united into the genus Corynebacterium at present ( Int. J. Syst. Bacteriol., 41, 255 (1981)), and include bacteria belonging to the genus Brevibacterium closely relative to the genus Corynebacterium, more specifically, wild strains of Breveibacterium lactofermentum and so forth.
- Chromosome DNA of coryneform bacteria can be prepared by, for example, the method of Saito and Miura ( Biochem. Biophys. Acta., 72, 619, (1963)), the method of K. S. Kirby ( Biochem. J., 64, 405, (1956)) and so forth.
- a chromosome DNA library can be obtained by partially digesting chromosome DNA with suitable restriction enzymes, ligating each of the obtained DNA fragments to a vector DNA autonomously replicable in Escherichia coli cells to prepare a recombinant DNA, and introducing the DNA into Escherichia coli .
- the vector is not particularly limited so long as it is a vector usually used for genetic cloning, and plasmid vectors such as pUC19, pUC18, pUC118, and pUC119, phage vectors such as ⁇ phage DNA and so forth can be used. Further, a vector autonomously replicable in both of Escherichia coli cells and coryneform bacterium cells may also be used.
- Such a vector can be constructed by ligating a vector for Escherichia coli and pAM330, which is a cryptic plasmid of Breveibacterium lactofermentum (see Japanese Patent Laid-open No. 58-67699).
- Escherichia coli HB101 harboring pHK4 was designated as Escherichia coli AJ13136, and it was deposited at the National Institute of Bioscience and Human-Technology, Agency of Industrial Science and Technology, Ministry of International Trade and Industry (postal code 305-8566, 1-3 Higashi 1-chome, Tsukuba-shi, Ibaraki-ken, Japan) on Aug. 1, 1995, and received an accession number of FERM BP-5186.
- the transformation of Escherichia coli cells can be performed by, for example, the method of D. A. Morrison (Methods in Enzymology, 68, 326, 1979), the method of treating recipient cells with calcium chloride so as to increase the permeability of DNA (Mandel, M. and Higa, A., J. Mol. Biol., 53, 159 (1970)) and so forth.
- methods for preparation of chromosome DNA library preparation of plasmid DNA, and digestion and ligation of DNA, as well as methods for PCR, preparation of oligonucleotides and hybridization mentioned hereinafter, conventional methods well known to those skilled in the art can be used. Such methods are described in Sambrook, J., Fritsch, E. F. and Maniatis, T., “Molecular Cloning, A Laboratory Manual, Second Edition”, Cold Spring Harbor Laboratory Press, (1989) and so forth.
- a nucleotide sequence of a DNA fragment containing carA and carB obtained as described above is represented as SEQ ID NO: 1 in Sequence Listing.
- This sequence contains two open reading frames (ORF, nucleotide numbers 283 to 1461 and nucleotide numbers 1756 to 4809).
- the upstream ORF is carA
- the downstream ORF is carB.
- the amino acid sequences encoded by these ORFs are shown in SEQ ID NOS: 2 and 3, respectively.
- a peptide encoded by carA is referred to as a small subunit
- a peptide encoded by carB is referred to as a large subunit.
- GTG of the nucleotide numbers 283 to 285 is indicated as the initiation codon in Sequence Listing.
- GTG of the nucleotide numbers 415 to 417 or ATG of the nucleotide numbers 430 to 432 may possibly be the initiation codon.
- an active small subunit can be obtained by using a longer open reading frame for the upstream region for the expression of carA.
- the amino acid corresponding to the GTG as the initiation codon is indicated as valine for each subunit, but it may be methionine, valine or formylmethionine.
- the small subunit of the carbamoyl-phosphate synthetase of the present invention is, for example, a polypeptide having the amino acid sequence of the amino acid numbers 50 to 393 in SEQ ID NO: 2, polypeptide having the amino acid sequence of the amino acid numbers 45 to 393 in SEQ ID NO: 2, polypeptide having the amino acid sequence of the amino acid numbers 1 to 393 in SEQ ID NO: 2 or the like.
- the large subunit of the carbamoyl-phosphate synthetase of the present invention is, for example, a polypeptide having the amino acid sequence shown as SEQ ID NO: 3.
- the DNA coding for the small subunit may be one coding for an amino acid sequence which contains the amino acid sequence of the amino acid numbers 50 to 393 in SEQ ID NO: 2 including substitution, deletion, insertion, addition, or inversion of one or several amino acids, or one coding for a polypeptide which can constitute a protein having a carbamoyl-phosphate synthetase activity with the large subunit.
- the DNA coding for the large subunit may be one coding for an amino acid sequence which contains the amino acid sequence of SEQ ID NO: 3 including substitution, deletion, insertion, addition, or inversion of one or several amino acids, or one coding for a polypeptide which can constitute a protein having a carbamoyl-phosphate synthetase activity with the small subunit.
- it may be one coding for a protein which has the amino acid sequence of SEQ ID NO: 3 including substitution, deletion, insertion, addition, or inversion of one or several amino acids, and has a carbamoyl-phosphate synthetase activity.
- a DNA that encodes carbamoyl-phosphate synthetase containing a mutation or mutations in the small subunit or the large subunit, or both of them also falls within the scope of the DNA of the present invention.
- severe amino acids preferably means 1 to 20 amino acids, more preferably 1 to 10 amino acids.
- DNA which encodes the substantially same peptide as the small subunit or the large subunit as described above, is obtained, for example, by modifying the nucleotide sequence of the DNA encoding the small subunit or the large subunit, for example, by means of the site-directed mutagenesis method so that one or more amino acid residues at a specified site of the gene involve substitution, deletion, insertion, addition, or inversion.
- DNA modified as described above may be obtained by the conventionally known mutation treatment.
- the mutation treatment includes a method for treating DNA coding for the small subunit or the large subunit in vitro, for example, with hydroxylamine, and a method for treating a microorganism, for example, a bacterium belonging to the genus Escherichia harboring DNA coding for the small subunit and the large subunit with ultraviolet irradiation or a mutating agent such as N-methyl-N′-nitro-N-nitrosoguanidine (NTG) and nitrous acid usually used for the mutation treatment.
- NTG N-methyl-N′-nitro-N-nitrosoguanidine
- substitution, deletion, insertion, addition, or inversion of nucleotide as described above also includes mutation (mutant or variant) which naturally occurs, for example, the difference in strains, species or genera of the microorganism having the small subunit and/or the large subunit.
- the DNA which encodes substantially the same protein as carbamoyl-phosphate synthetase, is obtained by expressing DNA having mutation as described above in an appropriate cell, and investigating the carbamoyl-phosphate synthetase activity of an expressed product.
- the carbamoyl-phosphate synthetase activity can be measured by the known method ( Journal of Genral Microbiology, 136, 1177-1183 (1990)).
- the DNA which encodes substantially the same protein as carbamoyl-phosphate synthetase, is also obtained by isolating DNA which is hybridizable with DNA having, for example, a nucleotide sequence corresponding to nucleotide numbers of 283 to 1461 or 1756 to 4809 of the nucleotide sequence of SEQ ID NO: 2, under a stringent condition, and which encodes a protein having the carbamoyl-phosphate synthetase activity, from DNA coding for carbamoyl-phosphate synthetase having mutation or from a cell harboring it.
- the “stringent condition” referred to herein is a condition under which so-called specific hybrid is formed, and non-specific hybrid is not formed.
- the stringent condition includes a condition under which DNA's having high homology, for example, DNA's having homology of not less than 70%, preferably not less than 80%, more preferably not less than 90% are hybridized with each other, and DNA's having homology lower than the above are not hybridized with each other.
- the stringent condition is exemplified by a condition under which DNA's are hybridized with each other at a salt concentration corresponding to an ordinary condition of washing in Southern hybridization, i.e., 60° C., 1 ⁇ SSC, 0.1% SDS, preferably 0.1 ⁇ SSC, 0.1% SDS.
- a partial sequence of the nucleotide sequence of SEQ ID NO: 1 can also be used.
- Such a probe may be prepared by PCR using oligonucleotides produced based on the nucleotide sequence of SEQ ID NO: 1 as primers, and a DNA fragment containing the nucleotide sequence of SEQ ID NO: 1 as a template.
- the conditions of washing for the hybridization consist of, for example, 50° C., 2 ⁇ SSC, and 0.1% SDS.
- the DNA of the present invention can be obtained by amplifying it from coryneform bacterial chromosome DNA through polymerase chain reaction (PCR: polymerase chain reaction; see White, T. J. et al., Trends Genet., 5, 185 (1989)) utilizing oligonucleotides prepared based on that nucleotide sequence as primers, or by selecting it from a coryneform bacterial chromosome DNA library by hybridization utilizing an oligonucleotide prepared based on that nucleotide sequence as a probe.
- PCR polymerase chain reaction
- a region upstream from the nucleotide number 283, preferably a region upstream from the nucleotide number 185 of SEQ ID NO: 1 can suitably be selected as the 5′ primer, and a region downstream from the nucleotide number 4809 of SEQ ID NO: 1 can suitably be selected as the 3′ primer.
- Examples of the host for the expression of the DNA of the present invention include various bacteria such as Escherichia coli and coryneform bacteria including Breveibacterium lactofermentum and Brevibacterium flavum, eukaryotic cells such as those of Saccharomyces cerevisiae and so forth.
- the host cells can be transformed with a recombinant vector obtained by inserting the DNA of the present invention into a vector selected according to the nature of the host in which the DNA is to be expressed. This procedure can be performed by a method well known to those skilled in the art.
- the method include the methods used for transformation of Escherichia coli mentioned above, the method in which competent cells are prepared from cells at the proliferating stage to introduce DNA, as reported for Bacillus subtilis (Duncan, C. H., Wilson, G. A. and Young, F. E., Gene, 1, 153 (1977)), the method in which DNA recipient cells are allowed to be in a state of protoplasts or spheroplasts capable of incorporating recombinant DNA with ease to introduce recombinant DNA into the DNA recipient cells, as known for Bacillus subtilis, actinomycetes, and yeasts (Chang, S. and Choen, S. N., Molec. Gen.
- the DNA to be introduced into the host such as those mentioned above may be DNA containing either carA or carB, or DNA containing both of them. Further, in order to attain efficient expression of these genes, a promoter functioning in the host cells such as lac, trp and P L may be ligated at a position upstream from carA or carB.
- Carbamoyl-phosphate synthetase or its subunits can be produced by culturing a transformant such as those mentioned above under a condition that allows the expression of carA or carB.
- the DNA of the present invention can also be utilized for breeding of L-arginine-producing bacteria or nucleic acid-producing bacteria such as uracil-producing bacteria. That is, a transformant introduced with the DNA of the present invention, in particular, one introduced with either carA or carB or both of them, should have increased carbamoyl-phosphate synthetase activity compared with non-transformants. Consequently, its productivity for L-arginine or nucleic acid such as uracil is improved.
- L-Arginine can efficiently be produced by culturing a microorganism that has enhanced intracellular carbamoyl-phosphate synthetase activity, and has L-arginine productivity in a medium to produce and accumulate L-arginine in the medium, and collecting the L-arginine from the medium.
- microorganism having L-arginine productivity include coryneform bacteria, bacteria belonging to the genera Bacillus, Serratia and Escherichia, yeast species belonging to the genus Saccharomyces or Candida. Of these, coryneform bacteria are preferred.
- Exemplary specific species include Bacillus subtilis as a bacterium belonging to the genus Bacillus, Serratia marcescens as a bacterium belonging to the genus Serratia, Escherichia coli as a bacterium belonging to the genus Escherichia, Saccharomyces cerevisiae as a yeast species belonging to the genus Saccharomyces, Candida tropicalis as a yeast species belonging to the genus Candida and so forth.
- Exemplary microorganisms having L-arginine productivity include Bacillus subtilis resistant to 5-azauracil, 6-azauracil, 2-thiouracil, 5-fluorouracil, 5-bromouracil, 5-azacytosine and so forth, Bacillus subtilis resistant to arginine hydroxamate and 2-thiouracil, Bacillus subtilis resistant to arginine hydroxamate and 6-azauracil (see Japanese Patent Laid-open No. 49-1268191),
- Escherichia coli introduced with the argA gene (see Japanese Patent Laid-open No. 57-5693),
- Coryneform bacteria include those bacteria having been hitherto classified into the genus Brevibacterium but united into the genus Corynebacterium at present ( Int. J. Syst. Bacteriol., 41, 255 (1981)), and include bacteria belonging to the genus Brevibacterium closely relative to the genus Corynebacterium. Examples of such coryneform bacteria are listed below.
- Corynebacterium lilium Corynebacterium glutamicum
- the coryneform bacteria that have the L-arginine productivity are not particularly limited so long as they have the L-arginine productivity. They include, for example, wild-type strains of coryneform bacteria; coryneform bacteria resistant to certain agents including sulfa drugs, 2-thiazolealanine, ⁇ -amino- ⁇ -hydroxyvaleric acid and the like; coryneform bacteria exhibiting L-histidine, L-proline, L-threonine, L-isoleucine, L-methionine, or L-tryptophan auxotrophy in addition to the resistance to 2-thiazolealanine (Japanese Patent Laid-open No.
- coryneform bacteria resistant to ketomalonic acid, fluoromalonic acid, or monofluoroacetic acid Japanese Patent Laid-open No. 57-18989
- coryneform bacteria resistant to argininol Japanese Patent Laid-open No. 62-24075
- coryneform bacteria resistant to X-guanidine X represents a derivative of fatty acid or aliphatic chain, Japanese Patent Laid-open No. 2-186995) and so forth.
- the AJ11169 strain and the AJ12092 strain are the 2-thiazolealanine resistant strains mentioned in Japanese Patent Laid-open No. 54-44096
- the AJ11336 strain is the strain having argininol resistance and sulfadiazine resistance mentioned in Japanese Patent Publication No. 62-24075
- the AJ11345 strain is the strain having argininol resistance, 2-thiazolealanine resistance, sulfaguanidine resistance, and exhibiting histidine auxotrophy mentioned in Japanese Patent Publication No. 62-24075
- the AJ12430 strain is the strain having octylguanidine resistance and 2-thiazolealanine resistance mentioned in Japanese Patent Laid-open No. 2-186995.
- the intracellular carbamoyl-phosphate synthetase activity of such microorganisms having the L-arginine productivity as mentioned above can be enhanced by, for example, increasing copy number of a gene coding for the carbamoyl-phosphate synthetase in the cells of the aforementioned microorganisms.
- the enhancement of the carbamoyl-phosphate synthetase activity can also be achieved by, in addition to the aforementioned gene amplification, modifying an expression regulation sequence for the DNA coding for carbamoyl-phosphate synthetase so that expression of the DNA gene coding for carbamoyl-phosphate synthetase should be enhanced.
- an expression regulation sequence such as a promoter for a gene coding for carbamoyl-phosphate synthetase on the chromosomal DNA or a plasmid can be replaced with a stronger one (see Japanese Patent Laid-open No. 1-215280).
- Strong promoters which function in cells of coryneform bacteria, include lac promoter, tac promoter, trp promoter, of Escherichia coli (Y. Morinaga, M. Tsuchiya, K. Miwa and K. Sano, J. Biotech., 5, 305-312 (1987)) and the like.
- trp promoter of Corynebacterium bacteria is also a preferable promoter (Japanese Patent Laid-open No. 62-195294).
- the modification of expression regulation sequence may be combined with the increasing of the copy number of DNA coding for carbamoyl-phosphate synthetase.
- the intracellular carbamoyl-phosphate synthetase activity can be enhanced by introducing one or more mutations into the enzyme protein of carbamoyl-phosphate synthetase so that the specific activity of the enzyme should be increased.
- Examples of the DNA coding for carbamoyl-phosphate synthetase include the aforementioned carA and carB genes of Breveibacterium lactofermentum and one containing both of them.
- Examples of the vector for introducing DNA coding for carbamoyl-phosphate synthetase into a microorganism include vectors autonomously replicable in cells of the microorganism. Specifically, the aforementioned vectors autonomously replicable in Escherichia coli cells, and the vectors autonomously replicable in both of Escherichia coli cells and coryneform bacterium cells.
- the medium used for culturing a microorganism having enhanced intracellular carbamoyl-phosphate synthetase activity and L-arginine productivity obtained as described above may be a well-known medium conventionally used for the production of amino acids by fermentation. That is, it is a usual medium that contains a carbon source, nitrogen source, inorganic ions, and other organic components as required.
- the carbon source it is possible to use sugars such as glucose, sucrose, lactose, galactose, fructose and starch hydrolysates; alcohols such as glycerol and sorbitol; or organic acids such as fumaric acid, citric acid and succinic acid and so forth.
- sugars such as glucose, sucrose, lactose, galactose, fructose and starch hydrolysates
- alcohols such as glycerol and sorbitol
- organic acids such as fumaric acid, citric acid and succinic acid and so forth.
- the nitrogen source it is possible to use inorganic ammonium salts such as ammonium sulfate, ammonium chloride and ammonium phosphate, organic nitrogen such as soybean hydrolysates, ammonia gas, aqueous ammonia and so forth.
- inorganic ammonium salts such as ammonium sulfate, ammonium chloride and ammonium phosphate
- organic nitrogen such as soybean hydrolysates, ammonia gas, aqueous ammonia and so forth.
- the medium preferably contains a suitable amount of required substance such as vitamin B 1 and L-homoserine, yeast extract and so forth as trace amount organic nutrients.
- a suitable amount of required substance such as vitamin B 1 and L-homoserine, yeast extract and so forth as trace amount organic nutrients.
- a small amount of potassium phosphate, magnesium sulfate, iron ions, manganese ions and so forth may be added to the medium.
- the cultivation is preferably performed under an aerobic condition for 1-7 days.
- Cultivation temperature is preferably 24-37° C.
- pH of the medium during the cultivation is preferably 5-9.
- Inorganic or organic acidic or alkaline substances, ammonia gas and so forth may be used for adjusting pH.
- L-Arginine can usually be recovered from the fermentation medium by a combination of known techniques such as ion exchange resin method.
- Brevibacterium lactofermentun ATCC13869 was inoculated to 100 ml of T-Y culture medium (1% of Bacto-Trypton (Difco), 0.5% of Bacto-Yeast Extract (Difco), 0.5% of NaCl (pH 7.2)), and cultured at a temperature of 31.5° C. for 8 hours to obtain a culture. The culture was centrifuged at 3,000 r.p.m. for 15 minutes to obtain 0.5 g of wet bacterial cells, and chromosome DNA was obtained from the bacterial cells according to the method of Saito and Miura ( Biochem. Biophys. Acta., 72, 619 (1963)).
- chromosome DNA and 3 units of restriction enzyme Sau3AI were each mixed in 10 mM Tris-HCl buffer (containing 50 mM NaCl, 10 MM MgSO 4 and 1 mM dithiothreitol (pH 7.4)), and allowed to react at a temperature of 37° C. for 30 minutes.
- the reaction mixture was subjected to phenol extraction and ethanol precipitation in a conventional manner to obtain 50 ⁇ g of chromosome DNA fragments of Brevibacterium lactofermentum ATCC13869 digested with Sau3AI.
- pSAC4 As a plasmid vector DNA autonomously replicable in both of Escherichia coli cells and coryneform bacterium cells, pSAC4 was used. pSAC4 was prepared as follows. In order to make a vector pHSG399 for Escherichia coli (Takara Shuzo) autonomously replicable in coryneform bacterium cells, a replication origin of the previously obtained plasmid pHM1519 autonomously replicable in coryneform bacterium cells (Miwa, K. et al., Agric. Biol. Chem., 48 (1984) 2901-2903) was introduced into the vector (Japanese Patent Laid-open No. 5-7491).
- pHM1519 was digested with restriction enzymes BamHI and KpnI to obtain a gene fragment containing the replication origin, and the obtained fragment was blunt-ended by using Blunting Lit produced by Takara Shuzo, and inserted into the SalI site of pHSG399 using a SalI linker (produced by Takara Shuzo) to obtain pSAC4.
- Escherichia coli DH5 was transformed with this DNA mixture in a conventional manner, and plated on an L agar medium containing 170 ⁇ g/ml of chloramphenicol to obtain about 20,000 colonies, which were used as a gene library.
- transformants were replicated on a minimum medium (5 g/L of glucose, 12.8 g/L of Na 2 HPO 4 , 3 g/L of KH 2 PO 4 , 0.5 g/L of NaCl, 1 g/L of NH 4 Cl, 40 ⁇ g/ml of L-threonine, 40 ⁇ g/ml of L-methionine) not containing arginine and uracil, and the minimum medium not containing L-arginine, but containing only 50 ⁇ g/ml of uracil, and screened for a strain in which arginine auxotrophy and uracil auxotrophy were restored, or a strain in which arginine auxotrophy was restored.
- a minimum medium 5 g/L of glucose, 12.8 g/L of Na 2 HPO 4 , 3 g/L of KH 2 PO 4 , 0.5 g/L of NaCl, 1 g/L of NH 4 Cl, 40 ⁇ g/ml of
- the Escherichia coli JEF8/p19 was designated as Escherichia coli AJ13574, and it was deposited at the National Institute of Bioscience and Human-Technology, Agency of Industrial Science and Technology, Ministry of International Trade and Industry (postal code 305-8566, 1-3 Higashi 1-chome, Tsukuba-shi, Ibaraki-ken, Japan) on Jan. 28, 1999, and received an accession number of FERM P-17180, and transferred from the original deposit to international deposit based on Budapest Treaty on Jan. 6, 20000, and has been deposited as deposition number of FERM BP-6989.
- a plasmid was prepared from JEF8/p19 in a conventional manner, and used for re-transformation of the JEF8 strain.
- the obtained transformants could grow in the minimum culture medium not containing L-arginine and uracil, and its auxotrophy for both of L-arginine and uracil was restored. Therefore, it was found that that the plasmid contained a gene complementing the auxotrophy for both of L-arginine and uracil caused by deletion of carB in the Escherichia coli strain.
- this plasmid was introduced into the carA mutant of Escherichia coli , RC50 (carA50, tsx ⁇ 273, ⁇ ⁇ , rpsL135 (str R ), malT1 ( ⁇ R), xylA7, thi ⁇ 1; Mol. Gen. Genet., 133, 299 (1974)). Since the strain introduced with the plasmid was able to grow in the minimum culture medium not containing arginine and uracil, the plasmid was also found to have a gene complementing the auxotrophy for both of L-arginine and uracil caused by carA mutation of the Escherichia coli strain.
- nucleotide sequence of about 4.8 kb from the HindIII side of the multi-cloning site of the vector to the HindIII site contained in the insertion DNA fragment was determined.
- the nucleotide sequencing was performed by using Rohdamin Terminator Cycle Sequencing Kit (produced by ABI) according to the method of Sanger.
- the obtained nucleotide sequence is shown as SEQ ID NO: 1 in Sequence Listing. From analysis of a consensus sequence which located in the upstream region of this gene, it was estimated that two open reading frames (open reading frame from 283rd G to 1461st A and open reading frame from 1756th G to 4809th T) were contained in this sequence.
- the nucleotides of the 162nd (TGCATA) to 194th (TATAAT), the 185th (TGCATA) to 213rd (TAAACT), the 203rd (TTGAAT) 230th (TATCAA), or the 224th (TTATCA) to 251st (TAAAAA) can be estimated to be a promoter region for regulating the transcription.
- the amino acid sequences encoded by these open reading frames are represented with the nucleotide sequences.
- the amino acid sequences were also shown in SEQ ID NOS: 2 and 3.
- a protein database (GenBank CDS) was searched for sequences exhibiting homology with these amino acid sequences.
- the 5′ open reading frame showed high homology (about 40%) with carA gene products of Escherichia coli, Bacillus subtilis and so forth
- the 3′ open reading frame showed high homology with known carB gene products of Escherichia coli, Bacillus stearothermophilus and so forth (about 40 to 50%). Therefore, it was suggested that these open reading frames coded for carA and carB, respectively.
- p19 was introduced into the Brevibacterium flavum wild strain 2247 (AJ14067) by the electric pulse method (Japanese Patent Laid-open No. 2-207791).
- the transformants were selected as chloramphenicol resistant strains on a CM2G plate medium (containing 10 g of polypeptone, 10 g of yeast extract, 5 g of glucose, 5 g of NaCl, 15 g of agar in 1 L of pure water, pH 7.2) containing 5 ⁇ g/ml of chloramphenicol to obtain 2247/p19.
- a plasmid vector autonomously replicable in both of Escherichia coli cells and coryneform bacterium cells was newly produced as a plasmid used for introducing the carA and carB genes into coryneform bacteria.
- a vector containing a drug resistance gene of Streptococcus faecalis was constructed first.
- the kanamycin resistant gene of Streptococcus faecalis was amplified by PCR from a known plasmid containing that gene.
- the nucleotide sequence of the kanamycin resistant gene of Streptococcus faecalis has already been clarified (Trieu-Cuot, P. and Courvalin, P., Gene, 23(3), 331-341 (1983)).
- the primers shown as SEQ ID NOS: 4 and 5 were synthesized based on that sequence, and PCR was performed by using pDG783 (Anne-Marie Guerout-Fleury et al., Gene, 167, 335-337 (1995)) as a template to amplify a DNA fragment containing the kanamycin resistant gene and its promoter.
- pDG783 Anne-Marie Guerout-Fleury et al., Gene, 167, 335-337 (1995)
- the obtained DNA fragment was purified by SUPREC02 produced by the Takara Shuzo, then fully digested with restriction enzymes HindIII and HincII, and blunt-ended.
- the blunt-ending was attained by using Blunting Kit produced by Takara Shuzo.
- This DNA fragment was mixed with and ligated to a DNA fragment, which had been obtained by performing PCR using the primers shown as SEQ ID NOS: 6 and 7 and pHSG399 (see S. Takeshita et al., Gene, 61, 63-74 (1987)) as a template, purifying and blunt-ending the resulted amplification product.
- the ligation reaction was performed by DNA Ligation Kit ver. 2 produced by Takara Shuzo.
- Competent cells of Escherichia coli JM109 were transformed with the ligated DNA, plated on L madium (10 g/L of Bacto-trypton, 5 g/L of Bacto-yeast extract, 5 g/L of NaCl, 15 g/L of agar, pH 7.2) containing 10 ⁇ g/ml of IPTG (isopropyl- ⁇ -D-thiogalactopyranoside), 40 ⁇ g/ml of X-Gal (5-bromo-4-chloro-3-indolyl- ⁇ -D-galactoside) and 25 ⁇ g/ml of kanamycin, and cultured overnight. The emerged blue colonies were picked up, and separated into single colonies to obtain transformant strains.
- L madium 10 g/L of Bacto-trypton, 5 g/L of Bacto-yeast extract, 5 g/L of NaCl, 15 g/L of agar, pH 7.2
- IPTG isopropyl- ⁇ -
- Plasmids were prepared from the transformant strains by the alkali method (Text for Bioengineering Experiments, Edited by the Society for Bioscience and Bioengineering, Japan, p.105, Baifukan, 1992), and restriction maps were prepared.
- One having a restriction map equivalent to that of FIG. 2 was designated as pK1.
- This plasmid is stably retained in Escherichia coli , and imparts kanamycin resistance to a host.
- it contains the lacZ′ gene, it is suitably used as a cloning vector.
- the plasmid pAM330 extracted from Brevibacterium lactofermentum ATCC13869 was fully digested with a restriction enzyme HindIII, and blunt-ended. This fragment was ligated to a fragment obtained by fully digesting the aforementioned pK1 with a restriction enzyme BsaAI. Breveibacterium lactofermentum ATCC13869 was transformed with the ligated DNA. The transformation was performed by the electric pulse method (see Japanese Patent Laid-open No. 2-207791).
- Transformants were selected on a M-CM2B plate (10 g/L of polypeptone, 10 g/L of yeast extract, 5 g/L of NaCl, 10 ⁇ g/L of biotin, 15 g/L of agar, pH 7.2) containing 25 ⁇ g/ml of kanamycin. After cultivation for 2 days, colonies were picked up, and separated into single colonies to obtain the transformants. Plasmid DNA was prepared from the transformants, and restriction maps were prepared. One having the same restriction map as that of FIG. 3 was designated as pSFK6. This plasmid can autonomously replicate in both of Escherichia coli and coryneform bacteria, and imparts kanamycin resistance to a host.
- the aforementioned pSFK6 was digested with SmaI and HindIII.
- the product was ligated to carA and carB gene fragments, which had been obtained by digesting the plasmid p19 prepared from JEF8/p19F in a conventional manner with a restriction enzyme XbaI, blunt-ending the product by using Blunting Kit produced by Takara Shuzo, and further digesting the product with a restriction enzyme HindIII, to obtain a plasmid pcarAB, which contained the carA and carB genes and could autonomously replicate in coryneform bacteria.
- pcarAB was introduced into Brevibacterium flavum AJ11345 and AJ11336 by the electric pulse method (Japanese Patent Laid-open No. 2-207791). Transformants were selected on a M-CM2B plate (10 g/L of polypeptone, 10 g/L of yeast extract, 5 g/L of glucose, 5 g/L of NaCl, 15 g/L of agar, pH 7.2) containing 25 ⁇ g/ml of kanamycin as kanamycin resistant strains. As control, transformants were obtained by similarly introducing pSFK6 into AJ11345 and AJ11336.
- Each of the aforementioned transformants was plated on an agar medium containing 0.5 g/dl of glucose, 1 g/dl of polypeptone, 1 g of yeast extract, 0.5 g/dl of NaCl and 5 ⁇ g/l of chloramphenicol, and cultured at 31.5° C. for 20 hours.
- One inoculating loop of the obtained cells were inoculated to a medium containing 4 g/dl of glucose, 6.5 g/dL of ammonium sulfate, 0.1 g/dl of KH 2 PO 4 , 0.04 g/dl of MgSO 4 , 0.001 g/dl of FeSO 4 , 0.01 g/dl of MnSO 4 , 5 ⁇ g/dl of VB 1 , 5 ⁇ g/dl of biotin, 45 mg/dl of soybean hydrolysates (as an amount of N), and cultured in a flask at 31.5° C. for 50 hours with shaking.
- the amounts of L-arginine produced by each strain were shown in Table 1.
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Genetics & Genomics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- General Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Biotechnology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
A DNA fragment which encodes a polypeptide defined in the following (a) or (b), and a polypeptide defined in the following (c) or (d):
(a) a polypeptide which has at least the amino acid sequence of the amino acid numbers 50 to 393 in SEQ ID NO: 2 shown in Sequence Listing,
(b) a polypeptide which has at least the amino acid sequence of the amino acid numbers 50 to 393 in SEQ ID NO: 2 shown in Sequence Listing including substitution, deletion, insertion, addition, or inversion of one or several amino acids, and can constitute a protein having a carbamoyl-phosphate synthetase activity with a large subunit of carbamoyl-phosphate synthetase having the amino acid sequence of SEQ ID NO: 3,
(c) a polypeptide which has the amino acid sequence of SEQ ID NO: 3 shown in Sequence Listing,
(d) a polypeptide which has the amino acid sequence of SEQ ID NO: 3 shown in Sequence Listing including substitution, deletion, insertion, addition, or inversion of one or several amino acids, and can constitute a protein having a carbamoyl-phosphate synthetase activity with a small subunit of carbamoyl-phosphate synthetase having the amino acid sequence of the amino acid numbers 50 to 393 in SEQ ID NO: 2.
Description
- 1. Field of the Invention
- The present invention relates to carbamoyl-phosphate synthetase of coryneform bacteria, and a gene therefor. The gene can be utilized for production of carbamoyl-phosphate synthetase and subunits thereof, breeding of L-arginine-producing bacteria and nucleic acid-producing bacteria and so forth.
- 2. Description of the Related Art
- Carbamoyl-phosphate synthetase is an enzyme that catalyzes the reactions producing carbamoyl phosphate from carbonic acid, ATP and glutamine. Carbamoyl phosphate produced by these reactions serves as a source of carbamoyl group required for the reaction producing citrulline from ornithine in the L-arginine biosynthetic pathway. Furthermore, carbamoyl aspartate produced from aspartic acid and carbamoyl phosphate is one of the intermediates of the pyrimidine biosynthesis system including uridine 5′-monophosphate.
- Carbamoyl-phosphate synthetase consists of two subunits, and it has been known for bacteria belonging to the genus Escherichia or Bacillus that those subunits are encoded by carA and carB genes.
- However, as for coryneform bacteria, there have been no findings about the carbamoyl-phosphate synthetase activity and enzymes therefor, and any genes therefor have not been elucidated.
- Incidentally, it has been reported that when a transformant ofEscherichia coli to which introduced a plasmid harboring the genes carA, carB, argI and arg box was cultured in the medium added with glutamine which is substrate of carbamoyl-phosphate synthetase, the concentration of intracellular L-arginine was the same as that of a control strain to which only the vector was introduced. However, when the transformant was cultured in a medium added with glutamine accompanied with ornithine which is a substrate of ArgI together with carbamoyl phosphate, the concentration of intracellular L-arginine was higher than that of the control strain (Malamy M. et al., Applied Environmental Microbiology, 63(1), 33 (1997)). From these result, it was suggested that the rate-determining step of synthesis of L-arginine is supply of ornithine.
- There was thought to be a possibility that the rate-determining step of supply of ornithine is N-acetylglutamine synthetase (ArgA). ArgA suffers feedback inhibition by the final product, L-arginine, in the biosynthesis pathway ofEscherichia coli.
- As for the strain in which argA gene coding for feedback inhibition-desensitized ArgA was amplified by plasmid, the concentration of intracellular L-arginine was increased even in a medium added with only glutamine as well as in a medium added with both glutamine and ornithine. However, farther increase of concentration of intracellular L-arginine was not observed in the case that the strain was cultured with addition of glutamine, or glutamine and ornithin, also in the case that the both of carA and carB genes were further amplified in the strain (Malamy M. et al.,Applied Environmental Microbiology, 64(5), 1805 (1998)).
- On the other hand, any attempts have not been reported to enhance L-arginine productibity of microorganisms by utilizing a gene coding for carbamoyl-phosphate synthetase derived from coryneform bacterium.
- An object of the present invention is to provide carbamoyl-phosphate synthetase of coryneform bacteria, a gene coding for it, and a method for producing L-arginine with a microorganism utilizing the gene.
- The inventors of the present invention eagerly studied in order to achieve the aforementioned object. As a result, the inventors successfully obtained a DNA fragment containing the carA gene and the carB gene from a wild strain ofBreveibacterium lactofermentum by utilizing a carB-deficient strain of Escherichia coli, and thus accomplished the present invention.
- That is, the present invention provides the followings.
- (1) A DNA fragment which encodes a polypeptide defined in the following (A) or (B):
- (A) a polypeptide which has an amino acid sequence comprises at least the amino acid numbers 50 to 393 of the amino acid sequence of SEQ ID NO: 2,
- (B) a polypeptide which has an amino acid sequence comprises at least the amino acid numbers 50 to 393 of the amino acid sequence of SEQ ID NO: 2 including substitution, deletion, insertion, addition, or inversion of one or several amino acids, and can constitute a protein having a carbamoyl-phosphate synthetase activity with a large subunit of carbamoyl-phosphate synthetase comprising the amino acid sequence of SEQ ID NO: 3.
- (2) A DNA fragment which encodes a polypeptide defined in the following (C) or (D):
- (C) a polypeptide which comprises the amino acid sequence of SEQ ID NO: 3,
- (D) a polypeptide which comprises the amino acid sequence of SEQ ID NO: 3 including substitution, deletion, insertion, addition, or inversion of one or several amino acids, and can constitute a protein having a carbamoyl-phosphate synthetase activity with a small subunit of carbamoyl-phosphate synthetase having an amino acid sequence comprises at least the amino acid numbers 50 to 393 of the amino acid sequence of SEQ ID NO: 2.
- (3) A DNA fragment encoding a polypeptide which comprises the amino acid sequence of SEQ ID NO: 3 including substitution, deletion, insertion, addition, or inversion of one or several amino acids, and can constitute a protein having a carbamoyl-phosphate synthetase activity.
- (4) A DNA fragment which encodes a polypeptide defined in the following (a) or (b), and a polypeptide defined in the following (c) or (d):
- (a) a polypeptide which has an amino acid sequence comprising at least the amino acid numbers 50 to 393 in SEQ ID NO: 2,
- (b) a polypeptide which has an amino acid sequence comprising at least the amino acid numbers 50 to 393 in SEQ ID NO: 2 including substitution, deletion, insertion, addition, or inversion of one or several amino acids, and can constitute a protein having a carbamoyl-phosphate synthetase activity with a large subunit of carbamoyl-phosphate synthetase comprising the amino acid sequence of SEQ ID NO: 3,
- (c) a polypeptide which comprises the amino acid sequence of SEQ ID NO: 3,
- (d) a polypeptide which comprises the amino acid sequence of SEQ ID NO: 3 including substitution, deletion, insertion, addition, or inversion of one or several amino acids, and can constitute a protein having a carbamoyl-phosphate synthetase activity with a small subunit of carbamoyl-phosphate synthetase having an amino acid sequence comprising the amino acid numbers 50 to 393 in SEQ ID NO: 2.
- (5) The DNA fragment according to (1), which has a nucleotide sequence comprising at least the nucleotide numbers 430 to 1461 in the nucleotide sequence of SEQ ID NO: 1.
- (6) The DNA fragment according to (2), which has a nucleotide sequence comprising at least the nucleotide numbers 1756 to 4809 in the nucleotide sequence of SEQ ID NO: 1.
- (7) The DNA fragment according to (3), which has a nucleotide sequence comprising at least the nucleotide numbers 430 to 4809 in the nucleotide sequence of SEQ ID NO: 1.
- (8) A protein which comprises a polypeptide defined in the following (a) or (b), and a polypeptide defined in the following (c) or (d):
- (a) a polypeptide which has an amino acid sequence comprising at least the amino acid numbers 50 to 393 in SEQ ID NO: 2,
- (b) a polypeptide which has an amino acid sequence comprising at least the amino acid numbers 50 to 393 in SEQ ID NO: 2 including substitution, deletion, insertion, addition, or inversion of one or several amino acids, and can constitute a protein having a carbamoyl-phosphate synthetase activity with a large subunit of carbamoyl-phosphate synthetase comprising the amino acid sequence of SEQ ID NO: 3,
- (c) a polypeptide which comprises the amino acid sequence of SEQ ID NO: 3,
- (d) a polypeptide which comprises the amino acid sequence of SEQ ID NO: 3 including substitution, deletion, insertion, addition, or inversion of one or several amino acids, and can constitute a protein having a carbamoyl-phosphate synthetase activity with a small subunit of carbamoyl-phosphate synthetase having an amino acid sequence comprising at least the amino acid numbers 50 to 393 in SEQ ID NO: 2.
- (9) A coryneform bacterium which is transformed with a DNA fragment according to any one of (1) to (7).
- (10) A microorganism which has enhanced intracellular carbamoyl-phosphate synthetase activity, and has L-arginine productivity.
- (11) The microorganism according to (10), wherein the enhanced intracellular carbamoyl-phosphate synthetase activity is obtained by increasing copy number of DNA encoding carbamoyl-phosphate synthetase of the microorganism, or by modifying an expression regulation sequence so that expression of the gene encoding carbamoyl-phosphate synthetase in the cell should be enhanced.
- (12) The microorganism according to (11), wherein the DNA is a DNA fragment according to any one of (1) to (7).
- (13) The microorganism according to (12), which is a coryneform bacterium.
- (14) A method for producing of L-arginine, comprising the steps of culturing a coryneform bacterium according to any one of (10) to (13) in a medium to produce and accumulate L-arginine in the medium, and collecting the L-arginine from the medium.
- The present invention provides genes coding for the subunits that constitute carbamoyl-phosphate synthetase. The gene can be utilized for production of carbamoyl-phosphate synthetase and subunits thereof, breeding of L-arginine-producing bacteria and nucleic acid-producing bacteria and so forth. Aditionally, L-arginine can be produced efficiently according to the present invention.
- FIG. 1 shows the structure of plasmid p19 containing the carA gene and carB gene.
- FIG. 2 shows a construction process of plasmid pK1.
- FIG. 3 shows a construction process of plasmid pSFK6.
- Hereafter, the present invention will be explained in detail.
- <1> DNA of the Present Invention
- The DNA of the present invention can be obtained from a chromosome DNA library of coryneform bacteria prepared with vectors such as plasmids by selection of the DNA using a microorganism which is deficient in carA or carB, for example,Escherichia coli RC50 (carA50, tsx−273, λ−, rpsL135 (strR), malT1 (λR), xy1A7, thi−1; Mol. Gen. Genet., 133, 299 (1974)), Escherichia coli JEF8 (thr−31, ΔcarB, relA−, metB1, Mol. Gen. Genet., 133, 299 (1974)) and so forth. Because a microorganism which is deficient in carA or carB exhibits L-arginine and uracil auxotrophy, a DNA fragment can be obtained by transforming such a microorganism with a chromosome DNA library, selecting clones in which the auxotrophy is complemented, and recovering a recombinant vector from the selected transformants.
- The coryneform bacteria used for preparing a chromosome DNA library are not particularly limited, and examples thereof include bacteria having been hitherto classified into the genus Brevibacterium but united into the genus Corynebacterium at present (Int. J. Syst. Bacteriol., 41, 255 (1981)), and include bacteria belonging to the genus Brevibacterium closely relative to the genus Corynebacterium, more specifically, wild strains of Breveibacterium lactofermentum and so forth. Chromosome DNA of coryneform bacteria can be prepared by, for example, the method of Saito and Miura (Biochem. Biophys. Acta., 72, 619, (1963)), the method of K. S. Kirby (Biochem. J., 64, 405, (1956)) and so forth.
- A chromosome DNA library can be obtained by partially digesting chromosome DNA with suitable restriction enzymes, ligating each of the obtained DNA fragments to a vector DNA autonomously replicable inEscherichia coli cells to prepare a recombinant DNA, and introducing the DNA into Escherichia coli. The vector is not particularly limited so long as it is a vector usually used for genetic cloning, and plasmid vectors such as pUC19, pUC18, pUC118, and pUC119, phage vectors such as λ phage DNA and so forth can be used. Further, a vector autonomously replicable in both of Escherichia coli cells and coryneform bacterium cells may also be used. Such a vector can be constructed by ligating a vector for Escherichia coli and pAM330, which is a cryptic plasmid of Breveibacterium lactofermentum (see Japanese Patent Laid-open No. 58-67699).
- Specific examples of the vector autonomously replicable within both ofEscherichia coli and coryneform bacterium cells include pSAC4 (see the examples mentioned below), pHK4 (see Japanese Patent Laid-open No. 5-7491) and so forth. Escherichia coli HB101 harboring pHK4 was designated as Escherichia coli AJ13136, and it was deposited at the National Institute of Bioscience and Human-Technology, Agency of Industrial Science and Technology, Ministry of International Trade and Industry (postal code 305-8566, 1-3 Higashi 1-chome, Tsukuba-shi, Ibaraki-ken, Japan) on Aug. 1, 1995, and received an accession number of FERM BP-5186.
- The transformation ofEscherichia coli cells can be performed by, for example, the method of D. A. Morrison (Methods in Enzymology, 68, 326, 1979), the method of treating recipient cells with calcium chloride so as to increase the permeability of DNA (Mandel, M. and Higa, A., J. Mol. Biol., 53, 159 (1970)) and so forth. As for methods for preparation of chromosome DNA library, preparation of plasmid DNA, and digestion and ligation of DNA, as well as methods for PCR, preparation of oligonucleotides and hybridization mentioned hereinafter, conventional methods well known to those skilled in the art can be used. Such methods are described in Sambrook, J., Fritsch, E. F. and Maniatis, T., “Molecular Cloning, A Laboratory Manual, Second Edition”, Cold Spring Harbor Laboratory Press, (1989) and so forth.
- A nucleotide sequence of a DNA fragment containing carA and carB obtained as described above is represented as SEQ ID NO: 1 in Sequence Listing. This sequence contains two open reading frames (ORF, nucleotide numbers 283 to 1461 and nucleotide numbers 1756 to 4809). The upstream ORF is carA, and the downstream ORF is carB. The amino acid sequences encoded by these ORFs are shown in SEQ ID NOS: 2 and 3, respectively. According to the present invention, a peptide encoded by carA is referred to as a small subunit, and a peptide encoded by carB is referred to as a large subunit. As for the coding region of carA, GTG of the nucleotide numbers 283 to 285 is indicated as the initiation codon in Sequence Listing. However, GTG of the nucleotide numbers 415 to 417 or ATG of the nucleotide numbers 430 to 432 may possibly be the initiation codon. In any case, an active small subunit can be obtained by using a longer open reading frame for the upstream region for the expression of carA. The amino acid corresponding to the GTG as the initiation codon is indicated as valine for each subunit, but it may be methionine, valine or formylmethionine.
- The small subunit of the carbamoyl-phosphate synthetase of the present invention is, for example, a polypeptide having the amino acid sequence of the amino acid numbers 50 to 393 in SEQ ID NO: 2, polypeptide having the amino acid sequence of the amino acid numbers 45 to 393 in SEQ ID NO: 2, polypeptide having the amino acid sequence of the amino acid numbers 1 to 393 in SEQ ID NO: 2 or the like. The large subunit of the carbamoyl-phosphate synthetase of the present invention is, for example, a polypeptide having the amino acid sequence shown as SEQ ID NO: 3.
- According to the present invention, the DNA coding for the small subunit may be one coding for an amino acid sequence which contains the amino acid sequence of the amino acid numbers 50 to 393 in SEQ ID NO: 2 including substitution, deletion, insertion, addition, or inversion of one or several amino acids, or one coding for a polypeptide which can constitute a protein having a carbamoyl-phosphate synthetase activity with the large subunit.
- According to the present invention, the DNA coding for the large subunit may be one coding for an amino acid sequence which contains the amino acid sequence of SEQ ID NO: 3 including substitution, deletion, insertion, addition, or inversion of one or several amino acids, or one coding for a polypeptide which can constitute a protein having a carbamoyl-phosphate synthetase activity with the small subunit. Alternatively, it may be one coding for a protein which has the amino acid sequence of SEQ ID NO: 3 including substitution, deletion, insertion, addition, or inversion of one or several amino acids, and has a carbamoyl-phosphate synthetase activity.
- Furthermore, a DNA that encodes carbamoyl-phosphate synthetase containing a mutation or mutations in the small subunit or the large subunit, or both of them also falls within the scope of the DNA of the present invention.
- The term “several amino acids” preferably means 1 to 20 amino acids, more preferably 1 to 10 amino acids.
- DNA, which encodes the substantially same peptide as the small subunit or the large subunit as described above, is obtained, for example, by modifying the nucleotide sequence of the DNA encoding the small subunit or the large subunit, for example, by means of the site-directed mutagenesis method so that one or more amino acid residues at a specified site of the gene involve substitution, deletion, insertion, addition, or inversion. DNA modified as described above may be obtained by the conventionally known mutation treatment. The mutation treatment includes a method for treating DNA coding for the small subunit or the large subunit in vitro, for example, with hydroxylamine, and a method for treating a microorganism, for example, a bacterium belonging to the genus Escherichia harboring DNA coding for the small subunit and the large subunit with ultraviolet irradiation or a mutating agent such as N-methyl-N′-nitro-N-nitrosoguanidine (NTG) and nitrous acid usually used for the mutation treatment.
- The substitution, deletion, insertion, addition, or inversion of nucleotide as described above also includes mutation (mutant or variant) which naturally occurs, for example, the difference in strains, species or genera of the microorganism having the small subunit and/or the large subunit.
- The DNA, which encodes substantially the same protein as carbamoyl-phosphate synthetase, is obtained by expressing DNA having mutation as described above in an appropriate cell, and investigating the carbamoyl-phosphate synthetase activity of an expressed product. The carbamoyl-phosphate synthetase activity can be measured by the known method (Journal of Genral Microbiology, 136, 1177-1183 (1990)). The DNA, which encodes substantially the same protein as carbamoyl-phosphate synthetase, is also obtained by isolating DNA which is hybridizable with DNA having, for example, a nucleotide sequence corresponding to nucleotide numbers of 283 to 1461 or 1756 to 4809 of the nucleotide sequence of SEQ ID NO: 2, under a stringent condition, and which encodes a protein having the carbamoyl-phosphate synthetase activity, from DNA coding for carbamoyl-phosphate synthetase having mutation or from a cell harboring it. The “stringent condition” referred to herein is a condition under which so-called specific hybrid is formed, and non-specific hybrid is not formed. It is difficult to clearly express this condition by using any numerical value. However, for example, the stringent condition includes a condition under which DNA's having high homology, for example, DNA's having homology of not less than 70%, preferably not less than 80%, more preferably not less than 90% are hybridized with each other, and DNA's having homology lower than the above are not hybridized with each other. Alternatively, the stringent condition is exemplified by a condition under which DNA's are hybridized with each other at a salt concentration corresponding to an ordinary condition of washing in Southern hybridization, i.e., 60° C., 1×SSC, 0.1% SDS, preferably 0.1×SSC, 0.1% SDS.
- As a probe, a partial sequence of the nucleotide sequence of SEQ ID NO: 1 can also be used. Such a probe may be prepared by PCR using oligonucleotides produced based on the nucleotide sequence of SEQ ID NO: 1 as primers, and a DNA fragment containing the nucleotide sequence of SEQ ID NO: 1 as a template. When a DNA fragment in a length of about 300 bp is used as the probe, the conditions of washing for the hybridization consist of, for example, 50° C., 2×SSC, and 0.1% SDS.
- Because the nucleotide sequence of the DNA of the present invention has been elucidated, the DNA of the present invention can be obtained by amplifying it from coryneform bacterial chromosome DNA through polymerase chain reaction (PCR: polymerase chain reaction; see White, T. J. et al.,Trends Genet., 5, 185 (1989)) utilizing oligonucleotides prepared based on that nucleotide sequence as primers, or by selecting it from a coryneform bacterial chromosome DNA library by hybridization utilizing an oligonucleotide prepared based on that nucleotide sequence as a probe. As nucleotide sequences of the primers used for PCR, a region upstream from the nucleotide number 283, preferably a region upstream from the nucleotide number 185 of SEQ ID NO: 1 can suitably be selected as the 5′ primer, and a region downstream from the nucleotide number 4809 of SEQ ID NO: 1 can suitably be selected as the 3′ primer.
- Examples of the host for the expression of the DNA of the present invention include various bacteria such asEscherichia coli and coryneform bacteria including Breveibacterium lactofermentum and Brevibacterium flavum, eukaryotic cells such as those of Saccharomyces cerevisiae and so forth. In order to introduce the DNA of the present invention into these hosts, the host cells can be transformed with a recombinant vector obtained by inserting the DNA of the present invention into a vector selected according to the nature of the host in which the DNA is to be expressed. This procedure can be performed by a method well known to those skilled in the art. Specific examples of the method include the methods used for transformation of Escherichia coli mentioned above, the method in which competent cells are prepared from cells at the proliferating stage to introduce DNA, as reported for Bacillus subtilis (Duncan, C. H., Wilson, G. A. and Young, F. E., Gene, 1, 153 (1977)), the method in which DNA recipient cells are allowed to be in a state of protoplasts or spheroplasts capable of incorporating recombinant DNA with ease to introduce recombinant DNA into the DNA recipient cells, as known for Bacillus subtilis, actinomycetes, and yeasts (Chang, S. and Choen, S. N., Molec. Gen. Genet., 168, 111 (1979); Bibb, M. J., Ward, J. M. and Hopwood, O. A., Nature, 274, 398 (1978); Hinnen, A., Hicks, J. B. and Fink, G. R., Proc. Natl. Acad. Sci. USA, 75, 1929 (1978)), the electric pulse method useful for cryneform bacteria (refer to Japanese Patent Publication Laid-Open No. 2-207791) and so forth.
- The DNA to be introduced into the host such as those mentioned above may be DNA containing either carA or carB, or DNA containing both of them. Further, in order to attain efficient expression of these genes, a promoter functioning in the host cells such as lac, trp and PL may be ligated at a position upstream from carA or carB.
- Carbamoyl-phosphate synthetase or its subunits can be produced by culturing a transformant such as those mentioned above under a condition that allows the expression of carA or carB. The DNA of the present invention can also be utilized for breeding of L-arginine-producing bacteria or nucleic acid-producing bacteria such as uracil-producing bacteria. That is, a transformant introduced with the DNA of the present invention, in particular, one introduced with either carA or carB or both of them, should have increased carbamoyl-phosphate synthetase activity compared with non-transformants. Consequently, its productivity for L-arginine or nucleic acid such as uracil is improved.
- <2> Method for Producing L-Arginine According to the Present Invention
- L-Arginine can efficiently be produced by culturing a microorganism that has enhanced intracellular carbamoyl-phosphate synthetase activity, and has L-arginine productivity in a medium to produce and accumulate L-arginine in the medium, and collecting the L-arginine from the medium.
- Specific examples of the microorganism having L-arginine productivity include coryneform bacteria, bacteria belonging to the genera Bacillus, Serratia and Escherichia, yeast species belonging to the genus Saccharomyces or Candida. Of these, coryneform bacteria are preferred.
- Exemplary specific species includeBacillus subtilis as a bacterium belonging to the genus Bacillus, Serratia marcescens as a bacterium belonging to the genus Serratia, Escherichia coli as a bacterium belonging to the genus Escherichia, Saccharomyces cerevisiae as a yeast species belonging to the genus Saccharomyces, Candida tropicalis as a yeast species belonging to the genus Candida and so forth.
- Exemplary microorganisms having L-arginine productivity includeBacillus subtilis resistant to 5-azauracil, 6-azauracil, 2-thiouracil, 5-fluorouracil, 5-bromouracil, 5-azacytosine and so forth, Bacillus subtilis resistant to arginine hydroxamate and 2-thiouracil, Bacillus subtilis resistant to arginine hydroxamate and 6-azauracil (see Japanese Patent Laid-open No. 49-1268191),
-
- a mutant ofBacillus subtilis exhibiting auxotrophy for at least one of methionine, histidine, threonine, proline, isoleucine, lysine, adenine, guanine and uracil (or uracil precursor) (see Japanese Patent Laid-open No. 52-99289),
-
-
-
-
-
- Coryneform bacteria include those bacteria having been hitherto classified into the genus Brevibacterium but united into the genus Corynebacterium at present (Int. J. Syst. Bacteriol., 41, 255 (1981)), and include bacteria belonging to the genus Brevibacterium closely relative to the genus Corynebacterium. Examples of such coryneform bacteria are listed below.
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
- The coryneform bacteria that have the L-arginine productivity are not particularly limited so long as they have the L-arginine productivity. They include, for example, wild-type strains of coryneform bacteria; coryneform bacteria resistant to certain agents including sulfa drugs, 2-thiazolealanine, α-amino-β-hydroxyvaleric acid and the like; coryneform bacteria exhibiting L-histidine, L-proline, L-threonine, L-isoleucine, L-methionine, or L-tryptophan auxotrophy in addition to the resistance to 2-thiazolealanine (Japanese Patent Laid-open No. 54-44096); coryneform bacteria resistant to ketomalonic acid, fluoromalonic acid, or monofluoroacetic acid (Japanese Patent Laid-open No. 57-18989); coryneform bacteria resistant to argininol (Japanese Patent Laid-open No. 62-24075); coryneform bacteria resistant to X-guanidine (X represents a derivative of fatty acid or aliphatic chain, Japanese Patent Laid-open No. 2-186995) and so forth.
- Specifically, the following bacterial strains can be exemplified.
-
-
-
-
-
- The AJ11169 strain and the AJ12092 strain are the 2-thiazolealanine resistant strains mentioned in Japanese Patent Laid-open No. 54-44096, the AJ11336 strain is the strain having argininol resistance and sulfadiazine resistance mentioned in Japanese Patent Publication No. 62-24075, the AJ11345 strain is the strain having argininol resistance, 2-thiazolealanine resistance, sulfaguanidine resistance, and exhibiting histidine auxotrophy mentioned in Japanese Patent Publication No. 62-24075, and the AJ12430 strain is the strain having octylguanidine resistance and 2-thiazolealanine resistance mentioned in Japanese Patent Laid-open No. 2-186995.
- The intracellular carbamoyl-phosphate synthetase activity of such microorganisms having the L-arginine productivity as mentioned above can be enhanced by, for example, increasing copy number of a gene coding for the carbamoyl-phosphate synthetase in the cells of the aforementioned microorganisms. The enhancement of the carbamoyl-phosphate synthetase activity can also be achieved by, in addition to the aforementioned gene amplification, modifying an expression regulation sequence for the DNA coding for carbamoyl-phosphate synthetase so that expression of the DNA gene coding for carbamoyl-phosphate synthetase should be enhanced. Specifically, an expression regulation sequence such as a promoter for a gene coding for carbamoyl-phosphate synthetase on the chromosomal DNA or a plasmid can be replaced with a stronger one (see Japanese Patent Laid-open No. 1-215280). Strong promoters, which function in cells of coryneform bacteria, include lac promoter, tac promoter, trp promoter, ofEscherichia coli (Y. Morinaga, M. Tsuchiya, K. Miwa and K. Sano, J. Biotech., 5, 305-312 (1987)) and the like. In addition, trp promoter of Corynebacterium bacteria is also a preferable promoter (Japanese Patent Laid-open No. 62-195294). By the replacement with these promoters the carbamoyl-phosphate synthetase activity is enhanced. The modification of expression regulation sequence may be combined with the increasing of the copy number of DNA coding for carbamoyl-phosphate synthetase. Further, the intracellular carbamoyl-phosphate synthetase activity can be enhanced by introducing one or more mutations into the enzyme protein of carbamoyl-phosphate synthetase so that the specific activity of the enzyme should be increased.
- Examples of the DNA coding for carbamoyl-phosphate synthetase include the aforementioned carA and carB genes ofBreveibacterium lactofermentum and one containing both of them.
- Examples of the vector for introducing DNA coding for carbamoyl-phosphate synthetase into a microorganism include vectors autonomously replicable in cells of the microorganism. Specifically, the aforementioned vectors autonomously replicable inEscherichia coli cells, and the vectors autonomously replicable in both of Escherichia coli cells and coryneform bacterium cells.
- The medium used for culturing a microorganism having enhanced intracellular carbamoyl-phosphate synthetase activity and L-arginine productivity obtained as described above may be a well-known medium conventionally used for the production of amino acids by fermentation. That is, it is a usual medium that contains a carbon source, nitrogen source, inorganic ions, and other organic components as required.
- As the carbon source, it is possible to use sugars such as glucose, sucrose, lactose, galactose, fructose and starch hydrolysates; alcohols such as glycerol and sorbitol; or organic acids such as fumaric acid, citric acid and succinic acid and so forth.
- As the nitrogen source, it is possible to use inorganic ammonium salts such as ammonium sulfate, ammonium chloride and ammonium phosphate, organic nitrogen such as soybean hydrolysates, ammonia gas, aqueous ammonia and so forth.
- The medium preferably contains a suitable amount of required substance such as vitamin B1 and L-homoserine, yeast extract and so forth as trace amount organic nutrients. Other than those substances, a small amount of potassium phosphate, magnesium sulfate, iron ions, manganese ions and so forth may be added to the medium.
- The cultivation is preferably performed under an aerobic condition for 1-7 days. Cultivation temperature is preferably 24-37° C., and pH of the medium during the cultivation is preferably 5-9. Inorganic or organic acidic or alkaline substances, ammonia gas and so forth may be used for adjusting pH. L-Arginine can usually be recovered from the fermentation medium by a combination of known techniques such as ion exchange resin method.
- <1> Preparation of Chromosome DNA ofBrevibacterium lactofermentum ATCC13869
-
- <2> Preparation of Gene Library ofBrevibacterium lactofermentum ATCC13869 using Plasmid Vector DNA
- As a plasmid vector DNA autonomously replicable in both ofEscherichia coli cells and coryneform bacterium cells, pSAC4 was used. pSAC4 was prepared as follows. In order to make a vector pHSG399 for Escherichia coli (Takara Shuzo) autonomously replicable in coryneform bacterium cells, a replication origin of the previously obtained plasmid pHM1519 autonomously replicable in coryneform bacterium cells (Miwa, K. et al., Agric. Biol. Chem., 48 (1984) 2901-2903) was introduced into the vector (Japanese Patent Laid-open No. 5-7491). Specifically, pHM1519 was digested with restriction enzymes BamHI and KpnI to obtain a gene fragment containing the replication origin, and the obtained fragment was blunt-ended by using Blunting Lit produced by Takara Shuzo, and inserted into the SalI site of pHSG399 using a SalI linker (produced by Takara Shuzo) to obtain pSAC4.
- In 50 mM Tris-HCl buffer (containing 100 mM NaCl and 10 mM magnesium sulfate (pH 7.4)), 20 μg of pSAC4 and 200 units of a restriction enzyme BamHI were mixed, and allowed to react at a temperature of 37° C. for 2 hours to obtain a digestion solution. This solution was subjected to phenol extraction and ethanol precipitation in a conventional manner. Then, in order to inhibit re-ligation of the DNA fragments derived from the plasmid vector, the DNA fragments were dephosphorylated with bacterial alkaline phosphatase according to the method described in Molecular Cloning, 2nd Edition (J. Sambrook, E. F. Fritsch and T. Maniatis, Cold Spring Harbor Laboratory Press, p1.56 (1989)), and subjected to phenol extraction and ethanol precipitation in a conventional manner.
- To 66 mM Tris-HCl buffer (pH 7.5) containing 66 mM magnesium chloride, 10 mM dithiothreitol and 10 mM ATP, 1 μg of the pSAC4 digested with BamHI, 1 μg of the chromosome DNA fragments ofBrevibacterium lactofermentum ATCC13869 digested with Sau3AI obtained in Example 1, and 2 units of T4 DNA ligase (produced by Takara Shuzo) were added, and allowed to react at a temperature of 16° C. for 16 hours to ligate the DNA. Then, Escherichia coli DH5 was transformed with this DNA mixture in a conventional manner, and plated on an L agar medium containing 170 μg/ml of chloramphenicol to obtain about 20,000 colonies, which were used as a gene library.
- <3> Transformation of carB-deficient Strain ofEscherichia coli (JEF8)
- The carB-deficient strain ofEscherichia coli, JEF8 (thr31 31, ΔcarB, relA−, metB1; Mol. Gen. Genet., 133, 299 (1974)) was transformed with a recombinant DNA mixture of the aforementioned gene library in a conventional manner. Transformants of about 15000 strains were obtained as Cm resistant strains. These transformants were replicated on a minimum medium (5 g/L of glucose, 12.8 g/L of Na2HPO4, 3 g/L of KH2PO4, 0.5 g/L of NaCl, 1 g/L of NH4Cl, 40 μg/ml of L-threonine, 40 μg/ml of L-methionine) not containing arginine and uracil, and the minimum medium not containing L-arginine, but containing only 50 μg/ml of uracil, and screened for a strain in which arginine auxotrophy and uracil auxotrophy were restored, or a strain in which arginine auxotrophy was restored. Strains in which arginine auxotrophy was restored recovered both of arginine auxotrophy and uracil auxotrophy. A plasmid harbored in one of such strains was designated as p19, and the strain harboring it was designated as JEF8/p19. The structure of p19 is shown in FIG. 1.
- TheEscherichia coli JEF8/p19 was designated as Escherichia coli AJ13574, and it was deposited at the National Institute of Bioscience and Human-Technology, Agency of Industrial Science and Technology, Ministry of International Trade and Industry (postal code 305-8566, 1-3 Higashi 1-chome, Tsukuba-shi, Ibaraki-ken, Japan) on Jan. 28, 1999, and received an accession number of FERM P-17180, and transferred from the original deposit to international deposit based on Budapest Treaty on Jan. 6, 20000, and has been deposited as deposition number of FERM BP-6989.
- <4> Acquisition of Plasmid Complementing Arginine and Uracil Auxotrophy
- A plasmid was prepared from JEF8/p19 in a conventional manner, and used for re-transformation of the JEF8 strain. The obtained transformants could grow in the minimum culture medium not containing L-arginine and uracil, and its auxotrophy for both of L-arginine and uracil was restored. Therefore, it was found that that the plasmid contained a gene complementing the auxotrophy for both of L-arginine and uracil caused by deletion of carB in theEscherichia coli strain.
- Further, this plasmid was introduced into the carA mutant ofEscherichia coli, RC50 (carA50, tsx−273, λ−, rpsL135 (strR), malT1 (λR), xylA7, thi−1; Mol. Gen. Genet., 133, 299 (1974)). Since the strain introduced with the plasmid was able to grow in the minimum culture medium not containing arginine and uracil, the plasmid was also found to have a gene complementing the auxotrophy for both of L-arginine and uracil caused by carA mutation of the Escherichia coli strain.
- <5> Nucleotide Sequence Analysis of p19
- Among the DNA sequence of p19, the nucleotide sequence of about 4.8 kb from the HindIII side of the multi-cloning site of the vector to the HindIII site contained in the insertion DNA fragment was determined. The nucleotide sequencing was performed by using Rohdamin Terminator Cycle Sequencing Kit (produced by ABI) according to the method of Sanger. The obtained nucleotide sequence is shown as SEQ ID NO: 1 in Sequence Listing. From analysis of a consensus sequence which located in the upstream region of this gene, it was estimated that two open reading frames (open reading frame from 283rd G to 1461st A and open reading frame from 1756th G to 4809th T) were contained in this sequence. The nucleotides of the 162nd (TGCATA) to 194th (TATAAT), the 185th (TGCATA) to 213rd (TAAACT), the 203rd (TTGAAT) 230th (TATCAA), or the 224th (TTATCA) to 251st (TAAAAA) can be estimated to be a promoter region for regulating the transcription.
- The amino acid sequences encoded by these open reading frames are represented with the nucleotide sequences. The amino acid sequences were also shown in SEQ ID NOS: 2 and 3. A protein database (GenBank CDS) was searched for sequences exhibiting homology with these amino acid sequences. As a result, it was found that the 5′ open reading frame showed high homology (about 40%) with carA gene products ofEscherichia coli, Bacillus subtilis and so forth, and the 3′ open reading frame showed high homology with known carB gene products of Escherichia coli, Bacillus stearothermophilus and so forth (about 40 to 50%). Therefore, it was suggested that these open reading frames coded for carA and carB, respectively.
- <6> Introduction of carA and carB into Wild-Type Strain of Coryneform Bacteria
- p19 was introduced into theBrevibacterium flavum wild strain 2247 (AJ14067) by the electric pulse method (Japanese Patent Laid-open No. 2-207791). The transformants were selected as chloramphenicol resistant strains on a CM2G plate medium (containing 10 g of polypeptone, 10 g of yeast extract, 5 g of glucose, 5 g of NaCl, 15 g of agar in 1 L of pure water, pH 7.2) containing 5 μg/ml of chloramphenicol to obtain 2247/p19.
- <1> Preparation of Shuttle Vector
- First, a plasmid vector autonomously replicable in both ofEscherichia coli cells and coryneform bacterium cells was newly produced as a plasmid used for introducing the carA and carB genes into coryneform bacteria.
- A vector containing a drug resistance gene ofStreptococcus faecalis was constructed first. The kanamycin resistant gene of Streptococcus faecalis was amplified by PCR from a known plasmid containing that gene. The nucleotide sequence of the kanamycin resistant gene of Streptococcus faecalis has already been clarified (Trieu-Cuot, P. and Courvalin, P., Gene, 23(3), 331-341 (1983)). The primers shown as SEQ ID NOS: 4 and 5 were synthesized based on that sequence, and PCR was performed by using pDG783 (Anne-Marie Guerout-Fleury et al., Gene, 167, 335-337 (1995)) as a template to amplify a DNA fragment containing the kanamycin resistant gene and its promoter.
- The obtained DNA fragment was purified by SUPREC02 produced by the Takara Shuzo, then fully digested with restriction enzymes HindIII and HincII, and blunt-ended. The blunt-ending was attained by using Blunting Kit produced by Takara Shuzo. This DNA fragment was mixed with and ligated to a DNA fragment, which had been obtained by performing PCR using the primers shown as SEQ ID NOS: 6 and 7 and pHSG399 (see S. Takeshita et al.,Gene, 61, 63-74 (1987)) as a template, purifying and blunt-ending the resulted amplification product. The ligation reaction was performed by DNA Ligation Kit ver. 2 produced by Takara Shuzo. Competent cells of Escherichia coli JM109 (produced by Takara Shuzo) were transformed with the ligated DNA, plated on L madium (10 g/L of Bacto-trypton, 5 g/L of Bacto-yeast extract, 5 g/L of NaCl, 15 g/L of agar, pH 7.2) containing 10 μg/ml of IPTG (isopropyl-β-D-thiogalactopyranoside), 40 μg/ml of X-Gal (5-bromo-4-chloro-3-indolyl-β-D-galactoside) and 25 μg/ml of kanamycin, and cultured overnight. The emerged blue colonies were picked up, and separated into single colonies to obtain transformant strains.
- Plasmids were prepared from the transformant strains by the alkali method (Text for Bioengineering Experiments, Edited by the Society for Bioscience and Bioengineering, Japan, p.105, Baifukan, 1992), and restriction maps were prepared. One having a restriction map equivalent to that of FIG. 2 was designated as pK1. This plasmid is stably retained inEscherichia coli, and imparts kanamycin resistance to a host. Moreover, since it contains the lacZ′ gene, it is suitably used as a cloning vector.
- The plasmid pAM330 extracted fromBrevibacterium lactofermentum ATCC13869 (see Japanese Patent Laid-open No. 58-67699) was fully digested with a restriction enzyme HindIII, and blunt-ended. This fragment was ligated to a fragment obtained by fully digesting the aforementioned pK1 with a restriction enzyme BsaAI. Breveibacterium lactofermentum ATCC13869 was transformed with the ligated DNA. The transformation was performed by the electric pulse method (see Japanese Patent Laid-open No. 2-207791). Transformants were selected on a M-CM2B plate (10 g/L of polypeptone, 10 g/L of yeast extract, 5 g/L of NaCl, 10 μg/L of biotin, 15 g/L of agar, pH 7.2) containing 25 μg/ml of kanamycin. After cultivation for 2 days, colonies were picked up, and separated into single colonies to obtain the transformants. Plasmid DNA was prepared from the transformants, and restriction maps were prepared. One having the same restriction map as that of FIG. 3 was designated as pSFK6. This plasmid can autonomously replicate in both of Escherichia coli and coryneform bacteria, and imparts kanamycin resistance to a host.
- <2> Introduction of carA and carB Genes into Coryneform Bacteria and Production of L-arginine
- The aforementioned pSFK6 was digested with SmaI and HindIII. The product was ligated to carA and carB gene fragments, which had been obtained by digesting the plasmid p19 prepared from JEF8/p19F in a conventional manner with a restriction enzyme XbaI, blunt-ending the product by using Blunting Kit produced by Takara Shuzo, and further digesting the product with a restriction enzyme HindIII, to obtain a plasmid pcarAB, which contained the carA and carB genes and could autonomously replicate in coryneform bacteria.
- pcarAB was introduced intoBrevibacterium flavum AJ11345 and AJ11336 by the electric pulse method (Japanese Patent Laid-open No. 2-207791). Transformants were selected on a M-CM2B plate (10 g/L of polypeptone, 10 g/L of yeast extract, 5 g/L of glucose, 5 g/L of NaCl, 15 g/L of agar, pH 7.2) containing 25 μg/ml of kanamycin as kanamycin resistant strains. As control, transformants were obtained by similarly introducing pSFK6 into AJ11345 and AJ11336.
- Each of the aforementioned transformants was plated on an agar medium containing 0.5 g/dl of glucose, 1 g/dl of polypeptone, 1 g of yeast extract, 0.5 g/dl of NaCl and 5 μg/l of chloramphenicol, and cultured at 31.5° C. for 20 hours. One inoculating loop of the obtained cells were inoculated to a medium containing 4 g/dl of glucose, 6.5 g/dL of ammonium sulfate, 0.1 g/dl of KH2PO4, 0.04 g/dl of MgSO4, 0.001 g/dl of FeSO4, 0.01 g/dl of MnSO4, 5 μg/dl of VB1, 5 μg/dl of biotin, 45 mg/dl of soybean hydrolysates (as an amount of N), and cultured in a flask at 31.5° C. for 50 hours with shaking. The amounts of L-arginine produced by each strain were shown in Table 1.
- The strains introduced with the carA and carB gene showed improved L-arginine productivity compared with the strains introduced only with the vector.
TABLE 1 Strain/plasmid L-arginine (g/dl) AJ11345/pSFK6 1.33 AJ11345/pcarAB 1.39 AJ11336/pSFK6 0.71 AJ11336/pcarAB 0.79 -
-
0 SEQUENCE LISTING <160> NUMBER OF SEQ ID NOS: 7 <210> SEQ ID NO 1 <211> LENGTH: 4838 <212> TYPE: DNA <213> ORGANISM: Brevibacterium lactofermentum <220> FEATURE: <221> NAME/KEY: CDS <222> LOCATION: (283)..(1461) <221> NAME/KEY: CDS <222> LOCATION: (1756)..(4809) <400> SEQUENCE: 1 gatccaggaa aaacctggac agcatccggt gcagactttg cgtccaaggc tgaaaacacc 60 ccatttgagg gccaggaatt cagcgctaag gtcacacaca ccgtgcttcg tggcaaggtg 120 acttgtgcag acggagttgc gcaagacgct taacgggtgg gtgcatagta tgcacgcgcc 180 gcattgcata taatgcaatg aattgaataa actacattca gggttatcaa ccagccaatt 240 tcttttaaaa agacagacac acgaaaggcg acaacagtca cc gtg agt aaa gac 294 Val Ser Lys Asp 1 acc acc acc tac cag gga gtc acc gag atc gga tcc gtt ccg gca tac 342 Thr Thr Thr Tyr Gln Gly Val Thr Glu Ile Gly Ser Val Pro Ala Tyr 5 10 15 20 ctg gtt ctt gca gac gga cgt acc ttc acc gga ttt ggc ttt gga gct 390 Leu Val Leu Ala Asp Gly Arg Thr Phe Thr Gly Phe Gly Phe Gly Ala 25 30 35 atc ggc acc acc ctt ggt gag gca gtg ttc acc acc gcc atg acc ggt 438 Ile Gly Thr Thr Leu Gly Glu Ala Val Phe Thr Thr Ala Met Thr Gly 40 45 50 tac caa gaa acc atg acc gat cct tcc tat cac cgc cag att gtt gtg 486 Tyr Gln Glu Thr Met Thr Asp Pro Ser Tyr His Arg Gln Ile Val Val 55 60 65 gct acc gca cca cag atc ggt aac acc ggc tgg aac gat gag gac aac 534 Ala Thr Ala Pro Gln Ile Gly Asn Thr Gly Trp Asn Asp Glu Asp Asn 70 75 80 gag tcc cgc gac ggc aag att tgg gtt gca ggc ctt gtt atc cgc gac 582 Glu Ser Arg Asp Gly Lys Ile Trp Val Ala Gly Leu Val Ile Arg Asp 85 90 95 100 ctc gca gca cgt gtg tcc aac tgg cgc gcc acc acc tcc ttg cag cag 630 Leu Ala Ala Arg Val Ser Asn Trp Arg Ala Thr Thr Ser Leu Gln Gln 105 110 115 gaa atg gca gac caa ggc atc gtc ggc atc ggc gga atc gac acc cgc 678 Glu Met Ala Asp Gln Gly Ile Val Gly Ile Gly Gly Ile Asp Thr Arg 120 125 130 gca ctg gtt cgc cac ctg cgc aac gaa ggt tcc atc gca gcg ggc atc 726 Ala Leu Val Arg His Leu Arg Asn Glu Gly Ser Ile Ala Ala Gly Ile 135 140 145 ttc tcc ggc gct gac gca cag cgc cca gtt gaa gaa ctc gta gag atc 774 Phe Ser Gly Ala Asp Ala Gln Arg Pro Val Glu Glu Leu Val Glu Ile 150 155 160 gtc aag aat cag cca gca atg acc ggc gca aac ctc tcc gtt gag gtc 822 Val Lys Asn Gln Pro Ala Met Thr Gly Ala Asn Leu Ser Val Glu Val 165 170 175 180 tct gct gat gaa acc tac gtc atc gaa gct gag ggc gaa gag cgc cac 870 Ser Ala Asp Glu Thr Tyr Val Ile Glu Ala Glu Gly Glu Glu Arg His 185 190 195 acc gtc gtg gcc tac gac ctg ggc att aag caa aac acc cca cgt cgt 918 Thr Val Val Ala Tyr Asp Leu Gly Ile Lys Gln Asn Thr Pro Arg Arg 200 205 210 ttc tct gca cgc ggt gtt cgc acc gtc atc gtg cct gct gaa acc cca 966 Phe Ser Ala Arg Gly Val Arg Thr Val Ile Val Pro Ala Glu Thr Pro 215 220 225 ttg gag gac atc aag cag tac aac cca tca ggc gtg ttt atc tcc aat 1014 Leu Glu Asp Ile Lys Gln Tyr Asn Pro Ser Gly Val Phe Ile Ser Asn 230 235 240 ggc cct ggc gac cct gca gca gca gac gtc atg gtt gat atc gtc cgc 1062 Gly Pro Gly Asp Pro Ala Ala Ala Asp Val Met Val Asp Ile Val Arg 245 250 255 260 gaa gtt ctg gaa gcc gac att cca ttc ttt ggc atc tgc ttc ggc aac 1110 Glu Val Leu Glu Ala Asp Ile Pro Phe Phe Gly Ile Cys Phe Gly Asn 265 270 275 cag atc ctc ggc cgc gca ttc ggc atg gag acc tac aag ctg aag ttc 1158 Gln Ile Leu Gly Arg Ala Phe Gly Met Glu Thr Tyr Lys Leu Lys Phe 280 285 290 ggc cac cgc ggc atc aac gtt cca gtg aag aac cac atc acc ggc aag 1206 Gly His Arg Gly Ile Asn Val Pro Val Lys Asn His Ile Thr Gly Lys 295 300 305 atc gac atc acc gcc cag aac cac ggc ttc gca ctc aag ggt gaa gca 1254 Ile Asp Ile Thr Ala Gln Asn His Gly Phe Ala Leu Lys Gly Glu Ala 310 315 320 ggc cag gaa ttc gag aca gat ttc ggc act gcg att gtc acc cac acc 1302 Gly Gln Glu Phe Glu Thr Asp Phe Gly Thr Ala Ile Val Thr His Thr 325 330 335 340 tgc ctt aac gac ggc gtc gtt gaa ggt gtt gcg ctg aag tcc gga cgc 1350 Cys Leu Asn Asp Gly Val Val Glu Gly Val Ala Leu Lys Ser Gly Arg 345 350 355 gca tac tcc gtt cag tac cac cca gag gcc gct gcc ggc cca aat gat 1398 Ala Tyr Ser Val Gln Tyr His Pro Glu Ala Ala Ala Gly Pro Asn Asp 360 365 370 gca agc ccc ctg ttt gac cag ttt gtt gag ctg atg gat gca gac gct 1446 Ala Ser Pro Leu Phe Asp Gln Phe Val Glu Leu Met Asp Ala Asp Ala 375 380 385 cag aag aaa ggc gca taaataacat gccaaagcgt tcagatatta accacgtcct 1501 Gln Lys Lys Gly Ala 390 cgtcatcggt tccggcccca tcgtcattgg ccaggcatgt gaattcgact actccggcac 1561 ccaggcttgc cgcgtgctga aggaagaggg actgcgcgtc accctcatca actccaaccc 1621 agcaacgatc atgaccgacc cagaaatggc tgaccacacc tacgtggagc caatcgagcc 1681 ggaatacatc gacaagattt tcgctaagga gatcgagcag ggccacccaa tcgacgccgt 1741 cctggcaacc cttg gtg gcc aga ctg cac tta acg cag cta tcc agc tgg 1791 Val Ala Arg Leu His Leu Thr Gln Leu Ser Ser Trp 395 400 405 atc gcc ttc ggc atc ctg gaa aag tac ggc gtt gaa ctc atc ggt gca 1839 Ile Ala Phe Gly Ile Leu Glu Lys Tyr Gly Val Glu Leu Ile Gly Ala 410 415 420 gac atc gat gcc att gag cgc ggc gaa gat cgc cag aag ttc aag gat 1887 Asp Ile Asp Ala Ile Glu Arg Gly Glu Asp Arg Gln Lys Phe Lys Asp 425 430 435 att gtc acc acc atc ggt ggc gaa tcc gcg cgt tcc cgc gtc tgc cac 1935 Ile Val Thr Thr Ile Gly Gly Glu Ser Ala Arg Ser Arg Val Cys His 440 445 450 aac atg gac gaa gtc cat gag act gtc gca gaa ctt ggc ctt cca gta 1983 Asn Met Asp Glu Val His Glu Thr Val Ala Glu Leu Gly Leu Pro Val 455 460 465 gtc gtg cgt cca tcc ttc act atg ggt ggc ctg ggc tcc ggt ctt gca 2031 Val Val Arg Pro Ser Phe Thr Met Gly Gly Leu Gly Ser Gly Leu Ala 470 475 480 485 tac aac acc gaa gac ctt gag cgc atc gca ggt ggc gga ctt gct gca 2079 Tyr Asn Thr Glu Asp Leu Glu Arg Ile Ala Gly Gly Gly Leu Ala Ala 490 495 500 tct cct gaa gca aac gtc ttg atc gaa gaa tcc atc ctt ggt tgg aag 2127 Ser Pro Glu Ala Asn Val Leu Ile Glu Glu Ser Ile Leu Gly Trp Lys 505 510 515 gaa ttc gag ctc gag ctc atg cgc gat acc gca gac aac gtt gtg gtt 2175 Glu Phe Glu Leu Glu Leu Met Arg Asp Thr Ala Asp Asn Val Val Val 520 525 530 atc tgc tcc att gaa aac gtc gac gca ctg ggc gtg cac acc ggc gac 2223 Ile Cys Ser Ile Glu Asn Val Asp Ala Leu Gly Val His Thr Gly Asp 535 540 545 tct gtc acc gtg gca cct gcc ctg acc ctg act gac cgt gaa ttc cag 2271 Ser Val Thr Val Ala Pro Ala Leu Thr Leu Thr Asp Arg Glu Phe Gln 550 555 560 565 aag atg cgc gat cag ggt atc gcc atc atc cgc gag gtc ggc gtg gac 2319 Lys Met Arg Asp Gln Gly Ile Ala Ile Ile Arg Glu Val Gly Val Asp 570 575 580 acc ggt gga tgt aac atc cag ttc gct atc aac cca gtt gat ggc cgc 2367 Thr Gly Gly Cys Asn Ile Gln Phe Ala Ile Asn Pro Val Asp Gly Arg 585 590 595 atc atc acc att gag atg aac cca cgt gtg tct cgt tcc tcc gcg ctg 2415 Ile Ile Thr Ile Glu Met Asn Pro Arg Val Ser Arg Ser Ser Ala Leu 600 605 610 gca tcc aag gca acg ggc ttc cca att gcc aag atg gct gcc aag ctg 2463 Ala Ser Lys Ala Thr Gly Phe Pro Ile Ala Lys Met Ala Ala Lys Leu 615 620 625 gct atc gga tac acc ctg gat gag atc acc aac gac atc act ggt gaa 2511 Ala Ile Gly Tyr Thr Leu Asp Glu Ile Thr Asn Asp Ile Thr Gly Glu 630 635 640 645 acc cca gct gcg ttt gag ccc acc atc gac tac gtc gtg gtc aag gcc 2559 Thr Pro Ala Ala Phe Glu Pro Thr Ile Asp Tyr Val Val Val Lys Ala 650 655 660 cca cgc ttt gct ttc gag aag ttt gtc ggc gct gat gac act ttg acc 2607 Pro Arg Phe Ala Phe Glu Lys Phe Val Gly Ala Asp Asp Thr Leu Thr 665 670 675 acc acc atg aag tcc gtc ggt gag gtc atg tcc ctg ggc cgt aac tac 2655 Thr Thr Met Lys Ser Val Gly Glu Val Met Ser Leu Gly Arg Asn Tyr 680 685 690 att gca gca ctg aac aag gca ctg cgt tcc ctg gaa acc aag cag cag 2703 Ile Ala Ala Leu Asn Lys Ala Leu Arg Ser Leu Glu Thr Lys Gln Gln 695 700 705 ggt ttc tgg acc aag cct gat gag ttc ttc gca ggg gag cgc gct acc 2751 Gly Phe Trp Thr Lys Pro Asp Glu Phe Phe Ala Gly Glu Arg Ala Thr 710 715 720 725 gat aag gca gct gtt ctg gaa gat ctc aag cgc cca acc gaa ggc cgc 2799 Asp Lys Ala Ala Val Leu Glu Asp Leu Lys Arg Pro Thr Glu Gly Arg 730 735 740 ctc tac gac gtt gag ctg gca atg cgc ctt ggc gca agc gtg gaa gaa 2847 Leu Tyr Asp Val Glu Leu Ala Met Arg Leu Gly Ala Ser Val Glu Glu 745 750 755 ctc tac gaa gca tct tct att gat cct tgg ttc ctc gcc gag ctt gaa 2895 Leu Tyr Glu Ala Ser Ser Ile Asp Pro Trp Phe Leu Ala Glu Leu Glu 760 765 770 gct ctc gtg cag ttc cgc cag aag ctc gtt gac gca cca ttc ctc aac 2943 Ala Leu Val Gln Phe Arg Gln Lys Leu Val Asp Ala Pro Phe Leu Asn 775 780 785 gaa gat ctc ctg cgc gaa gca aag ttc atg ggt ctg tcc gac ctg cag 2991 Glu Asp Leu Leu Arg Glu Ala Lys Phe Met Gly Leu Ser Asp Leu Gln 790 795 800 805 atc gca gcc ctt cgc cca gag ttc gct ggc gaa gac ggc gta cgc acc 3039 Ile Ala Ala Leu Arg Pro Glu Phe Ala Gly Glu Asp Gly Val Arg Thr 810 815 820 ttg cgt ctg tcc cta ggc atc cgc cca gta ttc aag act gtg gat acc 3087 Leu Arg Leu Ser Leu Gly Ile Arg Pro Val Phe Lys Thr Val Asp Thr 825 830 835 tgt gca gca gag ttt gaa gct aag act ccg tac cac tac tcc gca tac 3135 Cys Ala Ala Glu Phe Glu Ala Lys Thr Pro Tyr His Tyr Ser Ala Tyr 840 845 850 gag ctg gat cca gca gct gag tct gag gtc gca cca cag act gag cgt 3183 Glu Leu Asp Pro Ala Ala Glu Ser Glu Val Ala Pro Gln Thr Glu Arg 855 860 865 gaa aag gtc ctg atc ttg ggc tcc ggt cca aac cgc atc ggc cag ggc 3231 Glu Lys Val Leu Ile Leu Gly Ser Gly Pro Asn Arg Ile Gly Gln Gly 870 875 880 885 atc gag ttc gac tat tcc tgt gtt cac gca gct ctt gag ctc tcc cgc 3279 Ile Glu Phe Asp Tyr Ser Cys Val His Ala Ala Leu Glu Leu Ser Arg 890 895 900 gtc ggc tac gaa act gtc atg gtc aac tgc aac cca gag acc gtg tcc 3327 Val Gly Tyr Glu Thr Val Met Val Asn Cys Asn Pro Glu Thr Val Ser 905 910 915 acc gac tac gac acc gct gac cgc ctg tac ttc gag cca ctg acc ttc 3375 Thr Asp Tyr Asp Thr Ala Asp Arg Leu Tyr Phe Glu Pro Leu Thr Phe 920 925 930 gaa gac gtc atg gag gtc tac cac gct gag gcg cag tcc ggc acc gtc 3423 Glu Asp Val Met Glu Val Tyr His Ala Glu Ala Gln Ser Gly Thr Val 935 940 945 gca ggt gtt atc gtc cag ctt ggt ggc cag act cct ctg ggc ttg gca 3471 Ala Gly Val Ile Val Gln Leu Gly Gly Gln Thr Pro Leu Gly Leu Ala 950 955 960 965 gat cgt ttg aag aag gct ggc gtc cct gtc att ggt acc tcc cca gag 3519 Asp Arg Leu Lys Lys Ala Gly Val Pro Val Ile Gly Thr Ser Pro Glu 970 975 980 gca atc gac atg gct gag gac cgt ggc gag ttc ggt gca ctg ctg aac 3567 Ala Ile Asp Met Ala Glu Asp Arg Gly Glu Phe Gly Ala Leu Leu Asn 985 990 995 cgc gag cag ctt cct gct cca gca ttc ggc acc gca acc tct ttc 3612 Arg Glu Gln Leu Pro Ala Pro Ala Phe Gly Thr Ala Thr Ser Phe 1000 1005 1010 gaa gag gct cgc aca gta gcc gat gag atc agc tac cca gtg ctg 3657 Glu Glu Ala Arg Thr Val Ala Asp Glu Ile Ser Tyr Pro Val Leu 1015 1020 1025 gtt cgc cct tcc tac gtc ttg ggt ggc cgt ggc atg gag att gtc 3702 Val Arg Pro Ser Tyr Val Leu Gly Gly Arg Gly Met Glu Ile Val 1030 1035 1040 tac gat gag gct tcc ctc gag gat tac atc aac cgc gca act gag 3747 Tyr Asp Glu Ala Ser Leu Glu Asp Tyr Ile Asn Arg Ala Thr Glu 1045 1050 1055 ttg tct tct gac cac cca gtg ctg gtt gac cgc ttc ctg gac aac 3792 Leu Ser Ser Asp His Pro Val Leu Val Asp Arg Phe Leu Asp Asn 1060 1065 1070 gct att gag atc gac gtc gac gca ctg tgc gac ggc gac gaa gtc 3837 Ala Ile Glu Ile Asp Val Asp Ala Leu Cys Asp Gly Asp Glu Val 1075 1080 1085 tac ctg gcg ggc gtc atg gaa cac atc gag gaa gcc ggc att cac 3882 Tyr Leu Ala Gly Val Met Glu His Ile Glu Glu Ala Gly Ile His 1090 1095 1100 tcc ggt gac tcc gca tgt gca ctt cct cca atg act ttg ggc gca 3927 Ser Gly Asp Ser Ala Cys Ala Leu Pro Pro Met Thr Leu Gly Ala 1105 1110 1115 cag gac atc gag aag gtc cgc gaa gca acc aag aag ctg gct ctg 3972 Gln Asp Ile Glu Lys Val Arg Glu Ala Thr Lys Lys Leu Ala Leu 1120 1125 1130 ggc atc ggc gta cag ggc ctg atg aac gtc cag tac gca ctc aag 4017 Gly Ile Gly Val Gln Gly Leu Met Asn Val Gln Tyr Ala Leu Lys 1135 1140 1145 gac gac atc ctc tac gtc atc gag gca aac cca cgt gca tcc cgc 4062 Asp Asp Ile Leu Tyr Val Ile Glu Ala Asn Pro Arg Ala Ser Arg 1150 1155 1160 acc gtg ccg ttc gtc tcc aag gca acg ggc gtc aac ctg gcc aag 4107 Thr Val Pro Phe Val Ser Lys Ala Thr Gly Val Asn Leu Ala Lys 1165 1170 1175 gca gca tcc cgt atc gca gtg ggc gcc acc atc aag gat ctc caa 4152 Ala Ala Ser Arg Ile Ala Val Gly Ala Thr Ile Lys Asp Leu Gln 1180 1185 1190 gat gag ggc atg att cct acc gag tac gac ggc ggc tcc ttg cca 4197 Asp Glu Gly Met Ile Pro Thr Glu Tyr Asp Gly Gly Ser Leu Pro 1195 1200 1205 ctg gac gct cca atc gct gtg aag gaa gca gtg ttg ccg ttc aac 4242 Leu Asp Ala Pro Ile Ala Val Lys Glu Ala Val Leu Pro Phe Asn 1210 1215 1220 cgc ttc cgt cgc cca gat gga aag acc ctg gac acc ctg ctt tcc 4287 Arg Phe Arg Arg Pro Asp Gly Lys Thr Leu Asp Thr Leu Leu Ser 1225 1230 1235 cca gag atg aag tcc act ggc gag gtc atg ggc ttg gcc aac aac 4332 Pro Glu Met Lys Ser Thr Gly Glu Val Met Gly Leu Ala Asn Asn 1240 1245 1250 ttc ggc gct gca tat gca aag gct gaa gct ggc gcg ttt ggt gca 4377 Phe Gly Ala Ala Tyr Ala Lys Ala Glu Ala Gly Ala Phe Gly Ala 1255 1260 1265 ttg cca acc gaa ggc acc gtc ttc gtg acc gtg gct aac cgc gac 4422 Leu Pro Thr Glu Gly Thr Val Phe Val Thr Val Ala Asn Arg Asp 1270 1275 1280 aag cgc acc ctg atc ctg cca atc cag cgc ctg gcg tcg atg ggc 4467 Lys Arg Thr Leu Ile Leu Pro Ile Gln Arg Leu Ala Ser Met Gly 1285 1290 1295 tac aag atc ctc gcc acc gaa ggc acc gca ggc atg ctg cgc cgc 4512 Tyr Lys Ile Leu Ala Thr Glu Gly Thr Ala Gly Met Leu Arg Arg 1300 1305 1310 aac ggc att gat tgt gaa gtt gtg ctc aag gct tcc gac atc cgc 4557 Asn Gly Ile Asp Cys Glu Val Val Leu Lys Ala Ser Asp Ile Arg 1315 1320 1325 gaa ggt gta gag ggc aag tcc atc gtg gat cgt atc cgc gaa ggc 4602 Glu Gly Val Glu Gly Lys Ser Ile Val Asp Arg Ile Arg Glu Gly 1330 1335 1340 gaa gtt gac ctc atc ctc aac acc cca gct ggt tct gct ggc gct 4647 Glu Val Asp Leu Ile Leu Asn Thr Pro Ala Gly Ser Ala Gly Ala 1345 1350 1355 cgc cac gat ggc tac gat atc cgc gca gca gca gtg acc gtg ggt 4692 Arg His Asp Gly Tyr Asp Ile Arg Ala Ala Ala Val Thr Val Gly 1360 1365 1370 gtt cca ctg atc acc act gtc cag ggt gtc acc gca gct gtc cag 4737 Val Pro Leu Ile Thr Thr Val Gln Gly Val Thr Ala Ala Val Gln 1375 1380 1385 ggc att gag gcc ctg cgt gag ggc gtt gtc agc gtc cgc gcg ctg 4782 Gly Ile Glu Ala Leu Arg Glu Gly Val Val Ser Val Arg Ala Leu 1390 1395 1400 cag gaa ctc gac cac gca gtc aag gct taagccctat gacattcggc 4829 Gln Glu Leu Asp His Ala Val Lys Ala 1405 1410 gagaagctt 4838 <210> SEQ ID NO 2 <211> LENGTH: 393 <212> TYPE: PRT <213> ORGANISM: Brevibacterium lactofermentum <400> SEQUENCE: 2 Val Ser Lys Asp Thr Thr Thr Tyr Gln Gly Val Thr Glu Ile Gly Ser 1 5 10 15 Val Pro Ala Tyr Leu Val Leu Ala Asp Gly Arg Thr Phe Thr Gly Phe 20 25 30 Gly Phe Gly Ala Ile Gly Thr Thr Leu Gly Glu Ala Val Phe Thr Thr 35 40 45 Ala Met Thr Gly Tyr Gln Glu Thr Met Thr Asp Pro Ser Tyr His Arg 50 55 60 Gln Ile Val Val Ala Thr Ala Pro Gln Ile Gly Asn Thr Gly Trp Asn 65 70 75 80 Asp Glu Asp Asn Glu Ser Arg Asp Gly Lys Ile Trp Val Ala Gly Leu 85 90 95 Val Ile Arg Asp Leu Ala Ala Arg Val Ser Asn Trp Arg Ala Thr Thr 100 105 110 Ser Leu Gln Gln Glu Met Ala Asp Gln Gly Ile Val Gly Ile Gly Gly 115 120 125 Ile Asp Thr Arg Ala Leu Val Arg His Leu Arg Asn Glu Gly Ser Ile 130 135 140 Ala Ala Gly Ile Phe Ser Gly Ala Asp Ala Gln Arg Pro Val Glu Glu 145 150 155 160 Leu Val Glu Ile Val Lys Asn Gln Pro Ala Met Thr Gly Ala Asn Leu 165 170 175 Ser Val Glu Val Ser Ala Asp Glu Thr Tyr Val Ile Glu Ala Glu Gly 180 185 190 Glu Glu Arg His Thr Val Val Ala Tyr Asp Leu Gly Ile Lys Gln Asn 195 200 205 Thr Pro Arg Arg Phe Ser Ala Arg Gly Val Arg Thr Val Ile Val Pro 210 215 220 Ala Glu Thr Pro Leu Glu Asp Ile Lys Gln Tyr Asn Pro Ser Gly Val 225 230 235 240 Phe Ile Ser Asn Gly Pro Gly Asp Pro Ala Ala Ala Asp Val Met Val 245 250 255 Asp Ile Val Arg Glu Val Leu Glu Ala Asp Ile Pro Phe Phe Gly Ile 260 265 270 Cys Phe Gly Asn Gln Ile Leu Gly Arg Ala Phe Gly Met Glu Thr Tyr 275 280 285 Lys Leu Lys Phe Gly His Arg Gly Ile Asn Val Pro Val Lys Asn His 290 295 300 Ile Thr Gly Lys Ile Asp Ile Thr Ala Gln Asn His Gly Phe Ala Leu 305 310 315 320 Lys Gly Glu Ala Gly Gln Glu Phe Glu Thr Asp Phe Gly Thr Ala Ile 325 330 335 Val Thr His Thr Cys Leu Asn Asp Gly Val Val Glu Gly Val Ala Leu 340 345 350 Lys Ser Gly Arg Ala Tyr Ser Val Gln Tyr His Pro Glu Ala Ala Ala 355 360 365 Gly Pro Asn Asp Ala Ser Pro Leu Phe Asp Gln Phe Val Glu Leu Met 370 375 380 Asp Ala Asp Ala Gln Lys Lys Gly Ala 385 390 <210> SEQ ID NO 3 <211> LENGTH: 1018 <212> TYPE: PRT <213> ORGANISM: Brevibacterium lactofermentum <400> SEQUENCE: 3 Val Ala Arg Leu His Leu Thr Gln Leu Ser Ser Trp Ile Ala Phe Gly 1 5 10 15 Ile Leu Glu Lys Tyr Gly Val Glu Leu Ile Gly Ala Asp Ile Asp Ala 20 25 30 Ile Glu Arg Gly Glu Asp Arg Gln Lys Phe Lys Asp Ile Val Thr Thr 35 40 45 Ile Gly Gly Glu Ser Ala Arg Ser Arg Val Cys His Asn Met Asp Glu 50 55 60 Val His Glu Thr Val Ala Glu Leu Gly Leu Pro Val Val Val Arg Pro 65 70 75 80 Ser Phe Thr Met Gly Gly Leu Gly Ser Gly Leu Ala Tyr Asn Thr Glu 85 90 95 Asp Leu Glu Arg Ile Ala Gly Gly Gly Leu Ala Ala Ser Pro Glu Ala 100 105 110 Asn Val Leu Ile Glu Glu Ser Ile Leu Gly Trp Lys Glu Phe Glu Leu 115 120 125 Glu Leu Met Arg Asp Thr Ala Asp Asn Val Val Val Ile Cys Ser Ile 130 135 140 Glu Asn Val Asp Ala Leu Gly Val His Thr Gly Asp Ser Val Thr Val 145 150 155 160 Ala Pro Ala Leu Thr Leu Thr Asp Arg Glu Phe Gln Lys Met Arg Asp 165 170 175 Gln Gly Ile Ala Ile Ile Arg Glu Val Gly Val Asp Thr Gly Gly Cys 180 185 190 Asn Ile Gln Phe Ala Ile Asn Pro Val Asp Gly Arg Ile Ile Thr Ile 195 200 205 Glu Met Asn Pro Arg Val Ser Arg Ser Ser Ala Leu Ala Ser Lys Ala 210 215 220 Thr Gly Phe Pro Ile Ala Lys Met Ala Ala Lys Leu Ala Ile Gly Tyr 225 230 235 240 Thr Leu Asp Glu Ile Thr Asn Asp Ile Thr Gly Glu Thr Pro Ala Ala 245 250 255 Phe Glu Pro Thr Ile Asp Tyr Val Val Val Lys Ala Pro Arg Phe Ala 260 265 270 Phe Glu Lys Phe Val Gly Ala Asp Asp Thr Leu Thr Thr Thr Met Lys 275 280 285 Ser Val Gly Glu Val Met Ser Leu Gly Arg Asn Tyr Ile Ala Ala Leu 290 295 300 Asn Lys Ala Leu Arg Ser Leu Glu Thr Lys Gln Gln Gly Phe Trp Thr 305 310 315 320 Lys Pro Asp Glu Phe Phe Ala Gly Glu Arg Ala Thr Asp Lys Ala Ala 325 330 335 Val Leu Glu Asp Leu Lys Arg Pro Thr Glu Gly Arg Leu Tyr Asp Val 340 345 350 Glu Leu Ala Met Arg Leu Gly Ala Ser Val Glu Glu Leu Tyr Glu Ala 355 360 365 Ser Ser Ile Asp Pro Trp Phe Leu Ala Glu Leu Glu Ala Leu Val Gln 370 375 380 Phe Arg Gln Lys Leu Val Asp Ala Pro Phe Leu Asn Glu Asp Leu Leu 385 390 395 400 Arg Glu Ala Lys Phe Met Gly Leu Ser Asp Leu Gln Ile Ala Ala Leu 405 410 415 Arg Pro Glu Phe Ala Gly Glu Asp Gly Val Arg Thr Leu Arg Leu Ser 420 425 430 Leu Gly Ile Arg Pro Val Phe Lys Thr Val Asp Thr Cys Ala Ala Glu 435 440 445 Phe Glu Ala Lys Thr Pro Tyr His Tyr Ser Ala Tyr Glu Leu Asp Pro 450 455 460 Ala Ala Glu Ser Glu Val Ala Pro Gln Thr Glu Arg Glu Lys Val Leu 465 470 475 480 Ile Leu Gly Ser Gly Pro Asn Arg Ile Gly Gln Gly Ile Glu Phe Asp 485 490 495 Tyr Ser Cys Val His Ala Ala Leu Glu Leu Ser Arg Val Gly Tyr Glu 500 505 510 Thr Val Met Val Asn Cys Asn Pro Glu Thr Val Ser Thr Asp Tyr Asp 515 520 525 Thr Ala Asp Arg Leu Tyr Phe Glu Pro Leu Thr Phe Glu Asp Val Met 530 535 540 Glu Val Tyr His Ala Glu Ala Gln Ser Gly Thr Val Ala Gly Val Ile 545 550 555 560 Val Gln Leu Gly Gly Gln Thr Pro Leu Gly Leu Ala Asp Arg Leu Lys 565 570 575 Lys Ala Gly Val Pro Val Ile Gly Thr Ser Pro Glu Ala Ile Asp Met 580 585 590 Ala Glu Asp Arg Gly Glu Phe Gly Ala Leu Leu Asn Arg Glu Gln Leu 595 600 605 Pro Ala Pro Ala Phe Gly Thr Ala Thr Ser Phe Glu Glu Ala Arg Thr 610 615 620 Val Ala Asp Glu Ile Ser Tyr Pro Val Leu Val Arg Pro Ser Tyr Val 625 630 635 640 Leu Gly Gly Arg Gly Met Glu Ile Val Tyr Asp Glu Ala Ser Leu Glu 645 650 655 Asp Tyr Ile Asn Arg Ala Thr Glu Leu Ser Ser Asp His Pro Val Leu 660 665 670 Val Asp Arg Phe Leu Asp Asn Ala Ile Glu Ile Asp Val Asp Ala Leu 675 680 685 Cys Asp Gly Asp Glu Val Tyr Leu Ala Gly Val Met Glu His Ile Glu 690 695 700 Glu Ala Gly Ile His Ser Gly Asp Ser Ala Cys Ala Leu Pro Pro Met 705 710 715 720 Thr Leu Gly Ala Gln Asp Ile Glu Lys Val Arg Glu Ala Thr Lys Lys 725 730 735 Leu Ala Leu Gly Ile Gly Val Gln Gly Leu Met Asn Val Gln Tyr Ala 740 745 750 Leu Lys Asp Asp Ile Leu Tyr Val Ile Glu Ala Asn Pro Arg Ala Ser 755 760 765 Arg Thr Val Pro Phe Val Ser Lys Ala Thr Gly Val Asn Leu Ala Lys 770 775 780 Ala Ala Ser Arg Ile Ala Val Gly Ala Thr Ile Lys Asp Leu Gln Asp 785 790 795 800 Glu Gly Met Ile Pro Thr Glu Tyr Asp Gly Gly Ser Leu Pro Leu Asp 805 810 815 Ala Pro Ile Ala Val Lys Glu Ala Val Leu Pro Phe Asn Arg Phe Arg 820 825 830 Arg Pro Asp Gly Lys Thr Leu Asp Thr Leu Leu Ser Pro Glu Met Lys 835 840 845 Ser Thr Gly Glu Val Met Gly Leu Ala Asn Asn Phe Gly Ala Ala Tyr 850 855 860 Ala Lys Ala Glu Ala Gly Ala Phe Gly Ala Leu Pro Thr Glu Gly Thr 865 870 875 880 Val Phe Val Thr Val Ala Asn Arg Asp Lys Arg Thr Leu Ile Leu Pro 885 890 895 Ile Gln Arg Leu Ala Ser Met Gly Tyr Lys Ile Leu Ala Thr Glu Gly 900 905 910 Thr Ala Gly Met Leu Arg Arg Asn Gly Ile Asp Cys Glu Val Val Leu 915 920 925 Lys Ala Ser Asp Ile Arg Glu Gly Val Glu Gly Lys Ser Ile Val Asp 930 935 940 Arg Ile Arg Glu Gly Glu Val Asp Leu Ile Leu Asn Thr Pro Ala Gly 945 950 955 960 Ser Ala Gly Ala Arg His Asp Gly Tyr Asp Ile Arg Ala Ala Ala Val 965 970 975 Thr Val Gly Val Pro Leu Ile Thr Thr Val Gln Gly Val Thr Ala Ala 980 985 990 Val Gln Gly Ile Glu Ala Leu Arg Glu Gly Val Val Ser Val Arg Ala 995 1000 1005 Leu Gln Glu Leu Asp His Ala Val Lys Ala 1010 1015 <210> SEQ ID NO 4 <211> LENGTH: 32 <212> TYPE: DNA <213> ORGANISM: Artificial/Unknown <220> FEATURE: <221> NAME/KEY: misc_feature <222> LOCATION: ()..() <223> OTHER INFORMATION: synthetic DNA <400> SEQUENCE: 4 cccgttaact gcttgaaacc caggacaata ac 32 <210> SEQ ID NO 5 <211> LENGTH: 30 <212> TYPE: DNA <213> ORGANISM: Artificial/Unknown <220> FEATURE: <221> NAME/KEY: misc_feature <222> LOCATION: ()..() <223> OTHER INFORMATION: synthetic DNA <400> SEQUENCE: 5 cccgttaaca tgtacttcag aaaagattag 30 <210> SEQ ID NO 6 <211> LENGTH: 26 <212> TYPE: DNA <213> ORGANISM: Artificial/Unknown <220> FEATURE: <221> NAME/KEY: misc_feature <222> LOCATION: ()..() <223> OTHER INFORMATION: synthetic DNA <400> SEQUENCE: 6 gatatctacg tgccgatcaa cgtctc 26 <210> SEQ ID NO 7 <211> LENGTH: 25 <212> TYPE: DNA <213> ORGANISM: Artificial/Unknown <220> FEATURE: <221> NAME/KEY: misc_feature <222> LOCATION: ()..() <223> OTHER INFORMATION: synthetic DNA <400> SEQUENCE: 7 aggccttttt ttaaggcagt tattg 25
Claims (14)
1. A DNA fragment which encodes a polypeptide defined in the following (A) or (B):
(A) a polypeptide which has an amino acid sequence comprises at least the amino acid numbers 50 to 393 of the amino acid sequence of SEQ ID NO: 2,
(B) a polypeptide which has an amino acid sequence comprises at least the amino acid numbers 50 to 393 of the amino acid sequence of SEQ ID NO: 2 including substitution, deletion, insertion, addition, or inversion of one or several amino acids, and can constitute a protein having a carbamoyl-phosphate synthetase activity with a large subunit of carbamoyl-phosphate synthetase comprising the amino acid sequence of SEQ ID NO: 3.
2. A DNA fragment which encodes a polypeptide defined in the following (C) or (D):
(C) a polypeptide which comprises the amino acid sequence of SEQ ID NO: 3,
(D) a polypeptide which comprises the amino acid sequence of SEQ ID NO: 3 including substitution, deletion, insertion, addition, or inversion of one or several amino acids, and can constitute a protein having a carbamoyl-phosphate synthetase activity with a small subunit of carbamoyl-phosphate synthetase having an amino acid sequence comprises at least the amino acid numbers 50 to 393 of the amino acid sequence of SEQ ID NO: 2.
3. A DNA fragment encoding a polypeptide which comprises the amino acid sequence of SEQ ID NO: 3 including substitution, deletion, insertion, addition, or inversion of one or several amino acids, and can constitute a protein having a carbamoyl-phosphate synthetase activity.
4. A DNA fragment which encodes a polypeptide defined in the following (a) or (b), and a polypeptide defined in the following (c) or (d):
(a) a polypeptide which has an amino acid sequence comprising at least the amino acid numbers 50 to 393 in SEQ ID NO: 2,
(b) a polypeptide which has an amino acid sequence comprising at least the amino acid numbers 50 to 393 in SEQ ID NO: 2 including substitution, deletion, insertion, addition, or inversion of one or several amino acids, and can constitute a protein having a carbamoyl-phosphate synthetase activity with a large subunit of carbamoyl-phosphate synthetase comprising the amino acid sequence of SEQ ID NO: 3,
(c) a polypeptide which comprises the amino acid sequence of SEQ ID NO: 3,
(d) a polypeptide which comprises the amino acid sequence of SEQ ID NO: 3 including substitution, deletion, insertion, addition, or inversion of one or several amino acids, and can constitute a protein having a carbamoyl-phosphate synthetase activity with a small subunit of carbamoyl-phosphate synthetase having an amino acid sequence comprising the amino acid numbers 50 to 393 in SEQ ID NO: 2.
5. The DNA fragment according to claim 1 , which has a nucleotide sequence comprising at least the nucleotide numbers 430 to 1461 in the nucleotide sequence of SEQ ID NO: 1.
6. The DNA fragment according to claim 2 , which has a nucleotide sequence comprising at least the nucleotide numbers 1756 to 4809 in the nucleotide sequence of SEQ ID NO: 1.
7. The DNA fragment according to claim 3 , which has a nucleotide sequence comprising at least the nucleotide numbers 430 to 4809 in the nucleotide sequence of SEQ ID NO: 1.
8. A protein which comprises a polypeptide defined in the following (a) or (b), and a polypeptide defined in the following (c) or (d):
(a) a polypeptide which has an amino acid sequence comprising at least the amino acid numbers 50 to 393 in SEQ ID NO: 2,
(b) a polypeptide which has an amino acid sequence comprising at least the amino acid numbers 50 to 393 in SEQ ID NO: 2 including substitution, deletion, insertion, addition, or inversion of one or several amino acids, and can constitute a protein having a carbamoyl-phosphate synthetase activity with a large subunit of carbamoyl-phosphate synthetase comprising the amino acid sequence of SEQ ID NO: 3,
(c) a polypeptide which comprises the amino acid sequence of SEQ ID NO: 3,
(d) a polypeptide which comprises the amino acid sequence of SEQ ID NO: 3 including substitution, deletion, insertion, addition, or inversion of one or several amino acids, and can constitute a protein having a carbamoyl-phosphate synthetase activity with a small subunit of carbamoyl-phosphate synthetase having an amino acid sequence comprising at least the amino acid numbers 50 to 393 in SEQ ID NO: 2.
9. A coryneform bacterium which is transformed with a DNA fragment according to any one of claims 1 to 7 .
10. A microorganism which has enhanced intracellular carbamoyl-phosphate synthetase activity, and has L-arginine productivity.
11. The microorganism according to claim 10 , wherein the enhanced intracellular carbamoyl-phosphate synthetase activity is obtained by increasing copy number of DNA encoding carbamoyl-phosphate synthetase of the microorganism, or by modifying an expression regulation sequence so that expression of the gene encoding carbamoyl-phosphate synthetase in the cell should be enhanced.
12. The microorganism according to claim 11 , wherein the DNA is a DNA fragment according to any one of claims 1 to 7 .
13. The microorganism according to claim 12 , which is a coryneform bacterium.
14. A method for producing of L-arginine, comprising the steps of culturing a coryneform bacterium according to any one of claims 10 to 13 in a medium to produce and accumulate L-arginine in the medium, and collecting the L-arginine from the medium.
Priority Applications (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/629,616 US6255086B1 (en) | 1999-02-01 | 2000-07-31 | Carbamoyl-phosphate synthetase gene of coryneform bacteria and method for producing L-arginine |
US09/836,470 US6881566B2 (en) | 1999-02-01 | 2001-04-18 | Carbamoyl-phosphate synthetase gene of coryneform bacteria and method for producing L-arginine |
US10/284,334 US6908754B2 (en) | 1999-02-01 | 2002-10-31 | Carbamoyl-phosphate synthetase gene of coryneform bacteria and method for producing L-arginine |
US10/284,138 US20030082774A1 (en) | 1999-02-01 | 2002-10-31 | Carbamoyl-phosphate synthetase gene of coryneform bacteria and method for producing L-arginine |
US11/011,701 US7297521B2 (en) | 1999-02-01 | 2004-12-15 | Carbamoyl-phosphate synthetase gene of coryneform bacteria and method for producing L-arginine |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2414999 | 1999-02-01 | ||
JP11-24149 | 1999-02-01 |
Related Child Applications (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/629,616 Continuation-In-Part US6255086B1 (en) | 1999-02-01 | 2000-07-31 | Carbamoyl-phosphate synthetase gene of coryneform bacteria and method for producing L-arginine |
US10/284,138 Division US20030082774A1 (en) | 1999-02-01 | 2002-10-31 | Carbamoyl-phosphate synthetase gene of coryneform bacteria and method for producing L-arginine |
US10/284,334 Division US6908754B2 (en) | 1999-02-01 | 2002-10-31 | Carbamoyl-phosphate synthetase gene of coryneform bacteria and method for producing L-arginine |
Publications (1)
Publication Number | Publication Date |
---|---|
US20030124685A1 true US20030124685A1 (en) | 2003-07-03 |
Family
ID=12130290
Family Applications (4)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/494,359 Abandoned US20030124685A1 (en) | 1999-02-01 | 2000-01-31 | Carbamoyl-phosphate synthetase gene of coryneform bacteria and method for producing l-arginine |
US10/284,334 Expired - Fee Related US6908754B2 (en) | 1999-02-01 | 2002-10-31 | Carbamoyl-phosphate synthetase gene of coryneform bacteria and method for producing L-arginine |
US10/284,138 Abandoned US20030082774A1 (en) | 1999-02-01 | 2002-10-31 | Carbamoyl-phosphate synthetase gene of coryneform bacteria and method for producing L-arginine |
US11/011,701 Expired - Lifetime US7297521B2 (en) | 1999-02-01 | 2004-12-15 | Carbamoyl-phosphate synthetase gene of coryneform bacteria and method for producing L-arginine |
Family Applications After (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US10/284,334 Expired - Fee Related US6908754B2 (en) | 1999-02-01 | 2002-10-31 | Carbamoyl-phosphate synthetase gene of coryneform bacteria and method for producing L-arginine |
US10/284,138 Abandoned US20030082774A1 (en) | 1999-02-01 | 2002-10-31 | Carbamoyl-phosphate synthetase gene of coryneform bacteria and method for producing L-arginine |
US11/011,701 Expired - Lifetime US7297521B2 (en) | 1999-02-01 | 2004-12-15 | Carbamoyl-phosphate synthetase gene of coryneform bacteria and method for producing L-arginine |
Country Status (3)
Country | Link |
---|---|
US (4) | US20030124685A1 (en) |
EP (1) | EP1026247A1 (en) |
CN (1) | CN1204255C (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20100062497A1 (en) * | 2007-03-14 | 2010-03-11 | Seizaburo Shiraga | Microorganism producing an amino acid of the l-glutamic acid family and a method for producing the amino acid |
CN107893089A (en) * | 2016-10-03 | 2018-04-10 | 味之素株式会社 | Method for producing L amino acid |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6703971B2 (en) * | 2001-02-21 | 2004-03-09 | Sirf Technologies, Inc. | Mode determination for mobile GPS terminals |
CN1316015C (en) * | 2001-08-03 | 2007-05-16 | 味之素株式会社 | New-mutant carbamyl-phosphate synthesized enzyme and method for producing compound derivated from carbamyl-phosphate |
CN101250541B (en) * | 2008-04-08 | 2010-06-02 | 上海师范大学 | Salvia miltiorrhiza 1-deoxyxylulose-5-phosphate synthase gene 1 and its encoded protein and application |
DE102012016716A1 (en) | 2012-08-22 | 2014-02-27 | Forschungszentrum Jülich GmbH | A process for the preparation of vectors comprising a gene coding for an enzyme which has been reduced or eliminated in its feedback inhibition and the use thereof for the production of amino acids and nucleotides |
CN110964683B (en) * | 2019-12-02 | 2021-08-13 | 天津科技大学 | Genetically engineered bacteria producing L-arginine and its construction method and application |
CN114438000B (en) * | 2020-11-05 | 2024-02-27 | 万华化学(四川)有限公司 | Pseudomonas aeruginosa strain and construction method and application thereof |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS5730476A (en) | 1980-07-29 | 1982-02-18 | Pioneer Video Corp | Recording and reproduction system for video disk |
JPS5867699A (en) * | 1981-10-16 | 1983-04-22 | Ajinomoto Co Inc | Plasmid |
JPH0728749B2 (en) * | 1986-09-22 | 1995-04-05 | 協和醗酵工業株式会社 | Method for producing L-arginine |
US5807723A (en) * | 1987-03-02 | 1998-09-15 | Whitehead Institute For Biomedical Research | Homologously recombinant slow growing mycobacteria and uses therefor |
US6255086B1 (en) | 1999-02-01 | 2001-07-03 | Ajinomoto Co., Inc. | Carbamoyl-phosphate synthetase gene of coryneform bacteria and method for producing L-arginine |
-
2000
- 2000-01-31 US US09/494,359 patent/US20030124685A1/en not_active Abandoned
- 2000-02-01 CN CNB001046977A patent/CN1204255C/en not_active Expired - Lifetime
- 2000-02-01 EP EP00101648A patent/EP1026247A1/en not_active Withdrawn
-
2002
- 2002-10-31 US US10/284,334 patent/US6908754B2/en not_active Expired - Fee Related
- 2002-10-31 US US10/284,138 patent/US20030082774A1/en not_active Abandoned
-
2004
- 2004-12-15 US US11/011,701 patent/US7297521B2/en not_active Expired - Lifetime
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20100062497A1 (en) * | 2007-03-14 | 2010-03-11 | Seizaburo Shiraga | Microorganism producing an amino acid of the l-glutamic acid family and a method for producing the amino acid |
US8080396B2 (en) | 2007-03-14 | 2011-12-20 | Ajinomoto Co., Inc. | Microorganism producing an amino acid of the L-glutamic acid family and a method for producing the amino acid |
CN107893089A (en) * | 2016-10-03 | 2018-04-10 | 味之素株式会社 | Method for producing L amino acid |
Also Published As
Publication number | Publication date |
---|---|
US6908754B2 (en) | 2005-06-21 |
CN1204255C (en) | 2005-06-01 |
US20050095680A1 (en) | 2005-05-05 |
CN1270223A (en) | 2000-10-18 |
US20030082775A1 (en) | 2003-05-01 |
EP1026247A1 (en) | 2000-08-09 |
US7297521B2 (en) | 2007-11-20 |
US20030082774A1 (en) | 2003-05-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP4560998B2 (en) | Method for producing L-glutamine by fermentation and L-glutamine producing bacteria | |
US5766925A (en) | Method of producing L-lysine | |
KR101186128B1 (en) | L-amino acid-producing bacterium and a method for producing L-amino acid | |
EP1010755B1 (en) | Method for producing L-Glutamic acid by fermentation | |
EP1460128B1 (en) | Method for producing L-arginine or L-lysine by fermentation | |
HU223764B1 (en) | Method for producing l-lysin | |
EP1424397B1 (en) | Method for producing L-glutamine and L-glutamine producing bacterium | |
EP1004671B1 (en) | Process for producing l-glutamic acid by fermentation method | |
US7252978B2 (en) | Method for producing L-arginine | |
US6908754B2 (en) | Carbamoyl-phosphate synthetase gene of coryneform bacteria and method for producing L-arginine | |
EP1057893B1 (en) | Method for producing L-arginine | |
US6255086B1 (en) | Carbamoyl-phosphate synthetase gene of coryneform bacteria and method for producing L-arginine | |
EP0999267A1 (en) | Method for producing L-arginine | |
JP4178720B2 (en) | Method for producing L-arginine | |
JP4284840B2 (en) | Method for producing carbamoylphosphate synthetase gene and L-arginine of coryneform bacterium | |
US6593117B2 (en) | GMP synthetase and gene coding for the same | |
KR101687474B1 (en) | Microorganism of the genus corynebacterium with enhanced l-arginine productivity and method for producing l-arginine using the same | |
JP4576850B2 (en) | Method for producing L-arginine or L-lysine by fermentation | |
JP4239334B2 (en) | Method for producing L-glutamic acid by fermentation | |
JP2000287693A (en) | Carbamoyl phosphoric acid synthetase gene of coryneform bacterium and method for producing l- arginine |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: AJINOMOTO CO., INC., JAPAN Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:KUWABARA, YOKO;HASHIGUCHI, KENICHI;NAKAMATSU, TSUYOSHI;AND OTHERS;REEL/FRAME:010929/0264 Effective date: 20000301 |
|
STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO PAY ISSUE FEE |