CN106967702A - 一种纤维素酶及其编码基因与应用 - Google Patents
一种纤维素酶及其编码基因与应用 Download PDFInfo
- Publication number
- CN106967702A CN106967702A CN201710283553.6A CN201710283553A CN106967702A CN 106967702 A CN106967702 A CN 106967702A CN 201710283553 A CN201710283553 A CN 201710283553A CN 106967702 A CN106967702 A CN 106967702A
- Authority
- CN
- China
- Prior art keywords
- cellulase
- cel6
- amino acid
- cracking
- seq
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 108010059892 Cellulase Proteins 0.000 title claims abstract description 67
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 66
- 229940106157 cellulase Drugs 0.000 title claims abstract description 56
- 238000005336 cracking Methods 0.000 claims abstract description 36
- 125000000539 amino acid group Chemical group 0.000 claims abstract description 14
- 230000001105 regulatory effect Effects 0.000 claims abstract description 12
- 238000006467 substitution reaction Methods 0.000 claims abstract description 3
- 241000196324 Embryophyta Species 0.000 claims description 41
- 241000219194 Arabidopsis Species 0.000 claims description 32
- 230000014509 gene expression Effects 0.000 claims description 19
- 239000002773 nucleotide Substances 0.000 claims description 7
- 125000003729 nucleotide group Chemical group 0.000 claims description 7
- 102000040430 polynucleotide Human genes 0.000 claims description 6
- 108091033319 polynucleotide Proteins 0.000 claims description 6
- 239000002157 polynucleotide Substances 0.000 claims description 6
- 239000013598 vector Substances 0.000 claims description 6
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 5
- 230000000694 effects Effects 0.000 claims description 5
- 150000001413 amino acids Chemical class 0.000 claims description 4
- 230000033228 biological regulation Effects 0.000 claims description 4
- 244000005700 microbiome Species 0.000 claims description 3
- 230000009261 transgenic effect Effects 0.000 claims description 3
- 241000209510 Liliopsida Species 0.000 claims 1
- 125000003275 alpha amino acid group Chemical group 0.000 claims 1
- 230000001276 controlling effect Effects 0.000 claims 1
- 241001233957 eudicotyledons Species 0.000 claims 1
- 238000009396 hybridization Methods 0.000 claims 1
- 238000004519 manufacturing process Methods 0.000 abstract description 6
- 230000003111 delayed effect Effects 0.000 abstract description 3
- 230000007613 environmental effect Effects 0.000 abstract description 2
- 230000009467 reduction Effects 0.000 abstract description 2
- 238000007792 addition Methods 0.000 abstract 1
- 230000037430 deletion Effects 0.000 abstract 1
- 238000012217 deletion Methods 0.000 abstract 1
- 235000013399 edible fruits Nutrition 0.000 description 46
- 230000023753 dehiscence Effects 0.000 description 34
- 108020004414 DNA Proteins 0.000 description 29
- 238000000034 method Methods 0.000 description 22
- 238000011144 upstream manufacturing Methods 0.000 description 18
- 238000011161 development Methods 0.000 description 16
- 230000018109 developmental process Effects 0.000 description 16
- 239000000463 material Substances 0.000 description 16
- 241000219195 Arabidopsis thaliana Species 0.000 description 14
- 238000006243 chemical reaction Methods 0.000 description 14
- 240000002791 Brassica napus Species 0.000 description 13
- 210000004027 cell Anatomy 0.000 description 13
- 238000003780 insertion Methods 0.000 description 11
- 230000037431 insertion Effects 0.000 description 11
- 108091023040 Transcription factor Proteins 0.000 description 10
- 102000040945 Transcription factor Human genes 0.000 description 10
- 230000002779 inactivation Effects 0.000 description 10
- 230000017260 vegetative to reproductive phase transition of meristem Effects 0.000 description 8
- 238000003753 real-time PCR Methods 0.000 description 7
- 235000011293 Brassica napus Nutrition 0.000 description 6
- 238000011160 research Methods 0.000 description 6
- 235000004977 Brassica sinapistrum Nutrition 0.000 description 5
- 108010059820 Polygalacturonase Proteins 0.000 description 5
- 238000004090 dissolution Methods 0.000 description 5
- 230000008569 process Effects 0.000 description 5
- 238000000926 separation method Methods 0.000 description 5
- 238000004383 yellowing Methods 0.000 description 5
- 101710197649 Actin-8 Proteins 0.000 description 4
- 230000008859 change Effects 0.000 description 4
- 239000002299 complementary DNA Substances 0.000 description 4
- 238000010586 diagram Methods 0.000 description 4
- 230000035772 mutation Effects 0.000 description 4
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 3
- 108090000790 Enzymes Proteins 0.000 description 3
- 102000004190 Enzymes Human genes 0.000 description 3
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 3
- 238000011529 RT qPCR Methods 0.000 description 3
- 238000010367 cloning Methods 0.000 description 3
- 238000013461 design Methods 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 229940088598 enzyme Drugs 0.000 description 3
- 108010093305 exopolygalacturonase Proteins 0.000 description 3
- 239000012634 fragment Substances 0.000 description 3
- 230000004028 fruit dehiscence Effects 0.000 description 3
- 108010002430 hemicellulase Proteins 0.000 description 3
- 230000007246 mechanism Effects 0.000 description 3
- 230000002441 reversible effect Effects 0.000 description 3
- 230000005070 ripening Effects 0.000 description 3
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 2
- 108020005544 Antisense RNA Proteins 0.000 description 2
- 101100108243 Arabidopsis thaliana ADPG2 gene Proteins 0.000 description 2
- 235000006008 Brassica napus var napus Nutrition 0.000 description 2
- 108091033409 CRISPR Proteins 0.000 description 2
- LZZYPRNAOMGNLH-UHFFFAOYSA-M Cetrimonium bromide Chemical compound [Br-].CCCCCCCCCCCCCCCC[N+](C)(C)C LZZYPRNAOMGNLH-UHFFFAOYSA-M 0.000 description 2
- 108091026890 Coding region Proteins 0.000 description 2
- YRMZCZIRHYCNHX-RYUDHWBXSA-N Glu-Phe-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O YRMZCZIRHYCNHX-RYUDHWBXSA-N 0.000 description 2
- CBENHWCORLVGEQ-HJOGWXRNSA-N Phe-Phe-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 CBENHWCORLVGEQ-HJOGWXRNSA-N 0.000 description 2
- 238000012271 agricultural production Methods 0.000 description 2
- 101150038500 cas9 gene Proteins 0.000 description 2
- 210000002421 cell wall Anatomy 0.000 description 2
- 230000008618 cell wall macromolecule catabolic process Effects 0.000 description 2
- 239000003086 colorant Substances 0.000 description 2
- 239000003184 complementary RNA Substances 0.000 description 2
- 238000004925 denaturation Methods 0.000 description 2
- 230000036425 denaturation Effects 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 230000005014 ectopic expression Effects 0.000 description 2
- 239000000284 extract Substances 0.000 description 2
- 230000005078 fruit development Effects 0.000 description 2
- 230000009368 gene silencing by RNA Effects 0.000 description 2
- 238000010362 genome editing Methods 0.000 description 2
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 2
- 108010087823 glycyltyrosine Proteins 0.000 description 2
- 229940059442 hemicellulase Drugs 0.000 description 2
- 238000002743 insertional mutagenesis Methods 0.000 description 2
- 238000002493 microarray Methods 0.000 description 2
- 210000000056 organ Anatomy 0.000 description 2
- 108010065135 phenylalanyl-phenylalanyl-phenylalanine Proteins 0.000 description 2
- 102000004169 proteins and genes Human genes 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 230000009758 senescence Effects 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 1
- GFBLJMHGHAXGNY-ZLUOBGJFSA-N Ala-Asn-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GFBLJMHGHAXGNY-ZLUOBGJFSA-N 0.000 description 1
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 1
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 1
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 1
- GMGWOTQMUKYZIE-UBHSHLNASA-N Ala-Pro-Phe Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GMGWOTQMUKYZIE-UBHSHLNASA-N 0.000 description 1
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 1
- AENHOIXXHKNIQL-AUTRQRHGSA-N Ala-Tyr-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H]([NH3+])C)CC1=CC=C(O)C=C1 AENHOIXXHKNIQL-AUTRQRHGSA-N 0.000 description 1
- 108700019978 Arabidopsis INDEHISCENT Proteins 0.000 description 1
- 101100438273 Arabidopsis thaliana CAN1 gene Proteins 0.000 description 1
- VWVPYNGMOCSSGK-GUBZILKMSA-N Arg-Arg-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O VWVPYNGMOCSSGK-GUBZILKMSA-N 0.000 description 1
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 1
- XSPKAHFVDKRGRL-DCAQKATOSA-N Arg-Pro-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XSPKAHFVDKRGRL-DCAQKATOSA-N 0.000 description 1
- AWMAZIIEFPFHCP-RCWTZXSCSA-N Arg-Pro-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWMAZIIEFPFHCP-RCWTZXSCSA-N 0.000 description 1
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 1
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 description 1
- NLCDVZJDEXIDDL-BIIVOSGPSA-N Asn-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O NLCDVZJDEXIDDL-BIIVOSGPSA-N 0.000 description 1
- IYVSIZAXNLOKFQ-BYULHYEWSA-N Asn-Asp-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IYVSIZAXNLOKFQ-BYULHYEWSA-N 0.000 description 1
- RRVBEKYEFMCDIF-WHFBIAKZSA-N Asn-Cys-Gly Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N)C(=O)N RRVBEKYEFMCDIF-WHFBIAKZSA-N 0.000 description 1
- SJPZTWAYTJPPBI-GUBZILKMSA-N Asn-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N SJPZTWAYTJPPBI-GUBZILKMSA-N 0.000 description 1
- KAZKWIKPEPABOO-IHRRRGAJSA-N Asn-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N KAZKWIKPEPABOO-IHRRRGAJSA-N 0.000 description 1
- ZJIFRAPZHAGLGR-MELADBBJSA-N Asn-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZJIFRAPZHAGLGR-MELADBBJSA-N 0.000 description 1
- JTXVXGXTRXMOFJ-FXQIFTODSA-N Asn-Pro-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O JTXVXGXTRXMOFJ-FXQIFTODSA-N 0.000 description 1
- GKKUBLFXKRDMFC-BQBZGAKWSA-N Asn-Pro-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O GKKUBLFXKRDMFC-BQBZGAKWSA-N 0.000 description 1
- JZLFYAAGGYMRIK-BYULHYEWSA-N Asn-Val-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O JZLFYAAGGYMRIK-BYULHYEWSA-N 0.000 description 1
- JGDBHIVECJGXJA-FXQIFTODSA-N Asp-Asp-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JGDBHIVECJGXJA-FXQIFTODSA-N 0.000 description 1
- RSMIHCFQDCVVBR-CIUDSAMLSA-N Asp-Gln-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N RSMIHCFQDCVVBR-CIUDSAMLSA-N 0.000 description 1
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 1
- NRIFEOUAFLTMFJ-AAEUAGOBSA-N Asp-Gly-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O NRIFEOUAFLTMFJ-AAEUAGOBSA-N 0.000 description 1
- JOCQXVJCTCEFAZ-CIUDSAMLSA-N Asp-His-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O JOCQXVJCTCEFAZ-CIUDSAMLSA-N 0.000 description 1
- YFSLJHLQOALGSY-ZPFDUUQYSA-N Asp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N YFSLJHLQOALGSY-ZPFDUUQYSA-N 0.000 description 1
- WWOYXVBGHAHQBG-FXQIFTODSA-N Asp-Met-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O WWOYXVBGHAHQBG-FXQIFTODSA-N 0.000 description 1
- NWAHPBGBDIFUFD-KKUMJFAQSA-N Asp-Tyr-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O NWAHPBGBDIFUFD-KKUMJFAQSA-N 0.000 description 1
- CZIVKMOEXPILDK-SRVKXCTJSA-N Asp-Tyr-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O CZIVKMOEXPILDK-SRVKXCTJSA-N 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 235000011331 Brassica Nutrition 0.000 description 1
- 241000219198 Brassica Species 0.000 description 1
- 244000178993 Brassica juncea Species 0.000 description 1
- 235000011299 Brassica oleracea var botrytis Nutrition 0.000 description 1
- 240000003259 Brassica oleracea var. botrytis Species 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- CNAMJJOZGXPDHW-IHRRRGAJSA-N Cys-Pro-Phe Chemical compound N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O CNAMJJOZGXPDHW-IHRRRGAJSA-N 0.000 description 1
- GGRDJANMZPGMNS-CIUDSAMLSA-N Cys-Ser-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O GGRDJANMZPGMNS-CIUDSAMLSA-N 0.000 description 1
- WKKKNGNJDGATNS-QEJZJMRPSA-N Cys-Trp-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKKKNGNJDGATNS-QEJZJMRPSA-N 0.000 description 1
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 1
- IKDOHQHEFPPGJG-FXQIFTODSA-N Gln-Asp-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IKDOHQHEFPPGJG-FXQIFTODSA-N 0.000 description 1
- LGIKBBLQVSWUGK-DCAQKATOSA-N Gln-Leu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGIKBBLQVSWUGK-DCAQKATOSA-N 0.000 description 1
- QGWXAMDECCKGRU-XVKPBYJWSA-N Gln-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(N)=O)C(=O)NCC(O)=O QGWXAMDECCKGRU-XVKPBYJWSA-N 0.000 description 1
- GCYFUZJHAXJKKE-KKUMJFAQSA-N Glu-Arg-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GCYFUZJHAXJKKE-KKUMJFAQSA-N 0.000 description 1
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 1
- WRNAXCVRSBBKGS-BQBZGAKWSA-N Glu-Gly-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O WRNAXCVRSBBKGS-BQBZGAKWSA-N 0.000 description 1
- CHDWDBPJOZVZSE-KKUMJFAQSA-N Glu-Phe-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O CHDWDBPJOZVZSE-KKUMJFAQSA-N 0.000 description 1
- LPHGXOWFAXFCPX-KKUMJFAQSA-N Glu-Pro-Phe Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O LPHGXOWFAXFCPX-KKUMJFAQSA-N 0.000 description 1
- SWDNPSMMEWRNOH-HJGDQZAQSA-N Glu-Pro-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWDNPSMMEWRNOH-HJGDQZAQSA-N 0.000 description 1
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 1
- QIZJOTQTCAGKPU-KWQFWETISA-N Gly-Ala-Tyr Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 QIZJOTQTCAGKPU-KWQFWETISA-N 0.000 description 1
- XQHSBNVACKQWAV-WHFBIAKZSA-N Gly-Asp-Asn Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XQHSBNVACKQWAV-WHFBIAKZSA-N 0.000 description 1
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 1
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 1
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 1
- HQSKKSLNLSTONK-JTQLQIEISA-N Gly-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 HQSKKSLNLSTONK-JTQLQIEISA-N 0.000 description 1
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 1
- AWASVTXPTOLPPP-MBLNEYKQSA-N His-Ala-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWASVTXPTOLPPP-MBLNEYKQSA-N 0.000 description 1
- BPOHQCZZSFBSON-KKUMJFAQSA-N His-Leu-His Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BPOHQCZZSFBSON-KKUMJFAQSA-N 0.000 description 1
- PYNPBMCLAKTHJL-SRVKXCTJSA-N His-Pro-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O PYNPBMCLAKTHJL-SRVKXCTJSA-N 0.000 description 1
- SYPULFZAGBBIOM-GVXVVHGQSA-N His-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N SYPULFZAGBBIOM-GVXVVHGQSA-N 0.000 description 1
- 101000779672 Homo sapiens Probable inactive allantoicase Proteins 0.000 description 1
- 101000617285 Homo sapiens Tyrosine-protein phosphatase non-receptor type 6 Proteins 0.000 description 1
- DPTBVFUDCPINIP-JURCDPSOSA-N Ile-Ala-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DPTBVFUDCPINIP-JURCDPSOSA-N 0.000 description 1
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 1
- MTFVYKQRLXYAQN-LAEOZQHASA-N Ile-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O MTFVYKQRLXYAQN-LAEOZQHASA-N 0.000 description 1
- AMSYMDIIIRJRKZ-HJPIBITLSA-N Ile-His-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N AMSYMDIIIRJRKZ-HJPIBITLSA-N 0.000 description 1
- XQLGNKLSPYCRMZ-HJWJTTGWSA-N Ile-Phe-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)O)N XQLGNKLSPYCRMZ-HJWJTTGWSA-N 0.000 description 1
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 1
- 108091026898 Leader sequence (mRNA) Proteins 0.000 description 1
- HXWALXSAVBLTPK-NUTKFTJISA-N Leu-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(C)C)N HXWALXSAVBLTPK-NUTKFTJISA-N 0.000 description 1
- SUPVSFFZWVOEOI-CQDKDKBSSA-N Leu-Ala-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-CQDKDKBSSA-N 0.000 description 1
- SUPVSFFZWVOEOI-UHFFFAOYSA-N Leu-Ala-Tyr Natural products CC(C)CC(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-UHFFFAOYSA-N 0.000 description 1
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 1
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 1
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 1
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 1
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 1
- FOBUGKUBUJOWAD-IHPCNDPISA-N Leu-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 FOBUGKUBUJOWAD-IHPCNDPISA-N 0.000 description 1
- MJWVXZABPOKJJF-ACRUOGEOSA-N Leu-Phe-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MJWVXZABPOKJJF-ACRUOGEOSA-N 0.000 description 1
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 1
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 1
- XOEDPXDZJHBQIX-ULQDDVLXSA-N Leu-Val-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOEDPXDZJHBQIX-ULQDDVLXSA-N 0.000 description 1
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 1
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 1
- 108010062166 Lys-Asn-Asp Proteins 0.000 description 1
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 1
- ULUQBUKAPDUKOC-GVXVVHGQSA-N Lys-Glu-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ULUQBUKAPDUKOC-GVXVVHGQSA-N 0.000 description 1
- PYFNONMJYNJENN-AVGNSLFASA-N Lys-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PYFNONMJYNJENN-AVGNSLFASA-N 0.000 description 1
- JOSAKOKSPXROGQ-BJDJZHNGSA-N Lys-Ser-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JOSAKOKSPXROGQ-BJDJZHNGSA-N 0.000 description 1
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 1
- VKCPHIOZDWUFSW-ONGXEEELSA-N Lys-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN VKCPHIOZDWUFSW-ONGXEEELSA-N 0.000 description 1
- WYEXWKAWMNJKPN-UBHSHLNASA-N Met-Ala-Phe Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCSC)N WYEXWKAWMNJKPN-UBHSHLNASA-N 0.000 description 1
- MYAPQOBHGWJZOM-UWVGGRQHSA-N Met-Gly-Leu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C MYAPQOBHGWJZOM-UWVGGRQHSA-N 0.000 description 1
- HSJIGJRZYUADSS-IHRRRGAJSA-N Met-Lys-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HSJIGJRZYUADSS-IHRRRGAJSA-N 0.000 description 1
- MIXPUVSPPOWTCR-FXQIFTODSA-N Met-Ser-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MIXPUVSPPOWTCR-FXQIFTODSA-N 0.000 description 1
- XLTSAUGGDYRFLS-UMPQAUOISA-N Met-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCSC)N)O XLTSAUGGDYRFLS-UMPQAUOISA-N 0.000 description 1
- VOAKKHOIAFKOQZ-JYJNAYRXSA-N Met-Tyr-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCSC)CC1=CC=C(O)C=C1 VOAKKHOIAFKOQZ-JYJNAYRXSA-N 0.000 description 1
- OOLVTRHJJBCJKB-IHRRRGAJSA-N Met-Tyr-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OOLVTRHJJBCJKB-IHRRRGAJSA-N 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 1
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- 108010065395 Neuropep-1 Proteins 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 1
- LSXGADJXBDFXQU-DLOVCJGASA-N Phe-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 LSXGADJXBDFXQU-DLOVCJGASA-N 0.000 description 1
- HHOOEUSPFGPZFP-QWRGUYRKSA-N Phe-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HHOOEUSPFGPZFP-QWRGUYRKSA-N 0.000 description 1
- CDNPIRSCAFMMBE-SRVKXCTJSA-N Phe-Asn-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CDNPIRSCAFMMBE-SRVKXCTJSA-N 0.000 description 1
- 101710089297 Polygalacturonase 1 Proteins 0.000 description 1
- ZSKJPKFTPQCPIH-RCWTZXSCSA-N Pro-Arg-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSKJPKFTPQCPIH-RCWTZXSCSA-N 0.000 description 1
- VPVHXWGPALPDGP-GUBZILKMSA-N Pro-Asn-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPVHXWGPALPDGP-GUBZILKMSA-N 0.000 description 1
- TXPUNZXZDVJUJQ-LPEHRKFASA-N Pro-Asn-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O TXPUNZXZDVJUJQ-LPEHRKFASA-N 0.000 description 1
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 1
- QEWBZBLXDKIQPS-STQMWFEESA-N Pro-Gly-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QEWBZBLXDKIQPS-STQMWFEESA-N 0.000 description 1
- IBGCFJDLCYTKPW-NAKRPEOUSA-N Pro-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 IBGCFJDLCYTKPW-NAKRPEOUSA-N 0.000 description 1
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 1
- JUJCUYWRJMFJJF-AVGNSLFASA-N Pro-Lys-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 JUJCUYWRJMFJJF-AVGNSLFASA-N 0.000 description 1
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 1
- 102100033794 Probable inactive allantoicase Human genes 0.000 description 1
- 238000012228 RNA interference-mediated gene silencing Methods 0.000 description 1
- 108091030071 RNAI Proteins 0.000 description 1
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 1
- OBXVZEAMXFSGPU-FXQIFTODSA-N Ser-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)CN=C(N)N OBXVZEAMXFSGPU-FXQIFTODSA-N 0.000 description 1
- OLIJLNWFEQEFDM-SRVKXCTJSA-N Ser-Asp-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLIJLNWFEQEFDM-SRVKXCTJSA-N 0.000 description 1
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 1
- KJMOINFQVCCSDX-XKBZYTNZSA-N Ser-Gln-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KJMOINFQVCCSDX-XKBZYTNZSA-N 0.000 description 1
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 1
- KDGARKCAKHBEDB-NKWVEPMBSA-N Ser-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CO)N)C(=O)O KDGARKCAKHBEDB-NKWVEPMBSA-N 0.000 description 1
- TVPQRPNBYCRRLL-IHRRRGAJSA-N Ser-Phe-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O TVPQRPNBYCRRLL-IHRRRGAJSA-N 0.000 description 1
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 1
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 1
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 1
- ODRUTDLAONAVDV-IHRRRGAJSA-N Ser-Val-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ODRUTDLAONAVDV-IHRRRGAJSA-N 0.000 description 1
- 101100072621 Streptomyces griseus ind7 gene Proteins 0.000 description 1
- 208000037065 Subacute sclerosing leukoencephalitis Diseases 0.000 description 1
- 206010042297 Subacute sclerosing panencephalitis Diseases 0.000 description 1
- KBLYJPQSNGTDIU-LOKLDPHHSA-N Thr-Glu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O KBLYJPQSNGTDIU-LOKLDPHHSA-N 0.000 description 1
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 1
- VUSAEKOXGNEYNE-PBCZWWQYSA-N Thr-His-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O VUSAEKOXGNEYNE-PBCZWWQYSA-N 0.000 description 1
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 1
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 1
- PZSDPRBZINDEJV-HTUGSXCWSA-N Thr-Phe-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O PZSDPRBZINDEJV-HTUGSXCWSA-N 0.000 description 1
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 1
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 1
- RPECVQBNONKZAT-WZLNRYEVSA-N Thr-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H]([C@@H](C)O)N RPECVQBNONKZAT-WZLNRYEVSA-N 0.000 description 1
- 101710116403 Transcription factor IND Proteins 0.000 description 1
- IQGJAHMZWBTRIF-UBHSHLNASA-N Trp-Asp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N IQGJAHMZWBTRIF-UBHSHLNASA-N 0.000 description 1
- AIISTODACBDQLW-WDSOQIARSA-N Trp-Leu-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 AIISTODACBDQLW-WDSOQIARSA-N 0.000 description 1
- XKDOQXAXKFQWQJ-SRVKXCTJSA-N Tyr-Cys-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O XKDOQXAXKFQWQJ-SRVKXCTJSA-N 0.000 description 1
- ILTXFANLDMJWPR-SIUGBPQLSA-N Tyr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N ILTXFANLDMJWPR-SIUGBPQLSA-N 0.000 description 1
- MVFQLSPDMMFCMW-KKUMJFAQSA-N Tyr-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O MVFQLSPDMMFCMW-KKUMJFAQSA-N 0.000 description 1
- JAGGEZACYAAMIL-CQDKDKBSSA-N Tyr-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JAGGEZACYAAMIL-CQDKDKBSSA-N 0.000 description 1
- KGSDLCMCDFETHU-YESZJQIVSA-N Tyr-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O KGSDLCMCDFETHU-YESZJQIVSA-N 0.000 description 1
- KHUVIWRRFMPVHD-JYJNAYRXSA-N Tyr-Met-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O KHUVIWRRFMPVHD-JYJNAYRXSA-N 0.000 description 1
- WQOHKVRQDLNDIL-YJRXYDGGSA-N Tyr-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O WQOHKVRQDLNDIL-YJRXYDGGSA-N 0.000 description 1
- OJCISMMNNUNNJA-BZSNNMDCSA-N Tyr-Tyr-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 OJCISMMNNUNNJA-BZSNNMDCSA-N 0.000 description 1
- 102100033019 Tyrosine-protein phosphatase non-receptor type 11 Human genes 0.000 description 1
- 101710116241 Tyrosine-protein phosphatase non-receptor type 11 Proteins 0.000 description 1
- 102100021657 Tyrosine-protein phosphatase non-receptor type 6 Human genes 0.000 description 1
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 1
- COSLEEOIYRPTHD-YDHLFZDLSA-N Val-Asp-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 COSLEEOIYRPTHD-YDHLFZDLSA-N 0.000 description 1
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 1
- HPANGHISDXDUQY-ULQDDVLXSA-N Val-Lys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HPANGHISDXDUQY-ULQDDVLXSA-N 0.000 description 1
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 1
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 1
- 230000006578 abscission Effects 0.000 description 1
- 238000000246 agarose gel electrophoresis Methods 0.000 description 1
- 230000032683 aging Effects 0.000 description 1
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 1
- 108010087924 alanylproline Proteins 0.000 description 1
- 210000003484 anatomy Anatomy 0.000 description 1
- 108010013835 arginine glutamate Proteins 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 1
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 1
- 108010093581 aspartyl-proline Proteins 0.000 description 1
- 108010038633 aspartylglutamate Proteins 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 238000005422 blasting Methods 0.000 description 1
- 238000009395 breeding Methods 0.000 description 1
- 230000001488 breeding effect Effects 0.000 description 1
- 239000002775 capsule Substances 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 230000003828 downregulation Effects 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 239000013604 expression vector Substances 0.000 description 1
- 230000008124 floral development Effects 0.000 description 1
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 1
- 101150110946 gatC gene Proteins 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 1
- 108010049041 glutamylalanine Proteins 0.000 description 1
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 1
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 108010084389 glycyltryptophan Proteins 0.000 description 1
- 238000003306 harvesting Methods 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 230000003301 hydrolyzing effect Effects 0.000 description 1
- 239000007943 implant Substances 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 108010057821 leucylproline Proteins 0.000 description 1
- 230000035800 maturation Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000000877 morphologic effect Effects 0.000 description 1
- 230000002018 overexpression Effects 0.000 description 1
- 108010018625 phenylalanylarginine Proteins 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- 210000002826 placenta Anatomy 0.000 description 1
- 230000037039 plant physiology Effects 0.000 description 1
- 239000013612 plasmid Substances 0.000 description 1
- 230000002028 premature Effects 0.000 description 1
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 1
- 238000003259 recombinant expression Methods 0.000 description 1
- 230000037425 regulation of transcription Effects 0.000 description 1
- 238000004153 renaturation Methods 0.000 description 1
- 230000001850 reproductive effect Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 230000007226 seed germination Effects 0.000 description 1
- 238000007493 shaping process Methods 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 238000007619 statistical method Methods 0.000 description 1
- 108010061238 threonyl-glycine Proteins 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 108700004896 tripeptide FEG Proteins 0.000 description 1
- 108010038745 tryptophylglycine Proteins 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 1
- 108010073969 valyllysine Proteins 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
- 239000002699 waste material Substances 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/24—Hydrolases (3) acting on glycosyl compounds (3.2)
- C12N9/2402—Hydrolases (3) acting on glycosyl compounds (3.2) hydrolysing O- and S- glycosyl compounds (3.2.1)
- C12N9/2405—Glucanases
- C12N9/2434—Glucanases acting on beta-1,4-glucosidic bonds
- C12N9/2437—Cellulases (3.2.1.4; 3.2.1.74; 3.2.1.91; 3.2.1.150)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6888—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms
- C12Q1/6895—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms for plants, fungi or algae
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y302/00—Hydrolases acting on glycosyl compounds, i.e. glycosylases (3.2)
- C12Y302/01—Glycosidases, i.e. enzymes hydrolysing O- and S-glycosyl compounds (3.2.1)
- C12Y302/01004—Cellulase (3.2.1.4), i.e. endo-1,4-beta-glucanase
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/13—Plant traits
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/158—Expression markers
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Genetics & Genomics (AREA)
- General Health & Medical Sciences (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Analytical Chemistry (AREA)
- Biotechnology (AREA)
- Biochemistry (AREA)
- General Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Medicinal Chemistry (AREA)
- Botany (AREA)
- Mycology (AREA)
- Biomedical Technology (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Immunology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
本发明公开了一种纤维素酶及其编码基因与应用。该纤维素酶的氨基酸残基序列如(a)或(b)所示:(a)如SEQ ID No:1所示的氨基酸残基序列;(b)对如SEQ ID No:1所示的氨基酸残基序列经过1个或几个氨基酸残基的取代和/或缺失和/或添加且对植物角果开裂具有调控作用的氨基酸序列。本发明所述的纤维素酶基因被突变或被抑制表达后可显著推迟角果的开裂时间,同时降低植物角果的开裂率,从而减少由于环境因素导致相关作物角果提前开裂导致种子减产和造成土地污染的损失,进而保证作物种子的高产出,具有较高的实际应用价值,应用前景广阔。
Description
技术领域
本发明属于基因工程技术领域,特别涉及一种纤维素酶及其编码基因与应用。
背景技术
在植物体的生命过程中,细胞的分离担任着非常重要的角色:从种子萌发,到塑造植物体的形态,从花药开裂传播花粉,到果实开裂种子散发,都跟细胞在微观和宏观程度上“分离”密切相关。许多种类的果实(例如:角果、蒴果、蓇葖果、荚果等)在其发育的后期阶段,都会发生特殊类型的细胞的分化以及一系列与其紧密相关的分子和生化事件,最终使得成熟的果实开裂,并释放出种子,从而有利于扩大后代植株的生长范围,为繁荣种族以及提高植物的适应性提供了条件和保证。
果实的开裂伴随着种子的传播,是植物完成生命的延续和种族的繁衍的重要途径。然而,在农业生产上,农作物因气候变化而导致果荚过早开裂往往直接影响作物的收成,造成大量浪费。例如油料作物欧洲油菜的年产量,由于果实开裂而造成的损失平均每年可达20%,在不利的天气条件下损失可达50%(Child R.D.et al.,1998,Ethylenebiosynthesis in oilseed rape pods in relation to pod shatter.Journal ofExperimental Botany 49:829–838)。同时,果实开裂也会限制油料作物种子的进一步发育,影响种子的含油量(Jenkins E.et al.,1999,Dehiscence-related expression of anArabidopsis thaliana gene encoding a polygalacturonase in transgenic plantsof Brassica napus.Plant,Cell and Environment 22:159-167.)。另外,因果实开裂而散发的种子还会对其他农作物和环境造成污染(Spence J.et al.,1996,‘Pod shatter’inArabidopsis thaliana,Brassica napus and B.juncea.Journal of Microscope 181:195-203)。由此可见,研究果实开裂对农业生产有着直接而深远的影响。
拟南芥是植物学研究常用的模式植物,与传统油料作物油菜等同科,其果荚发育与开裂与传统油料作物具有相似的结构特点。与传统油料作物相比具有生长周期短(2个月可以完成一次生命周期),基因已全部测序完毕(基因序列清楚),各种相关研究资料非常丰富等优势,很多油料作物品种的改良的方法和思路都来源于拟南芥的相关研究。如调控拟南芥果荚开裂的一个转录因子FRUITFUL(FUL),有研究表明,通过用35S启动子替换编码该转录因子基因的启动子,使该基因异位表达可导致拟南芥的果荚成熟后不开裂(FerrándizC.et al.,2000,Redundant regulation of meristem identity and plantarchitecture by FRUITFULL,APETALA1,and CAULIFLOWER.Development 127:725-734)。而后期在欧洲油菜的研究中,科学家通过同源比对的方法获得了欧洲油菜的FUL同源基因,并通过相同的方法使该基因异位表达,从而筛选出欧洲油菜果实不开裂品系(QstergaardL.et al.,2005,Pod shatter-resistant Brassica fruit produced by ectopicexpression of the FRUITFULL gene.Plant Biotechnology Journal 4:45-51)。由此可见研究拟南芥的果荚开裂的机理,可为其他植物(特别是油料作物)研究提供有力指导和强大的理论依据。
纤维素酶、半纤维素酶和果胶酶是降解细胞壁的主要的三种酶(Patterson S.,2001,Cutting loose,Abscission and dehiscence in Arabidopsis.PlantPhysiology126:494-500;Roberts J.A.et al.,2000,Cell separation processes inplants:models,mechanisms and manipulation.Annals of Botany 86:223-235)。而植物器官的脱落和分离的过程往往伴随着开裂区细胞的细胞壁降解。有研究表明在果荚成熟衰老过程中往往伴随着在开裂区细胞的纤维素酶活性的增强且有十分明显的特异性(Meakinand Roberts,1990,Dehiscence of fruit in oilseed rape(Brassica napus L.).I.Anatomy of pod dehiscence.Journal of Experimental Botany 41:1003-1011);果胶酶伴随的果荚成熟衰老的活性变化虽然不大且无特异性,但是突变编码果胶酶的基因(ADPG1和ADPG2)却导致果荚无法开裂(Ogawa M.et al.,2009,RABIDOPSIS DEHISCENCEZONE POLYGALACTURONASE1(ADPG1),ADPG2,and QUARTET2Are PolygalacturonasesRequired for Cell Separation during Reproductive Development inArabidopsis.Plant cell 21:216-233)。然而参与果荚开裂的纤维素酶基因和半纤维素酶基因仍然未知,鉴定出拟南芥果荚开裂相关的纤维素酶和半纤维素酶将丰富对果荚开裂过程中细胞壁降解过程的理解,并有机会发现新的影响果实开裂的基因和表型,进而对油菜新品种的选育提供新的研究方向和思路,从而增加油菜的产量。与通过调控转录因子来获得不开裂的油菜品系相比,通过调控下游酶类所获得不开裂品系更有具有生产应用价值,因为转录因子是通过调控多个基因的表达来影响整个果荚开裂区的形成,因而调控转录因子所产生的品系一般都会导致果实完全不裂,在生产上反而会因为增加工作量而增加成本,与开裂带来的减产所造成的损失相比得不偿失。而调控下游细胞壁水解酶所产生的品系只会影响一个基因表达情况,并不会对整个果荚开裂区形成造成巨大的影响,因此一般只会导致果实开裂的推迟和开裂率降低,并不会导致果实完全不裂。这样既可以减少开裂造成的损失,又不会增加生产的成本,因此更具有现实应用意义和前景。
发明内容
本发明的首要目的在于克服现有技术的缺点与不足,提供一种纤维素酶,该纤维素酶对植物角果开裂具有调控作用。
本发明的另一目的在于提供所述纤维素酶的编码基因。
本发明的又一目的在于提供所述纤维素酶的应用。
本发明的目的通过下述技术方案实现:一种纤维素酶,名称为CEL6,来源于拟南芥(Arabidopsis thaliana),其氨基酸残基序列如(a)或(b)所示:
(a)如SEQ ID No:1所示的氨基酸残基序列;
(b)对如SEQ ID No:1所示的氨基酸残基序列经过1个或几个氨基酸残基的取代和/或缺失和/或添加且对植物角果开裂具有调控作用的氨基酸序列。
其中,序列表中的SEQ ID No:1由497个氨基酸残基组成,其分子量为55kDa。
上述(a)和(b)中的纤维素酶可人工合成,也可先合成其编码基因,再进行生物表达得到;(b)中CEL6的编码基因可通过将序列SEQ ID No:1中的DNA序列中缺失一个或几个氨基酸残基的密码子,和/或进行一个或几个碱基对的错义突变得到。
所述的纤维素酶的编码基因,其核苷酸序列如(a)、(b)、(c)或(d)所示:
(a)如SEQ ID No:2所示的多核苷酸;
(b)编码如SEQ ID No:1所示的氨基酸的多核苷酸;
(c)与SEQ ID No:2所示的多核苷酸具有80%以上的同源性且对植物角果开裂具有调控作用的核苷酸序列;
(d)在高严谨条件下与SEQ ID No:2限定的DNA序列杂交的核苷酸序列。
其中,序列表中的SEQ ID No:2由3282个碱基组成,其编码区域为82-504,1457-1756,1927-2088,2310-2585,2690-3022五个区域。
上述高严谨条件可为在0.1×SSPE(或0.1×SSC),0.1%SDS的溶液中,在65℃下杂交并洗膜。
含有上述任一所述编码基因的重组载体、重组微生物、转基因细胞系或表达盒也属于本发明的保护范围。
所述重组载体为将上述任一所述编码基因插入出发载体pCAMBIA1305.1的多克隆位点得到的重组表达载体。
所述的重组微生物为将所述重组载体转化细菌得到的重组微生物。
所述的纤维素酶在调控植物角果开裂基因中的应用。
所述植物包括单子叶植物和双子叶植物,优选为拟南芥或油菜品种。
所述的纤维素酶在调控植物角果开裂基因中的应用,是根据所述的纤维素酶基因序列,使用RNA干扰(RNA intrference,RNAi)、反义RNA(anti-sense RNA)、过表达负显性(dominant negative,DN)、crispr/cas9基因编辑技术使正常植物中纤维素酶编码基因CEL6的同源基因的功能丧失或表达量下降,从而实现推迟植物角果开裂和降低植物角果开裂率。
本发明相对于现有技术具有如下的优点及效果:
1、(1)本发明通过现有反向遗传学理论基础,分析目前数据库所含纤维素酶基因的相关信息数据,阐明了纤维素酶基因参与调控角果发育和开裂的机理,并加以qPCR测定拟南芥角果不同发育时期的表达量变化,获的具体参与角果发育和开裂过程的纤维素酶编码基因CEL6;(2)本发明通过突变株角果开裂区细胞超微结构分析和相关统计分析验证了该基因的功能,提供的纤维素酶基因具有重要的应用价值,是根据所述基因序列信息利用RNA干扰(RNAi)、反义(RNA anti-sencse RNA)、过表达负显性(dominant negative,DN)或Crispr/cas9基因编辑技术使正常植物(如拟南芥或油菜品种)中纤维素酶编码基因CEL6的同源基因的功能丧失或表达量下降,从而实现推迟植物角果开裂和降低植物角果开裂率的方法。
2、本发明中的纤维素酶在培育植物角果延缓开裂中的应用,所述的植物角果可以是不开裂的角果品系。
3、本发明的基因被突变或被抑制表达后可显著推迟角果的开裂时间,同时降低植物角果的开裂率,从而减少由于环境因素导致相关作物角果提前开裂导致种子减产和造成土地污染的损失,进而保证作物种子的高产出,具有较高的实际应用价值,应用前景广阔。
附图说明
图1是实施例2中的溶解曲线图,其中,图a表示溶解峰,图b表示溶解曲线。
图2是实施例2中纤维素酶基因CEL6在不同发育时期的果荚的定量PCR表达结果统计图,其中,6D、8D、10D、12D表示开花后的天数;18A表示半黄果荚;18B表示全黄的果荚。
图3是实施例3中纤维素酶基因CEL6的基因示意图以及两种纤维素酶突变株的材料的插入位点的确立的电泳结果图;其中,Col-0表示野生型,cel6-1和cel6-2表示突变型。
图4是实施例4中纤维素酶基因CEL6的突变株cel6-1和cel6-2纯合子材料的t-DNA插入位点的示意图。
图5是实施例5中纤维素酶基因CEL6的突变株cel6-1和cel6-2纯合子材料的成熟果荚中CEL6基因的表达与野生型的对比结果图,其中,Col-0表示野生型。
图6是实施例6中野生型和纤维素酶CEL6突变株果荚从开花到变黄所需天数的统计比较图,其中,Col-0表示野生型,cel6-1和cel6-2表示突变型。
图7是实施例7中野生型和纤维素酶突变株开裂率统计结果图,其中,Col-0表示野生型,cel6-1和cel6-2表示突变型,18期表示拟南芥黄果荚,19期表示果实发育至刚变褐的果荚。
图8是实施例8中纤维素酶突变株cel6-1和cel6-2纯合子材料在果荚成熟、变黄、开裂等阶段的显微结构变化图。其中,Col-0表示野生型,cel6-1和cel6-2表示突变型;18A表示半黄果荚;18B表示全黄的果荚;19A表示干褐的果荚。
具体实施方式
下面结合实施例对本发明作进一步详细的描述,但本发明的实施方式不限于此。
实施例1纤维素酶CEL6基因的克隆
拟南芥纤维素酶的基因全长DNA序列的获得,包括以下步骤:
1.在TAIR数据库(http://www.arabidopsis.org)中以cellulase(纤维素酶)作为关键字进行搜索,并将搜索所得基因的cDNA序列分别在TAIR数据库中进行Blast比对,取比对结果中E-Value小于0.01的所有的基因作为候选基因(共39个)。
2.将所得的候选基因分别在TAIR数据库中的基因表达芯片(microarrayexpression)数据库进行搜索,将有在花发育15期心皮表达的基因保留,其余的剔除(剩下22个)。
3.将剩下的候选基因输入ATTED-II数据库(http://atted.jp)中进行相关共表达基因查询。在查询所得的300个相关基因中寻找跟果荚发育与开裂相关的转录因子(如SHP1,SHP2,ALC等),能与果荚发育与开裂相关的转录因子有共表达结果的候选基因留下,其余的剔除(剩余6个)。
4.以不开裂的转录因子突变株(果荚不开裂转录因子突变体ind-7)和WT(野生型Col-0)为材料,进行定量PCR检测剩余的候选基因是否在两种材料中出现差异,其中WT中表达量比不开裂突变株高的保留,其余的剔除(剩余两个)。其中,果荚不开裂转录因子突变体ind-7通过如下方法获得:
根据文献(Liljegren,S.J.,Roeder,A.H.,Kempin,S.A.,Gremski,K.,L.,Guimil,S.,Reyes,D.K.,and Yanofsky,M.F.2004.Control of fruitpatterning in Arabidopsis by INDEHISCENT.Cell.116:843-853;Wu,H.,Mori,A.,Wang,Y.X.,and Yang,M.2006.The INDEHECIENT protein regulates unequal cell divisionin Arabidopsis fruits.Planta.224:971-979)已知拟南芥转录因子IND突变后果荚将无法开裂。通过下述方法筛选拟南芥IND失活纯合突变体:从ABRC(Arabidopsis BiologicalResource Center)获得IND(AT401200)位点的T-DNA插入的突变体株系的种子(种子编号:SALK_058083),并取名ind-7(Alonso J.M.,Stepanova A.N.,Leisse T.J.,Kim C.J.,ChenH.,Shinn P.,Stevenson D.K.,Zimmerman J.,Barajas P.,Cheuk R.,etal.2003.Genome-wide insertional mutagenesis of Arabidopsis thaliana.Science301:653-657.),并根据网站提供的资料设计所需引物,引物序列如下:
IND鉴定上游引物:5’-CACCCATACAATCTTGGATCG-3’;
IND鉴定下游引物:5’-TGTTCTGGTTGGGTTCAAAAG-3’;
SALK_058083插入引物:5’-TGGTTCACGTAGTGGGCCATCG-3’;
PCR反应体系如下:
PCR反应程序为:
利用上述PCR反应体系和反应程序,以提取的拟南芥的DNA(CTAB法)为模板,使用IND鉴定上游引物和IND鉴定下游引物可在野生型(Col-0)的拟南芥DNA中扩增出一条1198bp的条带,而在ind-7纯合子突变体中均无法扩增出对应条带。而使用SALK_058083插入引物和IND鉴定下游引物,可以在ind-7纯合子突变体中扩增出长度分别为756bp的条带,而无法再野生型的拟南芥的DNA中得到对应条带。从而说明我们通过PCR鉴定,获得了两种不同的T-DNA插入位点的IND失活纯合子突变体,命名为ind7(SALK_058083)。
5.根据这个数据库中这个基因的相关资料,在该基因序列的5’和3’末端序列设计一对引物,引物序列如下:
CEL6克隆上游引物:
5’-GGACTCTTGACCATGGTAGTTGCAAACACCGATC-3’;
CEL6克隆下游引物:
5’-ATTTACCCTCAGATCTACCATAGCAACCATTATATATATGTATA-3’;
提取拟南芥的总DNA(CTAB法),在CEL6克隆上游引物和CEL6克隆下游引物的引导下,以拟南芥的总DNA为模板,PCR扩增拟南芥纤维素酶基因全长DNA序列,50ul PCR反应体系为:总DNA模板2μl,Premix Taq(Ex Taq Version2.0;Takara公司)25μl,CEL6克隆上游引物(2mM)5μl,CEL6克隆下游引物(2mM)5μl,水(ddH2O)13μl。PCR的反应条件为:先94℃变性2分钟;然后94℃变性30秒,58℃复性1分钟,72摄氏度延伸4分钟,共35个循环。反应结束后,对PCR产物进行0.8%琼脂糖凝胶电泳检测,回收并纯化长度为3200bp的扩增片段,使用clotech的infusion-clone技术将其克隆到pCAMBIA1305.1载体中,得到含有回收片段的重组质粒,使用M13引物对(M13上游引物:5’-GAGCGGATAACAATTTCACACAGG-3’;M13下游引物:5’-CGCCAGGGTTTTCCCAGTCACGAC-3’)其进行测序,测序结果表明拟南芥纤维素酶CEL6基因全长具有序列表中SEQ ID No:2的核苷酸序列,由3282个碱基组成,通过序列比对,与拟南芥纤维素酶家族成员之间的同源率在80%以上,将该基因序列归为纤维素酶家族,命名为纤维素酶CEL6(简称:CEL6);其编码区域为82-504,1457-1756,1927-2088,2310-2585,2690-3022这五个区域,编码具有序列表中SEQ ID No.1的氨基酸序列的蛋白质,其编码蛋白命名为纤维素酶CEL6(简称:CEL6),其分子量为55kDa。
实施例2:纤维素酶CEL6在果荚发育中的表达情况变化
在野生型拟南芥材料(Col-0)开花时在花柄上用不同颜色的绳子做记号,记录对应的开花时的时间,然后根据开花后的天数(6D、8D、10D、12D)以及果荚的颜色变化(18A:半黄果荚;18B:全黄的果荚)进行取材,将所得的材料作为样品使用QIAGEN公司的plant Mini Kit提取RNA,并反转录成cDNA,做三组样品平行。然后使用进行定量PCR检测CEL6基因在不同发育时期果荚的表达量,使用的定量引物如下:
CEL6定量PCR上游引物:5’-TTTCCCGATGGCTTTCACCA-3’;
CEL6定量PCR下游引物:5’-GAGAGCCACGAGAGAGTTACG-3’;
内参actin8上游引物:5’-TAAGGTCGTGGCACCACCCG-3’;
内参actin8下游引物:5’-ATCCGAGTTTGAAGAGCTACAA-3’。
所用的反应体系为:
反应程序为:
溶解程序,获得溶解曲线(见图1)。
使用Bio-rad的CFX96TouchTM荧光定量PCR检测系统运行定量PCR反应。结果如图2所示,其中,时期17表示果荚的一个发育阶段,指的是从花器官(花药,萼片等)脱离果荚到果荚变黄(18期)这一阶段的果荚。从图中可以看出纤维素酶基因CEL6在果荚发育和延伸的时期(6D、8D、10D、12D)的表达量要比果荚衰老变黄的时期(18A和18B)表达量高,表达量最高的时期是开花后8D(天)时候的果荚。该结果表明,该基因跟果荚的生长发育相关,在果实开裂区刚形成的时候就已经开始起作用,并影响着果荚开裂区的形态构建。
实施例3:纤维素酶CEL6突变株的筛选
用下述方法筛选拟南芥CEL6失活纯合突变体:从ABRC(ArabidopsisBiologicalResource Center)获得CEL6(AT4G39010)位点两种不同的T-DNA插入的突变体株系的种子(种子编号:SALK_060505和WiscDsLox485-488K15)(Alonso J.M.,Stepanova A.N.,LeisseT.J.,Kim C.J.,Chen H.,Shinn P.,StevensonD.K.,Zimmerman J.,Barajas P.,CheukR.,et al.2003.Genome-wide insertionalmutagenesis of Arabidopsisthaliana.Science 301:653-657.),并根据网站提供的资料设计所需引物,引物序列如下:
CEL6鉴定上游引物:5’-CGTATTATCCAAGCATAAATCGC-3’;
CEL6鉴定下游引物:5’-TAGCTGTGAAACGGATTTGAGG-3’;
SALK_060505插入片段引物:5’-TGGTTCACGTAGTGGGCCATCG-3’;
WiscDsLox485-488K15插入片段引物:
5’-AACGTCCGCAATGTGTTATTAAGTTGTC-3’。
以拟南芥的DNA为模板(实施例1提取的拟南芥DNA),用上、下游引物进行扩增,PCR反应体系如下:
PCR反应程序为:
通过PCR进行鉴定,获得了两种不同的T-DNA插入位点的CEL6失活纯合子突变体,分别命名为cel6-1(SALK_060505)和cel6-2(WiscDsLox485-488K15),具体过程为:
利用上述PCR反应体系和反应程序,使用CEL6鉴定上游引物和CEL6鉴定下游引物可在野生型(Col-0)的拟南芥DNA中扩增出一条620bp的条带,而在cel6-1和cel6-2纯合子突变体中均无法扩增出对应条带(图3A和3B)。而使用SALK_060505插入片段引物和CEL6鉴定下游引物,WiscDsLox485-488K15插入片段引物和CEL6鉴定上游引物可以分别在cel6-1和cel6-2纯合子突变体中扩增出长度分别为838bp和488bp的条带,而无法再野生型的拟南芥的DNA中得到对应条带(图3C和3D)。
实施例4:CEL6失活纯合子突变体的t-DNA插入位点的鉴定
将使用SALK_060505插入片段引物和CEL6鉴定下游引物、以cel6-1的DNA为模板的PCR扩增产物和使用WiscDsLox485-488K15插入片段引物和CEL6鉴定上游引物、以cel6-2的DNA为模板的PCR扩增产物分别测序(cel6-1的DNA和cel6-2的DNA通过实施例3中838bp和488bp的条带回收得到)。根据所得的测序结果在拟南芥数据库中Blast以确定cel6-1和cel6-2的T-DNA的插入位点。结果表明,cel6-1的插入位点位于CEL6基因的5’-Utr前面7个碱基处,属于CEL6启动子区域;而cel6-2的插入位点位于CEL6基因的第一个外显子上(见图4)。
实施例5:验证CEL6失活纯合子突变体中CEL6基因的表达下调
分别以cel6-1和cel6-2的开花后12天的果荚为材料提取RNA,并反转录成cDNA。以cDNA为模板,哥伦比亚型(Col-0)即野生型为对照,使用定量引物与实施例2一致。结果如图5所示,从图中可以看出,所得的两种CEL6失活纯合子突变体中CEL6基因的表达量只有野生型的14%(cel6-1)和6%(cel6-2)。
实施例6:纤维素酶突变体果荚成熟推迟
将实施例2中纤维素酶CEL6的纯合子突变体cel6-1和cel6-2以及对应野生型(哥伦比亚型)的拟南芥材料的花通过绑不同颜色绳子进行标记来记录每朵花开花的日期,然后在果实变黄时通过观察绳子颜色可计算出从花到变黄果荚所需的发育时间,每种材料统计至少50个果荚。结果如图6所示,从图中可以看出,野生型拟南芥(Col-0)从开花到果荚变黄平均需要14.25天,而纤维素酶纯合子突变体cel6-1和cel6-2则分别需要17.41天和17.56天,均比野生型推迟3天左右。说明纤维素酶缺失能造成拟南芥果实发育延迟。
实施例7:纤维素酶突变体果荚开裂率下降
将实施例2中纤维素酶CEL6的纯合子突变体cel6-1和cel6-2以及对应野生型哥伦比亚型(Col-0)的拟南芥材料的果实发育至刚变褐(19期)时,取该果实和这个果荚对应的上一个全黄的果荚(18期)放置于解剖镜下观察是否有裂口。每种材料(cel6-1、cel6-2、Col-0)从6株以上的植物中选取:每棵植物统计15个18期和15个19期的果荚(一共统计90个以上果荚),计算每种材料每种时期的开裂率(开裂率为有裂口的果荚占果荚总数的百分比),然后每种材料用6株植物作为样品重复(每种材料每种时期会得到6个开裂率),再统计获得平均开裂率,在将突变株的不同时期的开裂率和对应的野生型数据进行比较并使用P值校验他们的差异性。
结果如图7所示,从图中可以看出,野生型拟南芥黄果荚(18期)的开裂率为76±3.6%,而纤维素酶纯合子突变体cel6-1和cel6-2黄果荚(18期)的开裂率为10±4.2%和20±4.4%;在变褐果荚中,野生型的开裂率为100%,而突变株的开裂率为42±6.5%和44±10%。突变株相对野生型的P值均远低于0.05,表明突变株和野生型在开裂率之间存在显著的统计学差异。说明纤维素酶缺失后会造成拟南芥果实开裂率下降。
实施例8:纤维素酶突变体果荚对果荚发育过程中结构的影响
取野生型(Col-0),纤维素酶CEL6失活纯合子突变株cel6-1和cel6-2的半黄果荚(18A),全黄的果荚(18B)以及干褐的果荚(19A)在体式显微镜进行检测。结果如图8所示:在果荚半黄的时期(18A),野生型以及纤维素酶CEL6失活纯合子突变株cel6-1和cel6-2的果荚均未开裂。在果荚全黄的时期(18B),野生型的果瓣和胎座框已经分离,果荚已经开始开裂,而纤维素酶CEL6失活纯合子突变株cel6-1和cel6-2的果荚并未裂开。在干褐果荚(19A)中,野生型的果荚已经完全裂开,而纤维素酶CEL6失活纯合子突变株cel6-1和cel6-2的果荚的整个开裂区仍处于未开裂状态。说明纤维素酶突变后对果荚的开裂造成巨大的影响。
上述实施例为本发明较佳的实施方式,但本发明的实施方式并不受上述实施例的限制,其他的任何未背离本发明的精神实质与原理下所作的改变、修饰、替代、组合、简化,均应为等效的置换方式,都包含在本发明的保护范围之内。
SEQUENCE LISTING
<110> 华南农业大学
<120> 一种纤维素酶及其编码基因与应用
<130> 1
<160> 17
<170> PatentIn version 3.5
<210> 1
<211> 497
<212> CEL6
<213> 拟南芥属拟南芥(Arabidopsis thaliana)
<400> 1
Met Lys Leu His Leu His Pro Leu Ser Leu Val Phe Phe Phe Phe Phe
1 5 10 15
Phe Phe Phe Arg Pro Thr Met Ser Ser Asn Gln His Asp Tyr Ser Asp
20 25 30
Ala Leu Ser Lys Ser Ile Leu Phe Phe Glu Gly Gln Arg Ser Gly Tyr
35 40 45
Leu Pro Asn Asp Gln Arg Met Thr Trp Arg Arg Asn Ser Gly Leu Ser
50 55 60
Asp Gly Trp Thr His Asn Ile Asp Leu Thr Gly Gly Tyr Tyr Asp Ala
65 70 75 80
Gly Asp Asn Val Lys Phe Asn Phe Pro Met Ala Phe Thr Thr Thr Met
85 90 95
Leu Ala Trp Ser Val Ile Glu Phe Gly Glu Phe Met Pro Ser Ser Glu
100 105 110
Leu Arg Asn Ser Leu Val Ala Leu Arg Trp Ser Ser Asn Tyr Leu Leu
115 120 125
Lys Ser Val Ser Gln Leu Pro Asn Arg Ile Phe Val Gln Val Gly Asp
130 135 140
Pro Ile Ala Asp His Asn Cys Trp Glu Arg Pro Glu Asp Met Asp Thr
145 150 155 160
Pro Arg Thr Ala Tyr Ala Val Asn Ala Pro Asn Pro Ala Ser Glu Val
165 170 175
Ala Gly Glu Thr Thr Ala Ala Leu Ser Ala Ala Ser Ile Ala Phe Arg
180 185 190
Ser Ser Asp Pro Gly Tyr Ser Gln Thr Leu Leu Gln Asn Ala Val Lys
195 200 205
Thr Phe Gln Phe Ala Asp Met Tyr Arg Gly Ala Tyr Ser Ser Asn Asp
210 215 220
Asp Ile Lys Asn Asp Val Cys Pro Phe Tyr Cys Asp Phe Asn Gly Phe
225 230 235 240
Gln Asp Glu Leu Leu Trp Gly Ala Ala Trp Leu Arg Lys Ala Thr Gly
245 250 255
Asp Glu Ser Tyr Leu Asn Tyr Ile Glu Ser Asn Arg Glu Pro Phe Gly
260 265 270
Ala Asn Asp Asn Val Asp Glu Phe Gly Trp Asp Asn Lys Val Gly Gly
275 280 285
Leu Asn Val Leu Val Ser Lys Glu Val Ile Glu Gly Asn Met Tyr Asn
290 295 300
Leu Glu Ala Tyr Lys Ala Ser Ala Glu Ser Phe Met Cys Ser Leu Val
305 310 315 320
Pro Glu Ser Ser Gly Pro His Val Glu Tyr Thr Ser Ala Gly Leu Leu
325 330 335
Tyr Lys Pro Gly Gly Ser Gln Leu Gln His Ala Thr Thr Ile Ser Phe
340 345 350
Leu Leu Leu Val Tyr Ala Gln Tyr Leu Ser Arg Ser Ser Leu Ser Leu
355 360 365
Asn Cys Gly Thr Leu Thr Val Pro Pro Asp Tyr Leu Arg Arg Leu Ala
370 375 380
Lys Lys Gln Val Asp Tyr Ile Leu Gly Asn Asn Pro Met Gly Leu Ser
385 390 395 400
Tyr Met Val Gly Tyr Gly Glu Arg Tyr Pro Lys Arg Ile His His Arg
405 410 415
Gly Ser Ser Leu Pro Ser Ile Val Asp His Pro Glu Ala Ile Arg Cys
420 425 430
Lys Asp Gly Ser Val Tyr Phe Asn Ser Thr Glu Pro Asn Pro Asn Val
435 440 445
Leu Ile Gly Ala Val Val Gly Gly Pro Gly Glu Asp Asp Met Tyr Asp
450 455 460
Asp Asp Arg Ser Asp Phe Arg Lys Ser Glu Pro Thr Thr Tyr Ile Asn
465 470 475 480
Ala Pro Phe Val Gly Val Leu Ala Tyr Phe Ala Ala Asn Pro Gly Ser
485 490 495
Ser
<210> 2
<211> 3282
<212> DNA
<213> 拟南芥属拟南芥(Arabidopsis thaliana)
<400> 2
gtagttgcaa acaccgatca gtcacagaca gtagcctctc accatcacca ccacggcggc 60
cgtcaccgac gccgatacac catgaaatta catttacacc ctctctcact tgtcttcttc 120
tttttcttct tcttcttccg accgacaatg tcgtcgaacc agcacgatta ctccgacgca 180
ctctcaaaat caattctctt cttcgaaggc cagcgctccg gttacctccc aaacgaccag 240
cgtatgacct ggcgccgtaa ctccggtctg agcgacggtt ggacacacaa catcgattta 300
accggcggtt attacgacgc cggagacaac gtcaaattca atttcccgat ggctttcacc 360
accacaatgc tcgcctggag cgtaattgag ttcggcgaat tcatgccttc gtcggagcta 420
cgtaactctc tcgtggctct ccggtggagc tctaattacc tcctcaaatc cgtttcacag 480
ctacctaacc ggatcttcgt ccaggttagt gtggttttaa tttgccaatt caatttcgta 540
actgtttttt tttatgccgg tgagtggcat ttttagtgaa tacgaaacta tggggaaggg 600
taaattgggg aattaaaata acacagagat tgaggaaact gaatcgtttg agttgccaag 660
agatgaaacg aacccgtgaa gtgggtgttt gtttttggac tctcaagagt ctctctgcat 720
ttaccttttt tgtcgttgcg tcacactttt tttggccctt tctattttcc tatatattct 780
tttctttcta cgaaattatc aatacacatc aatatcaatt taattagtta ttgttaatga 840
caaagtccag tacgtaacat acgtataaca tcgaatgaat catttaggaa ataacaattg 900
actaggaact ttataagttg gcaacgtaaa tgtcaattta attttggtct tattttgtgt 960
gtgtcccgta ctttgtagta aatacaaaga tgaccaacca ttagttttag ttgcagggtt 1020
ttttcgtttg tttttatgtt tattttttta aaaaaactac aacaataaat tttgttgact 1080
taaattcagc aataaatgga taatgataat tcaatagtac acctctaaaa taaattagtg 1140
aatgggttat gaaattatca acattgattc aatggtcgtt tgcatcataa ttaataaatt 1200
atgatcattt aataattatt agagttgaga tacgatgata gagacttctt tctttcagaa 1260
tcgttcaatt ctctttgttc caatattaga aaaatcaaac atcagatatg atgttgatgt 1320
tgttaggtgc atttcttatt ttattcattg ataatttcat atcttgattt atatattgtt 1380
ttatttagag ggcttgaata tttaatattt ttgaaaacta actaaggtag gaaaaaaata 1440
aacgaacttg tgataggtag gtgatccgat agcagatcat aattgctggg aaagaccaga 1500
agacatggat acaccacgga ctgcttacgc cgttaatgcc ccgaatccgg cttctgaagt 1560
tgccggagaa actacggcgg ccctctcagc tgcttcgatt gctttccgat catcggatcc 1620
aggatactct caaacattgc tccaaaatgc tgtcaaaact tttcagttcg ctgatatgta 1680
tcgcggtgct tatagcagca atgacgatat caaaaatgat gtgtgtcctt tctactgcga 1740
tttcaacgga tttcaggtaa acatcaatcc caacatttgt gtgactaaag aagttatcaa 1800
atatttaaca ggatcattag aacaagctaa atatctatac attcttggta aatattatat 1860
tttgtacatt ttcttggtga tgattttctt taatgttgtt ttttgtaaaa tttttgttat 1920
gggtaggatg agttattgtg gggagcagct tggctaagaa aagcgactgg tgatgaaagc 1980
taccttaatt acatagagag taaccgtgaa ccgtttggtg ctaacgataa cgtagacgaa 2040
tttggctggg acaacaaagt tggtggactt aatgttctcg tttcgaaggt tagttcaata 2100
cttccaattt tattaacatc aaatttaggg gatgaaatcg aaattaactt taaatacttt 2160
agcaaaagtt tagtaaacac gaggctttca cgtgcatatc aaagttatgt tcttttcttt 2220
ccgtgtacgt gaatccaaac tgacccggtt aagaaccaaa ccggtaccat aactgataaa 2280
cttggtggtt tgatttaatt tcggtacagg aagttattga aggaaatatg tacaacttag 2340
aggcgtacaa ggcatcagca gagagcttca tgtgtagtct ggtcccggaa tcttcgggac 2400
cgcatgttga gtatacttcc gctggtctac tttacaagcc cggcggtagc cagctccaac 2460
atgcgaccac catatctttc ctcctcctcg tatacgctca atatctttca agaagctctc 2520
tctccctcaa ttgcggcacc cttaccgtcc ctcctgatta tctccgccgt cttgccaaga 2580
aacaggttag accggtttga gttggtttgg tttaagttta taagtgaata acatataatt 2640
tgtttttggc atccagcaaa actaatcatg tatttattgg aaaatgaagg tggactatat 2700
attggggaat aatccgatgg gattatcgta catggtcgga tacggagaga gatatccaaa 2760
gaggatccat caccgtggct catcattgcc atcgattgtg gaccatccag aagccatcag 2820
atgcaaagac gggtcggttt atttcaattc taccgaaccg aacccgaacg ttttgattgg 2880
agcggtggtg ggtgggcccg gagaggacga tatgtacgat gatgatcggt ccgatttccg 2940
caagtccgag cccactactt acatcaatgc tcctttcgtt ggcgttttgg cgtactttgc 3000
agccaatcct ggttctagct aagctgacga catgggagta aaattggaaa acattattct 3060
taagttccaa gagtatggga gttctttttt ttcttacaag tctctcatct tcttgcccaa 3120
gtaaagaagg aaaaagttct tacagttaaa atagtttttc ttttacgcaa aattaatctg 3180
aataacttat tcggattatt cacctgttga tcatttgata cttctttttt tatcaaatgt 3240
aattaatagt ctattcatat atacatatat ataatggttg ct 3282
<210> 3
<211> 21
<212> DNA
<213> Artificial Sequence
<220>
<223> IND鉴定上游引物
<400> 3
cacccataca atcttggatc g 21
<210> 4
<211> 21
<212> DNA
<213> Artificial Sequence
<220>
<223> IND鉴定下游引物
<400> 4
tgttctggtt gggttcaaaa g 21
<210> 5
<211> 22
<212> DNA
<213> Artificial Sequence
<220>
<223> SALK_058083插入引物
<400> 5
tggttcacgt agtgggccat cg 22
<210> 6
<211> 34
<212> DNA
<213> Artificial Sequence
<220>
<223> CEL6克隆上游引物
<400> 6
ggactcttga ccatggtagt tgcaaacacc gatc 34
<210> 7
<211> 44
<212> DNA
<213> Artificial Sequence
<220>
<223> CEL6克隆下游引物
<400> 7
atttaccctc agatctacca tagcaaccat tatatatatg tata 44
<210> 8
<211> 24
<212> DNA
<213> Artificial Sequence
<220>
<223> M13上游引物
<400> 8
gagcggataa caatttcaca cagg 24
<210> 9
<211> 24
<212> DNA
<213> Artificial Sequence
<220>
<223> M13下游引物
<400> 9
cgccagggtt ttcccagtca cgac 24
<210> 10
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> CEL6定量PCR上游引物
<400> 10
tttcccgatg gctttcacca 20
<210> 11
<211> 21
<212> DNA
<213> Artificial Sequence
<220>
<223> CEL6定量PCR下游引物
<400> 11
gagagccacg agagagttac g 21
<210> 12
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> 内参actin8上游引物
<400> 12
taaggtcgtg gcaccacccg 20
<210> 13
<211> 22
<212> DNA
<213> Artificial Sequence
<220>
<223> 内参actin8下游引物
<400> 13
atccgagttt gaagagctac aa 22
<210> 14
<211> 23
<212> DNA
<213> Artificial Sequence
<220>
<223> CEL6鉴定上游引物
<400> 14
cgtattatcc aagcataaat cgc 23
<210> 15
<211> 22
<212> DNA
<213> Artificial Sequence
<220>
<223> CEL6鉴定下游引物
<400> 15
tagctgtgaa acggatttga gg 22
<210> 16
<211> 22
<212> DNA
<213> Artificial Sequence
<220>
<223> SALK_060505插入片段引物
<400> 16
tggttcacgt agtgggccat cg 22
<210> 17
<211> 28
<212> DNA
<213> Artificial Sequence
<220>
<223> WiscDsLox485-488K15插入片段引物
<400> 17
aacgtccgca atgtgttatt aagttgtc 28
Claims (10)
1.一种纤维素酶,其特征在于,其氨基酸残基序列如(a)或(b)所示:
(a)如SEQ ID No:1所示的氨基酸残基序列;
(b)对如SEQ ID No:1所示的氨基酸残基序列经过1个或几个氨基酸残基的取代和/或缺失和/或添加且对植物角果开裂具有调控作用的氨基酸序列。
2.权利要求1所述的纤维素酶的编码基因。
3.根据权利要求2所述的纤维素酶的编码基因,其特征在于,其核苷酸序列如(a)、(b)、(c)或(d)所示:
(a)如SEQ ID No:2所示的多核苷酸;
(b)编码如SEQ ID No:1所示的氨基酸的多核苷酸;
(c)与SEQ ID No:2所示的多核苷酸具有80%以上的同源性且对植物角果开裂具有调控作用的核苷酸序列;
(d)在高严谨条件下与SEQ ID No:2限定的DNA序列杂交的核苷酸序列。
4.含有权利要求2或3所述编码基因的重组载体。
5.含有权利要求2或3所述编码基因的重组微生物。
6.含有权利要求2或3所述编码基因的转基因细胞系。
7.含有权利要求2或3所述编码基因的表达盒。
8.权利要求1所述的纤维素酶在调控植物角果开裂基因中的应用。
9.根据权利要求8所述的纤维素酶在调控植物角果开裂基因中的应用,其特征在于:所述植物包括单子叶植物和双子叶植物。
10.根据权利要求8所述的纤维素酶在调控植物角果开裂基因中的应用,其特征在于:所述植物为拟南芥或油菜品种。
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201710283553.6A CN106967702A (zh) | 2017-04-26 | 2017-04-26 | 一种纤维素酶及其编码基因与应用 |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201710283553.6A CN106967702A (zh) | 2017-04-26 | 2017-04-26 | 一种纤维素酶及其编码基因与应用 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN106967702A true CN106967702A (zh) | 2017-07-21 |
Family
ID=59333360
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201710283553.6A Pending CN106967702A (zh) | 2017-04-26 | 2017-04-26 | 一种纤维素酶及其编码基因与应用 |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN106967702A (zh) |
Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN1211282A (zh) * | 1995-10-06 | 1999-03-17 | 植物遗传系统公司 | 种子散失 |
-
2017
- 2017-04-26 CN CN201710283553.6A patent/CN106967702A/zh active Pending
Patent Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN1211282A (zh) * | 1995-10-06 | 1999-03-17 | 植物遗传系统公司 | 种子散失 |
Non-Patent Citations (5)
| Title |
|---|
| CRISTINA FERRÁNDIZ: "Regulation of fruit dehiscence in Arabidopsis", 《JOURNAL OF EXPERIMENTAL BOTANY》 * |
| PAUL J. MEAKIN等: "Anatomical and Biochemical Changes Associated with the Induction of Oilseed Rape (Brassica napus) Pod Dehiscence by Dasineura brassicae (Winn.)", 《ANNALS OF BOTANY》 * |
| 张的: "纤维素酶和半纤维素酶在拟南芥果荚开裂中的作用机制研究", 《豆丁网》 * |
| 登录号:CP002687.1: "Arabidopsis thaliana chromosome 4 sequence", 《GENBANK》 * |
| 登录号:Q93YQ7: "Endoglucanase 24", 《UNIPROTKB》 * |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US11795466B2 (en) | Wheat-male sterility gene WMS and its anther-specific expression promoter and uses thereof | |
| CN108603197B (zh) | 提高植物氮利用效率的方法 | |
| US20130160158A1 (en) | Manipulation of glutamine synthetases (gs) to improve nitrogen use efficiency and grain yield in higher plants | |
| EP2173882A2 (en) | Secondary wall forming genes from maize and uses thereof | |
| US9518266B2 (en) | Maize genes for controlling plant growth and organ size and their use in improving crop plants | |
| WO2014164014A1 (en) | Genes for improving nutrient uptake and abiotic stress tolerance in plants | |
| US20080222753A1 (en) | Manipulation of Ammonium Transporters (AMTS) to Improve Nitrogen Use Efficiency in Higher Plants | |
| EP3390641B1 (en) | Compositions and methods for manipulating the development of plants | |
| US20220396804A1 (en) | Methods of improving seed size and quality | |
| WO2020156367A1 (zh) | 提高水稻产量和/或稻瘟病抗性的方法及其所用蛋白质 | |
| JP2020511960A (ja) | 光呼吸効率が増加した植物 | |
| CN108291234A (zh) | 倍数孢子体形成基因 | |
| US20120084881A1 (en) | Drought tolerant plants | |
| CN115992153A (zh) | 小麦茎基腐病抗性基因TaP5CS、其编码蛋白及应用 | |
| WO2015007241A1 (en) | Molecular marker | |
| CN107022533A (zh) | 一种半纤维素酶及其编码基因与应用 | |
| CN104109682B (zh) | 一种果胶裂解酶BnPL基因及其启动子和应用 | |
| CA2768571A1 (en) | The use of dimerization domain component stacks to modulate plant architecture | |
| CN103172714B (zh) | 水稻叶片卷曲相关蛋白OsMYB103L及其编码基因与应用 | |
| CN110092819A (zh) | 玉米苞叶宽度调控蛋白arf2及其编码基因与应用 | |
| US20120227132A1 (en) | Cell number polynucleotides and polypeptides and methos of use thereof | |
| CN106967702A (zh) | 一种纤维素酶及其编码基因与应用 | |
| CA2957378A1 (en) | Methods and materials for producing fruit of altered size | |
| CN119529047A (zh) | 蛋白SlRD1及其编码基因与应用 | |
| CN119351449A (zh) | 玉米转录因子基因ZmWRKY76、其编码蛋白及生物材料在耐高温胁迫中的应用 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| RJ01 | Rejection of invention patent application after publication |
Application publication date: 20170721 |
|
| RJ01 | Rejection of invention patent application after publication |