US20090030185A1 - Synthetic genes for plant gums and other hydroxyproline-rich glycoproteins - Google Patents
Synthetic genes for plant gums and other hydroxyproline-rich glycoproteins Download PDFInfo
- Publication number
- US20090030185A1 US20090030185A1 US12/121,140 US12114008A US2009030185A1 US 20090030185 A1 US20090030185 A1 US 20090030185A1 US 12114008 A US12114008 A US 12114008A US 2009030185 A1 US2009030185 A1 US 2009030185A1
- Authority
- US
- United States
- Prior art keywords
- hyp
- sequence
- pro
- gene
- seq
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 102000003886 Glycoproteins Human genes 0.000 title claims abstract description 29
- 108090000288 Glycoproteins Proteins 0.000 title claims abstract description 29
- PMMYEEVYMWASQN-DMTCNVIQSA-N Hydroxyproline Chemical compound O[C@H]1CN[C@H](C(O)=O)C1 PMMYEEVYMWASQN-DMTCNVIQSA-N 0.000 title abstract description 39
- 108700005078 Synthetic Genes Proteins 0.000 title abstract description 29
- PMMYEEVYMWASQN-UHFFFAOYSA-N dl-hydroxyproline Natural products OC1C[NH2+]C(C([O-])=O)C1 PMMYEEVYMWASQN-UHFFFAOYSA-N 0.000 title abstract description 12
- 229960002591 hydroxyproline Drugs 0.000 title abstract description 12
- FGMPLJWBKKVCDB-UHFFFAOYSA-N trans-L-hydroxy-proline Natural products ON1CCCC1C(O)=O FGMPLJWBKKVCDB-UHFFFAOYSA-N 0.000 title abstract description 12
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 67
- 229920000084 Gum arabic Polymers 0.000 claims abstract description 35
- 235000010489 acacia gum Nutrition 0.000 claims abstract description 35
- 239000000205 acacia gum Substances 0.000 claims abstract description 35
- 108010054251 arabinogalactan proteins Proteins 0.000 claims abstract description 19
- 241000978776 Senegalia senegal Species 0.000 claims abstract 3
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 50
- 229920001184 polypeptide Polymers 0.000 claims description 41
- 102000040430 polynucleotide Human genes 0.000 claims description 22
- 108091033319 polynucleotide Proteins 0.000 claims description 22
- 239000002157 polynucleotide Substances 0.000 claims description 22
- WBAXJMCUFIXCNI-WDSKDSINSA-N Ser-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O WBAXJMCUFIXCNI-WDSKDSINSA-N 0.000 claims 10
- 108010026333 seryl-proline Proteins 0.000 claims 10
- WPWUFUBLGADILS-WDSKDSINSA-N Ala-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O WPWUFUBLGADILS-WDSKDSINSA-N 0.000 claims 8
- QOLYAJSZHIJCTO-VQVTYTSYSA-N Thr-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(O)=O QOLYAJSZHIJCTO-VQVTYTSYSA-N 0.000 claims 8
- 108010087924 alanylproline Proteins 0.000 claims 8
- 241000196324 Embryophyta Species 0.000 abstract description 86
- 230000003252 repetitive effect Effects 0.000 abstract description 25
- 238000004519 manufacturing process Methods 0.000 abstract description 19
- 238000013459 approach Methods 0.000 abstract description 10
- 102100037084 C4b-binding protein alpha chain Human genes 0.000 abstract description 3
- 101710136733 Proline-rich protein Proteins 0.000 abstract description 3
- 108090000623 proteins and genes Proteins 0.000 description 97
- 210000004027 cell Anatomy 0.000 description 67
- 150000007523 nucleic acids Chemical group 0.000 description 55
- 108091028043 Nucleic acid sequence Proteins 0.000 description 43
- 102000004169 proteins and genes Human genes 0.000 description 42
- 102000053602 DNA Human genes 0.000 description 41
- 235000018102 proteins Nutrition 0.000 description 41
- 108020004414 DNA Proteins 0.000 description 38
- 244000215068 Acacia senegal Species 0.000 description 35
- 238000000034 method Methods 0.000 description 30
- 239000005090 green fluorescent protein Substances 0.000 description 24
- 101710129170 Extensin Proteins 0.000 description 23
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 23
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 23
- 108010076504 Protein Sorting Signals Proteins 0.000 description 23
- 150000001413 amino acids Chemical class 0.000 description 23
- 241000227653 Lycopersicon Species 0.000 description 22
- 235000007688 Lycopersicon esculentum Nutrition 0.000 description 22
- 241000589158 Agrobacterium Species 0.000 description 21
- 239000013612 plasmid Substances 0.000 description 21
- 239000013598 vector Substances 0.000 description 20
- 108091034117 Oligonucleotide Proteins 0.000 description 18
- 239000012634 fragment Substances 0.000 description 18
- 230000000295 complement effect Effects 0.000 description 17
- 239000013604 expression vector Substances 0.000 description 17
- 239000002245 particle Substances 0.000 description 17
- 210000001519 tissue Anatomy 0.000 description 17
- 102000039446 nucleic acids Human genes 0.000 description 15
- 108020004707 nucleic acids Proteins 0.000 description 15
- 108020001507 fusion proteins Proteins 0.000 description 14
- 102000037865 fusion proteins Human genes 0.000 description 14
- 239000000523 sample Substances 0.000 description 14
- 108700008625 Reporter Genes Proteins 0.000 description 13
- 108010048367 enhanced green fluorescent protein Proteins 0.000 description 13
- 238000009396 hybridization Methods 0.000 description 13
- 239000000047 product Substances 0.000 description 12
- 230000009466 transformation Effects 0.000 description 12
- 239000000243 solution Substances 0.000 description 11
- 244000061176 Nicotiana tabacum Species 0.000 description 10
- 235000013305 food Nutrition 0.000 description 10
- 230000004927 fusion Effects 0.000 description 10
- 108091035707 Consensus sequence Proteins 0.000 description 9
- 101150061611 HRGP gene Proteins 0.000 description 9
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 9
- 108700019146 Transgenes Proteins 0.000 description 9
- 238000010276 construction Methods 0.000 description 9
- 238000003780 insertion Methods 0.000 description 9
- 230000037431 insertion Effects 0.000 description 9
- 239000000203 mixture Substances 0.000 description 9
- 108010058731 nopaline synthase Proteins 0.000 description 9
- 239000013615 primer Substances 0.000 description 9
- -1 strain LBA4301 Chemical compound 0.000 description 9
- 230000009261 transgenic effect Effects 0.000 description 9
- 102000053187 Glucuronidase Human genes 0.000 description 8
- 108010060309 Glucuronidase Proteins 0.000 description 8
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 8
- 238000006206 glycosylation reaction Methods 0.000 description 8
- 238000012546 transfer Methods 0.000 description 8
- 230000027455 binding Effects 0.000 description 7
- 230000015572 biosynthetic process Effects 0.000 description 7
- 238000001514 detection method Methods 0.000 description 7
- 230000000694 effects Effects 0.000 description 7
- 229940088598 enzyme Drugs 0.000 description 7
- 230000013595 glycosylation Effects 0.000 description 7
- 239000003550 marker Substances 0.000 description 7
- 229920002477 rna polymer Polymers 0.000 description 7
- 238000004114 suspension culture Methods 0.000 description 7
- 230000008685 targeting Effects 0.000 description 7
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 6
- 108091026890 Coding region Proteins 0.000 description 6
- 102000004190 Enzymes Human genes 0.000 description 6
- 108090000790 Enzymes Proteins 0.000 description 6
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 6
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 6
- 239000002299 complementary DNA Substances 0.000 description 6
- 238000013461 design Methods 0.000 description 6
- 230000001404 mediated effect Effects 0.000 description 6
- 239000002773 nucleotide Substances 0.000 description 6
- 125000003729 nucleotide group Chemical group 0.000 description 6
- 229920001282 polysaccharide Polymers 0.000 description 6
- 239000005017 polysaccharide Substances 0.000 description 6
- 230000008569 process Effects 0.000 description 6
- 241000701489 Cauliflower mosaic virus Species 0.000 description 5
- 108010035563 Chloramphenicol O-acetyltransferase Proteins 0.000 description 5
- 241000588724 Escherichia coli Species 0.000 description 5
- 240000008042 Zea mays Species 0.000 description 5
- 125000003275 alpha amino acid group Chemical group 0.000 description 5
- 230000003115 biocidal effect Effects 0.000 description 5
- 230000006801 homologous recombination Effects 0.000 description 5
- 238000002744 homologous recombination Methods 0.000 description 5
- 230000036961 partial effect Effects 0.000 description 5
- 230000008488 polyadenylation Effects 0.000 description 5
- 238000000746 purification Methods 0.000 description 5
- 239000000126 substance Substances 0.000 description 5
- LWTDZKXXJRRKDG-KXBFYZLASA-N (-)-phaseollin Chemical compound C1OC2=CC(O)=CC=C2[C@H]2[C@@H]1C1=CC=C3OC(C)(C)C=CC3=C1O2 LWTDZKXXJRRKDG-KXBFYZLASA-N 0.000 description 4
- LMKYZBGVKHTLTN-NKWVEPMBSA-N D-nopaline Chemical compound NC(=N)NCCC[C@@H](C(O)=O)N[C@@H](C(O)=O)CCC(O)=O LMKYZBGVKHTLTN-NKWVEPMBSA-N 0.000 description 4
- 101150066002 GFP gene Proteins 0.000 description 4
- 125000000174 L-prolyl group Chemical group [H]N1C([H])([H])C([H])([H])C([H])([H])[C@@]1([H])C(*)=O 0.000 description 4
- 108010059712 Pronase Proteins 0.000 description 4
- 108020004511 Recombinant DNA Proteins 0.000 description 4
- 208000037065 Subacute sclerosing leukoencephalitis Diseases 0.000 description 4
- 206010042297 Subacute sclerosing panencephalitis Diseases 0.000 description 4
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 4
- OJOBTAOGJIWAGB-UHFFFAOYSA-N acetosyringone Chemical compound COC1=CC(C(C)=O)=CC(OC)=C1O OJOBTAOGJIWAGB-UHFFFAOYSA-N 0.000 description 4
- 230000003321 amplification Effects 0.000 description 4
- 238000010367 cloning Methods 0.000 description 4
- 230000001419 dependent effect Effects 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 241001233957 eudicotyledons Species 0.000 description 4
- 210000000416 exudates and transudate Anatomy 0.000 description 4
- 238000001727 in vivo Methods 0.000 description 4
- 230000010354 integration Effects 0.000 description 4
- 229930027917 kanamycin Natural products 0.000 description 4
- 229960000318 kanamycin Drugs 0.000 description 4
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 4
- 229930182823 kanamycin A Natural products 0.000 description 4
- 238000003199 nucleic acid amplification method Methods 0.000 description 4
- 238000007899 nucleic acid hybridization Methods 0.000 description 4
- 108091008146 restriction endonucleases Proteins 0.000 description 4
- 238000013518 transcription Methods 0.000 description 4
- 230000035897 transcription Effects 0.000 description 4
- DQJCDTNMLBYVAY-ZXXIYAEKSA-N (2S,5R,10R,13R)-16-{[(2R,3S,4R,5R)-3-{[(2S,3R,4R,5S,6R)-3-acetamido-4,5-dihydroxy-6-(hydroxymethyl)oxan-2-yl]oxy}-5-(ethylamino)-6-hydroxy-2-(hydroxymethyl)oxan-4-yl]oxy}-5-(4-aminobutyl)-10-carbamoyl-2,13-dimethyl-4,7,12,15-tetraoxo-3,6,11,14-tetraazaheptadecan-1-oic acid Chemical compound NCCCC[C@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)CC[C@H](C(N)=O)NC(=O)[C@@H](C)NC(=O)C(C)O[C@@H]1[C@@H](NCC)C(O)O[C@H](CO)[C@H]1O[C@H]1[C@H](NC(C)=O)[C@@H](O)[C@H](O)[C@@H](CO)O1 DQJCDTNMLBYVAY-ZXXIYAEKSA-N 0.000 description 3
- 241000220479 Acacia Species 0.000 description 3
- 235000006491 Acacia senegal Nutrition 0.000 description 3
- 229920000189 Arabinogalactan Polymers 0.000 description 3
- 239000001904 Arabinogalactan Substances 0.000 description 3
- 241000894006 Bacteria Species 0.000 description 3
- 108090000317 Chymotrypsin Proteins 0.000 description 3
- IMXSCCDUAFEIOE-UHFFFAOYSA-N D-Octopin Natural products OC(=O)C(C)NC(C(O)=O)CCCN=C(N)N IMXSCCDUAFEIOE-UHFFFAOYSA-N 0.000 description 3
- IMXSCCDUAFEIOE-RITPCOANSA-N D-octopine Chemical compound [O-]C(=O)[C@@H](C)[NH2+][C@H](C([O-])=O)CCCNC(N)=[NH2+] IMXSCCDUAFEIOE-RITPCOANSA-N 0.000 description 3
- 238000001712 DNA sequencing Methods 0.000 description 3
- 102220566524 GDNF family receptor alpha-1_F99S_mutation Human genes 0.000 description 3
- 235000010469 Glycine max Nutrition 0.000 description 3
- 244000068988 Glycine max Species 0.000 description 3
- 102000002068 Glycopeptides Human genes 0.000 description 3
- 108010015899 Glycopeptides Proteins 0.000 description 3
- 235000010643 Leucaena leucocephala Nutrition 0.000 description 3
- 241000209510 Liliopsida Species 0.000 description 3
- 241001465754 Metazoa Species 0.000 description 3
- 241000208133 Nicotiana plumbaginifolia Species 0.000 description 3
- 108700001094 Plant Genes Proteins 0.000 description 3
- 238000012300 Sequence Analysis Methods 0.000 description 3
- 206010052428 Wound Diseases 0.000 description 3
- 208000027418 Wounds and injury Diseases 0.000 description 3
- 239000000654 additive Substances 0.000 description 3
- 238000000137 annealing Methods 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 239000012472 biological sample Substances 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 229960002376 chymotrypsin Drugs 0.000 description 3
- 230000029087 digestion Effects 0.000 description 3
- 230000007613 environmental effect Effects 0.000 description 3
- 239000007789 gas Substances 0.000 description 3
- 150000004676 glycans Chemical class 0.000 description 3
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 3
- 229910052737 gold Inorganic materials 0.000 description 3
- 239000010931 gold Substances 0.000 description 3
- 238000000338 in vitro Methods 0.000 description 3
- 208000015181 infectious disease Diseases 0.000 description 3
- 235000021374 legumes Nutrition 0.000 description 3
- 230000007246 mechanism Effects 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 230000009871 nonspecific binding Effects 0.000 description 3
- 229920000642 polymer Polymers 0.000 description 3
- 238000003752 polymerase chain reaction Methods 0.000 description 3
- 230000004481 post-translational protein modification Effects 0.000 description 3
- 108010030511 potato lectin Proteins 0.000 description 3
- 230000008929 regeneration Effects 0.000 description 3
- 238000011069 regeneration method Methods 0.000 description 3
- 230000001105 regulatory effect Effects 0.000 description 3
- 230000004044 response Effects 0.000 description 3
- 239000011780 sodium chloride Substances 0.000 description 3
- 238000006467 substitution reaction Methods 0.000 description 3
- 239000000758 substrate Substances 0.000 description 3
- 238000003786 synthesis reaction Methods 0.000 description 3
- 230000032258 transport Effects 0.000 description 3
- HCWLJSDMOMMDRF-SZWOQXJISA-N 3-[(3s,6r)-2-oxo-6-[(1s,2r,3r)-1,2,3,4-tetrahydroxybutyl]morpholin-3-yl]propanamide Chemical compound NC(=O)CC[C@@H]1NC[C@H]([C@@H](O)[C@H](O)[C@H](O)CO)OC1=O HCWLJSDMOMMDRF-SZWOQXJISA-N 0.000 description 2
- OPIFSICVWOWJMJ-AEOCFKNESA-N 5-bromo-4-chloro-3-indolyl beta-D-galactoside Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1OC1=CNC2=CC=C(Br)C(Cl)=C12 OPIFSICVWOWJMJ-AEOCFKNESA-N 0.000 description 2
- 241000589156 Agrobacterium rhizogenes Species 0.000 description 2
- ILQOASSPNGAIBC-UHFFFAOYSA-N Agropine Natural products OC(O)C(O)CC(O)C1CN2C(CCC2=O)C(=O)O1 ILQOASSPNGAIBC-UHFFFAOYSA-N 0.000 description 2
- 241000972773 Aulopiformes Species 0.000 description 2
- 108700010070 Codon Usage Proteins 0.000 description 2
- 241000195493 Cryptophyta Species 0.000 description 2
- 108010066133 D-octopine dehydrogenase Proteins 0.000 description 2
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 2
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 2
- 102220566469 GDNF family receptor alpha-1_S65T_mutation Human genes 0.000 description 2
- 102220566451 GDNF family receptor alpha-1_Y66H_mutation Human genes 0.000 description 2
- 206010020649 Hyperkeratosis Diseases 0.000 description 2
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 2
- 108060001084 Luciferase Proteins 0.000 description 2
- 239000005089 Luciferase Substances 0.000 description 2
- 239000006137 Luria-Bertani broth Substances 0.000 description 2
- NWBJYWHLCVSVIJ-UHFFFAOYSA-N N-benzyladenine Chemical compound N=1C=NC=2NC=NC=2C=1NCC1=CC=CC=C1 NWBJYWHLCVSVIJ-UHFFFAOYSA-N 0.000 description 2
- 229930193140 Neomycin Natural products 0.000 description 2
- 238000000636 Northern blotting Methods 0.000 description 2
- 240000007594 Oryza sativa Species 0.000 description 2
- 235000007164 Oryza sativa Nutrition 0.000 description 2
- 238000012408 PCR amplification Methods 0.000 description 2
- 101710163504 Phaseolin Proteins 0.000 description 2
- 108010089814 Plant Lectins Proteins 0.000 description 2
- 241000209504 Poaceae Species 0.000 description 2
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 2
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 2
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 2
- 108010003581 Ribulose-bisphosphate carboxylase Proteins 0.000 description 2
- 241000208292 Solanaceae Species 0.000 description 2
- 244000061456 Solanum tuberosum Species 0.000 description 2
- 235000002595 Solanum tuberosum Nutrition 0.000 description 2
- 238000002105 Southern blotting Methods 0.000 description 2
- 235000021307 Triticum Nutrition 0.000 description 2
- 244000098338 Triticum aestivum Species 0.000 description 2
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 2
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 description 2
- 229960004150 aciclovir Drugs 0.000 description 2
- MKUXAQIIEYXACX-UHFFFAOYSA-N aciclovir Chemical compound N1C(N)=NC(=O)C2=C1N(COCCO)C=N2 MKUXAQIIEYXACX-UHFFFAOYSA-N 0.000 description 2
- 230000002411 adverse Effects 0.000 description 2
- 229960000723 ampicillin Drugs 0.000 description 2
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 235000019312 arabinogalactan Nutrition 0.000 description 2
- PYMYPHUHKUWMLA-UHFFFAOYSA-N arabinose Natural products OCC(O)C(O)C(O)C=O PYMYPHUHKUWMLA-UHFFFAOYSA-N 0.000 description 2
- SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 description 2
- 229960003669 carbenicillin Drugs 0.000 description 2
- FPPNZSSZRUTDAP-UWFZAAFLSA-N carbenicillin Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)C(C(O)=O)C1=CC=CC=C1 FPPNZSSZRUTDAP-UWFZAAFLSA-N 0.000 description 2
- 238000004113 cell culture Methods 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 238000003776 cleavage reaction Methods 0.000 description 2
- 239000013599 cloning vector Substances 0.000 description 2
- 238000001816 cooling Methods 0.000 description 2
- 235000005822 corn Nutrition 0.000 description 2
- 239000002537 cosmetic Substances 0.000 description 2
- 210000004748 cultured cell Anatomy 0.000 description 2
- 231100000433 cytotoxic Toxicity 0.000 description 2
- 230000001472 cytotoxic effect Effects 0.000 description 2
- 230000022811 deglycosylation Effects 0.000 description 2
- 238000004925 denaturation Methods 0.000 description 2
- 230000036425 denaturation Effects 0.000 description 2
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 2
- 235000013399 edible fruits Nutrition 0.000 description 2
- 239000002158 endotoxin Substances 0.000 description 2
- 230000002255 enzymatic effect Effects 0.000 description 2
- 239000000796 flavoring agent Substances 0.000 description 2
- MHMNJMPURVTYEJ-UHFFFAOYSA-N fluorescein-5-isothiocyanate Chemical compound O1C(=O)C2=CC(N=C=S)=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 MHMNJMPURVTYEJ-UHFFFAOYSA-N 0.000 description 2
- 229960002963 ganciclovir Drugs 0.000 description 2
- IRSCQMHQWWYFCW-UHFFFAOYSA-N ganciclovir Chemical compound O=C1NC(N)=NC2=C1N=CN2COC(CO)CO IRSCQMHQWWYFCW-UHFFFAOYSA-N 0.000 description 2
- 101150054900 gus gene Proteins 0.000 description 2
- 238000010348 incorporation Methods 0.000 description 2
- 230000001965 increasing effect Effects 0.000 description 2
- 238000002955 isolation Methods 0.000 description 2
- 235000009973 maize Nutrition 0.000 description 2
- 210000001161 mammalian embryo Anatomy 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 238000000386 microscopy Methods 0.000 description 2
- 229960004927 neomycin Drugs 0.000 description 2
- 229920001542 oligosaccharide Polymers 0.000 description 2
- 230000001590 oxidative effect Effects 0.000 description 2
- LWTDZKXXJRRKDG-UHFFFAOYSA-N phaseollin Natural products C1OC2=CC(O)=CC=C2C2C1C1=CC=C3OC(C)(C)C=CC3=C1O2 LWTDZKXXJRRKDG-UHFFFAOYSA-N 0.000 description 2
- 239000002953 phosphate buffered saline Substances 0.000 description 2
- BASFCYQUMIYNBI-UHFFFAOYSA-N platinum Chemical compound [Pt] BASFCYQUMIYNBI-UHFFFAOYSA-N 0.000 description 2
- 239000002243 precursor Substances 0.000 description 2
- 230000004853 protein function Effects 0.000 description 2
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 2
- 235000009566 rice Nutrition 0.000 description 2
- 235000019515 salmon Nutrition 0.000 description 2
- 230000007017 scission Effects 0.000 description 2
- 239000006152 selective media Substances 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- AJPJDKMHJJGVTQ-UHFFFAOYSA-M sodium dihydrogen phosphate Chemical compound [Na+].OP(O)([O-])=O AJPJDKMHJJGVTQ-UHFFFAOYSA-M 0.000 description 2
- 229910000162 sodium phosphate Inorganic materials 0.000 description 2
- 235000014214 soft drink Nutrition 0.000 description 2
- 125000006850 spacer group Chemical group 0.000 description 2
- 239000013589 supplement Substances 0.000 description 2
- 239000000725 suspension Substances 0.000 description 2
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- 230000003612 virological effect Effects 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- VDIQZIVYOLXOTG-VCGPICOLSA-N (2s)-3-hydroxy-2-[[(3r,4s,5r,6r)-3,4,5-trihydroxy-6-(hydroxymethyl)oxan-2-yl]amino]propanoic acid Chemical group OC[C@@H](C(O)=O)NC1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O VDIQZIVYOLXOTG-VCGPICOLSA-N 0.000 description 1
- GOJUJUVQIVIZAV-UHFFFAOYSA-N 2-amino-4,6-dichloropyrimidine-5-carbaldehyde Chemical compound NC1=NC(Cl)=C(C=O)C(Cl)=N1 GOJUJUVQIVIZAV-UHFFFAOYSA-N 0.000 description 1
- JXCKZXHCJOVIAV-UHFFFAOYSA-N 6-[(5-bromo-4-chloro-1h-indol-3-yl)oxy]-3,4,5-trihydroxyoxane-2-carboxylic acid;cyclohexanamine Chemical compound [NH3+]C1CCCCC1.O1C(C([O-])=O)C(O)C(O)C(O)C1OC1=CNC2=CC=C(Br)C(Cl)=C12 JXCKZXHCJOVIAV-UHFFFAOYSA-N 0.000 description 1
- 241000208140 Acer Species 0.000 description 1
- 235000004422 Acer negundo Nutrition 0.000 description 1
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- 244000144730 Amygdalus persica Species 0.000 description 1
- 235000011514 Anogeissus latifolia Nutrition 0.000 description 1
- 244000106483 Anogeissus latifolia Species 0.000 description 1
- 241000233788 Arecaceae Species 0.000 description 1
- 241000416162 Astragalus gummifer Species 0.000 description 1
- 229920002799 BoPET Polymers 0.000 description 1
- 241000167854 Bourreria succulenta Species 0.000 description 1
- 235000002566 Capsicum Nutrition 0.000 description 1
- 102000012286 Chitinases Human genes 0.000 description 1
- 108010022172 Chitinases Proteins 0.000 description 1
- 241000195628 Chlorophyta Species 0.000 description 1
- 239000005496 Chlorsulfuron Substances 0.000 description 1
- 108020004635 Complementary DNA Proteins 0.000 description 1
- 235000009849 Cucumis sativus Nutrition 0.000 description 1
- 240000008067 Cucumis sativus Species 0.000 description 1
- IGXWBGJHJZYPQS-SSDOTTSWSA-N D-Luciferin Chemical compound OC(=O)[C@H]1CSC(C=2SC3=CC=C(O)C=C3N=2)=N1 IGXWBGJHJZYPQS-SSDOTTSWSA-N 0.000 description 1
- 101150074155 DHFR gene Proteins 0.000 description 1
- 102000012410 DNA Ligases Human genes 0.000 description 1
- 108010061982 DNA Ligases Proteins 0.000 description 1
- 239000003155 DNA primer Substances 0.000 description 1
- 238000007900 DNA-DNA hybridization Methods 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- CYCGRDQQIOGCKX-UHFFFAOYSA-N Dehydro-luciferin Natural products OC(=O)C1=CSC(C=2SC3=CC(O)=CC=C3N=2)=N1 CYCGRDQQIOGCKX-UHFFFAOYSA-N 0.000 description 1
- 240000005717 Dioscorea alata Species 0.000 description 1
- 235000002723 Dioscorea alata Nutrition 0.000 description 1
- 235000007056 Dioscorea composita Nutrition 0.000 description 1
- 235000009723 Dioscorea convolvulacea Nutrition 0.000 description 1
- 235000005362 Dioscorea floribunda Nutrition 0.000 description 1
- 235000004868 Dioscorea macrostachya Nutrition 0.000 description 1
- 235000005361 Dioscorea nummularia Nutrition 0.000 description 1
- 235000005360 Dioscorea spiculiflora Nutrition 0.000 description 1
- 206010059866 Drug resistance Diseases 0.000 description 1
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 1
- 240000006890 Erythroxylum coca Species 0.000 description 1
- 244000165918 Eucalyptus papuana Species 0.000 description 1
- 102000010834 Extracellular Matrix Proteins Human genes 0.000 description 1
- 108010037362 Extracellular Matrix Proteins Proteins 0.000 description 1
- 229920001917 Ficoll Polymers 0.000 description 1
- BJGNCJDXODQBOB-UHFFFAOYSA-N Fivefly Luciferin Natural products OC(=O)C1CSC(C=2SC3=CC(O)=CC=C3N=2)=N1 BJGNCJDXODQBOB-UHFFFAOYSA-N 0.000 description 1
- KRHYYFGTRYWZRS-UHFFFAOYSA-N Fluorane Chemical compound F KRHYYFGTRYWZRS-UHFFFAOYSA-N 0.000 description 1
- 102000030902 Galactosyltransferase Human genes 0.000 description 1
- 108060003306 Galactosyltransferase Proteins 0.000 description 1
- 102100034062 Glutathione hydrolase 5 proenzyme Human genes 0.000 description 1
- 101710143566 Glutathione hydrolase 5 proenzyme Proteins 0.000 description 1
- 108700037728 Glycine max beta-conglycinin Proteins 0.000 description 1
- 108700023372 Glycosyltransferases Proteins 0.000 description 1
- 102000051366 Glycosyltransferases Human genes 0.000 description 1
- 239000001922 Gum ghatti Substances 0.000 description 1
- 229920000569 Gum karaya Polymers 0.000 description 1
- 244000020551 Helianthus annuus Species 0.000 description 1
- 235000003222 Helianthus annuus Nutrition 0.000 description 1
- 101000610640 Homo sapiens U4/U6 small nuclear ribonucleoprotein Prp3 Proteins 0.000 description 1
- 240000005979 Hordeum vulgare Species 0.000 description 1
- 235000007340 Hordeum vulgare Nutrition 0.000 description 1
- 235000006350 Ipomoea batatas var. batatas Nutrition 0.000 description 1
- 108010025815 Kanamycin Kinase Proteins 0.000 description 1
- 125000003412 L-alanyl group Chemical group [H]N([H])[C@@](C([H])([H])[H])(C(=O)[*])[H] 0.000 description 1
- HMFHBZSHGGEWLO-HWQSCIPKSA-N L-arabinofuranose Chemical compound OC[C@@H]1OC(O)[C@H](O)[C@H]1O HMFHBZSHGGEWLO-HWQSCIPKSA-N 0.000 description 1
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 1
- FBOZXECLQNJBKD-ZDUSSCGKSA-N L-methotrexate Chemical compound C=1N=C2N=C(N)N=C(N)C2=NC=1CN(C)C1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 FBOZXECLQNJBKD-ZDUSSCGKSA-N 0.000 description 1
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 1
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 1
- 108091026898 Leader sequence (mRNA) Proteins 0.000 description 1
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- 241000234435 Lilium Species 0.000 description 1
- DDWFXDSYGUXRAY-UHFFFAOYSA-N Luciferin Natural products CCc1c(C)c(CC2NC(=O)C(=C2C=C)C)[nH]c1Cc3[nH]c4C(=C5/NC(CC(=O)O)C(C)C5CC(=O)O)CC(=O)c4c3C DDWFXDSYGUXRAY-UHFFFAOYSA-N 0.000 description 1
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 241000218922 Magnoliophyta Species 0.000 description 1
- 235000011430 Malus pumila Nutrition 0.000 description 1
- 244000070406 Malus silvestris Species 0.000 description 1
- 235000015103 Malus silvestris Nutrition 0.000 description 1
- VPRLICVDSGMIKO-UHFFFAOYSA-N Mannopine Natural products NC(=O)CCC(C(O)=O)NCC(O)C(O)C(O)C(O)CO VPRLICVDSGMIKO-UHFFFAOYSA-N 0.000 description 1
- 108010021466 Mutant Proteins Proteins 0.000 description 1
- 102000008300 Mutant Proteins Human genes 0.000 description 1
- 239000005041 Mylar™ Substances 0.000 description 1
- 241000237536 Mytilus edulis Species 0.000 description 1
- 101710202365 Napin Proteins 0.000 description 1
- 101001068640 Nicotiana tabacum Basic form of pathogenesis-related protein 1 Proteins 0.000 description 1
- 108020005187 Oligonucleotide Probes Proteins 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- 241000233855 Orchidaceae Species 0.000 description 1
- 108090000417 Oxygenases Proteins 0.000 description 1
- 102000004020 Oxygenases Human genes 0.000 description 1
- 238000010222 PCR analysis Methods 0.000 description 1
- 239000006002 Pepper Substances 0.000 description 1
- 102000007079 Peptide Fragments Human genes 0.000 description 1
- 108010033276 Peptide Fragments Proteins 0.000 description 1
- 235000010627 Phaseolus vulgaris Nutrition 0.000 description 1
- 244000046052 Phaseolus vulgaris Species 0.000 description 1
- 235000008331 Pinus X rigitaeda Nutrition 0.000 description 1
- 235000011613 Pinus brutia Nutrition 0.000 description 1
- 240000007320 Pinus strobus Species 0.000 description 1
- 235000016761 Piper aduncum Nutrition 0.000 description 1
- 240000003889 Piper guineense Species 0.000 description 1
- 235000017804 Piper guineense Nutrition 0.000 description 1
- 235000008184 Piper nigrum Nutrition 0.000 description 1
- 235000010582 Pisum sativum Nutrition 0.000 description 1
- 240000004713 Pisum sativum Species 0.000 description 1
- 239000002202 Polyethylene glycol Substances 0.000 description 1
- 102000004079 Prolyl Hydroxylases Human genes 0.000 description 1
- 108010043005 Prolyl Hydroxylases Proteins 0.000 description 1
- 235000006040 Prunus persica var persica Nutrition 0.000 description 1
- 241001495449 Robinia pseudoacacia Species 0.000 description 1
- 240000000111 Saccharum officinarum Species 0.000 description 1
- 235000007201 Saccharum officinarum Nutrition 0.000 description 1
- 241001199840 Senegalia laeta Species 0.000 description 1
- 239000012506 Sephacryl® Substances 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 241001291279 Solanum galapagense Species 0.000 description 1
- 102100025749 Sphingosine 1-phosphate receptor 2 Human genes 0.000 description 1
- 101710155462 Sphingosine 1-phosphate receptor 2 Proteins 0.000 description 1
- 102100025747 Sphingosine 1-phosphate receptor 3 Human genes 0.000 description 1
- 101710155457 Sphingosine 1-phosphate receptor 3 Proteins 0.000 description 1
- 102100029803 Sphingosine 1-phosphate receptor 4 Human genes 0.000 description 1
- 101710155458 Sphingosine 1-phosphate receptor 4 Proteins 0.000 description 1
- 102100029802 Sphingosine 1-phosphate receptor 5 Human genes 0.000 description 1
- 101710155451 Sphingosine 1-phosphate receptor 5 Proteins 0.000 description 1
- 229920001872 Spider silk Polymers 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 101000870420 Streptococcus gordonii UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase GtfA subunit Proteins 0.000 description 1
- 229920001615 Tragacanth Polymers 0.000 description 1
- 108700029229 Transcriptional Regulatory Elements Proteins 0.000 description 1
- 239000007983 Tris buffer Substances 0.000 description 1
- 240000000260 Typha latifolia Species 0.000 description 1
- AOLHUMAVONBBEZ-STQMWFEESA-N Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AOLHUMAVONBBEZ-STQMWFEESA-N 0.000 description 1
- 102100040374 U4/U6 small nuclear ribonucleoprotein Prp3 Human genes 0.000 description 1
- 108090000848 Ubiquitin Proteins 0.000 description 1
- 102000044159 Ubiquitin Human genes 0.000 description 1
- 241001199633 Vachellia drepanolobium Species 0.000 description 1
- 241000978782 Vachellia seyal Species 0.000 description 1
- PMKQKNBISAOSRI-XHSDSOJGSA-N Val-Tyr-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N PMKQKNBISAOSRI-XHSDSOJGSA-N 0.000 description 1
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 1
- 108010046377 Whey Proteins Proteins 0.000 description 1
- 235000007244 Zea mays Nutrition 0.000 description 1
- AQIBTEMVIOJQFD-NQXXGFSBSA-N [(2R,3R)-1,2-dihydroxy-4-oxo-5-phosphonooxypentan-3-yl] dihydrogen phosphate Chemical compound OC[C@@H](O)[C@@H](OP(O)(O)=O)C(=O)COP(O)(O)=O AQIBTEMVIOJQFD-NQXXGFSBSA-N 0.000 description 1
- NREIOERVEJDBJP-KFDLCVIWSA-N [3)-beta-D-ribosyl-(1->1)-D-ribitol-5-P-(O->]3 Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1OC[C@H](O)[C@H](O)[C@H](O)COP(O)(=O)O[C@@H]1[C@@H](CO)O[C@@H](OC[C@H](O)[C@H](O)[C@H](O)COP(O)(=O)O[C@@H]2[C@H](O[C@@H](OC[C@H](O)[C@H](O)[C@H](O)COP(O)(O)=O)[C@@H]2O)CO)[C@@H]1O NREIOERVEJDBJP-KFDLCVIWSA-N 0.000 description 1
- 108020002494 acetyltransferase Proteins 0.000 description 1
- 102000005421 acetyltransferase Human genes 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- 229940126575 aminoglycoside Drugs 0.000 description 1
- 238000005571 anion exchange chromatography Methods 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 125000000328 arabinofuranosyl group Chemical group C1([C@@H](O)[C@H](O)[C@H](O1)CO)* 0.000 description 1
- 150000008209 arabinosides Chemical class 0.000 description 1
- 125000000089 arabinosyl group Chemical group C1([C@@H](O)[C@H](O)[C@H](O)CO1)* 0.000 description 1
- 108010039311 arabinosyltransferase Proteins 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 102000005936 beta-Galactosidase Human genes 0.000 description 1
- 108010005774 beta-Galactosidase Proteins 0.000 description 1
- 239000000227 bioadhesive Substances 0.000 description 1
- 238000010804 cDNA synthesis Methods 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 235000014633 carbohydrates Nutrition 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 150000001768 cations Chemical class 0.000 description 1
- 230000010261 cell growth Effects 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 210000002421 cell wall Anatomy 0.000 description 1
- 210000003850 cellular structure Anatomy 0.000 description 1
- 235000013339 cereals Nutrition 0.000 description 1
- 239000007795 chemical reaction product Substances 0.000 description 1
- 235000019693 cherries Nutrition 0.000 description 1
- VJYIFXVZLXQVHO-UHFFFAOYSA-N chlorsulfuron Chemical compound COC1=NC(C)=NC(NC(=O)NS(=O)(=O)C=2C(=CC=CC=2)Cl)=N1 VJYIFXVZLXQVHO-UHFFFAOYSA-N 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- KRKNYBCHXYNGOX-UHFFFAOYSA-N citric acid Chemical compound OC(=O)CC(O)(C(O)=O)CC(O)=O KRKNYBCHXYNGOX-UHFFFAOYSA-N 0.000 description 1
- 239000011248 coating agent Substances 0.000 description 1
- 238000000576 coating method Methods 0.000 description 1
- 235000008957 cocaer Nutrition 0.000 description 1
- ZPUCINDJVBIVPJ-LJISPDSOSA-N cocaine Chemical compound O([C@H]1C[C@@H]2CC[C@@H](N2C)[C@H]1C(=O)OC)C(=O)C1=CC=CC=C1 ZPUCINDJVBIVPJ-LJISPDSOSA-N 0.000 description 1
- 239000000084 colloidal system Substances 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 239000000356 contaminant Substances 0.000 description 1
- 238000004132 cross linking Methods 0.000 description 1
- SUYVUBYJARFZHO-RRKCRQDMSA-N dATP Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-RRKCRQDMSA-N 0.000 description 1
- SUYVUBYJARFZHO-UHFFFAOYSA-N dATP Natural products C1=NC=2C(N)=NC=NC=2N1C1CC(O)C(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-UHFFFAOYSA-N 0.000 description 1
- RGWHQCVHVJXOKC-SHYZEUOFSA-J dCTP(4-) Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](COP([O-])(=O)OP([O-])(=O)OP([O-])([O-])=O)[C@@H](O)C1 RGWHQCVHVJXOKC-SHYZEUOFSA-J 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 238000000151 deposition Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 229960000633 dextran sulfate Drugs 0.000 description 1
- 239000000539 dimer Substances 0.000 description 1
- 235000004879 dioscorea Nutrition 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 239000003995 emulsifying agent Substances 0.000 description 1
- 230000001804 emulsifying effect Effects 0.000 description 1
- 239000000839 emulsion Substances 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 238000001976 enzyme digestion Methods 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 230000005284 excitation Effects 0.000 description 1
- 210000002744 extracellular matrix Anatomy 0.000 description 1
- 239000000834 fixative Substances 0.000 description 1
- 235000019634 flavors Nutrition 0.000 description 1
- 108010006205 fluorescein isothiocyanate bovine serum albumin Proteins 0.000 description 1
- 235000013373 food additive Nutrition 0.000 description 1
- 239000002778 food additive Substances 0.000 description 1
- 238000005194 fractionation Methods 0.000 description 1
- 235000021022 fresh fruits Nutrition 0.000 description 1
- 230000004345 fruit ripening Effects 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 238000005227 gel permeation chromatography Methods 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 238000001476 gene delivery Methods 0.000 description 1
- 235000021474 generally recognized As safe (food) Nutrition 0.000 description 1
- 235000021473 generally recognized as safe (food ingredients) Nutrition 0.000 description 1
- 108010083391 glycinin Proteins 0.000 description 1
- 108700014210 glycosyltransferase activity proteins Proteins 0.000 description 1
- 235000019314 gum ghatti Nutrition 0.000 description 1
- 229910001385 heavy metal Inorganic materials 0.000 description 1
- 239000001307 helium Substances 0.000 description 1
- 229910052734 helium Inorganic materials 0.000 description 1
- SWQJXJOGLNCZEY-UHFFFAOYSA-N helium atom Chemical compound [He] SWQJXJOGLNCZEY-UHFFFAOYSA-N 0.000 description 1
- 230000002363 herbicidal effect Effects 0.000 description 1
- 239000004009 herbicide Substances 0.000 description 1
- 235000008216 herbs Nutrition 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- 125000000487 histidyl group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C([H])=N1 0.000 description 1
- 239000000416 hydrocolloid Substances 0.000 description 1
- 229910000040 hydrogen fluoride Inorganic materials 0.000 description 1
- 108010002685 hygromycin-B kinase Proteins 0.000 description 1
- 230000001976 improved effect Effects 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 210000000554 iris Anatomy 0.000 description 1
- 235000010494 karaya gum Nutrition 0.000 description 1
- 125000001909 leucine group Chemical group [H]N(*)C(C(*)=O)C([H])([H])C(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 229920006008 lipopolysaccharide Polymers 0.000 description 1
- 238000002803 maceration Methods 0.000 description 1
- VPRLICVDSGMIKO-SZWOQXJISA-N mannopine Chemical compound NC(=O)CC[C@@H](C(O)=O)NC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO VPRLICVDSGMIKO-SZWOQXJISA-N 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 230000013011 mating Effects 0.000 description 1
- 239000002609 medium Substances 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 239000002923 metal particle Substances 0.000 description 1
- 229960000485 methotrexate Drugs 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 230000001483 mobilizing effect Effects 0.000 description 1
- 239000000178 monomer Substances 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 239000002751 oligonucleotide probe Substances 0.000 description 1
- 150000002482 oligosaccharides Chemical class 0.000 description 1
- 230000008506 pathogenesis Effects 0.000 description 1
- 238000012510 peptide mapping method Methods 0.000 description 1
- 230000008121 plant development Effects 0.000 description 1
- 230000008635 plant growth Effects 0.000 description 1
- 239000003726 plant lectin Substances 0.000 description 1
- 229910052697 platinum Inorganic materials 0.000 description 1
- 229920001223 polyethylene glycol Polymers 0.000 description 1
- 238000006116 polymerization reaction Methods 0.000 description 1
- 150000004804 polysaccharides Polymers 0.000 description 1
- 230000001323 posttranslational effect Effects 0.000 description 1
- 239000000843 powder Substances 0.000 description 1
- 239000003755 preservative agent Substances 0.000 description 1
- 230000002335 preservative effect Effects 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 230000001681 protective effect Effects 0.000 description 1
- 235000004252 protein component Nutrition 0.000 description 1
- 108020001580 protein domains Proteins 0.000 description 1
- 238000004080 punching Methods 0.000 description 1
- 230000002285 radioactive effect Effects 0.000 description 1
- 238000003259 recombinant expression Methods 0.000 description 1
- 230000002829 reductive effect Effects 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 239000011347 resin Substances 0.000 description 1
- 229920005989 resin Polymers 0.000 description 1
- 238000004366 reverse phase liquid chromatography Methods 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 230000005070 ripening Effects 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 239000000565 sealant Substances 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 239000002689 soil Substances 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 239000003381 stabilizer Substances 0.000 description 1
- 230000000087 stabilizing effect Effects 0.000 description 1
- 239000004094 surface-active agent Substances 0.000 description 1
- 238000005382 thermal cycling Methods 0.000 description 1
- 239000002562 thickening agent Substances 0.000 description 1
- 238000012876 topography Methods 0.000 description 1
- 125000003508 trans-4-hydroxy-L-proline group Chemical group 0.000 description 1
- 230000005030 transcription termination Effects 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 239000001226 triphosphate Substances 0.000 description 1
- 235000011178 triphosphate Nutrition 0.000 description 1
- WFKWXMTUELFFGS-UHFFFAOYSA-N tungsten Chemical compound [W] WFKWXMTUELFFGS-UHFFFAOYSA-N 0.000 description 1
- 229910052721 tungsten Inorganic materials 0.000 description 1
- 239000010937 tungsten Substances 0.000 description 1
- 108010051110 tyrosyl-lysine Proteins 0.000 description 1
- 239000004474 valine Substances 0.000 description 1
- 239000003981 vehicle Substances 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 235000021119 whey protein Nutrition 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/415—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from plants
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8242—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
Definitions
- the present invention relates generally to the field of plant gums and other hydroxyproline-rich glycoproteins, and in particular, to the expression of synthetic genes designed from repetitive peptide sequences.
- Gummosis is a common wound response that results in the exudation of a gum sealant at the site of cracks in bark.
- A. M. Stephen et al. “Exudate Gums”, Methods Plant Biochem . (1990).
- the exudate is a composite of polysaccharides and glycoproteins structurally related to cell wall components such as galactans [G.O. Aspinall, “ Plant Gums”, The Carbohydrates 2B:522536 (1970)] and hydroxyproline-rich glycoproteins [Anderson and McDougall, “The chemical characterization of the gum exudates from eight Australian Acacia species of the series Phyllodineae.” Food Hydrocolloids, 2: 329 (1988)].
- Gum arabic is probably the best characterized of these exudates (although it has been largely refractory to chemical analysis). It is a natural plant exudate secreted by various species of Acacia trees. Acacia Senegal accounts for approximately 80% of the production of gum arabic with Acacia seyal, Acacia laeta, Acacia camplylacantha , and Acacia drepanolobium supplying the remaining 20%. The gum is gathered by hand in Africa. It is a tedious process involving piercing and stripping the bark of the trees, then returning later to gather the dried tear drop shaped, spherical balls that form.
- gum arabic The exact chemical nature of gum arabic has not been elucidated. It is believed to consist of two major components, a microheterogeneous glucurono-arabinorhamogalactan polysaccharide and a higher molecular weight hydroxyproline-rich glycoprotein. Osman et al., “Characteriztion of Gum Arabic Fractions Obtained By Anion-Exchange Chromatography” Phytochemistry 38:409 (1984) and Qi et al., “Gum Arabic Glycoprotein Is A Twisted Hairy Rope” Plant Physiol. 96:848 (1991). While the amino composition of the protein portion has been examined, little is known with regard to the precise amino acid sequence.
- gum arabic While the precise chemical nature of gum arabic is elusive, the gum is nonetheless particularly useful due to its high solubility and low viscosity compared to other gums.
- whey proteins can be used to increase the functionality of gum arabic.
- A. Prakash et al. “The effects of added proteins on the functionality of gum arabic in soft drink emulsion systems,” Food Hydrocolloids 4:177 (1990).
- this approach has limitations. Only low concentrations of such additives can be used without producing off-flavors in the final food product.
- the present invention involves a new approach in the field of plant gums and presents a new solution to the production of hydroxyproline(Hyp)-rich glycoproteins (HRGPs), repetitive proline-rich proteins (RPRPs) and arabino-galactan proteins (AGPs).
- HRGPs hydroxyproline(Hyp)-rich glycoproteins
- RPRPs repetitive proline-rich proteins
- AGPs arabino-galactan proteins
- the present invention contemplates the expression of synthetic genes designed from repetitive peptide sequences of such glycoproteins, including the peptide sequences of gum arabic glycoprotein (GAGP).
- the present invention contemplates a substantially purified polypeptide comprising at least a portion of the amino acid sequence Ser-Hyp-Hyp-Hyp-[Hyp/Thr]-Leu-Ser-Hyp-Ser-Hyp-Thr-Hyp-Thr-Hyp-Hyp-Hyp-Gly-Pro-His (SEQ ID NO:1) or variants thereof.
- variants it is meant that the sequence need not comprise the exact sequence; up to five (5) amino acid substitutions are contemplated.
- a Leu or Hyp may be substituted for the Gly; Leu may also be substituted for Ser and one or more Hyp.
- variants it is also meant that the sequence need not be the entire nineteen (19) amino acids. Illustrative variants are shown in Table 2.
- the present invention be limited by the precise length of the purified polypeptide.
- the peptide comprises more than twelve (12) amino acids from the nineteen (19) amino acids of the sequence.
- a portion of the nineteen (19) amino acids is utilized as a repetitive sequence.
- all nineteen (19) amino acids are utilized as a repetitive sequence.
- the sequence (i.e. SEQ ID NO:1) or variants thereof may be used as a repeating sequence between one (1) and up to fifty (50) times, more preferably between ten (10) and up to thirty (30) times, and most preferably approximately twenty (20) times.
- the sequence (i.e. SEQ ID NO:1) or variants thereof may be used as contiguous repeats or may be used as non-contiguous repeats (with other amino acids, or amino acid analogues, placed between the repeating sequences).
- the present invention specifically contemplates fusion proteins comprising a non-gum arabic protein or glycoprotein sequence and a portion of the gum arabic glycoprotein sequence (SEQ ID NO:1). It is not intended that the present invention be limited by the nature of the non-gum arabic glycoprotein sequence.
- the non-gum arabic glycoprotein sequence is a green fluorescent protein.
- the present invention contemplates synthetic genes encoding such peptides.
- synthetic genes it is meant that the nucleic acid sequence is derived using the peptide sequence of interest (in contrast to using the nucleic acid sequence from cDNA).
- the present invention contemplates an isolated polynucleotide sequence encoding a polypeptide comprising at least a portion of the polypeptide of SEQ ID NO:1 or variants thereof.
- the present invention specifically contemplates a polynucleotide sequence comprising a nucleotide sequence encoding a polypeptide comprising one or more repeats of SEQ ID NO:1 or variants thereof.
- the present invention contemplates synthetic genes encoding portions of HRGPs, wherein the encoded peptides contain one or more of the highly conserved Ser-Hyp 4 motif(s).
- the present invention also contemplates synthetic genes encoding portions of RPRPs, wherein the encoded peptides contain one or more of the pentapeptide motif: Pro-Hyp-Val-Tyr-Lys and variants of this sequence such as X-Hyp-Val-Tyr-Lys and Pro-Hyp-Val-X-Lys and Pro-Pro-X Tyr-Lys and Pro-Pro-X-Tyr-X, where “X” can be Thr, Glu, Hyp, Pro, His and Ile.
- the present invention also contemplates synthetic genes encoding portions of AGPs, wherein the encoded peptides contain one or more Xaa-Hyp-Xaa-Hyp repeats. Such peptides can be expressed in a variety of forms, including but not limited to fusion proteins.
- the present invention contemplates a polynucleotide sequence comprising the sequence: 5′-CCA CCA CCT TCA CCT CCA CCC CCA TCT CCA-3′ (SEQ ID NO:2).
- motifs for AGPs contemplates a polynucleotide sequence comprising the sequence: 5′-TCA CCA TCA CCA TCT CCT TCG COCA TCA CCC-3′ (SEQ ID NO:3).
- the present invention also contemplates sequences that are complementary (including sequences that are only partially complementary) sequences to the sequences of SEQ ID NOS: 2 and 3.
- Such complementary sequences include sequences that will hybridize to the sequences of SEQ ID NOS: 2 and 3 under low stringency conditions as well as high stringency conditions (see Definitions below).
- the present invention also contemplates the mixing of motifs (i.e. modules) which are not found in wild-type sequences. For example, one might add GAGP modules to extensin and RPRP crosslinking modules to AGP-like molecules.
- the present invention contemplates using the polynucleotides of the present invention for expression of the polypeptides in vitro and in vivo. Therefore, the present invention contemplates polynucleotide sequences encoding two or more repeats of the sequence of SEQ ID NO: 1 or variants thereof, wherein said polynucleotide sequence is contained on a recombinant expression vector. It is also contemplated that such vectors will be introduced into a variety of host cells, both eukaryotic and prokaryotic (e.g. bacteria such as E. coli ).
- the vector further comprises a promoter. It is not intended that the present invention be limited to a particular promoter. Any promoter sequence which is capable of directing expression of an operably linked nucleic acid sequence encoding a portion of a plant gum polypeptide (or other hydroxyproline-rich polypeptide of interest as described above) is contemplated to be within the scope of the invention. Promoters include, but are not limited to, promoter sequences of bacterial, viral and plant origins. Promoters of bacterial origin include, but are not limited to, the octopine synthase promoter, the nopaline synthase promoter and other promoters derived from native Ti plasmids.
- Viral promoters include, but are not limited to, the 35S and 19S RNA promoters of cauliflower mosaic virus (CaMV), and T-DNA promoters from Agrobacterium .
- Plant promoters include, but are not limited to, the ribulose-1,3-bisphosphate carboxylase small subunit promoter, maize ubiquitin promoters, the phaseolin promoter, the E8 promoter, and the Tob7 promoter.
- the invention is not limited to the number of promoters used to control expression of a nucleic acid sequence of interest. Any number of promoters may be used so long as expression of the nucleic acid sequence of interest is controlled in a desired manner. Furthermore, the selection of a promoter may be governed by the desirability that expression be over the whole plant, or localized to selected tissues of the plant, e.g., root, leaves, fruit, etc. For example, promoters active in flowers are known (Benfy et al. (1990) Plant Cell 2:849-856).
- the promoter activity of any nucleic acid sequence in host cells may be determined (i.e., measured or assessed) using methods well known in the art and exemplified herein.
- a candidate promoter sequence may be tested by ligating it in-frame to a reporter gene sequence to generate a reporter construct, introducing the reporter construct into host cells (e.g. tomato or potato cells) using methods described herein, and detecting the expression of the reporter gene (e.g., detecting the presence of encoded mRNA or encoded protein, or the activity of a protein encoded by the reporter gene).
- the reporter gene may confer antibiotic or herbicide resistance.
- reporter genes include, but are not limited to, dhfr which confers resistance to methotrexate [Wigler M et al., (1980) Proc Natl Acad Sci 77:3567-70]; npt, which confers resistance to the aminoglycosides neomycin and G-418 [Colbere-Garapin F et al., (1981) J. Mol. Biol. 150:1-14] and als or pat, which confer resistance to chlorsulfuron and phosphinotricin acetyl transferase, respectively.
- the expression construct preferably contains a transcription termination sequence downstream of the nucleic acid sequence of interest to provide for efficient termination.
- the termination sequence is the nopaline synthase (NOS) sequence.
- the termination region comprises different fragments of sugarcane ribulose-1,5-biphosphate carboxylase/oxygenase (rubisco) small subunit (scrbcs) gene.
- the termination sequences of the expression constructs are not critical to the invention. The termination sequence may be obtained from the same gene as the promoter sequence or may be obtained form different genes.
- polyadenylation sequences are also commonly added to the expression construct.
- the polyadenylation sequences include, but are not limited to, the Agrobacterium octopine synthase signal, or the nopaline synthase signal.
- the invention is not limited to constructs which express a single nucleic acid sequence of interest. Constructs which contain a plurality of (i.e., two or more) nucleic acid sequences under the transcriptional control of the same promoter sequence are expressly contemplated to be within the scope of the invention. Also included within the scope of this invention are constructs which contain the same or different nucleic acid sequences under the transcriptional control of different promoters. Such constructs may be desirable to, for example, target expression of the same or different nucleic acid sequences of interest to selected plant tissues.
- the present invention contemplates using the polynucleotides of the present invention for expression of a portion of plant gum polypeptides in vitro and in vivo. Where expression takes place in vivo, the present invention contemplates transgenic plants.
- the transgenic plants of the invention are not limited to plants in which each and every cell expresses the nucleic acid sequence of interest. Included within the scope of this invention is any plant (e.g. tobacco, tomato, maize, algae, etc.) which contains at least one cell which expresses the nucleic acid sequence of interest. It is preferred, though not necessary, that the transgenic plant express the nucleic acid sequence of interest in more than one cell, and more preferably in one or more tissue. It is particularly preferred that expression be followed by proper glycosylation of the plant gum polypeptide fragment or variant thereof, such that the host cell produces functional (e.g. in terms of use in the food or cosmetic industry) plant gum polypeptide.
- transformation of plant cells has taken place with the nucleic acid sequence of interest may be determined using any number of methods known in the art. Such methods include, but are not limited to, restriction mapping of genomic DNA, PCR analysis, DNA-DNA hybridization, DNA-RNA hybridization, and DNA sequence analysis.
- Expressed polypeptides can be immobilized (covalently or non-covalently) on solid supports or resins for use in isolating HRGP-binding molecules from a variety of sources (e.g. algae, plants, animals, microorganisms). Such polypeptides can also be used to make antibodies.
- sources e.g. algae, plants, animals, microorganisms.
- FIG. 1 shows the nucleic acid sequence of one embodiment of a synthetic gene of the present invention.
- FIG. 2 shows one embodiment of a synthetic gene in one embodiment of an expression vector.
- gene refers to a DNA sequence that comprises control and coding sequences necessary for the production of a polypeptide or its precursor.
- the polypeptide can be encoded by a full length coding sequence or by any portion of the coding sequence.
- nucleic acid sequence of interest refers to any nucleic acid sequence the manipulation of which may be deemed desirable for any reason by one of ordinary skill in the art (e.g., confer improved qualities).
- wild-type when made in reference to a gene refers to a gene which has the characteristics of a gene isolated from a naturally occurring source
- wild-type when made in reference to a gene product refers to a gene product which has the characteristics of a gene product isolated from a naturally occurring source.
- a wild-type gene is that which is most frequently observed in a population and is thus arbitrarily designated the “normal” or “wild-type” form of the gene.
- modified or “mutant” when made in reference to a gene or to a gene product refers, respectively, to a gene or to a gene product which displays modifications in sequence and or functional properties (i.e., altered characteristics) when compared to the wildtype gene or gene product. It is noted that naturally-occurring mutants can be isolated; these are identified by the fact that they have altered characteristics when compared to the wild-type gene or gene product.
- recombinant when made in reference to a DNA molecule refers to a DNA molecule which is comprised of segments of DNA joined together by means of molecular biological techniques.
- recombinant when made in reference to a protein or a polypeptide refers to a protein molecule which is expressed using a recombinant DNA molecule.
- vector and “vehicle” are used interchangeably in reference to nucleic acid molecules that transfer DNA segment(s) from one cell to another.
- expression vector or “expression cassette” as used herein refers to a recombinant DNA molecule containing a desired coding sequence and appropriate nucleic acid sequences necessary for the expression of the operably linked coding sequence in a particular host organism
- Nucleic acid sequences necessary for expression in prokaryotes usually include a promoter, an operator (optional), and a ribosome binding site, often along with other sequences.
- Eukaryotic cells are known to utilize promoters, enhancers, and termination and polyadenylation signals.
- targeting vector or “targeting construct” refer to oligonucleotide sequences comprising a gene of interest flanked on either side by a recognition sequence which is capable of homologous recombination of the DNA sequence located between the flanking recognition sequences.
- operable combination refers to the linkage of nucleic acid sequences in such a manner that a nucleic acid molecule capable of directing the transcription of a given gene and/or the synthesis of a desired protein molecule is produced.
- operable order refers to the linkage of amino acid sequences in such a manner so that a functional protein is produced.
- transformation refers to the introduction of foreign DNA into cells. Transformation of a plant cell may be accomplished by a variety of means known in the art including particle mediated gene transfer (see, e.g., U.S. Pat. No. 5,584,807 hereby incorporated by reference); infection with an Agrobacterium strain containing the foreign DNA for random integration (U.S. Pat. No. 4,940,838 hereby incorporated by reference) or targeted integration (U.S. Pat. No. 5,501,967 hereby incorporated by reference) of the foreign DNA into the plant cell genome; electroinjection (Nan et al. (1995) In “Biotechnology in Agriculture and Forestry,” Ed. Y. P. S.
- infectious and “infection” with a bacterium refer to co-incubation of a target biological sample, (e.g., cell, tissue, etc.) with the bacterium under conditions such that nucleic acid sequences contained within the bacterium are introduced into one or more cells of the target biological sample.
- a target biological sample e.g., cell, tissue, etc.
- Agrobacterium refers to a soil-borne, Gram-negative, rod-shaped phytopathogenic bacterium which causes crown gall.
- Agrobacterium includes, but is not limited to, the strains Agrobacterium tumefaciens , (which typically causes crown gall in infected plants), and Agrobacterium rhizogens (which causes hairy root disease in infected host plants). Infection of a plant cell with Agrobacterium generally results in the production of opines (e.g., nopaline, agropine, octopine etc.) by the infected cell.
- opines e.g., nopaline, agropine, octopine etc.
- Agrobacterium strains which cause production of nopaline are referred to as “nopaline-type” Agrobacteria
- Agrobacterium strains which cause production of octopine e.g.,′ strain LBA4404, Ach5, B6
- octopinc-type e.g., octopinc-type Agrobacteria
- Agrobacterium strains which cause production of agropine e.g., strain EHA105, EHA101, A281 are referred to as “agropine-type” Agrobacteria.
- biolistic bombardment refers to the process of accelerating particles towards a target biological sample (e.g., cell, tissue, etc.) to effect wounding of the cell membrane of a cell in the target biological sample and/or entry of the particles into the target biological sample.
- a target biological sample e.g., cell, tissue, etc.
- Methods for biolistic bombardment are known in the art (e.g., U.S. Pat. No. 5,584,807, the contents of which are herein incorporated by reference), and are commercially available (e.g., the helium gas-driven microprojectile accelerator (PDS-1000/He) (BioRad).
- microwounding when made in reference to plant tissue refers to the introduction of microscopic wounds in that tissue. Microwounding may be achieved by, for example, particle or biolistic bombardment.
- transgenic when used in reference to a plant cell refers to a plant cell which comprises a transgene, or whose genome has been altered by the introduction of a transgene.
- transgenic when used in reference to a plant refers to a plant which comprises one or more cells which contain a transgene, or whose genome has been altered by the introduction of a transgene.
- These transgenic cells and transgenic plants may be produced by several methods including the introduction of a “transgene” comprising nucleic acid (usually DNA) into a target cell or integration into a chromosome of a target cell by way of human intervention, such as by the methods described herein.
- transgene refers to any nucleic acid sequence which is introduced into the genome of a plant cell by experimental manipulations.
- a transgene may be an “endogenous DNA sequence,” or a “heterologous DNA sequence” (i.e., “foreign DNA”).
- endogenous DNA sequence refers to a nucleotide sequence which is naturally found in the cell into which it is introduced so long as it does not contain some modification (e.g., a point mutation, the presence of a selectable marker gene, etc.) relative to the naturally-occurring sequence.
- heterologous DNA sequence refers to a nucleotide sequence which is ligated to, or is manipulated to become ligated to, a nucleic acid sequence to which it is not ligated in nature, or to which it is ligated at a different location in nature.
- Heterologous DNA is not endogenous to the cell into which it is introduced, but has been obtained from another cell.
- Heterologous DNA also includes an endogenous DNA sequence which contains some modification.
- heterologous DNA encodes RNA and proteins that are not normally produced by the cell into which it is expressed. Examples of heterologous DNA include reporter genes, transcriptional and translational regulatory sequences, selectable marker proteins (e.g., proteins which confer drug resistance), etc.
- the term “probe” when made in reference to an oligonucleotide refers to an oligonucleotide, whether occurring naturally as in a purified restriction digest or produced synthetically, recombinantly or by PCR amplification, which is capable of hybridizing to another oligonucleotide of interest.
- a probe may be single-stranded or double-stranded. Probes are useful in the detection, identification and isolation of particular gene sequences. Oligonucleotide probes may be labelled with a “reporter molecule,” so that the probe is detectable using a detection system. Detection systems include, but are not limited to, enzyme, fluorescent, radioactive, and luminescent systems.
- selectable marker refers to a gene which encodes an enzyme having an activity that confers resistance to an antibiotic or drug upon the cell in which the selectable marker is expressed.
- Selectable markers may be “positive” or “negative.” Examples of positive selectable markers include the neomycin phosphotrasferase (NPTII) gene which confers resistance to G418 and to kanamycin, and the bacterial hygromycin phosphotransferase gene (hyg), which confers resistance to the antibiotic hygromycin.
- Negative selectable markers encode an enzymatic activity whose expression is cytotoxic to the cell when grown in an appropriate selective medium. For example, the HSV-tk gene is commonly used as a negative selectable marker.
- HSV-tk gene expression of the HSV-tk gene in cells grown in the presence of gancyclovir or acyclovir is cytotoxic; thus, growth of cells in selective medium containing gancyclovir or acyclovir selects against cells capable of expressing a functional HSV TK enzyme.
- promoter element refers to a DNA sequence that is located at the 5′ end (i.e. precedes) the protein coding region of a DNA polymer. The location of most promoters known in nature precedes the transcribed region. The promoter functions as a switch, activating the expression of a gene. If the gene is activated, it is said to be transcribed, or participating in transcription. Transcription involves the synthesis of mRNA from the gene. The promoter, therefore, serves as a transcriptional regulatory element and also provides a site for initiation of transcription of the gene into mRNA.
- PCR polymerase chain reaction
- This process for amplifying the target sequence consists of introducing a large excess of two oligonucleotide primers to the DNA mixture containing the desired target sequence, followed by a precise sequence of thermal cycling in the presence of a DNA polymerase.
- the two primers are complementary to their respective strands of the double stranded target sequence.
- the mixture is denatured and the primers then annealed to their complementary sequences within the target molecule.
- the primers are extended with a polymerase so as to form a new pair of complementary strands.
- the steps of denaturation, primer annealing and polymerase extension can be repeated many times (i.e., denaturation, annealing and extension constitute one “cycle”; there can be numerous “cycles”) to obtain a high concentration of an amplified segment of the desired target sequence.
- the length of the amplified segment of the desired target sequence is determined by the relative positions of the primers with respect to each other, and therefore, this length is a controllable parameter.
- the method is referred to as the “polymerase chain reaction” (hereinafter “PCR”). Because the desired amplified segments of the target sequence become the predominant sequences (in terms of concentration) in the mixture, they are said to be “PCR amplified.”
- PCR it is possible to amplify a single copy of a specific target sequence in genomic DNA to a level detectable by several different methodologies (e.g., hybridization with a labeled probe; incorporation of biotinylated primers followed by avidin-enzyme conjugate detection; and/or incorporation of 32 P-labeled deoxyribonucleotide triphosphates, such as dCTP or dATP, into the amplified segment).
- any oligonucleotide sequence can be amplified with the appropriate set of primer molecules.
- the amplified segments created by the PCR process itself are, themselves, efficient templates for subsequent PCR amplifications.
- Amplified target sequences may be used to obtain segments of DNA (e.g., genes) for the construction of targeting vectors, transgenes, etc.
- the present invention contemplates using amplification techniques such as PCR to obtain the cDNA (or portions thereof) of plant genes encoding plant gums and other hydroxyproline-rich polypeptides.
- primers are designed using the synthetic gene sequences (e.g. containing sequences encoding particular motifs) described herein and PCR is carried out (using genomic DNA or other source of nucleic acid from any plant capable of producing a gum exudate) under conditions of low stringency.
- PCR is carried out under high stringency.
- the amplified products can be run out on a gel and isolated from the gel.
- hybridization refers to any process by which a strand of nucleic acid joins with a complementary strand through base pairing [Coombs J (1994) Dictionary of Biotechnology , Stockton Press, New York N.Y.].
- the terms “complementary” or “complementarity” when used in reference to polynucleotides refer to polynucleotides which are related by the base-pairing rules. For example, for the sequence 5′-AGT-3′ is complementary to the sequence 5′-ACT-3′. Complementarity may be “partial,” in which only some of the nucleic acids' bases are matched according to the base pairing rules. Or, there may be “complete” or “total” complementarity between the nucleic acids. The degree of complementarity between nucleic acid strands has significant effects on the efficiency and strength of hybridization between nucleic acid strands. This is of particular importance in amplification reactions, as well as detection methods which depend upon binding between nucleic acids.
- the term “homology” when used in relation to nucleic acids refers to a degree of complementarity. There may be partial homology or complete homology (i.e., identity).
- a partially complementary sequence is one that at least partially inhibits a completely complementary sequence from hybridizing to a target nucleic acid is referred to using the functional term “substantially homologous.”
- the inhibition of hybridization of the completely complementary sequence to the target sequence may be examined using a hybridization assay (Southern or Northern blot, solution hybridization and the like) under conditions of low stringency.
- a substantially homologous sequence or probe will compete for and inhibit the binding (i.e., the hybridization) of a sequence which is completely homologous to a target under conditions of low stringency.
- low stringency conditions are such that non-specific binding is permitted; low stringency conditions require that the binding of two sequences to one another be a specific (i.e., selective) interaction.
- the absence of non-specific binding may be tested by the use of a second target which lacks even a partial degree of complementarity (e.g., less than about 30% identity); in the absence of non-specific binding the probe will not hybridize to the second non-complementary target.
- Low stringency conditions when used in reference to nucleic acid hybridization comprise conditions equivalent to binding or hybridization at 42° C. in a solution consisting of 5 ⁇ SSPE (43.8 g/l NaCl, 6.9 g/l NaH 2 PO 4 .H 2 O and 1.85 g/I EDTA, pH adjusted to 7.4 with NaOH), 0.1% SDS, 5 ⁇ Denhardt's reagent [50 ⁇ Denhardt's contains per 500 ml: 5 g Ficoll (Type 400, Pharmacia), 5 g BSA (Fraction V; Sigma)] and 100 ⁇ g/ml denatured salmon sperm DNA followed by washing in a solution comprising 5 ⁇ SSPE, 0.1% SDS at 42° C. when a probe of about 500 nucleotides in length is employed.
- 5 ⁇ SSPE 43.8 g/l NaCl, 6.9 g/l NaH 2 PO 4 .H 2 O and 1.85 g/I EDTA, pH adjusted to 7.4 with NaOH
- High stringency conditions when used in reference to nucleic acid hybridization comprise conditions equivalent to binding or hybridization at 42° C. in a solution consisting of 5 ⁇ SSPE (43.8 g/l NaCl, 6.9 g/l NaH 2 PO 4 .H 2 O and 1.85 g/I EDTA, pH adjusted to 7.4 with NaOH), 0.5% SDS, 5 ⁇ Denhardt's reagent and 100 ⁇ g/ml denatured salmon sperm DNA followed by washing in a solution comprising 0.1 ⁇ SSPE, 1.0% SDS at 42° C. when a probe of about 500 nucleotides in length is employed.
- the art knows well that numerous equivalent conditions may be employed to comprise either low or high stringency conditions; factors such as the length and nature (DNA, RNA, base composition) of the probe and nature of the target (DNA, RNA, base composition, present in solution or immobilized, etc.) and the concentration of the salts and other components (e.g., the presence or absence of formamide, dextran sulfate, polyethylene glycol) are considered and the hybridization solution may be varied to generate conditions of either low or high stringency hybridization different from, but equivalent to, the above listed conditions.
- factors such as the length and nature (DNA, RNA, base composition) of the probe and nature of the target (DNA, RNA, base composition, present in solution or immobilized, etc.) and the concentration of the salts and other components (e.g., the presence or absence of formamide, dextran sulfate, polyethylene glycol) are considered and the hybridization solution may be varied to generate conditions of either low or high stringency hybridization different from, but equivalent to, the above listed conditions
- “Stringency” when used in reference to nucleic acid hybridization typically occurs in a range from about T m -5° C. (5° C. below the T m of the probe) to about 20° C. to 25° C. below T m .
- a stringent hybridization can be used to identify or detect identical polynucleotide sequences or to identify or detect similar or related polynucleotide sequences.
- stringent conditions a nucleic acid sequence of interest will hybridize to its exact complement and closely related sequences.
- fusion protein refers to a chimeric protein containing the protein of interest (i.e., GAGP and fragments thereof) joined to an exogenous protein fragment (the fusion partner which consists of a non-GAGP sequence).
- the fusion partner may provide a detectable moiety, may provide an affinity tag to allow purification of the recombinant fusion protein from the host cell, or both.
- the fusion protein may be removed from the protein of interest (i.e. GAGP protein or fragments thereof) by a variety of enzymatic or chemical means known to the art.
- non-gum arabic glycoprotein or “non-gum arabic glycoprotein sequence” refers to that portion of a fusion protein which comprises a protein or protein sequence which is not derived from a gum arabic glycoprotein.
- protein of interest refers to the protein whose expression is desired within the fusion protein.
- the protein of interest e.g., GAGP
- another protein or protein domain e.g., GFP
- purified refers to the removal of contaminants from a sample.
- recombinant HRGP polypeptides including HRGP-GFP fusion proteins are purified by the removal of host cell components such as nucleic acids, lipopolysaccharide (e.g., endotoxin).
- recombinant DNA molecule refers to a DNA molecule which is comprised of segments of DNA joined together by means of molecular biological techniques.
- recombinant protein or “recombinant polypeptide” as used herein refers to a protein molecule which is expressed from a recombinant DNA molecule.
- portion when in reference to a protein (as in “a portion of a given protein”) refers to fragments of that protein.
- the fragments may range in size from four amino acid residues to the entire amino acid sequence minus one amino acid.
- the present invention relates generally to the field of plant gums and other hydroxyproline-rich glycoproteins, and in particular, to the expression of synthetic genes designed from repetitive peptide sequences.
- the hydroxyproline-rich glycoprotein (HRGP) superfamily is ubiquitous in the primary cell wall or extracellular matrix throughout the plant kingdom. Family members are diverse in structure and implicated in all aspects of plant growth and development. This includes plant responses to stress imposed by pathogenesis and mechanical wounding.
- Plant HRGPs have no known animal homologues. Furthermore, hydroxyproline residues are O-glycosylated in plant glycoproteins but never in animals. At the molecular level the function of these unique plant glycoproteins remains largely unexplored.
- HRGPS are, to a lesser or greater extent, extended, repetitive, modular proteins.
- the modules are small (generally 4-6 residue motifs), usually glycosylated, with most HRGPs being made up of more than one type of repetitive module.
- it is useful to view the glycosylated polypeptide modules not merely as peptides or oligosaccharides but as small functional units.
- the description of the invention involves A) the design of the polypeptide of interest, B) the production of synthetic genes encoding the polypeptide of interest, C) the construction of the expression vectors, D) selection of the host cells, and E) introduction of the expression construct into a particular cell (whether in vitro or in vivo).
- the present invention contemplates polypeptides that are fragments of hydroxyproline-rich glycoproteins (HRGPs), repetitive proline-rich proteins (RPRPs) and arabino-galactan proteins (AGPs).
- HRGPs hydroxyproline-rich glycoproteins
- RPRPs repetitive proline-rich proteins
- AGPs arabino-galactan proteins
- the present invention contemplates portions of HRGPs comprising one or more of the highly conserved Ser-Hyp 4 motif(s).
- the present invention also contemplates portions of RPRPs comprising one or more of the pentapeptide motif: Pro-Hyp-Val-Tyr-Lys.
- AGPs comprising one or more Xaa-Hyp-Xaa-Hyp repeats.
- Extensins occupy an intermediate position in the glycosylation continuum, containing about 50% carbohydrate which occurs mainly as Hyp-arabinosides (1-4 Ara residues), but not as Hyp-arabinogalactan polysaccharide.
- Extensins contain the repetitive, highly arabinosylated, diagnostic Ser-Hyp 4 glycopeptide module. The precise function of this module is unknown, but earlier work indicates that these blocks of arabinosylated Hyp help stabilize the extended polyproline-II helix of the extensins. Monogalactose also occurs on the Ser residues.
- the classical Ser-Hyp 4 glycopeptide module is of special interest.
- a tetra-L-arabinofuranosyl oligosaccharide is attached to each Hyp residue in the block.
- Three uniquely b-linked arabinofuranosyl residues and an a-linked nonreducing terminus comprise the tetraarabinooligosaccharide. While an understanding of the natural mechanism of glycosylation is not required for the successful operation of the present invention, it is believed that the arabinosylated Hyp residues together with the single galactosyl-serine residue undoubtedly form a unique molecular surface topography which interacts with and is recognized by other wall components, possibly including itself. Shorter blocks of Hyp, namely Hyp 3 and Hyp 2 , lack the fourth (a-linked) arabinose residue, again suggesting that the fourth Ara unique to the Hyp 4 block, has a special role and is presented for recognition or cleavage.
- the arabinogalactan-proteins (AGPs) and the related gum arabic glycoprotein (GAGP) are uniquely glycosylated with arabinogalactan polysaccharides.
- GAGP and all AGPs so far characterized by Hyp-glycoside profiles contain Hyp-linked arabinosides assigned to contiguous Hyp residues by the Hyp contiguity hypothesis.
- these glycoproteins also uniquely contain (Xaa-Hyp-Xaa-Hyp) repeats. These repeats are putative polysaccharide attachment sites.
- the present invention contemplates in particular fragments of gum arabic glycoprotein (GAGP).
- GAGP gum arabic glycoprotein
- the largest peptide obtained and sequenced from gum arabic was a peptide of twelve (12) amino acids having the sequence Ser-Hyp-Ser-Hyp-Thr-Hyp-Thr-Hyp-Hyp-Hyp-Gly-Pro. C. L. Delonnay, “Determination of the Protein Constituent Of Gum Arabic” Master of Science Thesis (1993).
- the present invention contemplates using this Delonnay sequence as well as (heretofore undescribed) larger peptide fragments of GAGP (and variants thereof) for the design of synthetic genes. In this manner, “designer plant gums” can be produced (“designer extensins” are also contemplated).
- the present invention contemplates a substantially purified polypeptide comprising at least a portion of the amino acid consensus sequence Ser-Hyp-Hyp-Hyp-[Hyp/Thr]-Leu-Ser-Hyp-Ser-Hyp-Thr-Hyp-Thr-Hyp-Hyp-Hyp-Gly-Pro-His (SEQ ID NO:1) or variants thereof.
- this GAGP 19-residue consensus repeat (which contains both contiguous Hyp and non-contiguous Hyp repeats) is glycosylated in native GAGP with both Hyp-arabinosides and Hyp-polysaccharide in molar ratios. It is further believed that the high molecular weight protein component of gum arabic (i.e. GAGP) is responsible for the remarkable emulsifying and stabilizing activity exploited by the food and soft drink industries.
- the present invention contemplates involves the use of synthetic genes engineered for the expression of repetitive glycopeptide modules in cells, including but not limited to callus and suspension cultures. It is not intended that the present invention be limited by the precise number of repeats.
- the present invention contemplates the nucleic acid sequences encoding the consensus sequence for GAGP (i.e. SEQ ID NO:1) or variants thereof may be used as a repeating sequence between two (2) and up to fifty (50) times, more preferably between ten (10) and up to thirty (30) times, and most preferably approximately twenty (20) times.
- the nucleic acid sequence encoding the consensus sequence (i.e. SEQ ID NO:1) or variants thereof may be used as contiguous repeats or may be used as non-contiguous repeats.
- Non-palindromic ends are used for the monomers and end linkers to assure proper “head-to-tail” polymerization.
- the constructs contain no internal restriction enzyme recognition sites for the restriction enzymes employed for the insertion of these sequences into expression vectors or during subsequent manipulations of such vectors.
- the 5′ linker contains a XmaI site downstream of the BamHI site used for cloning into the cloning vector (e.g., pbluescript).
- the XmaI site is used for insertion of the HRGP gene cassette into the expression vector (e.g., pBI121-Sig-EGFP).
- the 3′ linker contains a AgeI site upstream of the EcoRI site used for cloning into the cloning vector (e.g., pbluescript).
- the AgeI site is used for insertion of the HRGP gene cassette into the expression vector.
- plasmid pBI121-Sig which does not contain GFP for the fusion protein—the same signal sequence is used, but the 3′ linkers contain an Sst I restriction site for insertion as an Xma I/Sst I fragment behind the signal sequence and before the NOS terminator.
- the oligonucleotides used are high quality (e.g., from GibcoBRL, Operon) and have been purified away from unwanted products of the synthesis.
- T M of correctly aligned oligomers is greater than the T M of possible dimers, hairpins or crossdimers.
- a variety of vectors are contemplated.
- two plant transformation vectors are prepared, both derived from pBI121 (Clontech). Both contain an extensin signal sequence for transport of the constructs through the ER/Golgi for posttranslational modification.
- a first plasmid construct contained Green Fluorescent Protein (GFP) as a reporter protein instead of GUS.
- GFP Green Fluorescent Protein
- a second plasmid does not contain GFP.
- pBI121 is the Jefferson vector in which the BamHI and SstI sites can be used to insert foreign DNA between the 35S CaMV promoter and the termination/polyadenylation signal from the nopaline synthase gene (NOS-ter) of the Agrobacterium Ti plasmid); it also contains an RK2 origin of replication, a kanamycin resistance gene, and the GUS reporter gene.
- GUS sequence is replaced (via BamHI/SstI) with a synthetic DNA sequence encoding a peptide signal sequence based on the extensin signal sequences of Nicotiana plumbaginifolia and N. tabacum
- the DNA sequence also contains 15 bp of the 5′ untranslated region, and restriction sites for Bam HI in its 5′terminus and Sst I in its extreme 3′ terminus for insertion into pBI121 in place of GUS.
- An XmaI restriction site occurs 16 bp upstream from the Sst I site to allow subsequent insertion of EGFP into the plasmid as a Xma I/Sst I fragment.
- the sequence underlined above is known to target N. plumbaginifolia extensin fusion proteins through the ER and Golgi for post-translational modifications, and finally to the wall.
- the signal sequence proposed also involves transport of extensins and extensin modules in the same plant family ( Solanaceae ). Alternatively, one can use the signal sequence from tomato P1 extensin itself.
- GFP MUTANTs WAVELENGTH (nm) MUTANT Excitation Emitting mGFPX10; F99S, M153T, V163A Excites at 395 mGFPX10-5 Excites at 489 Emits at 508 GFPA2; I167T Excites at 471 GFPB7; Y66H Excites at 382 Emits at 440 (blue fluorescence) GFPX10-C7; F99S, M153T, Excites at 395 V163A, I167T, S175G and 473 GFPX10-D3; F99S, M153T, Excites at 382 Emits at 440 V163A, Y66H Addition of GFP.
- the repetitive HRGP-modules can be expressed as GFP fusion products rather than GUS fusions, and can also be expressed as modules without GFP.
- Fusion with a green fluorescent protein reporter gene appropriately red-shifted for plant use e.g. EGFP (an S65T variant recommended for plants by Clontech) or other suitable mutants (see Table 1 above) allows the detection of ⁇ 700 GFP molecules at the cell surface.
- GFP requires aerobic conditions for oxidative formation of the fluorophore. It works well at the lower temperatures used for plant cell cultures and normally it does not adversely affect protein function although it may allow the regeneration of plants only when targeted to the ER. Promoters. As noted above, it is not intended that the present invention be limited by the nature of the promoter(s) used in the expression constructs.
- the CaMV35S promoter is preferred, although it is not entirely constitutive and expression is “moderate”. In some embodiments, higher expression of the constructs is desired to enhance the yield of HRGP modules; in such cases a plasmid with “double” CaMV35S promoters is employed.
- host cells are contemplated (both eukaryotic and prokaryotic). It is not intended that the present invention be limited by the host cells used for expression of the synthetic genes of the present invention. Plant host cells are preferred, including but not limited to legumes (e.g. soy beans) and solanaceous plants (e.g. tobacco).
- legumes e.g. soy beans
- solanaceous plants e.g. tobacco
- the present invention is not limited by the nature of the plant cells. All sources of plant tissue are contemplated, including but not limited to seeds. Seeds of flowering plants consist of an embryo, a seed coat, and stored food. When fully formed, the embryo consists basically of a hypocotyl-root axis bearing either one or two cotyledons and an apical meristem at the shoot apex and at the root apex. The cotyledons of most dicots are fleshy and contain the stored food of the seed. In other dicots and most monocots, food is stored in the endosperm and the cotyledons function to absorb the simpler compounds resulting from the digestion of the food.
- Monoctyledons include grasses, lilies, irises, orchids, cattails, palms.
- Dicotyledons include almost all the familiar trees and shrubs (other than confers) and many of the herbs (non-woody plants).
- Tomato cultures are the ideal recipients for repetitive HRGP modules to be hydroxylated and glycosylated: Tomato is readily transformed.
- the cultures produce cell surface HRGPs in high yields easily eluted from the cell surface of intact cells and they possess the required posttranslational enzymes unique to plants—HRGP prolyl hydroxylases, hydroxyproline O-glycosyltransferases and other specific glycosyltransferases for building complex polysaccharide side chains.
- tomato genetics, and tomato leaf disc transformation/plantlet regeneration are well worked out.
- Expression constructs of the present invention may be introduced into host cells (e.g. plant cells) using methods known in the art.
- the expression constructs are introduced into plant cells by particle mediated gene transfer.
- Particle mediated gene transfer methods are known in the art, are commercially available, and include, but are not limited to, the gas driven gene delivery instrument descried in McCabe, U.S. Pat. No. 5,584,807, the entire contents of which are herein incorporated by reference. This method involves coating the nucleic acid sequence of interest onto heavy metal particles, and accelerating the coated particles under the pressure of compressed gas for delivery to the target tissue.
- Other particle bombardment methods are also available for the introduction of heterologous nucleic acid sequences into plant cells.
- these methods involve depositing the nucleic acid sequence of interest upon the surface of small, dense particles of a material such as gold, platinum, or tungsten.
- the coated particles are themselves then coated onto either a rigid surface, such as a metal plate, or onto a carrier sheet made of a fragile material such as mylar.
- the coated sheet is then accelerated toward the target biological tissue.
- the use of the flat sheet generates a uniform spread of accelerated particles which maximizes the number of cells receiving particles under uniform conditions, resulting in the introduction of the nucleic acid sample into the target tissue.
- an expression construct may be inserted into the genome of plant cells by infecting them with a bacterium, including but not limited to an Agrobacterium strain previously transformed with the nucleic acid sequence of interest.
- a bacterium including but not limited to an Agrobacterium strain previously transformed with the nucleic acid sequence of interest.
- disarmed Agrobacterium cells are transformed with recombinant Ti plasmids of Agrobacterium tumefaciens or Ri plasmids of Agrobacterium rhizogenes (such as those described in U.S. Pat. No. 4,940,838, the entire contents of which are herein incorporated by reference) which are constructed to contain the nucleic acid sequence of interest using methods well known in the art (Sambrook, J. et al., (1989) supra).
- the nucleic acid sequence of interest is then stably integrated into the plant genome by infection with the transformed Agrobacterium strain.
- heterologous nucleic acid sequences have been introduced into plant tissues using the natural DNA transfer system of Agrobacterium tumefaciens and Agrobacterium rhizogenes bacteria (for review, see Klee et al. (1987) Ann. Rev. Plant Phys. 38:467-486).
- Agrobacterium may be enhanced by using a number of methods known in the art. For example, the inclusion of a natural wound response molecule such as acetosyringone (AS) to the Agrobacterium culture has been shown to enhance transformation efficiency with Agrobacterium tumefaciens [Shahla et al. (1987) Plant Molec. Biol. 8:291-298].
- transformation efficiency may be enhanced by wounding the target tissue to be transformed. Wounding of plant tissue may be achieved, for example, by punching, maceration, bombardment with microprojectiles, etc. [see, e.g., Bidney et al. (1992) Plant Molec. Biol. 18:301-313].
- nucleic acid sequence of interest may be desirable to target the nucleic acid sequence of interest to a particular locus on the plant genome.
- Site-directed integration of the nucleic acid sequence of interest into the plant cell genome may be achieved by, for example, homologous recombination using Agrobacterium -derived sequences.
- plant cells are incubated with a strain of Agrobacterium which contains a targeting vector in which sequences that are homologous to a DNA sequence inside the target locus are flanked by Agrobacterium transfer-DNA (T-DNA) sequences, as previously described (Offring a et al., (1996), U.S. Pat. No. 5,501,967, the entire contents of which are herein incorporated by reference).
- T-DNA Agrobacterium transfer-DNA
- homologous recombination may be achieved using targeting vectors which contain sequences that are homologous to any part of the targeted plant gene, whether belonging to the regulatory elements of the gene, or the coding regions of the gene. Homologous recombination may be achieved at any region of a plant gene so long as the nucleic acid sequence of regions flanking the site to be targeted is known.
- the targeting vector used may be of the replacement- or insertion-type (Offring a et al. (1996), supra).
- Replacement-type vectors generally contain two regions which are homologous with the targeted genomic sequence and which flank a heterologous nucleic acid sequence, e.g., a selectable marker gene sequence.
- Replacement type vectors result in the insertion of the selectable marker gene which thereby disrupts the targeted gene.
- Insertion-type vectors contain a single region of homology with the targeted gene and result in the insertion of the entire targeting vector into the targeted gene.
- the present invention contemplates introducing nucleic acid via the leaf disc transformation method.
- DNA deoxyribonucleic acid
- cDNA complementary DNA
- RNA ribonucleic acid
- mRNA messenger ribonucleic acid
- X-gal 5-bromo-4-chloro-3-indolyl- ⁇ -D-galactopyranoside
- LB Lia Broth
- PAGE polyacrylamide gel electrophoresis
- NAA ⁇ -naphtaleneacetic acid
- BAP 6-benzyl aminopurine
- Tris tris(hydroxymethyl)-aminomethane
- PBS phosphate buffered saline
- 2 ⁇ SSC 0.3 M NaCl, 0.03 M Na 3 citrate, pH 7.0
- GAGP was isolated and (by using chymotrypsin) the deglycosylated polypeptide backbone was prepared.
- GAGP does not contain the usual chymotryptic cleavage sites, it does contain leucyl and histidyl residues which are occasionally cleaved. Chymotrypsin cleaved sufficient of these “occasionally cleaved” sites to produce a peptide map of closely related peptides.
- GAGP was isolated via preparative Superose-6 gel filtration. Anhydrous hydrogen fluoride deglycosylated it (20 mg powder/mL HF at 4° C., repeating the procedure twice to ensure complete deglycosylation), yielding dGAGP which gave a single symmetrical peak (data not shown) after rechromatography on Superose-6. Further purification of dGAGP by reverse phase chromatography also gave a single major peak, showing a highly biased but constant amino acid composition in fractions sampled across the peak. These data indicated that dGAGP was a single polypeptide component sufficiently pure for sequence analysis. Sequence Analysis.
- Synthetic gene cassettes encoding contiguous and noncontiguous Hyp modules are constructed using partially overlapping sets consisting of oligonucleotide pairs, “internal repeat pairs” and “external 3′- and 5′-linker pairs” respectively, all with complementary “sticky” ends.
- the design strategy for the repetitive HRGP modules combines proven approaches described earlier for the production in E. coli of novel repetitive polypeptide polymers (McGrath et al. [1990] Biotechnol. Prog. 6:188), of a repetitious synthetic analog of the bioadhesive precursor protein of the mussel Mytilus edulis , of a repetitive spider silk protein (Lewis et al. [1996] Protein Express. Purif.
- a synthetic gene encoding the extensin-like Ser-Hyp 4 module is constructed using the following partially overlapping sets of oligonucleotide pairs.
- 5′-Linker Amino Acid: A G S S T R A S P (P P P) 5′-GCT GGA TCC TCA ACC CGG GCC TCA CCA CGA CCT AGG AGT TGG GCC CCG AGT GGT GGT GGT GGA-5′ 3′ Linker (for pBI121-Sig-EGFP): Amino Acid: P P P S P V A R N S P 5′-CCA CCA CCT TCA CCG GTC GCC CGG AAT TCA CCA CCC AGT GGC CAG CGG GCC TTA AGT GGT GGG-5′ 3′ Linker (for pBI121-Sig: Amino Acid: 5′-CCA CCA CCT TAA TAG AGC TCC CCC ATT ATC TCG AGG GGG-5′ Internal Repeat Amino Acid: P P P P P P P P P S P 5′-CCA CCA CCT TCA CCT CCC CCA TCT CCA AGT GGA GGT GGT AGA G
- a synthetic gene cassette encoding the GAGP consensus sequence is generated as described above using the following 5′ linker, internal repeat and 3′ linker duplexes.
- Conversion of the “internal” AGP-like motif and 5′ & 3′ “external” gene cassettes to long duplex DNA is accomplished using the steps described in section a) above.
- Up to fifty (50) repeats of the internal repeat duplex are desirable (more preferrably up to thirty (30) repeats, and more preferrably approximately twenty (20) repeats) (i.e., the wild-type protein contains 20 of these repeats).
- the above GAGP internal repeat is a consensus sequence
- the variant sequences are likely to be glycosylated in a slightly different manner, which may confer different properties (e.g. more soluble etc.).
- Other constructs are shown for other illustrative modules in Table 3.
- P1 extensin signal sequence i.e., signal peptide
- P1 extensin cDNA clones were isolated using oligonucleotides designed after the P1-unique protein sequence: Val-Lys-Pro-Tyr-His-Pro-Thr-Hyp-Val-Tyr-Lys.
- the P1 extensin signal sequence directs the nascent peptide chain to the ER.
- pBI121 is an expression vector which permits the high level expression and secretion of inserted genes in plant cells (e.g., tomato, tobacco, members of the genus Solanace , members of the family Leguminoseae, non-graminaceous monocots).
- pBI121 contains the 35S CaMV promoter, the tobbaco ( Nicotiana plumbaginifolia ) extensin signal sequence, a EGFP gene, the termination/polyadenylation signal from the nopaline synthetase gene (NOS-ter), a kanamycin-resistance gene (nptII) and the right and left borders of T-DNA to permit transfer into plants by Agrobacterium -mediated transformation.
- the P3-Type Extensin Palindromic Module P3-Type Extensin Palindromic Internal Repeat Oligo's: 5′-CCA CCA CCT TCA CCC TCT CCA CCT CCA CCA TCT CCG TCA CCA AGT GGG AGA GGT GGA GGT GGT AGA GGC AGT GGT GGT GGT GGA-5′ P3-Type Extensin Palindromic External Linker Oligo's: Use the [SPPP] n linkers (SEE ABOVE) e.
- the Potato Lectin HRGP Palindromic Module Potato Lectin HRGP Palindromic Internal Repeat Oligo's: 5′-CCA CCA CCT TCA CCC CCA TCT CCA CCT CCA CCA TCT CA CCG TCA CCA AGT GGG GGT AGA GGT GGA GGT GGT AGA GGT GGC AGT GGT GGT GGA-5′ Potato Lectin HRGP Palindromic External Linker Oligo's: Use the [SPPP] n linkers (SEE ABOVE) f.
- P1-Extensin-Like Modules i.
- the SPPPPTPVYK Module SPPPPTPVYK Internal Repeat Oligo's: 5′-CCA CCA CCT ACT CCC GTT TAC AAA TCA CCA CCA CCA CCT ACT CCC GTT TAC AAA TCA CCA TGA GGG CAA ATG TTT AGT GGT GGT GGT GGA TGA GGG CAA ATG TTT AGT GGT GGT GGT GGA-5′ SPPPPTPVYK External Linker Oligo's: Use the [SPPP] n linkers (SEE ABOVE) ii.
- the SPPPPVKPYHPTPVFL Module SPPPPVKPYHPTPVFL Internal Repeat Oligo's: 5′-CCA CCA CCT GTC AAG CCT TAC CAC CCC ACT CCC GTT TTT CTT TCA CCA CAG TTC GGA ATG GTG GGG TGA GGG CAA AAA GAA AGT GGT GGT GGT GGA-5′ SPPPPVKPYHPTPVFL External Linker Oligo's: Use the [SPPP] n linkers (SEE ABOVE) iii.
- SPPPPVLPFHPTPVYK Module SPPPPVLPFHPTPVYK Internal Repeat Oligo's: 5′-CCA CCA CCT GTC TTA CCT TTC CAC CCC ACT CCC GTT TAC AAA TCA CCA CAG AAT GGA AAG GTG GGG TGA GGG CAA ATG TTT AGT GGT GGT GGT GGA-5′
- SPPPPVLPFHPTPVYK External Linker Oligo's Use the [SPPP] n linkers (SEE ABOVE) EGFP 3′ Linker Oligo's needed to insert EGFP into pBI121-Sig-EGF 5′-GGC CGC GAG CTC CAG CAC GGG CG CTC GAG GTC GTG CCC-5′
- the presence of the extensin signal sequence at the N-terminus of proteins encoded by genes inserted into the pBI121 expression vector e.g., HRGPs encoded by synthetic gene constructs.
- the tobacco signal sequence was demonstrated to target extensin fusion proteins through the ER and Golgi for posttranslational modifications, and finally to the wall.
- the targeted expression of recombinant HRGPs is not dependent upon the use of the tobacco extensin signal sequence.
- Signal sequences involved in the transport of extensins and extensin modules in the same plant family ( Solanaceae ) as tobacco may be employed; alternatively, the signal sequence from tomato P1 extensin may be employed.
- the EGFP gene encodes a green fluorescent protein (GFP) appropriately red-shifted for plant use (the EGFP gene encodes a S65T variant optimized for use in plants and is available from Clontech). Other suitable mutants may be employed (see Table 1). These modified GFPs allow the detection of less than 700 GFP molecules at the cell surface.
- the use of a GFP gene provides a reporter gene and permits the formation of fusion proteins comprising repetitive HRGP modules. GFPs require aerobic conditions for oxidative formation of the fluorophore. It is functional at the lower temperatures used for plant cell cultures, normally it does not adversely affect protein function.
- Plasmids pBI121-Sig and pBI121-Sig-EGFP are constructed as follows.
- the GUS gene present in pBI121 (Clontech) is deleted by digestion with BamHI and SstI and a pair of partially complementary oligonucleotides encoding the tobacco extensin signal sequence is annealed to the BamHI and SstI ends.
- the oligonucleotides encoding the 21 amino acid extensin signal sequence have the following sequence.
- this pair of oligonucleotides when inserted into the digested pBI121 vector, provides a BamHI site (5′ end) and XmaI and SstI sites (3′ end). The XmaI and SstI sites allow the insertion of the GFP gene.
- the modified pBI121 vector lacking the GUS gene and containing the synthetic extensin signal sequence is termed pBI121-Sig. Proper construction of pBI121 is confirmed by DNA sequencing.
- the GFP gene (e.g., the EGFP gene) is inserted into pBI121-Sig to make pBI121-Sig-EGFP as follows.
- the EGFP gene is excised from pEGFP (Clontech) as a 1.48 kb XmaI/NotI fragment (base pairs 270 to 1010 in pEGFP). This 1.48 kb XmaI/NotI fragment is then annealed and ligated to a synthetic 3′ linker (see above).
- the EGFP-3′ linker is then digested with SstI to produce an XmaI/SstI EGFP fragment which in inserted into the XmaI/SstI site of pBI121-Sig to create pBI121-Sig-EGFP.
- the AgeI discussed below
- XmaI and SstI sites provide unique restriction enzyme sites. Proper construction of the plasmids is confirmed by DNA sequencing.
- the EGFP sequences in pBI121-Sig-EGFP contain an AgeI site directly before the translation start codon (i.e., ATG) of EGFP.
- Synthetic HRGP gene cassettes are inserted into the plasmid between the signal sequence and the EGFP gene sequences as XmaI/AgeI fragments; the HRGP gene cassettes are excised as XmaI/AgeI fragments from the pbluescript constructs described in Ex.2. Proper construction of HRGP-containing expression vectors is confirmed by DNA sequencing and/or restriction enzyme digestion.
- Expression of the synthetic HRGP gene cassettes is not dependent upon the use of the pBI121-Sig and pBI121-Sig-EGFP gene cassette.
- Analogous expression vectors containing other promoter elements functional in plant cells may be employed (e.g., the CaMV region IV promoter, ribulose-1,6-biphosphate (RUBP) carboxylase small subunit (ssu) promoter, the nopaline promoter, octopine promoter, mannopine promoter, the ⁇ -conglycinin promoter, the ADH promoter, heat shock promoters, tissue-specific promoters, e.g., promoters associated with fruit ripening, promoters regulated during seed ripening (e.g., promoters from the napin, phaseolin and glycinin genes).
- promoter elements functional in plant cells may be employed (e.g., the CaMV region IV promoter, ribulose-1,6-biphosphate (RUBP) carboxylase small sub
- expression vectors containing a promoter that directs high level expression of inserted gene sequences in the seeds of plants may be employed.
- Expression may also be carried out in green algae.
- reporter genes may be employed in place of the GFP gene.
- Suitable reporter genes include ⁇ -glucuronidase (GUS), neomycin phosphotransferase II gene (nptII), alkaline phosphatase, luciferase, CAT (Chloramphenicol AcetylTransferase).
- GUS ⁇ -glucuronidase
- nptII neomycin phosphotransferase II gene
- alkaline phosphatase luciferase
- CAT Chloramphenicol AcetylTransferase
- Preferred reporter genes lack Hyp residues.
- the proteins encoded by the synthetic HRGP genes need not be expressed as fusion proteins. This is readily accomplished using the pBI121-Sig vector.
- the present invention contemplates recombinant HRGPs encoded by expression vectors comprising synthetic HRGP gene modules are expressed in tomato cell suspension cultures.
- the expression of recombinant HRGPs in tomato cell suspension cultures is illustrated by the discussion provided below for recombinant GAGP expression.
- An expression vector containing the synthetic GAGP gene cassette (capable of being expressed as a fusion with GFP or without GFP sequences) is introduced into tomato cell suspension cultures.
- a variety of means are known to the art for the transfer of DNA into tomato cell suspension cultures, including Agrobacterium -mediated transfer and biolistic transformation.
- Agrobacterium -mediated transformation contemplates transforming both suspension cultured cells (Bonnie Best cultures) and tomato leaf discs by mobilizing the above-described plasmid constructions (and others) from E. coli into Agrobacterium tumefaciens strain LBA4404 via triparental mating. Positive colonies are used to infect tomato cultures or leaf discs ( Lysopersicon esculentum ). Transformed cells/plants are selected on MSO medium containing 500 mg/mL carbenicillin and 100 mg/mL kanamycin. Expression of GFP fusion products are conveniently monitored by fluoresence microscopy using a high Q FITC filter set (Chroma Technology Corp.). FITC conjugates (e.g.
- FITC-BSA can be used along with purified recombinant GFP as controls for microscopy set-up. Cultured tomato cells show only very weak autofluorescence. Thus, one can readily verify the spatiotemporal expression of GFP-Hyp module fusion products.
- Transgenic cells/plants can be examined for transgene copy number and construct fidelity genomic Southern blotting and for the HRGP construct mRNA by northern blotting, using the internal repeat oligonucleotides as probes.
- Controls include tissue/plants which are untransformed, transformed with the pBI121 alone, pBI121 containing only GFP, and pBI121 having the signal sequence and GFP but no HRGP synthetic gene.
- Microprojectile bombardment 1.6 M gold particles are coated with each appropriate plasmid construct DNA for use in a Biolistic particle delivery system to transform the tomato suspension cultures/callus or other tissue. Controls include: particles without DNA, particles which contain PBI121 only, and particles which contain PBI121 and GFP.
- HRGPs include, but are not limited to, RPRps, extensins, AGPs and other plant gums (e.g. gum Karaya, gum Tragacanth, gum Ghatti, etc.).
- HRGP chimeras include but are not limited to HRGP plant lectins, including the solanaceous lectins, plant chitinases, and proteins in which the HRGP portion serves as a spacer (such as in sunflower).
- the present invention specifically contemplates using the HRGP modules (described above) as spacers to link non-HRGP proteins (e.g. enzymes) together.
- the present invention provides a new approach and solution to the problem of producing plant gums.
- the approach is not dependent on environmental factors and greatly simplifies production of a variety of naturally-occurring gums, as well as designer gums.
Landscapes
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- Zoology (AREA)
- Biochemistry (AREA)
- Biophysics (AREA)
- General Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- Cell Biology (AREA)
- Plant Pathology (AREA)
- Microbiology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Medicinal Chemistry (AREA)
- Gastroenterology & Hepatology (AREA)
- Botany (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Peptides Or Proteins (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
A new approach in the field of plant gums is described which presents a new solution to the production of hydroxyproline(Hyp)-rich glycoproteins (HRGPs), repetitive proline-rich proteins (RPRPs) and arabino-galactan proteins (AGPs). The expression of synthetic genes designed from repetitive peptide sequences of such glycoproteins, including the peptide sequences of gum arabic glycoprotein (GAGP), is taught in host cells, including plant host cells.
Description
- The present invention relates generally to the field of plant gums and other hydroxyproline-rich glycoproteins, and in particular, to the expression of synthetic genes designed from repetitive peptide sequences.
- Gummosis is a common wound response that results in the exudation of a gum sealant at the site of cracks in bark. A. M. Stephen et al., “Exudate Gums”, Methods Plant Biochem. (1990). Generally the exudate is a composite of polysaccharides and glycoproteins structurally related to cell wall components such as galactans [G.O. Aspinall, “Plant Gums”, The Carbohydrates 2B:522536 (1970)] and hydroxyproline-rich glycoproteins [Anderson and McDougall, “The chemical characterization of the gum exudates from eight Australian Acacia species of the series Phyllodineae.” Food Hydrocolloids, 2: 329 (1988)].
- Gum arabic is probably the best characterized of these exudates (although it has been largely refractory to chemical analysis). It is a natural plant exudate secreted by various species of Acacia trees. Acacia Senegal accounts for approximately 80% of the production of gum arabic with Acacia seyal, Acacia laeta, Acacia camplylacantha, and Acacia drepanolobium supplying the remaining 20%. The gum is gathered by hand in Africa. It is a tedious process involving piercing and stripping the bark of the trees, then returning later to gather the dried tear drop shaped, spherical balls that form.
- The exact chemical nature of gum arabic has not been elucidated. It is believed to consist of two major components, a microheterogeneous glucurono-arabinorhamogalactan polysaccharide and a higher molecular weight hydroxyproline-rich glycoprotein. Osman et al., “Characteriztion of Gum Arabic Fractions Obtained By Anion-Exchange Chromatography” Phytochemistry 38:409 (1984) and Qi et al., “Gum Arabic Glycoprotein Is A Twisted Hairy Rope” Plant Physiol. 96:848 (1991). While the amino composition of the protein portion has been examined, little is known with regard to the precise amino acid sequence.
- While the precise chemical nature of gum arabic is elusive, the gum is nonetheless particularly useful due to its high solubility and low viscosity compared to other gums. The FDA declared the gum to be a GRAS food additive. Consequently, it is widely used in the food industry as a thickener, emulsifier, stabilizer, surfactant, protective colloid, and flavor fixative or preservative. J. Dziezak, “A Focus on Gums” Food Technology (March 1991). It is also used extensively in the cosmetics industry.
- Normally, the world production of gum arabic is over 100,000 tons per year. However, this production depends on the environmental and political stability of the region producing the gum. In the early 1970s, for example, a severe drought reduced gum production to 30,00 tons. Again in 1985, drought brought about shortages of the gum, resulting in a 600% price increase.
- Three approaches have been used to deal with the somewhat precarious supply problem of gum arabic. First, other gums have been sought out in other regions of the world. Second, additives have been investigated to supplement inferior gum arabic. Third, production has been investigated in cultured cells.
- The effort to find other gums in other regions of the world has met with some limited success. However, the solubility of gum arabic from Acacia is superior to other gums because it dissolves well in either hot or cold water. Moreover, while other exudates are limited to a 5% solution because of their excessive viscosity, gum arabic can be dissolved readily to make 55% solutions.
- Some additives have been identified to supplement gum arabic. For example, whey proteins can be used to increase the functionality of gum arabic. A. Prakash et al., “The effects of added proteins on the functionality of gum arabic in soft drink emulsion systems,” Food Hydrocolloids 4:177 (1990). However, this approach has limitations. Only low concentrations of such additives can be used without producing off-flavors in the final food product.
- Attempts to produce gum arabic in cultured Acacia senegal cells has been explored. Unfortunately, conditions have not been found which lead to the expression of gum arabic in culture. A. Mollard and J-P. Joseleau, “Acacia senegal cells cultured in suspension secrete a hydroxyproline-deficient rabinogalactan-protein” Plant Physiol. Biochem. 32:703 (1994).
- Clearly, new approaches to improve gum arabic production are needed. Such approaches should not be dependent on environmental or political factors. Ideally, such approaches should simplify production and be relatively inexpensive.
- The present invention involves a new approach in the field of plant gums and presents a new solution to the production of hydroxyproline(Hyp)-rich glycoproteins (HRGPs), repetitive proline-rich proteins (RPRPs) and arabino-galactan proteins (AGPs). The present invention contemplates the expression of synthetic genes designed from repetitive peptide sequences of such glycoproteins, including the peptide sequences of gum arabic glycoprotein (GAGP).
- With respect to GAGP, the present invention contemplates a substantially purified polypeptide comprising at least a portion of the amino acid sequence Ser-Hyp-Hyp-Hyp-[Hyp/Thr]-Leu-Ser-Hyp-Ser-Hyp-Thr-Hyp-Thr-Hyp-Hyp-Hyp-Gly-Pro-His (SEQ ID NO:1) or variants thereof. By “variants” it is meant that the sequence need not comprise the exact sequence; up to five (5) amino acid substitutions are contemplated. For example, a Leu or Hyp may be substituted for the Gly; Leu may also be substituted for Ser and one or more Hyp. By “variants” it is also meant that the sequence need not be the entire nineteen (19) amino acids. Illustrative variants are shown in Table 2.
- Indeed, it is not intended that the present invention be limited by the precise length of the purified polypeptide. In one embodiment, the peptide comprises more than twelve (12) amino acids from the nineteen (19) amino acids of the sequence. In another embodiment, a portion of the nineteen (19) amino acids (see SEQ ID NO: 1) is utilized as a repetitive sequence. In yet another embodiment, all nineteen (19) amino acids (see SEQ ID NO:1 with or without amino acid substitutions) are utilized as a repetitive sequence.
- It is not intended that the present invention be limited by the precise number of repeats. The sequence (i.e. SEQ ID NO:1) or variants thereof may be used as a repeating sequence between one (1) and up to fifty (50) times, more preferably between ten (10) and up to thirty (30) times, and most preferably approximately twenty (20) times. The sequence (i.e. SEQ ID NO:1) or variants thereof may be used as contiguous repeats or may be used as non-contiguous repeats (with other amino acids, or amino acid analogues, placed between the repeating sequences).
- The present invention specifically contemplates fusion proteins comprising a non-gum arabic protein or glycoprotein sequence and a portion of the gum arabic glycoprotein sequence (SEQ ID NO:1). It is not intended that the present invention be limited by the nature of the non-gum arabic glycoprotein sequence. In one embodiment, the non-gum arabic glycoprotein sequence is a green fluorescent protein.
- As noted above, the present invention contemplates synthetic genes encoding such peptides. By “synthetic genes” it is meant that the nucleic acid sequence is derived using the peptide sequence of interest (in contrast to using the nucleic acid sequence from cDNA). In one embodiment, the present invention contemplates an isolated polynucleotide sequence encoding a polypeptide comprising at least a portion of the polypeptide of SEQ ID NO:1 or variants thereof. The present invention specifically contemplates a polynucleotide sequence comprising a nucleotide sequence encoding a polypeptide comprising one or more repeats of SEQ ID NO:1 or variants thereof. Importantly, it is not intended that the present invention be limited to the precise nucleic acid sequence encoding the polypeptide of interest.
- The present invention contemplates synthetic genes encoding portions of HRGPs, wherein the encoded peptides contain one or more of the highly conserved Ser-Hyp4 motif(s). The present invention also contemplates synthetic genes encoding portions of RPRPs, wherein the encoded peptides contain one or more of the pentapeptide motif: Pro-Hyp-Val-Tyr-Lys and variants of this sequence such as X-Hyp-Val-Tyr-Lys and Pro-Hyp-Val-X-Lys and Pro-Pro-X Tyr-Lys and Pro-Pro-X-Tyr-X, where “X” can be Thr, Glu, Hyp, Pro, His and Ile. The present invention also contemplates synthetic genes encoding portions of AGPs, wherein the encoded peptides contain one or more Xaa-Hyp-Xaa-Hyp repeats. Such peptides can be expressed in a variety of forms, including but not limited to fusion proteins.
- With regard to motifs for HRGPs, the present invention contemplates a polynucleotide sequence comprising the sequence: 5′-CCA CCA CCT TCA CCT CCA CCC CCA TCT CCA-3′ (SEQ ID NO:2). With regard to motifs for AGPs, the present invention contemplates a polynucleotide sequence comprising the sequence: 5′-TCA CCA TCA CCA TCT CCT TCG COCA TCA CCC-3′ (SEQ ID NO:3). Of course, it is not intended that the present invention be limited by the particular sequence. Indeed, the present invention specifically contemplates sequences homologous to the sequences of SEQ ID NOS: 2 and 3. The present invention also contemplates sequences that are complementary (including sequences that are only partially complementary) sequences to the sequences of SEQ ID NOS: 2 and 3. Such complementary sequences include sequences that will hybridize to the sequences of SEQ ID NOS: 2 and 3 under low stringency conditions as well as high stringency conditions (see Definitions below).
- The present invention also contemplates the mixing of motifs (i.e. modules) which are not found in wild-type sequences. For example, one might add GAGP modules to extensin and RPRP crosslinking modules to AGP-like molecules.
- The present invention contemplates using the polynucleotides of the present invention for expression of the polypeptides in vitro and in vivo. Therefore, the present invention contemplates polynucleotide sequences encoding two or more repeats of the sequence of SEQ ID NO: 1 or variants thereof, wherein said polynucleotide sequence is contained on a recombinant expression vector. It is also contemplated that such vectors will be introduced into a variety of host cells, both eukaryotic and prokaryotic (e.g. bacteria such as E. coli).
- In one embodiment, the vector further comprises a promoter. It is not intended that the present invention be limited to a particular promoter. Any promoter sequence which is capable of directing expression of an operably linked nucleic acid sequence encoding a portion of a plant gum polypeptide (or other hydroxyproline-rich polypeptide of interest as described above) is contemplated to be within the scope of the invention. Promoters include, but are not limited to, promoter sequences of bacterial, viral and plant origins. Promoters of bacterial origin include, but are not limited to, the octopine synthase promoter, the nopaline synthase promoter and other promoters derived from native Ti plasmids. Viral promoters include, but are not limited to, the 35S and 19S RNA promoters of cauliflower mosaic virus (CaMV), and T-DNA promoters from Agrobacterium. Plant promoters include, but are not limited to, the ribulose-1,3-bisphosphate carboxylase small subunit promoter, maize ubiquitin promoters, the phaseolin promoter, the E8 promoter, and the Tob7 promoter.
- The invention is not limited to the number of promoters used to control expression of a nucleic acid sequence of interest. Any number of promoters may be used so long as expression of the nucleic acid sequence of interest is controlled in a desired manner. Furthermore, the selection of a promoter may be governed by the desirability that expression be over the whole plant, or localized to selected tissues of the plant, e.g., root, leaves, fruit, etc. For example, promoters active in flowers are known (Benfy et al. (1990) Plant Cell 2:849-856).
- The promoter activity of any nucleic acid sequence in host cells may be determined (i.e., measured or assessed) using methods well known in the art and exemplified herein. For example, a candidate promoter sequence may be tested by ligating it in-frame to a reporter gene sequence to generate a reporter construct, introducing the reporter construct into host cells (e.g. tomato or potato cells) using methods described herein, and detecting the expression of the reporter gene (e.g., detecting the presence of encoded mRNA or encoded protein, or the activity of a protein encoded by the reporter gene). The reporter gene may confer antibiotic or herbicide resistance. Examples of reporter genes include, but are not limited to, dhfr which confers resistance to methotrexate [Wigler M et al., (1980) Proc Natl Acad Sci 77:3567-70]; npt, which confers resistance to the aminoglycosides neomycin and G-418 [Colbere-Garapin F et al., (1981) J. Mol. Biol. 150:1-14] and als or pat, which confer resistance to chlorsulfuron and phosphinotricin acetyl transferase, respectively. Recently, the use of a reporter gene system which expresses visible markers has gained popularity with such markers as β-glucuronidase and its substrate (X-Gluc), luciferase and its substrate (luciferin), and β-galactosidase and its substrate (%-Gal) being widely used not only to identify transformants, but also to quantify the amount of transient or stable protein expression attributable to a specific vector system [Rhodes C A et al. (1995) Methods Mol Biol 55:121-131].
- In addition to a promoter sequence, the expression construct preferably contains a transcription termination sequence downstream of the nucleic acid sequence of interest to provide for efficient termination. In one embodiment, the termination sequence is the nopaline synthase (NOS) sequence. In another embodiment the termination region comprises different fragments of sugarcane ribulose-1,5-biphosphate carboxylase/oxygenase (rubisco) small subunit (scrbcs) gene. The termination sequences of the expression constructs are not critical to the invention. The termination sequence may be obtained from the same gene as the promoter sequence or may be obtained form different genes.
- If the mRNA encoded by the nucleic acid sequence of interest is to be efficiently translated, polyadenylation sequences are also commonly added to the expression construct. Examples of the polyadenylation sequences include, but are not limited to, the Agrobacterium octopine synthase signal, or the nopaline synthase signal.
- The invention is not limited to constructs which express a single nucleic acid sequence of interest. Constructs which contain a plurality of (i.e., two or more) nucleic acid sequences under the transcriptional control of the same promoter sequence are expressly contemplated to be within the scope of the invention. Also included within the scope of this invention are constructs which contain the same or different nucleic acid sequences under the transcriptional control of different promoters. Such constructs may be desirable to, for example, target expression of the same or different nucleic acid sequences of interest to selected plant tissues.
- As noted above, the present invention contemplates using the polynucleotides of the present invention for expression of a portion of plant gum polypeptides in vitro and in vivo. Where expression takes place in vivo, the present invention contemplates transgenic plants. The transgenic plants of the invention are not limited to plants in which each and every cell expresses the nucleic acid sequence of interest. Included within the scope of this invention is any plant (e.g. tobacco, tomato, maize, algae, etc.) which contains at least one cell which expresses the nucleic acid sequence of interest. It is preferred, though not necessary, that the transgenic plant express the nucleic acid sequence of interest in more than one cell, and more preferably in one or more tissue. It is particularly preferred that expression be followed by proper glycosylation of the plant gum polypeptide fragment or variant thereof, such that the host cell produces functional (e.g. in terms of use in the food or cosmetic industry) plant gum polypeptide.
- The fact that transformation of plant cells has taken place with the nucleic acid sequence of interest may be determined using any number of methods known in the art. Such methods include, but are not limited to, restriction mapping of genomic DNA, PCR analysis, DNA-DNA hybridization, DNA-RNA hybridization, and DNA sequence analysis.
- Expressed polypeptides (or fragments thereof) can be immobilized (covalently or non-covalently) on solid supports or resins for use in isolating HRGP-binding molecules from a variety of sources (e.g. algae, plants, animals, microorganisms). Such polypeptides can also be used to make antibodies.
-
FIG. 1 shows the nucleic acid sequence of one embodiment of a synthetic gene of the present invention. -
FIG. 2 shows one embodiment of a synthetic gene in one embodiment of an expression vector. - The term “gene” refers to a DNA sequence that comprises control and coding sequences necessary for the production of a polypeptide or its precursor. The polypeptide can be encoded by a full length coding sequence or by any portion of the coding sequence.
- The term “nucleic acid sequence of interest” refers to any nucleic acid sequence the manipulation of which may be deemed desirable for any reason by one of ordinary skill in the art (e.g., confer improved qualities).
- The term “wild-type” when made in reference to a gene refers to a gene which has the characteristics of a gene isolated from a naturally occurring source, The term “wild-type” when made in reference to a gene product refers to a gene product which has the characteristics of a gene product isolated from a naturally occurring source. A wild-type gene is that which is most frequently observed in a population and is thus arbitrarily designated the “normal” or “wild-type” form of the gene. In contrast, the term “modified” or “mutant” when made in reference to a gene or to a gene product refers, respectively, to a gene or to a gene product which displays modifications in sequence and or functional properties (i.e., altered characteristics) when compared to the wildtype gene or gene product. It is noted that naturally-occurring mutants can be isolated; these are identified by the fact that they have altered characteristics when compared to the wild-type gene or gene product.
- The term “recombinant” when made in reference to a DNA molecule refers to a DNA molecule which is comprised of segments of DNA joined together by means of molecular biological techniques. The term “recombinant” when made in reference to a protein or a polypeptide refers to a protein molecule which is expressed using a recombinant DNA molecule.
- As used herein, the terms “vector” and “vehicle” are used interchangeably in reference to nucleic acid molecules that transfer DNA segment(s) from one cell to another.
- The term “expression vector” or “expression cassette” as used herein refers to a recombinant DNA molecule containing a desired coding sequence and appropriate nucleic acid sequences necessary for the expression of the operably linked coding sequence in a particular host organism, Nucleic acid sequences necessary for expression in prokaryotes usually include a promoter, an operator (optional), and a ribosome binding site, often along with other sequences. Eukaryotic cells are known to utilize promoters, enhancers, and termination and polyadenylation signals.
- The terms “targeting vector” or “targeting construct” refer to oligonucleotide sequences comprising a gene of interest flanked on either side by a recognition sequence which is capable of homologous recombination of the DNA sequence located between the flanking recognition sequences.
- The terms “in operable combination”, “in operable order” and “operably linked” as used herein refer to the linkage of nucleic acid sequences in such a manner that a nucleic acid molecule capable of directing the transcription of a given gene and/or the synthesis of a desired protein molecule is produced. The term also refers to the linkage of amino acid sequences in such a manner so that a functional protein is produced.
- The term “transformation” as used herein refers to the introduction of foreign DNA into cells. Transformation of a plant cell may be accomplished by a variety of means known in the art including particle mediated gene transfer (see, e.g., U.S. Pat. No. 5,584,807 hereby incorporated by reference); infection with an Agrobacterium strain containing the foreign DNA for random integration (U.S. Pat. No. 4,940,838 hereby incorporated by reference) or targeted integration (U.S. Pat. No. 5,501,967 hereby incorporated by reference) of the foreign DNA into the plant cell genome; electroinjection (Nan et al. (1995) In “Biotechnology in Agriculture and Forestry,” Ed. Y. P. S. Bajaj, Springer-Verlag Berlin Heidelberg, Vol 34:145-155; Griesbach (1992) HortScience 27:620); fusion with liposomes, lysosomes, cells, minicells or other fusible lipid-surfaced bodies (Fraley et al. (1982) Proc. Natl. Acad. Sci. USA 79:1859-1863; polyethylene glycol (Krens et al. (1982) Nature 296:72-74); chemicals that increase free DNA uptake; transformation using virus, and the like.
- The terms “infecting” and “infection” with a bacterium refer to co-incubation of a target biological sample, (e.g., cell, tissue, etc.) with the bacterium under conditions such that nucleic acid sequences contained within the bacterium are introduced into one or more cells of the target biological sample.
- The term “Agrobacterium” refers to a soil-borne, Gram-negative, rod-shaped phytopathogenic bacterium which causes crown gall. The term “Agrobacterium” includes, but is not limited to, the strains Agrobacterium tumefaciens, (which typically causes crown gall in infected plants), and Agrobacterium rhizogens (which causes hairy root disease in infected host plants). Infection of a plant cell with Agrobacterium generally results in the production of opines (e.g., nopaline, agropine, octopine etc.) by the infected cell. Thus, Agrobacterium strains which cause production of nopaline (e.g., strain LBA4301, C58, A208) are referred to as “nopaline-type” Agrobacteria; Agrobacterium strains which cause production of octopine (e.g.,′ strain LBA4404, Ach5, B6) are referred to as “octopinc-type” Agrobacteria; and Agrobacterium strains which cause production of agropine (e.g., strain EHA105, EHA101, A281) are referred to as “agropine-type” Agrobacteria.
- The terms “bombarding, “bombardment,” and “biolistic bombardment” refer to the process of accelerating particles towards a target biological sample (e.g., cell, tissue, etc.) to effect wounding of the cell membrane of a cell in the target biological sample and/or entry of the particles into the target biological sample. Methods for biolistic bombardment are known in the art (e.g., U.S. Pat. No. 5,584,807, the contents of which are herein incorporated by reference), and are commercially available (e.g., the helium gas-driven microprojectile accelerator (PDS-1000/He) (BioRad).
- The term “microwounding” when made in reference to plant tissue refers to the introduction of microscopic wounds in that tissue. Microwounding may be achieved by, for example, particle or biolistic bombardment.
- The term “transgenic” when used in reference to a plant cell refers to a plant cell which comprises a transgene, or whose genome has been altered by the introduction of a transgene. The term “transgenic” when used in reference to a plant refers to a plant which comprises one or more cells which contain a transgene, or whose genome has been altered by the introduction of a transgene. These transgenic cells and transgenic plants may be produced by several methods including the introduction of a “transgene” comprising nucleic acid (usually DNA) into a target cell or integration into a chromosome of a target cell by way of human intervention, such as by the methods described herein.
- The term “transgene” as used herein refers to any nucleic acid sequence which is introduced into the genome of a plant cell by experimental manipulations. A transgene may be an “endogenous DNA sequence,” or a “heterologous DNA sequence” (i.e., “foreign DNA”). The term “endogenous DNA sequence” refers to a nucleotide sequence which is naturally found in the cell into which it is introduced so long as it does not contain some modification (e.g., a point mutation, the presence of a selectable marker gene, etc.) relative to the naturally-occurring sequence. The term “heterologous DNA sequence” refers to a nucleotide sequence which is ligated to, or is manipulated to become ligated to, a nucleic acid sequence to which it is not ligated in nature, or to which it is ligated at a different location in nature. Heterologous DNA is not endogenous to the cell into which it is introduced, but has been obtained from another cell. Heterologous DNA also includes an endogenous DNA sequence which contains some modification. Generally, although not necessarily, heterologous DNA encodes RNA and proteins that are not normally produced by the cell into which it is expressed. Examples of heterologous DNA include reporter genes, transcriptional and translational regulatory sequences, selectable marker proteins (e.g., proteins which confer drug resistance), etc.
- As used herein, the term “probe” when made in reference to an oligonucleotide (i.e., a sequence of nucleotides) refers to an oligonucleotide, whether occurring naturally as in a purified restriction digest or produced synthetically, recombinantly or by PCR amplification, which is capable of hybridizing to another oligonucleotide of interest. A probe may be single-stranded or double-stranded. Probes are useful in the detection, identification and isolation of particular gene sequences. Oligonucleotide probes may be labelled with a “reporter molecule,” so that the probe is detectable using a detection system. Detection systems include, but are not limited to, enzyme, fluorescent, radioactive, and luminescent systems.
- The term “selectable marker” as used herein, refer to a gene which encodes an enzyme having an activity that confers resistance to an antibiotic or drug upon the cell in which the selectable marker is expressed. Selectable markers may be “positive” or “negative.” Examples of positive selectable markers include the neomycin phosphotrasferase (NPTII) gene which confers resistance to G418 and to kanamycin, and the bacterial hygromycin phosphotransferase gene (hyg), which confers resistance to the antibiotic hygromycin. Negative selectable markers encode an enzymatic activity whose expression is cytotoxic to the cell when grown in an appropriate selective medium. For example, the HSV-tk gene is commonly used as a negative selectable marker. Expression of the HSV-tk gene in cells grown in the presence of gancyclovir or acyclovir is cytotoxic; thus, growth of cells in selective medium containing gancyclovir or acyclovir selects against cells capable of expressing a functional HSV TK enzyme.
- The terms “promoter element,” “promoter,” or “promoter sequence” as used herein, refer to a DNA sequence that is located at the 5′ end (i.e. precedes) the protein coding region of a DNA polymer. The location of most promoters known in nature precedes the transcribed region. The promoter functions as a switch, activating the expression of a gene. If the gene is activated, it is said to be transcribed, or participating in transcription. Transcription involves the synthesis of mRNA from the gene. The promoter, therefore, serves as a transcriptional regulatory element and also provides a site for initiation of transcription of the gene into mRNA.
- The term “amplification” is defined as the production of additional copies of a nucleic acid sequence and is generally carried out using polymerase chain reaction technologies well known in the art [Dieffenbach C W and G S Dveksler (1995) PCR Primer, a Laboratory Manual, Cold Spring Harbor Press, Plainview N.Y.]. As used herein, the term “polymerase chain reaction” (“PCR”) refers to the method of K. B. Mullis disclosed in U.S. Pat. Nos. 4,683,195, 4,683,202 and 4,965,188, all of which are hereby incorporated by reference, which describe a method for increasing the concentration of a segment of a target sequence in a mixture of genomic DNA without cloning or purification. This process for amplifying the target sequence consists of introducing a large excess of two oligonucleotide primers to the DNA mixture containing the desired target sequence, followed by a precise sequence of thermal cycling in the presence of a DNA polymerase. The two primers are complementary to their respective strands of the double stranded target sequence. To effect amplification, the mixture is denatured and the primers then annealed to their complementary sequences within the target molecule. Following annealing, the primers are extended with a polymerase so as to form a new pair of complementary strands. The steps of denaturation, primer annealing and polymerase extension can be repeated many times (i.e., denaturation, annealing and extension constitute one “cycle”; there can be numerous “cycles”) to obtain a high concentration of an amplified segment of the desired target sequence. The length of the amplified segment of the desired target sequence is determined by the relative positions of the primers with respect to each other, and therefore, this length is a controllable parameter. By virtue of the repeating aspect of the process, the method is referred to as the “polymerase chain reaction” (hereinafter “PCR”). Because the desired amplified segments of the target sequence become the predominant sequences (in terms of concentration) in the mixture, they are said to be “PCR amplified.”
- With PCR, it is possible to amplify a single copy of a specific target sequence in genomic DNA to a level detectable by several different methodologies (e.g., hybridization with a labeled probe; incorporation of biotinylated primers followed by avidin-enzyme conjugate detection; and/or incorporation of 32P-labeled deoxyribonucleotide triphosphates, such as dCTP or dATP, into the amplified segment). In addition to genomic DNA, any oligonucleotide sequence can be amplified with the appropriate set of primer molecules. In particular, the amplified segments created by the PCR process itself are, themselves, efficient templates for subsequent PCR amplifications. Amplified target sequences may be used to obtain segments of DNA (e.g., genes) for the construction of targeting vectors, transgenes, etc.
- The present invention contemplates using amplification techniques such as PCR to obtain the cDNA (or portions thereof) of plant genes encoding plant gums and other hydroxyproline-rich polypeptides. In one embodiment, primers are designed using the synthetic gene sequences (e.g. containing sequences encoding particular motifs) described herein and PCR is carried out (using genomic DNA or other source of nucleic acid from any plant capable of producing a gum exudate) under conditions of low stringency. In another embodiment, PCR is carried out under high stringency. The amplified products can be run out on a gel and isolated from the gel.
- The term “hybridization” as used herein refers to any process by which a strand of nucleic acid joins with a complementary strand through base pairing [Coombs J (1994) Dictionary of Biotechnology, Stockton Press, New York N.Y.].
- As used herein, the terms “complementary” or “complementarity” when used in reference to polynucleotides refer to polynucleotides which are related by the base-pairing rules. For example, for the sequence 5′-AGT-3′ is complementary to the sequence 5′-ACT-3′. Complementarity may be “partial,” in which only some of the nucleic acids' bases are matched according to the base pairing rules. Or, there may be “complete” or “total” complementarity between the nucleic acids. The degree of complementarity between nucleic acid strands has significant effects on the efficiency and strength of hybridization between nucleic acid strands. This is of particular importance in amplification reactions, as well as detection methods which depend upon binding between nucleic acids.
- The term “homology” when used in relation to nucleic acids refers to a degree of complementarity. There may be partial homology or complete homology (i.e., identity). A partially complementary sequence is one that at least partially inhibits a completely complementary sequence from hybridizing to a target nucleic acid is referred to using the functional term “substantially homologous.” The inhibition of hybridization of the completely complementary sequence to the target sequence may be examined using a hybridization assay (Southern or Northern blot, solution hybridization and the like) under conditions of low stringency. A substantially homologous sequence or probe will compete for and inhibit the binding (i.e., the hybridization) of a sequence which is completely homologous to a target under conditions of low stringency. This is not to say that conditions of low stringency are such that non-specific binding is permitted; low stringency conditions require that the binding of two sequences to one another be a specific (i.e., selective) interaction. The absence of non-specific binding may be tested by the use of a second target which lacks even a partial degree of complementarity (e.g., less than about 30% identity); in the absence of non-specific binding the probe will not hybridize to the second non-complementary target.
- Low stringency conditions when used in reference to nucleic acid hybridization comprise conditions equivalent to binding or hybridization at 42° C. in a solution consisting of 5×SSPE (43.8 g/l NaCl, 6.9 g/l NaH2PO4.H2O and 1.85 g/I EDTA, pH adjusted to 7.4 with NaOH), 0.1% SDS, 5×Denhardt's reagent [50×Denhardt's contains per 500 ml: 5 g Ficoll (Type 400, Pharmacia), 5 g BSA (Fraction V; Sigma)] and 100 μg/ml denatured salmon sperm DNA followed by washing in a solution comprising 5×SSPE, 0.1% SDS at 42° C. when a probe of about 500 nucleotides in length is employed.
- High stringency conditions when used in reference to nucleic acid hybridization comprise conditions equivalent to binding or hybridization at 42° C. in a solution consisting of 5×SSPE (43.8 g/l NaCl, 6.9 g/l NaH2PO4.H2O and 1.85 g/I EDTA, pH adjusted to 7.4 with NaOH), 0.5% SDS, 5×Denhardt's reagent and 100 μg/ml denatured salmon sperm DNA followed by washing in a solution comprising 0.1×SSPE, 1.0% SDS at 42° C. when a probe of about 500 nucleotides in length is employed.
- When used in reference to nucleic acid hybridization the art knows well that numerous equivalent conditions may be employed to comprise either low or high stringency conditions; factors such as the length and nature (DNA, RNA, base composition) of the probe and nature of the target (DNA, RNA, base composition, present in solution or immobilized, etc.) and the concentration of the salts and other components (e.g., the presence or absence of formamide, dextran sulfate, polyethylene glycol) are considered and the hybridization solution may be varied to generate conditions of either low or high stringency hybridization different from, but equivalent to, the above listed conditions.
- “Stringency” when used in reference to nucleic acid hybridization typically occurs in a range from about Tm-5° C. (5° C. below the Tm of the probe) to about 20° C. to 25° C. below Tm. As will be understood by those of skill in the art, a stringent hybridization can be used to identify or detect identical polynucleotide sequences or to identify or detect similar or related polynucleotide sequences. Under “stringent conditions” a nucleic acid sequence of interest will hybridize to its exact complement and closely related sequences.
- As used herein, the term “fusion protein” refers to a chimeric protein containing the protein of interest (i.e., GAGP and fragments thereof) joined to an exogenous protein fragment (the fusion partner which consists of a non-GAGP sequence). The fusion partner may provide a detectable moiety, may provide an affinity tag to allow purification of the recombinant fusion protein from the host cell, or both. If desired, the fusion protein may be removed from the protein of interest (i.e. GAGP protein or fragments thereof) by a variety of enzymatic or chemical means known to the art.
- As used herein the term “non-gum arabic glycoprotein” or “non-gum arabic glycoprotein sequence” refers to that portion of a fusion protein which comprises a protein or protein sequence which is not derived from a gum arabic glycoprotein.
- The term “protein of interest” as used herein refers to the protein whose expression is desired within the fusion protein. In a fusion protein the protein of interest (e.g., GAGP) will be joined or fused with another protein or protein domain (e.g., GFP), the fusion partner, to allow for enhanced stability of the protein of interest and/or ease of purification of the fusion protein.
- As used herein, the term “purified” or “to purify” refers to the removal of contaminants from a sample. For example, recombinant HRGP polypeptides, including HRGP-GFP fusion proteins are purified by the removal of host cell components such as nucleic acids, lipopolysaccharide (e.g., endotoxin).
- The term “recombinant DNA molecule” as used herein refers to a DNA molecule which is comprised of segments of DNA joined together by means of molecular biological techniques.
- The term “recombinant protein” or “recombinant polypeptide” as used herein refers to a protein molecule which is expressed from a recombinant DNA molecule.
- As used herein the term “portion” when in reference to a protein (as in “a portion of a given protein”) refers to fragments of that protein. The fragments may range in size from four amino acid residues to the entire amino acid sequence minus one amino acid.
- The present invention relates generally to the field of plant gums and other hydroxyproline-rich glycoproteins, and in particular, to the expression of synthetic genes designed from repetitive peptide sequences. The hydroxyproline-rich glycoprotein (HRGP) superfamily is ubiquitous in the primary cell wall or extracellular matrix throughout the plant kingdom. Family members are diverse in structure and implicated in all aspects of plant growth and development. This includes plant responses to stress imposed by pathogenesis and mechanical wounding.
- Plant HRGPs have no known animal homologues. Furthermore, hydroxyproline residues are O-glycosylated in plant glycoproteins but never in animals. At the molecular level the function of these unique plant glycoproteins remains largely unexplored.
- HRGPS are, to a lesser or greater extent, extended, repetitive, modular proteins. The modules are small (generally 4-6 residue motifs), usually glycosylated, with most HRGPs being made up of more than one type of repetitive module. For purposes of constructing the synthetic genes of the present invention, it is useful to view the glycosylated polypeptide modules not merely as peptides or oligosaccharides but as small functional units.
- The description of the invention involves A) the design of the polypeptide of interest, B) the production of synthetic genes encoding the polypeptide of interest, C) the construction of the expression vectors, D) selection of the host cells, and E) introduction of the expression construct into a particular cell (whether in vitro or in vivo).
- The present invention contemplates polypeptides that are fragments of hydroxyproline-rich glycoproteins (HRGPs), repetitive proline-rich proteins (RPRPs) and arabino-galactan proteins (AGPs). The present invention contemplates portions of HRGPs comprising one or more of the highly conserved Ser-Hyp4 motif(s). The present invention also contemplates portions of RPRPs comprising one or more of the pentapeptide motif: Pro-Hyp-Val-Tyr-Lys. The present invention also contemplates portions of AGPs comprising one or more Xaa-Hyp-Xaa-Hyp repeats.
- While an understanding of the natural mechanism of glycosylation is not required for the successful operation of the present invention, it is believed that in GAGP and other HRGPs, repetitive Xaa-Hyp blocks constitute a Hyp-glycosylation code where Hyp occurring in contiguous blocks (Xaa-Hyp-Hyp) and Hyp occurring in non-contiguous Hyp repeats is recognized by different enzymes: arabinosyltransferases and galactosyltransferases, respectively.
- The RPRPs (and some nodulins) consist of short repetitive blocks (e.g. Soybean RPRP1: [POVYK]n where O=Hyp) containing the least amount of contiguous Hyp. They also exemplify the low end of the glycosylation range with relatively few Hyp residues arabinosylated and no arabinogalactan polysaccharide. For example, in soybean RPRP1, L-arabinofuranose is attached to perhaps only a single Hyp residue in the molecule.
- The Extensins occupy an intermediate position in the glycosylation continuum, containing about 50% carbohydrate which occurs mainly as Hyp-arabinosides (1-4 Ara residues), but not as Hyp-arabinogalactan polysaccharide. Extensins contain the repetitive, highly arabinosylated, diagnostic Ser-Hyp4 glycopeptide module. The precise function of this module is unknown, but earlier work indicates that these blocks of arabinosylated Hyp help stabilize the extended polyproline-II helix of the extensins. Monogalactose also occurs on the Ser residues.
- The classical Ser-Hyp4 glycopeptide module is of special interest. A tetra-L-arabinofuranosyl oligosaccharide is attached to each Hyp residue in the block. Three uniquely b-linked arabinofuranosyl residues and an a-linked nonreducing terminus comprise the tetraarabinooligosaccharide. While an understanding of the natural mechanism of glycosylation is not required for the successful operation of the present invention, it is believed that the arabinosylated Hyp residues together with the single galactosyl-serine residue undoubtedly form a unique molecular surface topography which interacts with and is recognized by other wall components, possibly including itself. Shorter blocks of Hyp, namely Hyp3 and Hyp2, lack the fourth (a-linked) arabinose residue, again suggesting that the fourth Ara unique to the Hyp4 block, has a special role and is presented for recognition or cleavage.
- At the high end of the glycosylation range (˜90% sugar), the arabinogalactan-proteins (AGPs) and the related gum arabic glycoprotein (GAGP) are uniquely glycosylated with arabinogalactan polysaccharides. GAGP and all AGPs so far characterized by Hyp-glycoside profiles contain Hyp-linked arabinosides assigned to contiguous Hyp residues by the Hyp contiguity hypothesis. However these glycoproteins also uniquely contain (Xaa-Hyp-Xaa-Hyp) repeats. These repeats are putative polysaccharide attachment sites.
- The present invention contemplates in particular fragments of gum arabic glycoprotein (GAGP). As noted above, GAGP has been largely refractory to chemical analysis. The largest peptide obtained and sequenced from gum arabic was a peptide of twelve (12) amino acids having the sequence Ser-Hyp-Ser-Hyp-Thr-Hyp-Thr-Hyp-Hyp-Hyp-Gly-Pro. C. L. Delonnay, “Determination of the Protein Constituent Of Gum Arabic” Master of Science Thesis (1993). The present invention contemplates using this Delonnay sequence as well as (heretofore undescribed) larger peptide fragments of GAGP (and variants thereof) for the design of synthetic genes. In this manner, “designer plant gums” can be produced (“designer extensins” are also contemplated).
- In one embodiment, the present invention contemplates a substantially purified polypeptide comprising at least a portion of the amino acid consensus sequence Ser-Hyp-Hyp-Hyp-[Hyp/Thr]-Leu-Ser-Hyp-Ser-Hyp-Thr-Hyp-Thr-Hyp-Hyp-Hyp-Gly-Pro-His (SEQ ID NO:1) or variants thereof. While an understanding of the natural mechanism of glycosylation is not required for the successful operation of the present invention, it is believed that this GAGP 19-residue consensus repeat (which contains both contiguous Hyp and non-contiguous Hyp repeats) is glycosylated in native GAGP with both Hyp-arabinosides and Hyp-polysaccharide in molar ratios. It is further believed that the high molecular weight protein component of gum arabic (i.e. GAGP) is responsible for the remarkable emulsifying and stabilizing activity exploited by the food and soft drink industries.
- The present invention contemplates involves the use of synthetic genes engineered for the expression of repetitive glycopeptide modules in cells, including but not limited to callus and suspension cultures. It is not intended that the present invention be limited by the precise number of repeats.
- In one embodiment, the present invention contemplates the nucleic acid sequences encoding the consensus sequence for GAGP (i.e. SEQ ID NO:1) or variants thereof may be used as a repeating sequence between two (2) and up to fifty (50) times, more preferably between ten (10) and up to thirty (30) times, and most preferably approximately twenty (20) times. The nucleic acid sequence encoding the consensus sequence (i.e. SEQ ID NO:1) or variants thereof may be used as contiguous repeats or may be used as non-contiguous repeats.
- In designing any HRGP gene cassette the following guidelines are employed. Cassette design reflects the following:
- 1) Minimization of the repetitive nature of the coding sequence while still taking into account the HRGP codon bias of the host plant (e.g., when tomato is the host plant, the codon usage bias of the tomato which favors CCA and CCT [but not CCG] for Pro residues, and TCA and TCC for Ser residues is employed). Zea mays (such as corn) and perhaps other graminaceous monocots (e.g. rice barley, wheat and all grasses) prefer CCG and CCC for proline; GTC and CTT for valine; and AAG for lysine. Dicots (including legumes) prefer CCA and CCT for proline and TCA and TCT for serine.
- 2) Minimization of strict sequence periodicity.
- 3) Non-palindromic ends are used for the monomers and end linkers to assure proper “head-to-tail” polymerization.
- 4) The constructs contain no internal restriction enzyme recognition sites for the restriction enzymes employed for the insertion of these sequences into expression vectors or during subsequent manipulations of such vectors. Typically, the 5′ linker contains a XmaI site downstream of the BamHI site used for cloning into the cloning vector (e.g., pbluescript). The XmaI site is used for insertion of the HRGP gene cassette into the expression vector (e.g., pBI121-Sig-EGFP). Typically, the 3′ linker contains a AgeI site upstream of the EcoRI site used for cloning into the cloning vector (e.g., pbluescript). The AgeI site is used for insertion of the HRGP gene cassette into the expression vector. (For plasmid pBI121-Sig—which does not contain GFP for the fusion protein—the same signal sequence is used, but the 3′ linkers contain an Sst I restriction site for insertion as an Xma I/Sst I fragment behind the signal sequence and before the NOS terminator.
- 5) The oligonucleotides used are high quality (e.g., from GibcoBRL, Operon) and have been purified away from unwanted products of the synthesis.
- 6) The TM of correctly aligned oligomers is greater than the TM of possible dimers, hairpins or crossdimers.
- It is not intended that the present invention be limited by the nature of the expression vector. A variety of vectors are contemplated. In one embodiment, two plant transformation vectors are prepared, both derived from pBI121 (Clontech). Both contain an extensin signal sequence for transport of the constructs through the ER/Golgi for posttranslational modification. A first plasmid construct contained Green Fluorescent Protein (GFP) as a reporter protein instead of GUS. A second plasmid does not contain GFP.
- pBI121 is the Jefferson vector in which the BamHI and SstI sites can be used to insert foreign DNA between the 35S CaMV promoter and the termination/polyadenylation signal from the nopaline synthase gene (NOS-ter) of the Agrobacterium Ti plasmid); it also contains an RK2 origin of replication, a kanamycin resistance gene, and the GUS reporter gene.
- Signal Sequences. As noted above, the GUS sequence is replaced (via BamHI/SstI) with a synthetic DNA sequence encoding a peptide signal sequence based on the extensin signal sequences of Nicotiana plumbaginifolia and N. tabacum
- MGKMASLFATFLVVLVSLSLAQTTRVVPVASSAP
- The DNA sequence also contains 15 bp of the 5′ untranslated region, and restriction sites for Bam HI in its 5′terminus and Sst I in its extreme 3′ terminus for insertion into pBI121 in place of GUS. An XmaI restriction site occurs 16 bp upstream from the Sst I site to allow subsequent insertion of EGFP into the plasmid as a Xma I/Sst I fragment.
- The sequence underlined above is known to target N. plumbaginifolia extensin fusion proteins through the ER and Golgi for post-translational modifications, and finally to the wall. The signal sequence proposed also involves transport of extensins and extensin modules in the same plant family (Solanaceae). Alternatively, one can use the signal sequence from tomato P1 extensin itself.
-
TABLE 1 GFP MUTANTs WAVELENGTH (nm) MUTANT Excitation Emitting mGFPX10; F99S, M153T, V163A Excites at 395 mGFPX10-5 Excites at 489 Emits at 508 GFPA2; I167T Excites at 471 GFPB7; Y66H Excites at 382 Emits at 440 (blue fluorescence) GFPX10-C7; F99S, M153T, Excites at 395 V163A, I167T, S175G and 473 GFPX10-D3; F99S, M153T, Excites at 382 Emits at 440 V163A, Y66H
Addition of GFP. The repetitive HRGP-modules can be expressed as GFP fusion products rather than GUS fusions, and can also be expressed as modules without GFP. Fusion with a green fluorescent protein reporter gene appropriately red-shifted for plant use, e.g. EGFP (an S65T variant recommended for plants by Clontech) or other suitable mutants (see Table 1 above) allows the detection of <700 GFP molecules at the cell surface. GFP requires aerobic conditions for oxidative formation of the fluorophore. It works well at the lower temperatures used for plant cell cultures and normally it does not adversely affect protein function although it may allow the regeneration of plants only when targeted to the ER.
Promoters. As noted above, it is not intended that the present invention be limited by the nature of the promoter(s) used in the expression constructs. The CaMV35S promoter is preferred, although it is not entirely constitutive and expression is “moderate”. In some embodiments, higher expression of the constructs is desired to enhance the yield of HRGP modules; in such cases a plasmid with “double” CaMV35S promoters is employed. - A variety of host cells are contemplated (both eukaryotic and prokaryotic). It is not intended that the present invention be limited by the host cells used for expression of the synthetic genes of the present invention. Plant host cells are preferred, including but not limited to legumes (e.g. soy beans) and solanaceous plants (e.g. tobacco).
- The present invention is not limited by the nature of the plant cells. All sources of plant tissue are contemplated, including but not limited to seeds. Seeds of flowering plants consist of an embryo, a seed coat, and stored food. When fully formed, the embryo consists basically of a hypocotyl-root axis bearing either one or two cotyledons and an apical meristem at the shoot apex and at the root apex. The cotyledons of most dicots are fleshy and contain the stored food of the seed. In other dicots and most monocots, food is stored in the endosperm and the cotyledons function to absorb the simpler compounds resulting from the digestion of the food.
- It is also not intended that the present invention be limited to only certain types of plants. Both monoctyledons and disctyledons are contemplated. Monoctyledons include grasses, lilies, irises, orchids, cattails, palms. Dicotyledons include almost all the familiar trees and shrubs (other than confers) and many of the herbs (non-woody plants).
- Tomato cultures are the ideal recipients for repetitive HRGP modules to be hydroxylated and glycosylated: Tomato is readily transformed. The cultures produce cell surface HRGPs in high yields easily eluted from the cell surface of intact cells and they possess the required posttranslational enzymes unique to plants—HRGP prolyl hydroxylases, hydroxyproline O-glycosyltransferases and other specific glycosyltransferases for building complex polysaccharide side chains. Furthermore, tomato genetics, and tomato leaf disc transformation/plantlet regeneration are well worked out.
- Expression constructs of the present invention may be introduced into host cells (e.g. plant cells) using methods known in the art. In one embodiment, the expression constructs are introduced into plant cells by particle mediated gene transfer. Particle mediated gene transfer methods are known in the art, are commercially available, and include, but are not limited to, the gas driven gene delivery instrument descried in McCabe, U.S. Pat. No. 5,584,807, the entire contents of which are herein incorporated by reference. This method involves coating the nucleic acid sequence of interest onto heavy metal particles, and accelerating the coated particles under the pressure of compressed gas for delivery to the target tissue.
- Other particle bombardment methods are also available for the introduction of heterologous nucleic acid sequences into plant cells. Generally, these methods involve depositing the nucleic acid sequence of interest upon the surface of small, dense particles of a material such as gold, platinum, or tungsten. The coated particles are themselves then coated onto either a rigid surface, such as a metal plate, or onto a carrier sheet made of a fragile material such as mylar. The coated sheet is then accelerated toward the target biological tissue. The use of the flat sheet generates a uniform spread of accelerated particles which maximizes the number of cells receiving particles under uniform conditions, resulting in the introduction of the nucleic acid sample into the target tissue.
- Alternatively, an expression construct may be inserted into the genome of plant cells by infecting them with a bacterium, including but not limited to an Agrobacterium strain previously transformed with the nucleic acid sequence of interest. Generally, disarmed Agrobacterium cells are transformed with recombinant Ti plasmids of Agrobacterium tumefaciens or Ri plasmids of Agrobacterium rhizogenes (such as those described in U.S. Pat. No. 4,940,838, the entire contents of which are herein incorporated by reference) which are constructed to contain the nucleic acid sequence of interest using methods well known in the art (Sambrook, J. et al., (1989) supra). The nucleic acid sequence of interest is then stably integrated into the plant genome by infection with the transformed Agrobacterium strain. For example, heterologous nucleic acid sequences have been introduced into plant tissues using the natural DNA transfer system of Agrobacterium tumefaciens and Agrobacterium rhizogenes bacteria (for review, see Klee et al. (1987) Ann. Rev. Plant Phys. 38:467-486).
- One of skill in the art knows that the efficiency of transformation by Agrobacterium may be enhanced by using a number of methods known in the art. For example, the inclusion of a natural wound response molecule such as acetosyringone (AS) to the Agrobacterium culture has been shown to enhance transformation efficiency with Agrobacterium tumefaciens [Shahla et al. (1987) Plant Molec. Biol. 8:291-298]. Alternatively, transformation efficiency may be enhanced by wounding the target tissue to be transformed. Wounding of plant tissue may be achieved, for example, by punching, maceration, bombardment with microprojectiles, etc. [see, e.g., Bidney et al. (1992) Plant Molec. Biol. 18:301-313].
- It may be desirable to target the nucleic acid sequence of interest to a particular locus on the plant genome. Site-directed integration of the nucleic acid sequence of interest into the plant cell genome may be achieved by, for example, homologous recombination using Agrobacterium-derived sequences. Generally, plant cells are incubated with a strain of Agrobacterium which contains a targeting vector in which sequences that are homologous to a DNA sequence inside the target locus are flanked by Agrobacterium transfer-DNA (T-DNA) sequences, as previously described (Offring a et al., (1996), U.S. Pat. No. 5,501,967, the entire contents of which are herein incorporated by reference). One of skill in the art knows that homologous recombination may be achieved using targeting vectors which contain sequences that are homologous to any part of the targeted plant gene, whether belonging to the regulatory elements of the gene, or the coding regions of the gene. Homologous recombination may be achieved at any region of a plant gene so long as the nucleic acid sequence of regions flanking the site to be targeted is known.
- Where homologous recombination is desired, the targeting vector used may be of the replacement- or insertion-type (Offring a et al. (1996), supra). Replacement-type vectors generally contain two regions which are homologous with the targeted genomic sequence and which flank a heterologous nucleic acid sequence, e.g., a selectable marker gene sequence. Replacement type vectors result in the insertion of the selectable marker gene which thereby disrupts the targeted gene. Insertion-type vectors contain a single region of homology with the targeted gene and result in the insertion of the entire targeting vector into the targeted gene.
- Other methods are also available for the introduction of expression constructs into plant tissue, e.g., electroinjection (Nan et al. (1995) In “Biotechnology in Agriculture and Forestry,” Ed. Y. P. S. Bajaj, Springer-Verlag Berlin Heidelberg, Vol 34:145-155; Griesbach (1992) HortScience 27:620); fusion with liposomes, lysosomes, cells, minicells or other fusible lipid-surfaced bodies (Fraley et al. (1982) Proc. Natl. Acad. Sci. USA 79:1859-1863; polyethylene glycol (Krens et al. (1982) Nature 296:72-74); chemicals that increase free DNA uptake; transformation using virus, and the like.
- In one embodiment, the present invention contemplates introducing nucleic acid via the leaf disc transformation method. Horsch et al. Science 227:1229-1231 (1985). Briefly, disks are punched from the surface of sterilized leaves and submerged with gentle shaking into a culture of A. tumefaciens that had been grown overnight in luria broth at 28° C. The disks are then blotted dry and placed upside-down onto nurse culture plates to induce the regeneration of shoots. Following 2-3 days, the leaf disks are transferred to petri plates containing the same media without feeder cells or filter papers, but in the presence of carbenicillin (500 μg/ml) and kanamycin (300 μg/ml) to select for antibiotic resistance. 2-4 weeks later, the shoots that developed are removed from calli and placed into root-inducing media with the appropriate antibiotic. These shoots were then further transplanted into soil following the presence of root formation.
- The following examples serve to illustrate certain preferred embodiments and aspects of the present invention and are not to be construed as limiting the scope thereof.
- In the experimental disclosure which follows, the following abbreviations apply: g (gram); mg (milligrams); μg (microgram); M (molar); mM (milliMolar); μM (microMolar); nm (nanometers); L (liter); ml (milliliter); μl (microliters); ° C. (degrees Centigrade); m (meter); sec. (second); DNA (deoxyribonucleic acid); cDNA (complementary DNA); RNA (ribonucleic acid); mRNA (messenger ribonucleic acid); X-gal (5-bromo-4-chloro-3-indolyl-β-D-galactopyranoside); LB (Luria Broth), PAGE (polacrylamide gel electrophoresis); NAA (α-naphtaleneacetic acid); BAP (6-benzyl aminopurine); Tris (tris(hydroxymethyl)-aminomethane); PBS (phosphate buffered saline); 2×SSC (0.3 M NaCl, 0.03 M Na3citrate, pH 7.0); Agri-Bio Inc. North Miami, Fla.); Analytical Scientific Instruments (Alameda, Calif.); BioRad (Richmond, Calif.); Clontech (Palo Alto Calif.); Delmonte Fresh Produce (Kunia, Hawaii); Difco Laboratories (Detroit, Mich.); Dole Fresh Fruit (Wahiawa, Hawaii); Dynatech Laboratory Inc. (Chantilly Va.); Gibco BRL (Gaithersburg, Md.); Gold fio Technology, Inc. (St. Louis, Mo.); GTE Corp. (Danvers, Mass.); MSI Corp. (Micron Separations, Inc., Westboro, Mass.); Operon (Operon Technolies, Alameda, Calif.); Pioneer Hi-Bred International, Inc. (Johnston, Iowa); 5 Prime 3 Prime (Boulder, Colo.); Sigma (St. Louis, Mo.); Promega (Promega Corp., Madison, Wis.); Stratagene (Stratagene Cloning Systems, La Jolla, Calif.); USB (U.S. Biochemical, Cleveland, Ohio).
- In this example, GAGP was isolated and (by using chymotrypsin) the deglycosylated polypeptide backbone was prepared. Although GAGP does not contain the usual chymotryptic cleavage sites, it does contain leucyl and histidyl residues which are occasionally cleaved. Chymotrypsin cleaved sufficient of these “occasionally cleaved” sites to produce a peptide map of closely related peptides.
- Purification and Deglycosylation of GAGP. GAGP was isolated via preparative Superose-6 gel filtration. Anhydrous hydrogen fluoride deglycosylated it (20 mg powder/mL HF at 4° C., repeating the procedure twice to ensure complete deglycosylation), yielding dGAGP which gave a single symmetrical peak (data not shown) after rechromatography on Superose-6. Further purification of dGAGP by reverse phase chromatography also gave a single major peak, showing a highly biased but constant amino acid composition in fractions sampled across the peak. These data indicated that dGAGP was a single polypeptide component sufficiently pure for sequence analysis.
Sequence Analysis. An incomplete pronase digest gave a large peptide PRP3 which yielded a partial sequence (Table 2) containing all the amino acids present in the suggested dGAGP repeat motif. In view of the limitations of pronase, for further peptide mapping and to obtain more definitive sequence information, dGAGP was digested with chymotrypsin, followed by a two-stage HPLC fractionation scheme. Initial separation of the chymotryptides on a PolySULFOETHYL A™ (designated PSA, PolyLC, Inc. Ellicott City, Md.) cation exchanger yielded three major fractions: S1 and S2 increased with digestion time while S3 showed a concomitant decrease. Further chromatography on PRP-1 resolved PSA fractions S1 and S2 into several peptides. -
TABLE 2 AMINO ACID SEQUENCES OF THE GUM ARABIC GLYCOPROTEIN POLYPEPTIDE BACKBONE Peptide Sequence S1P5 Ser-Hyp-Hyp-Hyp-Hyp-Leu-Ser-Hyp-Ser-Leu-Thr-Hyp-Thr-Hyp-Hyp-Leu-Gly-Pro-(Pro) S1P3 Ser-Hyp-Hyp-Hyp-Hyp-Leu-Ser-Hyp-Ser-Hyp-Thr-Hyp-Thr-Hyp-Hyp-Leu-Gly-Pro-(Pro) S3 Ser-Hyp-Hyp-Hyp-Thr-Leu-Ser-Hyp-Ser-Hyp-Thr-Hyp-Thr-Hyp-Hyp-Hyp-Gly-Pro-His-Ser- Hyp-Hyp-Hyp-(Hyp) S1P2 Ser-Hyp-Hyp-Hyp-Ser-Leu-Ser-Hyp-Ser-Hyp-Thr-Hyp-Thr-Hyp-Hyp-Thr-Gly-Pro-His S2P1 Ser-Hyp-Ser-Hyp-Thr-Hyp-Thr-Hyp-Hyp-Hyp-Gly-Pro-His S2P2a Ser-Hyp-Ser-Hyp-Ala-Hyp-Thr-Hyp-Hyp-Leu-Gly-Pro-His S2P2b Ser-Hyp-Leu-Pro-Thr-Hyp-Thr-Hyp-Hyp-Leu-Gly-Pro-His S2P3a Ser-Hyp-Ser-Hyp-Thr-Hyp-Thr-Hyp-Hyp-Leu-Gly-Pro-His S2P4 Ser-Hyp-Hyp-Leu-Thr-Hyp-Thr-Hyp-Hyp-Leu-Leu-Pro-His S1P4 Ser-Hyp-Leu-Pro-Thr-Leu-Ser-Hyp-Leu-Pro-Ala/Thr-Hyp-Thr-Hyp-Hyp-Hyp-Gly-Pro-His Consensus: Ser-Hyp-Hyp-Hyp-Thr/Hyp-Leu-Ser-Hyp-Ser-Hyp-Thr-Hyp-Thr-Hyp-Hyp-Leu-Gly-Pro-His ↑ ↑ ↑ ↑ ↑ ↑ ↑ ↑ (Leu)(Pro)(Ser) (Leu)(Leu)(Ala) (Hyp) (Pro)
Edman degradation showed that these chymotryptides were closely related to each other, to the partial sequence of the large pronase peptide (Table 2), and to the major pronase peptide of GAGP isolated earlier by Delonnay (see above). Indeed, all can be related to a single 19-residue consensus sequence with minor variation in some positions (Table 2). These peptides also reflect the overall amino acid composition and are therefore evidence of a highly repetitive polypeptide backbone with minor variations in the repetitive motif; these include occasional substitution of Leu for Hyp and Ser. Remarkably, fifteen residues of the consensus sequence are “quasi-palindromic” i.e. the side chain sequence is almost the same whether read from the N-terminus or C-terminus. - Synthetic gene cassettes encoding contiguous and noncontiguous Hyp modules are constructed using partially overlapping sets consisting of oligonucleotide pairs, “internal repeat pairs” and “external 3′- and 5′-linker pairs” respectively, all with complementary “sticky” ends. The design strategy for the repetitive HRGP modules combines proven approaches described earlier for the production in E. coli of novel repetitive polypeptide polymers (McGrath et al. [1990] Biotechnol. Prog. 6:188), of a repetitious synthetic analog of the bioadhesive precursor protein of the mussel Mytilus edulis, of a repetitive spider silk protein (Lewis et al. [1996] Protein Express. Purif. 7:400), and of a highly repetitive elastin-like polymer in tobacco [Zhang, X., Urry, D. W., and Daniell, H. “Expression of an environmentally friendly synthetic protein-based polymer gene in transgenic tobacco plants,” Plant Cell Reports, 16: 174 (1996)].
- The basic design strategy for synthetic HRGP gene cassettes is illustrate by the following illustrative constructs.
- a) Ser-Hyp4 Gene Cassette
- A synthetic gene encoding the extensin-like Ser-Hyp4 module is constructed using the following partially overlapping sets of oligonucleotide pairs.
-
5′-Linker: Amino Acid: A G S S T R A S P (P P P) 5′-GCT GGA TCC TCA ACC CGG GCC TCA CCA CGA CCT AGG AGT TGG GCC CCG AGT GGT GGT GGT GGA-5′ 3′ Linker (for pBI121-Sig-EGFP): Amino Acid: P P P S P V A R N S P P 5′-CCA CCA CCT TCA CCG GTC GCC CGG AAT TCA CCA CCC AGT GGC CAG CGG GCC TTA AGT GGT GGG-5′ 3′ Linker (for pBI121-Sig: Amino Acid: 5′-CCA CCA CCT TAA TAG AGC TCC CCC ATT ATC TCG AGG GGG-5′ Internal Repeat Amino Acid: P P P S P P P P S P 5′-CCA CCA CCT TCA CCT CCA CCC CCA TCT CCA AGT GGA GGT GGG GGT AGA GGT GGT GGT GGA-5′
Conversion of the “internal” and 5′ & 3′ “external” gene cassettes to long duplex DNA is accomplished using the following steps: -
- 1. Heat each pair of complementary oligonucleotides to 90° and then anneal by cooling slowly to 60° thereby forming short duplex internal and external DNAs.
- 2. Combine the 5′ external linker duplex with the internal repeat duplexes in an approximately 1:20 molar ratio and anneal by further cooling to yield long duplex DNA capped by the 5′ linker. The 5′ linker is covalently joined to the internal repeat duplex by ligation using T4 DNA ligase. (Preferrably up to 50, more preferrably up to 30, repeats of the internal repeat duplex can be used).
- 3. In molar excess, combine the 3′ external linker duplex with the above 5′ linker-internal repeat duplex, anneal and ligate as described above.
- 4. Digest the 5′ linker-internal repeat-3′ linker duplex with BamHI (cuts within the 5′-linker) and EcoRI (cuts within the 3′-linker).
- 5. Size fractionate the reaction products using Sephacryl gel permeation chromatography to select constructs greater than 90 bp.
- 6. Insert the sized, digested synthetic gene cassette into a plasmid having a polylinker containing BamHI and EcoRI sites (e.g., pbluescript SK+ or KS+ [Stratagene]).
- 7. Transform E. coli cells (e.g., by electroporation or the use of competent cells) with the plasmid into which the synthetic gene construct has been ligated.
- 8. Following E. coli transformation, the internal repeat oligonucleotides are used to screen and identify Ampicillin-resistant colonies carrying the synthetic gene construct.
- 9. The insert contained on the plasmids within the Ampicillin-resistant colonies are sequenced to confirm the fidelity of the synthetic gene construct.
- b) GAGP Consensus Sequence Cassette
- A synthetic gene cassette encoding the GAGP consensus sequence is generated as described above using the following 5′ linker, internal repeat and 3′ linker duplexes.
-
5′-Linker Amino Acid: A A G S S T R A (S P S) 5′-GCT GCC GGA TCC TCA ACC CGG GCC-3′ 3′-CGA CGG CCT AGG AGT TGG GCC CGG AGT GGC AGT-5′ 3′-Linker (for pBI121-Sig-EGFP Amino Acid: S P S P V A R N S PP 5′-TCA CCC TCA CCG GTC GCC CGG AAT TCA CCA CCC-3′ 3′GGC CAG CGG GCC TTA AGT GGT GGG-5′ 3′-Linker (for pBI121-Sig) Amino Acid: 5′-TCA CCC TCA TAA TAG AGC TCC CCC-3′ 3′ATT ATC TCG AGG GGG-5′ Internal Repeat Amino Acid: S P S P T P T P P P G P H S P P P T L 5′-TCA CCC TCA CCA ACT CCT ACC CCA CCA CCT GGT CCA CAC TCA CCA CCA CCA ACA TTG-3′ 3′-GGT TGA GGA TGG GGT GGT GGA CCA GGT GTG AGT GGT GGT TGT AAC AGT GGG AGT-5′ - Conversion of the “internal” AGP-like motif and 5′ & 3′ “external” gene cassettes to long duplex DNA is accomplished using the steps described in section a) above. Up to fifty (50) repeats of the internal repeat duplex are desirable (more preferrably up to thirty (30) repeats, and more preferrably approximately twenty (20) repeats) (i.e., the wild-type protein contains 20 of these repeats).
- Since the above GAGP internal repeat is a consensus sequence, it is also desirable to have repeats that comprise a repeat sequence that varies from the consensus sequence (see e.g. Table 2 above). In this regard, the variant sequences are likely to be glycosylated in a slightly different manner, which may confer different properties (e.g. more soluble etc.). Other constructs are shown for other illustrative modules in Table 3.
- In order to obtain the tomato P1 extensin signal sequence (i.e., signal peptide), P1 extensin cDNA clones were isolated using oligonucleotides designed after the P1-unique protein sequence: Val-Lys-Pro-Tyr-His-Pro-Thr-Hyp-Val-Tyr-Lys. When present at the N-terminus of a protein sequence, the P1 extensin signal sequence directs the nascent peptide chain to the ER.
- pBI121 is an expression vector which permits the high level expression and secretion of inserted genes in plant cells (e.g., tomato, tobacco, members of the genus Solanace, members of the family Leguminoseae, non-graminaceous monocots). pBI121 contains the 35S CaMV promoter, the tobbaco (Nicotiana plumbaginifolia) extensin signal sequence, a EGFP gene, the termination/polyadenylation signal from the nopaline synthetase gene (NOS-ter), a kanamycin-resistance gene (nptII) and the right and left borders of T-DNA to permit transfer into plants by Agrobacterium-mediated transformation.
-
TABLE 3 ILLUSTRATIVE HRGP SYNTHETIC GENE MODULES 1. MODULES FOR AGE-LIKE SEQUENCES a. The [SP]n Module [SP]n Internal Repeat Oligo's: 5′-TCA CCC TCA CCA TCT CCT TCG CCA TCA CCC GGT AGA GGA AGC GGT AGT GGG AGT GGG AGT-5′ The [SP]n 3′ & 5′ External Linkers for both plasmids are the same as for the GAGP module. b. The [AP]n Module [AP]n Internal Repeat Oligo's: 5′-GCT CCA GCA CCT GCC CCA GCC CCT GCA CCA-3′ GGA CGG GGT CGG GGA CGT GGT-5′ [AP]n External Linker Oligo's for plasmid pBI121-Sig-EGFP 5′-Linker: 5′-GCT GCC GGA TCC TCA ACC CGG 3′-CGA CGG CCT AGG AGT TGG GCC CGA GGT CGT-5′ 3′-Linker: 5′-GCT CCA GCA CCG GTC GCC CGG AAT TCA CCA CCC-3′ 3′- GGC CAG CGG GCC TTA AGT GGT GGG-5′ [AP]n External 3′ Linker Oligos for plasmid pBI121-Sig 5′-GCT CCA GCA TAA TAG AGC TCC CCC ATT ATC TCG AGG GGG-5′ c. The [TP]n Module [TP]n Internal Repeat Oligo's: 5′-ACA CCA ACC CCT ACT CCC ACG CCA ACA CCT ACA CCC ACT CCA GGA TGA GGG TGC GGT TGT GGA TCT GGG TGA GGT TGT GGT TGG-5′ [TP]n External Linker Oligo's for pBI121-Sig-EGFP: 5′Linker: 5′-GCT GCC GGA TCC TCA ACC CGG 3′-CGA CGG CCT AGG AGT TGG GCC TGT GGT TGG-5′ 3′Linker: 5′-ACA CCA ACC CCG GTC GCC CGG AAT TCA CCA CCC-3′ GGC CAG CGG GCC TTA AGT GGT GGG-5′ [TP]n External 3′ Linker Oligos for pBI121-Sig 5′-ACA CCA ACC TAA TAG AGC TCC CCC ATT ATC TCG AGG GGG-5′ 2. MODULES FOR EXTENSIN-LIKE SEQUENCES a. The [SPP]n Module [SPP]n Internal Repeat Oligo's: 5′-CCA CCA TCA CCA CCC TCT CCT CCA TCA CCC CCA TCC CCA CCA TCA GGT GGG AGA GGA GGT AGT GGG GGT AGG GGT GGT AGT GGT GGT AGT-5′ [SPP]n External Linkers for pBE121-Sig-EGFP: 5′ Linker: 5′-GCT GCC GGA TCC TCA ACC CGG GCC 3′-CGA CGG CCT AGG AGT TGG GCC CGG GGT GGT AGT-5′ 3′ Linker: 5′-CCA CCA TCA CCG GTC GCC CGG AAT TCA CCA CCC-3′ GGC CAG CGG GCC TTA AGT GGT GGG-5′ [SPP]n External 3′ Linker for pBE121-Sig: 5′-CCA CCA TCA TAA TAG AGC TCC CCC ATT ATC TCG AGG GGG-5′ b. The [SPPP]n Module [SPPP]n Internal Repeat Oligo's: 5′-CCA CCA CCT TCA CCA CCT CCA TCT CCC CCA CCT TCC CCT CCA CCA TCA AGT GGT GGA GGT AGA GGG GGT GGA AGG GGA GGT GGT AGT GGT GGT GGA-5′ [SPPP]n External Linker Oligo's for pBI121-Sig-EGFP: 5′-Linker: 5′-GCT GGA TCC TCA ACC CGG GCC TCA 3′-CGA CCT AGG AGT TGG GCC CGG AGT GGT GGT GGA-5′ 3′-Linker: 5′-CCA CCA CCT TCA CCG GTC GCC CGG AAT TCA CCA CCC-3′ AGT GGC CAG CGG GCC TTA AGT GGT GGG-5′ [SPPP]n External 3′ Linker Oligos for pBI121-Sig: 5′-CCA CCA CCT TAA TAG AGC TCC CCC ATT ATC TCG AGG GGG-5′ d. The P3-Type Extensin Palindromic Module: P3-Type Extensin Palindromic Internal Repeat Oligo's: 5′-CCA CCA CCT TCA CCC TCT CCA CCT CCA CCA TCT CCG TCA CCA AGT GGG AGA GGT GGA GGT GGT AGA GGC AGT GGT GGT GGT GGA-5′ P3-Type Extensin Palindromic External Linker Oligo's: Use the [SPPP]n linkers (SEE ABOVE) e. The Potato Lectin HRGP Palindromic Module: Potato Lectin HRGP Palindromic Internal Repeat Oligo's: 5′-CCA CCA CCT TCA CCC CCA TCT CCA CCT CCA CCA TCT CA CCG TCA CCA AGT GGG GGT AGA GGT GGA GGT GGT AGA GGT GGC AGT GGT GGT GGT GGA-5′ Potato Lectin HRGP Palindromic External Linker Oligo's: Use the [SPPP]n linkers (SEE ABOVE) f. P1-Extensin-Like Modules: i. The SPPPPTPVYK Module: SPPPPTPVYK Internal Repeat Oligo's: 5′-CCA CCA CCT ACT CCC GTT TAC AAA TCA CCA CCA CCA CCT ACT CCC GTT TAC AAA TCA CCA TGA GGG CAA ATG TTT AGT GGT GGT GGT GGA TGA GGG CAA ATG TTT AGT GGT GGT GGT GGA-5′ SPPPPTPVYK External Linker Oligo's: Use the [SPPP]n linkers (SEE ABOVE) ii. The SPPPPVKPYHPTPVFL Module: SPPPPVKPYHPTPVFL Internal Repeat Oligo's: 5′-CCA CCA CCT GTC AAG CCT TAC CAC CCC ACT CCC GTT TTT CTT TCA CCA CAG TTC GGA ATG GTG GGG TGA GGG CAA AAA GAA AGT GGT GGT GGT GGA-5′ SPPPPVKPYHPTPVFL External Linker Oligo's: Use the [SPPP]n linkers (SEE ABOVE) iii. The SPPPPVLPFHPTPVYK Module: SPPPPVLPFHPTPVYK Internal Repeat Oligo's: 5′-CCA CCA CCT GTC TTA CCT TTC CAC CCC ACT CCC GTT TAC AAA TCA CCA CAG AAT GGA AAG GTG GGG TGA GGG CAA ATG TTT AGT GGT GGT GGT GGA-5′ SPPPPVLPFHPTPVYK External Linker Oligo's: Use the [SPPP]n linkers (SEE ABOVE) EGFP 3′ Linker Oligo's needed to insert EGFP into pBI121-Sig-EGF 5′-GGC CGC GAG CTC CAG CAC GGG CG CTC GAG GTC GTG CCC-5′ - The presence of the extensin signal sequence at the N-terminus of proteins encoded by genes inserted into the pBI121 expression vector (e.g., HRGPs encoded by synthetic gene constructs). The tobacco signal sequence was demonstrated to target extensin fusion proteins through the ER and Golgi for posttranslational modifications, and finally to the wall. The targeted expression of recombinant HRGPs is not dependent upon the use of the tobacco extensin signal sequence. Signal sequences involved in the transport of extensins and extensin modules in the same plant family (Solanaceae) as tobacco may be employed; alternatively, the signal sequence from tomato P1 extensin may be employed.
- The EGFP gene encodes a green fluorescent protein (GFP) appropriately red-shifted for plant use (the EGFP gene encodes a S65T variant optimized for use in plants and is available from Clontech). Other suitable mutants may be employed (see Table 1). These modified GFPs allow the detection of less than 700 GFP molecules at the cell surface. The use of a GFP gene provides a reporter gene and permits the formation of fusion proteins comprising repetitive HRGP modules. GFPs require aerobic conditions for oxidative formation of the fluorophore. It is functional at the lower temperatures used for plant cell cultures, normally it does not adversely affect protein function.
- Plasmids pBI121-Sig and pBI121-Sig-EGFP are constructed as follows. For both plasmids, the GUS gene present in pBI121 (Clontech) is deleted by digestion with BamHI and SstI and a pair of partially complementary oligonucleotides encoding the tobacco extensin signal sequence is annealed to the BamHI and SstI ends. The oligonucleotides encoding the 21 amino acid extensin signal sequence have the following sequence. 5′-GA TCC GCA ATG GGA AAA ATG GCT TCT CTA TTT GCC ACA TTT TTA GTG GTT TTA GTG TCA CTT AGC TTA GCA CAA ACA ACC CGG GTA CCG GTC GCC ACC ATG GTG TAA AGC GGC CGC GAG CT-3′ (SEQ ID NO:) and 5′-C GCG GCC GCT TTA CAC CAT GGT GGC GAC CGG TAC CCG GGT TGT TTG TGC TAA GCT AAG TGA CAC TAA AAC CAC TAA AAA TGT GGC AAA TAG AGA AGC CAT TTT TCC CAT TGC G-3′ (SEQ ID NO:4). In addition to encoding the extensin signal sequence, this pair of oligonucleotides, when inserted into the digested pBI121 vector, provides a BamHI site (5′ end) and XmaI and SstI sites (3′ end). The XmaI and SstI sites allow the insertion of the GFP gene. The modified pBI121 vector lacking the GUS gene and containing the synthetic extensin signal sequence is termed pBI121-Sig. Proper construction of pBI121 is confirmed by DNA sequencing.
- The GFP gene (e.g., the EGFP gene) is inserted into pBI121-Sig to make pBI121-Sig-EGFP as follows. The EGFP gene is excised from pEGFP (Clontech) as a 1.48 kb XmaI/NotI fragment (base pairs 270 to 1010 in pEGFP). This 1.48 kb XmaI/NotI fragment is then annealed and ligated to a synthetic 3′ linker (see above). The EGFP-3′ linker is then digested with SstI to produce an XmaI/SstI EGFP fragment which in inserted into the XmaI/SstI site of pBI121-Sig to create pBI121-Sig-EGFP. The AgeI (discussed below), XmaI and SstI sites provide unique restriction enzyme sites. Proper construction of the plasmids is confirmed by DNA sequencing.
- The EGFP sequences in pBI121-Sig-EGFP contain an AgeI site directly before the translation start codon (i.e., ATG) of EGFP. Synthetic HRGP gene cassettes are inserted into the plasmid between the signal sequence and the EGFP gene sequences as XmaI/AgeI fragments; the HRGP gene cassettes are excised as XmaI/AgeI fragments from the pbluescript constructs described in Ex.2. Proper construction of HRGP-containing expression vectors is confirmed by DNA sequencing and/or restriction enzyme digestion.
- Expression of the synthetic HRGP gene cassettes is not dependent upon the use of the pBI121-Sig and pBI121-Sig-EGFP gene cassette. Analogous expression vectors containing other promoter elements functional in plant cells may be employed (e.g., the CaMV region IV promoter, ribulose-1,6-biphosphate (RUBP) carboxylase small subunit (ssu) promoter, the nopaline promoter, octopine promoter, mannopine promoter, the β-conglycinin promoter, the ADH promoter, heat shock promoters, tissue-specific promoters, e.g., promoters associated with fruit ripening, promoters regulated during seed ripening (e.g., promoters from the napin, phaseolin and glycinin genes). For example, expression vectors containing a promoter that directs high level expression of inserted gene sequences in the seeds of plants (e.g., fruits, legumes and cereals, including but not limited to corn, wheat, rice, tomato, potato, yam, pepper, sequash cucumbers, beans, peas, apple, cherry, peach, black locust, pine and maple trees) may be employed. Expression may also be carried out in green algae.
- In addition, alternative reporter genes may be employed in place of the GFP gene. Suitable reporter genes include β-glucuronidase (GUS), neomycin phosphotransferase II gene (nptII), alkaline phosphatase, luciferase, CAT (Chloramphenicol AcetylTransferase). Preferred reporter genes lack Hyp residues. Further, the proteins encoded by the synthetic HRGP genes need not be expressed as fusion proteins. This is readily accomplished using the pBI121-Sig vector.
- The present invention contemplates recombinant HRGPs encoded by expression vectors comprising synthetic HRGP gene modules are expressed in tomato cell suspension cultures. The expression of recombinant HRGPs in tomato cell suspension cultures is illustrated by the discussion provided below for recombinant GAGP expression.
- a) Expression of Recombinant GAGP
- An expression vector containing the synthetic GAGP gene cassette (capable of being expressed as a fusion with GFP or without GFP sequences) is introduced into tomato cell suspension cultures. A variety of means are known to the art for the transfer of DNA into tomato cell suspension cultures, including Agrobacterium-mediated transfer and biolistic transformation.
- Agrobacterium-mediated transformation: The present invention contemplates transforming both suspension cultured cells (Bonnie Best cultures) and tomato leaf discs by mobilizing the above-described plasmid constructions (and others) from E. coli into Agrobacterium tumefaciens strain LBA4404 via triparental mating. Positive colonies are used to infect tomato cultures or leaf discs (Lysopersicon esculentum). Transformed cells/plants are selected on MSO medium containing 500 mg/mL carbenicillin and 100 mg/mL kanamycin. Expression of GFP fusion products are conveniently monitored by fluoresence microscopy using a high Q FITC filter set (Chroma Technology Corp.). FITC conjugates (e.g. FITC-BSA) can be used along with purified recombinant GFP as controls for microscopy set-up. Cultured tomato cells show only very weak autofluorescence. Thus, one can readily verify the spatiotemporal expression of GFP-Hyp module fusion products.
- Transgenic cells/plants can be examined for transgene copy number and construct fidelity genomic Southern blotting and for the HRGP construct mRNA by northern blotting, using the internal repeat oligonucleotides as probes. Controls include tissue/plants which are untransformed, transformed with the pBI121 alone, pBI121 containing only GFP, and pBI121 having the signal sequence and GFP but no HRGP synthetic gene.
- Microprojectile bombardment: 1.6 M gold particles are coated with each appropriate plasmid construct DNA for use in a Biolistic particle delivery system to transform the tomato suspension cultures/callus or other tissue. Controls include: particles without DNA, particles which contain PBI121 only, and particles which contain PBI121 and GFP.
- b) Expression of Other HRGPs of Interest
- As noted above, the present invention contemplates expressing a variety of HRGPs, fragments and variants. Such HRGPs include, but are not limited to, RPRps, extensins, AGPs and other plant gums (e.g. gum Karaya, gum Tragacanth, gum Ghatti, etc.). HRGP chimeras include but are not limited to HRGP plant lectins, including the solanaceous lectins, plant chitinases, and proteins in which the HRGP portion serves as a spacer (such as in sunflower). The present invention specifically contemplates using the HRGP modules (described above) as spacers to link non-HRGP proteins (e.g. enzymes) together.
- From the above, it should be clear that the present invention provides a new approach and solution to the problem of producing plant gums. The approach is not dependent on environmental factors and greatly simplifies production of a variety of naturally-occurring gums, as well as designer gums.
Claims (21)
1-26. (canceled)
27. An isolated polynucleotide encoding an arabinogalactan protein.
28. The polynucleotide according to claim 27 , wherein the arabinogalactan protein comprises repeating modules.
29. The polynucleotide according to claim 28 , wherein the repeating modules are selected from the group consisting of (Ser-Pro)n (SEQ ID NO: 85), (Ala-Pro)n (SEQ ID NO: 86), and (Thr-Pro)n (SEQ ID NO:87), and wherein n is from 7 to 50.
30. The polynucleotide according to claim 29 , wherein the repeating modules are selected from the group consisting of (Ser-Pro)n (SEQ ID NO: 88), (Ala-Pro)n (SEQ ID NO: 89), and (Thr-Pro)n (SEQ ID NO: 90), and wherein n is 7, 20, 30, or 50.
31. The polynucleotide according to claim 30 , wherein the repeating module is (Ser-Pro)n (SEQ ID NO: 88), and wherein n is 7, 20, 30, or 50.
32. The polynucleotide according to claim 30 , wherein the repeating module is (Ala-Pro)n (SEQ ID NO: 89), and wherein n is 7, 20, 30, or 50.
33. The polynucleotide according to claim 30 , wherein the repeating module is (Thr-Pro)n (SEQ ID NO: 90), and wherein n is 7, 20, 30, or 50.
34. The polynucleotide according to claim 27 , further encoding a non-gum arabic glycoprotein.
35. The polynucleotide according to claim 34 , wherein the arabinogalactan protein comprises repeating modules selected from the group consisting of (Ser-Pro)n (SEQ ID NO: 85), (Ala-Pro)n (SEQ ID NO: 86), and (Thr-Pro)n (SEQ ID NO: 87), and wherein n is from 7 to 50.
36. The polynucleotide according to claim 35 , wherein the repeating module is (Ser-Pro)n (SEQ ID NO: 88), and wherein n is 7, 20, 30, or 50.
37. An isolated polypeptide comprising an arabinogalactan protein.
38. The polypeptide according to claim 37 , wherein the arabinogalactan protein comprises repeating modules.
39. The polypeptide according to claim 38 , wherein the repeating modules are selected from the group consisting of (Ser-Pro)n (SEQ ID NO: 85), (Ala-Pro)n (SEQ ID NO: 86), and (Thr-Pro)n (SEQ ID NO: 87), and wherein n is from 7 to 50.
40. The polypeptide according to claim 39 , wherein the repeating modules are selected from the group consisting of (Ser-Pro)n (SEQ ID NO: 88), (Ala-Pro)n (SEQ ID NO: 89), and (Thr-Pro)n (SEQ ID NO: 90), and wherein n is 7, 20, 30, or 50.
41. The polypeptide according to claim 40 , wherein the repeating module is (Ser-Pro)n (SEQ ID NO: 88), and wherein n is 7, 20, 30, or 50.
42. The polypeptide according to claim 40 , wherein the repeating module is (Ala-Pro)n (SEQ ID NO: 89), and wherein n is 7, 20, 30, or 50.
43. The polypeptide according to claim 40 , wherein the repeating module is (Thr-Pro)n (SEQ ID NO: 90), and wherein n is 7, 20, 30, or 50.
44. The polypeptide according to claim 37 , further comprising a non-gum arabic glycoprotein.
45. The polypeptide according to claim 44 , wherein the arabinogalactan protein comprises repeating modules selected from the group consisting of (Ser-Pro)n (SEQ ID NO: 85), (Ala-Pro)n (SEQ ID NO: 86), and (Thr-Pro)n (SEQ ID NO: 87), and wherein n is from 7 to 50.
46. The polypeptide according to claim 45 , wherein the repeating module is (Ser-Pro)n (SEQ ID NO: 88), and wherein n is 7, 20, 30, or 50.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US12/121,140 US20090030185A1 (en) | 1997-07-21 | 2008-05-15 | Synthetic genes for plant gums and other hydroxyproline-rich glycoproteins |
US12/820,943 US8563687B2 (en) | 1997-07-21 | 2010-06-22 | Synthetic genes for plant gums and other hydroxyproline rich glycoproteins |
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US08/897,556 US6570062B1 (en) | 1997-07-21 | 1997-07-21 | Synthetic genes for plant gums and other hydroxyproline-rich glycoproteins |
US09/547,693 US6639050B1 (en) | 1997-07-21 | 2000-04-12 | Synthetic genes for plant gums and other hydroxyproline-rich glycoproteins |
US10/418,032 US7378506B2 (en) | 1997-07-21 | 2003-04-16 | Synthetic genes for plant gums and other hydroxyproline-rich glycoproteins |
US12/121,140 US20090030185A1 (en) | 1997-07-21 | 2008-05-15 | Synthetic genes for plant gums and other hydroxyproline-rich glycoproteins |
Related Parent Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US10/418,032 Division US7378506B2 (en) | 1997-07-21 | 2003-04-16 | Synthetic genes for plant gums and other hydroxyproline-rich glycoproteins |
US10/418,032 Continuation US7378506B2 (en) | 1997-07-21 | 2003-04-16 | Synthetic genes for plant gums and other hydroxyproline-rich glycoproteins |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US12/820,943 Continuation US8563687B2 (en) | 1997-07-21 | 2010-06-22 | Synthetic genes for plant gums and other hydroxyproline rich glycoproteins |
Publications (1)
Publication Number | Publication Date |
---|---|
US20090030185A1 true US20090030185A1 (en) | 2009-01-29 |
Family
ID=33309519
Family Applications (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US10/418,032 Expired - Fee Related US7378506B2 (en) | 1997-07-21 | 2003-04-16 | Synthetic genes for plant gums and other hydroxyproline-rich glycoproteins |
US12/121,140 Abandoned US20090030185A1 (en) | 1997-07-21 | 2008-05-15 | Synthetic genes for plant gums and other hydroxyproline-rich glycoproteins |
US12/820,943 Expired - Fee Related US8563687B2 (en) | 1997-07-21 | 2010-06-22 | Synthetic genes for plant gums and other hydroxyproline rich glycoproteins |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US10/418,032 Expired - Fee Related US7378506B2 (en) | 1997-07-21 | 2003-04-16 | Synthetic genes for plant gums and other hydroxyproline-rich glycoproteins |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US12/820,943 Expired - Fee Related US8563687B2 (en) | 1997-07-21 | 2010-06-22 | Synthetic genes for plant gums and other hydroxyproline rich glycoproteins |
Country Status (4)
Country | Link |
---|---|
US (3) | US7378506B2 (en) |
EP (1) | EP1622635A4 (en) |
CA (1) | CA2522904A1 (en) |
WO (1) | WO2004094590A2 (en) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20080242834A1 (en) * | 2005-07-08 | 2008-10-02 | Ohio University | Methods of Predicting Hyp-Glycosylation Sites For Proteins Expressed and Secreted in Plant Cells, and Related Methods and Products |
US20080262198A1 (en) * | 2004-04-19 | 2008-10-23 | Ohio University | Cross-Linkable Glycoproteins and Methods of Making the Same |
US20100261874A1 (en) * | 1997-07-21 | 2010-10-14 | Ohio University | Synthetic genes for plant gums and other hydroxyproline rich glycoproteins |
US20110217766A1 (en) * | 2004-01-14 | 2011-09-08 | Ohio University | Methods of Producing Peptides in Plants and Peptides Produced Thereby |
US8871468B2 (en) | 1997-07-21 | 2014-10-28 | Ohio University | Synthetic genes for plant gums and other hydroxyproline-rich glycoproteins |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6570062B1 (en) * | 1997-07-21 | 2003-05-27 | Ohio University | Synthetic genes for plant gums and other hydroxyproline-rich glycoproteins |
US20060252120A1 (en) * | 2003-05-09 | 2006-11-09 | Kieliszewski Marcia J | Synthetic genes for plant gums and other hydroxyproline-rich glycoproteins |
DK1984344T3 (en) | 2005-12-29 | 2013-01-14 | Lexicon Pharmaceuticals Inc | Multicyclic amino acid derivatives and methods for their use |
WO2008097342A2 (en) * | 2006-07-31 | 2008-08-14 | Sigma-Aldrich Co. | Compositions and methods for isolation of biological molecules |
MX2018007680A (en) | 2015-12-22 | 2018-11-14 | Xl Protein Gmbh | Nucleic acids encoding repetitive amino acid sequences rich in proline and alanine residues that have low repetitive nucleotide sequences. |
EP3790977A1 (en) * | 2018-05-09 | 2021-03-17 | Gat Biosciences, S.L. | Gycomodule motifs and uses thereof |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5830747A (en) * | 1993-12-03 | 1998-11-03 | Cooperative Research Centre For Industrial Plant Biopolymers | Plant arabinogalactan protein (AGP) genes |
US6570062B1 (en) * | 1997-07-21 | 2003-05-27 | Ohio University | Synthetic genes for plant gums and other hydroxyproline-rich glycoproteins |
US20040009555A1 (en) * | 1997-07-21 | 2004-01-15 | Ohio University, Technology Transfer Office, Technology And Enterprise Building | Synthetic genes for plant gums and other hydroxyproline-rich glycoproteins |
Family Cites Families (76)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US635776A (en) * | 1898-04-30 | 1899-10-31 | Steel Ball Company | Rotary steel-ball-forging machine. |
US3664925A (en) | 1970-02-20 | 1972-05-23 | Martin Sonenberg | Clinically active bovine growth hormone fraction |
US4056520A (en) | 1972-03-31 | 1977-11-01 | Research Corporation | Clinically active bovine growth hormone fraction |
IL58849A (en) | 1978-12-11 | 1983-03-31 | Merck & Co Inc | Carboxyalkyl dipeptides and derivatives thereof,their preparation and pharmaceutical compositions containing them |
US5034322A (en) | 1983-01-17 | 1991-07-23 | Monsanto Company | Chimeric genes suitable for expression in plant cells |
US5352605A (en) | 1983-01-17 | 1994-10-04 | Monsanto Company | Chimeric genes for transforming plant cells using viral promoters |
NL8300698A (en) | 1983-02-24 | 1984-09-17 | Univ Leiden | METHOD FOR BUILDING FOREIGN DNA INTO THE NAME OF DIABIC LOBAL PLANTS; AGROBACTERIUM TUMEFACIENS BACTERIA AND METHOD FOR PRODUCTION THEREOF; PLANTS AND PLANT CELLS WITH CHANGED GENETIC PROPERTIES; PROCESS FOR PREPARING CHEMICAL AND / OR PHARMACEUTICAL PRODUCTS. |
US4478827A (en) | 1983-05-09 | 1984-10-23 | The General Hospital Corporation | Renin inhibitors |
US4649191A (en) * | 1984-05-10 | 1987-03-10 | Gibson-Stephens Neuropharmaceuticals, Inc. | Conformationally constrained alpha-melanotropin analogs with specific central nervous system activity |
US4683202A (en) | 1985-03-28 | 1987-07-28 | Cetus Corporation | Process for amplifying nucleic acid sequences |
US4965188A (en) | 1986-08-22 | 1990-10-23 | Cetus Corporation | Process for amplifying, detecting, and/or cloning nucleic acid sequences using a thermostable enzyme |
US4683195A (en) | 1986-01-30 | 1987-07-28 | Cetus Corporation | Process for amplifying, detecting, and/or-cloning nucleic acid sequences |
US6774283B1 (en) | 1985-07-29 | 2004-08-10 | Calgene Llc | Molecular farming |
US4956282A (en) | 1985-07-29 | 1990-09-11 | Calgene, Inc. | Mammalian peptide expression in plant cells |
US6018030A (en) | 1986-11-04 | 2000-01-25 | Protein Polymer Technologies, Inc. | Peptides comprising repetitive units of amino acids and DNA sequences encoding the same |
US5763394A (en) | 1988-04-15 | 1998-06-09 | Genentech, Inc. | Human growth hormone aqueous formulation |
US6680426B2 (en) | 1991-01-07 | 2004-01-20 | Auburn University | Genetic engineering of plant chloroplasts |
CA2001774C (en) | 1988-10-28 | 2001-10-16 | James A. Wells | Method for identifying active domains and amino acid residues in polypeptides and hormone variants |
US5534617A (en) | 1988-10-28 | 1996-07-09 | Genentech, Inc. | Human growth hormone variants having greater affinity for human growth hormone receptor at site 1 |
NL8901932A (en) | 1989-07-26 | 1991-02-18 | Mogen Int | PRODUCTION OF heterologous PROTEINS IN PLANTS OR PLANTS. |
US5501967A (en) | 1989-07-26 | 1996-03-26 | Mogen International, N.V./Rijksuniversiteit Te Leiden | Process for the site-directed integration of DNA into the genome of plants |
US6583115B1 (en) | 1989-10-12 | 2003-06-24 | Ohio University/Edison Biotechnology Institute | Methods for treating acromegaly and giantism with growth hormone antagonists |
US6787336B1 (en) | 1989-10-12 | 2004-09-07 | Ohio University/Edison Biotechnology Institute | DNA encoding growth hormone antagonists |
US5350836A (en) | 1989-10-12 | 1994-09-27 | Ohio University | Growth hormone antagonists |
US5958879A (en) | 1989-10-12 | 1999-09-28 | Ohio University/Edison Biotechnology Institute | Growth hormone receptor antagonists and methods of reducing growth hormone activity in a mammal |
US5989894A (en) | 1990-04-20 | 1999-11-23 | University Of Wyoming | Isolated DNA coding for spider silk protein, a replicable vector and a transformed cell containing the DNA |
EP0550629A1 (en) | 1990-08-17 | 1993-07-14 | Genentech, Inc. | Metal ion mediated receptor binding of polypeptide hormones |
US5780279A (en) | 1990-12-03 | 1998-07-14 | Genentech, Inc. | Method of selection of proteolytic cleavage sites by directed evolution and phagemid display |
DE69129154T2 (en) | 1990-12-03 | 1998-08-20 | Genentech, Inc., South San Francisco, Calif. | METHOD FOR ENRICHING PROTEIN VARIANTS WITH CHANGED BINDING PROPERTIES |
EP0586549B1 (en) | 1991-05-10 | 2000-09-20 | Genentech, Inc. | Selecting ligand agonists and antagonists |
WO1992020713A1 (en) | 1991-05-15 | 1992-11-26 | The University Of Melbourne | Proline rich protein from nicotiana alata |
WO1993000109A1 (en) | 1991-06-28 | 1993-01-07 | Genentech, Inc. | Method of stimulating immune response using growth hormone |
US5641670A (en) | 1991-11-05 | 1997-06-24 | Transkaryotic Therapies, Inc. | Protein production and protein delivery |
US5474925A (en) | 1991-12-19 | 1995-12-12 | Agracetus, Inc. | Immobilized proteins in cotton fiber |
US6225080B1 (en) | 1992-03-23 | 2001-05-01 | George R. Uhl | Mu-subtype opioid receptor |
US5352596A (en) | 1992-09-11 | 1994-10-04 | The United States Of America As Represented By The Secretary Of Agriculture | Pseudorabies virus deletion mutants involving the EPO and LLT genes |
CA2154882A1 (en) | 1993-01-28 | 1994-08-04 | Robert Tjian | Tata-binding protein associated factors, nucleic acids encoding tafs, and methods of use |
FR2701952B1 (en) * | 1993-02-22 | 1995-03-31 | Adir | New cyclopeptide derivatives of angiopeptin, their preparation process and the pharmaceutical compositions containing them. |
ATE300609T1 (en) | 1994-01-21 | 2005-08-15 | Powderject Vaccines Inc | GAS ACTUATED ELEMENT FOR DISCHARGING GENETIC MATERIAL |
US5733771A (en) | 1994-03-14 | 1998-03-31 | University Of Wyoming | cDNAs encoding minor ampullate spider silk proteins |
US6080560A (en) | 1994-07-25 | 2000-06-27 | Monsanto Company | Method for producing antibodies in plant cells |
US5695971A (en) | 1995-04-07 | 1997-12-09 | Amresco | Phage-cosmid hybrid vector, open cos DNA fragments, their method of use, and process of production |
US5723755A (en) | 1995-05-16 | 1998-03-03 | Francis E. Lefaivre | Large scale production of human or animal proteins using plant bioreactors |
US6020169A (en) | 1995-07-20 | 2000-02-01 | Washington State University Research Foundation | Production of secreted foreign polypeptides in plant cell culture |
CA2658039A1 (en) | 1995-09-21 | 1997-03-27 | Genentech, Inc. | Human growth hormone variants |
AR006928A1 (en) | 1996-05-01 | 1999-09-29 | Pioneer Hi Bred Int | AN ISOLATED DNA MOLECULA CODING A GREEN FLUORESCENT PROTEIN AS A TRACEABLE MARKER FOR TRANSFORMATION OF PLANTS, A METHOD FOR THE PRODUCTION OF TRANSGENIC PLANTS, A VECTOR OF EXPRESSION, A TRANSGENIC PLANT AND CELLS OF SUCH PLANTS. |
US5821089A (en) | 1996-06-03 | 1998-10-13 | Gruskin; Elliott A. | Amino acid modified polypeptides |
US6548642B1 (en) | 1997-07-21 | 2003-04-15 | Ohio University | Synthetic genes for plant gums |
US7378506B2 (en) | 1997-07-21 | 2008-05-27 | Ohio University | Synthetic genes for plant gums and other hydroxyproline-rich glycoproteins |
US20030204864A1 (en) | 2001-02-28 | 2003-10-30 | Henry Daniell | Pharmaceutical proteins, human therapeutics, human serum albumin, insulin, native cholera toxic b submitted on transgenic plastids |
US5994099A (en) | 1997-12-31 | 1999-11-30 | The University Of Wyoming | Extremely elastic spider silk protein and DNA coding therefor |
US6037456A (en) | 1998-03-10 | 2000-03-14 | Biosource Technologies, Inc. | Process for isolating and purifying viruses, soluble proteins and peptides from plant sources |
US20030167531A1 (en) | 1998-07-10 | 2003-09-04 | Russell Douglas A. | Expression and purification of bioactive, authentic polypeptides from plants |
WO2000026354A1 (en) | 1998-10-30 | 2000-05-11 | Novozymes A/S | Glycosylated proteins having reduced allergenicity |
PT1137789E (en) | 1998-12-09 | 2010-10-21 | Phyton Holdings Llc | A method for manufacturing glycoproteins having human-type glycosylation |
US6210950B1 (en) | 1999-05-25 | 2001-04-03 | University Of Medicine And Dentistry Of New Jersey | Methods for diagnosing, preventing, and treating developmental disorders due to a combination of genetic and environmental factors |
WO2001016339A1 (en) | 1999-08-27 | 2001-03-08 | University Of Guelph | Use of arabinogalactan protein fusion constructs in a method of expressing proteins and peptides in plants |
EP1246915A2 (en) | 1999-12-30 | 2002-10-09 | Maxygen Aps | Improved lysosomal enzymes and lysosomal enzyme activators |
US20020127652A1 (en) | 2000-02-11 | 2002-09-12 | Schambye Hans Thalsgard | Follicle stimulating hormones |
US20020174453A1 (en) | 2001-04-18 | 2002-11-21 | Henry Daniell | Production of antibodies in transgenic plastids |
US20020162135A1 (en) | 2001-04-18 | 2002-10-31 | Henry Daniell | Expression of antimicrobial peptide via the plastid genome to control phytopathogenic bacteria |
US20030041353A1 (en) | 2001-04-18 | 2003-02-27 | Henry Daniell | Mutiple gene expression for engineering novel pathways and hyperexpression of foreign proteins in plants |
WO2001075132A2 (en) | 2000-04-03 | 2001-10-11 | Monsanto Technology Llc | Method for producing authentic cytokines in plants |
DE60035337T2 (en) | 2000-05-12 | 2008-02-28 | Gpc Biotech Ag | Human peptides / proteins that cause or cause the killing of cells, including lymphoid tumor cells |
US20030036181A1 (en) | 2000-06-30 | 2003-02-20 | Okkels Jens Sigurd | Peptide extended glycosylated polypeptides |
CA2431035A1 (en) | 2000-11-06 | 2002-05-10 | Thrasos, Inc. | Computer method and apparatus for classifying objects |
US6987172B2 (en) | 2001-03-05 | 2006-01-17 | Washington University In St. Louis | Multifunctional single chain glycoprotein hormones comprising three or more β subunits |
US7173113B2 (en) | 2002-01-31 | 2007-02-06 | The Trustees Of Columbia University In The City Of New York | Long-acting hormone and growth factor compositions and uses thereof |
AU2004240553A1 (en) | 2003-05-09 | 2004-12-02 | Neose Technologies, Inc. | Compositions and methods for the preparation of human growth hormone glycosylation mutants |
US20060252120A1 (en) | 2003-05-09 | 2006-11-09 | Kieliszewski Marcia J | Synthetic genes for plant gums and other hydroxyproline-rich glycoproteins |
JP2005087172A (en) | 2003-09-19 | 2005-04-07 | Institute Of Physical & Chemical Research | Arabinogalactan-like O-linked glycosylation motif |
MXPA06008126A (en) | 2004-01-14 | 2008-02-14 | Univ Ohio | Methods of producing peptides/proteins in plants and peptides/proteins produced thereby. |
EP1751177A4 (en) | 2004-04-19 | 2008-07-16 | Univ Ohio | Cross-linkable glycoproteins and methods of making the same |
EP2357241B1 (en) | 2004-09-29 | 2015-03-04 | Collplant Ltd. | Collagen producing plants and methods of generating and using same |
WO2007008708A2 (en) | 2005-07-08 | 2007-01-18 | Ohio University | Methods of predicting hyp-glycosylation sites for proteins expressed and secreted in plant cells, and related methods and products |
WO2008008766A2 (en) | 2006-07-10 | 2008-01-17 | Ohio University | Co-expression of proline hydroxylases to facilitate hyp-glycosylation of proteins expressed and secreted in plant cells |
-
2003
- 2003-04-16 US US10/418,032 patent/US7378506B2/en not_active Expired - Fee Related
-
2004
- 2004-04-13 CA CA002522904A patent/CA2522904A1/en not_active Abandoned
- 2004-04-13 WO PCT/US2004/011174 patent/WO2004094590A2/en active Application Filing
- 2004-04-13 EP EP04759826A patent/EP1622635A4/en not_active Withdrawn
-
2008
- 2008-05-15 US US12/121,140 patent/US20090030185A1/en not_active Abandoned
-
2010
- 2010-06-22 US US12/820,943 patent/US8563687B2/en not_active Expired - Fee Related
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5830747A (en) * | 1993-12-03 | 1998-11-03 | Cooperative Research Centre For Industrial Plant Biopolymers | Plant arabinogalactan protein (AGP) genes |
US6570062B1 (en) * | 1997-07-21 | 2003-05-27 | Ohio University | Synthetic genes for plant gums and other hydroxyproline-rich glycoproteins |
US20040009555A1 (en) * | 1997-07-21 | 2004-01-15 | Ohio University, Technology Transfer Office, Technology And Enterprise Building | Synthetic genes for plant gums and other hydroxyproline-rich glycoproteins |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20100261874A1 (en) * | 1997-07-21 | 2010-10-14 | Ohio University | Synthetic genes for plant gums and other hydroxyproline rich glycoproteins |
US8563687B2 (en) | 1997-07-21 | 2013-10-22 | Ohio University | Synthetic genes for plant gums and other hydroxyproline rich glycoproteins |
US8871468B2 (en) | 1997-07-21 | 2014-10-28 | Ohio University | Synthetic genes for plant gums and other hydroxyproline-rich glycoproteins |
US20110217766A1 (en) * | 2004-01-14 | 2011-09-08 | Ohio University | Methods of Producing Peptides in Plants and Peptides Produced Thereby |
US20110230404A1 (en) * | 2004-01-14 | 2011-09-22 | Ohio University | Glycoproteins Produced in Plants and Methods of Their Use |
US9006410B2 (en) | 2004-01-14 | 2015-04-14 | Ohio University | Nucleic acid for plant expression of a fusion protein comprising hydroxyproline O-glycosylation glycomodule |
US20080262198A1 (en) * | 2004-04-19 | 2008-10-23 | Ohio University | Cross-Linkable Glycoproteins and Methods of Making the Same |
US8623812B2 (en) | 2004-04-19 | 2014-01-07 | Ohio University | Cross-linkable glycoproteins and methods of making the same |
US20080242834A1 (en) * | 2005-07-08 | 2008-10-02 | Ohio University | Methods of Predicting Hyp-Glycosylation Sites For Proteins Expressed and Secreted in Plant Cells, and Related Methods and Products |
Also Published As
Publication number | Publication date |
---|---|
EP1622635A2 (en) | 2006-02-08 |
US8563687B2 (en) | 2013-10-22 |
WO2004094590A2 (en) | 2004-11-04 |
US7378506B2 (en) | 2008-05-27 |
WO2004094590A3 (en) | 2005-05-19 |
EP1622635A4 (en) | 2008-01-23 |
CA2522904A1 (en) | 2004-11-04 |
US20050074838A1 (en) | 2005-04-07 |
US20100261874A1 (en) | 2010-10-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8563687B2 (en) | Synthetic genes for plant gums and other hydroxyproline rich glycoproteins | |
US8871468B2 (en) | Synthetic genes for plant gums and other hydroxyproline-rich glycoproteins | |
US20030167533A1 (en) | Intein-mediated protein splicing | |
AU7446796A (en) | Modified bacillus thuringiensis gene for lepidopteran control in plants | |
US6548642B1 (en) | Synthetic genes for plant gums | |
US6570062B1 (en) | Synthetic genes for plant gums and other hydroxyproline-rich glycoproteins | |
US20030194809A1 (en) | Method of controlling site-specific recombination | |
CA2226889C (en) | An expression control sequence for general and effective expression of genes in plants | |
US20060252120A1 (en) | Synthetic genes for plant gums and other hydroxyproline-rich glycoproteins | |
US6214578B1 (en) | Method for the expressing foreign genes and vectors therefor | |
US7557264B2 (en) | Gossypium hirsutum tissue-specific promoters and their use | |
AU2002301020B2 (en) | Novel Synthetic Genes for Plant Gums | |
US20040117874A1 (en) | Methods for accumulating translocated proteins | |
US20040172688A1 (en) | Intein-mediated protein splicing | |
CA2324520A1 (en) | Plant promoter sequences and methods of use thereof | |
US7485772B2 (en) | Methods of suppressing flowering in transgenic plants | |
US20050081266A1 (en) | Modulation of storage organs |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |