CA2504010A1 - Sequence specific dna recombination in eukaryotic cells - Google Patents
Sequence specific dna recombination in eukaryotic cells Download PDFInfo
- Publication number
- CA2504010A1 CA2504010A1 CA002504010A CA2504010A CA2504010A1 CA 2504010 A1 CA2504010 A1 CA 2504010A1 CA 002504010 A CA002504010 A CA 002504010A CA 2504010 A CA2504010 A CA 2504010A CA 2504010 A1 CA2504010 A1 CA 2504010A1
- Authority
- CA
- Canada
- Prior art keywords
- sequence
- int
- recombination
- dna
- cell
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 210000003527 eukaryotic cell Anatomy 0.000 title claims abstract description 30
- 238000012270 DNA recombination Methods 0.000 title description 4
- 230000006798 recombination Effects 0.000 claims abstract description 303
- 238000005215 recombination Methods 0.000 claims abstract description 303
- 210000004027 cell Anatomy 0.000 claims abstract description 224
- 238000000034 method Methods 0.000 claims abstract description 111
- 108010061833 Integrases Proteins 0.000 claims abstract description 63
- 102100034343 Integrase Human genes 0.000 claims abstract description 56
- 241000701959 Escherichia virus Lambda Species 0.000 claims abstract description 33
- 239000002773 nucleotide Substances 0.000 claims abstract description 29
- 125000003729 nucleotide group Chemical group 0.000 claims abstract description 29
- 108090000623 proteins and genes Proteins 0.000 claims description 169
- 108020004414 DNA Proteins 0.000 claims description 115
- 101000607560 Homo sapiens Ubiquitin-conjugating enzyme E2 variant 3 Proteins 0.000 claims description 109
- 102100039936 Ubiquitin-conjugating enzyme E2 variant 3 Human genes 0.000 claims description 109
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 60
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 43
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 40
- 229920001184 polypeptide Polymers 0.000 claims description 39
- 101150031239 xis gene Proteins 0.000 claims description 37
- 238000006467 substitution reaction Methods 0.000 claims description 24
- 150000007523 nucleic acids Chemical class 0.000 claims description 22
- 102000039446 nucleic acids Human genes 0.000 claims description 10
- 108020004707 nucleic acids Proteins 0.000 claims description 10
- 210000004962 mammalian cell Anatomy 0.000 claims description 9
- 230000001404 mediated effect Effects 0.000 claims description 9
- 230000002441 reversible effect Effects 0.000 claims description 5
- 241000699800 Cricetinae Species 0.000 claims description 4
- 230000000295 complement effect Effects 0.000 claims description 4
- 239000003102 growth factor Substances 0.000 claims description 3
- 229940088597 hormone Drugs 0.000 claims description 3
- 239000005556 hormone Substances 0.000 claims description 3
- 206010035226 Plasma cell myeloma Diseases 0.000 claims description 2
- 101100109426 Rhodococcus fascians argJ gene Proteins 0.000 claims description 2
- 241000283984 Rodentia Species 0.000 claims description 2
- 201000000050 myeloid neoplasm Diseases 0.000 claims description 2
- 101150062334 int gene Proteins 0.000 claims 2
- 239000006143 cell culture medium Substances 0.000 claims 1
- 210000004978 chinese hamster ovary cell Anatomy 0.000 claims 1
- 101150046810 fis gene Proteins 0.000 claims 1
- 230000014509 gene expression Effects 0.000 description 56
- 102000004169 proteins and genes Human genes 0.000 description 56
- 235000018102 proteins Nutrition 0.000 description 54
- 238000006243 chemical reaction Methods 0.000 description 42
- 239000000758 substrate Substances 0.000 description 37
- 239000013598 vector Substances 0.000 description 36
- 230000010354 integration Effects 0.000 description 34
- 102000018120 Recombinases Human genes 0.000 description 25
- 108010091086 Recombinases Proteins 0.000 description 25
- 239000012634 fragment Substances 0.000 description 23
- 239000000047 product Substances 0.000 description 23
- 239000003550 marker Substances 0.000 description 22
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 21
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 21
- 238000004519 manufacturing process Methods 0.000 description 20
- 238000001890 transfection Methods 0.000 description 20
- 239000013612 plasmid Substances 0.000 description 17
- 239000013604 expression vector Substances 0.000 description 16
- 239000002609 medium Substances 0.000 description 14
- 230000000694 effects Effects 0.000 description 13
- 230000035897 transcription Effects 0.000 description 12
- 238000013518 transcription Methods 0.000 description 12
- 241000588724 Escherichia coli Species 0.000 description 11
- 238000012217 deletion Methods 0.000 description 11
- 230000037430 deletion Effects 0.000 description 11
- IQFYYKKMVGJFEH-XLPZGREQSA-N Thymidine Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 IQFYYKKMVGJFEH-XLPZGREQSA-N 0.000 description 10
- 238000003556 assay Methods 0.000 description 10
- 102000034287 fluorescent proteins Human genes 0.000 description 10
- 108091006047 fluorescent proteins Proteins 0.000 description 10
- FDGQSTZJBFJUBT-UHFFFAOYSA-N hypoxanthine Chemical compound O=C1NC=NC2=C1NC=N2 FDGQSTZJBFJUBT-UHFFFAOYSA-N 0.000 description 10
- 102000053602 DNA Human genes 0.000 description 9
- 238000004113 cell culture Methods 0.000 description 9
- 230000004927 fusion Effects 0.000 description 9
- 238000002744 homologous recombination Methods 0.000 description 9
- 230000006801 homologous recombination Effects 0.000 description 9
- 230000001105 regulatory effect Effects 0.000 description 9
- 230000002103 transcriptional effect Effects 0.000 description 9
- 108010051219 Cre recombinase Proteins 0.000 description 8
- 235000001014 amino acid Nutrition 0.000 description 8
- 238000004458 analytical method Methods 0.000 description 8
- 238000002474 experimental method Methods 0.000 description 8
- 238000001943 fluorescence-activated cell sorting Methods 0.000 description 8
- 108020004999 messenger RNA Proteins 0.000 description 8
- 238000002965 ELISA Methods 0.000 description 7
- 102000014150 Interferons Human genes 0.000 description 7
- 108010050904 Interferons Proteins 0.000 description 7
- 108010022394 Threonine synthase Proteins 0.000 description 7
- 229940024606 amino acid Drugs 0.000 description 7
- 150000001413 amino acids Chemical class 0.000 description 7
- 230000003115 biocidal effect Effects 0.000 description 7
- 230000001965 increasing effect Effects 0.000 description 7
- 229940079322 interferon Drugs 0.000 description 7
- 108091026890 Coding region Proteins 0.000 description 6
- 108010046276 FLP recombinase Proteins 0.000 description 6
- 108010025815 Kanamycin Kinase Proteins 0.000 description 6
- 102000004419 dihydrofolate reductase Human genes 0.000 description 6
- 230000004048 modification Effects 0.000 description 6
- 238000012986 modification Methods 0.000 description 6
- DWRXFEITVBNRMK-UHFFFAOYSA-N Beta-D-1-Arabinofuranosylthymine Natural products O=C1NC(=O)C(C)=CN1C1C(O)C(O)C(CO)O1 DWRXFEITVBNRMK-UHFFFAOYSA-N 0.000 description 5
- 101710155857 C-C motif chemokine 2 Proteins 0.000 description 5
- 102000000018 Chemokine CCL2 Human genes 0.000 description 5
- 241000282414 Homo sapiens Species 0.000 description 5
- 108010015268 Integration Host Factors Proteins 0.000 description 5
- 241001465754 Metazoa Species 0.000 description 5
- 108060008682 Tumor Necrosis Factor Proteins 0.000 description 5
- 125000003275 alpha amino acid group Chemical group 0.000 description 5
- 238000013459 approach Methods 0.000 description 5
- IQFYYKKMVGJFEH-UHFFFAOYSA-N beta-L-thymidine Natural products O=C1NC(=O)C(C)=CN1C1OC(CO)C(O)C1 IQFYYKKMVGJFEH-UHFFFAOYSA-N 0.000 description 5
- 229960000074 biopharmaceutical Drugs 0.000 description 5
- 230000015572 biosynthetic process Effects 0.000 description 5
- 239000003795 chemical substances by application Substances 0.000 description 5
- 238000012761 co-transfection Methods 0.000 description 5
- 230000006870 function Effects 0.000 description 5
- 239000001963 growth medium Substances 0.000 description 5
- 230000001976 improved effect Effects 0.000 description 5
- 230000002123 temporal effect Effects 0.000 description 5
- 229940104230 thymidine Drugs 0.000 description 5
- 230000009466 transformation Effects 0.000 description 5
- 102000003390 tumor necrosis factor Human genes 0.000 description 5
- 241001515965 unidentified phage Species 0.000 description 5
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 4
- 101150074155 DHFR gene Proteins 0.000 description 4
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 4
- 108010067060 Immunoglobulin Variable Region Proteins 0.000 description 4
- 230000004913 activation Effects 0.000 description 4
- 102000006646 aminoglycoside phosphotransferase Human genes 0.000 description 4
- 239000002299 complementary DNA Substances 0.000 description 4
- 238000004520 electroporation Methods 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 239000003623 enhancer Substances 0.000 description 4
- 238000010353 genetic engineering Methods 0.000 description 4
- BRZYSWJRSDMWLG-CAXSIQPQSA-N geneticin Natural products O1C[C@@](O)(C)[C@H](NC)[C@@H](O)[C@H]1O[C@@H]1[C@@H](O)[C@H](O[C@@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](C(C)O)O2)N)[C@@H](N)C[C@H]1N BRZYSWJRSDMWLG-CAXSIQPQSA-N 0.000 description 4
- 230000012010 growth Effects 0.000 description 4
- 238000009396 hybridization Methods 0.000 description 4
- 238000000338 in vitro Methods 0.000 description 4
- NOESYZHRGYRDHS-UHFFFAOYSA-N insulin Chemical compound N1C(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(NC(=O)CN)C(C)CC)CSSCC(C(NC(CO)C(=O)NC(CC(C)C)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CCC(N)=O)C(=O)NC(CC(C)C)C(=O)NC(CCC(O)=O)C(=O)NC(CC(N)=O)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CSSCC(NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2C=CC(O)=CC=2)NC(=O)C(CC(C)C)NC(=O)C(C)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2NC=NC=2)NC(=O)C(CO)NC(=O)CNC2=O)C(=O)NCC(=O)NC(CCC(O)=O)C(=O)NC(CCCNC(N)=N)C(=O)NCC(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC(O)=CC=3)C(=O)NC(C(C)O)C(=O)N3C(CCC3)C(=O)NC(CCCCN)C(=O)NC(C)C(O)=O)C(=O)NC(CC(N)=O)C(O)=O)=O)NC(=O)C(C(C)CC)NC(=O)C(CO)NC(=O)C(C(C)O)NC(=O)C1CSSCC2NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CC(N)=O)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)C(C)C)CC1=CN=CN1 NOESYZHRGYRDHS-UHFFFAOYSA-N 0.000 description 4
- 238000002703 mutagenesis Methods 0.000 description 4
- 231100000350 mutagenesis Toxicity 0.000 description 4
- 102000040430 polynucleotide Human genes 0.000 description 4
- 108091033319 polynucleotide Proteins 0.000 description 4
- 239000002157 polynucleotide Substances 0.000 description 4
- 230000008569 process Effects 0.000 description 4
- 238000012552 review Methods 0.000 description 4
- 230000001225 therapeutic effect Effects 0.000 description 4
- 210000001519 tissue Anatomy 0.000 description 4
- 230000005758 transcription activity Effects 0.000 description 4
- 238000013519 translation Methods 0.000 description 4
- 238000011144 upstream manufacturing Methods 0.000 description 4
- 241000894006 Bacteria Species 0.000 description 3
- 238000001712 DNA sequencing Methods 0.000 description 3
- 102000004190 Enzymes Human genes 0.000 description 3
- 108090000790 Enzymes Proteins 0.000 description 3
- 101150066002 GFP gene Proteins 0.000 description 3
- UGQMRVRMYYASKQ-UHFFFAOYSA-N Hypoxanthine nucleoside Natural products OC1C(O)C(CO)OC1N1C(NC=NC2=O)=C2N=C1 UGQMRVRMYYASKQ-UHFFFAOYSA-N 0.000 description 3
- 102000017727 Immunoglobulin Variable Region Human genes 0.000 description 3
- 102000012330 Integrases Human genes 0.000 description 3
- FBOZXECLQNJBKD-ZDUSSCGKSA-N L-methotrexate Chemical compound C=1N=C2N=C(N)N=C(N)C2=NC=1CN(C)C1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 FBOZXECLQNJBKD-ZDUSSCGKSA-N 0.000 description 3
- 241000699666 Mus <mouse, genus> Species 0.000 description 3
- 241000699670 Mus sp. Species 0.000 description 3
- 229930193140 Neomycin Natural products 0.000 description 3
- 108091034117 Oligonucleotide Proteins 0.000 description 3
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 3
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 3
- 238000002105 Southern blotting Methods 0.000 description 3
- 102000006601 Thymidine Kinase Human genes 0.000 description 3
- 108020004440 Thymidine kinase Proteins 0.000 description 3
- 239000003242 anti bacterial agent Substances 0.000 description 3
- 238000007845 assembly PCR Methods 0.000 description 3
- 108010051210 beta-Fructofuranosidase Proteins 0.000 description 3
- 239000000872 buffer Substances 0.000 description 3
- 210000000349 chromosome Anatomy 0.000 description 3
- 238000003776 cleavage reaction Methods 0.000 description 3
- 238000011161 development Methods 0.000 description 3
- 229940088598 enzyme Drugs 0.000 description 3
- 108020001507 fusion proteins Proteins 0.000 description 3
- 102000037865 fusion proteins Human genes 0.000 description 3
- 238000001114 immunoprecipitation Methods 0.000 description 3
- 238000012744 immunostaining Methods 0.000 description 3
- 230000002779 inactivation Effects 0.000 description 3
- 238000002347 injection Methods 0.000 description 3
- 239000007924 injection Substances 0.000 description 3
- 238000003780 insertion Methods 0.000 description 3
- 230000037431 insertion Effects 0.000 description 3
- 235000011073 invertase Nutrition 0.000 description 3
- 229960000485 methotrexate Drugs 0.000 description 3
- 239000000203 mixture Substances 0.000 description 3
- 229960004927 neomycin Drugs 0.000 description 3
- 210000001938 protoplast Anatomy 0.000 description 3
- 238000000746 purification Methods 0.000 description 3
- 238000003127 radioimmunoassay Methods 0.000 description 3
- 108091008146 restriction endonucleases Proteins 0.000 description 3
- 230000007017 scission Effects 0.000 description 3
- 230000000638 stimulation Effects 0.000 description 3
- 238000001262 western blot Methods 0.000 description 3
- 108091033380 Coding strand Proteins 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- 108091035707 Consensus sequence Proteins 0.000 description 2
- 230000004544 DNA amplification Effects 0.000 description 2
- 241000206602 Eukaryota Species 0.000 description 2
- 238000012413 Fluorescence activated cell sorting analysis Methods 0.000 description 2
- 101000897480 Homo sapiens C-C motif chemokine 2 Proteins 0.000 description 2
- 108010054477 Immunoglobulin Fab Fragments Proteins 0.000 description 2
- 102000001706 Immunoglobulin Fab Fragments Human genes 0.000 description 2
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 2
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 2
- 102000004877 Insulin Human genes 0.000 description 2
- 108090001061 Insulin Proteins 0.000 description 2
- 108090000723 Insulin-Like Growth Factor I Proteins 0.000 description 2
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 2
- 206010028980 Neoplasm Diseases 0.000 description 2
- 238000000636 Northern blotting Methods 0.000 description 2
- 108700026244 Open Reading Frames Proteins 0.000 description 2
- 239000004365 Protease Substances 0.000 description 2
- 239000012980 RPMI-1640 medium Substances 0.000 description 2
- 241000700159 Rattus Species 0.000 description 2
- 108700008625 Reporter Genes Proteins 0.000 description 2
- 102000006382 Ribonucleases Human genes 0.000 description 2
- 108010083644 Ribonucleases Proteins 0.000 description 2
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 2
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 102000013275 Somatomedins Human genes 0.000 description 2
- 241000700605 Viruses Species 0.000 description 2
- OIRDTQYFTABQOQ-KQYNXXCUSA-N adenosine Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OIRDTQYFTABQOQ-KQYNXXCUSA-N 0.000 description 2
- 210000004102 animal cell Anatomy 0.000 description 2
- 210000004507 artificial chromosome Anatomy 0.000 description 2
- 230000001580 bacterial effect Effects 0.000 description 2
- 230000004071 biological effect Effects 0.000 description 2
- 230000008827 biological function Effects 0.000 description 2
- 108091092328 cellular RNA Proteins 0.000 description 2
- 230000001413 cellular effect Effects 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 230000002759 chromosomal effect Effects 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 230000004186 co-expression Effects 0.000 description 2
- 239000000356 contaminant Substances 0.000 description 2
- 238000007796 conventional method Methods 0.000 description 2
- 239000012228 culture supernatant Substances 0.000 description 2
- 238000000684 flow cytometry Methods 0.000 description 2
- 230000005714 functional activity Effects 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- WHUUTDBJXJRKMK-VKHMYHEASA-L glutamate group Chemical group N[C@@H](CCC(=O)[O-])C(=O)[O-] WHUUTDBJXJRKMK-VKHMYHEASA-L 0.000 description 2
- 102000046768 human CCL2 Human genes 0.000 description 2
- 210000005260 human cell Anatomy 0.000 description 2
- 108010002685 hygromycin-B kinase Proteins 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 238000007901 in situ hybridization Methods 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 229940125396 insulin Drugs 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 239000001573 invertase Substances 0.000 description 2
- 238000001638 lipofection Methods 0.000 description 2
- 125000003588 lysine group Chemical group [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 239000012528 membrane Substances 0.000 description 2
- 238000000520 microinjection Methods 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 210000000287 oocyte Anatomy 0.000 description 2
- 230000008488 polyadenylation Effects 0.000 description 2
- 229920000642 polymer Polymers 0.000 description 2
- 238000003752 polymerase chain reaction Methods 0.000 description 2
- 238000001742 protein purification Methods 0.000 description 2
- RXWNCPJZOCPEPQ-NVWDDTSBSA-N puromycin Chemical compound C1=CC(OC)=CC=C1C[C@H](N)C(=O)N[C@H]1[C@@H](O)[C@H](N2C3=NC=NC(=C3N=C2)N(C)C)O[C@@H]1CO RXWNCPJZOCPEPQ-NVWDDTSBSA-N 0.000 description 2
- 230000010076 replication Effects 0.000 description 2
- 150000003839 salts Chemical class 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 230000003248 secreting effect Effects 0.000 description 2
- 238000002741 site-directed mutagenesis Methods 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 238000010561 standard procedure Methods 0.000 description 2
- 230000004936 stimulating effect Effects 0.000 description 2
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 239000013589 supplement Substances 0.000 description 2
- 230000004083 survival effect Effects 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 230000009261 transgenic effect Effects 0.000 description 2
- 230000003612 virological effect Effects 0.000 description 2
- NWUYHJFMYQTDRP-UHFFFAOYSA-N 1,2-bis(ethenyl)benzene;1-ethenyl-2-ethylbenzene;styrene Chemical compound C=CC1=CC=CC=C1.CCC1=CC=CC=C1C=C.C=CC1=CC=CC=C1C=C NWUYHJFMYQTDRP-UHFFFAOYSA-N 0.000 description 1
- QRBLKGHRWFGINE-UGWAGOLRSA-N 2-[2-[2-[[2-[[4-[[2-[[6-amino-2-[3-amino-1-[(2,3-diamino-3-oxopropyl)amino]-3-oxopropyl]-5-methylpyrimidine-4-carbonyl]amino]-3-[(2r,3s,4s,5s,6s)-3-[(2s,3r,4r,5s)-4-carbamoyl-3,4,5-trihydroxy-6-(hydroxymethyl)oxan-2-yl]oxy-4,5-dihydroxy-6-(hydroxymethyl)- Chemical compound N=1C(C=2SC=C(N=2)C(N)=O)CSC=1CCNC(=O)C(C(C)=O)NC(=O)C(C)C(O)C(C)NC(=O)C(C(O[C@H]1[C@@]([C@@H](O)[C@H](O)[C@H](CO)O1)(C)O[C@H]1[C@@H]([C@](O)([C@@H](O)C(CO)O1)C(N)=O)O)C=1NC=NC=1)NC(=O)C1=NC(C(CC(N)=O)NCC(N)C(N)=O)=NC(N)=C1C QRBLKGHRWFGINE-UGWAGOLRSA-N 0.000 description 1
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 1
- BFSVOASYOCHEOV-UHFFFAOYSA-N 2-diethylaminoethanol Chemical compound CCN(CC)CCO BFSVOASYOCHEOV-UHFFFAOYSA-N 0.000 description 1
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 1
- UZOVYGYOLBIAJR-UHFFFAOYSA-N 4-isocyanato-4'-methyldiphenylmethane Chemical compound C1=CC(C)=CC=C1CC1=CC=C(N=C=O)C=C1 UZOVYGYOLBIAJR-UHFFFAOYSA-N 0.000 description 1
- YXHLJMWYDTXDHS-IRFLANFNSA-N 7-aminoactinomycin D Chemical compound C[C@H]1OC(=O)[C@H](C(C)C)N(C)C(=O)CN(C)C(=O)[C@@H]2CCCN2C(=O)[C@@H](C(C)C)NC(=O)[C@H]1NC(=O)C1=C(N)C(=O)C(C)=C2OC(C(C)=C(N)C=C3C(=O)N[C@@H]4C(=O)N[C@@H](C(N5CCC[C@H]5C(=O)N(C)CC(=O)N(C)[C@@H](C(C)C)C(=O)O[C@@H]4C)=O)C(C)C)=C3N=C21 YXHLJMWYDTXDHS-IRFLANFNSA-N 0.000 description 1
- 108700012813 7-aminoactinomycin D Proteins 0.000 description 1
- 241000242763 Anemonia Species 0.000 description 1
- 244000105975 Antidesma platyphyllum Species 0.000 description 1
- 108020005544 Antisense RNA Proteins 0.000 description 1
- 102100023927 Asparagine synthetase [glutamine-hydrolyzing] Human genes 0.000 description 1
- 108010070255 Aspartate-ammonia ligase Proteins 0.000 description 1
- 108010006654 Bleomycin Proteins 0.000 description 1
- 241000283690 Bos taurus Species 0.000 description 1
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 1
- 239000002126 C01EB10 - Adenosine Substances 0.000 description 1
- OYPRJOBELJOOCE-UHFFFAOYSA-N Calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- 241000282693 Cercopithecidae Species 0.000 description 1
- 241000006720 Clavularia sp. Species 0.000 description 1
- 108700010070 Codon Usage Proteins 0.000 description 1
- 102000004127 Cytokines Human genes 0.000 description 1
- 108090000695 Cytokines Proteins 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 206010011878 Deafness Diseases 0.000 description 1
- 229920002307 Dextran Polymers 0.000 description 1
- 241000006271 Discosoma sp. Species 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 102400001368 Epidermal growth factor Human genes 0.000 description 1
- 101800003838 Epidermal growth factor Proteins 0.000 description 1
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 1
- 102000003951 Erythropoietin Human genes 0.000 description 1
- 108090000394 Erythropoietin Proteins 0.000 description 1
- 108010022894 Euchromatin Proteins 0.000 description 1
- 108091092566 Extrachromosomal DNA Proteins 0.000 description 1
- 108010008177 Fd immunoglobulins Proteins 0.000 description 1
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 1
- 208000034951 Genetic Translocation Diseases 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- 108010017080 Granulocyte Colony-Stimulating Factor Proteins 0.000 description 1
- 102000004269 Granulocyte Colony-Stimulating Factor Human genes 0.000 description 1
- 108010017213 Granulocyte-Macrophage Colony-Stimulating Factor Proteins 0.000 description 1
- 102100039620 Granulocyte-macrophage colony-stimulating factor Human genes 0.000 description 1
- 239000007995 HEPES buffer Substances 0.000 description 1
- 108010034791 Heterochromatin Proteins 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- 241000701024 Human betaherpesvirus 5 Species 0.000 description 1
- -1 IL-14 Proteins 0.000 description 1
- 108060003951 Immunoglobulin Proteins 0.000 description 1
- 102000012745 Immunoglobulin Subunits Human genes 0.000 description 1
- 108010079585 Immunoglobulin Subunits Proteins 0.000 description 1
- 102000015271 Intercellular Adhesion Molecule-1 Human genes 0.000 description 1
- 108010064593 Intercellular Adhesion Molecule-1 Proteins 0.000 description 1
- 102000000589 Interleukin-1 Human genes 0.000 description 1
- 108010002352 Interleukin-1 Proteins 0.000 description 1
- 102000003814 Interleukin-10 Human genes 0.000 description 1
- 108090000174 Interleukin-10 Proteins 0.000 description 1
- 108090000177 Interleukin-11 Proteins 0.000 description 1
- 102000003815 Interleukin-11 Human genes 0.000 description 1
- 102000013462 Interleukin-12 Human genes 0.000 description 1
- 108010065805 Interleukin-12 Proteins 0.000 description 1
- 102000003816 Interleukin-13 Human genes 0.000 description 1
- 108090000176 Interleukin-13 Proteins 0.000 description 1
- 102000003812 Interleukin-15 Human genes 0.000 description 1
- 108090000172 Interleukin-15 Proteins 0.000 description 1
- 102000049772 Interleukin-16 Human genes 0.000 description 1
- 101800003050 Interleukin-16 Proteins 0.000 description 1
- 108050003558 Interleukin-17 Proteins 0.000 description 1
- 102000013691 Interleukin-17 Human genes 0.000 description 1
- 102000003810 Interleukin-18 Human genes 0.000 description 1
- 108090000171 Interleukin-18 Proteins 0.000 description 1
- 102000000588 Interleukin-2 Human genes 0.000 description 1
- 108010002350 Interleukin-2 Proteins 0.000 description 1
- 102000000646 Interleukin-3 Human genes 0.000 description 1
- 108010002386 Interleukin-3 Proteins 0.000 description 1
- 102000004388 Interleukin-4 Human genes 0.000 description 1
- 108090000978 Interleukin-4 Proteins 0.000 description 1
- 102000000743 Interleukin-5 Human genes 0.000 description 1
- 108010002616 Interleukin-5 Proteins 0.000 description 1
- 102000004889 Interleukin-6 Human genes 0.000 description 1
- 108090001005 Interleukin-6 Proteins 0.000 description 1
- 102000000704 Interleukin-7 Human genes 0.000 description 1
- 108010002586 Interleukin-7 Proteins 0.000 description 1
- 108090001007 Interleukin-8 Proteins 0.000 description 1
- 102000004890 Interleukin-8 Human genes 0.000 description 1
- 102000000585 Interleukin-9 Human genes 0.000 description 1
- 108010002335 Interleukin-9 Proteins 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- 239000007760 Iscove's Modified Dulbecco's Medium Substances 0.000 description 1
- ZQISRDCJNBUVMM-UHFFFAOYSA-N L-Histidinol Natural products OCC(N)CC1=CN=CN1 ZQISRDCJNBUVMM-UHFFFAOYSA-N 0.000 description 1
- 229930182816 L-glutamine Natural products 0.000 description 1
- ZQISRDCJNBUVMM-YFKPBYRVSA-N L-histidinol Chemical compound OC[C@@H](N)CC1=CNC=N1 ZQISRDCJNBUVMM-YFKPBYRVSA-N 0.000 description 1
- 108010046938 Macrophage Colony-Stimulating Factor Proteins 0.000 description 1
- 102000007651 Macrophage Colony-Stimulating Factor Human genes 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- 108091007491 NSP3 Papain-like protease domains Proteins 0.000 description 1
- 102000011931 Nucleoproteins Human genes 0.000 description 1
- 108010061100 Nucleoproteins Proteins 0.000 description 1
- 239000004677 Nylon Substances 0.000 description 1
- 108700020796 Oncogene Proteins 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 238000010222 PCR analysis Methods 0.000 description 1
- 108090000526 Papain Proteins 0.000 description 1
- 241001494479 Pecora Species 0.000 description 1
- 229930182555 Penicillin Natural products 0.000 description 1
- JGSARLDLIJGVTE-MBNYWOFBSA-N Penicillin G Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1 JGSARLDLIJGVTE-MBNYWOFBSA-N 0.000 description 1
- 102000057297 Pepsin A Human genes 0.000 description 1
- 108090000284 Pepsin A Proteins 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- 108010033276 Peptide Fragments Proteins 0.000 description 1
- 102000007079 Peptide Fragments Human genes 0.000 description 1
- BELBBZDIHDAJOR-UHFFFAOYSA-N Phenolsulfonephthalein Chemical compound C1=CC(O)=CC=C1C1(C=2C=CC(O)=CC=2)C2=CC=CC=C2S(=O)(=O)O1 BELBBZDIHDAJOR-UHFFFAOYSA-N 0.000 description 1
- LTQCLFMNABRKSH-UHFFFAOYSA-N Phleomycin Natural products N=1C(C=2SC=C(N=2)C(N)=O)CSC=1CCNC(=O)C(C(O)C)NC(=O)C(C)C(O)C(C)NC(=O)C(C(OC1C(C(O)C(O)C(CO)O1)OC1C(C(OC(N)=O)C(O)C(CO)O1)O)C=1NC=NC=1)NC(=O)C1=NC(C(CC(N)=O)NCC(N)C(N)=O)=NC(N)=C1C LTQCLFMNABRKSH-UHFFFAOYSA-N 0.000 description 1
- 108010035235 Phleomycins Proteins 0.000 description 1
- 108091000080 Phosphotransferase Proteins 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 241000242743 Renilla reniformis Species 0.000 description 1
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 1
- 238000010266 Sephadex chromatography Methods 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 241000193996 Streptococcus pyogenes Species 0.000 description 1
- 241000282898 Sus scrofa Species 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- 108090000901 Transferrin Proteins 0.000 description 1
- 102000004338 Transferrin Human genes 0.000 description 1
- 108700019146 Transgenes Proteins 0.000 description 1
- 108010019530 Vascular Endothelial Growth Factors Proteins 0.000 description 1
- 102000005789 Vascular Endothelial Growth Factors Human genes 0.000 description 1
- 108010067390 Viral Proteins Proteins 0.000 description 1
- 208000036142 Viral infection Diseases 0.000 description 1
- 101100082060 Xenopus laevis pou5f1.1 gene Proteins 0.000 description 1
- 230000021736 acetylation Effects 0.000 description 1
- 238000006640 acetylation reaction Methods 0.000 description 1
- 229960004150 aciclovir Drugs 0.000 description 1
- MKUXAQIIEYXACX-UHFFFAOYSA-N aciclovir Chemical compound N1C(N)=NC(=O)C2=C1N(COCCO)C=N2 MKUXAQIIEYXACX-UHFFFAOYSA-N 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 150000007513 acids Chemical class 0.000 description 1
- 229960005305 adenosine Drugs 0.000 description 1
- 239000000443 aerosol Substances 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 239000000556 agonist Substances 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- 239000005557 antagonist Substances 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 239000000427 antigen Substances 0.000 description 1
- 108091007433 antigens Proteins 0.000 description 1
- 102000036639 antigens Human genes 0.000 description 1
- 230000006907 apoptotic process Effects 0.000 description 1
- 239000012298 atmosphere Substances 0.000 description 1
- 230000003190 augmentative effect Effects 0.000 description 1
- 238000002869 basic local alignment search tool Methods 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- 230000001588 bifunctional effect Effects 0.000 description 1
- 230000031018 biological processes and functions Effects 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 229960001561 bleomycin Drugs 0.000 description 1
- OYVAGSVQBOHSSS-UAPAGMARSA-O bleomycin A2 Chemical compound N([C@H](C(=O)N[C@H](C)[C@@H](O)[C@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)NCCC=1SC=C(N=1)C=1SC=C(N=1)C(=O)NCCC[S+](C)C)[C@@H](O[C@H]1[C@H]([C@@H](O)[C@H](O)[C@H](CO)O1)O[C@@H]1[C@H]([C@@H](OC(N)=O)[C@H](O)[C@@H](CO)O1)O)C=1N=CNC=1)C(=O)C1=NC([C@H](CC(N)=O)NC[C@H](N)C(N)=O)=NC(N)=C1C OYVAGSVQBOHSSS-UAPAGMARSA-O 0.000 description 1
- 230000036760 body temperature Effects 0.000 description 1
- 238000009395 breeding Methods 0.000 description 1
- 230000001488 breeding effect Effects 0.000 description 1
- 239000011575 calcium Substances 0.000 description 1
- 229910052791 calcium Inorganic materials 0.000 description 1
- 201000011510 cancer Diseases 0.000 description 1
- 230000001925 catabolic effect Effects 0.000 description 1
- 239000003729 cation exchange resin Substances 0.000 description 1
- 230000007910 cell fusion Effects 0.000 description 1
- 230000010261 cell growth Effects 0.000 description 1
- 239000013592 cell lysate Substances 0.000 description 1
- 239000002458 cell surface marker Substances 0.000 description 1
- 239000006285 cell suspension Substances 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 238000013377 clone selection method Methods 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 239000003184 complementary RNA Substances 0.000 description 1
- 239000002131 composite material Substances 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- 125000000151 cysteine group Chemical class N[C@@H](CS)C(=O)* 0.000 description 1
- 238000004163 cytometry Methods 0.000 description 1
- 230000001086 cytosolic effect Effects 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- KVEAILYLMGOETO-UHFFFAOYSA-H dicalcium magnesium diphosphate Chemical compound P(=O)([O-])([O-])[O-].[Mg+2].[Ca+2].[Ca+2].P(=O)([O-])([O-])[O-] KVEAILYLMGOETO-UHFFFAOYSA-H 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 210000001671 embryonic stem cell Anatomy 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 238000009585 enzyme analysis Methods 0.000 description 1
- 229940116977 epidermal growth factor Drugs 0.000 description 1
- 229940105423 erythropoietin Drugs 0.000 description 1
- 238000012869 ethanol precipitation Methods 0.000 description 1
- 210000000632 euchromatin Anatomy 0.000 description 1
- 108010055246 excisionase Proteins 0.000 description 1
- 238000010195 expression analysis Methods 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 239000012894 fetal calf serum Substances 0.000 description 1
- 238000005194 fractionation Methods 0.000 description 1
- 239000012737 fresh medium Substances 0.000 description 1
- 230000005021 gait Effects 0.000 description 1
- 229960002963 ganciclovir Drugs 0.000 description 1
- IRSCQMHQWWYFCW-UHFFFAOYSA-N ganciclovir Chemical compound O=C1NC(N)=NC2=C1N=CN2COC(CO)CO IRSCQMHQWWYFCW-UHFFFAOYSA-N 0.000 description 1
- 238000010363 gene targeting Methods 0.000 description 1
- 238000001415 gene therapy Methods 0.000 description 1
- 102000034356 gene-regulatory proteins Human genes 0.000 description 1
- 108091006104 gene-regulatory proteins Proteins 0.000 description 1
- 230000007614 genetic variation Effects 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 108020002326 glutamine synthetase Proteins 0.000 description 1
- 102000005396 glutamine synthetase Human genes 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- 235000009424 haa Nutrition 0.000 description 1
- 210000003958 hematopoietic stem cell Anatomy 0.000 description 1
- 210000004458 heterochromatin Anatomy 0.000 description 1
- 210000000987 immune system Anatomy 0.000 description 1
- 102000018358 immunoglobulin Human genes 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 238000009776 industrial production Methods 0.000 description 1
- 238000001990 intravenous administration Methods 0.000 description 1
- 238000005342 ion exchange Methods 0.000 description 1
- 230000002427 irreversible effect Effects 0.000 description 1
- 210000003292 kidney cell Anatomy 0.000 description 1
- 210000002450 kidney nerve cell Anatomy 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 231100000518 lethal Toxicity 0.000 description 1
- 230000001665 lethal effect Effects 0.000 description 1
- 210000004185 liver Anatomy 0.000 description 1
- 210000005229 liver cell Anatomy 0.000 description 1
- 239000006166 lysate Substances 0.000 description 1
- 229920002521 macromolecule Polymers 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 230000002503 metabolic effect Effects 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 102000035118 modified proteins Human genes 0.000 description 1
- 108091005573 modified proteins Proteins 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 201000009240 nasopharyngitis Diseases 0.000 description 1
- 239000013642 negative control Substances 0.000 description 1
- 239000002777 nucleoside Substances 0.000 description 1
- 125000003835 nucleoside group Chemical group 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- 229920001778 nylon Polymers 0.000 description 1
- 238000002515 oligonucleotide synthesis Methods 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- 229940055729 papain Drugs 0.000 description 1
- 235000019834 papain Nutrition 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 229940049954 penicillin Drugs 0.000 description 1
- 229940111202 pepsin Drugs 0.000 description 1
- 229960003531 phenolsulfonphthalein Drugs 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 102000020233 phosphotransferase Human genes 0.000 description 1
- OXCMYAYHXIHQOA-UHFFFAOYSA-N potassium;[2-butyl-5-chloro-3-[[4-[2-(1,2,4-triaza-3-azanidacyclopenta-1,4-dien-5-yl)phenyl]phenyl]methyl]imidazol-4-yl]methanol Chemical compound [K+].CCCCC1=NC(Cl)=C(CO)N1CC1=CC=C(C=2C(=CC=CC=2)C2=N[N-]N=N2)C=C1 OXCMYAYHXIHQOA-UHFFFAOYSA-N 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 230000035755 proliferation Effects 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 235000019419 proteases Nutrition 0.000 description 1
- 230000004853 protein function Effects 0.000 description 1
- 230000020978 protein processing Effects 0.000 description 1
- 230000006337 proteolytic cleavage Effects 0.000 description 1
- 229950010131 puromycin Drugs 0.000 description 1
- 238000003259 recombinant expression Methods 0.000 description 1
- 108010054624 red fluorescent protein Proteins 0.000 description 1
- 238000004007 reversed phase HPLC Methods 0.000 description 1
- 239000000523 sample Substances 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 239000012679 serum free medium Substances 0.000 description 1
- 238000004904 shortening Methods 0.000 description 1
- 239000000377 silicon dioxide Substances 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 125000006850 spacer group Chemical group 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 230000002269 spontaneous effect Effects 0.000 description 1
- 238000003153 stable transfection Methods 0.000 description 1
- 238000010186 staining Methods 0.000 description 1
- 229960005322 streptomycin Drugs 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 230000001502 supplementing effect Effects 0.000 description 1
- 230000008093 supporting effect Effects 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 238000004114 suspension culture Methods 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 230000008685 targeting Effects 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- 238000002560 therapeutic procedure Methods 0.000 description 1
- 239000011573 trace mineral Substances 0.000 description 1
- 235000013619 trace mineral Nutrition 0.000 description 1
- 230000005026 transcription initiation Effects 0.000 description 1
- 230000005030 transcription termination Effects 0.000 description 1
- 238000003151 transfection method Methods 0.000 description 1
- 238000006276 transfer reaction Methods 0.000 description 1
- 239000012581 transferrin Substances 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- 230000010474 transient expression Effects 0.000 description 1
- 239000013638 trimer Substances 0.000 description 1
- 230000005740 tumor formation Effects 0.000 description 1
- VBEQCZHXXJYVRD-GACYYNSASA-N uroanthelone Chemical compound C([C@@H](C(=O)N[C@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)C(C)C)[C@@H](C)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CC=1NC=NC=1)NC(=O)[C@H](CCSC)NC(=O)[C@H](CS)NC(=O)[C@@H](NC(=O)CNC(=O)CNC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CS)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CS)NC(=O)CNC(=O)[C@H]1N(CCC1)C(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O)C(C)C)[C@@H](C)CC)C1=CC=C(O)C=C1 VBEQCZHXXJYVRD-GACYYNSASA-N 0.000 description 1
- 230000035899 viability Effects 0.000 description 1
- 108700026220 vif Genes Proteins 0.000 description 1
- 230000009385 viral infection Effects 0.000 description 1
- 239000013603 viral vector Substances 0.000 description 1
Landscapes
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
The present invention relates to a method of sequence-specific recombination of DNA in eukaryotic cells, comprising the introduction of a first DNA comprising a nucleotide sequence containing at least one recombination sequence into a cell, introducing a second DNA comprising a nucleotide sequence containing at least one further recombination sequence into a cell, and performing the sequence specific recombination by a bacteriophage lambda integrase Int.
  Description
 Sequence specific DNA recombination in eukaryotic cells The present invention relates to a method of sequence-specific recombination of DNA in eukaryotic cells, comprising the introduction of a first DNA comprising a nucleotide sequence containing at least one recombination sequence into a cell, introducing a second DNA
comprising a nucleotide sequence containing at least one further recombination sequence into a cell, and performing the sequence specific recombination by a bacteriophage lambda integrase Int.
The controlled manipulation of eukaryotic genomes and the expression of recombinant proteins from episomal vectors are important methods for analyzing the fimction(s) of specific genes in living organisms. Moreover, said manipulations play a role in gene therapeutic methods in medicine. In this context the generation of transgenic animals, the change of genes or gene segments (so-called "gene targeting") and the targeted integration for foreign DNA into the genome of higher eukaryotes are of particular importance. Recently these technologies could be improved by means of characterization and application of sequence specific recombination systems.
Furthermore, sequence-specific integration of expression cassettes, encoding and expressing a desired polypeptide/product, into the genome of biotechnological relevant host cells also gets more significance for the production of biopharmaceuticals. Expression level for a desired polypeptide in a stable transformed cell lines depends on the site of integration. By sequence specific integration, sites could be preferably used having a high transcription activity. The conventional method for generating production cell lines expressing a desired polypeptide/product is based on the random integration of the recombinant expression vector into the genome of the host cell. Variations in the expression level of the integrated genes) of interest in stable transformed cell lines are attributed mainly to differences in chromosomal locations and copy numbers. Random integration in the proximity of heterochromatin results in variable levels of transgene expression. Chromosome locations promoting the expression of the integrated genes) of interest are thought to be transcriptionally active regions of euchromatin.
This randomness of integration causes a large diversity in recombinant cells robustness, productivity and quality, necessitating an elaborate screening process to identify and isolate a suitable cell clone expressing the desired polypeptide at high level. In addition, the heterogeneity also means that for each clone an optimized production process has to be developed, making the development of a suitable production cell line a time consuming, labor intensive and costly process.
Conservative sequence specific DNA recombinases have been divided into two families.
Members of the first family, the so-called "integrase" family, catalyze the cleavage and rejoining of DNA strands between two defined nucleotide sequences, . which will be named as ~o recombination sequences in the following. The recombination sequences may be either on two different or on one DNA molecule, resulting in inter- or intramolecular recombination, respectively. For intramolecular recombination, the result of the reaction depends on the respective orientation of the recombination sequences to each other. In the case of an inverted, i.e. opposite orientation of the recombination sequences, inversion of the DNA
segments lying is between the recombination sequences occurs. In the case of direct, i.e.
tandem repeats of the recombination sequences on a DNA substrate, a deletion occurs. In case of the intermolecular recombination, i.e. if both recombination sequences are located on two .different DNA
molecules, a fusion of the two DNA molecules may occur. While members of the integrase family usually catalyze both intra- as well as intermolecular recombination, the recombinases of zo the second family of the so-called "invertases/resolvases" are only able to catalyze the intramolecular recombination.
At present, the recombinases which are used for the manipulation of eukaryotic genomes belong to the integrase family. Said recombinases are the Cre recombinase of the bacteriophage PI and zs the Flp recombinase from yeast (Miiller, U. (1999) Mech. Develop., 82, pp.
3). The recombination sequences to which the Cre recombinase binds are named loxP.
LoxP is a 34 by long nucleotide sequence consisting of two 13 by long inverted nucleotide sequences and an 8 by long spacer lying between the inverted sequences (Hoess, R. et al. (1985) J.
Mol. Biol., 181, pp.
351). The FRT named binding sequences for Flp are build up similarly., However, they differ so from loxP (Kilby, J. et al. (1993) Trends Genet., 9, pp. 413). Therefore, the recombination sequences may not be replaced by each other, i.e. Cre is not able to recombine FRT sequences and FLP is not able to recombine loxP sequences. Both recombination systems are active over long distances, i.e. the DNA segment to be inverted or deleted and flanked by two loxP or FRT
sequences may be several 10 000 base pairs long.
      comprising a nucleotide sequence containing at least one further recombination sequence into a cell, and performing the sequence specific recombination by a bacteriophage lambda integrase Int.
The controlled manipulation of eukaryotic genomes and the expression of recombinant proteins from episomal vectors are important methods for analyzing the fimction(s) of specific genes in living organisms. Moreover, said manipulations play a role in gene therapeutic methods in medicine. In this context the generation of transgenic animals, the change of genes or gene segments (so-called "gene targeting") and the targeted integration for foreign DNA into the genome of higher eukaryotes are of particular importance. Recently these technologies could be improved by means of characterization and application of sequence specific recombination systems.
Furthermore, sequence-specific integration of expression cassettes, encoding and expressing a desired polypeptide/product, into the genome of biotechnological relevant host cells also gets more significance for the production of biopharmaceuticals. Expression level for a desired polypeptide in a stable transformed cell lines depends on the site of integration. By sequence specific integration, sites could be preferably used having a high transcription activity. The conventional method for generating production cell lines expressing a desired polypeptide/product is based on the random integration of the recombinant expression vector into the genome of the host cell. Variations in the expression level of the integrated genes) of interest in stable transformed cell lines are attributed mainly to differences in chromosomal locations and copy numbers. Random integration in the proximity of heterochromatin results in variable levels of transgene expression. Chromosome locations promoting the expression of the integrated genes) of interest are thought to be transcriptionally active regions of euchromatin.
This randomness of integration causes a large diversity in recombinant cells robustness, productivity and quality, necessitating an elaborate screening process to identify and isolate a suitable cell clone expressing the desired polypeptide at high level. In addition, the heterogeneity also means that for each clone an optimized production process has to be developed, making the development of a suitable production cell line a time consuming, labor intensive and costly process.
Conservative sequence specific DNA recombinases have been divided into two families.
Members of the first family, the so-called "integrase" family, catalyze the cleavage and rejoining of DNA strands between two defined nucleotide sequences, . which will be named as ~o recombination sequences in the following. The recombination sequences may be either on two different or on one DNA molecule, resulting in inter- or intramolecular recombination, respectively. For intramolecular recombination, the result of the reaction depends on the respective orientation of the recombination sequences to each other. In the case of an inverted, i.e. opposite orientation of the recombination sequences, inversion of the DNA
segments lying is between the recombination sequences occurs. In the case of direct, i.e.
tandem repeats of the recombination sequences on a DNA substrate, a deletion occurs. In case of the intermolecular recombination, i.e. if both recombination sequences are located on two .different DNA
molecules, a fusion of the two DNA molecules may occur. While members of the integrase family usually catalyze both intra- as well as intermolecular recombination, the recombinases of zo the second family of the so-called "invertases/resolvases" are only able to catalyze the intramolecular recombination.
At present, the recombinases which are used for the manipulation of eukaryotic genomes belong to the integrase family. Said recombinases are the Cre recombinase of the bacteriophage PI and zs the Flp recombinase from yeast (Miiller, U. (1999) Mech. Develop., 82, pp.
3). The recombination sequences to which the Cre recombinase binds are named loxP.
LoxP is a 34 by long nucleotide sequence consisting of two 13 by long inverted nucleotide sequences and an 8 by long spacer lying between the inverted sequences (Hoess, R. et al. (1985) J.
Mol. Biol., 181, pp.
351). The FRT named binding sequences for Flp are build up similarly., However, they differ so from loxP (Kilby, J. et al. (1993) Trends Genet., 9, pp. 413). Therefore, the recombination sequences may not be replaced by each other, i.e. Cre is not able to recombine FRT sequences and FLP is not able to recombine loxP sequences. Both recombination systems are active over long distances, i.e. the DNA segment to be inverted or deleted and flanked by two loxP or FRT
sequences may be several 10 000 base pairs long.
 For example, a tissue specific recombination in a mouse system, a chromosomal translocation in plants and animals, and a controlled induction of the gene expression was achieved with said rive systems; review article of Miiller, U. (1999) Mech. Develop., 82, pp. 3. The DNA polymerase (3 s was deleted in particular tissues of mice in this way; Gu, H. et al. (1994) Science, 265, pp. 103.
A further example is the specific activation of the DNA tumor virus SV40 oncogene in the mouse lenses leading to tumor formation exclusively in these tissues. The Cre-IoxP strategy was used also in connection with inducible promoters. For example, the expression of the recombinase was regulated with an interferon-inducible promoter wleading to the deletion of a io specific gene in the liver and not - or only to a low extent - in other tissues; Kiihn, R. et al.
(1990 Science, 269, pp.1427.
So far three members of the invertase/resolvase family have been used for the manipulation of eukaryotic genomes. A mutant of the bacteriophage Mzi invertase Gin can catalyze the inversion ~s of a DNA fragment in plant protoplasts without cofactors. However, it has been discovered that this mutant is hyper-recombinogenic, i.e. it catalyzes DNA strand cleavages also at other than its naturally recombination sequences. This leads to unintended partially lethal recombination events in plant protoplast genomes. The (3-recombinase from Streptococcus pyogenes catalyses the recombination in mouse cell cultures between two recombination sequences as direct repeats zo leading to the excision of the segment. However, simultaneously with deletion also inversion has been detected which renders the controlled use of the system for manipulation of eukaryotic genomes unsuitable. Mutants of the y8 resolvase from E.coli have been shown to be active on episomal and artificially introduced genomic recombination sequences, but the efficiency of the latter reaction is still rather poor.
zs The manipulation of eukaryotic genomes with the Cre and Flp recombinase, respectively, shows significant disadvantages. In case of deletion, i.e. the recombination of two tandem repeated IoxP
or FRT recombination sequences in a genome there is an irreversibly loss of the DNA segment lying betW een the tandem repeats. Thus, a gene located on this DNA. segment will be lost 3o permanently for the cell and the organism. Therefore, the reconstruction of the original state for a new analyses of the gene function, e.g. in a later developmental stage of the organism, is impossible. The irreversible loss of the DNA segment caused by deletion may be avoided by an inversion of the respective DNA segment. A gene may be inactivated by an inversion without being lost and may be switched on again at a later developmental stage or in the adult animal by means of a timely regulated expression of the recombinase via back recombination. However, the use of both Cre and Flp recombinases in this modified method has the disadvantage that the inversion cannot be regulated as the recombination sequences will not be altered as a result of the recombination event. Thus, repeated recombination events occur causing the inactivation of s the respective gene due to the inversion of the respective DNA segment only in some, at best in 50% of the target cells at equilibrium of the reaction. There have been efforts to solve this problem, at least in part, by constructing mutated IoxP sequences which cannot be used for further reaction after a single recombination. However, the disadvantage is the uniqueness of the reaction, i.e. there is no subsequent activation by back recombination after inactivation of the ~o gene by inversion.
A further disadvantage of the Flp recombinase is its reduced heat stability at 37°C thus limiting the efficiency of the recombination reaction in higher eukaryotes significantly, e.g. in mice with a body temperature of about 39°C. Therefore, Flp mutants have been generated which exhibit a i s higher heat stability as the wild-type recombinase. However, even these mutant Flp enzymes still exhibit a lower recombination efficiency than the Cre recombinase.
A further use of sequence specific recombinases resides in the medical field, e.g. in gene therapy, where the recombinases integrate a desired DNA segment into the genome of a respective human zo target cell in a stable and controlled way. Both Cre and Flp may catalyze intermolecular recombination. Both recombinases recombine a plasmid DNA which carnes a copy of its respective recombination sequence with a corresponding recombination sequence which has been inserted before into the eukaryotic genome via homologous recombination.
However, it is desirable that this reaction includes a "naturally" occurring recombination sequence in the zs eukaryotic genome. Because loxP and FRT are 34 and 54 nucleotides long, respectively, occurrence of exact matches of these recombination sequences as part of the genome is statistically unlikely. Even if a recombination sequence would be present, the disadvantage of the aforementioned back reaction still exists, i.e. both Cre and Flp recombinase may excise the inserted DNA segment after successful integration by intramolecular recombination.
Thus, one problem of the present invention is to provide a simple and controllable recombination system, and the required working means. A further problem of the present invention is the provision of a recombination system and the required working means, which may carry out a stable and targeted integration of a desired DNA sequence. A further problem of the present invention is the provision of methods which allows the generation of an improved protein expression system on the basis of one of those recombination systems.
Said problems are solved by the subject matter characterized in the claims.
s The invention is explained in more detail with the following illustrations.
Figure 1 shows a schematic presentation of the recombination reactions namely integration and excision catalyzed by the wild-type integrase Int. A superhelical plasmid DNA
(top) carrying a io copy of the recombination sequence attP is shown. AttP consists of five so-called arm binding sites for Int (Pl, P2, P1', P2', P3'), two core Int binding sites (C and C';
marked with black arrows), three binding sites for IHF (Hl, H2, H'), two binding sites for Xis (X1, X2) and the so-called overlap region (open rectangle) where the actual DNA strand exchange takes place. The natural partner sequence for attP, attB, is shown on a linear DNA segment beneath and consists is of two core binding sites for Int (B and B'; marked with open arrows) and the overlap region. For the recombination between attB and attP, Int and IHF are necessary leading to the integration of the plasmid into the DNA segment carrying attB. Thereby, two new hybrid recombination sequences, attL and attR, are formed which serve ~as target sequences for the excision. The latter reaction requires in the wild-type situation Int and IHF, and a further cofactor XIS encoded by zo the phage lambda.
Figure 2 shows intra- and intermolecular recombination reactions. (A) Intramolecular integrative (attB x attP) recombination. (B) Intermolecular integrative (attB x attP) recombination. (C) Intramolecular excisive (attL x attR) recombination. (D) Intermolecular excisive (attL x attR) zs recombination. Substrate vectors and expected recombination products are schematized at the top of each panel. The fraction of GFP-expressing cells was determined by FACS
at three time points after co-transfection of substrate and expression vectors. We show mean values of three assays with standard deviations indicated by vertical lines.
3o Figure 3 shows that the presence of Int arm-binding DNA sequences in att sites stimulates intermolecular recombination. (A) Pairs of substrate vectors for intermolecular recombination contain either attB or nttP in different combinations and yield products that express GFP driven by the CMV promoter. (B) Various combinations of substrate vectors were co-transfected with expression vectors for wild-type Int, mutant Int-h, or Int-h1218. At 48 hrs, cells were analyzed by FACS and the ratio of GFP-expressing cells was determined fox two pairs of substrates.
Recombination between attP and attP served as reference, as indicated. We show mean values of three assays with standard deviations indicated by vertical lines. The actual mean values of GFP-expression cells (%) for Int were 0.08 (B x B), 1.24 (P x P), and 0.81 (P x B). Those for Int-h s were 1.15 (B xB), 8.07 (P x P), and 9.90 (P x B). Those for Int-h/218 were 4.01 (B x B), 17.62 (P
x P), and 16.45 (P x B).
Figure 4 shows that purified IHF protein stimulates intra- and intermolecular integrative recombination by wild-type Int. (A) Schematic representation of substrate vectors which were io incubated with or without IHF before trarisfection into HeLa cells that transiently expressed either wild-type Int or Int-h. (B) At 48 hrs after transfection, the fractions of GFP-expressing cells were analyzed by FRCS. The ratio of these fractions was plotted as activation of recombination by IHF. The graph shows mean values of three assays with standard deviations indicated by vertical lines. The actual mean values of GFP-expressing cells (%) in the is presence and absence of IHF, respectively, were for Int (7.93/1.26) and Int-h (17.57/13.14) in the case of intramolecular recombination, and for Int (13.94/3.47) and Int-h (20.33/16.83) analyzing intermolecular recombination.
Figure 5 schematically shows exemplary expression vector designs for the sequence specific zo DNA recombination in CHO-DG44 cells. "P/E" means a composite unit that contains both enhancer and promoter element, "P" a promoter element and "T" a transcription termination site required for polyadenylation of transcribed messenger RNA. "GOI" refers to a gene of interest, "dhfr" to the amplifiable selectable marker dihydrofolate reductase, "FP" to a fluorescent protein such as ZsGreen and "npt" to the selectable marker neomycin is phosphotransferase. An arrow indicates the site of transcription initiation within a transcription unit. The sequence specific recombination between the recombination site attP
or attB located on the first DNA and the recombination site attP or attB
located on the second DNA is depicted with a cross and is mediated by the bacteriophage lambda integrase. "att"
refers to the attachment sites resulting from the exemplarily shown recombination between 3o attP and cattP, attP and attB, attB and attP, or attB and attB located on the first and second DNA, respectively.
The term "transformation" or "to transform" , "transfection" or "to transfect"
as used herein means any introduction of a nucleic acid sequence into a cell, resulting in genetically modified, recombinant, transformed or transgenic cells. The introduction can be performed by any method well known in the art and described, e.g. in Sambrook, J. et al. (1989) Molecular Cloning: A
Laboratory Manual Cold Spring Harbor Laboratory, Cold Spring Harbor, New York or Ausubel, F.M. et al. (1994 updated) Current Protocols in Molecular Biology, New York:
Greene s Publishing Associates and Wiley-Interscience. Methods include but are not limited to lipofection, electroporation, polycation (such as DEAF-dextran)-mediated transfection, protoplast fusion, viral infections and microinjection or may be carried Ollt by means of the calcium method, electroshock method, intravenous/intramusuclar injection, aerosol inhalation or an oocyte injection. The transformation may result in a transient or stable transformation of the ~o host cells. The term "transformation" or "to transform" also means the introduction of a viral nucleic acid sequence in a way which is for the respective vims the naturally one. The viral nucleic acid sequence needs not to be present as a naked nucleic acid sequence belt may be packaged in a viral protein envelope. Thus, the term relates not only to the method which is usually known under the term "transformation" or "to transform". Transfection methods that ~s provide optimal transfection frequency and expression of the introduced nucleic acid are favored.
Suitable methods can be determined by routine procedures. For stable transfectants the constructs are either integrated into the host cell's genome or an artificial chromosome/mini-chromosome or located episomally so as to be stably maintained within the host cell.
?o The term "recombination sequences" as used herein relates to ~ttB, attP, attL and attR sequences and the derivatives thereof. An example for an attB sequence is specified in SEQ ID N0:13, an example for an attP sequence is specified in SEQ ID N0:14, an example for an attL sequence is specified in SEQ ID NO:15, and an example for an attR sequence is specified in SEQ ID N0:16.
zs The term "derivative" as used herein relates to attB, attP, attL and attR
sequences having one or more substitutions, preferably seven, more preferably two, three, four, five or six in the overlap region and/or core region in contrast to naturally occurring attB, attP, czttL
and c~ttR sequences.
The term "derivative" also relates to at least one core Int binding site of attB, attP, attL or attR.
The term "derivative" also relates to at least one core Int binding site of attP, attL or attR plus 30 one or more copies of the arm-binding sites for Int. The term "derivative"
also relates to at least one core Int binding site of attP, attL or attR plus one or more copies of the IHF, FIS or XIS
factor binding sites. The term "derivative" also relates to a combination of these features. The term "derivative" moreover relates to any functional fragments thereof and to endogenous nucleotide sequences in eukaryotic cells supporting sequence-specific recombination, e.g. attH
 
identified in the human genome (see e.g. WO 01/16345). The term "derivative"
in general includes attB, attP, attL or attR sequences suitable for realizing the intended use of the present invention, which means that the sequences mediate sequence-specific recombinantion events driven by an integrase (wild-type or modified) of the bacteriophage lambda.
s The term "functional fragment" relates to attB, attP, attL and attR sequences having substitutions, deletions, and/or insertions (including presence or absence of wild-type or modified protein binding sites), which do not significantly affect the use of said sequences in recombination events driven by an wild-type or modified integrase of the bacteriophage lambda.
io Functionality is not significantly affected, when recombination frequency is at least about 70%, preferably at least about 80%, more preferably about 90%, further more preferably at least about 95%, and most preferably more than about 100% in comparison to the corresponding naturally occurring recombination sequences, using the same recombinase under the same conditions (e.g.
in vitro ~or in vivo use, identical host cell type, identical transfection conditions, presence or is absence of the same host factors, the same buffer conditions, identical temperature etc.).
Alternatively, substitutions, deletions, and/or insertions in attB, attP, attL
and/or attR sequences confer at least an enhancement of the recombination events driven by a wild-type or modified integrase of the bacteriophage lccmbda, whereby said enhancement may consist for example of (i) increasing the efficiency of recombination events (integration and/or excision), (ii) increasing the zo specificity of recombination, (iii) favoring excisive recombination events, (iv) favoring integrative recombination events, (v) relieving the requirements for some or all host factors, in comparison to the corresponding naturally occurnng recombination sequences using the same recombinase under the same conditions (see above).
zs The functionality of modified recombination sites or of modified integrase can be demonstrated in ways that depend on the desired particular characteristic and are known in the art. For example, a co-transfection assay as described in the present invention (see Results 5.1 or Example 3 of WO 01/16345) may be used to characterize integrase-mediated recombination of extrachromosomal DNA in a variety of cell lines. Briefly, cells are co-transfected with an 30 expression vector encoding the integrase protein and a substrate vector that is a substrate for the recombinase, encoding a functional/non-functional reporter gene (e.g.
fluorescent protein like GFP) and containing at least one recombination sequence therein. Upon expression of the integrase by the expression vector, the function of the reporter gene will be rendered non-functional/functional. Thus, the recombination activity can be assayed either by recovering the recombined substrate vector and looking for evidence of recombination at the DNA level (for example by performing a PCR, sequence analysis of the recombined region, restriction enzyme analysis, Southern blot analysis) or by looking for evidence of the recombination at the protein level (e.g. ELISA, Western Blotting, radioimmunoassay, immunoprecipitation, immunostaining, s FACS-analysis of fluorescent proteins).
The term "overlap region" as used herein defines the sequence of the recombination sequences where the DNA strand exchange, including strand cleavage and religation, takes place and relates to the consensus DNA sequence S'-TTTATAC-3' in wild-type att sites or said sequence ~o having functional nucleotide substitutions. The only prerequisite is, that the sequence of the overlap region is identical between recombining partner sequences.
The term "core binding sites" relates to two imperfectly repeated copies in inverted orientation, separated by the overlap region, in each set of wild-type att sites. The core binding sites are ~s essential for the recombination by binding the integrase at low affinity.
Each core binding site consists of nine contiguous base pairs and relates to DNA sequences consisting for the B-sequence of the nucleotide sequence 5'-CTGCTTTTT-3', for the B'-sequence of the nucleotide sequence 5'-CAAGTTAGT-3' (reverse complementary strand), for the C-sequence of the nucleotide sequence 5'-CAGCTTTTT-3', and for the C'-sequence of the nucleotide sequence zo 5 ~-CAACTTAGT-3' (reverse complementary strand) in wild-type att sites or said sequences having functional nucleotide substitutions.
The term "arm-binding site for Int" or "arm-binding sites" as used herein relates to the consensus sequence S'-C/AAGTCACTAT-3' or said sequence having functional nucleotide substitutions.
zs The arm-binding site for Int may be positioned at various distances upstream and/or downstream of the core Int binding site(s).
The term "homologue" or "homologous" or "similar" as used herein with regard to recombination sequences, arm-binding sites, and host factor binding sites relates to a nucleic acid ~o sequence being identical for about 70%, preferably for about 80%, more preferably for about 85%, further more preferably for about 90%, further more preferably for about 95%, and most preferably for about 99% to naturally occurring recombination sequences, arm-binding sites, and host factor binding sites. As homologous or similar are considered sequences, which e.g. using standard parameters in the similarity algorithm BLAST of NCBI (Basic Local Alignment Search Tool, Altschul et al., Journal of Molecular Biology 215, 403-410 (1990)) showing a probability of P < 10-s when compared to the recombination sequences.
The term "vector" as used herein relates to naturally occurring or synthetically generated s constructs for uptake, proliferation, expression or transmission of nucleic acids in a cell, e.g.
plasmids, phagemids, cosmids, artificial chromosomes/mini-chromosomes, bacteriophages, viruses or retro vimses. Methods used to construct vectors are well known to a person skilled in the art and described in various publications. In particular techniques for constnicting suitable vectors, including a description of the functional and regulatory components such as promoters, io enhancers, termination and polyadenylation signals, selection markers, origins of replication, and splicing signals, are reviewed in considerable details in Sambrook, J. et al.
(1989), supra, and references cited therein. The eukaryotic expression vectors will typically contain also prokaryotic sequences that facilitate the propagation of the vector in bacteria such as an origin of replication and antibiotic resistance genes for selection in bacteria. A
variety of eukaryotic ~s expression vectors, containing a cloning site into which a polynucleotide can be operatively linked, are well known in the art and some are commercially available from companies such as Stratagene, La Jolla, CA; Invitrogen, Carlsbad, CA; Promega, Madison, WI or BD
Biosciences Clontech, Palo Alto, CA.
Zo The terms "gene of interest", "desired sequence", or "desired gene" as used herein have the same meaning and refer to a polynucleotide sequence of any length that encodes a product of interest.
The selected sequence can be full length or a tnmcated gene, a fusion or tagged gene, and can be a cDNA, a genomic DNA, or a DNA fragment, preferably, a cDNA. It can be the native sequence, i.e. naturally occurnng form(s), or can be mutated or otherwise modified as desired.
2s These modifications include codon optimizations to optimize codon usage in the selected host cell, humanization or tagging. The selected sequence can encode a secreted, cytoplasmic, nuclear, membrane bound or cell surface polypeptide. The "product of interest"
includes proteins, polypeptides, fragments thereof, peptides, antisense RNA all of which can be expressed in the selected host cell.
The term "nucleic acid sequence", "nucleotide sequence", or "DNA sequence" as used herein refers to an oligonucleotide, nucleotide or polynucleotide and fragments and portions thereof and to DNA or RNA of genomic or synthetic origin, which may be single or double stranded and represent the sense or antisense strand. The sequence may be a non-coding sequence, a coding to sequence or a mixture of both . The polynucleotides of the invention include nucleic acid regions wherein one or more codons have been replaced by their synonyms.
The nucleic acid sequences of the present invention can be prepared using standard techniques s well known to one of skill in the art. The term "encoding" or "coding"
refers to the inherent property of specific sequences of nucleotides in a nucleic acid, such as a gene in chromosome or an mRNA, to serve as templates for synthesis of other polymers and macromolecules in biological processes having a defined sequence of nucleotides (i.e. rRNA, tRNA, other RNA
molecules) or amino acids and the biological properties resulting therefrom.
Thus a gene encodes io a protein, if transcription and translation of mRNA produced by that gene produces the protein in a cell or other biological system. Both the coding strand, the nucleotide sequence of which is identical to the mRNA sequence and is usually provided in sequence listings, and non-coding strand, used as the template for the transcription, of a gene or cDNA can be referred to as encoding the protein or other product of that gene or cDNA. A nucleic acid that encodes a is protein includes any nucleic acids that have different nucleotide sequences but encode the same amino acid sequence of the protein due to the degeneracy of the genetic code.
Nucleic acids and nucleotide sequences that encode proteins may include introns.
The term "polypeptide" is used interchangeably with amino acid residue sequences or protein zo and refers to polymers of amino acids of any length. These terms also include proteins that are post-translationally modified through reactions that include, but are not limited to, glycosylation, acetylation, phosphorylation or protein processing. Modifications and changes, for example fusions to other proteins, amino acid sequence substitutions, deletions or insertions, can be made in the structure of a polypeptide while the molecule maintains its biological functional activity.
zs For example certain amino acid sequence substitutions can be made in a polypeptide or its underlying nucleic acid coding sequence and a protein can be obtained with like properties.
Amino acid modifications can be prepared for example by performing site-specific mutagenesis or polymerase chain reaction mediated mutagenesis on its underlying nucleic acid sequence.
3o The term "expression" as used herein refers to transcription and/or translation of a heterologous nucleic acid sequence within a host cell. The level of expression of a desired product in a host cell may be determined on the basis of either the amount of corresponding mRNA
that is present in the cell, or the amount of the desired polypeptide encoded by the selected sequence. For example, mRNA transcribed from a selected sequence can be quantitated by Northern blot hybridization, ribonuclease RNA protection, in situ hybridization to cellular RNA or by PCR
(see Sambrook, J. et al. (1989), supra; Ausubel, F.M. et al. (1994 updated), supra). Proteins encoded by a selected sequence can be quantitated by various methods, e.g. by ELISA, by Western blotting, by radioimmunoassays, by immunoprecipitation, by assaying for the biological s activity of the protein, or by immunostaining of the protein followed by FAGS analysis PCR (see Sambrook, J. et al. (1989), supra; Ausubel, F.M. et al. (1994 updated), supra).
An "expression cassette" defines a region within a construct that contains one or more genes to be transcribed, wherein the genes contained within the segment are operatively linked to each ~o other and transcribed from a single promoter, and as result, the different genes are at least transcriptionally linked. More than one protein or product can be transcribed and expressed from each transcription unit. Each transcription unit will comprise the regulatory elements necessary for the transcription and translation of any of the selected sequence that are contained within the unit.
is The term "operatively linked" means that two or more nucleic acid sequences or sequence elements are positioned in a way that permits them to function in their intended manner. For example, a promoter and/or enhancer is operatively linked to a coding sequence if it acts in cis to control or modulate the transcription of the linked sequence. Generally, but not necessarily, the zo DNA sequences that are operatively linked are contiguous and, where necessary to join two protein coding regions or in the case of a secretory leader, contiguous and in reading frame.
The term "selection marker gene" refers to a gene that only allows cells carrying the gene to be specifically selected fox or against in the presence of a corresponding selection agent. By way of zs illustration, an antibiotic resistance gene can be used as a positive selectable marker gene that allows the host cell transformed with the gene to be positively selected for in the presence of the corresponding antibiotic; a non-transformed host cell would not be capable of growth or survival under the selection culture conditions. Selectable markers can be positive, negative or bifimctional. Positive selectable markers allow selection for cells carrying the marker by 3o conferring resistance to a dnig or compensate for a metabolic or catabolic defect in the host cell.
In contrast, negative selection markers allow cells carrying the marker to be selectively eliminated. For example, using the HSV-tk gene as a marker will make the cells sensitive to agents such as acyclovir and gancyclovir. The selectable marker genes used herein, including the amplifiable selectable genes, will include recombinantly engineered mutants and variants, fragments, functional equivalents, derivatives, homologs and fusions of the native selectable marker gene so long as the encoded product retains the selectable property.
Useful derivatives generally have substantial sequence similarity (at the amino acid level) in regions or domains of the selectable marker associated with the selectable property. A variety of marker genes have s been described, including bifunctional (i.e. positivelnegative) markers (see e.g. WO 92/08796 and WO 94/28143), incorporated by reference herein. For example, selectable genes commonly used with eukaryotic cells include the genes for aminoglycoside phosphotransferase (APH), hygromycin phosphotransferase (HYG), dihydrofolate reductase (DHFR), thymidine kinase (TK), glutamine synthetase, asparagine synthetase, and genes encoding resistance to neomycin ~ o (G418), puromycin, histidinol D, bleomycin and phleomycin.
Selection may also be made by fluorescence activated cell sorting (FACS) using for example a cell surface marker, bacterial (3-galactosidase or fluorescent proteins (e.g.
green fluorescent proteins (GFP) and their variants from Aeqzcorea victoria and Renilla reniformis or other species;
is red fluorescent proteins, fluorescent proteins and their variants from non-bioluminescent species (e.g. Discosoma sp., Anemonia sp., Clavularia sp., Zoanthzcs sp.) to select for recombinant cells.
The term "selection agent" refers to a substance that interferes with the growth or survival of a host cell that is deficient in a particular selectable gene. For example, to select for the presence of zo an antibiotic resistance gene like APH (aminoglycoside phosphotransferase) in a transfected cell the antibiotic Geneticin (G418) is used.
The integrase (usually and designated herein as "Int") of the bacteriophage lambda belongs like Cre and Flp to the integrase family of the sequence specific conservative DNA
recombinases. In zs its natural function Int catalyses the integrative recombination between two different recombination sequences namely attB and attP. AttB comprises 21 nucleotides and was originally isolated from the E. coli genome; Mizuuchi, M. and Mizuuchi, K.
(1980) Proc. Natl.
Acad. Sci. USA, 77, pp. 3220. On the other hand attP having 243 nucleotides is much longer and occurs naturally in the genome of the bacteriophage lambda; Landy, A., and Ross, W. (1977) 3o Science, 197, pp. 1147. The Int recombinase has seven binding sites altogether in attP and two in attB. The biological function of Int is the sequence specific integration of the circular phage genome into the locus attB on the E. coli chromosome. Int needs a protein co-factor, the so-called integration host factor (usually and designated herein as "IHF") for the integrative recombination; Kikuchi, Y. and Nash, H. (1978) J. Biol. Chem., 253, 7149. IHF
is needed for the assembly of a functional recombination complex with attP. A second co-factor for the integration reaction is the DNA negative supercoiling of attP. Finally, the recombination between attB and attP leads to the formation of two new recombination sequences, namely attL
and attR, which serve as substrate and recognition sequence for a further recombination reaction, s the excision reaction. A comprehensive summary of the bacteriophage lambda integration is given e.g. in Landy, A. (1989) Annu. Rev. Biochem., 58, pp. 913.
The excision of the phage genome out of the bacterial genome is catalyzed by the Int recombinase also. For this, a further co-factor is needed in addition to Int and IHF, which is io encoded by the bacteriophage lambda. This is the excisionase (usually and designated herein as "XIS") having two binding sites in attR; Gottesman, M. and Weisberg, R. (1971) The Bacteriophage Lambda, Cold Spring Harbor Laboratory, pp.113. In contrast to the integrative recombination, DNA negative supercoiling of the recombination sequences is not necessary for the excisive recombination. However, DNA negative supercoiling increases the efficiency of the is recombination reaction: A further improvement of the efficiency of the excision reaction may be achieved with a second co-factor namely FIS (factor for inversion stimulation), which acts in conjunction with XIS; Landy, A. (1989) Annu. Rev. Biochem., 58, pp.913. The excision is genetically the exact reverse reaction of the integration, i.e. attB and attP
are generated again. A
comprehensive summary of the bacteriophage lambda excision is given e.g. in Landy, A. (1989) ~o Annu. Rev. Biochem., 58, pp. 913.
One aspect of the present invention relates to a method of sequence specific recombination of DNA in a eukaryotic cell, comprising a) introducing a first attB, attP, attL or attR sequence or a derivative thereof into a cell, zs b) introducing a second attB, attP, attL or czttR sequence or a derivative thereof into a cell, wherein if said first DNA sequence comprises an attB sequence or a derivative thereof said second sequence comprises an attB, attL or attR sequence or a derivative thereof, or wherein if said first DNA sequence comprises an attP sequence or a derivative thereof said second sequence comprises an czttP, attL or attR sequence or a derivative thereof, or wherein if said first 3o DNA sequence comprises an attL sequence or a derivative thereof said second sequence comprises an attB, attP or attL sequence or a derivative thereof, or wherein if said first DNA
sequence comprises an attR sequence or a derivative thereof said second sequence comprises an attB, attP or attR sequence or a derivative thereof, c) performing the sequence-specific recombination by a bacteriophage lambda integrase Int.
 
Preferred is the method wherein in step c) the sequence-specific recombination is performed by Int or by Int and XIS, FIS, and/or IHF. Most preferred is the method wherein in step c) the sequence-specific recombination is performed by Int or by Int and a XIS
factor, or by Int and s IHF, or by Int and XIS and IHF. Further preferred is the method wherein in step c) the sequence-specific recombination is performed by a modified Int, preferably the Int-h or Int-h/218. In this context, use of a modified Int together with XIS, FIS and/or IHF is also within the meaning of the present invention.
io In a more preferred embodiment of this method, sequence specific recombination of DNA in a eukaryotic cells will be performed between identically or nearly identically recombination sites.
Therefore, the present invention relates a method of sequence specific recombination as described above, wherein if said first DNA sequence comprises an attB sequence or a derivative thereof said second sequence comprises also attB sequence or a derivative thereof, or wherein if is said first DNA sequence comprises an attP sequence or a derivative thereof said second sequence comprises an attP sequence or a derivative thereof, or wherein if said first DNA
sequence comprises an attL sequence or a derivative thereof said second sequence comprises an attL sequence or a derivative thereof, or wherein if said first DNA sequence comprises an attR
sequence or a derivative thereof said second sequence comprises an attR
sequence or a zo derivative thereof.
The method of the present invention may be carned out not only with the naturally occuring attB, attP, attL, and/or attR sequences but also with modified e.g.
substituted attB, attP, attL, and/or attR sequences. For example an integrative recombination of the bacteriophage lambda Zs and E. coli between attP and attB homologous sequences (mutants of the wild-type sequences) have been observed which have one or more substitutions in attB (Nash, H.
(1981) Annu. Rev.
Genet., 15, pp. 143; Nussinov, R. and Weisberg, R. (1986) J. Biomol. Stntct.
Dynamics, 3, pp 1134) and/or in attP (Hash, H. (1981) Annu. Rev. Genet., 15, pp.143).
3o Thus, the present invention relates to a method wherein the used attB, attP, attL, and/or attR
sequences have one or more substitutions in comparison to the naturally occuring attB, attP, attL, and/or attR sequences. Preferred is a method wherein the attB, attP, attL, and/or attR
sequences have one, two, three, four, five, six, seven or more substitutions.
The substitutions may occur both in the overlap region and in the core region. The complete overlap region comprising seven nucleotides may be substituted also. More preferred is a method wherein substitutions are introduced into the attB, attP, attL, and/or attR sequences either in the core region or in the overlap region. Preferred is the introduction of a substitution in the overlap region and the simultaneous introduction of one or two substitutions in the core region. The s present invention also relates to a method wherein the used attB, attP, attL, and/or attR
sequences are derivatives, including functional fragments thereof, of said recombination sites in comparison to the naturally occurnng attB, attP, attL, and/or attR sequences.
A modification in the form of one or more substitutions) into recombination sequences is to be io chosen such that the recombination can be carned out in spite of the modification(s). Examples for such substitutions are listed e.g. in the publications of Nash, H. (1981), supra and Nussinov, R. and Weisberg, R. (1986), sz~pra and are not considered to be limiting.
Further modifications may be easily introduced e.g. by mutagenesis methods (a number of these are described in Ausubel, F.M. et al. (1994 updated), supra) and and may be tested for their use by test is recombinations as described e.g. in the examples of the present invention (Examples 1 and 2, results 5.1 ).
Furthermore, the present invention relates to a method wherein the used attB, attP, cattL, and/or attR sequences comprise only of one of the respective core Int binding sites, however, more than zo two core Int binding sites are also preferred. In a preferred embodiment, the present invention relates to a method wherein the used attB, attP, attL, and/or attR sequences consist only of one of the respective core Int binding sites. In a further embodiment the used attB, attP, attL, and/or attR sequences consist of two or more core Int binding sites.
zs The present invention relates further to a method wherein the used attP, attL, and/or attR
sequences comprise in addition to the core Int binding site one or more, preferably two, three, four, five or more than five, copies of the arm-binding site for Int. Said binding site comprises a consensus motive having the sequence 5'-C/AAGTCACTAT-3' (SEQ ID NO:1) or a modified sequence thereof having nucleotide substitutions and being functional with regard to the Int 3o binding. The arm-binding sites) for Int may be positioned at various distances upstream and/or downstream of the core Int binding site(s).
In order to perform the method of the present invention the first recombination sequence may comprise further DNA sequences which allow the integration into a desired target locus, e.g. in the genome of the eukaryotic cell or an artificial-/minichromosome. This recombination occurs e.g. via the homologous recombination which is mediated by internal cellular recombination mechanisms. For said recombination, the further DNA sequences have to be homologous to the DNA of the target locus and located both 3' and 5' of the attB, attL, ccttP, or attR sequences or s derivatives thereof, respectively. The person skilled in the art knows how great the degree of the homology and how long the respective 3' and 5' sequences have to be such that the homologous recombination occurs with a sufficient probability; see review of Capecchi, M.
(1989) Science, 244, pp. 1288.
~o However, it is also possible to integrate the first recombination sequence by any other mechanism into the genome of the eukaryotic cell, or any artificial-/minichromosome, e.g. via random integration which is also mediated by internal cellular recombination events. Integration of said first recombination site via sequence-specific recombination using sites different from those being integrated, e.g. by using IoxPlFRT sequences, is also conceivable.
is The second recombination sequence may also comprise DNA sequences which are necessary for an integration into a desired target locus via homologous recombination. For the method of the present invention both the first and/or the second recombination sequence may comprise the further DNA sequences. Preferred is a method wherein both DNA sequences comprise the zo further DNA sequences.
Introduction of the first and second recombination sequence with or without further DNA
sequences may be performed both consecutively and in a co-transformation wherein the recombination sequences are present on two different DNA molecules. Preferred is a method, Zs wherein the first and second recombination sequence with or without further DNA sequences are present and introduced into the eukaryotic cells on a single DNA molecule.
Furthermore, the first recombination sequence may be introduced into a cell and the second recombination sequence may be introduced into another cell wherein the cells are fused subsequently.
The term fusion means crossing of organisms as well as cell fusion in the widest sense.
The method of the present invention may be used e.g. to invert a DNA segment lying between the indirectly orientated recombination sequences in an intramolecular recombination.
Furthermore, the method of the present invention may be used to delete the DNA
segment lying between the directly orientated recombination sequences in an intramolecular recombination. If the recombination sequences are each incorporated in 5'-3' or in 3'-5' orientation they are present in direct orientation. The recombination sequences are in indirect orientation if e.g. the attB
sequence is integrated in S'-3' and the attP sequence is integrated in 3'-5' orientation. If the recombination sequences are each incorporated e.g. via homologous recombination into intron s sequences 5' and 3' of an exon and the recombination is performed by an integrase, the exon would be inverted in case of indirectly orientated recombination sequences and deleted in case of directly orientated recombination sequences, respectively. With this procedure the polypeptide encoded by the respective gene may lose its activity or function or the transcription may be stopped by the inversion or deletion such that no (complete) transcript is generated. In this way io e.g. the biological function of the encoded polypeptide may be investigated. Moreover, inversion or deletion reactions may be used to activate the expression of a gene encoding a desired polypeptide, e.g. by functional linkage of the open reading frame of the encoded polypeptide with regulatory elements which allow transcription and/or translation of the encoded polypeptide. Those regulatory elements include but are not limited to a promotor and or i ~ promotor/enhancer elements, which are well knoiyn in the art for various eukaryotic expression systems.
However, the first and/or second recombination sequence may comprise further nucleic acid sequences encoding one or more polypeptides/products of interest. For example a structural zo protein, an enzymatic or a regulatory protein may be introduced via the recombination sequences into the genome being transiently or stably expressed after intramolecular recombination. The introduced polypeptide/product may be an endogenous or exogenous one.
Furthermore, a marker protein or biopharmaceutically relevant therapeutic polypeptides may be introduced. The person skilled in the art knows that this listing of applications of the method according to the present ?s invention is only exemplary and not limiting. Examples of applications according to the present invention performed with the so far used Cre and Flp recombinases may be found e.g. in the review of Kilby, N. et al., (1993), Trends Genet., 9, pp.413.
Furthermore, the method of the present invention may be used to delete or,invert DNA segments 30 on vectors by an intramolecular recombination on episomal substrates. A
deletion reaction may be used e.g. to delete packaging sequences from so-called helper viruses. This method has a broad application in the industrial production of viral vectors for gene therapeutic applications;
Hardy, S. et al., (1997), 3. Virol., 71, pp.1842.
 
The intermolecular recombination leads to the fusion of two DNA molecules each having a copy of attB, attP, attL, or attR or various combinations of att sequences or of their derivates. For example, attB or a derivative thereof may be introduced first via homologous recombination in a known, well characterized genomic locus of a cell or an artificial-Iminchromosome.
s Subsequently an ccttB, attP, attL, or attR carrying vector or DNA-segment may be integrated into said genomic attB sequence via intermolecular recombination. Preferred in this method is the co-expression of the mutant integrase, e.g. Int-h or Int-h/218 within the eukaryotic cell, wherein the recombination occurs. Most preferred is the co-expression of the mutant integrase Int-h/218.
Genes encoding for any of those mutant integrases may be located on a second DNA vector io being transfected, preferably co-transfected, or on the vector or DNA-segment carrying the attP, attL, attR or also an czttB sequence or an derivative thereof. Further sequences may be located on the attB, attP, attL, or attR carrying vector or DNA-segment, e.g. a gene for a particular marker protein flanked by loxPlFRT sequences. With this approach it may be achieved that, e.g. in comparative expression analyses of different genes in a cell type, said genes are not influenced is by positive or negative influences of the respective genomic integration locus. Furthermore, the method of the present invention may be used to fuse DNA segments on vectors by an intermolecular recombination on episomal substrates. A fusion reaction may be used e.g. to express recombinant proteins or relevant domains in order to screen for phenotypes. This method may be used in the high throughput analysis of protein functions in eukaryotic cells and is thus of zo considerable interest.
As mentioned above, intermolecular recombination may be used to introduce one or more genes) of interest encoding one or more desired polypeptide(s)/product(s) into, e.g. episomal substrates, artificial-/minichromosomes, or various host cell genomes containing a first zs recombination sequence. In this context a second DNA comprises beside at least one recombination sequence, e.g. attP, attB, attL, attR or any derivative thereof, one or more expression cassettes) for the expression of one or more desired protein(s)/product(s). That expression cassette may be introduced into a desired target locus via the recombination sequences which allows sequence-specific recombination between the DNA
comprising the 3o second recombination sequence and the expression cassette, and the first recombination sequence being introduced before into said episomal substrate, artificial-/minichromosome, or host cell genome. This embodiment may be of high interest for establishing high expression cell lines which are suitable for the production of biopharmaceutical products.
 
In this context, a first DNA comprising at least one recombination sequence has to be introduced, e.g. by random integration, into the genome of the host cell, an artificial-Jminichromosomes or episomal substrates contained within the host cell. Alternatively, host cell may be transformed with an artificial-/minichromosome or episomal substrate comprising a corresponding at least s one recombination site(s). Another way to integrate recombination sequences) into a desired target locus, recognized by a bacteriophage lambda integrase Int, is to use homologous recombination techniques as mentioned above.
To facilitate selection for stable transfectants which have introduced recombination sequences) io into a desired target locus, a selection marker gene is co-introduced into the same target locus at the same time. This may be achieved, for example, if the recombination sequences) and a selection marker gene are co-located on the same vector or DNA segment, which is introduced into the target locus, e.g. by any method mentioned above (homologous recombination, random integration, etc.). As the expression level of the selection marker gene correlates with the is transcription activity at the integration site, cells showing a high expression level at site of integration, cell robustness, and good growth characteristics, e.g. in a bioreactor, can be identified very effectively. The level of expression of the selection marker gene can be determined by methods well known in the art, e.g. on the basis of either the amount of corresponding mRNA that is present in the cell, or the amount of polypeptide encoded by the ?o gene. For example, mRNA transcribed from the introduced gene sequence can be quantified by Northern blot hybridization, ribonuclease RNA protection, in situ hybridization to cellular RNA
or by PCR (see Sambrook et al., 1989; Ausubel et al., 1994, supra). Proteins encoded by a selected sequence can be quantified by various methods, e.g. by ELISA, by Western blotting, by radioimmunoassays, by immunoprecipitation, by assaying for the biological activity of the as protein, by immunostaining of the protein followed by FACS analysis, or by measuring the fluorescence signals of a fluorescent protein (see Sambrook et al., 1989;
Ausubel et al., 1994 updated, sicpra). By such a method excellent candidates of a production cell line for producing biopharmaceuticals may be obtained.
3o The integrated recombination sequences) (first recombination sequence(s)) allow integration of a further DNA molecule, e.g. a vector or DNA segment carrying at least one further recombination sequence (second recombination sequence) via sequence-specific recombination by a bacteriophage lambda integrase Int into a transcriptional active locus.
Preferably, that further DNA molecule comprising at least one second recombination sequence further comprises an expression cassette for the expression of at least one biopharmaceutically relevant gene of interest. Fox this, host cells, which comprise the first integrated recombination sequence, preferably integrated into the host cell genome at a transcriptional active locus, are tranfected with a DNA molecule comprising the second recombination sequence for a bacteriophage s lambda integrase Int, and are cultivated under conditions that allow sequence-specific recombination between the first and the second recombination sequence, preferably the integration of the DNA molecule comprising the second recombination sequence into the host cell genome comprising the first recombination sequence. First and second recombination sequences can be either attP, attB, attL, attR or any derivative thereof, which allows sequence-to specific recombination by a bacteriophage lambda integrase Int or any functional mutant thereof.
For example, if the first recombination sequence comprises attP or a derivative thereof second may comprises attP, attB, attL, attR or any derivative thereof.
Preferred is the method wherein the sequence-specific recombination is performed by Int, or by is Int and XIS, FIS and/or IHF. Most preferred is the method wherein the sequence-specific recombination is performed by Int or by Int and a XIS factor, or by Int and IHF, or by Int and XIS and IHF. Further preferred is the method wherein the sequence-specific recombination is performed by a modified Int, preferably the Int-h or Int-h/218. In this context, use of a modified Int together with XIS and/or IHF is also within the meaning of the present invention.
?o By this approach any DNA sequence(s), comprising a second recombination sequence for the bacteriophage lambda integrase Int is/are integrated into a known, well characterized and defined locus of the host cell. To select for cells where a sequence-specific recombination has occurred one can introduce, for example, a non-functional expression cassette comprising the selection ~s marker gene, e.g. without a promoter or promoter/enhancer or only part of the coding region of the gene. Only if sequence-specific recombination has occurred, a complete and functional expression cassette with efficient expression of the selection marker gene will be generated, thus allowing for the selection of cells having integrated the gene of interest via sequence specific integration.
,o By the method of the present invention production cell lines are obtainable differ from the host cell merely by the identity of DNA sequences integrated at a defined site of integration, e.g. into a genomic locus. Due to less genetic variation between different cell clones a more generic process for the development of production cell lines can be used, thus reducing time and capacity for clone selection and development of an optimized production process. The production cell lines may be used for the manufacturing of the desired polypeptide(s).
A further aspect of the present invention therefore relates to a method of expressing at least one s gene of interest encoding one or more desired polypeptide(s)/products(s) in a eukaroytic cell, comprising a) . introducing a first DNA comprising an attB, attP, attL or attR sequence or a derivative thereof into a cell;
b) introducing a second DNA comprising an attB, attP, attL or attR sequence or a derivative ~o thereof, and at least one gene of interest into a cell, c) contacting said cell with a bacteriophage lambda integrase Int;
d) performing the sequence-specific recombination by a bacteriophage lambda integrase Int, wherein the second DNA is integrated into the first DNA; and e) cultivating said cell under conditions, wherein the genes) of interest is/are being i s expressed.
Preferred is that method, wherein if said first DNA sequence comprises an attB
sequence or a derivative thereof said second sequence comprises an attB, attL or attR
sequence or a derivative thereof, or wherein if said first DNA sequence comprises an attP sequence or a derivative thereof ?o said second sequence comprises an attP, attL or attR sequence or a derivative thereof, or wherein if said first DNA sequence comprises an attL sequence or a derivative thereof said second sequence comprises an attB, attP or attL sequence or a derivative thereof, or wherein if said first DNA sequence comprises an attR sequence or a derivative thereof said second sequence comprises an attB, attP or attR sequence or a derivative thereof.
zs In a more preferred embodiment of that method, the first DNA has been integrated into the genome, an artificial-/minichromosome or an episomal element of a host cell, preferably at sites showing high transcription activity, before said second DNA is introduced into said cell.
3o The present invention also relates to a method of expressing at least one or more genes of interest in a host cell, wherein said host cell comprises one attB, attP, attL or attR
sequence or a derivative thereof integrated into the genome of said host cell, comprising a) introducing a DNA comprising an attB, attP, attL or attR sequence or a derivative thereof, and at least one gene of interest into said cell, b) contacting said cell with a bacteriophage lambda integrase Int;
c) performing the sequence-specific recombination by a bacteriophage lambda integrase Int, wherein the second DNA is integrated into the first DNA;
d) cultivating said cell under conditions, wherein the genes) of interest is/are being expressed.
The method may be carried out not only with an attB, attP, attL or attR
sequence or a derivative thereof being integrated into a host cell genome by genetic engineering of said cell, but also with naturally occurring recombination sequence of the genome, e.g. the attH-site described in 5 (5'-GAAATTCTTTTTGATACTAACTTGTGT-3'; SEQ ID N0:17) or any other Io recombination sequence, which allows sequence-specific recombination mediated by an Int or any functional mutant thereof.
Those methods are preferred, wherein said sequence-specific recombination is performed by Int or by Int and a XIS factor, or by Int and IHF, or by Int and XIS and IHF.
Further preferred is the is method wherein the sequence-specific recombination is performed by a modified Int, preferably the Int-h or Int-h/218. In this context, use of a modified Int together with XIS and/or IHF is also within the meaning of the present invention. Int, Int-h or Int-h/218, XIS, and/or IHF may be added to the cell in purified form or being co-expressed by said host cell, wherein the sequence-specific recombination is being performed.
zo A further embodiment of the above mentioned methods relates to a method, wherein the polypeptide(s)/product(s) which is/are encoded by the genes) of interest and being expressed in said host cell, is/are isolated from the cells or the cell culture supernatant, if secreted into the culture medium.
zs Said production cells are cultivated preferentially in semm-free medium and in suspension culture under conditions which are favorable for the expression of the desired genes) and isolating the protein of interest from the cells and/or the cell culture supernatant. Preferably the protein of interest is recovered from the culture medium as a secreted polypeptide, or it can be 3o recovered from host cell lysates if expressed without a secretory signal.
It is necessary to purifiy the protein of interest from other recombinant proteins, host cell proteins and contaminants in a way that substantially homogenous preparations of the protein of interest are obtained. As a first step often cells and/or particulate cell debris are removed from the culture medium or lysate. The product of interest thereafter is purified from contaminant soluble proteins, polypeptides and nucleic acids, for example, by fractionation on immunoaffinity or ion-exchange columns, ethanol precipitation, reverse phase HPLC, Sephadex chromatography on silica or on a cation exchange resin such as DEAE. In general, methods teaching a skilled persion how to purify a heterologous protein expressed by host cells, are well known in the art. Such methods are for example s described by Harris et al. (1995) Protein Purification: A Practical Approach, Pickwood and Hames, eds., IRL Press and Scopes, R. (1988) Protein Purification, Springer Verlag. Therefore, the aforementioned method of expressing at least one gene of interest may be added by an additional purification step, wherein the desired polypeptide is purified from the host cells or from cell culture if secreted into the culture medium.
~o The method of the present invention may be performed in all eukaryotic cells.
Cells and cell lines may be present e.g. in a cell culture and include but are not limited to eukaryotic cells, such as yeast, plant, insect or mammalian cells. For example, the cells may be oocytes, embryonic stem cells, hematopoietic stem cells or any type of differentiated cells. A
method is preferred ~s wherein the eukaryotic cell is a mammalian cell. More preferred is a method wherein the mammalian cell is a human, simian, marine, rat, rabbit, hamster, goat, bovine, sheep or pig cell.
Preferred cell lines or "host cells" for the production of biopharmaceuticals are human, mice, rat, monkey, or rodent cell lines. More preferred are hamster cells, preferably BHK21, BHK TK , CHO, CHO-K1, CHO-DUKX, CHO-DUKX B1, and CHO-DG44 cells or the zo derivatives/progenies of any of such cell lines. Particularly preferred are CHO-DG44, CHO-DLTKX, CHO-K1 and BHK21, and even more preferred CHO-DG44 and CHO-D>JKX cells.
Furthermore, marine myeloma cells, preferably NSO and Sp2/0 cells or the derivatives/progenies of any of such cell lines are also known as production cell lines.
zs Host cells are most preferred, when being established, adapted, and completely cultivated under semm free conditions, and optionally in media which are free of any protein/peptide of animal origin. Commercially available media such as Ham's F12 (Sigma, Deisenhofen, Germany), RPMI-1640 (Sigma), Dulbecco's Modified Eagle's Medium (DMEM; Sigma), Minimal Essential Medium (MEM; Sigma), Iscove's Modified Dulbecco's Medium (IMDM;
Sigma), CD-3o CHO (Invitrogen, Carlsbad, CA), CHO-S-SFMII (Invtirogen), serum-free CHO
Medium (Sigma), and protein-free CHO Medium (Sigma) are exemplary appropriate nutrient solutions.
Any of the media may be supplemented as necessary with a variety of compounds examples of which are hormones and/or other growth factors (such as insulin, transferrin, epidermal growth factor, insulin like growth factor), salts (such as sodium chloride, calcium, magnesium, phosphate), buffers (such as HEPES), nucleosides (such as adenosine, thymidine), glutamine, glucose or other equivalent energy sources, antibiotics, trace elements. Any other necessary supplements may also be included at appropriate concentrations that would be known to those skilled in the art. 1n the present invention the use of semm-free medium is preferred, but media s supplemented with a suitable amount of serum can also be used for the cultivation of host cells.
For the growth and selection of genetically modified cells expressing a selectable gene a suitable selection agent is added to the culture medium.
"Desired proteins/polypeptides" or "proteins/polypeptides of interest" of the invention are for io example, but not limited to insulin, insulin-like growth factor, hGH, tPA, cytokines, such as interleukines (IL), e.g. IL-1, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-10, IL-11, IL-12, IL-13, IL-14, IL-15, IL-16, IL-17, IL-18, interferon (IFN) alpha, IFN beta, IFN gamma, IFN
omega or IFN tau, tumor necrosisfactor (TNF), such as TNF alpha and TNF beta, TNF gamma, TRAIL; G-CSF, GM-CSF, M-CSF, MCP-1 and VEGF. Also included is the production of is erythropoietin or any other hormone growth factors and any other polypeptides that can serve as agonists or antagonists and/or have therapeutic or diagnostic use. The method according to the invention can also be advantageously used for production of antibodies, such as monoclonal, polyclonal, multispecific and single chain antibodies, or fragments thereof, e.g. Fab, Fab', F(ab')2, Fc and Fc'-fragments, heavy and light immunoglobulin chains and their constant, zo variable or hypervariable region as well as Fv- and Fd-fragments (Chamov, S.M. et al. (1999) Antibody Fusion Proteins, Wiley-Liss Inc.) Fab fragments (Fragment antigen-binding = Fab) consist of the variable regions of both chains which are held together by the adjacent constant region. These may be formed by protease zs digestion, e.g. with papain, from conventional antibodies, but similar Fab fragments may also be produced in the mean time by genetic engineering. Further antibody fragments include F(ab')2 fragments, which may be prepared by proteolytic cleaving with pepsin.
Using genetic engineering methods it is possible to produce shortened antibody fragments which 3o consist only of the variable regions of the heavy (VH) and of the light chain (VL). These are referred to as Fv fragments (Fragment variable = fragment of the variable part). Since these Fv-fragments lack the covalent bonding of the two chains by the cysteines of the constant chains, the Fv fragments are often stabilised. It is advantageous to link the variable regions of the heavy and of the light chain by a short peptide fragment, e.g. of 10 to 30 amino acids, preferably 1 S amino acids. In this way a single peptide strand is obtained consisting of VH and VL, linked by a peptide linker. An antibody protein of this kind is known as a single-chain-Fv (scFv). Examples of scFv-antibody proteins of this kind known from the prior art are described in Huston C. et al.
(1988) Proc. Natl. Acad. Sci. USA, 16, pp. 5879.
s In recent years, various strategies have been developed for preparing scFv as a multimeric derivative. This is intended to lead, in particular, to recombinant antibodies with improved pharmacokinetic and biodistribution properties as well as with increased binding avidity. In order to achieve multimerisation of the scFv, scFv were prepared as fusion proteins with io multimerisation domains. The multimerisation domains may be, e.g. the CH3 region of an IgG or coiled coil stmcture (helix structures) such as Leucin-zipper domains.
However, there are also strategies in which the interaction between the VH/VL regions of the scFv are used for the.
multimerisation (e.g. dia-, tri- and pentabodies). By diabody the skilled person means a bivalent homodimeric scFv derivative. The shortening of the Linker in an scFv molecule to 5- 10 amino is acids leads to the formation of homodimers in which an inter-chain VH/VL-superimposition takes place. Diabodies may additionally be stabilised by the incorporation of disulphide bridges.
Examples of diabody-antibody proteins from the prior art can be found in Perisic, O. et al. (1994) Structure, 2, pp. 1217.
Zo By minibody the skilled person means a bivalent, homodimeric scFv derivative. It consists of a fusion protein which contains the CH3 region of an immunoglobulin, preferably IgG, most preferably IgGl as the dimerisation region which is connected to the scFv via a Hinge region (e.g. also from IgGl) and a Linker region. Examples of minibody-antibody proteins from the prior art can be found in Hu, S. et al. (1996) Cancer Res., 56, pp. 3055.
Zs By triabody the skilled person means a: trivalent homotrimeric scFv derivative (Kortt A.A. et al.
(1997) Protein Engineering, l0,pp. 423). ScFv derivatives wherein VH-VL are fused directly without a linker sequence lead to the formation of trimers.
~o The skilled person will also be familiar with so-called miniantibodies which have a bi-, tri- or tetravalent structure and are derived from scFv. The multimerisation is carried out by di-, tri- or tetrameric coiled coil structures (Pack, P. et al. (1993) Biotechnology, 11, pp. 1271; Lovejoy, B.
et al. (1993) Science,. 259, pp. 1288; Pack, P. et al. (1995) J. Mol. Biol., 246, pp. 28). In a preferred embodiment of the present invention, the gene of interest is encoded for any of those desired polypeptides mentioned above, preferably for a monoclonal antibody, a derivative or fragment thereof.
In order to perform any embodiment of the present invention, an integrase has to act on the s recombination sequences. The integrase or the integrase gene and/or a co-factor or a co-factor gene, e.g. the XIS factor or the XIS factor gene and/or IHF or the IHF gene may be present in the eukaryotic cell already before introducing the first and second recombination sequence. They may also be introduced between the introduction of the first and second recombination sequence or after the introduction of the first and second recombination sequence.
Purification of ~o recombinase and host factor proteins has been described in the art (Hash, H.A. (1983) Methods of Enzymology, 100, pp. 210; Filutowicz, M. et al. (1994) Gene, 147, pp.149).
In cases when they are not known, cell extracts can be used or the enzymes can be partially purified using procedures described for example for Int or Cre recombinase. The purified proteins can be introduced into a cell by standard techniques, for example by means of injection or is microinjection or by means of a lipofection as described in example 2 of the present invention for IHF. The integrase used for the sequence-specific recombination is preferably expressed in the cell in which the reaction is earned out. For that purpose a third DNA
sequence comprising an integrase gene is introduced into the cells. If the sequence specific recombination is earned OLIt e.g. with ccttLlc~ttR a XIS factor gene (fourth DNA sequence) may be introduced into the cells Zo in addition. Most preferred is a method wherein the third and/or fourth DNA
sequence is integrated into the eukaryotic genome of the cell or an artificial-/minichromosome via homologous recombination or randomly. Further preferred is a method wherein the third and/or fourth DNA sequence comprises regulatory sequences resulting in a spatial and/or temporal expression of the integrase gene and/or XIS factor gene.
Zs In this case a spatial expression means that the Int recombinase, the XIS
factor, and/or the IHF
factor, respectively, is expressed only in a particular cell type by use of cell type specific promotors and catalyzes the recombination only in these cells, e.g. in liver cells, kidney cells, nerve cells or cells of the immune system. In the regulation of the integrase/XIS factor/IHF
3o expression a temporal expression may be achieved by means of promotors being active from or in a particular developmental stage or at a particular point of time in an adult organism.
Furthermore, the temporal expression may be achieved by use of inducible promotors, e.g. by interferon or tetracycline depended promotors; see review of Miiller, U.
(1999) Mech.
Develop.,82, pp. 3.
 
The integrase used in the method of the present invention may be both the wild-type and the modified (mutated) integrase of the bacteriophage lambda. As the wild-type integrase is only able to perform the recombination reaction at a high efficiency with a co-factor, namely IHF, it is s preferred to use a modified integrase in the method of the present invention. If the wild-type integrase is used in the method of the present invention, IHF may be needed in addition to achieve a stimulation of the recombination reaction. The modified integrase is modified such that said integrase may carry out the recombination reaction without IHF or other host factors such as XIS and FIS. For example, a recombination reaction between attL~and attR
sequences may be io preformed by a modified Int without the addition of a host factor (see results 5.1 and Figure 2C
and 2D).
The generation of modified polypeptides and screening for the desired activity is state of the art and may be performed easily; Erlich, H. (1989) PCR Technology. Stockton Press.
For example, is a nucleic acid sequence encoding for a modified integrase is intended to include any nucleic acid sequence that will be transcribed and translated into an integrase either in vitro on upon introduction of the encoding sequence into bacteria or eukaryotic cells. The modified integrase protein encoding sequences can be naturally occurring (by spontaneous mutation) or recombinantly engineered mutants and variants, tnmcated versions and fragments, functional Zo equivalents, derivatives, homologs and fusions of the naturally occurnng or wild-type proteins as long as the biological functional activity, meaning the recombinase activity, of the encoded polypeptide is maintained. Recombinase activity is maintained, when the modified recombinase has at least 50%, preferably at least 70%, more preferred at least 90%, most preferred at least 100% of the activity of the wild-type integrase Int, measured in a co-transfection assay with Zs substrate vectors and expression vectors as described in results 5.1 of the present invention or in Example 3 of WO 01/16345. Certain amino acid sequence substitutions can be made in an integrase or its underlying nucleic acid coding sequence and a protein can be obtained with like properties. Amino acid substitutions that provide functionally equivalent integrase polypeptides by use of the hydropathic index of amino acids (Kyte, J. et al. (1982) J. Mol.
Biol., 157, pp. 105) 3o can be prepared by performing site-specific mutagenesis or polymerase chain reaction mediated mutagenesis on its underlying nucleic acid sequence. In the present invention mutants or modified integrases are preferred, which show in comparison to a wild-type protein improved recombinase activity/recombination efficiency or an recombination activity independent of one or more host factors. "Wild-type protein" means a complete, non truncated, non modified, naturally occurring gene of the encoding polypeptide. Two Int mutants preferred are bacteriophage lambda integrases designated as Int-h and Int-h/218; Miller et al. (1980) Cell, 20, pp. 721; Christ, N. and Droge, P. (1999) J. Mol. Biol., 288, pp. 825. Int-h includes a lysine residue instead of a glutamate residue at position 174 in comparison to wild-type Int. Int-h/218 s includes a further lysine residue instead of a glutamate residue at position 218 and was generated by PCR mutagenesis of the Int-h gene. Said mutants may catalyze the recombination between c~ttBlattB, attPlattP, attLlattL or attRlattR and all other possible combinations, e.g. attPlattR, ccttLlattP, attLlattB, or attRlattB or the derivatives thereof without the co-factors IHF, XIS, and/or FIS and negative supercoiling in E. coli, in eukaryotic cells, and in vitro, i.e. with purified ~o substrates in a reaction tube. An improvement of the efficiency of the recombination may be achieved with a co-factor, e.g. FIS. The mutant Int-h/218 is preferred, because this mutant catalyze the recombination reaction with increased efficiency.
If the first reaction leads to an excision and the used two recombination sequences are identical, ~s e.g, attPlP, the resulting recombination sequences after the recombination will be identical to those on the substrate, e.g. here two attP sequences. If however, the two partner sequences are different, e.g. attPlR, the recombination reaction will generate hybrid recombination sequences which comprise one functional half from one sequence (e.g. attP) and one half from the other (ccttR). A functional half recombination site can be defined as the sequence either 5' or 3' form zo the overlap, whereby the overlap is considered, in each case, as a part of a funtional half site. If the respective overlap region of the used recombination sequences is identical the excision reaction may be performed with any recombination sequence according to the invention.
Additionally, the overlap region designates the orientation of the recombination sequences to each other also, i.e. inverted or direct. The reaction may be performed with wilt-type Int with zs low efficiency only, however, the addition of IHF or in the absence of IHF
the presence of arm binding sites) in addition to the core binding site stimulates and increases the efficiency. The reaction may be performed without any cofactor by a modified Int.
Furthermore, a method is preferred wherein a further DNA sequence comprising a Xis factor 3o gene is introduced into the cells. Most preferred is a method wherein the further DNA sequence further comprises a regulatory DNA sequence giving rise to a spatial and/or temporal expression of the Xis factor gene.
For example, after successful integrative intramolecular recombination (inversion) by means of Int leading to the activation/inactivation of a gene in a particular cell type said gene may be inactivated or activated at a later point of time again by means of the induced spatial and/or temporal expression of XIS with the simultaneously expression of Int.
s Furthermore, the invention relates to the use of any recombination sequences or the derivative thereof, e.g. to the derivative of attP as specified in SEQ ID NO: 2 in a sequence specific recombination of DNA in eukaryotic cells. The eukaryotic cell may be present in a cell aggregate of an organism, e.g. a mammal, having no integrase or Xis factor in its cells.
Said organism may be used for breeding with other organisms having in their cells the integrase or the Xis factor so io that off springs are generated wherein the sequence specific recombination is performed in cells of said off springs. Thus, the invention relates also to the use of an integrase or an integrase gene and a Xis factor or a Xis factor gene and an IHF factor or an IHF factor gene in a sequence .
specific recombination in eukaryotic cells. Furthermore, the present invention relates to eukaryotic cells and cell lines in which the method of the present invention was performed, is wherein said cells or cell lines are obtained after performing the method of the present invention.
The practice of the present invention will employ, unless otherwise indicated, conventional techniques of cell biology, molecular biology, cell culture, immunology and the like which are in the skill of one in the art. These techniques are fully disclosed in the current literature. See e.g.
zo Sambrook et al., Molecular Cloning: A Laboratory Manual, 2°'~ Ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989); Ausubel et al., Current Protocols in Molecular Biology (1987, updated); Brown ed., Essential Molecular Biology, IRL
Press (1991);
Goeddel ed., Gene Expression Technology, Academic Press (1991); Bothwell et al. eds., Methods for Cloning and Analysis of Eukaryotic Genes, Bartlett Publ. (1990);
Wu et al., eds., zs Recombinant DNA Methodology, Academic Press (1989); Kriegler, Gene Transfer and Expression, Stockton Press (1990); McPherson et al., PCR: A Practical Approach, IRL Press at Oxford University Press (1991); Gait ed., Oligonucleotide Synthesis (1984);
Miller & Calos eds., Gene Transfer Vectors for Mammalian Cells (1987); Butler ed., Mammalian Cell Biotechnology ( 1991 ); Pollard et al., eds., Animal Cell Culture, Humana Press ( 1990);
Freshney et al., eds., 3o Culture of Animal Cells, Alan R. Liss (1987); Studzinski, ed., Cell Growth and Apoptosis, A
Practical Approach, IRL Press at Oxford University Presss (1995); Melamed et al., eds., Flow Cytometry and Sorting, Wiley-Liss (1990); Current Protocols in Cytometry, John Wiley & Sons, Inc. (updated); Wirth & Hauser, Genetic Engineering of Animals Cells, in:
Biotechnology Vol.
 
2, Piihler ed., VCH, Weinheim 663-744; the series Methods of Enzymology (Academic Press, Inc.), and Harlow et al., eds., Antibodies: A Laboratory Manual (1987).
All publications and patent applications mentioned in this specification are indicative of the level s of skill of those skilled in the art to which this invention pertains. All publications and patent applications cited herein are hereby incorporated by reference in their entirety in order to more fully describe the state of the art to which this invention pertains. The invention generally described above will be more readily understood by reference to the following examples, which are hereby included merely for the purpose of illustration of certain embodiments of the present io invention and are not intended to limit the invention in any way.
Examples Methods 1. Production of expression and substrate vectors ~s The construction of mock and Int expression vectors pCMV, pCMVSSInt, pCMVSSInt-h, and pCMVSSInt-h/218 have been described; Lorbach, E. et al. (2000) J. Mol.
Biol, 296, pp.1175. Int expression is driven by the human cytomegalovirus promoter.
Substrate vectors used in intramolecular recombination assays, containing attBlattP (p~,IR) or Zo attLlattR (p7~ER) as direct repeats, are derivatives of pGEM'~4Z (Promega).
p~,IR was constricted by inserting attB as double-stranded oligonucleotide into CIaI/EcoRI-cleaved pPGKneo. This vectors is a derivative of pPGKSSInt-h, in which the Int-h gene was replaced by a neomycin gene (neo) using PstIlXbaI. The CMV promoter plus a hybrid intron was generated by PCR using pCMVSSInt as template and cloned into the KpnI/CIaI-cleaved, zs ~zttB-containing pPGKneo vector. This CMV-attB-neo-expression cassette was then cloned by PCR into BamHI-cleaved pGEM~4Z. The attP site, containing an A-to-C
substitution in the P'-arm which deletes a translational stop signal, was generated by assembly PCR using primers (attP01) 5'-GTCACTATCAGTCAAAATACAATCA-3', (SEQ ID NO: 3).
30 (attP02) 5'-TGATTGTATTTTGACTGATAGTGAC-3', (SEQ ID NO: 4) (PFP-NsiI) 5'-CCAATGCATCCTCTGTTACAGGTCACTAATAC-3', (SEQ ID NO: S) and (P'RP-EcoRV-NotI) 5'-ATAAGAATGCGGCCGCAGATATCAGG
GAGTGGGACAAA.ATTGAA-3' (SEQ )D NO: 6).
31;1 pGFPattBlattP was used as template (Lorbach, E. et al. (2000) , supra). The PCR fragment was cleaved with NsiI and NotI and ligated to the 3'-end of a BamHI/PstI-fragment containing a transcriptional stop cassette, which was generated from pBS302 (Gibco/BRL).
The GFP
gene and the polyA signal was cloned by PCR using pCMVSSGFP (a derivative of s pCMVSSInt-h, in which the Int-h gene is replaced by eGFP using PstIlXbaI).
The GFP-containing PCR fragment was cleaved with NotI and XbaI and was then ligated together with the BamHIlNotI-cleaved transcriptional stop/attP fragment into the BamHIlXbaI-cleaved vector already containing the CMV promoter, attB, and the neo expression cassette. p7~ER
was constricted as p7~IR, except that attL was generated by PCR using pGFPattLlattR
to (Lorbach, E. et al. (2000), supra) as template, and was cloned into the CIaI/EcoRI-cleaved pPGKneo. The attR site was generated by PCR using pGFPattLlattR as template, and the product was cleaved with NsiI and NotI.
Substrate vectors for intermolecular recombination assays which contain the CMV promoter ~ s in front of different attachment sites: pCMVattPmut contains three G-to-C
substitutions in the P-arm. These changes were necessary to eliminate ATG start codons that would prevent GFP
expression after recombination. The substitutions are outside of protein binding sites in c~ttP
and were introduced by assembly PCR. First, two overlapping PCR products were generated, one with primer pair c~ttP-ATC-1/attP-2 and one with nttP-ATC-3/czttP-4.
pGFPattBlattP was zo used as template. PCR products were gel-purified and used as templates for PCR with primers attP-PstI and attP-XbaI. The resulting product was digested with PstI and XbuI, and cloned into pCMVSSInt: The primer sequences for assembly PCR are:
(attP-ATC-1) 5'-TTTGGATAAAAAACAGACTAGATAATACTGTAAAACA
CAAGATATGCAGTCACTA-3', (SEQ ID NO: 7) zs (attP-2) 5'-TAACGCTTACAATTTACGCGT-3', (SEQ ID NO: 8) (attP-ATC-3) 5'-CTGCATATCTTGTGTTTTACAGTATTATCTAGTCTG
TTTTTTATCCAAAATCTAA-3', (SEQ ID NO: 9) (attP-4) 5'CTGGACGTAGCCTTCGGGCATGGC-3', (SEQ ID NO: 10) (attP-PstI) 5'-GACTGCTGCAGCTCTGTTACAGGTCAC-3', (SEQ ID NO: 11) 30 (attP-XbaI) S'-GACTGTCTAGAGAAATCAAATAATGAT-3' (SEQ ID NO: 12).
pCMVattB was generated by inserting attB as double-stranded oligonucleotide into PstI/XbaI-cleaved pCMVattPmut. pCMVattL was generated by PCR using p7~ER as template for attL, which was introduced into PstI/XbaI-cleaved pCMVattPmut.
 
Vectors which contain a transcriptional stop signal and an att site placed in front of a promoterless GFP gene were constructed as follows: pWSattBGFP was generated by first deleting a part of the hygromycin gene from pTKHyg (Clontech) using AvaI and NdeI. The s vector backbone was ligated after the sticky ends were made blunt by Klenow polymerise. An attB-GFP fragment, generated by PCR, was cloned into MfeI and HindIII sites, thereby creating a new NheI site 5' of attB. Finally, the transcriptional stop sequence was inserted through.
restriction with EcoRI and NheI. pWSc~ttRGFP was generated by isolating the BczmHIlNotI
transcriptional stop-attR fragment from p~,ER, which was inserted into pWSattBGFP cleaved io with the same enzymes. pWSattPGFP was generated by PCR of the ~ttP site using pGFPattPlattB as template, which was inserted into pWSattBGFP cleaved with EcoRIlNotI thus replacing attB. Plasmids were isolated from E. coli strain XL1-Blue using affinity chromatography (Qiagen). The nucleotide composition of relevant genetic elements was verified by DNA sequencing using the fluorescence-based 373A system (Applied Biosystems).
is 2. Cell culture, recombination assays, and flow cytomery HeLa cells were cultured in Dulbecco's modified eagle medium (DMEM) supplemented with 10% fetal calf serum, streptomycin [0,1 mg/ml] and penicillin [100 U/ml].
Cells were passaged twice before transfection.
zo Typical recombination assays were performed as follows. Cells were harvested, washed with PBS and resuspended in RPMI 1640 without L-glutamine and phenol red (Life Technologies).
A total of 60 pg of expression and substrate vectors at a molar ratio of 1:1 were then introduced into approximately 1 x 107 cells at 300V and 960pF using a Gene pulser (Bio-Rad). After zs electroporation, cells were plated in an appropriate dilution on 10 cm dishes. A single-cell suspension was prepared at 24, 48, and 72 hrs after transfection. Dead cells were excluded from the analysis by staining with 7-amino-actinomycin D (Sigma), and cells were analyzed by FACScalibur (Becton Dickinson). FACS data were analyzed with CellQuestT~'' software. The transfection efficiencies for intermolecular recombination assays were determined for each 3o experiment by co-transfecting 40 pg pCMV with 20 pg pEGFP-C1 (Clontech);
those for intramolecular recombination were determined with 30 pg pCMV and 30 pg pEGFP-C
1.
Experiments involving purified IHF were performed by introducing first 30 pg of Int expression vectors to approximately 6 x 106 cells via electroporation as described above. After 3 to 4 hrs, about 1 x 105 cells were transfected with 2 pg of substrate vectors for intramolecular recombination, or with a total of 2 pg of substrate vectors at a molar ratio of l:l for intermolecular recombination. Substrates were pre-incubated at room temperature with 2 ~g purified IHF (Lange-Gustafson BJ, Nash HA., Purification and properties of Int-h, a variant s protein involved in site-specific recombination of bacteriophage lambda., J
Biol Chem. 1984 Oct 25;259(20):12724-32) in a low salt buffer (50 mM NaCI, 10 mM Tris-HC1, pH
8.0, 1 mM
EDTA) for at least 30 minutes. Transfection of IHF-DNA complexes was achieved with FuGene (Boehringer Mannheim) and the efficiencies were always in the range of 80%. Cells were analyzed by flow cytometry after additional 48 hrs as described above.
~o CHO-DG44/dhfr ~~ cells (Urlaub, G. et al., (1983), Cell, 33, pp. 405), grown permanently in suspension in the serum-free medium CHO-S-SFMII (Invitrogen, Carlsbad, CA) supplemented with hypoxanthine and thymidine (Invitrogen, Carlsbad, CA), are incubated in cell culture flasks at 37°C in a humidified atmosphere containing 5% COz. Cells are seeded at a concentration of 1-is 3x105 cells/mL in fresh medium every two to three days.
Stable transfections of CHO-DG44 cells are conducted using Lipofectamine Plus reagent (Invitrogen, Carlsbad, CA). Per transfection 6x105 exponentially growing cells in 0,8 mL
hypoxanthine/thymidine (HT)-supplemented CHO-S-SFMII medium are seeded in a well of a 6-well chamber. A total of 1 pg plasmid DNA , 4 ~uL Lipofectamine and 6 pL Plus reagent in a zo volume of 200 yL is used for each transfection and added to the cells, following the protocol of the manufacturer. After incubation for 3 hours 2 mL of HT-supplemented CHO-S-SFMII
medium is added. In the case of neomycin phosphotransferase-based selection the medium is replaced 2 days after transfection with CHO-S-SFMII medium, supplemented with HT and 400 yg/mL 6418 (Invitrogen), and the mixed cell populations are selected for 2 to 3 weeks with z> medium changes every 3 to 4 days. For the DHFR-based selection of stable transfected CHO-DG44 cells CHO-S-SFMII medium without hypoxanthine/thymidine is used. DHFR-based gene amplification is achieved by adding 5 - 2000 nM methotrexate (Sigma, Deisenhofen, Germany) as amplifying selection agent to the medium.
30 3. sICAM and MCP-1 ELISA
sICAM titers in supernatants of stable transfected CHO-DG44 cells are quantified by ELISA
with standard protocols (Ausubel, F.M. et al., (1994, updated) Current protocols in molecular biology. New York: Greene Publishing Associated and Wiley-Interscience) using two in house developed sICAM specific monoclonal antibodies (as described for example in US
patents No.
5,284,931 and 5,475,091), whereby one of the antibodies is a HRPO-conjugated antibody.
Purified sICAM protein is used as a standard. Samples are analyzes using a Spectra Fluor Plus reader (TECAN, Crailsheim, Germany).
s MCP-1 titers in supernatants of stable transfected CHO-DG44 cells are quantified by ELISA
using the OptEIA human MCP-1 set according to the manufacturer's protocol (BD
Biosciences Pharmingen, Heidelberg, Germany).
Example 1: Kinetics of intra- and intermolecular recombination reactions io We showed in our previous studies that mutant Int catalyzed intramolecular integrative and excisive recombination reactions in the absence of natural accessory factors in E. coli and in human cells (Christ, N. et al. (1999), sz~pra; Lorbach, E. et al. (2000), szcpra). However, an interesting question with respect to interactions of episomal DNA segments inside mammalian cells concerns the ability of mutant Int to perform intermolecular recombination, i.e. when two is recombination sites are located on different DNA molecules in traps. We compared therefore first intra- and intermolecular integrative recombination reactions.
Intramolecular recombination was tested with a substrate that contains attB
and attP as direct repeats flanking a transcriptional stop signal. This recombination cassette, in turn, is flanked by zo a CMV promoter and the coding region for GFP. Recombination between attB
and attP
generates hybrid sites attL and attR, and leads to excision of the stop signal. Subsequent expression of the GFP gene thus serves as reporter of recombination (Figure 2A, top).
Expression vectors for either Int, Int-h, or Int-h/218 were co-transfected with the substrate ~s vector into HeLa cells. The expression vector backbone (mock) was used as negative control.
Transfection efficiencies independently determined for each experiment were in the range of 95 to 98% (data not shown). FAGS analyses from 3 experiments show that both mutant Int efficiently catalyzed recombination, leading in some experiments to about 30%
GFP-expressing cells (Figure 2A, bottom). The nucleotide sequence of recombination products, determined 3o indirectly by DNA sequencing of PCR fragments, confirmed that the strand-transfer-reactions catalyzed by mutant Int generated the expected hybrid att sites (data not shown).
It is apparent that the double mutant Int-h/218 was more active than Int-h, whereas wild-type Int was almost inactive. The fraction of GFP-expressing cells increased during 48 hrs after transfection and remained steady for the next 24 hrs. The time course of the reactions also indicates that a majority of recombination events must have occurred within the first 24 hrs.
This correlates well with the time course of Int-h/218 expression in HeLa cells (data not shown).
Although we cannot exclude the possibility that a fraction of GFP-expressing cells resulted from s inter- instead of intramolecular integrative recombination, the data set can be used as a reference for our analysis of intermolecular recombination.
We analyzed intermolecular integrative recombination by placing attB and attP
on separate plasmids. Recombination translocates the CMV promoter to a position upstream of the GFP
~o gene (Figure 2B, top). Hence, only intermolecular recombination between attB and attP will generate GFP-expressing cells. FACS analyses after co-transfection of the two substrate vectors with Int expression vectors yielded results which are comparable to those generated with substrates for intramolecular recombination (Figure 2B, bottom). Again, the majority of recombination events must have occurred within the first 24 hrs after transfection and Int-h/218 ~s was more active than Int-h. Wild-type Int generated only a very small fraction of GFP-expressing cells. These results demonstrate that over a time course of 24 to 72 hrs, intermolecular integrative recombination by mutant Int is at least as efficient as the corresponding intramolecular reaction.
Zo The same experimental strategy was then employed to compare intra- and intermolecular excisive (attL x attR) recombination pathways. The results revealed again that intermolecular recombination by mutant Int was as efficient as intramolecular recombination (Figure 2C and D). The efficiency of excisive recombination reactions, however, was slightly reduced compared to integrative recombination. Recombination by wild-type Int was again barely zs detectable.
Example 2: DNA arm-binding sites in att are not required, but stimulate recombination The results so far show that mutant Int catalyzed integrative and excisive recombination on episomal substrates in a significant number of transfected cells. In contrast, recombination 3o activities of wild-type Int was barely detectable above background. Since excisive recombination by wild-type Int depends on the presence of protein co-factors IHF and XIS, but does not require negative DNA supercoiling, this result demonstrates that eukaryotic counterparts of these co-factors are lacking in human cells. Further, it is known that episomal substrates are topologically relaxed soon after transfection (Schwikardi et al. (2000) FEBS
 
Letters, 471, pp. 147). It appears, therefore, that mutant Int perform recombination without the formation of defined nucleoprotein complexes, such as the intasome assembled at attP. This raises the question of the functional role of DNA arm-binding sites in recombination. They were present in at least one of the partner att sites employed so far.
s In order to investigate this question, we used intermolecular recombination with pairs of substrate vectors containing attB or attP in various combinations (Figure 3A).
The fraction of GFP-expressing cells that results from recombination was determined by FAGS at 48 hrs after co-transfection with Int expression vectors. Transfection efficiencies were always above 90%
~o (data not shown). The results from 3 experiments show that intermolecular recombination between pairs of attP was as efficient as recombination between attB and attP
(Figure 3B).
However, only Int-h/218 utilized pairs of attB sites as substrate to a significant extent. The efficiency of this reaction was, on average, about four-fold reduced compared to reactions between attP and attP or attB and attP (Figure 3B) Hence, the fraction of GFP-expressing cells ~s that results from recombination between two attB sites dropped to a level of 4 to 5%. These results demonstrate that the presence of arm-type sequences in att sites is not required for recombination by Int-h/218, but significantly stimulates the reaction. This stimulatory effect is even more pronounced (about eight-fold) when Int-h was used. Farther, the residual recombination activity observed with wild-type Int appears highly dependent on the presence of zo arm binding sites.
Example 3: Recombination by wild-type Int is stimulated by transfected IHF
protein Efficient integrative recombination catalyzed by wild-type Int in vitro and in E. coli requires the protein co-factor IHF and supercoiling of attP. The apparent lack of either co-factor in as mammalian cells thus led us to investigate whether the residual recombination activity of wild type Int is augmented if purified IHF, pre-incubated with a supercoiled substrate, is co-introduced into HeLa cells. To test this possibility, we introduced first expression vectors for either wild-type Int or Int-h. At 3 to 4 hrs after electroporation, substrates for intra- or intermolecular recombination were incubated either with or without purified IHF. Protein-DNA
~o mixtures as well as protein-free control samples were then transfected using Fugene (Figure 4A). The fractions of GFP-expressing cells were compared after additional 48 hrs.
The results from three experiments show that intramolecular recombination by wild-type Int was stimulated, on average, up to five-fold due to the presence of IHF. The fraction of GFP-positive cells increased, for example, in one experiment from about 1% in the absence of IHF to 6% in its presence. The stimulatory effect on intermolecular recombination was also significant, bLlt less pronounced (about three-fold). At 48 hrs after transfection, the stimulation was specific for wild-type Int since the activity of Int-h was not affected. Importantly, controls showed that s transfection efficiencies were also not affected by the presence of IHF
protein (data not shown).
Example 4: Improved protein expression system based on sequence-specific recombination of gene of interest CHO-DG44 cells are stably transfected with a linearized first plasmid DNA
expressing the ~o fluorescent protein ZsGreenl from Zoc~nthus sp. (Clontech Laboratories Inc., Palo Alto, CA, U.S.A.) and the antibiotic resistance gene neomycin phosphotransferase (Figure S). In addition, either an attB or an attP recombination sequence (natural or modified sequence or derivative thereof) is placed between the gene for the fluorescent protein and its promoter. The first plasmid DNA, linearized by using a restriction enzyme with a single restriction site outside the is transcription units for both selection markers, is introduced by random integration into the genome of CHO-DG44. Cells with a successful stable random integration of the first plasmid DNA are positively selected for by cultivation in the presence of the antibiotic 6418. Within the heterogeneous pool of stable transfectants cells with a high transcription activity at the integration site of the first plasmid DNA can be isolated simply by fluorescence activated cell zo sorting (FACS) based on the expression level of the introduced fluorescent protein ZsGreenl.
Cells with the highest ZsGreenl fluorescence are sorted and placed as single cells into the wells of a 96 well plate. The resulting cell subclones are expanded and tested by restriction endoncuclease mapping in Southern blot analysis for integration of a single plasmid sequence in a single chromosomal site. For the latter genomic DNA of the cell subclones is digested with zs restriction enzymes with no, one and multiple restriction sites within the introduced first plasmid DNA, respectively, electrophoresed on a 0.8% agarose gel and transferred to positvely charged nylon membrane (Amersham Biosciences, Freiburg, Germany).
Hybridization is performed overnight at 65°C in a hybridization oven with a random-primed FITC-dUTP labeled probe consisting of the ZsGreenl gene according to the protocol of the Gene Images random 3o prime labelling module (Amersham Biosciences). Candidate subclones with a single copy insert are subsequently tested in small scale bioreactors for their performance in a production-mimicking fedbatch process. Besides high expression levels during the complete production phase, monitored by measuring the ZsGreenl fluorescence, other important parameters such as high viability at high cell density, metabolism and reproducible performance are taken into account. This way a suitable host cell with an integrated first att recombination sequence is identified. To generate a production cell line producing a biopharmaceutical by sequence-specific recombination this host cell is transfected with a second plasmid DNA
(see Figure 5) containing a promoterless dihydrofolate reductase gene preceded by either an attB or an attP
s recombination sequence (natural or modified sequence or derivative thereof) and a complete transcription unit for the expression of the gene of interest, for example the common cold therapeutic sICAM (soluble intercellular adhesion molecule 1) or the human monocyte chemoattractant protein-1 (MCP-1). In addition the vector pCMVSSInt-h/218 expressing the mutated (modified) bacteriophage lambda integrase is co-transfected. After transfection, ~o transient expression of Int-h/218 is sufficient to perform the sequence-specific intermolecular recombination between the first att recombination site (either attP or attB) located at a preferred transcriptional active locus within the host cell genome and the second att recombination site (either attP or attB) on the introduced second DNA plasmid. To select for cells where a sequence-specific recombination between attP and attP, attP and attB or attB
and attB has ~s occurred, depending on the choice of the recombination sequence on the first and second DNA
plasmid, transfected cells are transferred and cultivated in CHO-S-SFMII
medium without the supplements hypoxanthin and thymidine. Only correct targeting results in cells surviving the selection by placing via recombination the promoterless dhfr-marker gene with an upstream att recombination site on the second DNA plasmid under the control of the promoter sequence of zo the ZsGreenl gene with a downstream att recombination site, thus allowing for the efficient expression of the dhfr selection marker gene. At the same time the functional expression cassette of the ZsGreenl gene is interrupted leaving behind a promoterless ZsGreenl gene. Thus cells do not express a fluorescent protein any longer. The non-fluorescing cells are identified and isolated by FACS providing a means to detect the cells producing the protein of interest. In zs addition, sequence-specific integration is verified by Southern Blot and PCR analysis with primers located in the sequences flanking the att sites before and after site-specific recombination followed by subsequent DNA sequencing. Expression of the protein of interest, sICAM or MCP-1, is assayed by ELISA.
The use of dhfr as marker gene for the generation of production cell lines offers not only the 3o advantage of positive selection but also the possibility to increase the productivity of the cell by methotrexate-induced DHFR-based gene amplification even further. This is achieved by supplementing the hypoxanthin/thymidine-free cultivation medium CHO-S-SFMII
with increasing amounts of methotrexate.
 
SEQUENCE LISTING
<110> BOEHRINGER INGELHEIM PHARMA GmbH & Co. KG
Droge, Peter <120> Sequence specific DNA recombination in eukaryotic cells <130> DRO-003 PCT
<140> unknown <141> 2003-11-28 <150> CA 2,413,175 <151> 2002-11-28 <150> US 10/310,695 <151> 2002-12-05 <160> 17 <170> PatentIn version 3.1 <210> 1 <211> 10 <212> DNA
<213> Artificial Sequence <220>
<223> Consensus sequence for Int binding-site <220>
<221> misc feature <222> (1)..(1) <223> c or a <400> 1 magtcactat 10 <210> 2 <211> 243 <212> DNA
<213> Artificial Sequence <220>
<223> attP derivative <400>
 
tctgttacaggtcactaataccatctaagtagttgattcatagtgactgcatatcttgtg 60 ttttacagtattatctagtctgttttttatccaaaatctaatttaatatattgatattta 120 tatcattttacgtttctcgttcagcttttttatactaagttggcattataaaaaagcatt 180 gcttatcaatttgttgcaacgaacaggtcactatcagtcaaaataaaatcattatttgat 240 ttc 243 <210> 3 <211> 25 <212> DNA
<213> Artificial Sequence <220>
<223> Primer <400> 3 gtcactatca gtcaaaatac aatca 25 <210> 4 <211> 25 <212> DNA
<213> Artificial Sequence <220>
<223> Primer <400> 4 tgattgtatt ttgactgata gtgac 25 <210> 5 <211> 32 <212> DNA
<213> Artificial Sequence <220>
<223> Primer <400> 5 ccaatgcatc ctctgttaca ggtcactaat ac 32 <210> 6 <211> 44 <212> DNA
<213> Artificial Sequence <220>
<223> Primer <400> 6 ataagaatgc ggccgcagat atcagggagt gggacaaaat tgaa 44 <210> 7 <211> 55 <212> DNA
<213> Artificial Sequence <220>
<223> Primer <400> 7 tttggataaa aaacagacta gataatactg taaaacacaa gatatgcagt cacta 55 <210> 8 <211> 21 <212> DNA
<213> Artificial Sequence <220>
<223> Primer <400> 8 taacgcttac aatttacgcg t 21 <210> 9 <211> 55 <212> DNA
<213> Artificial Sequence <220>
<223> Primer <400> 9 ctgcatatct tgtgttttac agtattatct agtctgtttt ttatccaaaa tctaa 55 <210> 10 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Primer <400> 10 ctggacgtag ccttcgggca tggc 24 <210> 11 <211> 27 <212> DNA
<213> Artificial Sequence <220>
<223> Primer <400> 11 gactgctgca gctctgttac aggtcac 27 <210> 12 <211> 27 <212> DNA
<213> Artificial Sequence <220>
<223> Primer <400> 12 gactgtctag agaaatcaaa taatgat 27 <210> 13 <211> 21 <212> DNA
<213> Escherichia coli <400> 13 ctgctttttt atactaactt g 21 <210> 14 <211> 243 <212> DNA
<213> Bacteriophage lambda <400>
 
tctgttacaggtcactaataccatctaagtagttgattcatagtgactgcatatgttgtg60 ttttacagtattatgtagtctgttttttatgcaaaatctaatttaatatattgatattta120 tatcattttacgtttctcgttcagcttttttatactaagttggcattata-~aaaaagcatt180 gcttatcaatttgttgcaacgaacaggtcactatcagtcaaaataaaatcattatttgat240 ttc 243 <210> 15 <211> 102 <212> DNA
<213> Escherichia coli <400> 15 ctgctttttt atactaagtt ggcattataa aaaagcattg cttatcaatt tgttgcaacg 60 aacaggtcac tatcagtcaa aataaaatca ttatttgatt tc 102 <210> 16 <211> 162 <212> DNA
<213> Escherichia coli <400> 16 tctgttacag gtcactaata ccatctaagt agttgattca tagtgactgc atatgttgtg 60 ttttacagta ttatgtagtc tgttttttat gcaaaatcta atttaatata ttgatattta 120 tatcatttta cgtttctcgt tcagcttttt tatactaact tg 162 <210> 17 <211> 27 <212> DNA
<213> Homo sapiens <400> 17 gaaattcttt ttgatactaa cttgtgt 27
    A further example is the specific activation of the DNA tumor virus SV40 oncogene in the mouse lenses leading to tumor formation exclusively in these tissues. The Cre-IoxP strategy was used also in connection with inducible promoters. For example, the expression of the recombinase was regulated with an interferon-inducible promoter wleading to the deletion of a io specific gene in the liver and not - or only to a low extent - in other tissues; Kiihn, R. et al.
(1990 Science, 269, pp.1427.
So far three members of the invertase/resolvase family have been used for the manipulation of eukaryotic genomes. A mutant of the bacteriophage Mzi invertase Gin can catalyze the inversion ~s of a DNA fragment in plant protoplasts without cofactors. However, it has been discovered that this mutant is hyper-recombinogenic, i.e. it catalyzes DNA strand cleavages also at other than its naturally recombination sequences. This leads to unintended partially lethal recombination events in plant protoplast genomes. The (3-recombinase from Streptococcus pyogenes catalyses the recombination in mouse cell cultures between two recombination sequences as direct repeats zo leading to the excision of the segment. However, simultaneously with deletion also inversion has been detected which renders the controlled use of the system for manipulation of eukaryotic genomes unsuitable. Mutants of the y8 resolvase from E.coli have been shown to be active on episomal and artificially introduced genomic recombination sequences, but the efficiency of the latter reaction is still rather poor.
zs The manipulation of eukaryotic genomes with the Cre and Flp recombinase, respectively, shows significant disadvantages. In case of deletion, i.e. the recombination of two tandem repeated IoxP
or FRT recombination sequences in a genome there is an irreversibly loss of the DNA segment lying betW een the tandem repeats. Thus, a gene located on this DNA. segment will be lost 3o permanently for the cell and the organism. Therefore, the reconstruction of the original state for a new analyses of the gene function, e.g. in a later developmental stage of the organism, is impossible. The irreversible loss of the DNA segment caused by deletion may be avoided by an inversion of the respective DNA segment. A gene may be inactivated by an inversion without being lost and may be switched on again at a later developmental stage or in the adult animal by means of a timely regulated expression of the recombinase via back recombination. However, the use of both Cre and Flp recombinases in this modified method has the disadvantage that the inversion cannot be regulated as the recombination sequences will not be altered as a result of the recombination event. Thus, repeated recombination events occur causing the inactivation of s the respective gene due to the inversion of the respective DNA segment only in some, at best in 50% of the target cells at equilibrium of the reaction. There have been efforts to solve this problem, at least in part, by constructing mutated IoxP sequences which cannot be used for further reaction after a single recombination. However, the disadvantage is the uniqueness of the reaction, i.e. there is no subsequent activation by back recombination after inactivation of the ~o gene by inversion.
A further disadvantage of the Flp recombinase is its reduced heat stability at 37°C thus limiting the efficiency of the recombination reaction in higher eukaryotes significantly, e.g. in mice with a body temperature of about 39°C. Therefore, Flp mutants have been generated which exhibit a i s higher heat stability as the wild-type recombinase. However, even these mutant Flp enzymes still exhibit a lower recombination efficiency than the Cre recombinase.
A further use of sequence specific recombinases resides in the medical field, e.g. in gene therapy, where the recombinases integrate a desired DNA segment into the genome of a respective human zo target cell in a stable and controlled way. Both Cre and Flp may catalyze intermolecular recombination. Both recombinases recombine a plasmid DNA which carnes a copy of its respective recombination sequence with a corresponding recombination sequence which has been inserted before into the eukaryotic genome via homologous recombination.
However, it is desirable that this reaction includes a "naturally" occurring recombination sequence in the zs eukaryotic genome. Because loxP and FRT are 34 and 54 nucleotides long, respectively, occurrence of exact matches of these recombination sequences as part of the genome is statistically unlikely. Even if a recombination sequence would be present, the disadvantage of the aforementioned back reaction still exists, i.e. both Cre and Flp recombinase may excise the inserted DNA segment after successful integration by intramolecular recombination.
Thus, one problem of the present invention is to provide a simple and controllable recombination system, and the required working means. A further problem of the present invention is the provision of a recombination system and the required working means, which may carry out a stable and targeted integration of a desired DNA sequence. A further problem of the present invention is the provision of methods which allows the generation of an improved protein expression system on the basis of one of those recombination systems.
Said problems are solved by the subject matter characterized in the claims.
s The invention is explained in more detail with the following illustrations.
Figure 1 shows a schematic presentation of the recombination reactions namely integration and excision catalyzed by the wild-type integrase Int. A superhelical plasmid DNA
(top) carrying a io copy of the recombination sequence attP is shown. AttP consists of five so-called arm binding sites for Int (Pl, P2, P1', P2', P3'), two core Int binding sites (C and C';
marked with black arrows), three binding sites for IHF (Hl, H2, H'), two binding sites for Xis (X1, X2) and the so-called overlap region (open rectangle) where the actual DNA strand exchange takes place. The natural partner sequence for attP, attB, is shown on a linear DNA segment beneath and consists is of two core binding sites for Int (B and B'; marked with open arrows) and the overlap region. For the recombination between attB and attP, Int and IHF are necessary leading to the integration of the plasmid into the DNA segment carrying attB. Thereby, two new hybrid recombination sequences, attL and attR, are formed which serve ~as target sequences for the excision. The latter reaction requires in the wild-type situation Int and IHF, and a further cofactor XIS encoded by zo the phage lambda.
Figure 2 shows intra- and intermolecular recombination reactions. (A) Intramolecular integrative (attB x attP) recombination. (B) Intermolecular integrative (attB x attP) recombination. (C) Intramolecular excisive (attL x attR) recombination. (D) Intermolecular excisive (attL x attR) zs recombination. Substrate vectors and expected recombination products are schematized at the top of each panel. The fraction of GFP-expressing cells was determined by FACS
at three time points after co-transfection of substrate and expression vectors. We show mean values of three assays with standard deviations indicated by vertical lines.
3o Figure 3 shows that the presence of Int arm-binding DNA sequences in att sites stimulates intermolecular recombination. (A) Pairs of substrate vectors for intermolecular recombination contain either attB or nttP in different combinations and yield products that express GFP driven by the CMV promoter. (B) Various combinations of substrate vectors were co-transfected with expression vectors for wild-type Int, mutant Int-h, or Int-h1218. At 48 hrs, cells were analyzed by FACS and the ratio of GFP-expressing cells was determined fox two pairs of substrates.
Recombination between attP and attP served as reference, as indicated. We show mean values of three assays with standard deviations indicated by vertical lines. The actual mean values of GFP-expression cells (%) for Int were 0.08 (B x B), 1.24 (P x P), and 0.81 (P x B). Those for Int-h s were 1.15 (B xB), 8.07 (P x P), and 9.90 (P x B). Those for Int-h/218 were 4.01 (B x B), 17.62 (P
x P), and 16.45 (P x B).
Figure 4 shows that purified IHF protein stimulates intra- and intermolecular integrative recombination by wild-type Int. (A) Schematic representation of substrate vectors which were io incubated with or without IHF before trarisfection into HeLa cells that transiently expressed either wild-type Int or Int-h. (B) At 48 hrs after transfection, the fractions of GFP-expressing cells were analyzed by FRCS. The ratio of these fractions was plotted as activation of recombination by IHF. The graph shows mean values of three assays with standard deviations indicated by vertical lines. The actual mean values of GFP-expressing cells (%) in the is presence and absence of IHF, respectively, were for Int (7.93/1.26) and Int-h (17.57/13.14) in the case of intramolecular recombination, and for Int (13.94/3.47) and Int-h (20.33/16.83) analyzing intermolecular recombination.
Figure 5 schematically shows exemplary expression vector designs for the sequence specific zo DNA recombination in CHO-DG44 cells. "P/E" means a composite unit that contains both enhancer and promoter element, "P" a promoter element and "T" a transcription termination site required for polyadenylation of transcribed messenger RNA. "GOI" refers to a gene of interest, "dhfr" to the amplifiable selectable marker dihydrofolate reductase, "FP" to a fluorescent protein such as ZsGreen and "npt" to the selectable marker neomycin is phosphotransferase. An arrow indicates the site of transcription initiation within a transcription unit. The sequence specific recombination between the recombination site attP
or attB located on the first DNA and the recombination site attP or attB
located on the second DNA is depicted with a cross and is mediated by the bacteriophage lambda integrase. "att"
refers to the attachment sites resulting from the exemplarily shown recombination between 3o attP and cattP, attP and attB, attB and attP, or attB and attB located on the first and second DNA, respectively.
The term "transformation" or "to transform" , "transfection" or "to transfect"
as used herein means any introduction of a nucleic acid sequence into a cell, resulting in genetically modified, recombinant, transformed or transgenic cells. The introduction can be performed by any method well known in the art and described, e.g. in Sambrook, J. et al. (1989) Molecular Cloning: A
Laboratory Manual Cold Spring Harbor Laboratory, Cold Spring Harbor, New York or Ausubel, F.M. et al. (1994 updated) Current Protocols in Molecular Biology, New York:
Greene s Publishing Associates and Wiley-Interscience. Methods include but are not limited to lipofection, electroporation, polycation (such as DEAF-dextran)-mediated transfection, protoplast fusion, viral infections and microinjection or may be carried Ollt by means of the calcium method, electroshock method, intravenous/intramusuclar injection, aerosol inhalation or an oocyte injection. The transformation may result in a transient or stable transformation of the ~o host cells. The term "transformation" or "to transform" also means the introduction of a viral nucleic acid sequence in a way which is for the respective vims the naturally one. The viral nucleic acid sequence needs not to be present as a naked nucleic acid sequence belt may be packaged in a viral protein envelope. Thus, the term relates not only to the method which is usually known under the term "transformation" or "to transform". Transfection methods that ~s provide optimal transfection frequency and expression of the introduced nucleic acid are favored.
Suitable methods can be determined by routine procedures. For stable transfectants the constructs are either integrated into the host cell's genome or an artificial chromosome/mini-chromosome or located episomally so as to be stably maintained within the host cell.
?o The term "recombination sequences" as used herein relates to ~ttB, attP, attL and attR sequences and the derivatives thereof. An example for an attB sequence is specified in SEQ ID N0:13, an example for an attP sequence is specified in SEQ ID N0:14, an example for an attL sequence is specified in SEQ ID NO:15, and an example for an attR sequence is specified in SEQ ID N0:16.
zs The term "derivative" as used herein relates to attB, attP, attL and attR
sequences having one or more substitutions, preferably seven, more preferably two, three, four, five or six in the overlap region and/or core region in contrast to naturally occurring attB, attP, czttL
and c~ttR sequences.
The term "derivative" also relates to at least one core Int binding site of attB, attP, attL or attR.
The term "derivative" also relates to at least one core Int binding site of attP, attL or attR plus 30 one or more copies of the arm-binding sites for Int. The term "derivative"
also relates to at least one core Int binding site of attP, attL or attR plus one or more copies of the IHF, FIS or XIS
factor binding sites. The term "derivative" also relates to a combination of these features. The term "derivative" moreover relates to any functional fragments thereof and to endogenous nucleotide sequences in eukaryotic cells supporting sequence-specific recombination, e.g. attH
identified in the human genome (see e.g. WO 01/16345). The term "derivative"
in general includes attB, attP, attL or attR sequences suitable for realizing the intended use of the present invention, which means that the sequences mediate sequence-specific recombinantion events driven by an integrase (wild-type or modified) of the bacteriophage lambda.
s The term "functional fragment" relates to attB, attP, attL and attR sequences having substitutions, deletions, and/or insertions (including presence or absence of wild-type or modified protein binding sites), which do not significantly affect the use of said sequences in recombination events driven by an wild-type or modified integrase of the bacteriophage lambda.
io Functionality is not significantly affected, when recombination frequency is at least about 70%, preferably at least about 80%, more preferably about 90%, further more preferably at least about 95%, and most preferably more than about 100% in comparison to the corresponding naturally occurring recombination sequences, using the same recombinase under the same conditions (e.g.
in vitro ~or in vivo use, identical host cell type, identical transfection conditions, presence or is absence of the same host factors, the same buffer conditions, identical temperature etc.).
Alternatively, substitutions, deletions, and/or insertions in attB, attP, attL
and/or attR sequences confer at least an enhancement of the recombination events driven by a wild-type or modified integrase of the bacteriophage lccmbda, whereby said enhancement may consist for example of (i) increasing the efficiency of recombination events (integration and/or excision), (ii) increasing the zo specificity of recombination, (iii) favoring excisive recombination events, (iv) favoring integrative recombination events, (v) relieving the requirements for some or all host factors, in comparison to the corresponding naturally occurnng recombination sequences using the same recombinase under the same conditions (see above).
zs The functionality of modified recombination sites or of modified integrase can be demonstrated in ways that depend on the desired particular characteristic and are known in the art. For example, a co-transfection assay as described in the present invention (see Results 5.1 or Example 3 of WO 01/16345) may be used to characterize integrase-mediated recombination of extrachromosomal DNA in a variety of cell lines. Briefly, cells are co-transfected with an 30 expression vector encoding the integrase protein and a substrate vector that is a substrate for the recombinase, encoding a functional/non-functional reporter gene (e.g.
fluorescent protein like GFP) and containing at least one recombination sequence therein. Upon expression of the integrase by the expression vector, the function of the reporter gene will be rendered non-functional/functional. Thus, the recombination activity can be assayed either by recovering the recombined substrate vector and looking for evidence of recombination at the DNA level (for example by performing a PCR, sequence analysis of the recombined region, restriction enzyme analysis, Southern blot analysis) or by looking for evidence of the recombination at the protein level (e.g. ELISA, Western Blotting, radioimmunoassay, immunoprecipitation, immunostaining, s FACS-analysis of fluorescent proteins).
The term "overlap region" as used herein defines the sequence of the recombination sequences where the DNA strand exchange, including strand cleavage and religation, takes place and relates to the consensus DNA sequence S'-TTTATAC-3' in wild-type att sites or said sequence ~o having functional nucleotide substitutions. The only prerequisite is, that the sequence of the overlap region is identical between recombining partner sequences.
The term "core binding sites" relates to two imperfectly repeated copies in inverted orientation, separated by the overlap region, in each set of wild-type att sites. The core binding sites are ~s essential for the recombination by binding the integrase at low affinity.
Each core binding site consists of nine contiguous base pairs and relates to DNA sequences consisting for the B-sequence of the nucleotide sequence 5'-CTGCTTTTT-3', for the B'-sequence of the nucleotide sequence 5'-CAAGTTAGT-3' (reverse complementary strand), for the C-sequence of the nucleotide sequence 5'-CAGCTTTTT-3', and for the C'-sequence of the nucleotide sequence zo 5 ~-CAACTTAGT-3' (reverse complementary strand) in wild-type att sites or said sequences having functional nucleotide substitutions.
The term "arm-binding site for Int" or "arm-binding sites" as used herein relates to the consensus sequence S'-C/AAGTCACTAT-3' or said sequence having functional nucleotide substitutions.
zs The arm-binding site for Int may be positioned at various distances upstream and/or downstream of the core Int binding site(s).
The term "homologue" or "homologous" or "similar" as used herein with regard to recombination sequences, arm-binding sites, and host factor binding sites relates to a nucleic acid ~o sequence being identical for about 70%, preferably for about 80%, more preferably for about 85%, further more preferably for about 90%, further more preferably for about 95%, and most preferably for about 99% to naturally occurring recombination sequences, arm-binding sites, and host factor binding sites. As homologous or similar are considered sequences, which e.g. using standard parameters in the similarity algorithm BLAST of NCBI (Basic Local Alignment Search Tool, Altschul et al., Journal of Molecular Biology 215, 403-410 (1990)) showing a probability of P < 10-s when compared to the recombination sequences.
The term "vector" as used herein relates to naturally occurring or synthetically generated s constructs for uptake, proliferation, expression or transmission of nucleic acids in a cell, e.g.
plasmids, phagemids, cosmids, artificial chromosomes/mini-chromosomes, bacteriophages, viruses or retro vimses. Methods used to construct vectors are well known to a person skilled in the art and described in various publications. In particular techniques for constnicting suitable vectors, including a description of the functional and regulatory components such as promoters, io enhancers, termination and polyadenylation signals, selection markers, origins of replication, and splicing signals, are reviewed in considerable details in Sambrook, J. et al.
(1989), supra, and references cited therein. The eukaryotic expression vectors will typically contain also prokaryotic sequences that facilitate the propagation of the vector in bacteria such as an origin of replication and antibiotic resistance genes for selection in bacteria. A
variety of eukaryotic ~s expression vectors, containing a cloning site into which a polynucleotide can be operatively linked, are well known in the art and some are commercially available from companies such as Stratagene, La Jolla, CA; Invitrogen, Carlsbad, CA; Promega, Madison, WI or BD
Biosciences Clontech, Palo Alto, CA.
Zo The terms "gene of interest", "desired sequence", or "desired gene" as used herein have the same meaning and refer to a polynucleotide sequence of any length that encodes a product of interest.
The selected sequence can be full length or a tnmcated gene, a fusion or tagged gene, and can be a cDNA, a genomic DNA, or a DNA fragment, preferably, a cDNA. It can be the native sequence, i.e. naturally occurnng form(s), or can be mutated or otherwise modified as desired.
2s These modifications include codon optimizations to optimize codon usage in the selected host cell, humanization or tagging. The selected sequence can encode a secreted, cytoplasmic, nuclear, membrane bound or cell surface polypeptide. The "product of interest"
includes proteins, polypeptides, fragments thereof, peptides, antisense RNA all of which can be expressed in the selected host cell.
The term "nucleic acid sequence", "nucleotide sequence", or "DNA sequence" as used herein refers to an oligonucleotide, nucleotide or polynucleotide and fragments and portions thereof and to DNA or RNA of genomic or synthetic origin, which may be single or double stranded and represent the sense or antisense strand. The sequence may be a non-coding sequence, a coding to sequence or a mixture of both . The polynucleotides of the invention include nucleic acid regions wherein one or more codons have been replaced by their synonyms.
The nucleic acid sequences of the present invention can be prepared using standard techniques s well known to one of skill in the art. The term "encoding" or "coding"
refers to the inherent property of specific sequences of nucleotides in a nucleic acid, such as a gene in chromosome or an mRNA, to serve as templates for synthesis of other polymers and macromolecules in biological processes having a defined sequence of nucleotides (i.e. rRNA, tRNA, other RNA
molecules) or amino acids and the biological properties resulting therefrom.
Thus a gene encodes io a protein, if transcription and translation of mRNA produced by that gene produces the protein in a cell or other biological system. Both the coding strand, the nucleotide sequence of which is identical to the mRNA sequence and is usually provided in sequence listings, and non-coding strand, used as the template for the transcription, of a gene or cDNA can be referred to as encoding the protein or other product of that gene or cDNA. A nucleic acid that encodes a is protein includes any nucleic acids that have different nucleotide sequences but encode the same amino acid sequence of the protein due to the degeneracy of the genetic code.
Nucleic acids and nucleotide sequences that encode proteins may include introns.
The term "polypeptide" is used interchangeably with amino acid residue sequences or protein zo and refers to polymers of amino acids of any length. These terms also include proteins that are post-translationally modified through reactions that include, but are not limited to, glycosylation, acetylation, phosphorylation or protein processing. Modifications and changes, for example fusions to other proteins, amino acid sequence substitutions, deletions or insertions, can be made in the structure of a polypeptide while the molecule maintains its biological functional activity.
zs For example certain amino acid sequence substitutions can be made in a polypeptide or its underlying nucleic acid coding sequence and a protein can be obtained with like properties.
Amino acid modifications can be prepared for example by performing site-specific mutagenesis or polymerase chain reaction mediated mutagenesis on its underlying nucleic acid sequence.
3o The term "expression" as used herein refers to transcription and/or translation of a heterologous nucleic acid sequence within a host cell. The level of expression of a desired product in a host cell may be determined on the basis of either the amount of corresponding mRNA
that is present in the cell, or the amount of the desired polypeptide encoded by the selected sequence. For example, mRNA transcribed from a selected sequence can be quantitated by Northern blot hybridization, ribonuclease RNA protection, in situ hybridization to cellular RNA or by PCR
(see Sambrook, J. et al. (1989), supra; Ausubel, F.M. et al. (1994 updated), supra). Proteins encoded by a selected sequence can be quantitated by various methods, e.g. by ELISA, by Western blotting, by radioimmunoassays, by immunoprecipitation, by assaying for the biological s activity of the protein, or by immunostaining of the protein followed by FAGS analysis PCR (see Sambrook, J. et al. (1989), supra; Ausubel, F.M. et al. (1994 updated), supra).
An "expression cassette" defines a region within a construct that contains one or more genes to be transcribed, wherein the genes contained within the segment are operatively linked to each ~o other and transcribed from a single promoter, and as result, the different genes are at least transcriptionally linked. More than one protein or product can be transcribed and expressed from each transcription unit. Each transcription unit will comprise the regulatory elements necessary for the transcription and translation of any of the selected sequence that are contained within the unit.
is The term "operatively linked" means that two or more nucleic acid sequences or sequence elements are positioned in a way that permits them to function in their intended manner. For example, a promoter and/or enhancer is operatively linked to a coding sequence if it acts in cis to control or modulate the transcription of the linked sequence. Generally, but not necessarily, the zo DNA sequences that are operatively linked are contiguous and, where necessary to join two protein coding regions or in the case of a secretory leader, contiguous and in reading frame.
The term "selection marker gene" refers to a gene that only allows cells carrying the gene to be specifically selected fox or against in the presence of a corresponding selection agent. By way of zs illustration, an antibiotic resistance gene can be used as a positive selectable marker gene that allows the host cell transformed with the gene to be positively selected for in the presence of the corresponding antibiotic; a non-transformed host cell would not be capable of growth or survival under the selection culture conditions. Selectable markers can be positive, negative or bifimctional. Positive selectable markers allow selection for cells carrying the marker by 3o conferring resistance to a dnig or compensate for a metabolic or catabolic defect in the host cell.
In contrast, negative selection markers allow cells carrying the marker to be selectively eliminated. For example, using the HSV-tk gene as a marker will make the cells sensitive to agents such as acyclovir and gancyclovir. The selectable marker genes used herein, including the amplifiable selectable genes, will include recombinantly engineered mutants and variants, fragments, functional equivalents, derivatives, homologs and fusions of the native selectable marker gene so long as the encoded product retains the selectable property.
Useful derivatives generally have substantial sequence similarity (at the amino acid level) in regions or domains of the selectable marker associated with the selectable property. A variety of marker genes have s been described, including bifunctional (i.e. positivelnegative) markers (see e.g. WO 92/08796 and WO 94/28143), incorporated by reference herein. For example, selectable genes commonly used with eukaryotic cells include the genes for aminoglycoside phosphotransferase (APH), hygromycin phosphotransferase (HYG), dihydrofolate reductase (DHFR), thymidine kinase (TK), glutamine synthetase, asparagine synthetase, and genes encoding resistance to neomycin ~ o (G418), puromycin, histidinol D, bleomycin and phleomycin.
Selection may also be made by fluorescence activated cell sorting (FACS) using for example a cell surface marker, bacterial (3-galactosidase or fluorescent proteins (e.g.
green fluorescent proteins (GFP) and their variants from Aeqzcorea victoria and Renilla reniformis or other species;
is red fluorescent proteins, fluorescent proteins and their variants from non-bioluminescent species (e.g. Discosoma sp., Anemonia sp., Clavularia sp., Zoanthzcs sp.) to select for recombinant cells.
The term "selection agent" refers to a substance that interferes with the growth or survival of a host cell that is deficient in a particular selectable gene. For example, to select for the presence of zo an antibiotic resistance gene like APH (aminoglycoside phosphotransferase) in a transfected cell the antibiotic Geneticin (G418) is used.
The integrase (usually and designated herein as "Int") of the bacteriophage lambda belongs like Cre and Flp to the integrase family of the sequence specific conservative DNA
recombinases. In zs its natural function Int catalyses the integrative recombination between two different recombination sequences namely attB and attP. AttB comprises 21 nucleotides and was originally isolated from the E. coli genome; Mizuuchi, M. and Mizuuchi, K.
(1980) Proc. Natl.
Acad. Sci. USA, 77, pp. 3220. On the other hand attP having 243 nucleotides is much longer and occurs naturally in the genome of the bacteriophage lambda; Landy, A., and Ross, W. (1977) 3o Science, 197, pp. 1147. The Int recombinase has seven binding sites altogether in attP and two in attB. The biological function of Int is the sequence specific integration of the circular phage genome into the locus attB on the E. coli chromosome. Int needs a protein co-factor, the so-called integration host factor (usually and designated herein as "IHF") for the integrative recombination; Kikuchi, Y. and Nash, H. (1978) J. Biol. Chem., 253, 7149. IHF
is needed for the assembly of a functional recombination complex with attP. A second co-factor for the integration reaction is the DNA negative supercoiling of attP. Finally, the recombination between attB and attP leads to the formation of two new recombination sequences, namely attL
and attR, which serve as substrate and recognition sequence for a further recombination reaction, s the excision reaction. A comprehensive summary of the bacteriophage lambda integration is given e.g. in Landy, A. (1989) Annu. Rev. Biochem., 58, pp. 913.
The excision of the phage genome out of the bacterial genome is catalyzed by the Int recombinase also. For this, a further co-factor is needed in addition to Int and IHF, which is io encoded by the bacteriophage lambda. This is the excisionase (usually and designated herein as "XIS") having two binding sites in attR; Gottesman, M. and Weisberg, R. (1971) The Bacteriophage Lambda, Cold Spring Harbor Laboratory, pp.113. In contrast to the integrative recombination, DNA negative supercoiling of the recombination sequences is not necessary for the excisive recombination. However, DNA negative supercoiling increases the efficiency of the is recombination reaction: A further improvement of the efficiency of the excision reaction may be achieved with a second co-factor namely FIS (factor for inversion stimulation), which acts in conjunction with XIS; Landy, A. (1989) Annu. Rev. Biochem., 58, pp.913. The excision is genetically the exact reverse reaction of the integration, i.e. attB and attP
are generated again. A
comprehensive summary of the bacteriophage lambda excision is given e.g. in Landy, A. (1989) ~o Annu. Rev. Biochem., 58, pp. 913.
One aspect of the present invention relates to a method of sequence specific recombination of DNA in a eukaryotic cell, comprising a) introducing a first attB, attP, attL or attR sequence or a derivative thereof into a cell, zs b) introducing a second attB, attP, attL or czttR sequence or a derivative thereof into a cell, wherein if said first DNA sequence comprises an attB sequence or a derivative thereof said second sequence comprises an attB, attL or attR sequence or a derivative thereof, or wherein if said first DNA sequence comprises an attP sequence or a derivative thereof said second sequence comprises an czttP, attL or attR sequence or a derivative thereof, or wherein if said first 3o DNA sequence comprises an attL sequence or a derivative thereof said second sequence comprises an attB, attP or attL sequence or a derivative thereof, or wherein if said first DNA
sequence comprises an attR sequence or a derivative thereof said second sequence comprises an attB, attP or attR sequence or a derivative thereof, c) performing the sequence-specific recombination by a bacteriophage lambda integrase Int.
Preferred is the method wherein in step c) the sequence-specific recombination is performed by Int or by Int and XIS, FIS, and/or IHF. Most preferred is the method wherein in step c) the sequence-specific recombination is performed by Int or by Int and a XIS
factor, or by Int and s IHF, or by Int and XIS and IHF. Further preferred is the method wherein in step c) the sequence-specific recombination is performed by a modified Int, preferably the Int-h or Int-h/218. In this context, use of a modified Int together with XIS, FIS and/or IHF is also within the meaning of the present invention.
io In a more preferred embodiment of this method, sequence specific recombination of DNA in a eukaryotic cells will be performed between identically or nearly identically recombination sites.
Therefore, the present invention relates a method of sequence specific recombination as described above, wherein if said first DNA sequence comprises an attB sequence or a derivative thereof said second sequence comprises also attB sequence or a derivative thereof, or wherein if is said first DNA sequence comprises an attP sequence or a derivative thereof said second sequence comprises an attP sequence or a derivative thereof, or wherein if said first DNA
sequence comprises an attL sequence or a derivative thereof said second sequence comprises an attL sequence or a derivative thereof, or wherein if said first DNA sequence comprises an attR
sequence or a derivative thereof said second sequence comprises an attR
sequence or a zo derivative thereof.
The method of the present invention may be carned out not only with the naturally occuring attB, attP, attL, and/or attR sequences but also with modified e.g.
substituted attB, attP, attL, and/or attR sequences. For example an integrative recombination of the bacteriophage lambda Zs and E. coli between attP and attB homologous sequences (mutants of the wild-type sequences) have been observed which have one or more substitutions in attB (Nash, H.
(1981) Annu. Rev.
Genet., 15, pp. 143; Nussinov, R. and Weisberg, R. (1986) J. Biomol. Stntct.
Dynamics, 3, pp 1134) and/or in attP (Hash, H. (1981) Annu. Rev. Genet., 15, pp.143).
3o Thus, the present invention relates to a method wherein the used attB, attP, attL, and/or attR
sequences have one or more substitutions in comparison to the naturally occuring attB, attP, attL, and/or attR sequences. Preferred is a method wherein the attB, attP, attL, and/or attR
sequences have one, two, three, four, five, six, seven or more substitutions.
The substitutions may occur both in the overlap region and in the core region. The complete overlap region comprising seven nucleotides may be substituted also. More preferred is a method wherein substitutions are introduced into the attB, attP, attL, and/or attR sequences either in the core region or in the overlap region. Preferred is the introduction of a substitution in the overlap region and the simultaneous introduction of one or two substitutions in the core region. The s present invention also relates to a method wherein the used attB, attP, attL, and/or attR
sequences are derivatives, including functional fragments thereof, of said recombination sites in comparison to the naturally occurnng attB, attP, attL, and/or attR sequences.
A modification in the form of one or more substitutions) into recombination sequences is to be io chosen such that the recombination can be carned out in spite of the modification(s). Examples for such substitutions are listed e.g. in the publications of Nash, H. (1981), supra and Nussinov, R. and Weisberg, R. (1986), sz~pra and are not considered to be limiting.
Further modifications may be easily introduced e.g. by mutagenesis methods (a number of these are described in Ausubel, F.M. et al. (1994 updated), supra) and and may be tested for their use by test is recombinations as described e.g. in the examples of the present invention (Examples 1 and 2, results 5.1 ).
Furthermore, the present invention relates to a method wherein the used attB, attP, cattL, and/or attR sequences comprise only of one of the respective core Int binding sites, however, more than zo two core Int binding sites are also preferred. In a preferred embodiment, the present invention relates to a method wherein the used attB, attP, attL, and/or attR sequences consist only of one of the respective core Int binding sites. In a further embodiment the used attB, attP, attL, and/or attR sequences consist of two or more core Int binding sites.
zs The present invention relates further to a method wherein the used attP, attL, and/or attR
sequences comprise in addition to the core Int binding site one or more, preferably two, three, four, five or more than five, copies of the arm-binding site for Int. Said binding site comprises a consensus motive having the sequence 5'-C/AAGTCACTAT-3' (SEQ ID NO:1) or a modified sequence thereof having nucleotide substitutions and being functional with regard to the Int 3o binding. The arm-binding sites) for Int may be positioned at various distances upstream and/or downstream of the core Int binding site(s).
In order to perform the method of the present invention the first recombination sequence may comprise further DNA sequences which allow the integration into a desired target locus, e.g. in the genome of the eukaryotic cell or an artificial-/minichromosome. This recombination occurs e.g. via the homologous recombination which is mediated by internal cellular recombination mechanisms. For said recombination, the further DNA sequences have to be homologous to the DNA of the target locus and located both 3' and 5' of the attB, attL, ccttP, or attR sequences or s derivatives thereof, respectively. The person skilled in the art knows how great the degree of the homology and how long the respective 3' and 5' sequences have to be such that the homologous recombination occurs with a sufficient probability; see review of Capecchi, M.
(1989) Science, 244, pp. 1288.
~o However, it is also possible to integrate the first recombination sequence by any other mechanism into the genome of the eukaryotic cell, or any artificial-/minichromosome, e.g. via random integration which is also mediated by internal cellular recombination events. Integration of said first recombination site via sequence-specific recombination using sites different from those being integrated, e.g. by using IoxPlFRT sequences, is also conceivable.
is The second recombination sequence may also comprise DNA sequences which are necessary for an integration into a desired target locus via homologous recombination. For the method of the present invention both the first and/or the second recombination sequence may comprise the further DNA sequences. Preferred is a method wherein both DNA sequences comprise the zo further DNA sequences.
Introduction of the first and second recombination sequence with or without further DNA
sequences may be performed both consecutively and in a co-transformation wherein the recombination sequences are present on two different DNA molecules. Preferred is a method, Zs wherein the first and second recombination sequence with or without further DNA sequences are present and introduced into the eukaryotic cells on a single DNA molecule.
Furthermore, the first recombination sequence may be introduced into a cell and the second recombination sequence may be introduced into another cell wherein the cells are fused subsequently.
The term fusion means crossing of organisms as well as cell fusion in the widest sense.
The method of the present invention may be used e.g. to invert a DNA segment lying between the indirectly orientated recombination sequences in an intramolecular recombination.
Furthermore, the method of the present invention may be used to delete the DNA
segment lying between the directly orientated recombination sequences in an intramolecular recombination. If the recombination sequences are each incorporated in 5'-3' or in 3'-5' orientation they are present in direct orientation. The recombination sequences are in indirect orientation if e.g. the attB
sequence is integrated in S'-3' and the attP sequence is integrated in 3'-5' orientation. If the recombination sequences are each incorporated e.g. via homologous recombination into intron s sequences 5' and 3' of an exon and the recombination is performed by an integrase, the exon would be inverted in case of indirectly orientated recombination sequences and deleted in case of directly orientated recombination sequences, respectively. With this procedure the polypeptide encoded by the respective gene may lose its activity or function or the transcription may be stopped by the inversion or deletion such that no (complete) transcript is generated. In this way io e.g. the biological function of the encoded polypeptide may be investigated. Moreover, inversion or deletion reactions may be used to activate the expression of a gene encoding a desired polypeptide, e.g. by functional linkage of the open reading frame of the encoded polypeptide with regulatory elements which allow transcription and/or translation of the encoded polypeptide. Those regulatory elements include but are not limited to a promotor and or i ~ promotor/enhancer elements, which are well knoiyn in the art for various eukaryotic expression systems.
However, the first and/or second recombination sequence may comprise further nucleic acid sequences encoding one or more polypeptides/products of interest. For example a structural zo protein, an enzymatic or a regulatory protein may be introduced via the recombination sequences into the genome being transiently or stably expressed after intramolecular recombination. The introduced polypeptide/product may be an endogenous or exogenous one.
Furthermore, a marker protein or biopharmaceutically relevant therapeutic polypeptides may be introduced. The person skilled in the art knows that this listing of applications of the method according to the present ?s invention is only exemplary and not limiting. Examples of applications according to the present invention performed with the so far used Cre and Flp recombinases may be found e.g. in the review of Kilby, N. et al., (1993), Trends Genet., 9, pp.413.
Furthermore, the method of the present invention may be used to delete or,invert DNA segments 30 on vectors by an intramolecular recombination on episomal substrates. A
deletion reaction may be used e.g. to delete packaging sequences from so-called helper viruses. This method has a broad application in the industrial production of viral vectors for gene therapeutic applications;
Hardy, S. et al., (1997), 3. Virol., 71, pp.1842.
The intermolecular recombination leads to the fusion of two DNA molecules each having a copy of attB, attP, attL, or attR or various combinations of att sequences or of their derivates. For example, attB or a derivative thereof may be introduced first via homologous recombination in a known, well characterized genomic locus of a cell or an artificial-Iminchromosome.
s Subsequently an ccttB, attP, attL, or attR carrying vector or DNA-segment may be integrated into said genomic attB sequence via intermolecular recombination. Preferred in this method is the co-expression of the mutant integrase, e.g. Int-h or Int-h/218 within the eukaryotic cell, wherein the recombination occurs. Most preferred is the co-expression of the mutant integrase Int-h/218.
Genes encoding for any of those mutant integrases may be located on a second DNA vector io being transfected, preferably co-transfected, or on the vector or DNA-segment carrying the attP, attL, attR or also an czttB sequence or an derivative thereof. Further sequences may be located on the attB, attP, attL, or attR carrying vector or DNA-segment, e.g. a gene for a particular marker protein flanked by loxPlFRT sequences. With this approach it may be achieved that, e.g. in comparative expression analyses of different genes in a cell type, said genes are not influenced is by positive or negative influences of the respective genomic integration locus. Furthermore, the method of the present invention may be used to fuse DNA segments on vectors by an intermolecular recombination on episomal substrates. A fusion reaction may be used e.g. to express recombinant proteins or relevant domains in order to screen for phenotypes. This method may be used in the high throughput analysis of protein functions in eukaryotic cells and is thus of zo considerable interest.
As mentioned above, intermolecular recombination may be used to introduce one or more genes) of interest encoding one or more desired polypeptide(s)/product(s) into, e.g. episomal substrates, artificial-/minichromosomes, or various host cell genomes containing a first zs recombination sequence. In this context a second DNA comprises beside at least one recombination sequence, e.g. attP, attB, attL, attR or any derivative thereof, one or more expression cassettes) for the expression of one or more desired protein(s)/product(s). That expression cassette may be introduced into a desired target locus via the recombination sequences which allows sequence-specific recombination between the DNA
comprising the 3o second recombination sequence and the expression cassette, and the first recombination sequence being introduced before into said episomal substrate, artificial-/minichromosome, or host cell genome. This embodiment may be of high interest for establishing high expression cell lines which are suitable for the production of biopharmaceutical products.
In this context, a first DNA comprising at least one recombination sequence has to be introduced, e.g. by random integration, into the genome of the host cell, an artificial-Jminichromosomes or episomal substrates contained within the host cell. Alternatively, host cell may be transformed with an artificial-/minichromosome or episomal substrate comprising a corresponding at least s one recombination site(s). Another way to integrate recombination sequences) into a desired target locus, recognized by a bacteriophage lambda integrase Int, is to use homologous recombination techniques as mentioned above.
To facilitate selection for stable transfectants which have introduced recombination sequences) io into a desired target locus, a selection marker gene is co-introduced into the same target locus at the same time. This may be achieved, for example, if the recombination sequences) and a selection marker gene are co-located on the same vector or DNA segment, which is introduced into the target locus, e.g. by any method mentioned above (homologous recombination, random integration, etc.). As the expression level of the selection marker gene correlates with the is transcription activity at the integration site, cells showing a high expression level at site of integration, cell robustness, and good growth characteristics, e.g. in a bioreactor, can be identified very effectively. The level of expression of the selection marker gene can be determined by methods well known in the art, e.g. on the basis of either the amount of corresponding mRNA that is present in the cell, or the amount of polypeptide encoded by the ?o gene. For example, mRNA transcribed from the introduced gene sequence can be quantified by Northern blot hybridization, ribonuclease RNA protection, in situ hybridization to cellular RNA
or by PCR (see Sambrook et al., 1989; Ausubel et al., 1994, supra). Proteins encoded by a selected sequence can be quantified by various methods, e.g. by ELISA, by Western blotting, by radioimmunoassays, by immunoprecipitation, by assaying for the biological activity of the as protein, by immunostaining of the protein followed by FACS analysis, or by measuring the fluorescence signals of a fluorescent protein (see Sambrook et al., 1989;
Ausubel et al., 1994 updated, sicpra). By such a method excellent candidates of a production cell line for producing biopharmaceuticals may be obtained.
3o The integrated recombination sequences) (first recombination sequence(s)) allow integration of a further DNA molecule, e.g. a vector or DNA segment carrying at least one further recombination sequence (second recombination sequence) via sequence-specific recombination by a bacteriophage lambda integrase Int into a transcriptional active locus.
Preferably, that further DNA molecule comprising at least one second recombination sequence further comprises an expression cassette for the expression of at least one biopharmaceutically relevant gene of interest. Fox this, host cells, which comprise the first integrated recombination sequence, preferably integrated into the host cell genome at a transcriptional active locus, are tranfected with a DNA molecule comprising the second recombination sequence for a bacteriophage s lambda integrase Int, and are cultivated under conditions that allow sequence-specific recombination between the first and the second recombination sequence, preferably the integration of the DNA molecule comprising the second recombination sequence into the host cell genome comprising the first recombination sequence. First and second recombination sequences can be either attP, attB, attL, attR or any derivative thereof, which allows sequence-to specific recombination by a bacteriophage lambda integrase Int or any functional mutant thereof.
For example, if the first recombination sequence comprises attP or a derivative thereof second may comprises attP, attB, attL, attR or any derivative thereof.
Preferred is the method wherein the sequence-specific recombination is performed by Int, or by is Int and XIS, FIS and/or IHF. Most preferred is the method wherein the sequence-specific recombination is performed by Int or by Int and a XIS factor, or by Int and IHF, or by Int and XIS and IHF. Further preferred is the method wherein the sequence-specific recombination is performed by a modified Int, preferably the Int-h or Int-h/218. In this context, use of a modified Int together with XIS and/or IHF is also within the meaning of the present invention.
?o By this approach any DNA sequence(s), comprising a second recombination sequence for the bacteriophage lambda integrase Int is/are integrated into a known, well characterized and defined locus of the host cell. To select for cells where a sequence-specific recombination has occurred one can introduce, for example, a non-functional expression cassette comprising the selection ~s marker gene, e.g. without a promoter or promoter/enhancer or only part of the coding region of the gene. Only if sequence-specific recombination has occurred, a complete and functional expression cassette with efficient expression of the selection marker gene will be generated, thus allowing for the selection of cells having integrated the gene of interest via sequence specific integration.
,o By the method of the present invention production cell lines are obtainable differ from the host cell merely by the identity of DNA sequences integrated at a defined site of integration, e.g. into a genomic locus. Due to less genetic variation between different cell clones a more generic process for the development of production cell lines can be used, thus reducing time and capacity for clone selection and development of an optimized production process. The production cell lines may be used for the manufacturing of the desired polypeptide(s).
A further aspect of the present invention therefore relates to a method of expressing at least one s gene of interest encoding one or more desired polypeptide(s)/products(s) in a eukaroytic cell, comprising a) . introducing a first DNA comprising an attB, attP, attL or attR sequence or a derivative thereof into a cell;
b) introducing a second DNA comprising an attB, attP, attL or attR sequence or a derivative ~o thereof, and at least one gene of interest into a cell, c) contacting said cell with a bacteriophage lambda integrase Int;
d) performing the sequence-specific recombination by a bacteriophage lambda integrase Int, wherein the second DNA is integrated into the first DNA; and e) cultivating said cell under conditions, wherein the genes) of interest is/are being i s expressed.
Preferred is that method, wherein if said first DNA sequence comprises an attB
sequence or a derivative thereof said second sequence comprises an attB, attL or attR
sequence or a derivative thereof, or wherein if said first DNA sequence comprises an attP sequence or a derivative thereof ?o said second sequence comprises an attP, attL or attR sequence or a derivative thereof, or wherein if said first DNA sequence comprises an attL sequence or a derivative thereof said second sequence comprises an attB, attP or attL sequence or a derivative thereof, or wherein if said first DNA sequence comprises an attR sequence or a derivative thereof said second sequence comprises an attB, attP or attR sequence or a derivative thereof.
zs In a more preferred embodiment of that method, the first DNA has been integrated into the genome, an artificial-/minichromosome or an episomal element of a host cell, preferably at sites showing high transcription activity, before said second DNA is introduced into said cell.
3o The present invention also relates to a method of expressing at least one or more genes of interest in a host cell, wherein said host cell comprises one attB, attP, attL or attR
sequence or a derivative thereof integrated into the genome of said host cell, comprising a) introducing a DNA comprising an attB, attP, attL or attR sequence or a derivative thereof, and at least one gene of interest into said cell, b) contacting said cell with a bacteriophage lambda integrase Int;
c) performing the sequence-specific recombination by a bacteriophage lambda integrase Int, wherein the second DNA is integrated into the first DNA;
d) cultivating said cell under conditions, wherein the genes) of interest is/are being expressed.
The method may be carried out not only with an attB, attP, attL or attR
sequence or a derivative thereof being integrated into a host cell genome by genetic engineering of said cell, but also with naturally occurring recombination sequence of the genome, e.g. the attH-site described in 5 (5'-GAAATTCTTTTTGATACTAACTTGTGT-3'; SEQ ID N0:17) or any other Io recombination sequence, which allows sequence-specific recombination mediated by an Int or any functional mutant thereof.
Those methods are preferred, wherein said sequence-specific recombination is performed by Int or by Int and a XIS factor, or by Int and IHF, or by Int and XIS and IHF.
Further preferred is the is method wherein the sequence-specific recombination is performed by a modified Int, preferably the Int-h or Int-h/218. In this context, use of a modified Int together with XIS and/or IHF is also within the meaning of the present invention. Int, Int-h or Int-h/218, XIS, and/or IHF may be added to the cell in purified form or being co-expressed by said host cell, wherein the sequence-specific recombination is being performed.
zo A further embodiment of the above mentioned methods relates to a method, wherein the polypeptide(s)/product(s) which is/are encoded by the genes) of interest and being expressed in said host cell, is/are isolated from the cells or the cell culture supernatant, if secreted into the culture medium.
zs Said production cells are cultivated preferentially in semm-free medium and in suspension culture under conditions which are favorable for the expression of the desired genes) and isolating the protein of interest from the cells and/or the cell culture supernatant. Preferably the protein of interest is recovered from the culture medium as a secreted polypeptide, or it can be 3o recovered from host cell lysates if expressed without a secretory signal.
It is necessary to purifiy the protein of interest from other recombinant proteins, host cell proteins and contaminants in a way that substantially homogenous preparations of the protein of interest are obtained. As a first step often cells and/or particulate cell debris are removed from the culture medium or lysate. The product of interest thereafter is purified from contaminant soluble proteins, polypeptides and nucleic acids, for example, by fractionation on immunoaffinity or ion-exchange columns, ethanol precipitation, reverse phase HPLC, Sephadex chromatography on silica or on a cation exchange resin such as DEAE. In general, methods teaching a skilled persion how to purify a heterologous protein expressed by host cells, are well known in the art. Such methods are for example s described by Harris et al. (1995) Protein Purification: A Practical Approach, Pickwood and Hames, eds., IRL Press and Scopes, R. (1988) Protein Purification, Springer Verlag. Therefore, the aforementioned method of expressing at least one gene of interest may be added by an additional purification step, wherein the desired polypeptide is purified from the host cells or from cell culture if secreted into the culture medium.
~o The method of the present invention may be performed in all eukaryotic cells.
Cells and cell lines may be present e.g. in a cell culture and include but are not limited to eukaryotic cells, such as yeast, plant, insect or mammalian cells. For example, the cells may be oocytes, embryonic stem cells, hematopoietic stem cells or any type of differentiated cells. A
method is preferred ~s wherein the eukaryotic cell is a mammalian cell. More preferred is a method wherein the mammalian cell is a human, simian, marine, rat, rabbit, hamster, goat, bovine, sheep or pig cell.
Preferred cell lines or "host cells" for the production of biopharmaceuticals are human, mice, rat, monkey, or rodent cell lines. More preferred are hamster cells, preferably BHK21, BHK TK , CHO, CHO-K1, CHO-DUKX, CHO-DUKX B1, and CHO-DG44 cells or the zo derivatives/progenies of any of such cell lines. Particularly preferred are CHO-DG44, CHO-DLTKX, CHO-K1 and BHK21, and even more preferred CHO-DG44 and CHO-D>JKX cells.
Furthermore, marine myeloma cells, preferably NSO and Sp2/0 cells or the derivatives/progenies of any of such cell lines are also known as production cell lines.
zs Host cells are most preferred, when being established, adapted, and completely cultivated under semm free conditions, and optionally in media which are free of any protein/peptide of animal origin. Commercially available media such as Ham's F12 (Sigma, Deisenhofen, Germany), RPMI-1640 (Sigma), Dulbecco's Modified Eagle's Medium (DMEM; Sigma), Minimal Essential Medium (MEM; Sigma), Iscove's Modified Dulbecco's Medium (IMDM;
Sigma), CD-3o CHO (Invitrogen, Carlsbad, CA), CHO-S-SFMII (Invtirogen), serum-free CHO
Medium (Sigma), and protein-free CHO Medium (Sigma) are exemplary appropriate nutrient solutions.
Any of the media may be supplemented as necessary with a variety of compounds examples of which are hormones and/or other growth factors (such as insulin, transferrin, epidermal growth factor, insulin like growth factor), salts (such as sodium chloride, calcium, magnesium, phosphate), buffers (such as HEPES), nucleosides (such as adenosine, thymidine), glutamine, glucose or other equivalent energy sources, antibiotics, trace elements. Any other necessary supplements may also be included at appropriate concentrations that would be known to those skilled in the art. 1n the present invention the use of semm-free medium is preferred, but media s supplemented with a suitable amount of serum can also be used for the cultivation of host cells.
For the growth and selection of genetically modified cells expressing a selectable gene a suitable selection agent is added to the culture medium.
"Desired proteins/polypeptides" or "proteins/polypeptides of interest" of the invention are for io example, but not limited to insulin, insulin-like growth factor, hGH, tPA, cytokines, such as interleukines (IL), e.g. IL-1, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-10, IL-11, IL-12, IL-13, IL-14, IL-15, IL-16, IL-17, IL-18, interferon (IFN) alpha, IFN beta, IFN gamma, IFN
omega or IFN tau, tumor necrosisfactor (TNF), such as TNF alpha and TNF beta, TNF gamma, TRAIL; G-CSF, GM-CSF, M-CSF, MCP-1 and VEGF. Also included is the production of is erythropoietin or any other hormone growth factors and any other polypeptides that can serve as agonists or antagonists and/or have therapeutic or diagnostic use. The method according to the invention can also be advantageously used for production of antibodies, such as monoclonal, polyclonal, multispecific and single chain antibodies, or fragments thereof, e.g. Fab, Fab', F(ab')2, Fc and Fc'-fragments, heavy and light immunoglobulin chains and their constant, zo variable or hypervariable region as well as Fv- and Fd-fragments (Chamov, S.M. et al. (1999) Antibody Fusion Proteins, Wiley-Liss Inc.) Fab fragments (Fragment antigen-binding = Fab) consist of the variable regions of both chains which are held together by the adjacent constant region. These may be formed by protease zs digestion, e.g. with papain, from conventional antibodies, but similar Fab fragments may also be produced in the mean time by genetic engineering. Further antibody fragments include F(ab')2 fragments, which may be prepared by proteolytic cleaving with pepsin.
Using genetic engineering methods it is possible to produce shortened antibody fragments which 3o consist only of the variable regions of the heavy (VH) and of the light chain (VL). These are referred to as Fv fragments (Fragment variable = fragment of the variable part). Since these Fv-fragments lack the covalent bonding of the two chains by the cysteines of the constant chains, the Fv fragments are often stabilised. It is advantageous to link the variable regions of the heavy and of the light chain by a short peptide fragment, e.g. of 10 to 30 amino acids, preferably 1 S amino acids. In this way a single peptide strand is obtained consisting of VH and VL, linked by a peptide linker. An antibody protein of this kind is known as a single-chain-Fv (scFv). Examples of scFv-antibody proteins of this kind known from the prior art are described in Huston C. et al.
(1988) Proc. Natl. Acad. Sci. USA, 16, pp. 5879.
s In recent years, various strategies have been developed for preparing scFv as a multimeric derivative. This is intended to lead, in particular, to recombinant antibodies with improved pharmacokinetic and biodistribution properties as well as with increased binding avidity. In order to achieve multimerisation of the scFv, scFv were prepared as fusion proteins with io multimerisation domains. The multimerisation domains may be, e.g. the CH3 region of an IgG or coiled coil stmcture (helix structures) such as Leucin-zipper domains.
However, there are also strategies in which the interaction between the VH/VL regions of the scFv are used for the.
multimerisation (e.g. dia-, tri- and pentabodies). By diabody the skilled person means a bivalent homodimeric scFv derivative. The shortening of the Linker in an scFv molecule to 5- 10 amino is acids leads to the formation of homodimers in which an inter-chain VH/VL-superimposition takes place. Diabodies may additionally be stabilised by the incorporation of disulphide bridges.
Examples of diabody-antibody proteins from the prior art can be found in Perisic, O. et al. (1994) Structure, 2, pp. 1217.
Zo By minibody the skilled person means a bivalent, homodimeric scFv derivative. It consists of a fusion protein which contains the CH3 region of an immunoglobulin, preferably IgG, most preferably IgGl as the dimerisation region which is connected to the scFv via a Hinge region (e.g. also from IgGl) and a Linker region. Examples of minibody-antibody proteins from the prior art can be found in Hu, S. et al. (1996) Cancer Res., 56, pp. 3055.
Zs By triabody the skilled person means a: trivalent homotrimeric scFv derivative (Kortt A.A. et al.
(1997) Protein Engineering, l0,pp. 423). ScFv derivatives wherein VH-VL are fused directly without a linker sequence lead to the formation of trimers.
~o The skilled person will also be familiar with so-called miniantibodies which have a bi-, tri- or tetravalent structure and are derived from scFv. The multimerisation is carried out by di-, tri- or tetrameric coiled coil structures (Pack, P. et al. (1993) Biotechnology, 11, pp. 1271; Lovejoy, B.
et al. (1993) Science,. 259, pp. 1288; Pack, P. et al. (1995) J. Mol. Biol., 246, pp. 28). In a preferred embodiment of the present invention, the gene of interest is encoded for any of those desired polypeptides mentioned above, preferably for a monoclonal antibody, a derivative or fragment thereof.
In order to perform any embodiment of the present invention, an integrase has to act on the s recombination sequences. The integrase or the integrase gene and/or a co-factor or a co-factor gene, e.g. the XIS factor or the XIS factor gene and/or IHF or the IHF gene may be present in the eukaryotic cell already before introducing the first and second recombination sequence. They may also be introduced between the introduction of the first and second recombination sequence or after the introduction of the first and second recombination sequence.
Purification of ~o recombinase and host factor proteins has been described in the art (Hash, H.A. (1983) Methods of Enzymology, 100, pp. 210; Filutowicz, M. et al. (1994) Gene, 147, pp.149).
In cases when they are not known, cell extracts can be used or the enzymes can be partially purified using procedures described for example for Int or Cre recombinase. The purified proteins can be introduced into a cell by standard techniques, for example by means of injection or is microinjection or by means of a lipofection as described in example 2 of the present invention for IHF. The integrase used for the sequence-specific recombination is preferably expressed in the cell in which the reaction is earned out. For that purpose a third DNA
sequence comprising an integrase gene is introduced into the cells. If the sequence specific recombination is earned OLIt e.g. with ccttLlc~ttR a XIS factor gene (fourth DNA sequence) may be introduced into the cells Zo in addition. Most preferred is a method wherein the third and/or fourth DNA
sequence is integrated into the eukaryotic genome of the cell or an artificial-/minichromosome via homologous recombination or randomly. Further preferred is a method wherein the third and/or fourth DNA sequence comprises regulatory sequences resulting in a spatial and/or temporal expression of the integrase gene and/or XIS factor gene.
Zs In this case a spatial expression means that the Int recombinase, the XIS
factor, and/or the IHF
factor, respectively, is expressed only in a particular cell type by use of cell type specific promotors and catalyzes the recombination only in these cells, e.g. in liver cells, kidney cells, nerve cells or cells of the immune system. In the regulation of the integrase/XIS factor/IHF
3o expression a temporal expression may be achieved by means of promotors being active from or in a particular developmental stage or at a particular point of time in an adult organism.
Furthermore, the temporal expression may be achieved by use of inducible promotors, e.g. by interferon or tetracycline depended promotors; see review of Miiller, U.
(1999) Mech.
Develop.,82, pp. 3.
The integrase used in the method of the present invention may be both the wild-type and the modified (mutated) integrase of the bacteriophage lambda. As the wild-type integrase is only able to perform the recombination reaction at a high efficiency with a co-factor, namely IHF, it is s preferred to use a modified integrase in the method of the present invention. If the wild-type integrase is used in the method of the present invention, IHF may be needed in addition to achieve a stimulation of the recombination reaction. The modified integrase is modified such that said integrase may carry out the recombination reaction without IHF or other host factors such as XIS and FIS. For example, a recombination reaction between attL~and attR
sequences may be io preformed by a modified Int without the addition of a host factor (see results 5.1 and Figure 2C
and 2D).
The generation of modified polypeptides and screening for the desired activity is state of the art and may be performed easily; Erlich, H. (1989) PCR Technology. Stockton Press.
For example, is a nucleic acid sequence encoding for a modified integrase is intended to include any nucleic acid sequence that will be transcribed and translated into an integrase either in vitro on upon introduction of the encoding sequence into bacteria or eukaryotic cells. The modified integrase protein encoding sequences can be naturally occurring (by spontaneous mutation) or recombinantly engineered mutants and variants, tnmcated versions and fragments, functional Zo equivalents, derivatives, homologs and fusions of the naturally occurnng or wild-type proteins as long as the biological functional activity, meaning the recombinase activity, of the encoded polypeptide is maintained. Recombinase activity is maintained, when the modified recombinase has at least 50%, preferably at least 70%, more preferred at least 90%, most preferred at least 100% of the activity of the wild-type integrase Int, measured in a co-transfection assay with Zs substrate vectors and expression vectors as described in results 5.1 of the present invention or in Example 3 of WO 01/16345. Certain amino acid sequence substitutions can be made in an integrase or its underlying nucleic acid coding sequence and a protein can be obtained with like properties. Amino acid substitutions that provide functionally equivalent integrase polypeptides by use of the hydropathic index of amino acids (Kyte, J. et al. (1982) J. Mol.
Biol., 157, pp. 105) 3o can be prepared by performing site-specific mutagenesis or polymerase chain reaction mediated mutagenesis on its underlying nucleic acid sequence. In the present invention mutants or modified integrases are preferred, which show in comparison to a wild-type protein improved recombinase activity/recombination efficiency or an recombination activity independent of one or more host factors. "Wild-type protein" means a complete, non truncated, non modified, naturally occurring gene of the encoding polypeptide. Two Int mutants preferred are bacteriophage lambda integrases designated as Int-h and Int-h/218; Miller et al. (1980) Cell, 20, pp. 721; Christ, N. and Droge, P. (1999) J. Mol. Biol., 288, pp. 825. Int-h includes a lysine residue instead of a glutamate residue at position 174 in comparison to wild-type Int. Int-h/218 s includes a further lysine residue instead of a glutamate residue at position 218 and was generated by PCR mutagenesis of the Int-h gene. Said mutants may catalyze the recombination between c~ttBlattB, attPlattP, attLlattL or attRlattR and all other possible combinations, e.g. attPlattR, ccttLlattP, attLlattB, or attRlattB or the derivatives thereof without the co-factors IHF, XIS, and/or FIS and negative supercoiling in E. coli, in eukaryotic cells, and in vitro, i.e. with purified ~o substrates in a reaction tube. An improvement of the efficiency of the recombination may be achieved with a co-factor, e.g. FIS. The mutant Int-h/218 is preferred, because this mutant catalyze the recombination reaction with increased efficiency.
If the first reaction leads to an excision and the used two recombination sequences are identical, ~s e.g, attPlP, the resulting recombination sequences after the recombination will be identical to those on the substrate, e.g. here two attP sequences. If however, the two partner sequences are different, e.g. attPlR, the recombination reaction will generate hybrid recombination sequences which comprise one functional half from one sequence (e.g. attP) and one half from the other (ccttR). A functional half recombination site can be defined as the sequence either 5' or 3' form zo the overlap, whereby the overlap is considered, in each case, as a part of a funtional half site. If the respective overlap region of the used recombination sequences is identical the excision reaction may be performed with any recombination sequence according to the invention.
Additionally, the overlap region designates the orientation of the recombination sequences to each other also, i.e. inverted or direct. The reaction may be performed with wilt-type Int with zs low efficiency only, however, the addition of IHF or in the absence of IHF
the presence of arm binding sites) in addition to the core binding site stimulates and increases the efficiency. The reaction may be performed without any cofactor by a modified Int.
Furthermore, a method is preferred wherein a further DNA sequence comprising a Xis factor 3o gene is introduced into the cells. Most preferred is a method wherein the further DNA sequence further comprises a regulatory DNA sequence giving rise to a spatial and/or temporal expression of the Xis factor gene.
For example, after successful integrative intramolecular recombination (inversion) by means of Int leading to the activation/inactivation of a gene in a particular cell type said gene may be inactivated or activated at a later point of time again by means of the induced spatial and/or temporal expression of XIS with the simultaneously expression of Int.
s Furthermore, the invention relates to the use of any recombination sequences or the derivative thereof, e.g. to the derivative of attP as specified in SEQ ID NO: 2 in a sequence specific recombination of DNA in eukaryotic cells. The eukaryotic cell may be present in a cell aggregate of an organism, e.g. a mammal, having no integrase or Xis factor in its cells.
Said organism may be used for breeding with other organisms having in their cells the integrase or the Xis factor so io that off springs are generated wherein the sequence specific recombination is performed in cells of said off springs. Thus, the invention relates also to the use of an integrase or an integrase gene and a Xis factor or a Xis factor gene and an IHF factor or an IHF factor gene in a sequence .
specific recombination in eukaryotic cells. Furthermore, the present invention relates to eukaryotic cells and cell lines in which the method of the present invention was performed, is wherein said cells or cell lines are obtained after performing the method of the present invention.
The practice of the present invention will employ, unless otherwise indicated, conventional techniques of cell biology, molecular biology, cell culture, immunology and the like which are in the skill of one in the art. These techniques are fully disclosed in the current literature. See e.g.
zo Sambrook et al., Molecular Cloning: A Laboratory Manual, 2°'~ Ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989); Ausubel et al., Current Protocols in Molecular Biology (1987, updated); Brown ed., Essential Molecular Biology, IRL
Press (1991);
Goeddel ed., Gene Expression Technology, Academic Press (1991); Bothwell et al. eds., Methods for Cloning and Analysis of Eukaryotic Genes, Bartlett Publ. (1990);
Wu et al., eds., zs Recombinant DNA Methodology, Academic Press (1989); Kriegler, Gene Transfer and Expression, Stockton Press (1990); McPherson et al., PCR: A Practical Approach, IRL Press at Oxford University Press (1991); Gait ed., Oligonucleotide Synthesis (1984);
Miller & Calos eds., Gene Transfer Vectors for Mammalian Cells (1987); Butler ed., Mammalian Cell Biotechnology ( 1991 ); Pollard et al., eds., Animal Cell Culture, Humana Press ( 1990);
Freshney et al., eds., 3o Culture of Animal Cells, Alan R. Liss (1987); Studzinski, ed., Cell Growth and Apoptosis, A
Practical Approach, IRL Press at Oxford University Presss (1995); Melamed et al., eds., Flow Cytometry and Sorting, Wiley-Liss (1990); Current Protocols in Cytometry, John Wiley & Sons, Inc. (updated); Wirth & Hauser, Genetic Engineering of Animals Cells, in:
Biotechnology Vol.
2, Piihler ed., VCH, Weinheim 663-744; the series Methods of Enzymology (Academic Press, Inc.), and Harlow et al., eds., Antibodies: A Laboratory Manual (1987).
All publications and patent applications mentioned in this specification are indicative of the level s of skill of those skilled in the art to which this invention pertains. All publications and patent applications cited herein are hereby incorporated by reference in their entirety in order to more fully describe the state of the art to which this invention pertains. The invention generally described above will be more readily understood by reference to the following examples, which are hereby included merely for the purpose of illustration of certain embodiments of the present io invention and are not intended to limit the invention in any way.
Examples Methods 1. Production of expression and substrate vectors ~s The construction of mock and Int expression vectors pCMV, pCMVSSInt, pCMVSSInt-h, and pCMVSSInt-h/218 have been described; Lorbach, E. et al. (2000) J. Mol.
Biol, 296, pp.1175. Int expression is driven by the human cytomegalovirus promoter.
Substrate vectors used in intramolecular recombination assays, containing attBlattP (p~,IR) or Zo attLlattR (p7~ER) as direct repeats, are derivatives of pGEM'~4Z (Promega).
p~,IR was constricted by inserting attB as double-stranded oligonucleotide into CIaI/EcoRI-cleaved pPGKneo. This vectors is a derivative of pPGKSSInt-h, in which the Int-h gene was replaced by a neomycin gene (neo) using PstIlXbaI. The CMV promoter plus a hybrid intron was generated by PCR using pCMVSSInt as template and cloned into the KpnI/CIaI-cleaved, zs ~zttB-containing pPGKneo vector. This CMV-attB-neo-expression cassette was then cloned by PCR into BamHI-cleaved pGEM~4Z. The attP site, containing an A-to-C
substitution in the P'-arm which deletes a translational stop signal, was generated by assembly PCR using primers (attP01) 5'-GTCACTATCAGTCAAAATACAATCA-3', (SEQ ID NO: 3).
30 (attP02) 5'-TGATTGTATTTTGACTGATAGTGAC-3', (SEQ ID NO: 4) (PFP-NsiI) 5'-CCAATGCATCCTCTGTTACAGGTCACTAATAC-3', (SEQ ID NO: S) and (P'RP-EcoRV-NotI) 5'-ATAAGAATGCGGCCGCAGATATCAGG
GAGTGGGACAAA.ATTGAA-3' (SEQ )D NO: 6).
31;1 pGFPattBlattP was used as template (Lorbach, E. et al. (2000) , supra). The PCR fragment was cleaved with NsiI and NotI and ligated to the 3'-end of a BamHI/PstI-fragment containing a transcriptional stop cassette, which was generated from pBS302 (Gibco/BRL).
The GFP
gene and the polyA signal was cloned by PCR using pCMVSSGFP (a derivative of s pCMVSSInt-h, in which the Int-h gene is replaced by eGFP using PstIlXbaI).
The GFP-containing PCR fragment was cleaved with NotI and XbaI and was then ligated together with the BamHIlNotI-cleaved transcriptional stop/attP fragment into the BamHIlXbaI-cleaved vector already containing the CMV promoter, attB, and the neo expression cassette. p7~ER
was constricted as p7~IR, except that attL was generated by PCR using pGFPattLlattR
to (Lorbach, E. et al. (2000), supra) as template, and was cloned into the CIaI/EcoRI-cleaved pPGKneo. The attR site was generated by PCR using pGFPattLlattR as template, and the product was cleaved with NsiI and NotI.
Substrate vectors for intermolecular recombination assays which contain the CMV promoter ~ s in front of different attachment sites: pCMVattPmut contains three G-to-C
substitutions in the P-arm. These changes were necessary to eliminate ATG start codons that would prevent GFP
expression after recombination. The substitutions are outside of protein binding sites in c~ttP
and were introduced by assembly PCR. First, two overlapping PCR products were generated, one with primer pair c~ttP-ATC-1/attP-2 and one with nttP-ATC-3/czttP-4.
pGFPattBlattP was zo used as template. PCR products were gel-purified and used as templates for PCR with primers attP-PstI and attP-XbaI. The resulting product was digested with PstI and XbuI, and cloned into pCMVSSInt: The primer sequences for assembly PCR are:
(attP-ATC-1) 5'-TTTGGATAAAAAACAGACTAGATAATACTGTAAAACA
CAAGATATGCAGTCACTA-3', (SEQ ID NO: 7) zs (attP-2) 5'-TAACGCTTACAATTTACGCGT-3', (SEQ ID NO: 8) (attP-ATC-3) 5'-CTGCATATCTTGTGTTTTACAGTATTATCTAGTCTG
TTTTTTATCCAAAATCTAA-3', (SEQ ID NO: 9) (attP-4) 5'CTGGACGTAGCCTTCGGGCATGGC-3', (SEQ ID NO: 10) (attP-PstI) 5'-GACTGCTGCAGCTCTGTTACAGGTCAC-3', (SEQ ID NO: 11) 30 (attP-XbaI) S'-GACTGTCTAGAGAAATCAAATAATGAT-3' (SEQ ID NO: 12).
pCMVattB was generated by inserting attB as double-stranded oligonucleotide into PstI/XbaI-cleaved pCMVattPmut. pCMVattL was generated by PCR using p7~ER as template for attL, which was introduced into PstI/XbaI-cleaved pCMVattPmut.
Vectors which contain a transcriptional stop signal and an att site placed in front of a promoterless GFP gene were constructed as follows: pWSattBGFP was generated by first deleting a part of the hygromycin gene from pTKHyg (Clontech) using AvaI and NdeI. The s vector backbone was ligated after the sticky ends were made blunt by Klenow polymerise. An attB-GFP fragment, generated by PCR, was cloned into MfeI and HindIII sites, thereby creating a new NheI site 5' of attB. Finally, the transcriptional stop sequence was inserted through.
restriction with EcoRI and NheI. pWSc~ttRGFP was generated by isolating the BczmHIlNotI
transcriptional stop-attR fragment from p~,ER, which was inserted into pWSattBGFP cleaved io with the same enzymes. pWSattPGFP was generated by PCR of the ~ttP site using pGFPattPlattB as template, which was inserted into pWSattBGFP cleaved with EcoRIlNotI thus replacing attB. Plasmids were isolated from E. coli strain XL1-Blue using affinity chromatography (Qiagen). The nucleotide composition of relevant genetic elements was verified by DNA sequencing using the fluorescence-based 373A system (Applied Biosystems).
is 2. Cell culture, recombination assays, and flow cytomery HeLa cells were cultured in Dulbecco's modified eagle medium (DMEM) supplemented with 10% fetal calf serum, streptomycin [0,1 mg/ml] and penicillin [100 U/ml].
Cells were passaged twice before transfection.
zo Typical recombination assays were performed as follows. Cells were harvested, washed with PBS and resuspended in RPMI 1640 without L-glutamine and phenol red (Life Technologies).
A total of 60 pg of expression and substrate vectors at a molar ratio of 1:1 were then introduced into approximately 1 x 107 cells at 300V and 960pF using a Gene pulser (Bio-Rad). After zs electroporation, cells were plated in an appropriate dilution on 10 cm dishes. A single-cell suspension was prepared at 24, 48, and 72 hrs after transfection. Dead cells were excluded from the analysis by staining with 7-amino-actinomycin D (Sigma), and cells were analyzed by FACScalibur (Becton Dickinson). FACS data were analyzed with CellQuestT~'' software. The transfection efficiencies for intermolecular recombination assays were determined for each 3o experiment by co-transfecting 40 pg pCMV with 20 pg pEGFP-C1 (Clontech);
those for intramolecular recombination were determined with 30 pg pCMV and 30 pg pEGFP-C
1.
Experiments involving purified IHF were performed by introducing first 30 pg of Int expression vectors to approximately 6 x 106 cells via electroporation as described above. After 3 to 4 hrs, about 1 x 105 cells were transfected with 2 pg of substrate vectors for intramolecular recombination, or with a total of 2 pg of substrate vectors at a molar ratio of l:l for intermolecular recombination. Substrates were pre-incubated at room temperature with 2 ~g purified IHF (Lange-Gustafson BJ, Nash HA., Purification and properties of Int-h, a variant s protein involved in site-specific recombination of bacteriophage lambda., J
Biol Chem. 1984 Oct 25;259(20):12724-32) in a low salt buffer (50 mM NaCI, 10 mM Tris-HC1, pH
8.0, 1 mM
EDTA) for at least 30 minutes. Transfection of IHF-DNA complexes was achieved with FuGene (Boehringer Mannheim) and the efficiencies were always in the range of 80%. Cells were analyzed by flow cytometry after additional 48 hrs as described above.
~o CHO-DG44/dhfr ~~ cells (Urlaub, G. et al., (1983), Cell, 33, pp. 405), grown permanently in suspension in the serum-free medium CHO-S-SFMII (Invitrogen, Carlsbad, CA) supplemented with hypoxanthine and thymidine (Invitrogen, Carlsbad, CA), are incubated in cell culture flasks at 37°C in a humidified atmosphere containing 5% COz. Cells are seeded at a concentration of 1-is 3x105 cells/mL in fresh medium every two to three days.
Stable transfections of CHO-DG44 cells are conducted using Lipofectamine Plus reagent (Invitrogen, Carlsbad, CA). Per transfection 6x105 exponentially growing cells in 0,8 mL
hypoxanthine/thymidine (HT)-supplemented CHO-S-SFMII medium are seeded in a well of a 6-well chamber. A total of 1 pg plasmid DNA , 4 ~uL Lipofectamine and 6 pL Plus reagent in a zo volume of 200 yL is used for each transfection and added to the cells, following the protocol of the manufacturer. After incubation for 3 hours 2 mL of HT-supplemented CHO-S-SFMII
medium is added. In the case of neomycin phosphotransferase-based selection the medium is replaced 2 days after transfection with CHO-S-SFMII medium, supplemented with HT and 400 yg/mL 6418 (Invitrogen), and the mixed cell populations are selected for 2 to 3 weeks with z> medium changes every 3 to 4 days. For the DHFR-based selection of stable transfected CHO-DG44 cells CHO-S-SFMII medium without hypoxanthine/thymidine is used. DHFR-based gene amplification is achieved by adding 5 - 2000 nM methotrexate (Sigma, Deisenhofen, Germany) as amplifying selection agent to the medium.
30 3. sICAM and MCP-1 ELISA
sICAM titers in supernatants of stable transfected CHO-DG44 cells are quantified by ELISA
with standard protocols (Ausubel, F.M. et al., (1994, updated) Current protocols in molecular biology. New York: Greene Publishing Associated and Wiley-Interscience) using two in house developed sICAM specific monoclonal antibodies (as described for example in US
patents No.
5,284,931 and 5,475,091), whereby one of the antibodies is a HRPO-conjugated antibody.
Purified sICAM protein is used as a standard. Samples are analyzes using a Spectra Fluor Plus reader (TECAN, Crailsheim, Germany).
s MCP-1 titers in supernatants of stable transfected CHO-DG44 cells are quantified by ELISA
using the OptEIA human MCP-1 set according to the manufacturer's protocol (BD
Biosciences Pharmingen, Heidelberg, Germany).
Example 1: Kinetics of intra- and intermolecular recombination reactions io We showed in our previous studies that mutant Int catalyzed intramolecular integrative and excisive recombination reactions in the absence of natural accessory factors in E. coli and in human cells (Christ, N. et al. (1999), sz~pra; Lorbach, E. et al. (2000), szcpra). However, an interesting question with respect to interactions of episomal DNA segments inside mammalian cells concerns the ability of mutant Int to perform intermolecular recombination, i.e. when two is recombination sites are located on different DNA molecules in traps. We compared therefore first intra- and intermolecular integrative recombination reactions.
Intramolecular recombination was tested with a substrate that contains attB
and attP as direct repeats flanking a transcriptional stop signal. This recombination cassette, in turn, is flanked by zo a CMV promoter and the coding region for GFP. Recombination between attB
and attP
generates hybrid sites attL and attR, and leads to excision of the stop signal. Subsequent expression of the GFP gene thus serves as reporter of recombination (Figure 2A, top).
Expression vectors for either Int, Int-h, or Int-h/218 were co-transfected with the substrate ~s vector into HeLa cells. The expression vector backbone (mock) was used as negative control.
Transfection efficiencies independently determined for each experiment were in the range of 95 to 98% (data not shown). FAGS analyses from 3 experiments show that both mutant Int efficiently catalyzed recombination, leading in some experiments to about 30%
GFP-expressing cells (Figure 2A, bottom). The nucleotide sequence of recombination products, determined 3o indirectly by DNA sequencing of PCR fragments, confirmed that the strand-transfer-reactions catalyzed by mutant Int generated the expected hybrid att sites (data not shown).
It is apparent that the double mutant Int-h/218 was more active than Int-h, whereas wild-type Int was almost inactive. The fraction of GFP-expressing cells increased during 48 hrs after transfection and remained steady for the next 24 hrs. The time course of the reactions also indicates that a majority of recombination events must have occurred within the first 24 hrs.
This correlates well with the time course of Int-h/218 expression in HeLa cells (data not shown).
Although we cannot exclude the possibility that a fraction of GFP-expressing cells resulted from s inter- instead of intramolecular integrative recombination, the data set can be used as a reference for our analysis of intermolecular recombination.
We analyzed intermolecular integrative recombination by placing attB and attP
on separate plasmids. Recombination translocates the CMV promoter to a position upstream of the GFP
~o gene (Figure 2B, top). Hence, only intermolecular recombination between attB and attP will generate GFP-expressing cells. FACS analyses after co-transfection of the two substrate vectors with Int expression vectors yielded results which are comparable to those generated with substrates for intramolecular recombination (Figure 2B, bottom). Again, the majority of recombination events must have occurred within the first 24 hrs after transfection and Int-h/218 ~s was more active than Int-h. Wild-type Int generated only a very small fraction of GFP-expressing cells. These results demonstrate that over a time course of 24 to 72 hrs, intermolecular integrative recombination by mutant Int is at least as efficient as the corresponding intramolecular reaction.
Zo The same experimental strategy was then employed to compare intra- and intermolecular excisive (attL x attR) recombination pathways. The results revealed again that intermolecular recombination by mutant Int was as efficient as intramolecular recombination (Figure 2C and D). The efficiency of excisive recombination reactions, however, was slightly reduced compared to integrative recombination. Recombination by wild-type Int was again barely zs detectable.
Example 2: DNA arm-binding sites in att are not required, but stimulate recombination The results so far show that mutant Int catalyzed integrative and excisive recombination on episomal substrates in a significant number of transfected cells. In contrast, recombination 3o activities of wild-type Int was barely detectable above background. Since excisive recombination by wild-type Int depends on the presence of protein co-factors IHF and XIS, but does not require negative DNA supercoiling, this result demonstrates that eukaryotic counterparts of these co-factors are lacking in human cells. Further, it is known that episomal substrates are topologically relaxed soon after transfection (Schwikardi et al. (2000) FEBS
Letters, 471, pp. 147). It appears, therefore, that mutant Int perform recombination without the formation of defined nucleoprotein complexes, such as the intasome assembled at attP. This raises the question of the functional role of DNA arm-binding sites in recombination. They were present in at least one of the partner att sites employed so far.
s In order to investigate this question, we used intermolecular recombination with pairs of substrate vectors containing attB or attP in various combinations (Figure 3A).
The fraction of GFP-expressing cells that results from recombination was determined by FAGS at 48 hrs after co-transfection with Int expression vectors. Transfection efficiencies were always above 90%
~o (data not shown). The results from 3 experiments show that intermolecular recombination between pairs of attP was as efficient as recombination between attB and attP
(Figure 3B).
However, only Int-h/218 utilized pairs of attB sites as substrate to a significant extent. The efficiency of this reaction was, on average, about four-fold reduced compared to reactions between attP and attP or attB and attP (Figure 3B) Hence, the fraction of GFP-expressing cells ~s that results from recombination between two attB sites dropped to a level of 4 to 5%. These results demonstrate that the presence of arm-type sequences in att sites is not required for recombination by Int-h/218, but significantly stimulates the reaction. This stimulatory effect is even more pronounced (about eight-fold) when Int-h was used. Farther, the residual recombination activity observed with wild-type Int appears highly dependent on the presence of zo arm binding sites.
Example 3: Recombination by wild-type Int is stimulated by transfected IHF
protein Efficient integrative recombination catalyzed by wild-type Int in vitro and in E. coli requires the protein co-factor IHF and supercoiling of attP. The apparent lack of either co-factor in as mammalian cells thus led us to investigate whether the residual recombination activity of wild type Int is augmented if purified IHF, pre-incubated with a supercoiled substrate, is co-introduced into HeLa cells. To test this possibility, we introduced first expression vectors for either wild-type Int or Int-h. At 3 to 4 hrs after electroporation, substrates for intra- or intermolecular recombination were incubated either with or without purified IHF. Protein-DNA
~o mixtures as well as protein-free control samples were then transfected using Fugene (Figure 4A). The fractions of GFP-expressing cells were compared after additional 48 hrs.
The results from three experiments show that intramolecular recombination by wild-type Int was stimulated, on average, up to five-fold due to the presence of IHF. The fraction of GFP-positive cells increased, for example, in one experiment from about 1% in the absence of IHF to 6% in its presence. The stimulatory effect on intermolecular recombination was also significant, bLlt less pronounced (about three-fold). At 48 hrs after transfection, the stimulation was specific for wild-type Int since the activity of Int-h was not affected. Importantly, controls showed that s transfection efficiencies were also not affected by the presence of IHF
protein (data not shown).
Example 4: Improved protein expression system based on sequence-specific recombination of gene of interest CHO-DG44 cells are stably transfected with a linearized first plasmid DNA
expressing the ~o fluorescent protein ZsGreenl from Zoc~nthus sp. (Clontech Laboratories Inc., Palo Alto, CA, U.S.A.) and the antibiotic resistance gene neomycin phosphotransferase (Figure S). In addition, either an attB or an attP recombination sequence (natural or modified sequence or derivative thereof) is placed between the gene for the fluorescent protein and its promoter. The first plasmid DNA, linearized by using a restriction enzyme with a single restriction site outside the is transcription units for both selection markers, is introduced by random integration into the genome of CHO-DG44. Cells with a successful stable random integration of the first plasmid DNA are positively selected for by cultivation in the presence of the antibiotic 6418. Within the heterogeneous pool of stable transfectants cells with a high transcription activity at the integration site of the first plasmid DNA can be isolated simply by fluorescence activated cell zo sorting (FACS) based on the expression level of the introduced fluorescent protein ZsGreenl.
Cells with the highest ZsGreenl fluorescence are sorted and placed as single cells into the wells of a 96 well plate. The resulting cell subclones are expanded and tested by restriction endoncuclease mapping in Southern blot analysis for integration of a single plasmid sequence in a single chromosomal site. For the latter genomic DNA of the cell subclones is digested with zs restriction enzymes with no, one and multiple restriction sites within the introduced first plasmid DNA, respectively, electrophoresed on a 0.8% agarose gel and transferred to positvely charged nylon membrane (Amersham Biosciences, Freiburg, Germany).
Hybridization is performed overnight at 65°C in a hybridization oven with a random-primed FITC-dUTP labeled probe consisting of the ZsGreenl gene according to the protocol of the Gene Images random 3o prime labelling module (Amersham Biosciences). Candidate subclones with a single copy insert are subsequently tested in small scale bioreactors for their performance in a production-mimicking fedbatch process. Besides high expression levels during the complete production phase, monitored by measuring the ZsGreenl fluorescence, other important parameters such as high viability at high cell density, metabolism and reproducible performance are taken into account. This way a suitable host cell with an integrated first att recombination sequence is identified. To generate a production cell line producing a biopharmaceutical by sequence-specific recombination this host cell is transfected with a second plasmid DNA
(see Figure 5) containing a promoterless dihydrofolate reductase gene preceded by either an attB or an attP
s recombination sequence (natural or modified sequence or derivative thereof) and a complete transcription unit for the expression of the gene of interest, for example the common cold therapeutic sICAM (soluble intercellular adhesion molecule 1) or the human monocyte chemoattractant protein-1 (MCP-1). In addition the vector pCMVSSInt-h/218 expressing the mutated (modified) bacteriophage lambda integrase is co-transfected. After transfection, ~o transient expression of Int-h/218 is sufficient to perform the sequence-specific intermolecular recombination between the first att recombination site (either attP or attB) located at a preferred transcriptional active locus within the host cell genome and the second att recombination site (either attP or attB) on the introduced second DNA plasmid. To select for cells where a sequence-specific recombination between attP and attP, attP and attB or attB
and attB has ~s occurred, depending on the choice of the recombination sequence on the first and second DNA
plasmid, transfected cells are transferred and cultivated in CHO-S-SFMII
medium without the supplements hypoxanthin and thymidine. Only correct targeting results in cells surviving the selection by placing via recombination the promoterless dhfr-marker gene with an upstream att recombination site on the second DNA plasmid under the control of the promoter sequence of zo the ZsGreenl gene with a downstream att recombination site, thus allowing for the efficient expression of the dhfr selection marker gene. At the same time the functional expression cassette of the ZsGreenl gene is interrupted leaving behind a promoterless ZsGreenl gene. Thus cells do not express a fluorescent protein any longer. The non-fluorescing cells are identified and isolated by FACS providing a means to detect the cells producing the protein of interest. In zs addition, sequence-specific integration is verified by Southern Blot and PCR analysis with primers located in the sequences flanking the att sites before and after site-specific recombination followed by subsequent DNA sequencing. Expression of the protein of interest, sICAM or MCP-1, is assayed by ELISA.
The use of dhfr as marker gene for the generation of production cell lines offers not only the 3o advantage of positive selection but also the possibility to increase the productivity of the cell by methotrexate-induced DHFR-based gene amplification even further. This is achieved by supplementing the hypoxanthin/thymidine-free cultivation medium CHO-S-SFMII
with increasing amounts of methotrexate.
SEQUENCE LISTING
<110> BOEHRINGER INGELHEIM PHARMA GmbH & Co. KG
Droge, Peter <120> Sequence specific DNA recombination in eukaryotic cells <130> DRO-003 PCT
<140> unknown <141> 2003-11-28 <150> CA 2,413,175 <151> 2002-11-28 <150> US 10/310,695 <151> 2002-12-05 <160> 17 <170> PatentIn version 3.1 <210> 1 <211> 10 <212> DNA
<213> Artificial Sequence <220>
<223> Consensus sequence for Int binding-site <220>
<221> misc feature <222> (1)..(1) <223> c or a <400> 1 magtcactat 10 <210> 2 <211> 243 <212> DNA
<213> Artificial Sequence <220>
<223> attP derivative <400>
tctgttacaggtcactaataccatctaagtagttgattcatagtgactgcatatcttgtg 60 ttttacagtattatctagtctgttttttatccaaaatctaatttaatatattgatattta 120 tatcattttacgtttctcgttcagcttttttatactaagttggcattataaaaaagcatt 180 gcttatcaatttgttgcaacgaacaggtcactatcagtcaaaataaaatcattatttgat 240 ttc 243 <210> 3 <211> 25 <212> DNA
<213> Artificial Sequence <220>
<223> Primer <400> 3 gtcactatca gtcaaaatac aatca 25 <210> 4 <211> 25 <212> DNA
<213> Artificial Sequence <220>
<223> Primer <400> 4 tgattgtatt ttgactgata gtgac 25 <210> 5 <211> 32 <212> DNA
<213> Artificial Sequence <220>
<223> Primer <400> 5 ccaatgcatc ctctgttaca ggtcactaat ac 32 <210> 6 <211> 44 <212> DNA
<213> Artificial Sequence <220>
<223> Primer <400> 6 ataagaatgc ggccgcagat atcagggagt gggacaaaat tgaa 44 <210> 7 <211> 55 <212> DNA
<213> Artificial Sequence <220>
<223> Primer <400> 7 tttggataaa aaacagacta gataatactg taaaacacaa gatatgcagt cacta 55 <210> 8 <211> 21 <212> DNA
<213> Artificial Sequence <220>
<223> Primer <400> 8 taacgcttac aatttacgcg t 21 <210> 9 <211> 55 <212> DNA
<213> Artificial Sequence <220>
<223> Primer <400> 9 ctgcatatct tgtgttttac agtattatct agtctgtttt ttatccaaaa tctaa 55 <210> 10 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Primer <400> 10 ctggacgtag ccttcgggca tggc 24 <210> 11 <211> 27 <212> DNA
<213> Artificial Sequence <220>
<223> Primer <400> 11 gactgctgca gctctgttac aggtcac 27 <210> 12 <211> 27 <212> DNA
<213> Artificial Sequence <220>
<223> Primer <400> 12 gactgtctag agaaatcaaa taatgat 27 <210> 13 <211> 21 <212> DNA
<213> Escherichia coli <400> 13 ctgctttttt atactaactt g 21 <210> 14 <211> 243 <212> DNA
<213> Bacteriophage lambda <400>
tctgttacaggtcactaataccatctaagtagttgattcatagtgactgcatatgttgtg60 ttttacagtattatgtagtctgttttttatgcaaaatctaatttaatatattgatattta120 tatcattttacgtttctcgttcagcttttttatactaagttggcattata-~aaaaagcatt180 gcttatcaatttgttgcaacgaacaggtcactatcagtcaaaataaaatcattatttgat240 ttc 243 <210> 15 <211> 102 <212> DNA
<213> Escherichia coli <400> 15 ctgctttttt atactaagtt ggcattataa aaaagcattg cttatcaatt tgttgcaacg 60 aacaggtcac tatcagtcaa aataaaatca ttatttgatt tc 102 <210> 16 <211> 162 <212> DNA
<213> Escherichia coli <400> 16 tctgttacag gtcactaata ccatctaagt agttgattca tagtgactgc atatgttgtg 60 ttttacagta ttatgtagtc tgttttttat gcaaaatcta atttaatata ttgatattta 120 tatcatttta cgtttctcgt tcagcttttt tatactaact tg 162 <210> 17 <211> 27 <212> DNA
<213> Homo sapiens <400> 17 gaaattcttt ttgatactaa cttgtgt 27
Claims (23)
1. A method of sequence specific recombination of DNA in a eukaryotic cell, comprising a) introducing a DNA comprising a first attB, attP, attL or attR sequence or a derivative thereof into a cell;
b) introducing a DNA comprising a second attB, attP, attL or attR sequence or a derivative thereof into a cell, wherein if said first DNA sequence comprises an attB
sequence or a derivative thereof said second sequence comprises an attB, attL
or attR
sequence or a derivative thereof, or wherein if said first DNA sequence comprises an attP sequence or a derivative thereof said second sequence comprises an attP, attL or attR sequence or a derivative thereof, or wherein if said first DNA sequence comprises an attL sequence or a derivative thereof said second sequence comprises an attB, attP or attL sequence or a derivative thereof, or wherein if said first DNA
sequence comprises an attR sequence or a derivative thereof said second sequence comprises an attB, attP or attR sequence or a derivative thereof; and c) performing the sequence specific recombination by a bacteriophage lambda integrase Int.
    b) introducing a DNA comprising a second attB, attP, attL or attR sequence or a derivative thereof into a cell, wherein if said first DNA sequence comprises an attB
sequence or a derivative thereof said second sequence comprises an attB, attL
or attR
sequence or a derivative thereof, or wherein if said first DNA sequence comprises an attP sequence or a derivative thereof said second sequence comprises an attP, attL or attR sequence or a derivative thereof, or wherein if said first DNA sequence comprises an attL sequence or a derivative thereof said second sequence comprises an attB, attP or attL sequence or a derivative thereof, or wherein if said first DNA
sequence comprises an attR sequence or a derivative thereof said second sequence comprises an attB, attP or attR sequence or a derivative thereof; and c) performing the sequence specific recombination by a bacteriophage lambda integrase Int.
2. Method of sequence specific recombination of DNA in a eukaryotic cell having integrated the first att sequence or a derivative thereof according to claim 1 in an artificial-/minichromosome or the genome of said eukaryotic cell, comprising the steps b) and c) according to claim 1. 
    3. Method according to claim 1 or 2, wherein the first att sequence or a derivative thereof naturally occurs in the genome of said eukaryotic cell or is introduced previously. 
    4. Method of expressing at least one gene of interest encoding one or more desired polypeptide(s)/product(s) in a eukaryotic cell, comprising a) introducing a first DNA comprising an attB, attP, attL or attR sequence or a derivative thereof into a cell;
b) introducing a second DNA comprising an attB, attP, attL or attR sequence or a derivative thereof, and at least one gene of interest into a cell;
c) contacting said cell with a bacteriophage lambda integrase Int;
d) performing the sequence-specific recombination by a bacteriophage lambda integrase Int, wherein the second DNA is integrated into the first DNA; and e) cultivating said cell under conditions, wherein the gene(s) of interest is/are being expressed.
    b) introducing a second DNA comprising an attB, attP, attL or attR sequence or a derivative thereof, and at least one gene of interest into a cell;
c) contacting said cell with a bacteriophage lambda integrase Int;
d) performing the sequence-specific recombination by a bacteriophage lambda integrase Int, wherein the second DNA is integrated into the first DNA; and e) cultivating said cell under conditions, wherein the gene(s) of interest is/are being expressed.
5. Method according to claim 4, wherein if said first DNA sequence comprises an attB
sequence or a derivative thereof said second sequence comprises an attB, attL
or attR
sequence or a derivative thereof, or wherein if said first DNA sequence comprises an attP
sequence or a derivative thereof said second sequence comprises an attP, attL
or attR
sequence or a derivative thereof, or wherein if said first DNA sequence comprises an attL
sequence or a derivative thereof said second sequence comprises an attB, attP
or attL
sequence or a derivative thereof, or wherein if said first DNA sequence comprises an attR
sequence or a derivative thereof said second sequence comprises an attB, attP
or attR
sequence or a derivative thereof.
    sequence or a derivative thereof said second sequence comprises an attB, attL
or attR
sequence or a derivative thereof, or wherein if said first DNA sequence comprises an attP
sequence or a derivative thereof said second sequence comprises an attP, attL
or attR
sequence or a derivative thereof, or wherein if said first DNA sequence comprises an attL
sequence or a derivative thereof said second sequence comprises an attB, attP
or attL
sequence or a derivative thereof, or wherein if said first DNA sequence comprises an attR
sequence or a derivative thereof said second sequence comprises an attB, attP
or attR
sequence or a derivative thereof.
6. Method according to claims 4 or 5, wherein, the first DNA has been integrated into the genome, an artificial-/minichromosome or an episomal element of the host cell, before said second DNA is introduced into said cell. 
    7. Method according to claim 6, wherein the first DNA has been integrated into the host cell genome. 
    8. A method of expressing at least one gene of interest encoding one or more desired polypeptide(s)/product(s) in a eukaryotic cell, having at least one naturally occurring recombination sequence which allows sequence-specific recombination mediated by an bacteriophage lambda Int or any functional mutant thereof, comprising a) introducing a DNA comprising an attB, attP, attL or attR sequence or a derivative thereof, and at least one gene of interest into said cell;
b) contacting said cell with a bacteriophage lambda integrase Int;
c) performing the sequence-specific recombination by a bacteriophage lambda integrase Int, between the recombination sequence naturally occurring in said cell and the DNA
introduced into said cell; and d) cultivating said cell under conditions, wherein the gene(s) of interest is/are being expressed.
    b) contacting said cell with a bacteriophage lambda integrase Int;
c) performing the sequence-specific recombination by a bacteriophage lambda integrase Int, between the recombination sequence naturally occurring in said cell and the DNA
introduced into said cell; and d) cultivating said cell under conditions, wherein the gene(s) of interest is/are being expressed.
9. Method of claim 8, wherein the naturally occurring sequence is attH. 
    10. Method according to anyone of claims 4 to 9, wherein said desired polypeptide(s)/product(s) is/are isolated from the host cell or the cell culture medium. 
    11. Method according to anyone of the preceding claims, wherein said derivative of the attP, attB, attL or attR sequence comprise one copy or more copies of the arm-binding site(s) for Int, or wherein said derivative comprise one copy or more copies of the core Int binding site(s), or wherein said derivative comprise of a combination of one copy or more copies of the arm-binding site(s) for Int and one copy or more copies of the core Int binding site(s). 
    12. Method according to claim 11, wherein said derivative of the attP, attB, attL or attR
sequence consist of one copy or more copies of the core Int binding site(s), or wherein said derivative consist of a combination of one copy or more copies of the arm-binding site(s) for Int and one copy or more copies of the core Int binding site(s).
    sequence consist of one copy or more copies of the core Int binding site(s), or wherein said derivative consist of a combination of one copy or more copies of the arm-binding site(s) for Int and one copy or more copies of the core Int binding site(s).
13. Method according to claim 11 or 12, wherein the core binding site consists of nine contiguous base pairs and relates to DNA sequences consisting for the B-sequence of the nucleotide sequence 5'-CTGCTTTTT-3', for the B'-sequence of the nucleotide sequence 5'-CAAGTTAGT-3' (reverse complementary strand), for the C-sequence of the nucleotide sequence 5'-CAGCTTTTT-3', and for the C'-sequence of the nucleotide sequence 5'-CAACTTAGT-3' (reverse complementary strand) in wild-type att sites or said sequences having functional nucleotide substitutions. 
    14. Method according to anyone of the preceding claims, wherein said sequence-specific recombination is performed by Int and one or more cofactors selected from XIS, FIS
and/or IHF.
    and/or IHF.
15. Method according to anyone of the preceding claims, wherein the sequence-specific recombination is performed by a modified Int, preferably the Int-h or Int-h/218. 
    16. The method of claim 14, wherein Int, Int-h or Int-h/218, XIS, FIS and/or IHF are added to the cell in purified form or are co-expressed by said host cell, wherein the sequence-specific recombination is performed. 
    17. Method according to anyone of the preceding claims, wherein additionally a third or a third and fourth DNA sequence comprising an Int gene, or an Int gene and one or more cofactor genes selected from XIS gene, FIS gene and/or IHF gene, respectively, is/are introduced into the cell. 
    18. Method according to anyone of the preceding claims, wherein neither XIS, FIS nor IHF is required, when sequence specific recombination is performed by a modified Int, preferably the Int-h or Int-h/218. 
    19. Method according to anyone of the preceding claims, said first and/or second recombination sequence further comprising a nucleic acid coding for a polypeptide of interest. 
    20. Method according to anyone of the preceding claims, wherein said polypeptide of interest is an antibody, hormone or growth factor. 
    21. Method according to anyone of the preceding claims, wherein the host cell is a mammalian cell. 
    22. Method according to claim 21, wherein the mammalian cell is a rodent cell, preferably a mouse or a hamster cell. 
    23. Method according to claim 22, wherein the hamster cell is a BHK or CHO 
cell and the mouse cell is a marine myeloma cells, preferably NS0 and Sp2/0 cell.
    cell and the mouse cell is a marine myeloma cells, preferably NS0 and Sp2/0 cell.
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title | 
|---|---|---|---|
| CA002504010A CA2504010A1 (en) | 2002-11-28 | 2003-11-28 | Sequence specific dna recombination in eukaryotic cells | 
Applications Claiming Priority (6)
| Application Number | Priority Date | Filing Date | Title | 
|---|---|---|---|
| CA2,413,175 | 2002-11-28 | ||
| CA2413175A CA2413175C (en) | 2002-11-28 | 2002-11-28 | Sequence specific dna recombination in eukaryotic cells | 
| US10/310,695 US7491539B2 (en) | 2002-12-05 | 2002-12-05 | Sequence specific DNA recombination in eukaryotic cells | 
| US10/310,695 | 2002-12-05 | ||
| CA002504010A CA2504010A1 (en) | 2002-11-28 | 2003-11-28 | Sequence specific dna recombination in eukaryotic cells | 
| PCT/EP2003/013414 WO2004048584A1 (en) | 2002-11-28 | 2003-11-28 | Sequence specific dna recombination in eukaryotic cells | 
Publications (1)
| Publication Number | Publication Date | 
|---|---|
| CA2504010A1 true CA2504010A1 (en) | 2004-06-10 | 
Family
ID=34890675
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date | 
|---|---|---|---|
| CA002504010A Abandoned CA2504010A1 (en) | 2002-11-28 | 2003-11-28 | Sequence specific dna recombination in eukaryotic cells | 
Country Status (1)
| Country | Link | 
|---|---|
| CA (1) | CA2504010A1 (en) | 
- 
        2003
        - 2003-11-28 CA CA002504010A patent/CA2504010A1/en not_active Abandoned
 
Similar Documents
| Publication | Publication Date | Title | 
|---|---|---|
| CA2724908C (en) | Hgh polyadenylation signal | |
| JP2021505180A (en) | Manipulated Cas9 system for eukaryotic genome modification | |
| CA2706050C (en) | Novel att recombination sequences | |
| JP2024099583A (en) | Stable targeted integration | |
| EP1565562B1 (en) | Sequence specific dna recombination in eukaryotic cells | |
| JP2020174681A (en) | Efficient selectivity of recombinant proteins | |
| US20110136236A1 (en) | Genetically modified eukaryotic cells | |
| US7491539B2 (en) | Sequence specific DNA recombination in eukaryotic cells | |
| CA2413175C (en) | Sequence specific dna recombination in eukaryotic cells | |
| CA2504010A1 (en) | Sequence specific dna recombination in eukaryotic cells | |
| JP2004248509A (en) | Sequence-specific DNA recombination in eukaryotic cells | |
| HK1081998B (en) | Sequence specific dna recombination in eukaryotic cells | |
| 汪雪 | Development of a targeted gene integration procedure for the production of biopharmaceutical proteins | 
Legal Events
| Date | Code | Title | Description | 
|---|---|---|---|
| EEER | Examination request | ||
| FZDE | Discontinued | Effective date: 20121128 |