WO1999035275A1

WO1999035275A1 - Human genetic sequences which are homologous to the yeast genes involved in the proteolytic processing of prenylated proteins

Info

Publication number: WO1999035275A1
Application number: PCT/ES1999/000001
Authority: WO
Inventors: José María PEREZ FREIJE; Gloria Velasco Cotarelo; Alberto Martin Pendas; Pilar Blay Albors; Milagros Balbin Felechosa; Carlos Lopez Otin
Original assignee: Universidad De Oviedo; Fuji Yakuhin Kogyo Kabushiki Kaisha
Priority date: 1998-01-08
Filing date: 1999-01-08
Publication date: 1999-07-15
Also published as: ES2149672A1; ES2151459A1; ES2149672B1; ES2151459B1

Abstract

Human genetic sequences which are homologous to the yeast gene involved in the proteolytic processing of prenylated proteins. The invention comprises the identification of homologous fragments of human genes AFC1 and RCE1 of Saccharomyces cerevisiae, amplifying them through PCR of total RNA, using the amplified fragments as probes to hybridize cDNA and determining the sequence of the clones of cDNA which hybridize with the probes. The identified sequences are SEQ ID NO: 1 and SEQ ID NO: 2. The applications of said sequences concern essentially diagnosis and treatment of oncogenic troubles.

Description

Human gene sequences homologous to yeast genes involved in proteolytic processing of pre-piled proteins.

Field of the Invention

The invention is attached to the field of human oncology. Specifically, the present invention is about the enzymes that process pre-packed proteins and the genes that encode them. More particularly, the present invention addresses the identification of human proteases, and their possible inhibitors, responsible for post-translational modifications suffered by ras and other related proteins to activate and perform their pathological functions.

State of the art

Ras proteins are part of a guanine nucleotide binding protein superfamily that, after being activated by mutation, have the ability to transform eukaryotic cells. Mutated or oncogenic forms of ras genes have been found in a very significant percentage of human tumors, reaching 50% in colon and pancreas carcinomas (Annu. Rev. Biochem., 56, 779, (1987)). These observations indicate that ras genes contribute to the development of various types of human tumors, thus being molecular targets of therapeutic intervention.

Ras proteins are synthesized in the cell cytoplasm as precursor molecules that require various post-translational modifications to be inserted into the membrane and perform their biological functions there. The first of these modifications consists in the precursor or farnesylation of a cysteine residue, located in the carboxyl-terminal region, and forming part of a sequence. Cys-AAX, where A is usually an aliphatic amino acid and X is any amino acid. After pre-pilation, the three residues adjacent to the pre-piled cysteine are removed proteolytically and the resulting carboxyl group is methylated. So far, several proteins have been identified that participate in the pre-translation and methylation stages of ras proteins and other proteins that undergo analogous post-translational modifications, and among which fungal pheromones such as the yeast factor-a, γ subunits of various trimeric G proteins or small GTP binding proteins involved in cell vesicular traffic (Annu. Rev. Biochem. 6JL, 355, (1992)). However, there is currently no known human protein responsible for the proteolytic removal of residues adjacent to the pre-piled cysteine, an essential stage in the maturation of ras and related proteins.

One of the strategies for the identification of human protease activators of pre-piled proteins would be the search for homologous proteins in other phylogenetically remote species, such as yeasts, but which are also capable of developing analogous proteolytic processing reactions in Cys-A sequences. - AX. In this regard, it should be noted that recently Boyartchuk et al., (Science 275, 1797, (1997)) have described the existence in Saccharomyces cerevisiae of two proteins (Afc-1 and Rce-1) that participate in protein proteolytic maturation piled up. The Afc-1 protein has been described as a metalloprotease that participates in the processing of the pheromome called factor-a. On the contrary, the Rce-1 protein can contribute to the proteolytic processing of both factor-a and yeast ras proteins. The description of these two proteins opens the possibility of searching for analogous proteins in humans through a "homology cloning" strategy. One of the multiple ways of approaching this strategy, in a first step, seeks to search in publicly accessible databases of fragments of sequences of nucleotides of human genes that have similarity to the sequences of the AFC1 and RCE1 genes of Saccharomyces cerevisiae. Upon identification, the hypothetical homologous fragments can be amplified by PCR of total RNA from human tissues in which the expression of said genes is suspected, and used as probes to hybridize cDNA libraries prepared from RNA of the same tissues . Finally, the sequencing and subsequent characterization of human clones isolated by standard Molecular Biology techniques would confirm the possible role of the proteins encoded by said clones in the proteolytic processing of pre-piled proteins. Based on this idea, the authors of the invention, after the relevant experimental studies, have reached the objectives listed above that constitute the various aspects of the present invention.

Brief Description of the Invention

An object of the present invention is to identify the human gene that encodes a protein homologous to the Afc-1 protein of Saccharomyces cerevisiae.

Another object of the invention is to identify the human gene that encodes a protein homologous to the Rce-1 protein of Saccharomyces cerevisiae.

A third object of the invention is to analyze the expression in human tissues of the homologous genes to AFC1 and RCE1 of Saccharomyces cerevisiae.

Detailed description of the invention

The first object of the invention was the identification of a human gene that could encode a protein homologous to the Afc-1 protein of Saccharomyces cerevi - siae. For this, the amino acid sequence described for this protein was compared with the "Expressed Sequence Tags" (ESTs) division of the GenBank database used. leaving the TBLASTN program (J. Mol. Biol. 215, 403, (1990)). Six overlapping human ESTs were identified, whose access numbers are AA210930, F11310, Z43272, R54272, T35312 and N76181. From the overlap of these ESTs, we deduced a partial nucleotide sequence, which encodes a fragment of a hypothetical human protein with a degree of similarity of approximately 40% with the Afc-1 protein of Saccharomyces cerevisiae. This human protein was tentatively called Face-1, (Farnesylated-proteins cpnverting enzyme 3.). Its amino acid sequence, as well as the nucleotide sequence that encodes it, is shown as SEQ ID NO: 1.

From this sequence we design and synthesize two oligonucleotides, AFC1 (5 '-ATGAGGAGGTACTCGCTGTACTAGG-3') and AFC2 (5 '-GCTGGAACATGCTGCCCAGGAC-3'). These oligonucleotides were used to amplify the corresponding cDNA fragment using as a template total DNA isolated from a human ovarian cDNA library, constructed in Lambda DR2 (Clontech Catalog No. HL1146x). For this, 20 pmoles of each oligonucleotide were used, approximately 1 microgram of cDNA, 0.2 mM dNTPs and 1.25 U of Taq DNA polymerase in a total volume of 50 microliters of "ExpandLong buffer 3" (Boehringer Mannheim). The amplification was carried out in a Perkin-Elmer GeneAmp2400 apparatus, and consisted of an initial denaturation cycle (1 min, 94 ° C), 35 denaturation cycles (15 s, 94 ° C), hybridization (15 s, 60 ° C) and extension (1 min, 72 ^C C), followed by a final extension cycle of 10 min at 72 ° C. The resulting 516 base pair (bp) DNA fragment was purified by agarose gel electrophoresis and extraction with GeneClean. The identity of the amplified fragment with the partial sequence deduced for Face-1 was verified after subcloning it into pUC18 and determining its nucleotide sequence by standard Molecular Biology techniques.

In order to obtain a cDNA sequence containing the information encoding the Face-1 protein In full, the PCR product obtained with the oligonucleotides AFC1 and AFC2, described above, was radioactively labeled and hybridized with 10 ⁶ clones of the ovarian cDNA library cited above following standard procedures. 11 clones of lambda phage were obtained that hybridized specifically with the probe used. The isolated phage DNA was converted into the corresponding pDR2 plasmids by excision in vivo, following the instructions of the library provider. The recombinant pDR2 plasmids were analyzed by Southern blot, analysis that led us to select clone 1.3a, as it contained the longest insert. The nucleotide sequence of this plasmid was determined by the method of chain terminators described by Sanger (PNAS, 74 5463, (1977)). Sequencing revealed the existence of an open reading phase, which encodes a 475 amino acid protein that we call human Face-1. The comparison of this amino acid sequence with all the sequences present in the publicly accessible databases showed that the highest degree of similarity (40%) corresponded to the Afc-1 protein of Saccharomyces cerevisiae. A significant degree of similarity was also detected with the so-called hypothetical protein p59 of Schizosaccharomyces pombe and with an Escherichia coli protein of unknown function and called htpX. A more detailed analysis of the amino acid sequence of Face-1 revealed the presence of structural motifs that allow it to be classified as a metalloprotease of the gluzincin family (Methods Enzymol. 248, 183, (1995)). Thus, in position 335 is the HELGH sequence, which corresponds perfectly with the HEXXH sequence present in the metalloproteases and involved in the binding of metal ions. In addition, in position 415 there is a glutamic residue, conserved in the sequence of Afc-1, p59 and htpX, and which is also part of the metal binding site. Four residues away from this glutamic acid is an aspartic acid residue, essential for the catalytic activity of these proteases (Eur. J. Biochem. 221, 475, (1994)). The existence of a single human protein with these properties leads us to conclude that Face-1 is the human homologue of the Afc-1 protein of Saccharomyces cerevisiae and therefore its participation in the proteolytic maturation of prenilated proteins is presumable. Both the isolated DNA and the encoded polypeptide, represented in SEQ ID NO: 1, as partial sequences obtained from both, can also be chemically synthesized. The second object of the invention was the identification of a human gene that could encode a protein homologous to the Rce-1 protein of Saccharomyces cerevisiae. For this, the amino acid sequence described for this protein was compared with the "Expressed Sequence Tags" (ESTs) division of the GenBank database using the TBLASTN program. Six overlapping human ESTs were identified, whose access numbers are: W96412, W96411, AA220236, T97242, T97243 and D20146. From the overlap of these ESTs, we deduced a partial nucleotide sequence, which encodes a fragment of a hypothetical human protein with a degree of similarity of approximately 30% with the Rce-1 protein of Saccharomyces cerevisiae. This human protein was tentatively called Face-2, (Far-sylated-proteins converting enzyme 2). Its amino acid sequence, as well as the nucleotide sequence that encodes it, are shown in SEQ ID NO: 2. Similarly to that described for Face-1, two oligonucleotides, RCE1 (5 '-CTCTCACCCCTGTGCGTGCTGCTC-3') and RCE2 (5 '-CGCGCAAACAG CTGGGAAACCC-3') and were used to amplify the corresponding Face-2 fragment using as a template

Total DNA of a human breast cancer cDNA library built in lambda gtll (Clontech Catalog No. HL1059b). The obtained DNA fragment, 582 bp, was subcloned into pUC18 and sequenced by standard techniques in Molecular Biology, which allowed confirming its identity with the expected fragment of the Face-2 gene. In order to obtain a cDNA sequence containing the information encoding the complete Face-2 protein, the PCR product obtained with the oligonucleotides RCE1 and RCE2, described above, was radiolabelled and hybridized with 10 ⁶ clones of the ovarian cDNA library cited above following standard procedures. Four clones of lambda phage were obtained that hybridized specifically with the probe used. The isolated phage DNA was converted into the corresponding pDR2 plasmids by excision in vivo, following the instructions of the library provider. Analysis of the four plasmids revealed that they all contained cDNA inserts of similar size. The nucleotide sequence of the clone we call 1. Ib was determined by the method of chain terminators described by Sanger (PNAS, 74, 5463, (1977)). Sequencing revealed the existence of an open reading phase, which encodes a 329 amino acid protein that we call human Face-2. The comparison of this amino acid sequence with all the sequences present in the publicly accessible databases showed that the highest degree of similarity (30%) corresponded to the Rce-1 protein of Saccharomyces cerevisiae. Although the Rce-1 protein of Saccharomyces cerevisiae had been described as slightly similar to the signal peptidases type Ilb (Science 275, 1797, (1997)), Face-2 analysis did not confirm this hypothesis. On the contrary, a more detailed analysis of the amino acid sequence of Face-2 revealed the presence of structural motifs that allow it to be classified as a metalloprotease of the so-called Group IV, characterized by having ligand binding sites other than the HEXXH sequence (Methods Enzymol . 248, 183,

(nineteen ninety five) ) . The comparison of the Face-2 sequence with the proteins included so far in this group leads us to identify histidine residues in position 211 and glutamic acid in position 214 as residues involved in the binding of metal ions. Both residues are conserved in the sequence of Rce-1 and in the sequence of carboxypeptidases verses. An additional histidine residue at position 261, also conserved in these proteins, can participate in metal binding. The existence of a single human protein with these properties leads us to conclude that Face-2 is the human homologue of the Rce-1 protein of Saccharomyces cerevisiae, has structural motifs characteristic of proteases and more specifically of carboxypeptidases and therefore its participation is presumably in proteolytic maturation of pre-packed proteins. Both the isolated DNA and the encoded polypeptide, represented in SEQ ID NO: 2, as partial sequences obtained from both, can also be chemically synthesized.

The third object of the invention is to analyze the expression in human tissues of the homologous genes to AFC1 and RCE1 of Saccharomyces cerevisiae. To this end, two membranes containing polyadenylated RNA from multiple human tissues (leukocytes, colon, small intestine, ovary, testis, prostate, thymus, spleen, pancreas, kidney, skeletal muscle, liver, lung, placenta, brain and heart) are they hybridized with the radioactively labeled Face-1 and Face-2 probes. Two micrograms of polyadenylated RNA from the indicated tissues were hybridized with the Face-1 and Face-2 cDNAs. After a prehybridization at 42 ° C for three hours in 40% formamide, 5x PBS / EDTA (lx = 150 mM NaCl, 10 mM NaH ₂ P0 ₄ , lmM EDTA, pH 7.4), lOx Denhardt's solution (lx = Bovine serum albumin, 0.02%, polyvinylpyrrolidone, 0.02%, ficoll, 0.02%), 2% SDS and 100 mg / ml salmon sperm DNA, the probes were added and hybridized for 20 hours in The same conditions. The filters were washed with 1 x SSC (150 mM NaCl, 15 mM sodium citrate, pH 7.0) containing 0.1% SDS for 2 hours at 50 ° C, and finally exposed to autoradiography

(Fig. 1). As can be seen in Figure 1A, after hybridization with the Face-1 probe, a minor RNA of approximately 3.5 kilobases was detected in all the tissues analyzed. Similarly, when the filters hybridized with the Face-2 probe, a messenger RNA was detected. about 1.5 kilobases in all tissues, being especially abundant in testis. The ubiquitous expression of the Face-1 and Face-2 genes is consistent with the wide tissue distribution of pre-packed proteins, which in turn implies the need for the proteases responsible for their maturation to be present in all body tissues.

DESCRIPTION OF THE FIGURES

Figure 1. Northern analysis of human tissue expression of the Face-1 (A), Face-2 (B) genes and an Actin (C) control that is expressed in all tissues. The size of the RNAs used as markers is indicated on the left. (1) Leukocytes, (2) colon, (3) intestine, (4) ovary, (5) testis, (6) prostate, (7) thymus, (8) spleen, (9) pancreas, (10) kidney, (11) muscle, (12) liver, (13) lung, (14) placenta, (15) brain and (16) heart.

Claims

1. Procedure for the identification of human gene sequences homologous to the yeast genes involved in proteolytic processing of prenilated proteins characterized in that it comprises the following steps: a) Compare the nucleotide sequence of the Saccharomyces cerevisiae AFC1 and RCE1 genes with the Partial nucleotide sequences present in the expressed gene databases. b) Identify homologous fragments and amplify them by PCR of total RNA of human tissues in which said gene sequences can be expressed. c) Use the amplified fragments as probes to hybridize cDNA libraries prepared from RNA from human tissues. d) Isolate the cDNA clones that hybridize with the probes and determine their complete nucleotide sequence.

2. Identification method according to claim 1, characterized in that the identified gene sequences encode raster and other related protein processing proteases.

3. Identification method according to any of the preceding claims characterized in that the identified gene sequence and its deduced amino acid sequence are SEQ ID NO: 1.

4. Identification procedure according to claims 1 or 2, characterized in that the identified gene sequence and its deduced amino acid sequence are SEQ ID NO: 2.

5. Gene sequence SEQ ID NO: 1 and its mutations, derivatives or partial sequences, which code for proteolytic enzymatic activity of pre-piled proteins.

6. Gene sequence SEQ ID NO: 2 and its mutations, derivatives or partial sequences, which encode a proteolytic enzymatic activity of pre-piled proteins.

7. Use of the sequences SEQ ID NO: 1 and / or SEQ ID NO: 2 in the design of inhibitors of proteolytic enzymatic activity of pre-piled proteins.

8. Use of the sequences SEQ ID NO: 1 and / or SEQ ID NO: 2 in the production of recombinant or synthetic proteins.

9. Use of the sequences SEQ ID NO: 1 and / or SEQ ID NO: 2 in the production of antibodies.

10. Use of the sequences SEQ ID NO: 1 and / or SEQ ID NO: 2 in the production of proteolytic protein detection systems of pre-piled proteins and / or the genes encoding them.

11. Use of the sequences SEQ ID NO: 1 and / or SEQ ID NO: 2 in the production of active compositions in the treatment of pathological processes mediated by ras proteins or other pre-encapsulated proteins, and / or by genes encoding them .

12. Complete amino acid sequence or parts thereof, reflected in SEQ ID NO: 1.

13. Complete amino acid sequence or parts thereof, reflected in SEQ ID NO: 2.