CN119534824A - A blood metabolite marker combination for early diagnosis of colorectal cancer and its application - Google Patents
A blood metabolite marker combination for early diagnosis of colorectal cancer and its application Download PDFInfo
- Publication number
- CN119534824A CN119534824A CN202411692771.1A CN202411692771A CN119534824A CN 119534824 A CN119534824 A CN 119534824A CN 202411692771 A CN202411692771 A CN 202411692771A CN 119534824 A CN119534824 A CN 119534824A
- Authority
- CN
- China
- Prior art keywords
- colorectal cancer
- early diagnosis
- model
- blood metabolite
- marker combination
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 206010009944 Colon cancer Diseases 0.000 title claims abstract description 84
- 208000001333 Colorectal Neoplasms Diseases 0.000 title claims abstract description 75
- 238000013399 early diagnosis Methods 0.000 title claims abstract description 39
- 239000003550 marker Substances 0.000 title claims abstract description 36
- 239000002207 metabolite Substances 0.000 title claims abstract description 36
- 210000004369 blood Anatomy 0.000 title claims abstract description 34
- 239000008280 blood Substances 0.000 title claims abstract description 34
- 239000000090 biomarker Substances 0.000 claims abstract description 23
- 238000012216 screening Methods 0.000 claims abstract description 21
- FCVWHDAKNZMSON-UHFFFAOYSA-N Chrycorin Chemical compound O=C1CC2OCCCC2=C1C1=CC=CS1 FCVWHDAKNZMSON-UHFFFAOYSA-N 0.000 claims abstract description 19
- 238000001514 detection method Methods 0.000 claims abstract description 19
- YZXBAPSDXZZRGB-DOFZRALJSA-N arachidonic acid Chemical compound CCCCC\C=C/C\C=C/C\C=C/C\C=C/CCCC(O)=O YZXBAPSDXZZRGB-DOFZRALJSA-N 0.000 claims abstract description 16
- 238000000034 method Methods 0.000 claims abstract description 14
- DQWPFSLDHJDLRL-UHFFFAOYSA-N triethyl phosphate Chemical compound CCOP(=O)(OCC)OCC DQWPFSLDHJDLRL-UHFFFAOYSA-N 0.000 claims abstract description 14
- 238000007637 random forest analysis Methods 0.000 claims abstract description 11
- OZOJARPUDFLESM-UHFFFAOYSA-N 3-carboxy-4-methyl-5-(1-hydroxypropyl)-2-furanpropionic acid Chemical compound CCC(O)c1oc(CCC(O)=O)c(C(O)=O)c1C OZOJARPUDFLESM-UHFFFAOYSA-N 0.000 claims abstract description 9
- 229940114079 arachidonic acid Drugs 0.000 claims abstract description 8
- 235000021342 arachidonic acid Nutrition 0.000 claims abstract description 8
- JBKVHLHDHHXQEQ-UHFFFAOYSA-N epsilon-caprolactam Chemical compound O=C1CCCCCN1 JBKVHLHDHHXQEQ-UHFFFAOYSA-N 0.000 claims description 26
- 238000003745 diagnosis Methods 0.000 claims description 19
- 238000011156 evaluation Methods 0.000 claims description 19
- 238000004393 prognosis Methods 0.000 claims description 12
- 239000003153 chemical reaction reagent Substances 0.000 claims description 11
- 238000002360 preparation method Methods 0.000 claims description 8
- 238000007477 logistic regression Methods 0.000 claims description 7
- 238000012545 processing Methods 0.000 claims description 5
- 238000004590 computer program Methods 0.000 claims description 4
- 238000013210 evaluation model Methods 0.000 claims description 4
- 230000007170 pathology Effects 0.000 claims description 4
- 238000001793 Wilcoxon signed-rank test Methods 0.000 claims description 3
- 238000004364 calculation method Methods 0.000 claims description 3
- 230000000813 microbial effect Effects 0.000 claims description 3
- 238000002546 full scan Methods 0.000 claims description 2
- 238000001294 liquid chromatography-tandem mass spectrometry Methods 0.000 claims description 2
- 230000035945 sensitivity Effects 0.000 abstract description 16
- 208000035965 Postoperative Complications Diseases 0.000 abstract description 3
- 208000002193 Pain Diseases 0.000 abstract description 2
- 238000012417 linear regression Methods 0.000 abstract description 2
- 230000036407 pain Effects 0.000 abstract description 2
- 239000000523 sample Substances 0.000 description 17
- 238000012360 testing method Methods 0.000 description 13
- 238000004458 analytical method Methods 0.000 description 9
- 201000010099 disease Diseases 0.000 description 8
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 8
- 206010028980 Neoplasm Diseases 0.000 description 7
- 201000011510 cancer Diseases 0.000 description 6
- -1 1-hydroxypropyl Chemical group 0.000 description 5
- 238000002052 colonoscopy Methods 0.000 description 5
- 102100025475 Carcinoembryonic antigen-related cell adhesion molecule 5 Human genes 0.000 description 4
- 230000002550 fecal effect Effects 0.000 description 4
- 239000000203 mixture Substances 0.000 description 4
- YEBDWAHEIMUJQT-ZLCLUPBPSA-N (5z,8z,11z,14z)-icosa-5,8,11,14-tetraenoic acid Chemical compound CCCCC\C=C/C\C=C/C\C=C/C\C=C/CCCC(O)=O.CCCCC\C=C/C\C=C/C\C=C/C\C=C/CCCC(O)=O YEBDWAHEIMUJQT-ZLCLUPBPSA-N 0.000 description 3
- 208000003200 Adenoma Diseases 0.000 description 3
- 206010001233 Adenoma benign Diseases 0.000 description 3
- 239000002253 acid Substances 0.000 description 3
- 238000013473 artificial intelligence Methods 0.000 description 3
- 238000004422 calculation algorithm Methods 0.000 description 3
- 150000001875 compounds Chemical class 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 230000000968 intestinal effect Effects 0.000 description 3
- 210000002966 serum Anatomy 0.000 description 3
- 239000005747 Chlorothalonil Substances 0.000 description 2
- 230000005856 abnormality Effects 0.000 description 2
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 2
- 229940097572 chloromycetin Drugs 0.000 description 2
- CRQQGFGUEAVUIL-UHFFFAOYSA-N chlorothalonil Chemical compound ClC1=C(Cl)C(C#N)=C(Cl)C(C#N)=C1Cl CRQQGFGUEAVUIL-UHFFFAOYSA-N 0.000 description 2
- 238000012937 correction Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 2
- 239000005457 ice water Substances 0.000 description 2
- 150000002500 ions Chemical class 0.000 description 2
- 238000004895 liquid chromatography mass spectrometry Methods 0.000 description 2
- 230000014759 maintenance of location Effects 0.000 description 2
- 230000002503 metabolic effect Effects 0.000 description 2
- BDAGIHXWWSANSR-UHFFFAOYSA-N methanoic acid Natural products OC=O BDAGIHXWWSANSR-UHFFFAOYSA-N 0.000 description 2
- 244000005700 microbiome Species 0.000 description 2
- 238000000513 principal component analysis Methods 0.000 description 2
- 239000013062 quality control Sample Substances 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 238000004704 ultra performance liquid chromatography Methods 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- WSGYTJNNHPZFKR-UHFFFAOYSA-N 3-hydroxypropanenitrile Chemical compound OCCC#N WSGYTJNNHPZFKR-UHFFFAOYSA-N 0.000 description 1
- OSWFIVFLDKOXQC-UHFFFAOYSA-N 4-(3-methoxyphenyl)aniline Chemical compound COC1=CC=CC(C=2C=CC(N)=CC=2)=C1 OSWFIVFLDKOXQC-UHFFFAOYSA-N 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 108010022366 Carcinoembryonic Antigen Proteins 0.000 description 1
- 208000005623 Carcinogenesis Diseases 0.000 description 1
- 241000219061 Rheum Species 0.000 description 1
- 235000009411 Rheum rhabarbarum Nutrition 0.000 description 1
- WEVYAHXRMPXWCK-UHFFFAOYSA-N acetonitrile Substances CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 1
- 238000013528 artificial neural network Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000036952 cancer formation Effects 0.000 description 1
- 231100000504 carcinogenesis Toxicity 0.000 description 1
- 238000003759 clinical diagnosis Methods 0.000 description 1
- 201000002758 colorectal adenoma Diseases 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 230000006378 damage Effects 0.000 description 1
- 238000013135 deep learning Methods 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 239000000104 diagnostic biomarker Substances 0.000 description 1
- 235000005911 diet Nutrition 0.000 description 1
- 230000037213 diet Effects 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 235000019253 formic acid Nutrition 0.000 description 1
- 238000013467 fragmentation Methods 0.000 description 1
- 238000006062 fragmentation reaction Methods 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 230000008463 key metabolic pathway Effects 0.000 description 1
- 230000003902 lesion Effects 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 230000036210 malignancy Effects 0.000 description 1
- 238000004949 mass spectrometry Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 230000004066 metabolic change Effects 0.000 description 1
- 235000020938 metabolic status Nutrition 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000010606 normalization Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 239000012074 organic phase Substances 0.000 description 1
- 230000008506 pathogenesis Effects 0.000 description 1
- 230000001575 pathological effect Effects 0.000 description 1
- 238000003909 pattern recognition Methods 0.000 description 1
- 239000012071 phase Substances 0.000 description 1
- 238000007781 pre-processing Methods 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 239000000047 product Substances 0.000 description 1
- 102000004169 proteins and genes Human genes 0.000 description 1
- 108090000623 proteins and genes Proteins 0.000 description 1
- 230000009711 regulatory function Effects 0.000 description 1
- 230000008672 reprogramming Effects 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 239000007921 spray Substances 0.000 description 1
- 238000007619 statistical method Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 238000001356 surgical procedure Methods 0.000 description 1
- 230000004083 survival effect Effects 0.000 description 1
- 238000004885 tandem mass spectrometry Methods 0.000 description 1
- 238000010257 thawing Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 210000004881 tumor cell Anatomy 0.000 description 1
- 238000002137 ultrasound extraction Methods 0.000 description 1
- 238000010200 validation analysis Methods 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 239000003643 water by type Substances 0.000 description 1
Classifications
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/53—Immunoassay; Biospecific binding assay; Materials therefor
- G01N33/574—Immunoassay; Biospecific binding assay; Materials therefor for cancer
- G01N33/57407—Specifically defined cancers
- G01N33/57419—Specifically defined cancers of colon
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H50/00—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
- G16H50/20—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for computer-aided diagnosis, e.g. based on medical expert systems
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H50/00—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
- G16H50/30—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for calculating health indices; for individual health risk assessment
Landscapes
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Life Sciences & Earth Sciences (AREA)
- Biomedical Technology (AREA)
- Immunology (AREA)
- Public Health (AREA)
- Pathology (AREA)
- Medical Informatics (AREA)
- Chemical & Material Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Urology & Nephrology (AREA)
- Molecular Biology (AREA)
- Hematology (AREA)
- Cell Biology (AREA)
- Medicinal Chemistry (AREA)
- Biotechnology (AREA)
- Databases & Information Systems (AREA)
- General Physics & Mathematics (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- Primary Health Care (AREA)
- Epidemiology (AREA)
- Food Science & Technology (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- Analytical Chemistry (AREA)
- Hospice & Palliative Care (AREA)
- Oncology (AREA)
- Investigating Or Analysing Biological Materials (AREA)
Abstract
The invention belongs to the field of colorectal cancer detection, and relates to a blood metabolite marker combination for early diagnosis of colorectal cancer and application thereof. The screening of the invention results in a blood metabolite biomarker combination for early diagnosis of colorectal cancer comprising Ep silon-Caprolactam, triethyl Phosphate, 3-Carboxy-4-Methyl-5- (1-Hydroxypropyl) -2-Furanpropi onic Acid, arachidonic Acid, chrycorin; the method is used for diagnosing colorectal cancer through models such as random forests and linear regression, has high sensitivity and strong specificity, can avoid pains, discomfort and postoperative complications caused by enteroscopy invasive examination, obviously reduces the false positive rate of early diagnosis of colorectal cancer, and is widely used for early diagnosis of colorectal cancer.
Description
Technical Field
The invention belongs to the field of colorectal cancer detection, and particularly relates to a biomarker group for early diagnosis of colorectal cancer and application thereof.
Background
Colorectal cancer is a common malignancy worldwide, accounting for about one tenth of cancer cases, with morbidity and mortality of 6.1% and 9.2%, respectively. Colorectal cancer incidence rates in the 380 ten thousand new cancer cases in china in 2018 are fourth in men, third in women and second in the world. If colorectal lesions can be detected early and resected by surgery, survival rates are over 90% and later are reduced to 13%, so early screening is the most effective means of reducing colorectal mortality. With the development of sequencing technology and information technology, artificial intelligence has become an important tool for assisting clinicians, such as imaging diagnosis. The data information can be deeply mined by utilizing an artificial intelligence algorithm and bioinformatics, a diagnosis model is built, and the diagnosis accuracy can be improved. The data show that the artificial intelligent model based on the algorithms such as neural network and deep learning can reach more than 80% accuracy, so that the patient benefits. Thus, combining artificial intelligence learning with early diagnosis will further reduce colorectal cancer mortality and medical burden.
Currently, invasive colonoscopy is still the gold standard for CRC diagnosis, the most commonly used method of diagnosis in the clinic, and the results of case-control and cohort studies indicate that colonoscopy has the potential to prevent CRC and cancer death. China recruited 1,381,561 qualified participants from 16 provinces in china from 2012 to 2015, aged 40-69 years, established a risk scoring system, then recommended for colonoscopy, and finally only 25,593 participants were colonoscopy with a participation rate of 14.0%. Although colonoscopy is considered the gold standard for CRC screening, it is an invasive procedure, requires a high level of expertise, is costly, can cause postoperative complications, and patient experience is poor, which limits its choice as an early screening. The non-invasive detection method, the American cancer society recommends that the high sensitivity FOBT and FIT be checked once every three years, but the sensitivity is lower (12% -56%), and the sensitivity is greatly influenced by diet and medication, so that the sensitivity is still further improved. Multi-target fecal DNA detection Cologuard, FDA approval in 2014, recommended screening every 3 years, but its sensitivity to early adenoma is low (42.4%), detection cost (599 merits) is high, performance verification is currently only obtained in a few countries such as the united states, canada, etc., and no clinical experiments have been performed in china. The large-scale use of these methods is limited due to the low accuracy of non-invasive tests and the damage caused by invasive tests. Thus, a need exists for a non-invasive, accurate method of CRC detection.
The occurrence of cancer is accompanied by global changes in metabolic status, affecting the tumor tissue as well as the surrounding microenvironment and macroscopical environment. Metabolic changes in tumor cell status can be more directly observed than genomic and proteomic changes and are therefore a promising source of biomarkers for detecting tumorigenesis. Various studies have shown that metabolites produced by intestinal bacteria can enter the circulation and exert regulatory functions in distant organs, key metabolic pathways being disrupted in the pathogenesis of CRC. In recent years, serum metabolites closely related to CRC have been intensively studied to develop diagnostic biomarkers. Intestinal microbiome reprogramming in colorectal cancer patients is associated with changes in serum metabolome, and changes in intestinal microbiome related serum metabolites (GMSM) can effectively distinguish CRC and adenoma patients from normal individuals. GMSM-based models can distinguish CRC and adenoma patients from healthy normal subjects more effectively than the clinical marker carcinoembryonic antigen. The GMSM model is superior to the clinical biomarkers CEA and FOBT in colorectal abnormality detection. Colorectal abnormalities were detected using CEA at a clinical cut-off of 5U/mL, AUC of 0.72, sensitivity of 35.8%, specificity of 86.4%. In contrast, AUC of GMSM model reached 0.92 (sensitivity=83.5%, specificity=84.9%) well above CEA. The detection sensitivity of FOBT/FIT is 65.2%, which is equivalent to that of the prior report. These results indicate that the GMSM model is superior to the FOBT/FI T test in detecting CRC.
The invention screens specific blood metabolites as biomarker compositions for early diagnosis of colorectal cancer, wherein the biomarker compositions comprise Epsilon-Caprolactam (caprolactam), triethyl Phosphate (triethyl phosphate), 3-Carboxy-4-Methyl-5- (1-Hydroxypropyl) -2-Furanpropionic Acid (3-carboxyl-4-Methyl-5- (1-hydroxypropyl) -2-furopropionic acid), arachidonic Acid (arachidonic acid) and Chrycorin (chloromycetin), and the biomarker compositions are used for the diagnosis of colorectal cancer by using the models of random forest, linear regression and the like.
Disclosure of Invention
Aiming at the technical problems, the invention aims to provide a blood metabolite marker combination for early diagnosis of colorectal cancer and application thereof. The method specifically comprises the following steps:
In a first aspect, the invention provides a blood metabolite marker combination for early diagnosis of colorectal cancer, comprising Epsilon-Caprolactam (caprolactam), triethyl Phosphate (triethyl phosphate), 3-Carboxy-4-Methyl-5- (1-Hydroxypropyl) -2-Furanpropionic Acid (3-carboxy-4-Methyl-5- (1-hydroxypropyl) -2-furanpropanoic acid), arachidonic Acid (arachidonic acid), chrycorin (chlorothalonil).
In a second aspect, the present invention provides the use of a reagent for detecting a blood metabolite marker combination according to the first aspect above, for the preparation of a reagent, kit or chip for early diagnosis or prognosis of colorectal cancer.
In a third aspect, the present invention provides the use of a reagent for detecting a blood metabolite marker combination according to the first aspect above, for the preparation of a colorectal cancer patient screening evaluation model.
In a fourth aspect, the present invention provides the use of a blood metabolite marker combination according to the first aspect above for the preparation of a reagent, kit or chip for early diagnosis or prognosis of colorectal cancer.
In a fifth aspect, the present invention provides the use of a blood metabolite marker combination according to the first aspect above for the preparation of a colorectal cancer patient screening evaluation model.
In a sixth aspect, the present invention provides a kit for early diagnosis or prognosis of colorectal cancer, the kit comprising reagents or apparatus for detecting a blood metabolite marker combination according to the first aspect above.
In a seventh aspect, the present invention provides a model for early diagnosis or prognosis of colorectal cancer, the model comprising:
(1) Sample processing, namely extracting the blood metabolite marker combination in the first aspect of the blood sample to be detected and carrying out LC-MS/MS full-scan detection to obtain the abundance value of the marker;
(2) Marking colorectal cancer samples determined by colonoscope and pathology detection as a value 1, marking healthy crowd samples as a value 0, inputting abundance values of biomarker combinations of different obtained samples into a random forest, constructing a model, and predicting diagnosis results;
(3) The result judges that the colorectal cancer diagnosis result is positive when the predictive value is 1, and negative when the predictive value is 0.
In an eighth aspect, the present invention provides a method for determining a blood metabolite marker combination according to the first aspect, comprising:
(1) Group difference statistics (Wilcoxon test) were performed on markers of colorectal cancer group and healthy group, using species with P <0.05 as potential biomarkers;
(2) The optimal combination of microbial biomarkers was determined using the R software package AUCRF (random forest method);
(3) Determining the optimal combination by randomly removing the markers using AUC as an evaluation index (pROC packs);
(4) And simultaneously constructing a logistic regression model by using the markers, and comparing the logistic regression model with the marker to obtain the blood metabolite marker combination.
In a ninth aspect, the present invention provides a model system for early diagnosis or prognosis of colorectal cancer, comprising:
(1) The pre-input module is at least used for inputting data to be evaluated;
(2) The evaluation module is at least used for evaluating the data to be evaluated and is executed by the model in the seventh aspect;
(3) And the display module is at least used for displaying the evaluation result.
In a tenth aspect, the present invention provides a computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, the processor implementing the following steps using the system of the eighth aspect when executing the program, comprising:
(1) Collecting and/or inputting evaluation data for early diagnosis or prediction of colorectal cancer;
(2) Substituting the evaluation data into the model for calculation to obtain a judgment conclusion of whether the sample is colorectal cancer;
(3) And outputting a judging conclusion of whether the sample is colorectal cancer.
The kit has the beneficial effects that ① is subjected to a large number of screening to obtain metabolites Epsi lon-Caprolactam (caprolactam), triethyl Phosphate (triethyl phosphate), 3-Carboxy-4-Methyl-5- (1-Hydroxypropyl) -2-Furanpropionic Acid (3-carboxyl-4-Methyl-5- (1-hydroxypropyl) -2-furopropionic acid), arachi donic Acid (arachidonic acid) and Chrycorin (chloromycetin) which are derived from blood samples for the first time, and ② is used for constructing an early diagnosis or prediction model of colorectal cancer by using the markers, provides a new direction for clinical diagnosis of colorectal cancer, avoids pain, discomfort and postoperative complications caused by the conventional early diagnosis of colorectal cancer, remarkably reduces false positive rate of early diagnosis of colorectal cancer, has the characteristics of good specificity and high sensitivity, has good clinical application value for auxiliary diagnosis of colorectal cancer, and can be widely used for early diagnosis of colorectal cancer.
Drawings
Fig. 1 shows a graph of AUC versus markers constructed from a random forest model using pROC markers from 52 markers, the curve showing that the model consisting of 11 markers is optimal.
FIG. 2 shows the AUC curve of a diagnostic model constructed using 5 fecal metabolites, RF represents a random forest model, glm represents a logistic regression model.
Detailed Description
The following examples facilitate a better understanding of the present invention, but are not intended to limit the same. The experimental methods in the following examples are conventional methods unless otherwise specified. The test materials used in the examples described below are commercially available unless otherwise specified.
In one embodiment, sensitivity, specificity, accuracy, and other combinations are used to describe the well and reliability of the detection methods of the present invention. Several terms used with descriptions of sensitivity, specificity, accuracy include True Positive (TP), true Negative (TN), false Positive (FP), false Negative (FN), wherein a test result is considered true if a patient is demonstrated to have a disease and a given screening test also indicates the presence of the disease, a test result is considered true if a patient is demonstrated to have no disease and a given screening test also indicates the absence of the disease, a test result is false positive if a screening test result indicates that a patient who does not actually have a disease has the disease, and a test result is false negative if a screening test result indicates that a patient who does not actually have a disease does not have the disease.
Sensitivity = TP/(tp+fn) = number of true positive assessments/number of all positive assessments;
specificity = TN/(tn+fp) = number of true negative evaluations/number of all negative evaluations;
accuracy= (tn+tp)/(tn+tp+fn+fp) =number of correct evaluations/number of all evaluations.
In the following examples, 63 patients suffering from colorectal cancer and adenoma and 40 healthy people are selected, the above cases are screened by using the marker combination provided by the invention, and the sensitivity, the specificity and the accuracy of the marker combination provided by the invention for colorectal cancer detection are judged according to the detection result.
EXAMPLE 1 screening of blood metabolites for early diagnosis of colorectal cancer
1. Blood sample collection
A5 ml vacuum blood collection tube was prepared, blood from the patient was collected, immediately centrifuged at 3000rpm/min for 15 minutes, and the supernatant was aspirated and packaged into sterile centrifuge tubes and immediately stored at-80 ℃.
2. Blood biomarker detection
2.1 Metabolite extraction
(1) Taking out a sample stored at-80 ℃, thawing in an ice-water mixture, and transferring 80 mu L of the sample to a 1.5mL EP tube;
(2) 320. Mu.L of protein precipitant methanol-acetonitrile (V: V=2:1, containing mixed internal standard, 4. Mu.g/mL) was added and vortexed for 1min;
(3) Ultrasonic extraction in ice water bath for 10min at-40 ℃ and standing overnight;
(4) Centrifuge for 10min (12000 rpm,4 ℃) aspirate 150 μl of supernatant with syringe, filter with 0.22 μm organic phase pinhole filter, transfer to LC sample vials, store at-80 ℃ until LC-MS analysis.
(5) The quality control sample (QC) is prepared by mixing all the extracting solutions of all the samples in equal volume. All extraction reagents were pre-chilled at-20 ℃.
2.2 Liquid chromatography-Mass Spectrometry analysis
(1) The analytical instrument for the experiment is a liquid-mass combination system consisting of Waters ACQUITY UPLC I-Class plus/Thermo QE ultra-high performance liquid tandem high resolution mass spectrometer.
(2) The chromatographic column was ACQUITY UPLC HSS T (100 mm. Times.2.1 mm,1.8 μm), column temperature 45 ℃, mobile phase A-water (0.1% formic acid), B-acetonitrile, flow rate 0.35mL/min, and sample volume 3. Mu.L.
(3) Mass spectrometry conditions :Spray Voltage(V)(3800 -3000),Capillary Temperature(℃)(320),Aux gas heater temperature(℃)(350),Sheath Gas Flow Rate(Arb)(35),Aux gas flow rate(Arb)(8),S-lens RF level(50),Mass range(m/z)(70-1050),Full ms resolution(70000),MS/MS resolution(17500),NCE/stepped NCE(10,20,40).
2.3 Data processing
(1) Data preprocessing before pattern recognition, the original data is subjected to baseline filtration, peak recognition, integration, retention time correction, peak alignment and normalization by metabonomics processing software Progenesis QI v 3.0.0, with main parameters of :pre cursor tolerance:5ppm(HMDB+Lipidmaps)/10ppm(LuMet-Animal+METLIN);product to lerance:10ppm(HMDB+Lipidmaps)/20ppm(LuMet-Animal+METLIN).
(2) Identification of compounds based on multiple dimensions of RT (retention time), exact mass number, secondary fragmentation, and isotope distribution, identification analysis was performed using The Human Metabolome Database (HMDB), LIPIDMAPS (v 2.3), and METLIN databases and LuMet-animate 3.0 local databases.
(3) The extracted data is subjected to deletion value processing, 0 value replacement, score scoring screening, data merging and the like. Missing value processing and 0 value substitution by deleting ion peaks with missing values (0 value) >50% in the group and substituting the remaining 0 value with half of the minimum value of all ion intensities of all samples. Score scoring screening, namely screening the compounds obtained qualitatively according to the Score of the qualitative results of the compounds (Score), wherein the screening standard is 36 points (full Score of 80 points), and the qualitative results are inaccurate and deleted under 36 points.
(4) The multivariate statistical analysis firstly adopts an unsupervised Principal Component Analysis (PCA) to observe the overall distribution among samples and the stability of the whole analysis process, and then uses a supervised partial least squares analysis (PLS-DA) and an orthogonal partial least squares analysis (OPLS-DA) to distinguish the overall differences of the metabolic profiles among groups and find the differential metabolites among groups.
2.4 Marker combination screening
(1) The markers of colorectal cancer group and healthy group were subjected to inter-group difference statistics (Wilcoxon test) using species with P <0.05 as potential biomarkers, and by Wilcoxon inter-group difference test, markers with P value less than 0.05 after Bonferroni correction were selected for total of 52 metabolites.
(2) Colorectal cancer (colonoscope and pathology detection determination) samples were labeled as value 1, healthy population samples were labeled as value 0, and modeling analysis and prediction were performed on different sample abundance matrices. The optimal combination of microbial biomarkers was determined using the R software package AUCRF (random forest method). The results are shown in FIG. 1, which shows that the diagnosis is best when the combination contains 11 markers.
(3) By randomly removing the markers, the AUC was used as an evaluation index (pROC packs) to determine the optimal combination. The final determination of 5 metabolic markers will retain the diagnostic properties of the model, epsilon-Caprolactam (caprolactam), triethyl Phosphate (triethyl phosphate), 3-Carboxy-4-Methyl-5- (1-Hydroxypropyl) -2-Furanpropionic Acid (3-carboxy-4-Methyl-5- (1-hydroxypropyl) -2-furanopropionic acid), arachidonic Acid (arachidonic acid), chrycorin (chlorothalonil).
(4) And simultaneously constructing a logistic regression model by using the markers, and comparing the logistic regression model with the marker.
By the above analysis, Epsilon-Caprolactam、Triethyl Phosphate、3-Carboxy-4-Methyl-5-(1-Hydroxypropyl)-2-Furanpropionic Acid、Arachidonic Acid、Chrycorin metabolites were selected as marker combinations for early diagnosis of colorectal cancer for healthy and colorectal cancer population discrimination.
Example 2 establishment and validation of early diagnosis or prediction model of colorectal cancer
1. Model construction
The metabolite Epsilon-Caprolactam、Triethyl Phosphate、3-Carboxy-4-Methyl-5-(1-Hydroxypropyl)-2-Furanpropionic Acid、Arachidonic Acid、Chrycorin obtained by the final screening in example 1 was selected as a marker combination for early diagnosis of colorectal cancer, and an early diagnosis or prediction model of colorectal cancer was constructed by the following specific method:
(1) Extracting metabolites Epsilon-Caprolactam、Triethyl Phosphate、3-Carboxy-4-Met hyl-5-(1-Hydroxypropyl)-2-Furanpropionic Acid、Arachidonic Acid、Chrycorin, in the blood sample to obtain the abundance value of the marker;
(2) Marking colorectal cancer (colonoscope and pathological detection determination) samples as a value 1, marking healthy crowd samples as a value 0, inputting obtained abundance values (n x n table) of biomarker combinations of different samples into a random forest, constructing a model, and predicting diagnosis results;
(3) When the abundance matrix of the marker is used, the predicted value of the random forest algorithm is 1, the diagnosis result of the sample is judged to be positive, namely colorectal cancer, and when the predicted value is 0, the diagnosis result of the sample is judged to be negative, namely health.
2. Accuracy verification
The accuracy is determined by comparing the above-described predictive diagnosis with the true diagnosis (colonoscope and pathology detection are determined to be colorectal cancer).
3. Analysis of results
Healthy and colorectal cancer population discrimination was performed using Epsilon-Caprolactam、Triethyl Phosphate、3-Carboxy-4-Methyl-5-(1-Hydroxypropyl)-2-Furanpropionic Acid、Arachidonic Acid、Chrycorin markers, and the results are shown in fig. 2, with a specificity of 90.0% and an accuracy of 92.9% when the sensitivity was 81.0%.
The results show that the biomarker combination provided by the invention is used for colorectal cancer diagnosis, and has higher accuracy, specificity and sensitivity.
EXAMPLE 3 colorectal cancer early diagnosis or prediction System
The present embodiment provides a system for early diagnosis or prognosis of colorectal cancer, comprising:
(1) The pre-input module is at least used for inputting the biomarker combination abundance data in the fecal sample and transmitting the biomarker combination abundance data to the evaluation module;
(2) The evaluation module is at least used for analyzing the abundance data of the biomarker combinations in the fecal sample according to the model described in the embodiment 2 to obtain a result, wherein the abundance data of the biomarker combinations can be collected through the pre-input module and can be obtained from other sources.
(3) The display module is at least used for displaying the evaluation result, and the colorectal cancer diagnosis result is negative when the predicted value is 0, and positive when the predicted value is 1.
Example 4 computer apparatus
The present embodiment provides an electronic device, which may be expressed in the form of a computing device (e.g., may be a server device), including a memory, a processor, and a computer program stored on the memory and executable on the processor, where the processor may implement the colorectal cancer early diagnosis or prediction model in embodiment 2 of the present invention when executing the computer program.
The processor, when executing the program, causes the system to perform the steps comprising:
(1) Collecting and/or inputting biomarker combination abundance data for early diagnosis or prediction of colorectal cancer;
(2) Substituting the evaluation data into the model for calculation to obtain a judgment conclusion that the sample is positive or negative to colorectal cancer;
(3) And outputting a judgment conclusion that the sample is positive or negative to colorectal cancer.
It should be noted that although several units/modules or sub-units/modules of an electronic device are mentioned in the above detailed description, such a division is merely exemplary and not mandatory. Indeed, the features and functionality of two or more units/modules described above may be embodied in one unit/module in accordance with embodiments of the present application. Conversely, the features and functions of one unit/module described above may be further divided into ones that are embodied by a plurality of units/modules.
Embodiment 5 computer-readable storage Medium
The present embodiment provides a computer-readable storage medium having the computer device stored thereon. The program is executed by the processor to implement the steps of the method for evaluating the geodetic properties of rhubarb in the first embodiment of the invention.
More specifically, a readable storage medium may include, but is not limited to, a portable disk, hard disk, random access memory, read only memory, erasable programmable read only memory, optical storage device, magnetic storage device, or any suitable combination of the foregoing.
In a possible embodiment, the invention may also be implemented in the form of a program product comprising program code for causing a terminal device to execute a model for carrying out the early diagnosis or prognosis of colorectal cancer in embodiment 2 of the invention, when said program product is run on the terminal device.
Wherein the program code for carrying out the invention may be written in any combination of one or more programming languages, which program code may execute entirely on the user device, partly on the user device, as a stand-alone software package, partly on the user device and partly on the remote device or entirely on the remote device.
The foregoing description is directed to the preferred embodiments of the present invention, but the embodiments are not intended to limit the scope of the invention, and all equivalent changes or modifications made under the technical spirit of the present invention should be construed to fall within the scope of the present invention.
Claims (10)
1. A blood metabolite marker combination for early diagnosis of colorectal cancer, characterized in that the blood metabolite marker combination comprises Epsilon-Caprolactam、Triethyl Phosphate、3-Carboxy-4-Methyl-5-(1-Hydroxyp ropyl)-2-Furanpropionic Acid、Arachidonic Acid、Chrycorin.
2. Use of a reagent for detecting a blood metabolite marker combination according to claim 1 for the preparation of a reagent, kit or chip for early diagnosis or prognosis of colorectal cancer.
3. Use of a reagent for detecting a blood metabolite marker combination according to claim 1 for the preparation of a colorectal cancer patient screening evaluation model.
4. Use of a blood metabolite marker combination according to claim 1 for the preparation of a reagent, kit or chip for early diagnosis or prognosis of colorectal cancer.
5. Use of a blood metabolite marker combination according to claim 1 for the preparation of a colorectal cancer patient screening evaluation model.
6. A kit for early diagnosis or prognosis of colorectal cancer, the kit comprising reagents or instruments for detecting the blood metabolite marker combination of claim 1.
7. A model for early diagnosis or prognosis of colorectal cancer, characterized in that the model comprises:
(1) Sample processing, namely extracting the blood metabolite marker combination in the first aspect of the blood sample to be detected and carrying out LC-MS/MS full-scan detection to obtain the abundance value of the marker;
(2) Marking colorectal cancer samples determined by colonoscope and pathology detection as a value 1, marking healthy crowd samples as a value 0, inputting abundance values of biomarker combinations of different obtained samples into a random forest, constructing a model, and predicting diagnosis results;
(3) The result judges that the colorectal cancer diagnosis result is positive when the predictive value is 1, and negative when the predictive value is 0.
8. The method for determining a blood metabolite marker combination for early diagnosis of colorectal cancer according to claim 1:
(1) Group difference statistics (Wilcoxon test) were performed on markers of colorectal cancer group and healthy group, using species with P <0.05 as potential biomarkers;
(2) The optimal combination of microbial biomarkers was determined using the R software package AUCRF (random forest method);
(3) Determining the optimal combination by randomly removing the markers using AUC as an evaluation index (pROC packs);
(4) And simultaneously constructing a logistic regression model by using the markers, and comparing the logistic regression model with the marker to obtain the blood metabolite marker combination.
9. A model system for early diagnosis or prognosis of colorectal cancer, comprising:
(1) The pre-input module is at least used for inputting data to be evaluated;
(2) An evaluation module at least for evaluating the data to be evaluated, performed by the model of claim 7 or 8;
(3) And the display module is at least used for displaying the evaluation result.
10. A computer device comprising a memory, a processor, and a computer program stored on the memory and executable on the processor, wherein the processor, when executing the program, implements the steps comprising using the system of claim 9:
(1) Collecting and/or inputting evaluation data for early diagnosis or prediction of colorectal cancer;
(2) Substituting the evaluation data into the model for calculation to obtain a judgment conclusion of whether the sample is colorectal cancer;
(3) And outputting a judging conclusion of whether the sample is colorectal cancer.
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202411692771.1A CN119534824A (en) | 2024-11-25 | 2024-11-25 | A blood metabolite marker combination for early diagnosis of colorectal cancer and its application |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202411692771.1A CN119534824A (en) | 2024-11-25 | 2024-11-25 | A blood metabolite marker combination for early diagnosis of colorectal cancer and its application |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN119534824A true CN119534824A (en) | 2025-02-28 |
Family
ID=94706278
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202411692771.1A Pending CN119534824A (en) | 2024-11-25 | 2024-11-25 | A blood metabolite marker combination for early diagnosis of colorectal cancer and its application |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN119534824A (en) |
-
2024
- 2024-11-25 CN CN202411692771.1A patent/CN119534824A/en active Pending
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US20170358434A1 (en) | Big-data analyzing method and mass spectrometric system using the same method | |
| CN102323351B (en) | Bladder cancer patient urine specific metabolite spectrum, establishing method and application | |
| CN108603859B (en) | Use of metabolites in urine in the preparation of kits for evaluation of cancer | |
| CN112748191A (en) | Small molecule metabolite biomarker for diagnosing acute diseases, and screening method and application thereof | |
| CN117388495B (en) | Application of metabolic marker for diagnosing lung cancer stage and kit | |
| CN118471335B (en) | Device, method, system for diagnosing cervical cancer lymphatic metastasis and application thereof | |
| CN116559453A (en) | A biomarker for lung cancer detection | |
| CN112201356A (en) | Construction method of oral squamous cell carcinoma diagnosis model, marker and application thereof | |
| CN109946411B (en) | Biomarkers for the diagnosis of ossification of the ligamentum flavum of the thoracic spine and their screening methods | |
| CN116859050A (en) | Application of protein marker combination in the preparation of early auxiliary diagnosis products for colorectal cancer | |
| CN114822854B (en) | Progression of gastric mucosal lesions and gastric cancer-related urinary protein markers and their application | |
| CN116106534A (en) | Application of biomarker combination in preparation of lung cancer prediction product | |
| CN118465282B (en) | Biomarker combination, kit, system and application thereof for predicting liver metastasis of colorectal cancer | |
| CN117877712A (en) | Castleman disease diagnosis, identification and prediction system based on random forest | |
| CN119534824A (en) | A blood metabolite marker combination for early diagnosis of colorectal cancer and its application | |
| CN111430030A (en) | Methods and systems for the application of biomarkers in the evaluation of ovarian cancer | |
| CN119643867A (en) | Fecal metabolite marker combination for early diagnosis of colorectal cancer and application | |
| CN115825258A (en) | Glycopeptide-based early liver cancer and liver cirrhosis assessment method | |
| CN114609389A (en) | Method for using ZNF326 protein as II-stage colorectal cancer curative effect prediction marker | |
| Hollfoth et al. | Accurate MS-based diagnostic amyloid typing using endogenously normalized protein intensities in formalin-fixed paraffin-embedded tissue | |
| CN119649906A (en) | A fecal biomarker combination for early diagnosis of colorectal cancer and its application | |
| CN118150830B (en) | Application of protein marker combination in preparation of colorectal cancer early diagnosis product | |
| Jiang et al. | Protocol for identifying metabolite biomarkers in patient uterine fluid for early ovarian cancer detection | |
| Gowda | Molecular Diagnosis of Glial Cell Brain Cancers Using Handheld Mass Spectrometry Device | |
| CN119506445A (en) | A biomarker combination for early diagnosis or prediction of colorectal cancer and its application |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination |