[go: up one dir, main page]

CN107358019B - Recommendation method for concept-shifted medical solutions - Google Patents

Recommendation method for concept-shifted medical solutions Download PDF

Info

Publication number
CN107358019B
CN107358019B CN201710379952.2A CN201710379952A CN107358019B CN 107358019 B CN107358019 B CN 107358019B CN 201710379952 A CN201710379952 A CN 201710379952A CN 107358019 B CN107358019 B CN 107358019B
Authority
CN
China
Prior art keywords
historical
case
classifier
sample
medical
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201710379952.2A
Other languages
Chinese (zh)
Other versions
CN107358019A (en
Inventor
沈坤炜
陈小松
朱思吉
曹健
朱能军
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ruinjin Hospital Affiliated to Shanghai Jiaotong University School of Medicine Co Ltd
Shanghai Jiao Tong University
Original Assignee
Ruinjin Hospital Affiliated to Shanghai Jiaotong University School of Medicine Co Ltd
Shanghai Jiao Tong University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ruinjin Hospital Affiliated to Shanghai Jiaotong University School of Medicine Co Ltd, Shanghai Jiao Tong University filed Critical Ruinjin Hospital Affiliated to Shanghai Jiaotong University School of Medicine Co Ltd
Priority to CN201710379952.2A priority Critical patent/CN107358019B/en
Publication of CN107358019A publication Critical patent/CN107358019A/en
Application granted granted Critical
Publication of CN107358019B publication Critical patent/CN107358019B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Medical Treatment And Welfare Office Work (AREA)

Abstract

本发明提供了一种适用于概念漂移的医疗方案的推荐系统及方法,该系统包括:用户界面模块,与推荐模块相连,用于读取一开始的训练样本;推荐模块,与工作流系统相连,根据历史病例的数据,自动智能地为当前的病例计算出一个最合适的医疗方案;外部数据库,与推荐模块相连,用来读取一开始的训练样本;外部数据库用于存放每一个病例的详细信息,例如姓名身高体重等个人信息,以及具体的各项医疗指标信息。本发明能够考虑到了概念漂移现象对结果产生的影响,兼顾了样本的采集时间等因素,能检测到概念漂移的发生,并且修正了过时的样本以适应新的规律,使得此后的预测更加准确。

Figure 201710379952

The present invention provides a recommending system and method for a medical plan with concept drift. The system includes: a user interface module, which is connected with the recommending module and used to read the initial training samples; the recommending module, which is connected with the workflow system , according to the data of historical cases, automatically and intelligently calculate the most suitable medical plan for the current case; the external database, connected with the recommendation module, is used to read the initial training samples; the external database is used to store the information of each case. Detailed information, such as personal information such as name, height and weight, as well as specific medical indicator information. The present invention can take into account the influence of concept drift phenomenon on results, take into account factors such as sample collection time, can detect the occurrence of concept drift, and correct outdated samples to adapt to new rules, so that subsequent predictions are more accurate.

Figure 201710379952

Description

Recommendation method for concept-shifted medical solutions
Technical Field
The invention relates to the technical field of medical method recommendation, in particular to a recommendation method for a concept drifting medical scheme.
Background
In the medical decision making process, case-based recommendation algorithms such as kNN (k-nearest neighbor) can effectively utilize past similar cases to provide recommendations of medical solutions. However, in practice, the treatment regimen for patients with the same characteristics is constantly changing, and new medical research progresses, and new drugs are developed, which may cause the treatment regimen to change in an irregular manner. This phenomenon, in which the statistical properties of the target variable change irregularly over time, is commonly referred to as conceptual drift. The prior art, in order to eliminate the effect of outdated samples, simply discards outdated samples, loses the information implied in these samples, and also causes a decrease in the number of samples that can be utilized, thereby causing an increase in error.
Disclosure of Invention
Aiming at the defects in the prior art, the invention aims to provide a recommendation method of a medical scheme suitable for concept drift, which can take the influence of the concept drift phenomenon on the result into consideration, takes the factors such as the acquisition time of a sample into consideration, can detect the occurrence of the concept drift, and corrects the outdated sample to adapt to a new rule, so that the later prediction is more accurate.
The invention provides a recommendation method for a concept drifting medical scheme, which comprises the following steps:
step one, starting;
step two, inputting case information x of a scheme to be recommended;
reading historical case data Da from the historical case sample set;
step four, simultaneously calculating a detector sensitive to the concept;
step five, outputting a recommendation result Yac of the self-adaptive classifier, wherein the result of the self-adaptive classifier is output as a recommended medical scheme of the case;
step six, determining a scheme y adopted by time, performing discussion analysis by a doctor according to a recommendation result, and finally obtaining an actually adopted diagnosis and treatment scheme by taking individual characteristics of a patient as reference;
step seven, adding the case information x and the scheme y in the step six into a history and sample set, and adding the information of the patient into a history data sample set for future use;
step eight, judging whether the result of the self-adaptive classifier and the result of the CSD classifier are the same as the actually adopted scheme or not, if so, turning to the step nine, and otherwise, turning to the step ten;
step nine, updating the conflict list Ct of each historical sample Dt, wherein the self-adaptive classifier predicts wrongly, and the CSD classifier predicts correctly, the relevant data in the historical case sample set needs to be updated, so that the old sample adapts to a new rule, and the medical scheme under the current rule is predicted more accurately;
and step ten, finishing.
Preferably, in the second step, the case information, various clinical indexes and other attributes are used as characteristics to establish a model, the similarity between the new case and all existing case records is calculated item by using the model, and the recommendation scheme is comprehensively generated according to several results with the highest similarity according to the similarity ranking.
Preferably, the historical case data Da in the third step is equal to the training sample set, and each piece of historical case data Da is a training sample.
Compared with the prior art, the invention has the following beneficial effects: the method can consider the influence of the concept drift phenomenon on the result, considers the factors such as the acquisition time of the sample and the like, can detect the occurrence of the concept drift, and corrects the outdated sample to adapt to a new rule, so that the later prediction is more accurate.
Drawings
Other features, objects and advantages of the invention will become more apparent upon reading of the detailed description of non-limiting embodiments with reference to the following drawings:
fig. 1 is a block diagram of a recommendation system for a concept-shifted medical scenario in accordance with the present invention.
Detailed Description
The present invention will be described in detail with reference to specific examples. The following examples will assist those skilled in the art in further understanding the invention, but are not intended to limit the invention in any way. It should be noted that variations and modifications can be made by persons skilled in the art without departing from the spirit of the invention. All falling within the scope of the present invention.
As shown in fig. 1, the recommendation system for a concept-shifted medical scenario of the present invention includes:
the user interface module is connected with the recommending module and used for inputting case information of the scheme to be recommended;
and the recommendation module is connected with the workflow system and automatically and intelligently calculates a most appropriate medical scheme for the current case according to the data of the historical case.
The external database is connected with the recommendation module and used for storing the training samples; the external database is used for storing detailed information of each case, such as personal information including names, heights, weights and the like, and specific medical index information.
A workflow database connected to the workflow system for storing and reading the status of each case, such as what status a case is currently in (whether a medical solution has been determined, a solution is waiting for a system recommendation, specific information is insufficient, etc.)
A workflow system for controlling a basic flow of each case processed by the entire system.
The recommending module comprises a scheme recommending module and a scheme formulating module, wherein: the scheme recommending module is a process of recommending a scheme for the sample to be predicted by the recommending method; the solution formulation module is the process of determining the solution actually employed.
The recommendation method of the medical scheme suitable for concept drift comprises the following steps:
step one, starting;
step two, inputting case information x of a scheme to be recommended;
reading historical case data Da from the historical case sample set;
and step four, simultaneously calculating CSD (Concept-Sensitive Detector, which means a Detector Sensitive to concepts and essentially a classifier for detecting the occurrence of Concept drift).
A Classifier recommendation scheme Ycsd and an Adaptive Classifier (AC) recommendation scheme Yac respectively use a CSD Classifier and an AC Classifier to perform scheme recommendation, Yac represents the classification result of the Adaptive Classifier, and Ycsd represents the classification result of the Classifier CSD;
step five, outputting a recommendation result Yac of the self-adaptive classifier, wherein the result of the self-adaptive classifier is output as a recommended medical scheme of the case;
step six, determining a scheme y adopted by time, performing discussion analysis by a doctor according to a recommendation result, and finally obtaining an actually adopted diagnosis and treatment scheme by taking individual characteristics of a patient as reference;
step seven, adding < x, y > into the historical sample set, and adding the information of the patient into the historical data sample set for future use;
step eight, judging whether the result of the self-adaptive classifier and the result of the CSD classifier are the same as the actually adopted scheme or not, if so, turning to the step nine, and otherwise, turning to the step ten;
step nine, updating the conflict list Ct of each historical sample Dt, wherein the self-adaptive classifier predicts wrongly, and the CSD classifier predicts correctly, the relevant data in the historical case sample set needs to be updated, so that the old sample adapts to a new rule, and the medical scheme under the current rule is predicted more accurately;
and step ten, finishing.
And secondly, establishing a model by using the attributes of the case information, various clinical indexes and the like as characteristics, calculating the similarity of the new case and all existing case records one by using the model, sequencing according to the similarity, and comprehensively generating a recommendation scheme according to several results with the highest similarity, so that the use is convenient.
In the third step, the historical case data Da is equal to a training sample set, each piece of historical case data is a training sample, and the training sample set D is shown in the following formula (1):
Figure GDA0002706783520000041
wherein N represents the sample capacity of the training set, N represents the number of attributes of each sample, and in the training set, the samples are arranged according to the time sequence of acquisition, in other words, the smaller the serial number i, the earlier the acquisition time of the sample, and for the ith historical sample Di in the training set, as shown in the following formula (2):
Figure GDA0002706783520000042
wherein
Figure GDA0002706783520000043
Values, y, representing n different attributes of the ith sample, respectivelyiIndicating the category to which the ith sample belongs.
In the fourth step, the AC represents the self-adaptive classifier using all samples in the training set, and the classification result comprehensively considers all historical samples, so that the method is more stable; CSD represents that a classifier of recently acquired samples in a training set is used, and classification results of the classifier are more sensitive to new concepts and used for detecting concept drift; to detect the occurrence of concept drift, a set of kNN classifiers using different samples is required to classify the target samples.
Said seventh step, for a sample < x (now), y (now) >, to be predicted currently, if the result of the classifier AC is wrong and the result of the CSD is correct, it means that the historical data used in the process of generating this result by the AC and the current sample are inconsistent due to different rules of compliance.
Judging whether the Yac is consistent with the Ycsd or not, and if the Yac is consistent with the Ycsd, determining that no concept drift occurs; on the contrary, if the Yac is not equal to the Ycsd, the rule of obeying the recently collected sample is different from the historical sample, and concept drift may occur, and further detection is needed.
In the ninth step, Dt is represented by the above formula (2) and satisfies t > N, N is the sample capacity of the current training set, and other samples in the training set are used for predicting the category yt to which Dt belongs; ct is expressed by the following formula (3) for recording samples inconsistent with Dt, for each historical sample Dt used by the AC classifier, it needs to be marked that it conflicts with the currently predicted sample, namely, the sample is added into Ct in (now, y (now)), for any historical sample Dt, if the item in Ct reaches a certain threshold, the concept drift is considered to have great influence on the historical sample, yt in the historical sample is updated by the latest y value to adapt to the latest rule, and the next prediction and recommendation are more accurately served,
Ct={(i,y(i))|i>t∧x(t)∈AC.nearest(k,x(i))∧y(i)≠y(t)}......(3)
ct is a set used to record samples that do not match Dt. This formula is a mathematical collective expression meaning: each element in this set is of the form (i, y (i)) where i is the number of the inconsistent sample, and is required to be greater than t; x (i) is the characteristic attribute of this sample, which is required to satisfy that it belongs to the "k samples nearest to Dt".
Wherein, the nearest represents the set of k samples with the highest similarity to the current sample Xi to be predicted, as shown in the following formula (4).
Figure GDA0002706783520000051
a and b are the numbers of the two samples Da and Db, respectively, and d represents the total number of samples in the training sample.
For the discrete attributes of the class type, there is no specific magnitude relationship between the values, so the similarity between the samples cannot be directly measured by using the mahalanobis distance as simple as the numerical attribute, and generally, in order to calculate the sample DaAnd DbThe similarity between the two samples needs to define the distance on each attribute respectively, and finally the final result is obtained by weighted comprehensive calculation of each attributeaAnd DbThe value on the k-th attribute is the same, sample DaAnd DbThe similarity on this attribute is 1; if two samples DaAnd DbAny value on the kth attribute is unknown or missing, and the similarity is 0.5; otherwise, the similarity is 0, as shown in the following formulas (5) and (6):
Figure GDA0002706783520000061
Figure GDA0002706783520000062
the foregoing description of specific embodiments of the present invention has been presented. It is to be understood that the present invention is not limited to the specific embodiments described above, and that various changes and modifications may be made by one skilled in the art within the scope of the appended claims without departing from the spirit of the invention.

Claims (3)

1.一种用于概念漂移的医疗方案的推荐方法,其特征在于,包括:1. A method for recommending a medical solution for concept drift, comprising: 步骤一,开始;step one, start; 步骤二,输入待推荐方案的病例信息x;Step 2, input the case information x of the plan to be recommended; 步骤三,读取历史病例数据Da,从历史病历样本集合中读取历史病历数据Da;Step 3, read the historical case data Da, and read the historical medical record data Da from the historical medical record sample collection; 步骤四,同时计算CSD分类器和AC分类器;Step 4: Calculate the CSD classifier and the AC classifier at the same time; 步骤五,输出自适应分类器的推荐结果Yac,自适应分类器的结果将作为此病例的推荐医疗方案输出;Step 5, output the recommended result Yac of the adaptive classifier, and the result of the adaptive classifier will be output as the recommended medical plan for this case; 步骤六,确定实际采用的方案y,待医生根据推荐结果进行讨论分析,并将病人个体的特征作为参考,最终得出实际采用的诊疗方案之后;Step 6: Determine the actually adopted scheme y, after the doctor discusses and analyzes according to the recommended results, and takes the characteristics of the individual patient as a reference, and finally obtains the actually adopted diagnosis and treatment scheme; 步骤七,将病例信息x和步骤六的方案y加入历史并样本集合,将此例病人的信息加入历史数据样本集合中,以供将来使用;Step 7, add the case information x and the plan y of step 6 to the historical and sample collection, and add the patient's information to the historical data sample collection for future use; 步骤八,判断对比自适应分类器的结果、CSD分类器的结果和实际采用的方案是否相同,是则转步骤十,否则转步骤九;Step 8, judge whether the result of comparing the adaptive classifier, the result of the CSD classifier and the actually adopted solution are the same, if yes, go to step ten, otherwise go to step nine; 步骤九,更新每个历史样本Dt的冲突列表Ct,自适应分类器预测错误,而CSD分类器预测正确,则需要对历史病例样本集合中的相关数据进行更新,使旧的样本适应新的规律,从而更准确地预测当前规律下的医疗方案;Step 9: Update the conflict list Ct of each historical sample Dt. The adaptive classifier predicts incorrectly, but the CSD classifier predicts correctly. It is necessary to update the relevant data in the historical case sample set to adapt the old samples to the new rules. , so as to more accurately predict the medical plan under the current law; 步骤十,结束。Step ten, end. 2.根据权利要求1所述的用于概念漂移的医疗方案的推荐方法,其特征在于,所述步骤二使用病例信息以及各项临床指标作为特征建立模型,使用模型逐条计算新病例与现有的所有病例记录的相似度,并根据相似度排序,根据相似度最高的几条结果,综合产生推荐方案。2. The method for recommending a medical plan for concept drift according to claim 1, wherein the step 2 uses case information and various clinical indicators as a feature to build a model, and uses the model to calculate the new cases and existing ones one by one. The similarity of all case records is sorted according to the similarity, and the recommended scheme is comprehensively generated according to the results with the highest similarity. 3.根据权利要求1所述的用于概念漂移的医疗方案的推荐方法,其特征在于,所述步骤三中历史病例数据Da等于训练样本集合,每一条历史病例数据,就是一个训练样本。3 . The method for recommending a medical plan for concept drift according to claim 1 , wherein in the step 3, the historical case data Da is equal to the training sample set, and each piece of historical case data is a training sample. 4 .
CN201710379952.2A 2017-05-25 2017-05-25 Recommendation method for concept-shifted medical solutions Active CN107358019B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710379952.2A CN107358019B (en) 2017-05-25 2017-05-25 Recommendation method for concept-shifted medical solutions

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710379952.2A CN107358019B (en) 2017-05-25 2017-05-25 Recommendation method for concept-shifted medical solutions

Publications (2)

Publication Number Publication Date
CN107358019A CN107358019A (en) 2017-11-17
CN107358019B true CN107358019B (en) 2021-01-29

Family

ID=60271653

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710379952.2A Active CN107358019B (en) 2017-05-25 2017-05-25 Recommendation method for concept-shifted medical solutions

Country Status (1)

Country Link
CN (1) CN107358019B (en)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109215795B (en) * 2018-08-10 2020-11-06 上海交通大学 Case complexity prediction method and system
CN112397161A (en) * 2019-08-16 2021-02-23 北京大数医达科技有限公司 Method, apparatus, device and medium for generating an assay model
CN111477321B (en) * 2020-03-11 2023-06-09 北京大学第三医院(北京大学第三临床医学院) Treatment effect prediction system with self-learning capability and treatment effect prediction terminal
CN112765324B (en) * 2021-01-25 2022-12-23 四川虹微技术有限公司 Concept drift detection method and device
CN113851208B (en) * 2021-09-26 2024-09-24 上海大学 Medical examination recommendation system and method based on explicit topic allocation technology
CN119400343A (en) * 2025-01-02 2025-02-07 浙江爱达科技有限公司 A wearable health solution recommendation system based on IoT gateway

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070260568A1 (en) * 2006-04-21 2007-11-08 International Business Machines Corporation System and method of mining time-changing data streams using a dynamic rule classifier having low granularity
CN101827002B (en) * 2010-05-27 2012-05-09 桂林电子科技大学 Concept drift detection method of data flow classification
WO2014075108A2 (en) * 2012-11-09 2014-05-15 The Trustees Of Columbia University In The City Of New York Forecasting system using machine learning and ensemble methods
CN103150470B (en) * 2013-02-18 2015-12-23 大连理工大学 A Visualization Method of Data Flow Concept Drift in Dynamic Data Environment
CN103345575B (en) * 2013-06-19 2016-07-13 华南师范大学 Method and system for detecting data flow concept drift
CN104766259A (en) * 2015-03-31 2015-07-08 华据医疗评估信息技术(北京)有限公司 Medical clinical quality monitoring and evaluation system based on single-disease model
CN105975794A (en) * 2016-05-23 2016-09-28 上海交通大学 Weighted KNN-based breast cancer chemotherapy scheme recommendation method

Also Published As

Publication number Publication date
CN107358019A (en) 2017-11-17

Similar Documents

Publication Publication Date Title
CN107358019B (en) Recommendation method for concept-shifted medical solutions
CN108648827B (en) Cardiovascular and cerebrovascular disease risk prediction method and device
US9861308B2 (en) Method and system for monitoring stress conditions
WO2020181805A1 (en) Diabetes prediction method and apparatus, storage medium, and computer device
CN110503155B (en) A method for information classification and related device and server
CN118711822A (en) Cardiovascular and cerebrovascular disease risk prediction method and system
CN113257421B (en) Construction method and system of hypertension prediction model
US11923075B2 (en) System and method associated with determining physician attribution related to in-patient care using prediction-based analysis
CN113361653A (en) Deep learning model depolarization method and device based on data sample enhancement
CN120299733B (en) A multi-dimensional medical data analysis and decision-making support system and method
Maram et al. A framework for performance analysis on machine learning algorithms using covid-19 dataset
CN114706985B (en) Text classification method, device, electronic device and storage medium
CN118312816B (en) Cluster weighted clustering integrated medical text processing method and system based on member selection
CN114373546A (en) Disease risk assessment method, device and storage medium
Siddiqa et al. Robust Length of Stay Prediction Model for Indoor Patients.
CN118522469A (en) Big data analysis medical decision method and system
CN113536069A (en) Mis-labeling data identification method and device, computer equipment and storage medium
CN110957046A (en) Medical health case knowledge matching method and system
Rajmohan et al. G-Sep: A deep learning algorithm for detection of long-term sepsis using bidirectional gated recurrent unit
Sreenivasan et al. PCPS: Personalized care through patient similarity
US20240232230A9 (en) Classification system
CN116842330A (en) Health care information processing method and device capable of comparing histories
Sharma et al. A multi-level decision-making framework for heart-related disease prediction and recommendation
Zhu et al. Design and development of a readmission risk assessment system for patients with cardiovascular disease
CN118366646B (en) Large model medical decision support method based on knowledge graph

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant