CN113723950B - Fraudulent transaction identification method, system and device based on dynamic weighted information entropy - Google Patents
Fraudulent transaction identification method, system and device based on dynamic weighted information entropy Download PDFInfo
- Publication number
- CN113723950B CN113723950B CN202010436168.2A CN202010436168A CN113723950B CN 113723950 B CN113723950 B CN 113723950B CN 202010436168 A CN202010436168 A CN 202010436168A CN 113723950 B CN113723950 B CN 113723950B
- Authority
- CN
- China
- Prior art keywords
- model
- class
- overlapping data
- information entropy
- weighted information
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q20/00—Payment architectures, schemes or protocols
- G06Q20/38—Payment protocols; Details thereof
- G06Q20/382—Payment protocols; Details thereof insuring higher security of transaction
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/214—Generating training patterns; Bootstrap methods, e.g. bagging or boosting
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Life Sciences & Earth Sciences (AREA)
- Artificial Intelligence (AREA)
- Business, Economics & Management (AREA)
- Evolutionary Computation (AREA)
- Evolutionary Biology (AREA)
- Accounting & Taxation (AREA)
- Bioinformatics & Computational Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Strategic Management (AREA)
- General Business, Economics & Management (AREA)
- Computer Security & Cryptography (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Finance (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Complex Calculations (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
本发明提供一种基于动态加权信息熵的欺诈交易识别方法、系统及装置,包括以下步骤:使用动态加权信息熵筛选one‑class‑SVM模型,选出重叠数据子集动态加权信息熵最大的one‑class‑SVM模型Mocsvm;使用one‑class‑SVM模型Mocsvm将原始数据划分为重叠数据子集和非重叠数据子集;使用one‑class‑SVM模型Mocsvm划分得到的重叠数据子集训练非线性分类器模型Mclf,使用非线性分类器模型Mclf区分重叠数据子集中的欺诈交易和正常交易;生成由one‑class‑SVM模型Mocsvm和非线性分类器模型Mclf构成的欺诈交易识别模型。本发明,采用分而治之的策略,为非线性机器学习模型排出大量易识别的正常交易数据,使得模型能仅关注于难以划分数据的学习,充分发挥了非线性模型的能力,提升欺诈交易识别模型的性能。
The invention provides a method, system and device for identifying fraudulent transactions based on dynamic weighted information entropy, which includes the following steps: using dynamic weighted information entropy to screen one-class-SVM models, and selecting the one with the largest dynamic weighted information entropy of overlapping data subsets ‑class‑SVM model M ocsvm ; use one‑class‑SVM model M ocsvm to divide the original data into overlapping data subsets and non-overlapping data subsets; use the one‑class‑SVM model M ocsvm to divide the overlapping data subsets for training Nonlinear classifier model M clf , uses the nonlinear classifier model M clf to distinguish fraudulent transactions and normal transactions in overlapping data subsets; generates fraudulent transactions composed of one‑class‑SVM model M ocsvm and nonlinear classifier model M clf Identify the model. The present invention adopts a divide-and-conquer strategy to discharge a large amount of easily identifiable normal transaction data for the nonlinear machine learning model, so that the model can only focus on the learning of data that is difficult to divide, giving full play to the ability of the nonlinear model and improving the performance of the fraudulent transaction identification model. performance.
Description
技术领域Technical Field
本发明涉及电子欺诈交易识别技术领域,特别是涉及一种基于动态加权信息熵的欺诈交易识别方法、系统及装置。The present invention relates to the technical field of electronic fraud transaction identification, and in particular to a fraud transaction identification method, system and device based on dynamic weighted information entropy.
背景技术Background Art
近年来,将金融和科技融合的金融科技成为热点研究领域之一。人工智能推动力了金融科技以提供更高质量的服务,与此同时,金融科技为人工智能研究和创新提供了广泛的平台和应用场景。电子交易欺诈检测是金融科技最重要的研究之一,已引起广泛关注。识别欺诈交易具有很大的挑战性,其中最重要的原因之一是数据不均衡问题,尤其是带有数据重叠(overlapping)的数据不均衡问题。当不同类别的数据之间的样本量差异很大时,就会发生类别不平衡,这在电子交易记录中很明显,因为欺诈交易的数量远远少于正常交易的数量。Overlapping是指不同类别的样本包含在同一数据空间区域中,这会增加分类器区分重叠区域中不同类别的样本的学习难度。由于欺诈者将不遗余力地模仿真实持卡人的交易行为以使欺诈检测系统失效,欺诈交易和合法交易将在某些数据空间区域中交织在一起,并导致overlapping问题。如果只是数据不均衡(non-overlapping非重叠)问题,即使数据不均衡比很高,也可能不会对分类器的性能带来很大的影响,因为有些分类模型的性能于不同类别样本的数量无关,例如基于最大间隔的分类模型等。但是,如果数据不均衡和overlapping问题同时出现,那么即使基于最大间隔的分类器也无法在正确地区分不同类别的样本方面取得良好的性能。In recent years, FinTech, which integrates finance and technology, has become one of the hot research areas. Artificial intelligence has driven FinTech to provide higher quality services. At the same time, FinTech has provided a wide range of platforms and application scenarios for artificial intelligence research and innovation. Electronic transaction fraud detection is one of the most important research areas in FinTech and has attracted widespread attention. Identifying fraudulent transactions is very challenging, and one of the most important reasons is the data imbalance problem, especially the data imbalance problem with data overlap. Class imbalance occurs when the sample size difference between different categories of data is large, which is obvious in electronic transaction records because the number of fraudulent transactions is much less than the number of normal transactions. Overlapping refers to the inclusion of samples of different categories in the same data space area, which increases the learning difficulty of the classifier to distinguish samples of different categories in the overlapping area. Since fraudsters will spare no effort to imitate the transaction behavior of real cardholders to make the fraud detection system ineffective, fraudulent transactions and legitimate transactions will be intertwined in certain data space areas and cause overlapping problems. If it is just a data imbalance (non-overlapping) problem, even if the data imbalance ratio is very high, it may not have a big impact on the performance of the classifier, because the performance of some classification models is independent of the number of samples of different categories, such as the classification model based on the maximum margin. However, if data imbalance and overlapping problems occur at the same time, even a classifier based on the maximum margin cannot achieve good performance in correctly distinguishing samples of different categories.
在在电子欺诈交易识别中,数据不均衡是影响欺诈交易识别模型性能的关键因素之一。而由于欺诈分子挖空心思来模仿持卡人的真实交易行为来避免被识别,从而使得真实交易与欺诈交易数据交叉,带来overlapping问题。带有overlapping的数据不均衡问题使得欺诈交易识别更加困难。In electronic fraud transaction identification, data imbalance is one of the key factors affecting the performance of fraud transaction identification models. Since fraudsters try their best to imitate the real transaction behavior of cardholders to avoid being identified, the real transaction data and fraud transaction data overlap, resulting in overlapping problems. The data imbalance problem with overlapping makes fraud transaction identification more difficult.
现有的研究主要是采用最近邻方法k-NN模型对原始数据集进行划分得到overlapping数据子集和non-overlapping数据子集。针对overlapping数据子集,通常采用采样方法去除其中的多数类样本,使得不同类别样本之间的边界更为清晰,对准确识别少数类样本更加有利。最后再将处理后的overlapping数据子集与non-overlapping数据子集合并为新的数据集,然后使用此新的数据集训练机器学习模型来区分不同类别的样本。这类方法存在一些明显的缺陷,首先overlapping数据子集中的部分样本被删除,虽然得到了更清晰的决策边界,但是可能会引起重要的样本信息丢失,使得决策边界出现错误。此外,overlapping数据子集由k-NN模型从原始数据中划分得到,但是k-NN模型参数的选择缺乏指导,通常是经过多次试验才能确定,需要消耗很多时间和计算资源,尤其是在有海量交易数据的欺诈交易识别场景中,k-NN模型难以适用。Existing research mainly uses the nearest neighbor method k-NN model to divide the original data set into overlapping data subsets and non-overlapping data subsets. For the overlapping data subset, sampling methods are usually used to remove the majority class samples, so that the boundaries between samples of different categories are clearer, which is more conducive to accurately identifying minority class samples. Finally, the processed overlapping data subset and non-overlapping data subset are merged into a new data set, and then the machine learning model is trained with this new data set to distinguish samples of different categories. This type of method has some obvious defects. First, some samples in the overlapping data subset are deleted. Although a clearer decision boundary is obtained, it may cause important sample information to be lost, resulting in errors in the decision boundary. In addition, the overlapping data subset is obtained by dividing the original data by the k-NN model, but the selection of k-NN model parameters lacks guidance and is usually determined after multiple experiments, which requires a lot of time and computing resources. Especially in the fraud transaction identification scenario with massive transaction data, the k-NN model is difficult to apply.
因此,希望能够解决如何有效、快速识别overlapping和non-overlapping数据子集,如何加速后续非线性机器学习模型的训练过程,减少模型训练的资源消耗,如何更好的进行电子欺诈交易识别的问题。Therefore, we hope to solve the problems of how to effectively and quickly identify overlapping and non-overlapping data subsets, how to accelerate the training process of subsequent nonlinear machine learning models, reduce the resource consumption of model training, and how to better identify electronic fraud transactions.
发明内容Summary of the invention
鉴于以上所述现有技术的缺点,本发明的目的在于提供一种基于动态加权信息熵的欺诈交易识别方法、系统及装置,用于解决现有技术中如何有效、快速识别overlapping和non-overlapping数据子集,如何加速后续非线性机器学习模型的训练过程,减少模型训练的资源消耗,如何更好的进行电子欺诈交易识别的问题。In view of the shortcomings of the prior art mentioned above, the purpose of the present invention is to provide a fraudulent transaction identification method, system and device based on dynamic weighted information entropy, which is used to solve the problems in the prior art of how to effectively and quickly identify overlapping and non-overlapping data subsets, how to accelerate the training process of subsequent nonlinear machine learning models, reduce resource consumption of model training, and how to better identify electronic fraud transactions.
为实现上述目的及其他相关目的,本发明提供一种基于动态加权信息熵的欺诈交易识别方法,包括以下步骤:使用欺诈交易样本训练含有超参数的one-class-SVM(单分类模型)模型;其中属于使用训练后的one-class-SVM模型将原始数据划分为重叠数据子集和非重叠数据子集;计算不同超参数 对应的one-class-SVM模型划分得到的重叠数据子集的动态加权信息熵,选择动态加权信息熵最大的重叠数据子集对应超参数对应的one-class-SVM模型,作为重叠数据子集的动态加权信息熵最大的one-class-SVM模型Mocsvm;使用one-class-SVM模型Mocsvm将原始数据划分为重叠数据子集和非重叠数据子集;使用one-class-SVM模型Mocsvm划分得到的重叠数据子集训练非线性分类器模型Mclf,使用非线性分类器模型Mclf区分重叠数据子集中的欺诈交易和正常交易;生成由one-class-SVM模型Mocsvm和非线性分类器模型Mclf构成的欺诈交易识别模型。To achieve the above-mentioned and other related purposes, the present invention provides a fraudulent transaction identification method based on dynamic weighted information entropy, comprising the following steps: using fraudulent transaction samples to train a hyperparameter One-class-SVM (single classification model) model; belong Use the trained one-class-SVM model to divide the original data into overlapping data subsets and non-overlapping data subsets; calculate different hyperparameters The dynamic weighted information entropy of the overlapping data subsets obtained by the corresponding one-class-SVM model division, and the corresponding hyperparameter of the overlapping data subset with the largest dynamic weighted information entropy is selected The corresponding one-class-SVM model is the one-class-SVM model Mocsvm with the largest dynamic weighted information entropy of the overlapping data subset; the original data is divided into overlapping data subsets and non-overlapping data subsets using the one-class-SVM model Mocsvm ; the nonlinear classifier model Mclf is trained on the overlapping data subsets obtained by the one-class-SVM model Mocsvm , and the nonlinear classifier model Mclf is used to distinguish fraudulent transactions from normal transactions in the overlapping data subsets; a fraudulent transaction recognition model composed of the one-class-SVM model Mocsvm and the nonlinear classifier model Mclf is generated.
于本发明的一实施例中,所述重叠数据子集的动态加权信息熵定义为:In one embodiment of the present invention, the dynamic weighted information entropy of the overlapping data subset is defined as:
GDWE(θ)=Wsnr*HG DWE (θ)=W snr *H
其中,θ表示one-class-SVM模型的一组超参数;H表示重叠数据子集的信息熵;Wsnr表示H的动态权重,由少数类样本的信噪比决定,H和Wsnr分别定义为:Among them, θ represents a set of hyperparameters of the one-class-SVM model; H represents the information entropy of the overlapping data subsets; W snr represents the dynamic weight of H, which is determined by the signal-to-noise ratio of the minority class samples. H and W snr are defined as:
其中i∈{0,1,...,k},k表示重叠数据子集中的类别(对于欺诈交易识别来说k=1),pi表示重叠数据子集中的某个样本属于类别i的概率,nall表示原始数据集中少数类样本的数量,noutliers表示原始数据集中的被判定为噪声数据的少数类样本数量。Where i∈{0, 1, ..., k}, k represents the category in the overlapping data subset (k=1 for fraudulent transaction identification), pi represents the probability that a sample in the overlapping data subset belongs to category i, n all represents the number of minority class samples in the original data set, and n outliers represents the number of minority class samples in the original data set that are judged to be noise data.
于本发明的一实施例中,所述非线性分类器模型为深度学习模型。In one embodiment of the present invention, the nonlinear classifier model is a deep learning model.
于本发明的一实施例中,所述使用one-class-SVM模型Mocsvm划分得到的重叠数据子集训练非线性分类器模型Mclf,包括:使用重叠数据子集的第一部分样本训练非线性分类器模型Mclf;使用重叠数据子集的第二部分样本验证非线性分类器模型Mclf;重复训练和验证步骤直到所述非线性分类器模型Mclf的准确性符合预设要求。In one embodiment of the present invention, the overlapping data subsets obtained by dividing the one-class-SVM model M ocsvm are used to train the nonlinear classifier model Mclf , including: using a first part of samples of the overlapping data subsets to train the nonlinear classifier model Mclf ; using a second part of samples of the overlapping data subsets to verify the nonlinear classifier model Mclf ; repeating the training and verification steps until the accuracy of the nonlinear classifier model Mclf meets the preset requirements.
为实现上述目的,本发明还提供一种基于动态加权信息熵的欺诈交易识别系统,包括:训练模块、划分模块、区分模块和模型生成模块;所述训练模块用于使用欺诈交易样本训练含有超参数的one-class-SVM模型;其中属于使用训练后的one-class-SVM模型将原始数据划分为重叠数据子集和非重叠数据子集;计算不同超参数对应的one-class-SVM模型划分得到的重叠数据子集的动态加权信息熵,选择动态加权信息熵最大的重叠数据子集对应超参数对应的one-class-SVM模型,作为重叠数据子集的动态加权信息熵最大的one-class-SVM模型Mocsvm;所述划分模块用于使用one-class-SVM模型Mocsvm将原始数据划分为重叠数据子集和非重叠数据子集;所述区分模块用于使用one-class-SVM模型Mocsvm划分得到的重叠数据子集训练非线性分类器模型Mclf,使用非线性分类器模型Mclf区分重叠数据子集中的欺诈交易和正常交易;所述模型生成模块用于生成由one-class-SVM模型Mocsvm和非线性分类器模型Mclf构成的欺诈交易识别模型。To achieve the above-mentioned purpose, the present invention also provides a fraudulent transaction identification system based on dynamic weighted information entropy, including: a training module, a division module, a distinction module and a model generation module; the training module is used to train a fraudulent transaction sample containing hyperparameters One-class-SVM model; belong Use the trained one-class-SVM model to divide the original data into overlapping data subsets and non-overlapping data subsets; calculate different hyperparameters The dynamic weighted information entropy of the overlapping data subsets obtained by the corresponding one-class-SVM model division, and the corresponding hyperparameter of the overlapping data subset with the largest dynamic weighted information entropy is selected The corresponding one-class-SVM model is the one-class-SVM model Mocsvm with the largest dynamic weighted information entropy of the overlapping data subset; the partitioning module is used to use the one-class-SVM model Mocsvm to divide the original data into overlapping data subsets and non-overlapping data subsets; the distinguishing module is used to train the nonlinear classifier model Mclf using the overlapping data subsets obtained by the division of the one-class-SVM model Mocsvm , and use the nonlinear classifier model Mclf to distinguish fraudulent transactions and normal transactions in the overlapping data subsets; the model generation module is used to generate a fraudulent transaction recognition model composed of the one-class-SVM model Mocsvm and the nonlinear classifier model Mclf .
于本发明的一实施例中,所述重叠数据子集的动态加权信息熵定义为:In one embodiment of the present invention, the dynamic weighted information entropy of the overlapping data subset is defined as:
GDWE(θ)=Wsnr*HG DWE (θ)=W snr *H
其中,θ表示one-class-SVM模型的一组超参数;H表示重叠数据子集的信息熵;Wsnr表示H的动态权重,由少数类样本的信噪比决定,H和Wsnr分别定义为:Among them, θ represents a set of hyperparameters of the one-class-SVM model; H represents the information entropy of the overlapping data subsets; W snr represents the dynamic weight of H, which is determined by the signal-to-noise ratio of the minority class samples. H and W snr are defined as:
其中i∈{0,1,...,k},k表示重叠数据子集中的类别(对于欺诈交易识别来说k=1),pi表示重叠数据子集中的某个样本属于类别i的概率,nall表示原始数据集中少数类样本的数量,noutliers表示原始数据集中的被判定为噪声数据的少数类样本数量。Where i∈{0, 1, ..., k}, k represents the category in the overlapping data subset (k=1 for fraudulent transaction identification), pi represents the probability that a sample in the overlapping data subset belongs to category i, n all represents the number of minority class samples in the original data set, and n outliers represents the number of minority class samples in the original data set that are judged to be noise data.
于本发明的一实施例中,所述非线性分类器模型为深度学习模型。In one embodiment of the present invention, the nonlinear classifier model is a deep learning model.
于本发明的一实施例中,所述使用one-class-SVM模型Mocsvm划分得到的重叠数据子集训练非线性分类器模型Mclf,包括:使用重叠数据子集的第一部分样本训练非线性分类器模型Mclf;使用重叠数据子集的第二部分样本验证非线性分类器模型Mclf;重复训练和验证步骤直到所述非线性分类器模型Mclf的准确性符合预设要求。In one embodiment of the present invention, the overlapping data subsets obtained by dividing the one-class-SVM model M ocsvm are used to train the nonlinear classifier model Mclf , including: using a first part of samples of the overlapping data subsets to train the nonlinear classifier model Mclf ; using a second part of samples of the overlapping data subsets to verify the nonlinear classifier model Mclf ; and repeating the training and verification steps until the accuracy of the nonlinear classifier model Mclf meets preset requirements.
为实现上述目的,本发明还提供一种计算机可读存储介质,其上存储有计算机程序,所述计算机程序被处理器执行时实现任一上述基于动态加权信息熵的欺诈交易识别方法。To achieve the above objectives, the present invention also provides a computer-readable storage medium having a computer program stored thereon, and when the computer program is executed by a processor, it implements any of the above-mentioned fraudulent transaction identification methods based on dynamic weighted information entropy.
为实现上述目的,本发明还提供一种基于动态加权信息熵的欺诈交易识别装置,包括:处理器和存储器;所述存储器用于存储计算机程序;所述处理器与所述存储器相连,用于执行所述存储器存储的计算机程序,以使所述基于动态加权信息熵的欺诈交易识别装置执行任一上述的基于动态加权信息熵的欺诈交易识别方法。To achieve the above-mentioned purpose, the present invention also provides a fraud transaction identification device based on dynamic weighted information entropy, comprising: a processor and a memory; the memory is used to store a computer program; the processor is connected to the memory, and is used to execute the computer program stored in the memory, so that the fraud transaction identification device based on dynamic weighted information entropy can execute any of the above-mentioned fraud transaction identification methods based on dynamic weighted information entropy.
如上所述,本发明的一种基于动态加权信息熵的欺诈交易识别方法、系统及装置,具有以下有益效果:提出基于分而治之的欺诈交易识别模型,先使用欺诈交易数据训练one-class-SVM模型,将易识别的部分正常交易识别出来归类到非重叠数据子集,而正常交易与欺诈交易难以区分的部分归类为重叠数据子集;然后采用非线性机器学习模型区分重叠数据子集中的欺诈和正常交易数据。为非线性机器学习模型排出大量易识别的正常交易数据,使得模型能仅关注于难以划分数据的学习,充分发挥了非线性模型的能力,提升欺诈交易识别模型的性能。为了保证one-class-SVM模型划分的重叠数据子集中尽可能使得正常交易数据量减少,同时尽可能不丢失欺诈交易数据,本发明提出动态加权信息熵来指导one-class-SVM模型的超参数选择,动态加权信息熵是对重叠数据子集中的信息熵和欺诈交易数据的信息损失的权衡。在原始数据划分阶段,采用动态加权信息熵指导one-class-SVM模型的参数选择,无需后续非线性机器学习模型测试结果的反馈,能够大大降低参数选择的计算复杂度,提升模型的效率。As described above, a fraudulent transaction identification method, system and device based on dynamic weighted information entropy of the present invention has the following beneficial effects: a fraudulent transaction identification model based on divide and conquer is proposed, firstly, the one-class-SVM model is trained with fraudulent transaction data, and the part of normal transactions that are easy to identify are identified and classified into non-overlapping data subsets, and the part of normal transactions that are difficult to distinguish from fraudulent transactions are classified into overlapping data subsets; then a nonlinear machine learning model is used to distinguish fraudulent and normal transaction data in the overlapping data subsets. A large amount of normal transaction data that is easy to identify is excluded for the nonlinear machine learning model, so that the model can only focus on the learning of data that is difficult to divide, give full play to the ability of the nonlinear model, and improve the performance of the fraudulent transaction identification model. In order to ensure that the amount of normal transaction data is reduced as much as possible in the overlapping data subsets divided by the one-class-SVM model, and at the same time, the fraudulent transaction data is not lost as much as possible, the present invention proposes dynamic weighted information entropy to guide the hyperparameter selection of the one-class-SVM model, and the dynamic weighted information entropy is a trade-off between the information entropy in the overlapping data subset and the information loss of the fraudulent transaction data. In the original data division stage, dynamic weighted information entropy is used to guide the parameter selection of the one-class-SVM model. There is no need for feedback on the subsequent nonlinear machine learning model test results, which can greatly reduce the computational complexity of parameter selection and improve the efficiency of the model.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
图1a显示为本发明的基于动态加权信息熵的欺诈交易识别方法于一实施例中的流程图;FIG. 1a is a flow chart showing a fraudulent transaction identification method based on dynamic weighted information entropy in one embodiment of the present invention;
图1b显示为本发明的基于动态加权信息熵的欺诈交易识别方法于一实施例中的One-class-SVM模型划分原始数据集示意图;FIG1b is a schematic diagram showing the division of the original data set by the One-class-SVM model in an embodiment of the fraudulent transaction identification method based on dynamic weighted information entropy of the present invention;
图1c显示为本发明的基于动态加权信息熵的欺诈交易识别方法于又一实施例中的流程图;FIG. 1c is a flow chart showing a fraudulent transaction identification method based on dynamic weighted information entropy in another embodiment of the present invention;
图1d显示为本发明的基于动态加权信息熵的欺诈交易识别方法于一实施例中的F1值随DWE的变化示意图;FIG. 1d is a schematic diagram showing the variation of the F1 value with DWE in a fraudulent transaction identification method based on dynamic weighted information entropy in one embodiment of the present invention;
图1e显示为本发明的基于动态加权信息熵的欺诈交易识别方法于一实施例中的one-class-SVM模型和非线性机器学习模型时间消耗示意图;FIG. 1e is a schematic diagram showing the time consumption of a one-class-SVM model and a nonlinear machine learning model in an embodiment of a fraudulent transaction identification method based on dynamic weighted information entropy according to the present invention;
图1f显示为本发明的基于动态加权信息熵的欺诈交易识别方法于再一实施例中的流程图;FIG. 1f is a flow chart showing a fraudulent transaction identification method based on dynamic weighted information entropy in yet another embodiment of the present invention;
图2显示为本发明的基于动态加权信息熵的欺诈交易识别系统于一实施例中的结构示意图;FIG2 is a schematic diagram showing the structure of a fraudulent transaction identification system based on dynamic weighted information entropy in one embodiment of the present invention;
图3显示为本发明的基于动态加权信息熵的欺诈交易识别装置于一实施例中的结构示意图。FIG. 3 is a schematic diagram showing the structure of a fraudulent transaction identification device based on dynamic weighted information entropy in one embodiment of the present invention.
元件标号说明Component number description
21 训练模块21 Training Modules
22 划分模块22 Divide Modules
23 区分模块23 Differentiate modules
24 模型生成模块24 Model Generation Module
31 处理器31 Processor
32 存储器32 Memory
具体实施方式DETAILED DESCRIPTION
以下通过特定的具体实例说明本发明的实施方式,本领域技术人员可由本说明书所揭露的内容轻易地了解本发明的其他优点与功效。本发明还可以通过另外不同的具体实施方式加以实施或应用,本说明书中的各项细节也可以基于不同观点与应用,在没有背离本发明的精神下进行各种修饰或改变。需说明的是,在不冲突的情况下,以下实施例及实施例中的特征可以相互组合。The following describes the embodiments of the present invention by specific examples, and those skilled in the art can easily understand other advantages and effects of the present invention from the contents disclosed in this specification. The present invention can also be implemented or applied through other different specific embodiments, and the details in this specification can also be modified or changed in various ways based on different viewpoints and applications without departing from the spirit of the present invention. It should be noted that the following embodiments and features in the embodiments can be combined with each other without conflict.
需要说明的是,以下实施例中所提供的图示仅以示意方式说明本发明的基本构想,故图式中仅显示与本发明中有关的组件而非按照实际实施时的组件数目、形状及尺寸绘制,其实际实施时各组件的型态、数量及比例可为一种随意的改变,且其组件布局型态也可能更为复杂。It should be noted that the illustrations provided in the following embodiments are only used to illustrate the basic concept of the present invention in a schematic manner. Therefore, the drawings only show components related to the present invention rather than being drawn according to the number, shape and size of components in actual implementation. In actual implementation, the type, quantity and proportion of each component may be changed arbitrarily, and the component layout may also be more complicated.
本发明的基于动态加权信息熵的欺诈交易识别方法、系统及装置,有效、快速识别overlapping和non-overlapping数据子集,加速后续模型的训练过程,减少模型训练的资源消耗,更好的进行电子欺诈交易识别。The fraudulent transaction identification method, system and device based on dynamic weighted information entropy of the present invention can effectively and quickly identify overlapping and non-overlapping data subsets, accelerate the training process of subsequent models, reduce resource consumption of model training, and better identify electronic fraud transactions.
如图1a所示,于一实施例中,本发明的基于动态加权信息熵的欺诈交易识别方法,包括以下步骤:As shown in FIG. 1a , in one embodiment, the fraudulent transaction identification method based on dynamic weighted information entropy of the present invention comprises the following steps:
步骤S11、使用欺诈交易样本训练含有超参数的one-class-SVM模型;其中属于使用训练后的one-class-SVM模型将原始数据划分为overlapping数据子集和non-overlapping数据子集;所述划分模块用于计算不同超参数 对应的one-class-SVM模型划分得到的overlapping数据子集的动态加权信息熵,选择动态加权信息熵最大的overlapping数据子集对应超参数对应的one-class-SVM模型,作为overlapping数据子集的动态加权信息熵最大的one-class-SVM模型Mocsvm。Step S11: Use fraudulent transaction samples to train the hyperparameters One-class-SVM model; belong The trained one-class-SVM model is used to divide the original data into overlapping data subsets and non-overlapping data subsets; the division module is used to calculate different hyperparameters The dynamic weighted information entropy of the overlapping data subsets obtained by the corresponding one-class-SVM model division, and the corresponding hyperparameter of the overlapping data subset with the largest dynamic weighted information entropy is selected The corresponding one-class-SVM model is the one-class-SVM model Mocsvm with the largest dynamic weighted information entropy of the overlapping data subset.
原始数据集中包含大量正常交易数据和少量欺诈交易数据,其中的欺诈交易数据用来训练one-class-SVM模型。理想情况下,训练好的one-class-SVM模型可以将原始数据集划分为仅含正常交易数据的non-overlapping数据子集和同时含有正常和欺诈交易数据的overlapping数据子集,如图1b所示的overlapping数据子集-1。但是,欺诈交易数据中难免存在一些由于错误标记等产生的噪声数据,将这些少数类噪声数据划分到overlapping数据子集会使得one-class-SVM模型的决策边界向多数类样本偏移,使得大量多数类样本被划分到overlapping数据子集中,使得overlapping数据子集的数据不均衡比增加,给后续模型学习带来困难。因此,将部分少数类样本当作噪声数据判定为多数类样本,可以提升overlapping数据子集的质量,从而提升欺诈识别模型的整体性能。如图1b所示,将1个少数类噪声数据判定为多数类数据,可以使得one-class-SVM模型的决策边界收缩,形成多数类数据更少的overlapping数据子集-2,降低容易区分的数据对后续分类模型的影响。为了能够量化将部分少数类样本判定为多数类样本造成的损失以及由此对overlapping数据子集的影响,本发明提出动态加权信息熵。The original data set contains a large amount of normal transaction data and a small amount of fraudulent transaction data, of which the fraudulent transaction data is used to train the one-class-SVM model. Ideally, the trained one-class-SVM model can divide the original data set into a non-overlapping data subset containing only normal transaction data and an overlapping data subset containing both normal and fraudulent transaction data, as shown in the overlapping data subset-1 in Figure 1b. However, there are inevitably some noise data in the fraudulent transaction data due to incorrect labeling, etc. Dividing these minority class noise data into the overlapping data subset will cause the decision boundary of the one-class-SVM model to shift toward the majority class samples, causing a large number of majority class samples to be divided into the overlapping data subset, increasing the data imbalance ratio of the overlapping data subset, and bringing difficulties to subsequent model learning. Therefore, treating some minority class samples as noise data and determining them as majority class samples can improve the quality of the overlapping data subset, thereby improving the overall performance of the fraud recognition model. As shown in Figure 1b, judging a minority class noise data as majority class data can shrink the decision boundary of the one-class-SVM model, forming an overlapping data subset-2 with less majority class data, reducing the impact of easily distinguishable data on subsequent classification models. In order to quantify the loss caused by judging some minority class samples as majority class samples and the impact on the overlapping data subset, the present invention proposes dynamic weighted information entropy.
具体地,所述one-class-SVM模型为单分类模型。对于同一个one-class-SVM模型可以使用不同的超参数θocsvm,例如超参数而每个超参数会使训练后的one-class-SVM模型将原始数据划得到的overlapping数据子集的动态加权信息熵不同。Specifically, the one-class-SVM model is a single-class model. Different hyperparameters θ ocsvm can be used for the same one-class-SVM model. For example, the hyperparameter Each hyperparameter will cause the trained one-class-SVM model to divide the original data into overlapping data subsets with different dynamic weighted information entropy.
具体地,所述使用欺诈交易样本训练含超参数的one-class-SVM模型;包括:使用欺诈交易样本的第一部分样本训练超参数的one-class-SVM模型;使用欺诈交易样本的第二部分样本验证超参数的one-class-SVM模型;重复训练和验证步骤直到所述超参数的one-class-SVM模型的准确性符合第一预设要求。例如,所述欺诈交易样本是指包含欺诈交易数据的样本。利用欺诈交易样本训练超参数的one-class-SVM模型,可以使超参数的one-class-SVM模型识别欺诈交易数据和正常交易数据。使用欺诈交易样本的某个月前二十天的样本训练超参数的one-class-SVM模型;使用欺诈交易样本的某个月后十天的样本验证超参数的one-class-SVM模型;重复训练和验证步骤直到所述超参数的one-class-SVM模型的准确性符合第一预设要求。所述第一预设要求是指超参数的one-class-SVM模型识别overlapping数据子集和non-overlapping数据子集的准确率。所述non-overlapping数据子集由于只包含正常交易数据,overlapping数据子集由于同时含有正常和欺诈交易数据,而通过欺诈交易样本训练超参数的one-class-SVM模型,可以使超参数的one-class-SVM模型识别overlapping数据子集和non-overlapping数据子集。Specifically, the fraudulent transaction samples are used to train the hyperparameters One-class-SVM model; including: using the first part of the fraudulent transaction sample to train the hyperparameters One-class-SVM model of the fraudulent transaction sample; hyperparameters are verified using the second part of the fraudulent transaction sample One-class-SVM model; repeat the training and validation steps until the hyperparameters The accuracy of the one-class-SVM model meets the first preset requirement. For example, the fraudulent transaction sample refers to a sample containing fraudulent transaction data. The fraudulent transaction sample is used to train the hyperparameters The one-class-SVM model can make the hyperparameters The one-class-SVM model identifies fraudulent transaction data and normal transaction data. The hyperparameters are trained using samples of fraudulent transaction samples from the first twenty days of a certain month. One-class-SVM model; hyperparameters are verified using a sample of fraudulent transactions from the last ten days of a certain month One-class-SVM model; repeat the training and validation steps until the hyperparameters The accuracy of the one-class-SVM model meets the first preset requirement. The first preset requirement refers to the hyperparameter The accuracy of the one-class-SVM model in identifying overlapping data subsets and non-overlapping data subsets. The non-overlapping data subset contains only normal transaction data, while the overlapping data subset contains both normal and fraudulent transaction data. The hyperparameters are trained using fraudulent transaction samples. The one-class-SVM model can make the hyperparameters The one-class-SVM model identifies overlapping data subsets and non-overlapping data subsets.
具体地,使用训练后的one-class-SVM模型将原始数据划分为overlapping数据子集和non-overlapping数据子集。所述overlapping(重叠)数据子集和non-overlapping(非重叠)数据子集,其中仅含正常交易数据的non-overlapping数据子集和同时含有正常和欺诈交易数据的overlapping数据子集。因此,non-overlapping数据子集由于只包含正常交易数据,因此,non-overlapping数据子集不需要再进行区分,而overlapping数据子集由于同时含有正常和欺诈交易数据,因此,overlapping数据子集要再进行区分。欺诈交易数据中难免存在一些由于错误标记等产生的噪声数据,将这些少数类噪声数据划分到overlapping数据子集会使得one-class-SVM模型的决策边界向多数类样本偏移,使得大量多数类样本被划分到overlapping数据子集中,使得overlapping数据子集的数据不均衡比增加,给后续模型学习带来困难。因此,将部分少数类样本当作噪声数据判定为多数类样本,可以提升overlapping数据子集的质量,从而提升欺诈识别模型的整体性能。Specifically, the trained one-class-SVM model is used to divide the original data into overlapping data subsets and non-overlapping data subsets. The overlapping data subsets and non-overlapping data subsets include a non-overlapping data subset containing only normal transaction data and an overlapping data subset containing both normal and fraudulent transaction data. Therefore, since the non-overlapping data subset only contains normal transaction data, the non-overlapping data subset does not need to be distinguished, while since the overlapping data subset contains both normal and fraudulent transaction data, the overlapping data subset needs to be distinguished. It is inevitable that there are some noise data generated by incorrect labels in the fraudulent transaction data. Dividing these minority class noise data into the overlapping data subset will cause the decision boundary of the one-class-SVM model to shift toward the majority class samples, so that a large number of majority class samples are divided into the overlapping data subset, which increases the data imbalance ratio of the overlapping data subset, bringing difficulties to subsequent model learning. Therefore, treating some minority class samples as noise data and judging them as majority class samples can improve the quality of the overlapping data subset, thereby improving the overall performance of the fraud identification model.
具体地,可使用one-class-SVM模型对全部原始数据集进行数据划分。在one-class-SVM模型决策边界内的交易数据全部划分为overlapping数据子集,其中包含绝大部分欺诈交易数据和部分与欺诈交易相似度较高的正常交易数据,需要使用学习能力强的非线性机器学习模型来区分。在one-class-SVM模型决策边界外的交易数据全部划分为non-overlapping数据子集,其中仅包含少量欺诈交易的噪声数据,其余全部为正常交易数据,因此可将non-overlapping数据子集全部判定为正常交易,少量被判定为正常交易的欺诈交易不会对欺诈识别模型的整体性能带来显著影响。将原始数据集中容易区分的大量交易数据划分到non-overlapping数据子集中,降低了overlapping数据子集的不均衡比,同时排出了容易分类的数据对机器学习模型训练的干扰。Specifically, the one-class-SVM model can be used to divide the data of all original data sets. All transaction data within the decision boundary of the one-class-SVM model are divided into overlapping data subsets, which contain most of the fraudulent transaction data and some normal transaction data with high similarity to fraudulent transactions. Nonlinear machine learning models with strong learning ability are needed to distinguish them. All transaction data outside the decision boundary of the one-class-SVM model are divided into non-overlapping data subsets, which only contain a small amount of noise data of fraudulent transactions, and the rest are all normal transaction data. Therefore, all non-overlapping data subsets can be judged as normal transactions, and a small number of fraudulent transactions judged as normal transactions will not have a significant impact on the overall performance of the fraud identification model. The large amount of transaction data that is easy to distinguish in the original data set is divided into non-overlapping data subsets, which reduces the imbalance ratio of overlapping data subsets and eliminates the interference of easy-to-classify data on the training of machine learning models.
具体地,不同的超参数对应相应的one-class-SVM模型,而每个超参数都具有相应的one-class-SVM模型,也具有相应的one-class-SVM模型将原始数据划分为overlapping数据子集和non-overlapping数据子集,而每个划分后得到的overlapping数据子集具有动态加权信息熵。使用欺诈交易样本训练含有超参数的one-class-SVM模型,计算不同超参数对应的one-class-SVM模型将原始数据划分为overlapping数据子集的overlapping数据子集动态加权信息熵,选择overlapping数据子集的动态加权信息熵最大的超参数对应的one-class-SVM模型,作为overlapping数据子集的动态加权信息熵最大的one-class-SVM模型Mocsvm。Specifically, different hyperparameters correspond to corresponding one-class-SVM models, and each hyperparameter has a corresponding one-class-SVM model, and also has a corresponding one-class-SVM model to divide the original data into overlapping data subsets and non-overlapping data subsets, and each overlapping data subset obtained after division has a dynamic weighted information entropy. Using fraudulent transaction samples to train a model containing hyperparameters One-class-SVM model, calculate different hyperparameters The corresponding one-class-SVM model divides the original data into overlapping data subsets, and selects the hyperparameter with the largest dynamic weighted information entropy of the overlapping data subsets. The corresponding one-class-SVM model is the one-class-SVM model Mocsvm with the largest dynamic weighted information entropy of the overlapping data subset.
因此,计算不同超参数对应的one-class-SVM模型的overlapping数据子集的动态加权信息熵,选择overlapping数据子集的动态加权信息熵最大的超参数对应的one-class-SVM模型,作为动态加权信息熵最大的one-class-SVM模型Mocsvm。Therefore, calculating different hyperparameters The dynamic weighted information entropy of the overlapping data subset of the corresponding one-class-SVM model, select the hyperparameter with the largest dynamic weighted information entropy of the overlapping data subset The corresponding one-class-SVM model is the one-class-SVM model Mocsvm with the largest dynamic weighted information entropy.
具体地,所述overlapping数据子集的动态加权信息熵定义为:Specifically, the dynamic weighted information entropy of the overlapping data subset is defined as:
GDWE(θ)=Wsnr*HG DWE (θ)=W snr *H
其中,θ表示one-class-SVM模型的一组超参数;H表示overlapping数据子集的信息熵;Wsnr表示H的动态权重,由少数类样本的信噪比决定,H和Wsnr分别定义为:Among them, θ represents a set of hyperparameters of the one-class-SVM model; H represents the information entropy of the overlapping data subset; W snr represents the dynamic weight of H, which is determined by the signal-to-noise ratio of the minority class samples. H and W snr are defined as:
其中i∈{0,1,...,k},k表示overlapping数据子集中的类别(对于欺诈交易识别来说k=1),pi表示overlapping数据子集中的某个样本属于类别i的概率,nall表示原始数据集中少数类样本的数量,noutliers表示原始数据集中的被判定为噪声数据的少数类样本数量。本发明提出的动态加权信息熵是对少数类样本信息损失和overlapping数据质量之间的权衡,只有在H和Wsnr都达到比较大的值时才能取得较大的动态加权信息熵。将本发明提出的动态加权信息熵作为one-class-SVM模型超参数选择的评价指标,能够保证欺诈交易识别模型的整体性能。调整one-class-SVM模型的不同的超参数,选择其中使得动态加权信息熵最大的。例如动态加权信息熵最大的超参数对应的one-class-SVM模型,那么称之为作为动态加权信息熵最大的one-class-SVM模型Mocsvm。相比于现有的处理带有重叠的数据不均衡问题的方法(使用KNN模型划分重叠和非重叠数据子集),本发明使用one-class-SVM模型来划分重叠和非重叠数据子集,计算速度更快。提出动态加权信息熵作为one-class-SVM模型选择超参数的参考指标,在带有特定超参数的one-class-SVM完成训练时,就可计算得到动态加权信息熵,不需要完成整个欺诈交易识别模型的训练,就能对此one-class-SVM模型的性能进行评估。相比于现有处理带有重叠数据不均衡的方法(没有动态加权信息熵,需要完成整个模型的训练才能知道重叠和非重叠数据子集划分的好坏),可以省去数据划分之后的模型训练过程,加快模型训练过程,同时减少模型训练的资源消耗。Where i∈{0, 1, ..., k}, k represents the category in the overlapping data subset (k=1 for fraudulent transaction identification), pi represents the probability that a sample in the overlapping data subset belongs to category i, n all represents the number of minority class samples in the original data set, and n outliers represents the number of minority class samples in the original data set that are judged to be noise data. The dynamic weighted information entropy proposed in the present invention is a trade-off between the information loss of minority class samples and the quality of overlapping data. A larger dynamic weighted information entropy can only be obtained when both H and W snr reach relatively large values. Using the dynamic weighted information entropy proposed in the present invention as an evaluation indicator for the selection of hyperparameters of the one-class-SVM model can ensure the overall performance of the fraudulent transaction identification model. Adjust the different hyperparameters of the one-class-SVM model and select the one that maximizes the dynamic weighted information entropy. For example, the hyperparameter with the largest dynamic weighted information entropy The corresponding one-class-SVM model is then called the one-class-SVM model Mocsvm with the largest dynamic weighted information entropy. Compared with the existing method for processing the problem of data imbalance with overlap (using the KNN model to divide overlapping and non-overlapping data subsets), the present invention uses the one-class-SVM model to divide overlapping and non-overlapping data subsets, and the calculation speed is faster. The dynamic weighted information entropy is proposed as a reference indicator for selecting hyperparameters of the one-class-SVM model. When the one-class-SVM with specific hyperparameters completes the training, the dynamic weighted information entropy can be calculated. The performance of this one-class-SVM model can be evaluated without completing the training of the entire fraud transaction recognition model. Compared with the existing method for processing data imbalance with overlap (without dynamic weighted information entropy, the entire model needs to be trained to know the quality of the overlapping and non-overlapping data subset division), the model training process after data division can be omitted, the model training process can be accelerated, and the resource consumption of model training can be reduced.
一方面,信息熵H可以表示overlapping数据子集的平均信息量,如果overlapping数据子集中的多数类样本数量过多,即p0接近于1,p1接近于0(0表示多数类,1表示少数类),会使得信息熵H接近于0。仅当overlapping数据子集中的多数类和少数类样本比较均衡时,即p0和p1都接近于0体5,才能使得信息熵接近于最大值1。On the one hand, information entropy H can represent the average amount of information in the overlapping data subsets. If there are too many majority class samples in the overlapping data subsets, that is, p0 is close to 1 and p1 is close to 0 (0 represents the majority class and 1 represents the minority class), the information entropy H will be close to 0. Only when the majority class and minority class samples in the overlapping data subsets are relatively balanced, that is, p0 and p1 are both close to 0, can the information entropy be close to the maximum value of 1.
另一方面,基于原始数据集少数类样本信噪比的动态权值Wsnr能够反映少数类样本判定为噪声数据带来的损失。原始数据集少数类样本的信噪比随着one-class-SVM模型超参数的变化而变化。原始数据集中,越多的少数类样本被one-class-SVM模型判定为噪声数据,则其信噪比Wsnr越小,即少数类样本丢失的信息越多。On the other hand, the dynamic weight W snr based on the signal-to-noise ratio of the minority class samples in the original data set can reflect the loss caused by the minority class samples being judged as noise data. The signal-to-noise ratio of the minority class samples in the original data set changes with the changes in the hyperparameters of the one-class-SVM model. In the original data set, the more minority class samples are judged as noise data by the one-class-SVM model, the smaller their signal-to-noise ratio W snr is, that is, the more information the minority class samples lose.
步骤S12、使用one-class-SVM模型Mocsvm将原始数据划分为overlapping数据子集和non-overlapping数据子集。Step S12: Use the one-class-SVM model Mocsvm to divide the original data into overlapping data subsets and non-overlapping data subsets.
因此,本发明提出的动态加权信息熵是对少数类样本信息损失和overlapping数据质量之间的权衡,只有在H和Wsnr都达到比较大的值时才能取得较大的动态加权信息熵。将本发明提出的动态加权信息熵作为one-class-SVM模型超参数选择的评价指标,能够保证欺诈交易识别模型的整体性能。调整不同的one-class-SVM模型的超参数,选择其中使得动态加权信息熵最大的。再使用所述动态加权信息熵最大one-class-SVM模型Mocsvm将原始数据划分为overlapping数据子集和non-overlapping数据子集。Therefore, the dynamic weighted information entropy proposed in the present invention is a trade-off between the information loss of minority class samples and the quality of overlapping data. A larger dynamic weighted information entropy can be obtained only when both H and W snr reach relatively large values. Using the dynamic weighted information entropy proposed in the present invention as an evaluation indicator for selecting hyperparameters of the one-class-SVM model can ensure the overall performance of the fraudulent transaction identification model. Adjust the hyperparameters of different one-class-SVM models and select the one that maximizes the dynamic weighted information entropy. Then use the one-class-SVM model Mocsvm with the maximum dynamic weighted information entropy to divide the original data into overlapping data subsets and non-overlapping data subsets.
如图1c所示,在确定好one-class-SVM模型参数后,即可使用one-class-SVM模型对全部原始数据集进行数据划分。在one-class-SVM模型决策边界内的交易数据全部划分为overlapping数据子集,其中包含绝大部分欺诈交易数据和部分与欺诈交易相似度较高的正常交易数据。在one-class-SVM模型决策边界外的交易数据全部划分为non-overlapping数据子集,其中仅包含少量欺诈交易的噪声数据,其余全部为正常交易数据,因此可将non-overlapping数据子集全部判定为正常交易,少量被判定为正常交易的欺诈交易不会对欺诈识别模型的整体性能带来显著影响。将原始数据集中容易区分的大量交易数据划分到non-overlapping数据子集中,降低了overlapping数据子集的不均衡比,同时排出了容易分类的数据对机器学习模型训练的干扰。As shown in Figure 1c, after determining the parameters of the one-class-SVM model, the one-class-SVM model can be used to divide the data of the entire original data set. All transaction data within the decision boundary of the one-class-SVM model are divided into overlapping data subsets, which contain most of the fraudulent transaction data and some normal transaction data with a high similarity to fraudulent transactions. All transaction data outside the decision boundary of the one-class-SVM model are divided into non-overlapping data subsets, which only contain a small amount of noise data of fraudulent transactions, and the rest are all normal transaction data. Therefore, all non-overlapping data subsets can be judged as normal transactions, and a small number of fraudulent transactions judged as normal transactions will not have a significant impact on the overall performance of the fraud identification model. The large amount of transaction data that is easy to distinguish in the original data set is divided into the non-overlapping data subset, which reduces the imbalance ratio of the overlapping data subset and eliminates the interference of easy-to-classify data on the training of the machine learning model.
步骤S13、使用one-class-SVM模型Mocsvm划分得到的overlapping数据子集训练非线性分类器模型Mclf,使用非线性分类器模型Mclf区分overlapping数据子集中的欺诈交易和正常交易。Step S13: Use the one-class-SVM model Mocsvm to divide the overlapping data subsets to train a nonlinear classifier model Mclf , and use the nonlinear classifier model Mclf to distinguish fraudulent transactions from normal transactions in the overlapping data subsets.
具体地,所述使用one-class-SVM模型Mocsvm划分得到的overlapping数据子集训练非线性分类器模型Mclf,包括:使用overlapping数据子集的第一部分样本训练非线性分类器模型Mclf;使用overlapping数据子集的第二部分样本验证非线性分类器模型Mclf;重复训练和验证步骤直到所述非线性分类器模型Mclf的准确性符合预设要求。例如,使用overlapping数据子集的某个月的前20天的数据样本训练非线性分类器模型Mclf,使用overlapping数据子集的某个月的后10天的样本验证非线性分类器模型Mclf;重复训练和验证步骤直到所述非线性分类器模型Mclf的准确性符合预设要求。所述预设要求是指所述非线性分类器模型Mclf对于overlapping数据子集中欺诈交易数据和正常交易数据进行区分的准确率。Overlapping数据子集中正常交易数据和欺诈交易数据在原始特征空间中难以区分,需要使用非线性机器学习模型如深度学习模型等来学习更为复杂的特征组合将原始数据映射到新的特征空间中,使得正常交易和欺诈交易在新的特征空间中能够更容易区分,从而保证欺诈交易识别的性能。Specifically, the overlapping data subset obtained by dividing using the one-class-SVM model Mocsvm is trained with a nonlinear classifier model Mclf , including: using the first part of the samples of the overlapping data subset to train the nonlinear classifier model Mclf ; using the second part of the samples of the overlapping data subset to verify the nonlinear classifier model Mclf ; repeating the training and verification steps until the accuracy of the nonlinear classifier model Mclf meets the preset requirements. For example, the nonlinear classifier model Mclf is trained with the data samples of the first 20 days of a certain month of the overlapping data subset, and the nonlinear classifier model Mclf is verified with the samples of the last 10 days of a certain month of the overlapping data subset; repeating the training and verification steps until the accuracy of the nonlinear classifier model Mclf meets the preset requirements. The preset requirements refer to the accuracy of the nonlinear classifier model Mclf in distinguishing fraudulent transaction data and normal transaction data in the overlapping data subset. It is difficult to distinguish normal transaction data and fraudulent transaction data in the overlapping data subset in the original feature space. It is necessary to use nonlinear machine learning models such as deep learning models to learn more complex feature combinations to map the original data to a new feature space, so that normal transactions and fraudulent transactions can be more easily distinguished in the new feature space, thereby ensuring the performance of fraudulent transaction identification.
步骤S14、生成由one-class-SVM模型Mocsvm和非线性分类器模型Mclf构成的欺诈交易识别模型。Step S14: Generate a fraudulent transaction identification model consisting of a one-class-SVM model Mocsvm and a nonlinear classifier model Mclf .
具体地,基于动态加权信息熵的欺诈交易识别方法的算法如下所示:Specifically, the algorithm of the fraudulent transaction identification method based on dynamic weighted information entropy is as follows:
具体地,使用欺诈交易样本训练含有超参数的one-class-SVM模型;其中属于使用训练后的one-class-SVM模型将原始数据划分为overlapping数据子集和non-overlapping数据子集。对应步骤:Specifically, fraudulent transaction samples are used to train the hyperparameter One-class-SVM model; belong Use the trained one-class-SVM model to divide the original data into overlapping data subsets and non-overlapping data subsets. Corresponding steps:
具体地,计算不同超参数对应的one-class-SVM模型划分得到的overlapping数据子集的动态加权信息熵,选择动态加权信息熵最大的overlapping数据子集对应超参数对应的one-class-SVM模型,作为overlapping数据子集的动态加权信息熵最大的one-class-SVM模型Mocsvm。对应步骤:Specifically, calculate different hyperparameters The dynamic weighted information entropy of the overlapping data subsets obtained by the corresponding one-class-SVM model division, and the corresponding hyperparameter of the overlapping data subset with the largest dynamic weighted information entropy is selected The corresponding one-class-SVM model is the one-class-SVM model Mocsvm with the largest dynamic weighted information entropy of the overlapping data subset. The corresponding steps are:
具体地,使用one-class-SVM模型Mocsvm将原始数据划分为overlapping数据子集和non-overlapping数据子集。对应步骤:Specifically, the one-class-SVM model Mocsvm is used to divide the original data into overlapping data subsets and non-overlapping data subsets. The corresponding steps are:
14.使用Mocsvm将原始数据划分为overlapping和non-overlapping数据子集14. Use Mocsvm to divide the original data into overlapping and non-overlapping data subsets
具体地,使用one-class-SVM模型Mocsvm划分得到的overlapping数据子集训练非线性分类器模型Mclf,使用非线性分类器模型Mclf区分overlapping数据子集中的欺诈交易和正常交易,对应步骤:Specifically, the overlapping data subset obtained by dividing the one-class-SVM model Mocsvm is used to train the nonlinear classifier model Mclf , and the nonlinear classifier model Mclf is used to distinguish fraudulent transactions and normal transactions in the overlapping data subset. The corresponding steps are:
15.使用overlapping数据子集训练非线性分类器模型Mclf 15. Use overlapping data subsets to train the nonlinear classifier model Mclf
具体地,生成由one-class-SVM模型Mocsvm和非线性分类器模型Mclf构成的欺诈交易识别模型,对应步骤:Specifically, a fraudulent transaction identification model consisting of a one-class-SVM model Mocsvm and a nonlinear classifier model Mclf is generated, corresponding to the following steps:
16.Return由Mocsvm和Mclf构成的欺诈交易识别模型16.Return Fraudulent transaction identification model composed of Mocsvm and Mclf
上述步骤以中国某银行的交易数据为例展开实际测试。该数据集包含从2017年4月到6月的交易数据,数据量大约350万条,且都由银行专业人员进行标注。如表1所示,为此数据集的基本信息。The above steps are tested using the transaction data of a Chinese bank as an example. The data set contains transaction data from April to June 2017, with a data volume of about 3.5 million records, all of which are annotated by bank professionals. Table 1 shows the basic information of this data set.
表1电子交易数据信息Table 1 Electronic transaction data information
为了能够证明本发明所述方法在欺诈交易识别上的良好性能和较高的效率。我们将此数据集的全部数据按照每10天的交易数据分为1组,共形成9组数据。然后使用9组数据组成6个实验组。每个实验组由连续的4组数据组成,其中前3组数据用于模型训练,剩下1组数据用于模型测试。6个实验组的详细信息如表1所示,数据不均衡比Imbalance Ratio在48~69之间,而数据overlap率在0.26~0.29之间。In order to demonstrate the good performance and high efficiency of the method of the present invention in identifying fraudulent transactions. We divide all the data in this data set into 1 group according to the transaction data of every 10 days, forming a total of 9 groups of data. Then use the 9 groups of data to form 6 experimental groups. Each experimental group consists of 4 consecutive groups of data, of which the first 3 groups of data are used for model training and the remaining 1 group of data is used for model testing. The detailed information of the 6 experimental groups is shown in Table 1. The data imbalance ratio is between 48 and 69, and the data overlap rate is between 0.26 and 0.29.
表2实验组数据信息Table 2 Experimental group data information
其中,datasets(数据组)、no.ofsamples(样本数量)、no.ofmaj/min(多数类样本数量/少数类样本数量)、Imbalance Ratio(数据不均衡比)、degree ofoverlap(重叠率)Among them, datasets (data set), no.ofsamples (number of samples), no.ofmaj/min (number of majority class samples/number of minority class samples), Imbalance Ratio (data imbalance ratio), degree of overlap (overlap rate)
在每个实验组上,我们将本发明所述的基于动态加权信息熵的方法与4个处理数据不均衡问题的常用方法以及2个最新的方法进行对比实验:In each experimental group, we compared the method based on dynamic weighted information entropy described in the present invention with four common methods for dealing with data imbalance problems and two latest methods:
1)EditedNearest Neighbor(ENN);1)EditedNearest Neighbor(ENN);
2)Tomek links;2) Tomek links;
3)Synthetic Minority Oversampling Technique(SMOTE);3)Synthetic Minority Oversampling Technique(SMOTE);
4)One Class Support Vector Machine(OC-SVM);4)One Class Support Vector Machine(OC-SVM);
5)Overlap Sensitive Margin(OSM)Classifier;5)Overlap Sensitive Margin(OSM)Classifier;
6)Modified Tomek Link Search(NB-Tomek);6)Modified Tomek Link Search(NB-Tomek);
针对每个实验组,本发明所使用方法采用网格搜索的方式来对one-class-SVM模型的超参数进行评估筛选,其他对比方法的超参数按照其推荐值设置。我们使用F1值(综合性能)和G_mean(GM)(欺诈交易和正常交易识别准确率乘积的开平方)作为各个模型的评价指标。For each experimental group, the method used in this invention uses a grid search method to evaluate and screen the hyperparameters of the one-class-SVM model, and the hyperparameters of other comparison methods are set according to their recommended values. We use F1 value (comprehensive performance) and G_mean (GM) (square root of the product of fraudulent transaction and normal transaction identification accuracy) as evaluation indicators for each model.
首先,实验结果图1d中所示本文提出欺诈识别模型的整体F1值随动态加权信息熵DWE的变化情况,可知本发明提出的欺诈识别模型在DWE取得最大值时,能够得到更好的性能,证明了动态加权信息熵可以指导本文提出的欺诈交易识别模型的参数选择。然后,表3展示了不同方法的F1值和GM的对比结果,显然,本发明提出的方法对于overlapping的数据不均衡问题更加有效,ours(本申请采用的方法)的F1值和GM都相对于其他方法高。最后,实验结果图1e展示了本文提出方法中用于数据划分的one-class-SVM模型和识别overlapping数据子集中欺诈交易的非线性机器学习模型的平均时间消耗情况,可见采用本文提出的动态加权信息熵来直接指导one-class-SVM模型的超参数选择,而无需非线性机器学习模型的结果反馈,能够大大减少整体参数选择的计算复杂度,提升模型学习的效率。First, the experimental result Figure 1d shows the change of the overall F1 value of the fraud identification model proposed in this paper with the dynamic weighted information entropy DWE. It can be seen that the fraud identification model proposed in this paper can achieve better performance when DWE reaches the maximum value, proving that the dynamic weighted information entropy can guide the parameter selection of the fraud transaction identification model proposed in this paper. Then, Table 3 shows the comparison results of the F1 value and GM of different methods. Obviously, the method proposed in this paper is more effective for the problem of overlapping data imbalance. The F1 value and GM of ours (the method adopted in this application) are higher than those of other methods. Finally, the experimental result Figure 1e shows the average time consumption of the one-class-SVM model used for data partitioning in the method proposed in this paper and the nonlinear machine learning model for identifying fraudulent transactions in the overlapping data subset. It can be seen that the use of the dynamic weighted information entropy proposed in this paper to directly guide the hyperparameter selection of the one-class-SVM model without the result feedback of the nonlinear machine learning model can greatly reduce the computational complexity of the overall parameter selection and improve the efficiency of model learning.
表3实验测试结果Table 3 Experimental test results
如图1f所示,于一实施例中,本发明的基于动态加权信息熵的欺诈交易识别方法,包括:步骤S11、使用欺诈交易样本训练one-class-SVM模型,并采用动态加权信息熵选择模型的超参数:使用欺诈交易样本训练含有超参数的one-class-SVM模型;其中属于使用训练后的one-class-SVM模型将原始数据划分为overlapping数据子集和non-overlapping数据子集;计算不同超参数 对应的one-class-SVM模型划分得到的overlapping数据子集的动态加权信息熵,选择动态加权信息熵最大的overlapping数据子集对应超参数对应的one-class-SVM模型,作为overlapping数据子集的动态加权信息熵最大的one-class-SVM模型Mocsvm。步骤S12、使用确定好超参数的one-class-SVM模型将原始数据划分为overlapping和non-overlapping数据子集:使用one-class-SVM模型Mocsvm将原始数据划分为overlapping数据子集和non-overlapping数据子集。步骤S13、使用overlapping数据子集训练非线性分类器来区分欺诈和正常交易:使用one-class-SVM模型Mocsvm划分得到的overlapping数据子集训练非线性分类器模型Mclf,使用非线性分类器模型Mclf区分overlapping数据子集中的欺诈交易和正常交易。生成由one-class-SVM模型Mocsvm和非线性分类器模型Mclf构成的欺诈交易识别模型。As shown in FIG. 1f , in one embodiment, the fraudulent transaction identification method based on dynamic weighted information entropy of the present invention includes: step S11, using fraudulent transaction samples to train a one-class-SVM model, and using dynamic weighted information entropy to select hyperparameters of the model: using fraudulent transaction samples to train a one-class-SVM model containing hyperparameters One-class-SVM model; belong Use the trained one-class-SVM model to divide the original data into overlapping data subsets and non-overlapping data subsets; calculate different hyperparameters The dynamic weighted information entropy of the overlapping data subsets obtained by the corresponding one-class-SVM model division, and the corresponding hyperparameter of the overlapping data subset with the largest dynamic weighted information entropy is selected The corresponding one-class-SVM model is used as the one-class-SVM model Mocsvm with the largest dynamic weighted information entropy of the overlapping data subset. Step S12: Use the one-class-SVM model with the determined hyperparameters to divide the original data into overlapping and non-overlapping data subsets: Use the one-class-SVM model Mocsvm to divide the original data into overlapping data subsets and non-overlapping data subsets. Step S13: Use the overlapping data subset to train a nonlinear classifier to distinguish fraudulent and normal transactions: Use the overlapping data subset obtained by the one-class-SVM model Mocsvm to train a nonlinear classifier model Mclf , and use the nonlinear classifier model Mclf to distinguish fraudulent transactions from normal transactions in the overlapping data subset. Generate a fraudulent transaction recognition model composed of the one-class-SVM model Mocsvm and the nonlinear classifier model Mclf .
如图2所示,于一实施例中,本发明的基于动态加权信息熵的欺诈交易识别系统,包括:训练模块21、划分模块22、区分模块23和模型生成模块24;As shown in FIG2 , in one embodiment, the fraudulent transaction identification system based on dynamic weighted information entropy of the present invention includes: a training module 21 , a division module 22 , a distinction module 23 and a model generation module 24 ;
所述训练模块21用于使用欺诈交易样本训练含有超参数的one-class-SVM模型;其中属于使用训练后的one-class-SVM模型将原始数据划分为overlapping数据子集和non-overlapping数据子集;计算不同超参数 对应的one-class-SVM模型划分得到的overlapping数据子集的动态加权信息熵,选择动态加权信息熵最大的overlapping数据子集对应超参数对应的one-class-SVM模型,作为overlapping数据子集的动态加权信息熵最大的one-class-SVM模型Mocsvm。The training module 21 is used to train the fraudulent transaction samples with hyper parameters One-class-SVM model; belong Use the trained one-class-SVM model to divide the original data into overlapping data subsets and non-overlapping data subsets; calculate different hyperparameters The dynamic weighted information entropy of the overlapping data subsets obtained by the corresponding one-class-SVM model division, and the corresponding hyperparameter of the overlapping data subset with the largest dynamic weighted information entropy is selected The corresponding one-class-SVM model is the one-class-SVM model Mocsvm with the largest dynamic weighted information entropy of the overlapping data subset.
所述划分模块用于22使用one-class-SVM模型Mocsvm将原始数据划分为overlapping数据子集和non-overlapping数据子集。The partitioning module 22 is used to partition the original data into overlapping data subsets and non-overlapping data subsets using the one-class-SVM model Mocsvm .
所述区分模块23用于使用one-class-SVM模型Mocsvm划分得到的overlapping数据子集训练非线性分类器模型Mclf,使用非线性分类器模型Mclf区分overlapping数据子集中的欺诈交易和正常交易。The distinguishing module 23 is used to train a nonlinear classifier model Mclf using the overlapping data subset obtained by dividing the one-class-SVM model Mocsvm , and use the nonlinear classifier model Mclf to distinguish fraudulent transactions and normal transactions in the overlapping data subset.
所述模型生成模块24用于生成由one-class-SVM模型Mocsvm和非线性分类器模型Mclf构成的欺诈交易识别模型。The model generation module 24 is used to generate a fraudulent transaction identification model consisting of a one-class-SVM model M ocsvm and a nonlinear classifier model M clf .
具体地,所述overlapping数据子集的动态加权信息熵定义为:Specifically, the dynamic weighted information entropy of the overlapping data subset is defined as:
GDWE(θ)=Wsnr*HG DWE (θ)=W snr *H
其中,θ表示one-class-SVM模型的一组超参数;H表示overlapping数据子集的信息熵;Wsnr表示H的动态权重,由少数类样本的信噪比决定,H和Wsnr分别定义为:Among them, θ represents a set of hyperparameters of the one-class-SVM model; H represents the information entropy of the overlapping data subset; W snr represents the dynamic weight of H, which is determined by the signal-to-noise ratio of the minority class samples. H and W snr are defined as:
其中i∈{0,1,...,k},k表示overlapping数据子集中的类别(对于欺诈交易识别来说k=1),pi表示overlapping数据子集中的某个样本属于类别i的概率,nall表示原始数据集中少数类样本的数量,noutliers表示原始数据集中的被判定为噪声数据的少数类样本数量。Where i∈{0, 1, ..., k}, k represents the category in the overlapping data subset (k=1 for fraudulent transaction identification), pi represents the probability that a sample in the overlapping data subset belongs to category i, n all represents the number of minority class samples in the original data set, and n outliers represents the number of minority class samples in the original data set that are judged to be noise data.
具体地,所述非线性分类器模型为深度学习模型。Specifically, the nonlinear classifier model is a deep learning model.
具体地,所述使用one-class-SVM模型Mocsvm划分得到的overlapping数据子集训练非线性分类器模型Mclf,包括:使用overlapping数据子集的第一部分样本训练非线性分类器模型Mclf;使用overlapping数据子集的第二部分样本验证非线性分类器模型Mclf;重复训练和验证步骤直到所述非线性分类器模型Mclf的准确性符合预设要求。Specifically, the overlapping data subset obtained by dividing the one-class-SVM model Mocsvm is used to train the nonlinear classifier model Mclf , including: using a first part of samples of the overlapping data subset to train the nonlinear classifier model Mclf ; using a second part of samples of the overlapping data subset to verify the nonlinear classifier model Mclf ; repeating the training and verification steps until the accuracy of the nonlinear classifier model Mclf meets the preset requirements.
需要说明的是,训练模块21、划分模块22、区分模块23和模型生成模块24的结构和原理与上述基于动态加权信息熵的欺诈交易识别方法中的步骤一一对应,故在此不再赘述。It should be noted that the structures and principles of the training module 21, the division module 22, the differentiation module 23 and the model generation module 24 correspond one to one with the steps in the above-mentioned fraudulent transaction identification method based on dynamic weighted information entropy, so they will not be repeated here.
需要说明的是,应理解以上系统的各个模块的划分仅仅是一种逻辑功能的划分,实际实现时可以全部或部分集成到一个物理实体上,也可以物理上分开。且这些模块可以全部以软件通过处理元件调用的形式实现;也可以全部以硬件的形式实现;还可以部分模块通过处理元件调用软件的形式实现,部分模块通过硬件的形式实现。例如,x模块可以为单独设立的处理元件,也可以集成在上述装置的某一个芯片中实现,此外,也可以以程序代码的形式存储于上述装置的存储器中,由上述装置的某一个处理元件调用并执行以上x模块的功能。其它模块的实现与之类似。此外这些模块全部或部分可以集成在一起,也可以独立实现。这里所述的处理元件可以是一种集成电路,具有信号的处理能力。在实现过程中,上述方法的各步骤或以上各个模块可以通过处理器元件中的硬件的集成逻辑电路或者软件形式的指令完成。It should be noted that it should be understood that the division of the various modules of the above system is only a division of logical functions. In actual implementation, they can be fully or partially integrated into one physical entity, or they can be physically separated. And these modules can all be implemented in the form of software called by processing elements; they can also be all implemented in the form of hardware; some modules can also be implemented in the form of software called by processing elements, and some modules can be implemented in the form of hardware. For example, the x module can be a separately established processing element, or it can be integrated in a chip of the above device. In addition, it can also be stored in the memory of the above device in the form of program code, and called and executed by a processing element of the above device. The implementation of other modules is similar. In addition, these modules can be fully or partially integrated together, or they can be implemented independently. The processing element described here can be an integrated circuit with signal processing capabilities. In the implementation process, each step of the above method or each module above can be completed by an integrated logic circuit of hardware in the processor element or instructions in the form of software.
例如,以上这些模块可以是被配置成实施以上方法的一个或多个集成电路,例如:一个或多个特定集成电路(Application Specific Integrated Circuit,简称ASIC),或,一个或多个微处理器(Digital Singnal Processor,简称DSP),或,一个或者多个现场可编程门阵列(Field Programmable Gate Array,简称FPGA)等。再如,当以上某个模块通过处理元件调度程序代码的形式实现时,该处理元件可以是通用处理器,例如中央处理器(Central ProcessingUnit,简称CPU)或其它可以调用程序代码的处理器。再如,这些模块可以集成在一起,以片上系统(system-on-a-chip,简称SOC)的形式实现。For example, the above modules may be one or more integrated circuits configured to implement the above methods, such as one or more application specific integrated circuits (ASIC), or one or more microprocessors (DSP), or one or more field programmable gate arrays (FPGA). For another example, when a module is implemented in the form of a processing element scheduling program code, the processing element may be a general-purpose processor, such as a central processing unit (CPU) or other processor that can call program code. For another example, these modules may be integrated together and implemented in the form of a system-on-a-chip (SOC).
于本发明一实施例中,本发明还包括一种计算机可读存储介质,其上存储有计算机程序,该程序被处理器执行时实现上述任一所述基于动态加权信息熵的欺诈交易识别方法。In one embodiment of the present invention, the present invention also includes a computer-readable storage medium on which a computer program is stored, and when the program is executed by a processor, any of the above-mentioned fraudulent transaction identification methods based on dynamic weighted information entropy is implemented.
本领域普通技术人员可以理解:实现上述各方法实施例的全部或部分步骤可以通过计算机程序相关的硬件来完成。前述的计算机程序可以存储于一计算机可读存储介质中。该程序在执行时,执行包括上述各方法实施例的步骤;而前述的存储介质包括:ROM、RAM、磁碟或者光盘等各种可以存储程序代码的介质。Those skilled in the art can understand that all or part of the steps of implementing the above-mentioned method embodiments can be completed by hardware related to the computer program. The aforementioned computer program can be stored in a computer-readable storage medium. When the program is executed, the steps of the above-mentioned method embodiments are executed; and the aforementioned storage medium includes: ROM, RAM, magnetic disk or optical disk, etc., various media that can store program codes.
如图3所示,于一实施例中,本发明的基于动态加权信息熵的欺诈交易识别装置包括:处理器31和存储器32;所述存储器32用于存储计算机程序;所述处理器31与所述存储器32相连,用于执行所述存储器32存储的计算机程序,以使所述基于动态加权信息熵的欺诈交易识别装置执行任一所述的基于动态加权信息熵的欺诈交易识别方法。As shown in Figure 3, in one embodiment, the fraud transaction identification device based on dynamic weighted information entropy of the present invention includes: a processor 31 and a memory 32; the memory 32 is used to store computer programs; the processor 31 is connected to the memory 32, and is used to execute the computer program stored in the memory 32, so that the fraud transaction identification device based on dynamic weighted information entropy executes any of the fraud transaction identification methods based on dynamic weighted information entropy.
具体地,所述存储器32包括:ROM、RAM、磁碟、U盘、存储卡或者光盘等各种可以存储程序代码的介质。Specifically, the memory 32 includes: ROM, RAM, disk, USB flash drive, memory card or optical disk, etc., which can store program codes.
优选地,所述处理器31可以是通用处理器,包括中央处理器(Central ProcessingUnit,简称CPU)、网络处理器(Network Processor,简称NP)等;还可以是数字信号处理器(Digital Signal Processor,简称DSP)、专用集成电路(Application SpecificIntegrated Circuit,简称ASIC)、现场可编程门阵列(Field Programmable Gate Array,简称FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件。Preferably, the processor 31 can be a general-purpose processor, including a central processing unit (CPU), a network processor (NP), etc.; it can also be a digital signal processor (DSP), an application specific integrated circuit (ASIC), a field programmable gate array (FPGA) or other programmable logic devices, discrete gates or transistor logic devices, discrete hardware components.
综上所述,本发明基于动态加权信息熵的欺诈交易识别方法、系统及装置,采用分而治之的策略,为非线性机器学习模型排出大量易识别的正常交易数据,使得模型能仅关注于难以划分数据的学习,充分发挥了非线性模型的能力,提升欺诈交易识别模型的性能。所以,本发明有效克服了现有技术中的种种缺点而具高度产业利用价值。In summary, the fraudulent transaction identification method, system and device based on dynamic weighted information entropy of the present invention adopts a divide-and-conquer strategy to exclude a large amount of easily identifiable normal transaction data for the nonlinear machine learning model, so that the model can only focus on the learning of data that is difficult to divide, giving full play to the ability of the nonlinear model and improving the performance of the fraudulent transaction identification model. Therefore, the present invention effectively overcomes various shortcomings in the prior art and has a high industrial utilization value.
上述实施例仅例示性说明本发明的原理及其功效,而非用于限制本发明。任何熟悉此技术的人士皆可在不违背本发明的精神及范畴下,对上述实施例进行修饰或改变。因此,举凡所属技术领域中具有通常知识者在未脱离本发明所揭示的精神与技术思想下所完成的一切等效修饰或改变,仍应由本发明的权利要求所涵盖。The above embodiments are merely illustrative of the principles and effects of the present invention, and are not intended to limit the present invention. Anyone familiar with the art may modify or alter the above embodiments without departing from the spirit and scope of the present invention. Therefore, all equivalent modifications or alterations made by a person of ordinary skill in the art without departing from the spirit and technical concept disclosed by the present invention shall still be covered by the claims of the present invention.
Claims (8)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202010436168.2A CN113723950B (en) | 2020-05-21 | 2020-05-21 | Fraudulent transaction identification method, system and device based on dynamic weighted information entropy |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202010436168.2A CN113723950B (en) | 2020-05-21 | 2020-05-21 | Fraudulent transaction identification method, system and device based on dynamic weighted information entropy |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN113723950A CN113723950A (en) | 2021-11-30 |
| CN113723950B true CN113723950B (en) | 2024-03-05 |
Family
ID=78671283
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202010436168.2A Active CN113723950B (en) | 2020-05-21 | 2020-05-21 | Fraudulent transaction identification method, system and device based on dynamic weighted information entropy |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN113723950B (en) |
Families Citing this family (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN118260683B (en) * | 2024-05-29 | 2024-08-02 | 天逸财金科技服务(武汉)有限公司 | Big data-based anti-fraud model training method and system |
Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN110930218A (en) * | 2019-11-07 | 2020-03-27 | 中诚信征信有限公司 | Method and device for identifying fraudulent customer and electronic equipment |
| CN111079805A (en) * | 2019-12-03 | 2020-04-28 | 浙江工业大学 | Abnormal image detection method combining attention mechanism and information entropy minimization |
| CN111079283A (en) * | 2019-12-13 | 2020-04-28 | 四川新网银行股份有限公司 | Method for processing information saturation unbalanced data |
Family Cites Families (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7668769B2 (en) * | 2005-10-04 | 2010-02-23 | Basepoint Analytics, LLC | System and method of detecting fraud |
-
2020
- 2020-05-21 CN CN202010436168.2A patent/CN113723950B/en active Active
Patent Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN110930218A (en) * | 2019-11-07 | 2020-03-27 | 中诚信征信有限公司 | Method and device for identifying fraudulent customer and electronic equipment |
| CN111079805A (en) * | 2019-12-03 | 2020-04-28 | 浙江工业大学 | Abnormal image detection method combining attention mechanism and information entropy minimization |
| CN111079283A (en) * | 2019-12-13 | 2020-04-28 | 四川新网银行股份有限公司 | Method for processing information saturation unbalanced data |
Non-Patent Citations (3)
| Title |
|---|
| Computational Data and Social Networks;jiang changjun;《7th International Conference, CSoNet 2018》;全文 * |
| Overlapping One-Class SVMs for Utterance Verification in Speech Recognition;Cuiqin Hou;《2011 International Joint Conference of IEEE TrustCom-11/IEEE ICESS-11/FCST-11》;全文 * |
| 基于信息熵加权去噪的半监督SVM文本分类研究;李英英;《中国优秀硕士学位论文全文数据库》;全文 * |
Also Published As
| Publication number | Publication date |
|---|---|
| CN113723950A (en) | 2021-11-30 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN107103171B (en) | Modeling method and device of machine learning model | |
| CN111461855B (en) | Credit card fraud detection method, system, medium, and equipment based on undersampling | |
| CN107103332B (en) | A Relevance Vector Machine Classification Method for Large-Scale Datasets | |
| CN103632168B (en) | Classifier integration method for machine learning | |
| CN109902722A (en) | Classifier, neural network model training method, data processing equipment and medium | |
| CN111539733B (en) | Fraud transaction identification method, system and device based on full-center loss function | |
| CN110570312B (en) | Sample data acquisition method and device, computer equipment and readable storage medium | |
| CN108960304B (en) | A deep learning detection method for online transaction fraud | |
| WO2020250730A1 (en) | Fraud detection device, fraud detection method, and fraud detection program | |
| CN116030266A (en) | Pavement crack detection and classification method in natural scenes based on improved YOLOv3 | |
| CN111340086A (en) | Method, system, medium and terminal for processing unlabeled data | |
| WO2024045989A1 (en) | Graph network data set processing method and apparatus, electronic device, program, and medium | |
| CN111292182A (en) | Credit fraud detection method and system | |
| CN118261695A (en) | Risk assessment method and device, computer storage medium and electronic equipment | |
| CN113723950B (en) | Fraudulent transaction identification method, system and device based on dynamic weighted information entropy | |
| CN113052244A (en) | Classification model training method and classification model training device | |
| CN110324178A (en) | A kind of network inbreak detection method based on mostly through checking study | |
| CN112766320A (en) | Classification model training method and computer equipment | |
| CN116821688A (en) | Method for processing data set in credit card fraud transaction based on clustering downsampling technology | |
| CN117273060A (en) | A data optimization method based on influence function | |
| US11609936B2 (en) | Graph data processing method, device, and computer program product | |
| CN115454863A (en) | Software defect prediction method and system | |
| CN114022712A (en) | User classification method and device, computer equipment and storage medium | |
| CN112926697A (en) | Abrasive particle image classification method and device based on semantic segmentation | |
| CN118035125B (en) | Random test case generation method and system based on double-level probability selection |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant |