CN105023025B - An open-set trace image classification method and system - Google Patents
An open-set trace image classification method and system Download PDFInfo
- Publication number
- CN105023025B CN105023025B CN201510481067.6A CN201510481067A CN105023025B CN 105023025 B CN105023025 B CN 105023025B CN 201510481067 A CN201510481067 A CN 201510481067A CN 105023025 B CN105023025 B CN 105023025B
- Authority
- CN
- China
- Prior art keywords
- image
- trace
- classified
- category
- images
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/2163—Partitioning the feature space
Landscapes
- Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Theoretical Computer Science (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Biology (AREA)
- Evolutionary Computation (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Image Analysis (AREA)
Abstract
Description
技术领域technical field
本发明涉及一种分类方法及系统,特别是关于一种开集痕迹图像分类方法及系统。The invention relates to a classification method and system, in particular to an open-set trace image classification method and system.
背景技术Background technique
现场痕迹是刑事侦查的重要物证之一,而提取的痕迹多数以图像形式保存,如何对现场痕迹图像进行高效的归类管理成为了一个迫切需要解决的问题。On-site traces are one of the important physical evidence in criminal investigation, and most of the extracted traces are stored in the form of images. How to efficiently classify and manage on-site trace images has become an urgent problem to be solved.
痕迹图像分类是图像分类的一种,目前图像分类方法主要用于:(1)判定图像中是否含有感兴趣的物体,如人、飞机等;(2)判断图像所代表的场景,如森林、客厅和工厂等;(3)判定图像中物体的形态属性,如颜色、形状等。图像分类方法的基本框架是:(1)需要对图像库中的每一幅图像进行人工或自动标记;(2)根据标记的样本进行学习训练得到分类器;(3)提取待分类图像的特征;(4)相似性度量;(5)利用分类器进行分类。Trace image classification is a kind of image classification. At present, image classification methods are mainly used to: (1) determine whether the image contains objects of interest, such as people, airplanes, etc.; (2) determine the scene represented by the image, such as forest, Living room and factory, etc.; (3) Determine the morphological attributes of objects in the image, such as color, shape, etc. The basic framework of the image classification method is: (1) need to manually or automatically mark each image in the image library; (2) learn and train the classifier according to the marked samples; (3) extract the features of the image to be classified ; (4) Similarity measure; (5) Use classifier to classify.
目前图像分类方法存在的问题是:(1)分类器的精度依赖于标记的精度,由于痕迹图像非常多,且质量较差,而痕迹图像的标记多为自动标记,因此标记错误极为正常,这就极大影响了分类精度。(2)目前图像分类方法多是依据待分类图像与各类图像的共性特征的相似性进行分类,而痕迹图像多为受各种干扰以及残缺的图像,若求取类内的共性特征,可能造成各类图像的有用信息减弱或丢失。(3)目前图像分类方法若往图像库中新增一幅图像需要重新训练,由于痕迹图像库中图像非常多,若每次新增图像都要重新训练,在实施上有困难。The problems existing in the current image classification methods are: (1) The accuracy of the classifier depends on the accuracy of the markers. Since there are many trace images and their quality is poor, and most of the markers of the trace images are automatically marked, it is very normal for the markers to be wrong. It greatly affects the classification accuracy. (2) At present, most image classification methods are based on the similarity between the image to be classified and the common features of various types of images, and the trace images are mostly disturbed and incomplete images. If the common features within the class are obtained, it is possible The useful information of various images is weakened or lost. (3) In the current image classification method, if an image is added to the image library, retraining is required. Since there are many images in the trace image library, it is difficult to implement if each new image needs to be retrained.
发明内容Contents of the invention
针对上述问题,本发明的目的是提供一种高效的开集痕迹图像分类方法及系统,实现对现场提取的痕迹的分类和对痕迹图像库的自动扩充,通过对现场痕迹的分类实现案件的串并,为警方办案提供很大的帮助。In view of the above problems, the object of the present invention is to provide an efficient open-collection trace image classification method and system to realize the classification of traces extracted on the spot and the automatic expansion of the trace image library, and realize the serialization of cases through the classification of traces on the spot. And, provide great help for the police handling the case.
为实现上述目的,本发明采取以下技术方案:一种开集痕迹图像分类方法,它包括以下步骤:1)针对待分类图像进行预处理,使其形成痕迹图像库内存储的痕迹图像的规格,其中痕迹图像库包括若干痕迹图像和每一痕迹图像的所属类别;2)根据预处理后的待分类图像与痕迹图像库中每一痕迹图像的相似性筛选待分类图像的候选类别;3)根据待分类图像与候选类别中排在第一名的痕迹图像及其对应代表图像的相似性判断待分类图像为所属类别或是新增类别,若不能直接给出该待分类图像的分类情况,则进入下一步,否则,进入步骤6);4)根据多数表决原则或排在第一名和第二名痕迹图像之间的相似性落差判定条件判断待分类图像为所属类别,若不能直接给出该待分类图像的分类情况,则进入下一步,否则,进入步骤6);5)将待分类图像的候选类别中的代表图像提供给用户,由其交互式判别待分类图像的所属类别,进入步骤6);6)痕迹图像库更新,自动扩充其内存储痕迹图像。In order to achieve the above object, the present invention adopts the following technical solutions: an open-set trace image classification method, which includes the following steps: 1) preprocessing the image to be classified to form the specification of the trace image stored in the trace image library, Wherein the trace image library includes several trace images and the category of each trace image; 2) according to the similarity of each trace image in the preprocessed image to be classified and the trace image library, the candidate category of the image to be classified is screened; 3) according to The similarity between the image to be classified and the trace image ranked first in the candidate category and its corresponding representative image is used to determine whether the image to be classified belongs to the category or a new category. If the classification of the image to be classified cannot be directly given, then Go to the next step, otherwise, go to step 6); 4) According to the majority voting principle or the similarity gap judgment condition between the first and second trace images, it is judged that the image to be classified belongs to the category, if it cannot be directly given the The classification situation of the image to be classified, then enter the next step, otherwise, enter step 6); 5) provide the representative image in the candidate category of the image to be classified to the user, and interactively determine the category of the image to be classified by it, and enter the step 6); 6) The trace image library is updated, and the trace image stored in it is automatically expanded.
所述步骤2)包括以下步骤:①计算待分类图像的特征,记为②根据待分类图像的特征计算待分类图像与痕迹图像库中每一痕迹图像的相似性,记为S={s1,…,si,…,sN}其中si=sim(Tx,xi)表示待分类图像与痕迹图像di的相似性;xi为痕迹图像库D中每一痕迹图像的特征集合,即特征库记为X={x1,…,xi,…,xN},其中表示第i幅痕迹图像的特征,特征是d维实数;③将S中各值降序排列,得到并从中顺序选择前f类图像及其所在类别,作为待分类图像T的候选类别,记为P。Described step 2) comprises the following steps: 1. calculate the feature of image to be classified, denoted as ②Calculate the similarity between the image to be classified and each trace image in the trace image library according to the characteristics of the image to be classified, recorded as S={s 1 ,...,s i ,...,s N } where s i =sim(T x ,xi ) represents the similarity between the image to be classified and the trace image d i ; xi is the feature set of each trace image in the trace image library D, that is, the feature library is recorded as X={x 1 ,…, xi ,… , x N }, where Represents the feature of the i-th trace image, and the feature is a d-dimensional real number; ③ Arrange the values in S in descending order, and get and from Sequentially select the first f types of images and their categories, as the candidate category of the image T to be classified, denoted as P.
所述步骤3)包括以下步骤:①将待分类图像与待分类图像的候选类别P中排在第一名的痕迹图像的相似性记为sF,将待分类图像与待分类图像的候选类别P中排在第一名的痕迹图像所在类别代表图像的相似性记为sR;②若sF<sNew,则将待分类图像直接认定为新增类别,进入步骤6);否则进入下一步,其中sNew表示认定为新增类别的阈值;③若sF≥sDir且sR≥sRth,则将待分类图像直接认定为排在第一名痕迹图像所在的类别,进入步骤6);否则进入步骤4),其中sDir表示待分类图像与排在第一名痕迹图像的相似性阈值,sRth表示待分类图像与排在第一名痕迹图像所在类代表图像的相似性阈值,且sDir>sRth。Said step 3) includes the following steps: ① record the similarity between the image to be classified and the trace image ranked first in the candidate category P of the image to be classified as s F , and record the image to be classified and the candidate category of the image to be classified The similarity of the representative image of the category of the trace image ranked first in P is recorded as s R ; ②If s F <s New , then directly identify the image to be classified as a new category, and enter step 6); otherwise, enter the next step Step 1, where s New represents the threshold value for the newly added category; ③ If s F ≥ s Dir and s R ≥ s Rth , then the image to be classified is directly identified as the category of the trace image that ranks first, and proceeds to step 6 ); otherwise, go to step 4), where s Dir represents the similarity threshold between the image to be classified and the first trace image, and s Rth represents the similarity threshold between the image to be classified and the class representative image of the first trace image , and s Dir >s Rth .
所述步骤3)中代表图像的获取过程如下:①提取痕迹图像库中每一痕迹图像的特征,形成特征库X={x1,…,xi,…,xN}其中表示第i幅痕迹图像的特征,特征是d维实数;②计算痕迹图像库中每个类别中各成员之间的相似性,得到每一类别中成员之间的相似性矩阵,记为Wi,j(k),其中li=lj=k,且k=1,…,K;③根据Wi,j(k)按照计算得到各类别代表图像即在痕迹图像库的每类痕迹图像中选出与该类其他所有痕迹图像相似性积最大的痕迹图像作为代表图像。The acquisition process of the representative image in the step 3) is as follows: 1. extract the feature of each trace image in the trace image library to form a feature library X={x 1 ,..., xi ,...,x N } where Represents the feature of the i-th trace image, and the feature is a d-dimensional real number; ② Calculate the similarity between members of each category in the trace image library, and obtain the similarity matrix between members of each category, denoted as W i ,j (k), where l i =l j =k, and k=1,...,K; ③According to W i,j (k) according to Calculate the representative image of each category That is, in each type of trace image in the trace image library, select the trace image with the largest similarity product with all other trace images of this type as the representative image.
所述步骤4)包括以下步骤:①在待分类图像的候选类别P中,排在前q名的痕迹图像个数多的属于同一个类别,则待分类图像为该类别,进入步骤6),否则进入下一步;②在待分类图像的候选类别P中,若与排在第一名痕迹图像相似性远大于与排在第二名痕迹图像的相似性,即相似性落差大于Δs时,则待分类图像的类别为第一名痕迹图像所在的类别,进入步骤6),否则进入步骤5)。Said step 4) comprises the following steps: 1. among the candidate categories P of images to be classified, if the number of trace images ranked in the top q number belongs to the same category, then the image to be classified belongs to this category, and enters step 6), Otherwise, go to the next step; ②In the candidate category P of the image to be classified, if the similarity with the first-ranked trace image is much greater than the similarity with the second-ranked trace image, that is, when the similarity gap is greater than Δs, then The category of the image to be classified is the category of the first trace image, go to step 6), otherwise go to step 5).
所述步骤5)包括以下步骤:①将待分类图像的候选类别P中各图像所在类别的代表图像按序呈现给用户,并同时显示待分类图像;②用户根据每个代表图像查看其所在类别在待分类图像的候选类别P中出现的图像和名次;③用户根据①和②信息,判决待分类图像所属类别。The step 5) includes the following steps: 1. present the representative images of the categories of the images in the candidate category P of the images to be classified to the user in order, and display the images to be classified at the same time; 2. The user checks the category according to each representative image The images and rankings that appear in the candidate category P of the image to be classified; ③The user judges the category of the image to be classified according to the information in ① and ②.
所述步骤6)包括以下步骤:①痕迹图像库的更新,即D=D∪{T},新进入痕迹图像 库中的痕迹图像的下标索引号为N+1;痕迹图像库D包括若干痕迹图像和每一痕迹图像的所 属类别,其中,D={d1,…,di,…,dN},且表示第i幅痕迹图像,该图像的高度和宽 度分别为m和n,痕迹图像库的标记记为L={l1,…,li,…,lN},其中表示痕迹图像di 的标记类别号,痕迹图像库共有K个类别;②待分类图像所属类别lt类代表图像R(lt)的更 新,若待分类图像为新增类别,则待分类图像直接为代表图像,即R(lt)=N+1;否则,按如下 步骤进行:A、计算lt类内每一非待分类图像与该类所有图像的相似性积,并找到所有相似 性积中的最大值R1及该最大值的图像所对应的索引号i*:其中B、计算待 分类图像与lt类内其他图像的相似度积,即C、更新后的lt类的代表图像为:③标记的更新,即L=L∪{lt};④特征库的更新,即X=X∪{Tx}; ⑤痕迹图像库中痕迹图像数目增加1,即N=N+1。 Described step 6) comprises the following steps: 1. the update of the trace image database, namely D=D∪{T}, the subscript index number of the trace image newly entered in the trace image database is N+1; the trace image database D includes several Trace images and the category to which each trace image belongs, wherein, D={d 1 ,...,d i ,...,d N }, and represents the i-th trace image, the height and width of which are m and n respectively, The marks of the trace image library are marked as L={l 1 ,…,l i ,…,l N }, which represents the tag category number of the trace image d i , and there are K categories in the trace image library; ②The image to be classified belongs to the category l Class t represents the update of the image R(l t ). If the image to be classified is a new category, the image to be classified is directly a representative image, that is, R(l t )=N+1; otherwise, proceed as follows: A, Calculate the similarity product between each non-to-be-classified image and all images of this class in class l t , and find the maximum value R 1 in all similarity products and the index number i * corresponding to the image with the maximum value: in B. Calculate the similarity product between the image to be classified and other images in the l t class, that is, the representative image of the C, updated l t class is: ③Mark update, ie L=L∪{l t }; ④Feature library update, ie X=X∪{T x }; ⑤Increase the number of trace images in the trace image library by 1, ie N=N+1.
一种开集痕迹图像分类系统,其特征在于:它包括预处理模块、初始筛选模块、痕迹图像库、直接认定模块、补录模块和主观判决模块,且所述痕迹图像库内存储痕迹图像、每一痕迹图像所属类别;所述预处理模块将待分类图像预处理成所述痕迹图像库内存储的痕迹图像的规格,并将预处理之后的待分类图像传送给所述初始筛选模块;所述初始筛选模块根据预处理后的待分类图像与所述痕迹图像库中每一痕迹图像的相似性筛选待分类图像的候选类别,并就该候选类别传送给所述直接认定模块;所述直接认定模块根据待分类图像与候选类别中排在第一名的痕迹图像及其对应的代表图像的相似性判断待分类图像为所属类别或是新增类别,若该模块不能直接给出该待分类图像的分类情况,则将待分类图像的候选类别传送给所述补录模块,否则,将待分类图像的所属类别传送给所述痕迹图像库进行更新;所述补录模块根据多数表决原则或排在第一名和第二名痕迹图像之间的相似性落差判定条件判断待分类图像的所属类别,若该模块不能直接给出该待分类图像的分类情况,则将待分类图像的候选类别传送给所述主观判决模块,否则,将待分类图像的所属类别传送给所述痕迹图像库进行更新;所述主观判决模块将待分类图像的候选类别中的代表图像提供给用户,由其交互式判别待分类图像的所属类别,并将待分类图像的所属类别传送给所述痕迹图像库进行更新;所述痕迹图像库将待分类图像补充到所述痕迹图像库中,若为已有类别则补充到相应的类别,更新该类别的代表图像;否则为新增类别,设立新增类别,更新该类别的代表图像;同时,更新特征库、标记和痕迹图像数目。An open-collection trace image classification system is characterized in that it includes a preprocessing module, an initial screening module, a trace image library, a direct identification module, a supplementary recording module, and a subjective judgment module, and the trace image library stores trace images, Each trace image belongs to the category; the preprocessing module preprocesses the image to be classified into the specification of the trace image stored in the trace image library, and transmits the preprocessed image to be classified to the initial screening module; The initial screening module screens the candidate category of the image to be classified according to the similarity between the preprocessed image to be classified and each trace image in the trace image library, and transmits the candidate category to the direct identification module; the direct The identification module determines whether the image to be classified belongs to the category or a new category according to the similarity between the image to be classified and the trace image ranked first in the candidate category and its corresponding representative image. If the module cannot directly give the image to be classified If the classification situation of the image is different, then the candidate category of the image to be classified is sent to the supplementary recording module, otherwise, the category of the image to be classified is transmitted to the trace image library for updating; the supplementary recording module is based on the principle of majority voting or The similarity gap judgment condition between the first and second trace images judges the category of the image to be classified. If the module cannot directly give the classification of the image to be classified, the candidate category of the image to be classified will be sent to the subjective judgment module, otherwise, the category of the image to be classified is sent to the trace image library for updating; the subjective judgment module provides the representative image of the candidate category of the image to be classified to the user, and the interactive Discriminate the category of the image to be classified, and transmit the category of the image to be classified to the trace image database for updating; the trace image database will supplement the image to be classified into the trace image database, if it is an existing category, then Add to the corresponding category, update the representative image of this category; otherwise, add a new category, set up a new category, update the representative image of this category; at the same time, update the number of feature databases, marks and trace images.
本发明由于采取以上技术方案,其具有以下优点:1、本发明方法包括:1)针对待分类图像进行预处理,使其形成痕迹图像库内存储的痕迹图像的规格,其中痕迹图像库包括若干痕迹图像和每一痕迹图像的所属类别;2)根据预处理后的待分类图像与痕迹图像库中每一痕迹图像的相似性筛选待分类图像的候选类别;3)根据待分类图像与候选类别中排名第一位的痕迹图像及其对应代表图像的相似性判断待分类图像为所属类别或是新增类别,若不能直接给出该待分类图像的分类情况,则进入下一步,否则,痕迹图像库更新,自动扩充其内存储痕迹图像;4)根据多数表决原则或排在第一名和第二名痕迹图像之间的相似性落差判定条件判断待分类图像的所属类别,若不能直接给出该待分类图像的分类情况,则进入下一步,否则,痕迹图像库更新,自动扩充其内存储痕迹图像;5)将待分类图像的候选类别中的代表图像提供给用户,由其交互式判决待分类图像的所属类别,痕迹图像库更新,自动扩充其内存储痕迹图像;采用以上方式实现痕迹图像库的自动扩充和标记。另外,上述步骤3)、4)和5)为多层级联的分类策略,采用该策略从多个层面进行痕迹图像分类,从而大大提高了分类精度;尤其在步骤5)中,在分类可靠性不高时,还提供候选类别的代表图像供人机交互判决,只有该步骤需要人工识别,从而节省警方办案时间。2、本发明采用代表图像作为每个类别的代表与待分类图像进行相似性比较,同有的传统图像分类方法只利用每个图像跟待分类图像进行相似性比较而言,解决了因事先自动标记错误而带来的分类精度不高的问题。3、本发明将待分类图像与痕迹图像库中每一痕迹图像的相似性进行比对,同有的传统图像分类方法采用待分类图像与所有图像的共性特征相比,受各种干扰的影响较小,从而提高分类精度、结果相对准确且全面。4、本发明在新增图像的情况下,不需要对整个痕迹图像库的痕迹图像进行重新计算,只是在其所属类别中更新代表图像即可,相较传统方法每次新增图像都要训练而言,大大减少算法运行时间,相对速度较快。因此,本发明可以广泛用于痕迹分类领域。Because the present invention adopts the above technical scheme, it has the following advantages: 1. The method of the present invention includes: 1) performing preprocessing on the images to be classified to form the specifications of the trace images stored in the trace image database, wherein the trace image database includes several The trace image and the category of each trace image; 2) according to the similarity between the preprocessed image to be classified and each trace image in the trace image library, the candidate category of the image to be classified is screened; 3) according to the image to be classified and the candidate category The similarity between the first-ranked trace image and its corresponding representative image judges whether the image to be classified belongs to the category or a newly added category. If the classification of the image to be classified cannot be directly given, go to the next step; otherwise, the trace The image library is updated, and the trace images stored in it are automatically expanded; 4) According to the principle of majority voting or the similarity gap judgment condition between the first and second trace images, the category of the image to be classified is judged. If it cannot be directly given The classification situation of the image to be classified, then enter the next step, otherwise, the trace image database is updated, and the trace image stored in it is automatically expanded; 5) The representative image in the candidate category of the image to be classified is provided to the user, and the user is interactively judged The category of the image to be classified, the trace image library is updated, and the trace image stored in it is automatically expanded; the above method is used to realize the automatic expansion and marking of the trace image library. In addition, the above steps 3), 4) and 5) are multi-layer cascaded classification strategies, which are used to classify trace images from multiple levels, thereby greatly improving the classification accuracy; especially in step 5), the classification reliability When it is not high, representative images of candidate categories are also provided for human-computer interaction judgment, and only this step requires manual identification, thereby saving police time in handling cases. 2. The present invention uses the representative image as the representative of each category to carry out similarity comparison with the image to be classified. With the traditional image classification method that only utilizes each image to compare the similarity with the image to be classified, it solves the problem caused by the prior automatic The problem of low classification accuracy caused by labeling errors. 3. The present invention compares the image to be classified with the similarity of each trace image in the trace image database. Compared with the traditional image classification method that uses the common features of the image to be classified and all images, it is affected by various interferences. Smaller, so as to improve the classification accuracy, the result is relatively accurate and comprehensive. 4. In the case of adding new images, the present invention does not need to recalculate the trace images of the entire trace image database, but only updates the representative images in its category. Compared with the traditional method, training is required every time a new image is added. As far as the algorithm is concerned, the running time of the algorithm is greatly reduced, and the speed is relatively fast. Therefore, the present invention can be widely used in the field of trace classification.
附图说明Description of drawings
图1是本发明方法的整体流程示意图Fig. 1 is the overall flow chart diagram of the inventive method
图2是痕迹图像库中代表图像的生成过程Figure 2 is the generation process of representative images in the trace image library
图3是本发明系统的结构示意图Fig. 3 is the structural representation of the system of the present invention
具体实施方式Detailed ways
下面结合附图和实施例对本发明进行详细的描述。The present invention will be described in detail below in conjunction with the accompanying drawings and embodiments.
如图1所示,一种开集痕迹图像分类算法,它包括以下步骤:As shown in Figure 1, an open-set trace image classification algorithm includes the following steps:
1)针对待分类图像进行预处理,使其形成痕迹图像库内存储的痕迹图像的规格,其包括以下步骤:1) Preprocessing the image to be classified to form the specification of the trace image stored in the trace image library, which includes the following steps:
①将待分类图像的采集分辨率和采集角度调整成痕迹图像库内存储的痕迹图像同一规格上,以提高分类精度,其中待分类图像的高度和宽度分别为m和n;①The image to be classified The acquisition resolution and acquisition angle are adjusted to be on the same specification as the trace images stored in the trace image library to improve the classification accuracy, wherein the height and width of the image to be classified are m and n respectively;
②将待分类图像中所关注的痕迹从背景中提取出来,以便去除背景干扰;② Extract the traces of interest in the image to be classified from the background in order to remove background interference;
由于案发现场背景复杂,目前还没有较实用的自动提取方法,比较有效的方法就是通过人机交互的方式将痕迹从背景中提取出来;Due to the complex background of the crime scene, there is no practical automatic extraction method at present. The more effective method is to extract the traces from the background through human-computer interaction;
③采用均值滤波器去除痕迹图像所受的高斯噪声干扰,滤波器窗口大小为高斯噪声标准差的6倍,高斯噪声标准差用图像中平滑区的标准差近似;③A mean value filter is used to remove Gaussian noise interference from the trace image, the filter window size is 6 times the standard deviation of Gaussian noise, and the standard deviation of Gaussian noise is approximated by the standard deviation of the smooth area in the image;
④采用直方图规定化将待分类图像采集的光照条件调整与痕迹图像库中图像的采集条件一致,得到预处理后的待分类图像。④Using the histogram specification to adjust the illumination conditions of the images to be classified to be consistent with the acquisition conditions of the images in the trace image database, and obtain the preprocessed images to be classified.
上述痕迹图像库,记为D,包括若干痕迹图像和每一痕迹图像的所属类别,其中,D={d1,…,di,…,dN},且表示第i幅痕迹图像,该图像的高度和宽度分别为m和n,痕迹图像库的标记记为L={l1,…,li,…,lN},其中表示痕迹图像di的标记类别号,痕迹图像库共有K个类别。The above trace image database, denoted as D, includes several trace images and the category of each trace image, where D={d 1 ,...,d i ,...,d N }, and Represents the i-th trace image, the height and width of which are m and n respectively, and the marks of the trace image library are marked as L={l 1 ,…,l i ,…,l N }, where Indicates the tag category number of the trace image d i , and there are K categories in the trace image library.
2)根据预处理后的待分类图像与痕迹图像库中每一痕迹图像的相似性筛选待分类图像的候选类别,其包括以下步骤:2) According to the similarity between the preprocessed image to be classified and each trace image in the trace image database, the candidate category of the image to be classified is screened, which includes the following steps:
①计算待分类图像的特征,记为 ① Calculate the features of the image to be classified, denoted as
根据待分类图像中痕迹的种类采用相应的方式提取特征,通常采用的方法为SIFT(Scale-invariant feature transform,尺度不变特征变换)、SURF(Speeded up robustfeatures,加速鲁棒特征)以及小波傅里叶变换系数等;According to the type of traces in the image to be classified, features are extracted in a corresponding way. The commonly used methods are SIFT (Scale-invariant feature transform, scale-invariant feature transform), SURF (Speeded up robust features, accelerated robust features) and wavelet Fourier leaf transformation coefficients, etc.;
②根据待分类图像的特征计算待分类图像与痕迹图像库中每一痕迹图像的相似性,记为S={s1,…,si,…,sN},其中si=sim(Tx,xi)表示待分类图像与痕迹图像di的相似性;相似性函数sim(·)的定义有很多种,在本实施中采用的是归一化互相关系数;xi为痕迹图像库D中每一痕迹图像的特征集合,即特征库记为X={x1,…,xi,…,xN},其中表示第i幅痕迹图像的特征,特征是d维实数;②Calculate the similarity between the image to be classified and each trace image in the trace image library according to the characteristics of the image to be classified, recorded as S={s 1 ,…,s i ,…,s N }, where s i =sim(T x , x i ) represent the similarity between the image to be classified and the trace image d i ; there are many definitions of the similarity function sim( ), and in this implementation, the normalized cross-correlation coefficient is used; xi is the trace image The feature set of each trace image in library D, that is, the feature library is recorded as X={x 1 ,…, xi ,…,x N }, where Represents the feature of the i-th trace image, and the feature is a d-dimensional real number;
③将S中各值降序排列,得到并从中顺序选择前f类图像及其所在类别,作为待分类图像T的候选类别,记为P。③ Arrange the values in S in descending order to get and from Sequentially select the first f types of images and their categories, as the candidate category of the image T to be classified, denoted as P.
上述f一般取100。The above f generally takes 100.
3)根据待分类图像与候选类别中排在第一名的痕迹图像及其对应代表图像的相似性判断待分类图像为所属类别或是新增类别,若不能直接给出该待分类图像的分类情况,则进入下一步,否则,进入步骤6),其包括以下步骤:3) According to the similarity between the image to be classified and the trace image ranked first in the candidate category and its corresponding representative image, it is judged whether the image to be classified belongs to the category or a new category, if the classification of the image to be classified cannot be directly given situation, then enter the next step, otherwise, enter step 6), which includes the following steps:
①将待分类图像与待分类图像的候选类别P中排在第一名的痕迹图像的相似性记为sF,将待分类图像与待分类图像的候选类别P中排在第一名的痕迹图像所在类别代表图像的相似性记为sR;① Denote the similarity between the image to be classified and the trace image that ranks first in the candidate category P of the image to be classified as s F , and the trace image that ranks first in the candidate category P of the image to be classified and the image to be classified The category of the image represents the similarity of the image and is recorded as s R ;
②若sF<sNew,则将待分类图像直接认定为新增类别,进入步骤6);否则进入下一步,其中sNew表示认定为新增类别的阈值,该值若设得过大,可能将非新增图像认定为新增图像,该阈值一般取0.55,因为图像相似性的取值范围是0到1之间,因此可以定义相似性为0.55以下就是不像的两幅痕迹图像,符合人的主观思维;② If s F < s New , the image to be classified is directly identified as a newly added category, and proceeds to step 6); otherwise, enters the next step, where s New represents the threshold value for identifying as a newly added category, if the value is set too large, It is possible to identify a non-new image as a new image, and the threshold is generally 0.55, because the value range of image similarity is between 0 and 1, so it can be defined that the similarity below 0.55 is two different trace images, in line with human subjective thinking;
③若sF≥sDir且sR≥sRth,则将待分类图像直接认定为排在第一名痕迹图像所在的类别,进入步骤6);否则不能直接给出该待分类图像的分类情况,进入下一步,其中sDir表示待分类图像与排在第一名痕迹图像的相似性阈值,该阈值一般取0.71,在0.71以上就是相像的两幅图像,0.55-0.71之间需要继续判断两幅痕迹图像是否相像,0.55以下就是不相像的两幅痕迹图像;sRth表示待分类图像与排在第一名痕迹图像所在类代表图像的相似性阈值,该阈值一般取0.63,大于0.63就认为待分类的痕迹图像属于该类别,否则不属于该类别,且sDir>sRth。③If s F ≥ s Dir and s R ≥ s Rth , then the image to be classified is directly identified as the category of the first trace image, and enter step 6); otherwise, the classification of the image to be classified cannot be directly given , enter the next step, where s Dir represents the similarity threshold between the image to be classified and the first-ranked trace image. The threshold is generally 0.71. If it is above 0.71, it is two similar images. Between 0.55 and 0.71, it is necessary to continue to judge the two images. Whether the two trace images are similar or not, below 0.55 is the two trace images that are not similar; s Rth represents the similarity threshold between the image to be classified and the representative image of the category of the first trace image, the threshold is generally 0.63, and it is considered to be greater than 0.63 The trace image to be classified belongs to this category, otherwise it does not belong to this category, and s Dir >s Rth .
如图2所示,上述代表图像的获取过程如下:As shown in Figure 2, the above representative image acquisition process is as follows:
①提取痕迹图像库中每一痕迹图像的特征,形成特征库;① Extract the features of each trace image in the trace image library to form a feature library;
特征需要根据痕迹的种类而采用不同的方式提取,如SIFT,SURF以及小波傅里叶变换系数等。特征库记为X={x1,…,xi,…,xN},其中表示第i幅痕迹图像的特征,特征是d维实数;Features need to be extracted in different ways according to the type of traces, such as SIFT, SURF, and wavelet Fourier transform coefficients. The feature library is recorded as X={x 1 ,…,xi i ,…,x N }, where Represents the feature of the i-th trace image, and the feature is a d-dimensional real number;
②计算痕迹图像库中每个类别中各成员之间的相似性,得到每一类别中成员之间的相似性矩阵,记为Wi,j(k),其中li=lj=k,且k=1,…,K;②Calculate the similarity between members of each category in the trace image library, and obtain the similarity matrix between members of each category, which is denoted as W i,j (k), where l i =l j =k, And k=1,...,K;
③根据Wi,j(k)按照计算得到各类别代表图像即在痕迹图像库的每类痕迹图像中选出与该类其他所有痕迹图像相似性积最大的痕迹图像作为代表图像。③ According to W i,j (k) according to Calculate the representative image of each category That is, in each type of trace image in the trace image database, select the trace image with the largest similarity product with all other trace images of this type as the representative image.
4)根据多数表决原则或排在第一名和第二名痕迹图像之间的相似度落差判定条件判断待分类图像为所属类别,若不能直接给出该待分类图像的分类情况,则进入下一步,否则,进入步骤6),其包括以下步骤:4) Judging that the image to be classified belongs to the category according to the principle of majority voting or the similarity difference between the first and second trace images, if the classification of the image to be classified cannot be directly given, go to the next step , otherwise, enter step 6), which includes the following steps:
①在待分类图像的候选类别P中,排在前q名的痕迹图像个数多的属于同一个类别(多数表决原则),则待分类图像为该类别,然后进入步骤6),否则进入下一步;① Among the candidate categories P of images to be classified, if the number of trace images ranked in the top q is large and belong to the same category (majority voting principle), then the image to be classified belongs to this category, and then go to step 6), otherwise go to the next step step;
上述q一般取3。The above q generally takes 3.
②在待分类图像的候选类别P中,若与排在第一名痕迹图像相似性远大于与排在第二名痕迹图像的相似性,即相似性落差大于Δs时,则待分类图像的类别为第一名痕迹图像所在的类别,进入步骤6),否则进入下一步;②In the candidate category P of the image to be classified, if the similarity with the trace image ranked first is much greater than the similarity with the trace image ranked second, that is, when the similarity gap is greater than Δs, the category of the image to be classified For the category of the first trace image, go to step 6), otherwise go to the next step;
上述Δs一般取0.08。The above Δs is generally taken as 0.08.
5)将待分类图像的候选类别中的代表图像提供给用户,由其交互式判决待分类图像的所属类别,进入步骤6),其包括以下步骤:5) The representative image in the candidate category of the image to be classified is provided to the user, and the category of the image to be classified is interactively judged by it, and enters step 6), which includes the following steps:
①将待分类图像的候选类别P中各图像所在类别的代表图像按序呈现给用户,并同时显示待分类图像;① Present the representative images of the categories of the images in the candidate category P of the images to be classified to the user in sequence, and display the images to be classified at the same time;
②用户根据每个代表图像查看其所在类别在待分类图像的候选类别P中出现的图像和名次;②According to each representative image, the user checks the images and rankings of its category in the candidate category P of the image to be classified;
③用户根据①和②信息,判别待分类图像所属类别。③The user judges the category of the image to be classified according to the information in ① and ②.
6)痕迹图像库更新自动扩充,其包括以下步骤:6) The trace image library is updated and automatically expanded, which includes the following steps:
①痕迹图像库的更新,即D=D∪{T},新进入痕迹图像库中的痕迹图像的下标索引号为N+1,实现开集更新;① The update of the trace image library, that is, D=D∪{T}, the subscript index number of the trace image newly entered into the trace image library is N+1, and the open set update is realized;
②待分类图像所属类别lt类代表图像R(lt)的更新,若待分类图像为新增类,则待分类图直接为代表图像,即R(lt)=N+1;否则,按如下步骤进行:② The category l t of the image to be classified represents the update of the image R(l t ), if the image to be classified is a newly added class, the image to be classified is directly the representative image, that is, R(l t )=N+1; otherwise, Proceed as follows:
A、计算lt类内每一非待分类图像与该类所有图像的相似性积,并找到所有相似性 积中的最大值R1及该最大值的痕迹图像所对应的索引号i*:其中 A. Calculate the similarity product of each non-to-be-classified image and all images of this class in the l t class, and find the index number i * corresponding to the maximum value R in all similarity products and the trace image of the maximum value: in
B、计算待分类图像与lt类内其他痕迹图像的相似性积,即 B. Calculate the similarity product between the image to be classified and other trace images in the class l t , that is
C、更新后的lt类的代表图像为: C. The representative image of the updated l t class is:
③标记的更新,即L=L∪{lt};③The update of the mark, that is, L=L∪{l t };
④特征库的更新,即X=X∪{Tx};④Update of feature database, that is, X=X∪{T x };
⑤痕迹图像库中痕迹图像数目增加1,即N=N+1。⑤ The number of trace images in the trace image library is increased by 1, that is, N=N+1.
需要说明的是,若初次应用本发明时:若痕迹图像库为空,则自动将待分类图像作为新增类别加入到痕迹图像库,并作标记;若痕迹图像库存在已标记类别的痕迹图像,则根据待分类图像与痕迹图像库中痕迹图像的相似性,分三种情况进行处理:It should be noted that, if the present invention is applied for the first time: if the trace image library is empty, the image to be classified is automatically added to the trace image library as a new category and marked; if there are trace images of the marked category in the trace image library , according to the similarity between the image to be classified and the trace image in the trace image database, it is divided into three cases for processing:
1)自动认定待分类图像为痕迹图像库中的一个类别;1) automatically identify the image to be classified as a category in the trace image library;
2)自动认定可靠性不高时,提供候选类别的代表图像列表,由人机主观交互判决;2) When the reliability of the automatic determination is not high, a list of representative images of the candidate categories is provided, and the judgment is made by human-computer subjective interaction;
3)自动认定待分类图像为新增类别,对痕迹图像库的类别自动进行扩充。3) Automatically identify the image to be classified as a newly added category, and automatically expand the category of the trace image library.
如图3所示,一种开集痕迹图像分类系统包括预处理模块1、初始筛选模块2、痕迹图像库3、直接认定模块4、补录模块5和主观判决模块6。As shown in Figure 3, an open-set trace image classification system includes a preprocessing module 1, an initial screening module 2, a trace image library 3, a direct identification module 4, a supplementary recording module 5, and a subjective judgment module 6.
预处理模块1针对待分类图像进行预处理,并将预处理后的待分类图像传送给初始筛选模块2。The preprocessing module 1 performs preprocessing on the images to be classified, and sends the preprocessed images to the initial screening module 2 .
初始筛选模块2将预处理后的待分类图像与痕迹图像库3中痕迹图像的相似性筛选待分类图像的候选类别,并将该候选类别传送给直接认定模块4。The initial screening module 2 selects the candidate category of the image to be classified based on the similarity between the preprocessed image to be classified and the trace image in the trace image library 3 , and sends the candidate category to the direct identification module 4 .
直接认定模块4根据待分类图像与候选类别中排在第一名的痕迹图像及其对应的代表图像的相似性判定待分类图像为所属类别或是新增类别,若该模块不能直接给出该待分类图像的分类情况,则将待分类图像的候选类别传送给补录模块5,否则,将待分类图像的所属类别传送给痕迹图像库3进行更新。The direct identification module 4 determines whether the image to be classified belongs to a category or a newly added category according to the similarity between the image to be classified and the trace image ranked first in the candidate category and its corresponding representative image, if the module cannot directly give the For the classification status of the image to be classified, the candidate category of the image to be classified is sent to the supplementary recording module 5, otherwise, the category of the image to be classified is sent to the trace image library 3 for updating.
补录模块5根据多数表决原则或排在第一名和第二名痕迹图像之间的相似性落差判定条件判断待分类图像为所属类别,若该模块不能直接给出该待分类图像的分类情况,则将待分类图像的候选类别传送给主观判决模块6,否则,将待分类图像的所属类别传送给痕迹图像库3进行更新。The supplementary recording module 5 judges that the image to be classified belongs to the category according to the principle of majority voting or the similarity gap judgment condition between the first and second trace images, if the module cannot directly provide the classification of the image to be classified, The candidate category of the image to be classified is sent to the subjective decision module 6, otherwise, the category of the image to be classified is sent to the trace image database 3 for updating.
主观判决模块6将待分类图像的候选类别中的代表图像提供给用户,由其交互式判决待分类图像的所属类别,并将待分类图像的所属类别传送给痕迹图像库3进行更新。The subjective judgment module 6 provides representative images of the candidate categories of the images to be classified to the user, who interactively determines the categories of the images to be classified, and transmits the categories of the images to be classified to the trace image database 3 for updating.
痕迹图像库3将待分类图像补充到痕迹图像库3中,若为已有类别则补充到相应的类别,更新该类别的代表图像;否则为新增类别,设立新增类别,更新该类别的代表图像;同时,更新特征库、标记和痕迹图像数目。The trace image library 3 supplements the image to be classified into the trace image library 3, if it is an existing category, it is added to the corresponding category, and the representative image of this category is updated; otherwise, it is a new category, a new category is set up, and the category is updated. Represents an image; at the same time, updates the number of feature databases, markers, and trace images.
最后应当说明,以上实施例仅用以说明而非限制本发明的技术方案,本发明还可有其他多种实施例。在不脱离本发明精神及其实质的情况下,本领域的普通技术人员可以对本发明进行各种改动和变形。若这些改动和变形属于本发明权利要求及其等同技术的范围之内,则这些改动和变形都应属于本发明所附的权利要求的保护范围。Finally, it should be noted that the above embodiments are only used to illustrate rather than limit the technical solution of the present invention, and the present invention may also have other various embodiments. Those skilled in the art can make various changes and modifications to the present invention without departing from the spirit and essence of the present invention. If these changes and deformations fall within the scope of the claims of the present invention and their equivalent technologies, then these changes and deformations shall all belong to the protection scope of the appended claims of the present invention.
Claims (7)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201510481067.6A CN105023025B (en) | 2015-08-03 | 2015-08-03 | An open-set trace image classification method and system |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201510481067.6A CN105023025B (en) | 2015-08-03 | 2015-08-03 | An open-set trace image classification method and system |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN105023025A CN105023025A (en) | 2015-11-04 |
| CN105023025B true CN105023025B (en) | 2018-09-25 |
Family
ID=54412979
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201510481067.6A Active CN105023025B (en) | 2015-08-03 | 2015-08-03 | An open-set trace image classification method and system |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN105023025B (en) |
Families Citing this family (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN105740850A (en) * | 2016-03-04 | 2016-07-06 | 北京小米移动软件有限公司 | Method and device for classifying photos |
| CN106326927B (en) * | 2016-08-24 | 2019-06-04 | 大连海事大学 | A new category detection method for shoe prints |
| CN107862344B (en) * | 2017-12-01 | 2021-06-11 | 中南大学 | Image classification method |
| CN109214411B (en) * | 2018-07-12 | 2024-08-27 | 杭州吉吉知识产权运营有限公司 | Verification method and system for identifying typical pictures to newly added entities based on training model |
| CN110674823A (en) * | 2019-09-26 | 2020-01-10 | 中国科学院声学研究所 | Sample library construction method based on automatic identification of deep sea large benthonic animals |
| CN113239804B (en) * | 2021-05-13 | 2023-06-02 | 杭州睿胜软件有限公司 | Image recognition method, readable storage medium, and image recognition system |
Citations (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4887304A (en) * | 1987-09-30 | 1989-12-12 | Raytheon Company | Library image optimization |
| EP0357010A2 (en) * | 1988-09-01 | 1990-03-07 | E.I. Du Pont De Nemours And Company | Method of identifying spectra |
| CN1776717A (en) * | 2005-12-01 | 2006-05-24 | 上海交通大学 | Method for Shoeprint Recognition at Crime Scene |
| CN103207879A (en) * | 2012-01-17 | 2013-07-17 | 阿里巴巴集团控股有限公司 | Method and equipment for generating image index |
| CN103377376A (en) * | 2012-04-13 | 2013-10-30 | 阿里巴巴集团控股有限公司 | Method and system for image classification, and method and system for image retrieval |
| CN103839072A (en) * | 2013-12-31 | 2014-06-04 | 浙江工业大学 | False fingerprint detecting method based on naive Bayes classifiers |
| CN104331642A (en) * | 2014-10-28 | 2015-02-04 | 山东大学 | Integrated learning method for recognizing ECM (extracellular matrix) protein |
-
2015
- 2015-08-03 CN CN201510481067.6A patent/CN105023025B/en active Active
Patent Citations (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4887304A (en) * | 1987-09-30 | 1989-12-12 | Raytheon Company | Library image optimization |
| EP0357010A2 (en) * | 1988-09-01 | 1990-03-07 | E.I. Du Pont De Nemours And Company | Method of identifying spectra |
| CN1776717A (en) * | 2005-12-01 | 2006-05-24 | 上海交通大学 | Method for Shoeprint Recognition at Crime Scene |
| CN103207879A (en) * | 2012-01-17 | 2013-07-17 | 阿里巴巴集团控股有限公司 | Method and equipment for generating image index |
| CN103377376A (en) * | 2012-04-13 | 2013-10-30 | 阿里巴巴集团控股有限公司 | Method and system for image classification, and method and system for image retrieval |
| CN103839072A (en) * | 2013-12-31 | 2014-06-04 | 浙江工业大学 | False fingerprint detecting method based on naive Bayes classifiers |
| CN104331642A (en) * | 2014-10-28 | 2015-02-04 | 山东大学 | Integrated learning method for recognizing ECM (extracellular matrix) protein |
Also Published As
| Publication number | Publication date |
|---|---|
| CN105023025A (en) | 2015-11-04 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN105023025B (en) | An open-set trace image classification method and system | |
| CN106226050B (en) | A kind of TFDS fault picture automatic identifying method based on convolutional neural networks | |
| Cheng et al. | Scene recognition with objectness | |
| CN106650806B (en) | A Collaborative Deep Network Model Method for Pedestrian Detection | |
| Wang et al. | Deep people counting in extremely dense crowds | |
| CN108171184B (en) | Method for re-identifying pedestrians based on Simese network | |
| CN104866829B (en) | A cross-age face verification method based on feature learning | |
| CN107230267B (en) | Intelligence In Baogang Kindergarten based on face recognition algorithms is registered method | |
| CN105138953B (en) | A method of action recognition in the video based on continuous more case-based learnings | |
| CN109948447B (en) | Discovery and evolution presentation method of character network relationship based on video image recognition | |
| CN106022300A (en) | Traffic sign identifying method and traffic sign identifying system based on cascading deep learning | |
| CN104376105B (en) | The Fusion Features system and method for image low-level visual feature and text description information in a kind of Social Media | |
| CN102054176B (en) | Method used for establishing semantic scene models for scene images of moving targets by utilizing computer | |
| CN103838835B (en) | A kind of network sensitive video detection method | |
| CN107133601A (en) | A kind of pedestrian's recognition methods again that network image super-resolution technique is resisted based on production | |
| CN112132014A (en) | Target re-identification method and system based on non-supervised pyramid similarity learning | |
| CN103336971B (en) | Target matching method between multiple-camera based on multiple features fusion and incremental learning | |
| CN106981063A (en) | A kind of grid equipment state monitoring apparatus based on deep learning | |
| CN103984943A (en) | Scene text identification method based on Bayesian probability frame | |
| CN109919060A (en) | A kind of identity card content identifying system and method based on characteristic matching | |
| CN110796131A (en) | Chinese character writing evaluation system | |
| CN110765285A (en) | Multimedia information content control method and system based on visual characteristics | |
| CN107688830A (en) | It is a kind of for case string and show survey visual information association figure layer generation method | |
| CN115760758A (en) | An intelligent identification and evaluation method for power grid infrastructure suitable for common quality problems of tower nuts | |
| CN110008899B (en) | Method for extracting and classifying candidate targets of visible light remote sensing image |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant |