CN100462978C - An image retrieval method and system - Google Patents
An image retrieval method and system Download PDFInfo
- Publication number
- CN100462978C CN100462978C CNB2007100984941A CN200710098494A CN100462978C CN 100462978 C CN100462978 C CN 100462978C CN B2007100984941 A CNB2007100984941 A CN B2007100984941A CN 200710098494 A CN200710098494 A CN 200710098494A CN 100462978 C CN100462978 C CN 100462978C
- Authority
- CN
- China
- Prior art keywords
- image
- similarity
- standard sample
- retrieved
- retrieval
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 238000000034 method Methods 0.000 title claims description 33
- 239000013598 vector Substances 0.000 claims description 38
- 238000005516 engineering process Methods 0.000 description 11
- 238000004364 calculation method Methods 0.000 description 10
- 238000010586 diagram Methods 0.000 description 6
- 230000000694 effects Effects 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 230000007812 deficiency Effects 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- FFBHFFJDDLITSX-UHFFFAOYSA-N benzyl N-[2-hydroxy-4-(3-oxomorpholin-4-yl)phenyl]carbamate Chemical compound OC1=C(NC(=O)OCC2=CC=CC=C2)C=CC(=C1)N1CCOCC1=O FFBHFFJDDLITSX-UHFFFAOYSA-N 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000007781 pre-processing Methods 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
Images
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Processing Or Creating Images (AREA)
Abstract
Description
技术领域 technical field
本发明涉及数据检索领域,特别涉及一种图像检索方法及系统。The invention relates to the field of data retrieval, in particular to an image retrieval method and system.
背景技术 Background technique
从20世纪70年代开始有关图像检索的研究就已经开始了,一种是基于图像文本标引的图像检索技术(TBIR,Text-based Image Retrieval),也是当前应用最为广泛、最为成熟的一种图像检索方法,主要是利用图像文本标引方式描述图像的特征,如图像说明、来源等。另一种是基于内容的图像检索(CBIR,Content-based Image Retrieval)技术,是代表图像检索技术未来的发展方向,主要是对图像的内容语义,如图像的颜色、纹理、布局等进行分析和检索的图像检索技术。随着计算机技术和通信技术的迅速发展,检索和浏览海量的包括数字图像在内的多媒体数据成为日益迫切的问题。Research on image retrieval has started since the 1970s. One is the image retrieval technology based on image text indexing (TBIR, Text-based Image Retrieval), which is also the most widely used and mature image retrieval technology. The retrieval method mainly uses the image text indexing method to describe the characteristics of the image, such as image description, source, etc. The other is content-based image retrieval (CBIR, Content-based Image Retrieval) technology, which represents the future development direction of image retrieval technology. Retrieval by Image Retrieval Techniques. With the rapid development of computer technology and communication technology, retrieving and browsing massive multimedia data including digital images has become an increasingly urgent problem.
发明人在发明过程中注意到:对于绝大多数采用基于图像文本标引的图像检索技术,其存在如下的不足:The inventor noticed in the process of invention that for most of the image retrieval technologies based on image text indexing, there are the following deficiencies:
1)文本标引本身没有客观统一的标准,带有一定的主观性,譬如对同一幅图像不同的“人”可能标引的文本信息可能不一样;1) There is no objective and unified standard for text indexing itself, and it has a certain degree of subjectivity. For example, different "persons" may index different text information for the same image;
2)通过图像文本标引信息检索,可能由于检索的文本关键词与检索的图像库中的图像文本标引不一致而导致检索失败,同时检索结果中可能存在大量的不相关的检索结果。即,由于标签与图像内容之间的关系是人为设置,因此标签与图像内容之间并不存在必然关联,不能检索出图像的真实内容。2) Through image text indexing information retrieval, the retrieval may fail due to the inconsistency between the retrieved text keywords and the image text index in the retrieved image database, and there may be a large number of irrelevant retrieval results in the retrieval results. That is, since the relationship between the tag and the image content is artificially set, there is no necessary correlation between the tag and the image content, and the real content of the image cannot be retrieved.
对于目前基于内容的图像检索技术,一般收集和加工图像资源,提取诸如颜色特征、纹理特征、形状特征等特征作为图像数据库图像索引,然后计算检索图像与图像库中的图像的相似度大小,提取出满足阈值的记录作为结果,按照相似度降序的方式输出。但发明人注意到其存在以下不足:For the current content-based image retrieval technology, generally collect and process image resources, extract features such as color features, texture features, shape features, etc. Records that meet the threshold are output as a result in descending order of similarity. But the inventor has noticed that it has the following deficiencies:
1)由于采用颜色直方图来计算图像间的相似性的方法比较简单,因此它不能反映图像中对象的空间特征;1) Since the method of using the color histogram to calculate the similarity between images is relatively simple, it cannot reflect the spatial characteristics of the objects in the image;
2)由于纹理描述比较困难,对纹理的检索都采用示例查询方式,多次交互,逐步求精,从而导致查询速度慢;2) Due to the difficulty of texture description, the retrieval of textures adopts the example query method, multiple interactions, and gradual refinement, resulting in slow query speed;
3)在采用基于形状特征进行检索时,用户通过勾勒图像的形状或轮廓,从图像库中检出形状类似的图像。基于此特征的检索方法有两种:i)分割图像经过边缘提取后,得到目标图像的轮廓线,针对这种轮廓线进行的形状特征检索。ii)直接针对图形寻找适当的矢量特征用于检索算法。显然,处理这种结构化检索更为复杂,需做更多的预处理等等,使得该方法不仅慢而且实现很难。3) When searching based on shape features, the user can detect images with similar shapes from the image library by outlining the shape or outline of the image. There are two retrieval methods based on this feature: i) After the segmented image is extracted through edge extraction, the contour line of the target image is obtained, and the shape feature retrieval is carried out for this contour line. ii) Find the appropriate vector features directly for the graph to be used in the retrieval algorithm. Obviously, it is more complicated to deal with this kind of structured retrieval, more preprocessing and so on are required, which makes this method not only slow but also difficult to implement.
由上述可见,以上技术存在:1)检索返回的查询结果不够精确;或者2)计算复杂,需要多次交互,查询速度慢。因而导致现有图像检索技术并没有在实际广泛应用。It can be seen from the above that the above technologies exist: 1) the query results returned by the retrieval are not accurate enough; or 2) the calculation is complex, requiring multiple interactions, and the query speed is slow. As a result, the existing image retrieval technology has not been widely used in practice.
发明内容 Contents of the invention
本发明提供一种图像检索方法及系统、装置,用以解决现有技术中存在的在图像检索时不能同时满足检索高效且检索结果准确的问题。The present invention provides an image retrieval method, system and device, which are used to solve the problems in the prior art that the retrieval efficiency and accurate retrieval results cannot be satisfied at the same time during image retrieval.
本发明提供了一种图像检索方法,包括如下步骤:The invention provides an image retrieval method, comprising the following steps:
比较需检索图像与标准样本图像集每一标准样本图像的第一相似度集;Comparing the first similarity set of each standard sample image between the image to be retrieved and the standard sample image set;
根据相似度集库中第二相似度集与所述第一相似度集的相似度高低,从图像库中检索出与所述第二相似度集对应的待检索图像,所述相似度集库是所述图像库中的每一待检索图像与所述标准样本图像集每一标准样本图像的第二相似度集组成的相似度集库。According to the degree of similarity between the second similarity set in the similarity set library and the first similarity set, retrieve the image to be retrieved corresponding to the second similarity set from the image library, and the similarity set library It is a similarity set library composed of each image to be retrieved in the image library and the second similarity set of each standard sample image in the standard sample image set.
较佳地,进一步包括如下步骤:Preferably, further comprising the following steps:
当向图像库添加待检索图像时,将所述添加的待检索图像与所述标准样本图像集每一标准样本图像的第二相似度集添加至相似度集库。When an image to be retrieved is added to the image database, a second similarity set between the added image to be retrieved and each standard sample image in the standard sample image set is added to the similarity set library.
较佳地,进一步包括如下步骤:Preferably, further comprising the following steps:
将图像相互之间相似度小的图像作为所述标准样本图像集中的标准样本图像。An image with a small similarity between images is used as a standard sample image in the standard sample image set.
较佳地,所述相似度是根据图像之间的特征向量来比较的。Preferably, the similarity is compared according to feature vectors between images.
本发明还提供了一种图像检索系统,包括:The present invention also provides an image retrieval system, comprising:
第一比较模块,用于比较需检索图像与标准样本图像集每一标准样本图像的第一相似度集;The first comparison module is used to compare the first similarity set of each standard sample image between the image to be retrieved and the standard sample image set;
第二比较模块,用于比较图像库中的每一待检索图像与所述标准样本图像集每一标准样本图像的第二相似度集;The second comparison module is used to compare the second similarity set between each image to be retrieved in the image library and each standard sample image in the standard sample image set;
存储模块,与所述第二比较模块相连,用于存储所述第二比较模块比较的第二相似度集组成的相似度集库;A storage module, connected to the second comparison module, for storing a similarity set library composed of the second similarity sets compared by the second comparison module;
检索模块,与第一比较模块、存储模块相连,用于根据所述第一比较模块比较的所述第一相似度集与所述存储模块中的所述相似度集库中第二相似度集的相似度高低,从图像库中检索出与所述第二相似度集对应的待检索图像。The retrieval module is connected with the first comparison module and the storage module, and is used for comparing the first similarity set compared with the first comparison module with the second similarity set in the similarity set library in the storage module The degree of similarity is high or low, and the image to be retrieved corresponding to the second similarity set is retrieved from the image database.
较佳地,进一步包括:Preferably, it further includes:
输出模块,与检索模块相连,用于输出所述检索模块检索从图像库中检索出的待检索图像。The output module is connected with the retrieval module, and is used for outputting the image to be retrieved from the image database retrieved by the retrieval module.
较佳地,进一步包括:Preferably, it further includes:
添加模块,与第二比较模块、存储模块相连,用于当向图像库添加待检索图像时,将所述第二比较模块比较的所述添加的待检索图像与所述标准样本图像集每一标准样本图像的第二相似度集添加至所述存储模块的相似度集库。The adding module is connected with the second comparison module and the storage module, and is used to compare the added image to be retrieved by the second comparison module with each of the standard sample image sets when adding the image to be retrieved to the image library The second similarity set of the standard sample image is added to the similarity set library of the storage module.
较佳地,进一步包括:Preferably, it further includes:
标准样本图像选取模块,用于将图像相互之间相似度小的图像作为所述标准样本图像集中的标准样本图像。The standard sample image selection module is configured to use images with a small similarity between images as standard sample images in the standard sample image set.
较佳地,进一步包括:Preferably, it further includes:
特征向量输入模块,用于输入图像的特征向量;Feature vector input module, for the feature vector of input image;
所述系统根据图像的特征向量之间的所述相似度来检索图像。The system retrieves images based on the similarity between feature vectors of the images.
本发明有益效果如下:The beneficial effects of the present invention are as follows:
由于本发明先比较图像库中的每一待检索图像与所述标准样本图像集每一标准样本图像的第二相似度集;并将比较的第二相似度集组成的相似度集库;这种采用事先确定的标准样本图像集作为图像特征度量尺度,从而具有较强的客观性、一致性;Because the present invention first compares the second similarity set of each standard sample image between each image to be retrieved in the image library and the standard sample image set; and the similarity set library formed by the second similarity set compared; A standard sample image set determined in advance is used as the image feature measurement scale, so it has strong objectivity and consistency;
由于在建立了相似度集库后,通过比较需检索图像与标准样本图像集每一标准样本图像的第一相似度集;再根据第一相似度集与相似度集库从图像库中检索待检索图像。从而通过引入标准样本图像集作为比较的中介,扩大了图像的特征空间,包含了更多地图像特征信息,改善了检索返回结果集的精度。同时,不需要将检索图像与图像库中的图像一一比较计算其相似度,而是采用引入标准样本图像集作为比较的中介,即事先计算好图像库中的图像与标准样本图像集的各个标准样本图像的相似度作为其索引向量,从而提高了检索速度。After the similarity set library is established, by comparing the first similarity set of each standard sample image between the image to be retrieved and the standard sample image set; Retrieve an image. Therefore, by introducing the standard sample image set as the intermediary of comparison, the feature space of the image is expanded, more image feature information is included, and the accuracy of the retrieval result set is improved. At the same time, it is not necessary to compare the retrieved images with the images in the image library to calculate their similarity, but to use the standard sample image set as the intermediary for comparison, that is, to calculate in advance the differences between the images in the image library and the standard sample image set. The similarity of the standard sample image is used as its index vector, which improves the retrieval speed.
进一步的,由于采用了确定图像特征向量的方法来进行相似度的检索,使得发明实施简单、高效、易于实施。Further, since the method of determining the feature vector of the image is used to retrieve the similarity, the invention is simple, efficient and easy to implement.
附图说明 Description of drawings
图1为本发明实施例中所述图像检索方法实施流程示意图;Fig. 1 is a schematic diagram of the implementation flow of the image retrieval method described in the embodiment of the present invention;
图2为本发明实施例中所述图像检索系统结构示意图;Fig. 2 is a schematic structural diagram of the image retrieval system described in the embodiment of the present invention;
图3为本发明实施例中所述图像检索流程实施示意图。Fig. 3 is a schematic diagram of the implementation of the image retrieval process described in the embodiment of the present invention.
具体实施方式 Detailed ways
下面结合附图对本发明的具体实施方式进行说明。Specific embodiments of the present invention will be described below in conjunction with the accompanying drawings.
本发明的目的是针对通常的图像检索方法实际在检索过程中实现复杂,检索速度较慢,返回的结果集精度欠佳,从而提出了一个能够改善检索性能的检索方法,并籍此提出相应的实现方法。实现简单,既提高了检索的速度,又改善了返回结果集的精度效果。The purpose of the present invention is to propose a retrieval method that can improve the retrieval performance in view of the fact that the usual image retrieval method is complex in the retrieval process, the retrieval speed is slow, and the accuracy of the returned result set is not good, and a corresponding retrieval method is proposed accordingly Implementation. The implementation is simple, which not only improves the retrieval speed, but also improves the accuracy of the returned result set.
图1为图像检索方法实施流程示意图,如图所示,检索中包括如下步骤:Figure 1 is a schematic diagram of the implementation process of the image retrieval method. As shown in the figure, the retrieval includes the following steps:
步骤101、将图像库中的每一待检索图像与标准样本图像集每一标准样本图像进行比较,得到每一待检索图像的第二相似度集;
优选实施中,在选择标准样本图像时,可以将图像之间的相似性作为根据,将图像相互之间相似度小的图像作为标准样本图像集中的标准样本图像,从而使得相似度集库中图像的特征向量更为精准地刻画出图像的特征,换句话说,相似度集库中图像的特征向量能够更好地表达出相似度集库中图像之间的相互差异,从而使得后续的检索精度变高。In a preferred implementation, when selecting a standard sample image, the similarity between images can be used as a basis, and an image with a small similarity between images is used as a standard sample image in the standard sample image set, so that the image in the similarity set library The eigenvector of the image can more accurately describe the characteristics of the image. In other words, the eigenvector of the image in the similarity set library can better express the mutual differences between the images in the similarity set library, so that the subsequent retrieval accuracy Becomes high.
步骤102、将每一个第二相似度集组成的相似度集库;
通过以上两个步骤的准备,建立了待检索图像的相似度集库,以后的检索过程中便可由以前对图像内容的检索变成了相似度的检索,从而避免了基于图像内容直接检索的弊端。Through the preparation of the above two steps, the similarity set library of the image to be retrieved is established, and the subsequent retrieval process can be changed from the previous retrieval of image content to similarity retrieval, thereby avoiding the disadvantages of direct retrieval based on image content .
实施中还可以视需要在向图像库添加待检索图像时,也将所添加的待检索图像与标准样本图像集每一标准样本图像的第二相似度集添加至相似度集库。In implementation, if necessary, when adding the image to be retrieved to the image database, the second similarity set between the added image to be retrieved and each standard sample image in the standard sample image set may also be added to the similarity set library.
步骤103、比较需检索图像与标准样本图像集每一标准样本图像的第一相似度集;
步骤104、根据第一相似度集与相似度集库从图像库中检索待检索图像。Step 104: Retrieve the image to be retrieved from the image library according to the first similarity set and the similarity set library.
优选实施中,根据所述相似度集库中第二相似度集与所述第一相似度集的相似度高低,从图像库中检索出与所述第二相似度集对应的待检索图像,由于通过相似度检索出来的待检索图像并非一一精确匹配,因此在输出检索结果时,可以根据实际的需要按相似性匹配的高低输出。In a preferred implementation, according to the degree of similarity between the second similarity set in the similarity set library and the first similarity set, the image to be retrieved corresponding to the second similarity set is retrieved from the image library, Since the images to be retrieved through similarity retrieval are not exactly matched one by one, when outputting the retrieval results, they can be output according to the level of similarity matching according to actual needs.
实施中,相似度可以根据图像之间的特征向量的匹配高低来得出相似度的高低,可以视需要选取图像特征向量的数量,需精度高则选取较多的特征向量。In implementation, the similarity can be obtained according to the matching level of feature vectors between images, and the number of image feature vectors can be selected as required, and more feature vectors can be selected if high precision is required.
下面再以一实例来阐述基于图像相似性比对的检索方法实施,包括以下步骤:Next, an example is used to illustrate the implementation of the retrieval method based on image similarity comparison, including the following steps:
1)确定图像库利用图像特征向量获得的与标准样本图像集中每一标准样本图像根据相似度组成的相似度集库。1) Determine the similarity set library formed by the similarity between each standard sample image in the standard sample image set and each standard sample image in the standard sample image set obtained by using the image feature vector.
实施例中,S表示图像为标准样本图像,[k]表示标准样本图像序号;D表示图像为图像库中待检索图像,[n]表示待检索图像序号,IS[i][j]表示第i标准样本图像与第j待检索图像之间的相似度,DX[i]=(IS[i][1],IS[i][2],...,IS[i][K])为第i标准样本图像的相似度集(第二相似度集)。In the embodiment, S indicates that the image is a standard sample image, [k] indicates the serial number of the standard sample image; D indicates that the image is an image to be retrieved in the image database, [n] indicates the serial number of the image to be retrieved, and IS[i][j] indicates the number of the image to be retrieved. The similarity between the i standard sample image and the jth image to be retrieved, DX[i]=(IS[i][1], IS[i][2], ..., IS[i][K]) is the similarity set (second similarity set) of the i-th standard sample image.
输入标准样本图像(S[1],S[2],...,S[K])和图像库图像(D[1],D[2],...,D[n]),图像库图像逐一分别同标准样本图像进行相似性比对,分别计算得到它们之间的相似度IS[i][j],如下表所示:Input standard sample images (S[1], S[2], ..., S[K]) and image library images (D[1], D[2], ..., D[n]), images The library images are compared with the standard sample images one by one, and the similarity IS[i][j] between them is calculated respectively, as shown in the following table:
于是,得到图像库待检索图像D[i]相对标准样本图像(S[1],S[2],...,S[K])的相似特征向量集DX[i]=(IS[i][1],IS[i][2],...,IS[i][K])。Thus, the similar feature vector set DX[i]=(IS[i ][1], IS[i][2], ..., IS[i][K]).
2)重复过程1)计算并保存所有图像库图像相对标准样本图像(S[1],S[2],...,S[K])的特征向量后,组成相似度集库。2) After repeating the process 1) to calculate and save the feature vectors of all image library images relative to the standard sample images (S[1], S[2], ..., S[K]), a similarity set library is formed.
3)利用需检索图像特征向量确定相似度集(第一相似度集)。3) Determine the similarity set (first similarity set) by using the feature vector of the image to be retrieved.
输入标准样本图像(S[1],S[2],...,S[K])和需检索图像KEY,需检索图像分别同标准样本图像进行相似性比对,分别计算得到它们之间的初步相似度KS[j],于是,得到需检索图像KEY相对标准样本图像(S[1],S[2],...,S[K])的相似特征向量的相似度集KEYX=(KS[1],KS[2],...,KS[K])。Input the standard sample image (S[1], S[2], ..., S[K]) and the image to be retrieved KEY, the image to be retrieved is compared with the standard sample image for similarity, and the relationship between them is calculated respectively. The preliminary similarity KS[j], so, obtain the similarity set KEYX=of the similar feature vector of the image KEY to be retrieved relative to the standard sample image (S[1], S[2], . (KS[1], KS[2], ..., KS[K]).
4)进行图像检索计算。4) Carry out image retrieval calculation.
分别计算需检索图像特征向量KEYX与图像库相似度集库中图像特征向量DX[i]之间的相似度FS[i]作为其最终的相似度。假设初步相似度KS[j]与IS[i][j]的取值范围为0-100,相似度FS[i]的计算方法可以这样定义,FS[i]=100-(ABS(KS[1]-IS[i][1])+ABS(KS[2]-IS[i][2])+...+ABS(KS[K]-IS[i][K]))/K,其中ABS(a)表示a的绝对值计算。Calculate the similarity FS[i] between the image feature vector KEYX to be retrieved and the image feature vector DX[i] in the image library similarity set as the final similarity. Assuming that the value range of preliminary similarity KS[j] and IS[i][j] is 0-100, the calculation method of similarity FS[i] can be defined as follows, FS[i]=100-(ABS(KS[ 1]-IS[i][1])+ABS(KS[2]-IS[i][2])+...+ABS(KS[K]-IS[i][K]))/K , where ABS(a) represents the absolute value calculation of a.
具体实施中,基于利用需检索图像获得的相似度集与相似性集库来进行检索的原理实施方式很多,因此,本实施例提供了相似度FS[i]的计算方法但易知并不仅限于此方式。In the specific implementation, there are many implementations based on the principle of using the similarity set and the similarity set library obtained from the images to be retrieved. Therefore, this embodiment provides a calculation method for the similarity FS[i], but it is easy to know that it is not limited to this way.
5)输出查询结果。5) Output query results.
FS[i]按倒序排序,最前面的结果即为最相似检索输出结果。FS[i] is sorted in reverse order, and the top result is the most similar retrieval output result.
实施中,在进行图像检索前,应先确定好所有图像库图像相对标准样本图像相似特征向量,即可利用图像库图像相对标准样本图像的相似特征向量与检索图像相对标准样本图像的相似特征向量之间的相似度得到最相似的检索输出结果。In the implementation, before image retrieval, the similar feature vectors of all images in the image library relative to the standard sample images should be determined first, and the similar feature vectors of the image library images relative to the standard sample images and the similar feature vectors of the retrieved images relative to the standard sample images can be used The similarity between them gets the most similar retrieval output.
为了增强检索效果,优选实施中可以采取如下一些辅助措施:In order to enhance the retrieval effect, the following auxiliary measures can be taken in the preferred implementation:
1)选择标准样本图像集。选取标准样本图像时,将图像相互之间相似度小的图像作为标准样本图像集中的标准样本图像,尽可能地选择那些各张图像之间存在比较大的差异作为标准样本图像。譬如:可以从人物图像、山水图像等不同类别图像中各选择一张作为样本图像。1) Select a standard sample image set. When selecting standard sample images, images with small similarities between images are used as standard sample images in the standard sample image set, and those images with relatively large differences between each image are selected as standard sample images as much as possible. For example, one can be selected as a sample image from images of different categories such as person images and landscape images.
2)选择图像与图像之间初步相似度计算方法。选择初步相似度计算方法时,可以选择那些计算方法能够反映图像之间差异,并具有较强的客观性和一致性。2) Select a preliminary similarity calculation method between images. When choosing a preliminary similarity calculation method, you can choose those calculation methods that can reflect the differences between images and have strong objectivity and consistency.
本发明还提供了一种图像检索系统,图2为图像检索系统结构示意图,如图所示,图像检索系统中包括:第一比较模块、第二比较模块、存储模块、检索模块,其中:The present invention also provides an image retrieval system. FIG. 2 is a schematic structural diagram of the image retrieval system. As shown in the figure, the image retrieval system includes: a first comparison module, a second comparison module, a storage module, and a retrieval module, wherein:
第一比较模块用于比较需检索图像与标准样本图像集每一标准样本图像的第一相似度集;The first comparison module is used to compare the first similarity set of each standard sample image between the image to be retrieved and the standard sample image set;
第二比较模块用于比较图像库中的每一待检索图像与所述标准样本图像集每一标准样本图像的第二相似度集;The second comparison module is used to compare the second similarity set between each image to be retrieved in the image database and each standard sample image in the standard sample image set;
存储模块与所述第二比较模块相连,用于存储所述第二比较模块比较的第二相似度集组成的相似度集库;The storage module is connected to the second comparison module, and is used to store the similarity set library composed of the second similarity sets compared by the second comparison module;
检索模块与第一比较模块、存储模块相连,用于根据所述第一比较模块比较的所述第一相似度集与所述存储模块中的所述相似度集库从图像库中检索待检索图像。The retrieval module is connected with the first comparison module and the storage module, and is used for retrieving from the image library to be retrieved according to the first similarity set compared by the first comparison module and the similarity set library in the storage module. image.
优选实施中还可以进一步包括:In the preferred implementation, it can further include:
输出模块,与检索模块相连,用于输出所述检索模块从图像库中检索出与所述第二相似度集对应的待检索图像。The output module is connected with the retrieval module, and is used for outputting the image to be retrieved corresponding to the second similarity set retrieved by the retrieval module from the image database.
添加模块,与第二比较模块、存储模块相连,用于当向图像库添加待检索图像时,将所述第二比较模块比较的所述添加的待检索图像与所述标准样本图像集每一标准样本图像的第二相似度集添加至所述存储模块的相似度集库。The adding module is connected with the second comparison module and the storage module, and is used to compare the added image to be retrieved by the second comparison module with each of the standard sample image sets when adding the image to be retrieved to the image library The second similarity set of the standard sample image is added to the similarity set library of the storage module.
标准样本图像选取模块,用于将图像相互之间相似度小的图像作为所述标准样本图像集中的标准样本图像。The standard sample image selection module is configured to use images with a small similarity between images as standard sample images in the standard sample image set.
优选实施中,还可以进一步包括:In a preferred implementation, it may further include:
特征向量输入模块,用于输入图像的特征向量;图像检索系统根据图像的特征向量之间的所述相似度来检索图像。The feature vector input module is used to input the feature vectors of the images; the image retrieval system retrieves the images according to the similarity between the feature vectors of the images.
图3为图像检索流程实施示意图,简要地,如图所示,图像检索可按以下基本流程来实施:Figure 3 is a schematic diagram of the implementation of the image retrieval process. Briefly, as shown in the figure, the image retrieval can be implemented according to the following basic processes:
步骤301、新增图像库待检索图像;Step 301, adding images to be retrieved in the image library;
步骤302、计算待检索特征向量并添加;Step 302, calculating and adding feature vectors to be retrieved;
步骤303、图像库待检索图像特征向量;Step 303, the feature vector of the image to be retrieved in the image library;
步骤304、输入需检索图像;Step 304, input the image to be retrieved;
步骤305、计算需检索图像特征向量;Step 305, calculating the feature vector of the image to be retrieved;
步骤306、利用图像库图像特征向量进行图像检索计算;Step 306, using the image feature vector of the image library to perform image retrieval calculation;
步骤307、输出查询结果。Step 307, output the query result.
下面结合实例对上述实施进行说明。The above implementation will be described below in combination with examples.
假设现在有一个拥有100万张图像的图像库。Suppose now there is an image library with 1 million images.
首先,选取5张标准样本图像(S[1],S[2],...,S[5])作为标准样本图像集。First, 5 standard sample images (S[1], S[2], ..., S[5]) are selected as the standard sample image set.
其次,分别计算图像库100万张待检索图像相对5张标准样本图像的相似特征向量DX[i]=(IS[i][1],IS[i][2],...,IS[i][5]),i=1,2,...,1000000。Secondly, calculate the similar feature vector DX[i]=(IS[i][1],IS[i][2],...,IS[ i][5]), i=1, 2, . . . , 1000000.
再次,计算需检索图像KEY相对5张标准样本图像的相似特征向量KEYX=(KS[1],KS[2],...,KS[5])。Again, calculate the similar feature vector KEYX=(KS[1], KS[2], . . . , KS[5]) of the image KEY to be retrieved relative to the five standard sample images.
然后,分别计算需检索图像特征向量KEYX与图像库图像特征向量DX[i]之间的最终相似度FS[i],i=1,2,...,1000000。Then, calculate the final similarity FS[i] between the feature vector KEYX of the image to be retrieved and the feature vector DX[i] of the image database, i=1, 2, . . . , 1000000.
最后,FS[i](i=1,2,...,1000000)按倒序排序,最前面的结果(譬如前10个)即为最相似检索输出结果。Finally, FS[i] (i=1, 2, . . . , 1000000) is sorted in reverse order, and the top results (for example, the top 10) are the most similar retrieval output results.
通过本发明所述的方案,能够在现有技术基础上非常有效地获得较快速度与较精确的的图像检索效果。同时本发明提供的基于图像相似性比对地检索方案,可以根据要求的不同,可以采用不同的初步相似度计算方法来获得所需的检索效果。实现简单,易于操作。Through the solution described in the present invention, a faster and more accurate image retrieval effect can be obtained very effectively on the basis of the prior art. At the same time, in the retrieval scheme based on image similarity comparison provided by the present invention, different preliminary similarity calculation methods can be used to obtain the desired retrieval effect according to different requirements. Simple to implement and easy to operate.
由上述实施例可见,本发明:1)提出了一种新的图像特征提取方法,采用事先确定的标准样本图像集作为图像特征(向量)度量尺度,具有较强的客观性、一致性;2)确定图像特征(向量)的方法比较简单,不需要将检索图像与图像库中的图像一一比较计算其相似度,而是采用引入标准样本图像集作为比较的中介,即事先计算好图像库中的图像与标准样本图像集的各个标准样本图像的相似度作为其索引向量,提高了检索速度;3)由于引入标准样本图像集作为比较的中介,扩大了图像的特征空间,包含了更多地图像特征信息,改善了检索返回结果集的精度。As can be seen from the foregoing embodiments, the present invention: 1) proposes a new image feature extraction method, adopts a standard sample image set determined in advance as the image feature (vector) measurement scale, which has strong objectivity and consistency; 2 ) to determine the image features (vectors) is relatively simple. It is not necessary to compare the retrieved image with the images in the image library to calculate their similarity one by one, but to introduce a standard sample image set as a comparison intermediary, that is, to calculate the image library in advance The similarity between the image in the image and each standard sample image in the standard sample image set is used as its index vector, which improves the retrieval speed; 3) Due to the introduction of the standard sample image set as an intermediary for comparison, the feature space of the image is expanded, including more The feature information of the map improves the accuracy of the retrieval result set.
由上述实施还可知,由于现有的各种图像检索方法普遍存在对于大数据量图像库的检索返回结果集精确度不高、速度慢,很难到达实际应用要求。因此本发明在现有的图像检索方法基础上提出了一种基于图像相似性比对的检索方案。采用本发明所述的方案,能够有效的改善图像检索返回结果集的精确度,同时提高图像的检索速度。此外还可以应用于两个或多个图像库之间的图像相似性自动匹配以用作更多的检索运用。It can also be seen from the above implementation that, due to the ubiquity of various existing image retrieval methods, the accuracy and speed of the returned result sets of large-scale image database retrieval are not high, and it is difficult to meet the actual application requirements. Therefore, the present invention proposes a retrieval scheme based on image similarity comparison on the basis of existing image retrieval methods. By adopting the scheme of the present invention, the accuracy of the returned result set of image retrieval can be effectively improved, and at the same time, the image retrieval speed can be increased. In addition, it can also be applied to automatic matching of image similarities between two or more image databases for more retrieval applications.
显然,本领域的技术人员可以对本发明进行各种改动和变型而不脱离本发明的精神和范围。这样,倘若本发明的这些修改和变型属于本发明权利要求及其等同技术的范围之内,则本发明也意图包含这些改动和变型在内。Obviously, those skilled in the art can make various changes and modifications to the present invention without departing from the spirit and scope of the present invention. Thus, if these modifications and variations of the present invention fall within the scope of the claims of the present invention and their equivalent technologies, the present invention also intends to include these modifications and variations.
Claims (9)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CNB2007100984941A CN100462978C (en) | 2007-04-18 | 2007-04-18 | An image retrieval method and system |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CNB2007100984941A CN100462978C (en) | 2007-04-18 | 2007-04-18 | An image retrieval method and system |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN101030230A CN101030230A (en) | 2007-09-05 |
| CN100462978C true CN100462978C (en) | 2009-02-18 |
Family
ID=38715572
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CNB2007100984941A Expired - Fee Related CN100462978C (en) | 2007-04-18 | 2007-04-18 | An image retrieval method and system |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN100462978C (en) |
Families Citing this family (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101739428B (en) * | 2008-11-10 | 2012-04-18 | 中国科学院计算技术研究所 | A method of indexing multimedia |
| CN101859326B (en) * | 2010-06-09 | 2012-04-18 | 南京大学 | Image searching method |
| CN102902826B (en) * | 2012-11-08 | 2016-07-06 | 公安部第三研究所 | A kind of image method for quickly retrieving based on reference picture index |
| CN106033443B (en) * | 2015-03-16 | 2019-05-07 | 北京大学 | An extended query method and device in vehicle retrieval |
| CN106528568A (en) * | 2015-09-11 | 2017-03-22 | 中兴通讯股份有限公司 | Method and system for information supply |
| US10405003B2 (en) * | 2017-01-20 | 2019-09-03 | Google Llc | Image compression based on semantic relevance |
| CN108805148B (en) * | 2017-04-28 | 2022-01-11 | 富士通株式会社 | Method of processing image and apparatus for processing image |
| CN112163112A (en) * | 2020-07-07 | 2021-01-01 | 北京国电通网络技术有限公司 | Image processing method and device in image sample database |
| CN112668632B (en) * | 2020-12-25 | 2022-04-08 | 浙江大华技术股份有限公司 | Data processing method and device, computer equipment and storage medium |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5586197A (en) * | 1993-09-02 | 1996-12-17 | Canon Kabushiki Kaisha | Image searching method and apparatus thereof using color information of an input image |
| US20020110276A1 (en) * | 2000-12-12 | 2002-08-15 | Nec Corporation | Image search system and image search method thereof |
| CN1551017A (en) * | 2003-05-08 | 2004-12-01 | ���ְ�˹��ʽ���� | Image searching device, method, programmme and storage medium storing said programme |
| US20050041863A1 (en) * | 2003-08-19 | 2005-02-24 | Ray Ajoy K. | Enabling content-based search of objects in an image database with reduced matching |
| CN1851703A (en) * | 2006-05-10 | 2006-10-25 | 南京大学 | Active semi-monitoring-related feedback method for digital image search |
-
2007
- 2007-04-18 CN CNB2007100984941A patent/CN100462978C/en not_active Expired - Fee Related
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5586197A (en) * | 1993-09-02 | 1996-12-17 | Canon Kabushiki Kaisha | Image searching method and apparatus thereof using color information of an input image |
| US20020110276A1 (en) * | 2000-12-12 | 2002-08-15 | Nec Corporation | Image search system and image search method thereof |
| CN1551017A (en) * | 2003-05-08 | 2004-12-01 | ���ְ�˹��ʽ���� | Image searching device, method, programmme and storage medium storing said programme |
| US20050041863A1 (en) * | 2003-08-19 | 2005-02-24 | Ray Ajoy K. | Enabling content-based search of objects in an image database with reduced matching |
| CN1851703A (en) * | 2006-05-10 | 2006-10-25 | 南京大学 | Active semi-monitoring-related feedback method for digital image search |
Also Published As
| Publication number | Publication date |
|---|---|
| CN101030230A (en) | 2007-09-05 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN100462978C (en) | An image retrieval method and system | |
| CN108280187B (en) | Hierarchical image retrieval method based on depth features of convolutional neural network | |
| CN111914107B (en) | Instance retrieval method based on multi-channel attention area expansion | |
| CN102902826B (en) | A kind of image method for quickly retrieving based on reference picture index | |
| CN114461836A (en) | Cross-modal retrieval method for image-text | |
| CN106033426A (en) | Image retrieval method based on latent semantic minimum hash | |
| CN103810252A (en) | Image retrieval method based on group sparse feature selection | |
| CN104317867B (en) | System for carrying out entity clustering on webpage pictures returned by search engine | |
| CN110110120B (en) | Image retrieval method and device based on deep learning | |
| Zou et al. | aware representation for sketch-based 3d model retrieval | |
| CN104142978B (en) | A kind of image indexing system and method based on multiple features and rarefaction representation | |
| Manisha et al. | Content-based image retrieval through semantic image segmentation | |
| CN106611016A (en) | Image retrieval method based on decomposable word pack model | |
| CN118779458B (en) | A sensitive information analysis and identification method, system, device and readable storage medium | |
| Minarno et al. | Batik image retrieval using convolutional autoencoder | |
| CN119719462A (en) | Legal case generation method and device based on large model driving | |
| CN107423294A (en) | A kind of community image search method and system | |
| CN106547825B (en) | 3D Model Retrieval Method Based on Global Features and Sub-block Features | |
| CN107491814B (en) | A Process Case Hierarchical Knowledge Model Construction Method for Knowledge Push | |
| Li et al. | A novel approach to remote sensing image retrieval with multi-feature VP-tree indexing and online feature selection | |
| CN110162654A (en) | It is a kind of that image retrieval algorithm is surveyed based on fusion feature and showing for search result optimization | |
| CN103198117B (en) | Content-based image spurious correlation method for reordering | |
| KR20230045866A (en) | Apparatus for searching video | |
| CN102831216B (en) | Image retrieval system and method for maintaining geometrical shape information of object | |
| CN108959650A (en) | Image search method based on symbiosis SURF feature |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| C14 | Grant of patent or utility model | ||
| GR01 | Patent grant | ||
| CF01 | Termination of patent right due to non-payment of annual fee | ||
| CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20090218 Termination date: 20170418 |