[go: up one dir, main page]

CN103440262A - Image Retrieval System and Method Based on Correlation Feedback and Bag-of-Features - Google Patents

Image Retrieval System and Method Based on Correlation Feedback and Bag-of-Features Download PDF

Info

Publication number
CN103440262A
CN103440262A CN201310330706XA CN201310330706A CN103440262A CN 103440262 A CN103440262 A CN 103440262A CN 201310330706X A CN201310330706X A CN 201310330706XA CN 201310330706 A CN201310330706 A CN 201310330706A CN 103440262 A CN103440262 A CN 103440262A
Authority
CN
China
Prior art keywords
image
feature
features
retrieval
images
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201310330706XA
Other languages
Chinese (zh)
Inventor
罗笑南
姜涛
肖剑
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sun Yat Sen University
Institute of Dongguan of Sun Yat Sen University
Original Assignee
Sun Yat Sen University
Institute of Dongguan of Sun Yat Sen University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sun Yat Sen University, Institute of Dongguan of Sun Yat Sen University filed Critical Sun Yat Sen University
Priority to CN201310330706XA priority Critical patent/CN103440262A/en
Publication of CN103440262A publication Critical patent/CN103440262A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses an image retrieval system and method based on relevant feedback and Bag-of-Features, the system includes: the characteristic extraction module is used for preprocessing the images and extracting the local characteristics of each image; the characteristic dictionary generating module is used for finding out key characteristics from the whole image database and forming a dictionary set; the frequency characteristic vector generating module is used for constructing a characteristic vector for each image; the characteristic weighting module is used for generating weight for each key characteristic in the dictionary, and then multiplying the weight by the corresponding component in the frequency characteristic vector to construct a weighted characteristic vector for each image; the similarity measurement module is used for calculating the similarity between the two images; and the related feedback module is used for enabling the user to participate in the retrieval process. By implementing the invention, the efficiency and the accuracy of image retrieval are improved.

Description

基于相关反馈和Bag-of-Features的图像检索系统及方法Image Retrieval System and Method Based on Correlation Feedback and Bag-of-Features

技术领域technical field

本发明涉及计算机领域,具体涉及一种基于相关反馈和Bag-of-Features的图像检索方法。The invention relates to the field of computers, in particular to an image retrieval method based on correlation feedback and Bag-of-Features.

背景技术Background technique

随着信息科技的发展,各个领域各种图像越来越多,如何从大量图像数据中快速而准确地检索出相关图像逐渐成为人们的关注热点。近些年,在工业应用界和学术研究界,大规模图像检索日益受到广泛重视,不断提出各种图像检索的方法,包括基于文本的图像检索、基于分类的图像检索等等。With the development of information technology, there are more and more images in various fields. How to quickly and accurately retrieve relevant images from a large amount of image data has gradually become a hot spot of attention. In recent years, large-scale image retrieval has received increasing attention in the industrial application and academic research circles, and various image retrieval methods have been proposed, including text-based image retrieval, classification-based image retrieval, and so on.

基于文本的图像检索沿用了传统文本检索技术,回避对图像可视化元素的分析,而是从图像名称、图像尺寸、压缩类型、作者、年代等方面标引图像,一般以关键词来查询图像,或者是根据等级目录的形式浏览查找特定类目下的图像。另外,图像所在页面的主题、图像的文件名称、与图像密切环绕的文字内容、图像链接地址等都被用作为图像分析的一句,根据这些文本分析结果推断其中图像的特征。Text-based image retrieval follows the traditional text retrieval technology, avoids the analysis of image visualization elements, but indexes images from image name, image size, compression type, author, year, etc., and generally uses keywords to query images, or It is to browse and find images under a specific category in the form of a hierarchical directory. In addition, the theme of the page where the image is located, the file name of the image, the text content closely surrounding the image, and the link address of the image are all used as a sentence for image analysis, and the characteristics of the image in it are inferred based on the text analysis results.

基于分类的图像检索是一种利用图像分类进行检索的技术。该技术需要对数据库中的图像进行明确的分类,并为每个类别选取出最具代表性的一些图像。用户输入一张查询图像时,系统将该图像与数据库中每个类别的代表图像进行相似度量,从而确定该查询图像所属的类别,然后将该类别的所有图像作为检索结果返回给用户。Classification-based image retrieval is a technique that utilizes image classification for retrieval. The technique requires explicit classification of images in a database and selection of some of the most representative images for each category. When the user inputs a query image, the system measures the similarity between the image and the representative images of each category in the database to determine the category to which the query image belongs, and then returns all images of the category to the user as retrieval results.

基于文本的图像检索方法虽然方便快捷,根据关键字就可以快速查询到所需图像,但是该检索方法完全脱离了图像的可视化内容,仅依靠与其关联的关键字,需要预先为所有图像进行文本标识,并且标识的准确性直接影响检索的准确性。而基于分类的图像检索方法依赖于数据库中图像的分类,目前并没有准确有效地分类各种图像。这些传统图像检索方法的弊端日益突出,人们亟待更新更有效地方法,关于图像检索方法的研究继续前行。Although the text-based image retrieval method is convenient and quick, and the desired image can be quickly queried according to the keyword, this retrieval method is completely separated from the visual content of the image, and only relies on the associated keyword, which requires text identification for all images in advance. , and the accuracy of identification directly affects the accuracy of retrieval. However, classification-based image retrieval methods rely on the classification of images in the database, and currently do not classify various images accurately and effectively. The disadvantages of these traditional image retrieval methods are becoming more and more prominent, and people urgently need to update more effective methods, and the research on image retrieval methods continues to move forward.

由于传统的图像检索方法存在各种弊端,已逐渐不能满足人们的需求。人们提出了一种不同的解决方案:基于内容的图像检索方法。基于内容的图像检索是使用图像的可视特征对图像进行检索,它提取图像的低层特征,包括颜色、形状、纹理等,然后将查询图像的低层特征与数据库中的特征进行比较,找出与查询特征相似的图像返回给用户。Due to various drawbacks in the traditional image retrieval methods, it has gradually been unable to meet people's needs. A different solution has been proposed: content-based image retrieval methods. Content-based image retrieval is to use the visual features of images to retrieve images. It extracts the low-level features of images, including color, shape, texture, etc., and then compares the low-level features of the query image with the features in the database to find out the relevant features. Images with similar query features are returned to the user.

基于内容的图像检索方法虽然是脱离了图像文本标注,而对图像的可视化内容进行检索,但是该方法只是提取图像的颜色、形状、纹理等低层特征,无法表示图像的高层语义内容,因此检索结果往往差强人意。Although the content-based image retrieval method breaks away from the image text annotation and retrieves the visual content of the image, this method only extracts low-level features such as color, shape, and texture of the image, and cannot represent the high-level semantic content of the image. Therefore, the retrieval results More often than not.

发明内容Contents of the invention

本发明提供了一种基于相关反馈和Bag-of-Features的图像检索方法,采用Bag-of-Features方式,将每一张图像看作是一些局部特征的集合,并挑选出图像库中的关键特征,然后基于关键特征集合将每一张图像表示成一个向量,这样通过向量的比较来实现图像检索的目的,这种方式简洁有效。同时,为了兼顾图像的高层语义,结合相关反馈的方法,让用户参与检索的过程,用户对每次检索结果进行正相关和负相关的标示,系统根据用户的反馈重新调整检索参数,以此迭代,最终得到用户满意的检索结果。The present invention provides an image retrieval method based on correlation feedback and Bag-of-Features, adopts the Bag-of-Features method, regards each image as a collection of some local features, and selects the key points in the image library features, and then represent each image as a vector based on the key feature set, so that the purpose of image retrieval can be achieved through the comparison of vectors, which is concise and effective. At the same time, in order to take into account the high-level semantics of the image, combined with the method of relevant feedback, let the user participate in the retrieval process, the user marks the positive correlation and negative correlation of each retrieval result, and the system readjusts the retrieval parameters according to the user's feedback, so as to iterate , and finally get the user-satisfied retrieval results.

相应的,本发明实施例提供了一种基于相关反馈和Bag-of-Features的图像检索系统,包括:Correspondingly, an embodiment of the present invention provides an image retrieval system based on relevant feedback and Bag-of-Features, including:

特征提取模块,用于对图像进行预处理,提取出每张图像的局部特征;The feature extraction module is used to preprocess the image and extract the local features of each image;

特征词典生成模块,用于从整个图像数据库中找出关键性特征,并组成一个词典集合;The feature dictionary generation module is used to find key features from the entire image database and form a dictionary set;

频率特征向量生成模块,用于为每张图像构建一个特征向量;A frequency eigenvector generation module is used to construct a eigenvector for each image;

特征加权模块,用于为词典中每个关键特征生成权重,然后用该权重乘以频率特征向量中对应的分量,为每张图像构建出带权特征向量;The feature weighting module is used to generate a weight for each key feature in the dictionary, and then multiplies the weight by the corresponding component in the frequency feature vector to construct a weighted feature vector for each image;

相似性度量模块,用于计算两张图像之间的相似性;Similarity measurement module for calculating the similarity between two images;

相关反馈模块,用于让用户参与到检索过程,在用户输入查询条件后,检索系统返回查询结果,然后用户对查询结果进行筛选,认为有用的就标识成正相关,无用的标识成负相关,系统根据用户的反馈,重新调整查询条件进行检索,以此循环,直到得到用户满意的结果。The relevant feedback module is used to allow users to participate in the retrieval process. After the user enters the query conditions, the retrieval system returns the query results, and then the user screens the query results, and marks the useful ones as positive correlations, and the useless ones as negative correlations. According to the user's feedback, readjust the query conditions for retrieval, and repeat this cycle until the user is satisfied with the result.

相应的,本发明实施例还提供了一种基于相关反馈和Bag-of-Features的图像检索方法,包括:Correspondingly, the embodiment of the present invention also provides an image retrieval method based on relevant feedback and Bag-of-Features, including:

步骤一、对每张图像进行特征提取,找出局部特征,并将其用SIFT算子表示;Step 1, perform feature extraction on each image, find out local features, and express it with SIFT operator;

步骤二、将所有图像的局部特征集合在一起,采用K-means聚类的方式,生成指定数量的关键特征,组成特征词典;Step 2. Collect the local features of all the images together, and use the K-means clustering method to generate a specified number of key features to form a feature dictionary;

步骤三、对每张图像,依次将其每个局部特征分配给最近邻的关键特征,表示关键特征的频数,这样基于特征词典,可以为每张图像生成一个频率特征向量;Step 3. For each image, assign each local feature to the key feature of the nearest neighbor in turn, indicating the frequency of the key feature, so that based on the feature dictionary, a frequency feature vector can be generated for each image;

步骤四、统计出每个关键特征出现的图像数,即在多少张图像中出现过,然后除以图像总数,得到关键特征的IDF值,作为关键特征的权重;Step 4, count the number of images in which each key feature appears, that is, how many images it has appeared in, and then divide it by the total number of images to obtain the IDF value of the key feature as the weight of the key feature;

步骤五、对每张图像,将其特征向量中的每个分量乘以对应关键特征的权重,得到带权特征向量;Step 5. For each image, multiply each component in its feature vector by the weight of the corresponding key feature to obtain a weighted feature vector;

步骤六、算查询图像的向量与数据中图像向量的相似性,并按相似程度从高到低的顺序排序输出;Step 6, calculate the similarity between the vector of the query image and the image vector in the data, and sort the output in order of similarity from high to low;

步骤七、用户检查检索结果,如果满足要求,则结束;否则进入步骤八;Step 7. The user checks the search results, if the requirements are met, then end; otherwise, go to step 8;

步骤八、用户对检索结果不满意,则对检索结果进行正相关和负相关的标注,然后重新输入给检索系统;Step 8. If the user is not satisfied with the search result, he will mark the search result with positive correlation and negative correlation, and then re-input it to the retrieval system;

步骤九、系统根据用户的反馈,重新调整检索条件,进入步骤六。Step 9. The system readjusts the retrieval conditions according to the user's feedback, and proceeds to step 6.

本发明具有如下有益效果,本发明是将Bag-of-Features和相关反馈两种方式完美地结合在一起,提高了图像检索的效率和准确性。首先采用Bag-of-Features方式,将图像表示成特征的集合,进而表示成一个特征向量,这种量化的方式使得图像的表示和相似性计算变得方便简捷。同时,相关反馈的方式也让用户充分参与到检索过程,避免了图像高层语义的丢失。Bag-of-Features和相关反馈两种方式的结合,大大简化了图像的表示和相似性比较,同时也兼顾了图像的低层可视化特征和高层语义内容,使得图像检索更加简单、更加准确。The present invention has the following beneficial effects. The present invention perfectly combines the Bag-of-Features and related feedback methods to improve the efficiency and accuracy of image retrieval. First, the Bag-of-Features method is used to represent the image as a collection of features, and then as a feature vector. This quantization method makes image representation and similarity calculation convenient and simple. At the same time, the way of relevant feedback also allows users to fully participate in the retrieval process, avoiding the loss of high-level semantics of images. The combination of Bag-of-Features and relevant feedback greatly simplifies the image representation and similarity comparison, and also takes into account the low-level visual features and high-level semantic content of the image, making image retrieval simpler and more accurate.

附图说明Description of drawings

为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其它的附图。In order to more clearly illustrate the technical solutions in the embodiments of the present invention or the prior art, the following will briefly introduce the drawings that need to be used in the description of the embodiments or the prior art. Obviously, the accompanying drawings in the following description are only These are some embodiments of the present invention. Those skilled in the art can also obtain other drawings based on these drawings without creative work.

图1是本发明实施例中的基于相关反馈和Bag-of-Features的图像检索系统结构示意图;Fig. 1 is a schematic structural diagram of an image retrieval system based on correlation feedback and Bag-of-Features in an embodiment of the present invention;

图2是本发明实施例中的基于相关反馈和Bag-of-Features的图像检索方法流程图。FIG. 2 is a flowchart of an image retrieval method based on relevant feedback and Bag-of-Features in an embodiment of the present invention.

具体实施方式Detailed ways

下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其它实施例,都属于本发明保护的范围。The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

本发明提供了一种基于相关反馈和Bag-of-Features的图像检索系统及方法,采用Bag-of-Features方式,将每一张图像看作是一些局部特征的集合,并挑选出图像库中的关键特征,然后基于关键特征集合将每一张图像表示成一个向量,这样通过向量的比较来实现图像检索的目的,这种方式简洁有效。同时,为了兼顾图像的高层语义,结合相关反馈的方法,让用户参与检索的过程,用户对每次检索结果进行正相关和负相关的标示,系统根据用户的反馈重新调整检索参数,以此迭代,最终得到用户满意的检索结果。The present invention provides an image retrieval system and method based on correlation feedback and Bag-of-Features, adopts the Bag-of-Features method, regards each image as a collection of some local features, and selects the The key features, and then represent each image as a vector based on the key feature set, so that the purpose of image retrieval can be achieved by comparing the vectors, which is simple and effective. At the same time, in order to take into account the high-level semantics of the image, combined with the method of relevant feedback, let the user participate in the retrieval process, the user marks the positive correlation and negative correlation of each retrieval result, and the system readjusts the retrieval parameters according to the user's feedback, so as to iterate , and finally get the user-satisfied retrieval results.

图1示出了本发明实施例中的基于相关反馈和Bag-of-Features的图像检索系统,其主要包括特征提取模块、特征词典生成模块、频率特征向量生成模块、特征加权模块、相似性度量模块、相关反馈模块等。Figure 1 shows an image retrieval system based on correlation feedback and Bag-of-Features in an embodiment of the present invention, which mainly includes a feature extraction module, a feature dictionary generation module, a frequency feature vector generation module, a feature weighting module, and a similarity measure modules, related feedback modules, etc.

特征提取模块是对图像进行预处理,提取出每张图像的局部特征,这样每张图像就可以表示成局部特征的集合,无需考虑局部特征的位置关系。同时,也需要将每个局部特征进行量化表示,采用SIFT算子来提取局部特征并表示成128维的向量。The feature extraction module is to preprocess the image and extract the local features of each image, so that each image can be represented as a collection of local features without considering the positional relationship of the local features. At the same time, it is also necessary to quantify each local feature, and use the SIFT operator to extract local features and express them as 128-dimensional vectors.

特征词典生成模块负责从整个图像数据库中找出关键性特征,并组成一个词典集合。因为每张图像有很多局部特征,这样整个数据库中所有局部特征的数量非常庞大,因此需要采用K-means聚类的方式,找出具有代表性的特征,由这些关键特征组成一个特征词单。The feature dictionary generation module is responsible for finding key features from the entire image database and forming a dictionary set. Because each image has many local features, the number of all local features in the entire database is very large, so it is necessary to use K-means clustering to find representative features and form a feature word list from these key features.

频率特征向量生成模块是为每张图像构建一个特征向量。基于特征词典可以为每张图像生成一个特征向量,向量的每个分量表示对应关键特征的频数。因为词典中的关键特征是通过聚类而来,因此在每张图像中,不可能完全找到关键特征,我们将图像中的局部特征分配给距离最近的关键特征,表示该关键特征出现一次。通过频率特征向量生成模块,每张图像都用维数相同的向量进行表示。The frequency feature vector generation module is to construct a feature vector for each image. A feature vector can be generated for each image based on the feature dictionary, and each component of the vector represents the frequency of the corresponding key feature. Because the key features in the dictionary are obtained by clustering, it is impossible to find the key features completely in each image. We assign the local features in the image to the nearest key feature, indicating that the key feature appears once. Through the frequency feature vector generation module, each image is represented by a vector with the same dimension.

特征加权模块是为词典中每个关键特征生成权重,然后用该权重乘以频率特征向量中对应的分量,为每张图像构建出带权特征向量。词典中的每个关键特征在图像库中的重要性不同,可以采用IDF算法,计算出每个关键特征的IDF值,将其作为关键特征的权值。然后计算出每张图像的带权特征向量。The feature weighting module generates a weight for each key feature in the dictionary, and then multiplies the weight by the corresponding component in the frequency feature vector to construct a weighted feature vector for each image. The importance of each key feature in the dictionary is different in the image library. The IDF algorithm can be used to calculate the IDF value of each key feature and use it as the weight of the key feature. Then calculate the weighted feature vector of each image.

相似性度量模块是计算两张图像之间的相似性。数据库中的图像都已经用维数相同的向量量化表示,因此通过计算向量之间的距离来度量对应图像之间的相似性,按相似程度从高到低的顺序进行排序。The similarity measurement module is to calculate the similarity between two images. The images in the database have been quantized and represented by vectors with the same dimension, so the similarity between corresponding images is measured by calculating the distance between vectors, and sorted in descending order of similarity.

相关反馈模块是让用户参与到检索过程。用户输入查询条件后,检索系统返回查询结果,然后用户对查询结果进行筛选,认为有用的就标识成正相关,无用的标识成负相关,系统根据用户的反馈,重新调整查询条件进行检索,以此循环,直到得到用户满意的结果。The relevant feedback module is to allow users to participate in the retrieval process. After the user enters the query conditions, the retrieval system returns the query results, and then the user filters the query results, and marks the useful ones as positive correlations, and the useless ones as negative correlations, and the system re-adjusts the query conditions for retrieval based on user feedback. Loop until the user is satisfied with the result.

图2示出了本发明实施例中的一种基于相关反馈和Bag-of-Features的图像检索方法流程图,具体步骤如下:Fig. 2 shows a flow chart of an image retrieval method based on relevant feedback and Bag-of-Features in an embodiment of the present invention, and the specific steps are as follows:

第1步,对每张图像进行特征提取,找出局部特征,并将其用SIFT算子表示。The first step is to perform feature extraction on each image, find out local features, and express them with SIFT operator.

第2步,将所有图像的局部特征集合在一起,采用K-means聚类的方式,生成指定数量的关键特征,组成特征词典。In the second step, the local features of all images are gathered together, and K-means clustering is used to generate a specified number of key features to form a feature dictionary.

第3步,对每张图像,依次将其每个局部特征分配给最近邻的关键特征,表示关键特征的频数。这样基于特征词典,可以为每张图像生成一个频率特征向量。In the third step, for each image, each local feature is assigned to the nearest neighbor key feature in turn, indicating the frequency of key features. In this way, based on the feature dictionary, a frequency feature vector can be generated for each image.

第4步,结合第3步,统计出每个关键特征出现的图像数,即在多少张图像中出现过,然后除以图像总数,得到关键特征的IDF值,作为关键特征的权重。Step 4, combined with step 3, count the number of images in which each key feature appears, that is, how many images it has appeared in, and then divide it by the total number of images to obtain the IDF value of the key feature as the weight of the key feature.

第5步,对每张图像,将其特征向量中的每个分量乘以对应关键特征的权重,得到带权特征向量。Step 5: For each image, multiply each component in its feature vector by the weight of the corresponding key feature to obtain a weighted feature vector.

第6步,计算查询图像的向量与数据中图像向量的相似性,并按相似程度从高到低的顺序排序输出。Step 6: Calculate the similarity between the vector of the query image and the image vector in the data, and sort the output in order of similarity from high to low.

第7步,用户检查检索结果,如果满足要求,则结束;否则进入第8步。In step 7, the user checks the search results, if the requirements are met, then end; otherwise, go to step 8.

第8步,用户对检索结果不满意,则对检索结果进行正相关和负相关的标注,然后重新输入给检索系统。Step 8, if the user is not satisfied with the search result, he will mark the search result with positive correlation and negative correlation, and then input it to the retrieval system again.

第9步,系统根据用户的反馈,重新调整检索条件,进入第6步。In step 9, the system readjusts the retrieval conditions according to the user's feedback, and then enters step 6.

综上,本发明是将Bag-of-Features和相关反馈两种方式完美地结合在一起,提高了图像检索的效率和准确性。首先采用Bag-of-Features方式,将图像表示成特征的集合,进而表示成一个特征向量,这种量化的方式使得图像的表示和相似性计算变得方便简捷。同时,相关反馈的方式也让用户充分参与到检索过程,避免了图像高层语义的丢失。Bag-of-Features和相关反馈两种方式的结合,大大简化了图像的表示和相似性比较,同时也兼顾了图像的低层可视化特征和高层语义内容,使得图像检索更加简单、更加准确。To sum up, the present invention perfectly combines Bag-of-Features and related feedback methods to improve the efficiency and accuracy of image retrieval. First, the Bag-of-Features method is used to represent the image as a collection of features, and then as a feature vector. This quantization method makes image representation and similarity calculation convenient and simple. At the same time, the way of relevant feedback also allows users to fully participate in the retrieval process, avoiding the loss of high-level semantics of images. The combination of Bag-of-Features and relevant feedback greatly simplifies the image representation and similarity comparison, and also takes into account the low-level visual features and high-level semantic content of the image, making image retrieval simpler and more accurate.

本领域普通技术人员可以理解上述实施例的各种方法中的全部或部分步骤是可以通过程序来指令相关的硬件来完成,该程序可以存储于一计算机可读存储介质中,存储介质可以包括:只读存储器(ROM,Read Only Memory)、随机存取存储器(RAM,Random AccessMemory)、磁盘或光盘等。Those of ordinary skill in the art can understand that all or part of the steps in the various methods of the above-mentioned embodiments can be completed by instructing related hardware through a program, and the program can be stored in a computer-readable storage medium, and the storage medium can include: Read-only memory (ROM, Read Only Memory), random access memory (RAM, Random AccessMemory), disk or CD-ROM, etc.

以上对本发明实施例所提供的基于相关反馈方式和Bag-of-Features的图像检索系统及方法进行了详细介绍,本文中应用了具体个例对本发明的原理及实施方式进行了阐述,以上实施例的说明只是用于帮助理解本发明的方法及其核心思想;同时,对于本领域的一般技术人员,依据本发明的思想,在具体实施方式及应用范围上均会有改变之处,综上所述,本说明书内容不应理解为对本发明的限制。The image retrieval system and method based on the relevant feedback mode and Bag-of-Features provided by the embodiments of the present invention have been introduced in detail above. In this paper, specific examples have been used to illustrate the principle and implementation of the present invention. The above embodiments The description is only used to help understand the method of the present invention and its core idea; at the same time, for those of ordinary skill in the art, according to the idea of the present invention, there will be changes in the specific implementation and scope of application. In summary, As stated above, the content of this specification should not be construed as limiting the present invention.

Claims (2)

1.一种基于相关反馈和Bag-of-Features的图像检索系统,其特征在于,包括:1. An image retrieval system based on relevant feedback and Bag-of-Features, characterized in that, comprising: 特征提取模块,用于对图像进行预处理,提取出每张图像的局部特征;The feature extraction module is used to preprocess the image and extract the local features of each image; 特征词典生成模块,用于从整个图像数据库中找出关键性特征,并组成一个词典集合;The feature dictionary generation module is used to find key features from the entire image database and form a dictionary set; 频率特征向量生成模块,用于为每张图像构建一个特征向量;A frequency eigenvector generation module is used to construct a eigenvector for each image; 特征加权模块,用于为词典中每个关键特征生成权重,然后用该权重乘以频率特征向量中对应的分量,为每张图像构建出带权特征向量;The feature weighting module is used to generate a weight for each key feature in the dictionary, and then multiplies the weight by the corresponding component in the frequency feature vector to construct a weighted feature vector for each image; 相似性度量模块,用于计算两张图像之间的相似性;Similarity measurement module for calculating the similarity between two images; 相关反馈模块,用于让用户参与到检索过程,在用户输入查询条件后,检索系统返回查询结果,然后用户对查询结果进行筛选,认为有用的就标识成正相关,无用的标识成负相关,系统根据用户的反馈,重新调整查询条件进行检索,以此循环,直到得到用户满意的结果。The relevant feedback module is used to allow users to participate in the retrieval process. After the user enters the query conditions, the retrieval system returns the query results, and then the user screens the query results, and marks the useful ones as positive correlations, and the useless ones as negative correlations. According to the user's feedback, readjust the query conditions for retrieval, and repeat this cycle until the user is satisfied with the result. 2.一种基于相关反馈和Bag-of-Features的图像检索方法,其特征在于,包括:2. An image retrieval method based on relevant feedback and Bag-of-Features, characterized in that, comprising: 步骤一、对每张图像进行特征提取,找出局部特征,并将其用SIFT算子表示;Step 1, perform feature extraction on each image, find out local features, and express it with SIFT operator; 步骤二、将所有图像的局部特征集合在一起,采用K-means聚类的方式,生成指定数量的关键特征,组成特征词典;Step 2. Collect the local features of all the images together, and use the K-means clustering method to generate a specified number of key features to form a feature dictionary; 步骤三、对每张图像,依次将其每个局部特征分配给最近邻的关键特征,表示关键特征的频数,这样基于特征词典,可以为每张图像生成一个频率特征向量;Step 3. For each image, assign each local feature to the key feature of the nearest neighbor in turn, indicating the frequency of the key feature, so that based on the feature dictionary, a frequency feature vector can be generated for each image; 步骤四、统计出每个关键特征出现的图像数,即在多少张图像中出现过,然后除以图像总数,得到关键特征的IDF值,作为关键特征的权重;Step 4, count the number of images in which each key feature appears, that is, how many images it has appeared in, and then divide it by the total number of images to obtain the IDF value of the key feature as the weight of the key feature; 步骤五、对每张图像,将其特征向量中的每个分量乘以对应关键特征的权重,得到带权特征向量;Step 5. For each image, multiply each component in its feature vector by the weight of the corresponding key feature to obtain a weighted feature vector; 步骤六、算查询图像的向量与数据中图像向量的相似性,并按相似程度从高到低的顺序排序输出;Step 6, calculate the similarity between the vector of the query image and the image vector in the data, and sort the output in order of similarity from high to low; 步骤七、用户检查检索结果,如果满足要求,则结束;否则进入步骤八;Step 7. The user checks the search results, if the requirements are met, then end; otherwise, go to step 8; 步骤八、用户对检索结果不满意,则对检索结果进行正相关和负相关的标注,然后重新输入给检索系统;Step 8. If the user is not satisfied with the search result, he will mark the search result with positive correlation and negative correlation, and then re-input it to the retrieval system; 步骤九、系统根据用户的反馈,重新调整检索条件,进入步骤六。Step 9. The system readjusts the retrieval conditions according to the user's feedback, and proceeds to step 6.
CN201310330706XA 2013-07-31 2013-07-31 Image Retrieval System and Method Based on Correlation Feedback and Bag-of-Features Pending CN103440262A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310330706XA CN103440262A (en) 2013-07-31 2013-07-31 Image Retrieval System and Method Based on Correlation Feedback and Bag-of-Features

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310330706XA CN103440262A (en) 2013-07-31 2013-07-31 Image Retrieval System and Method Based on Correlation Feedback and Bag-of-Features

Publications (1)

Publication Number Publication Date
CN103440262A true CN103440262A (en) 2013-12-11

Family

ID=49693954

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310330706XA Pending CN103440262A (en) 2013-07-31 2013-07-31 Image Retrieval System and Method Based on Correlation Feedback and Bag-of-Features

Country Status (1)

Country Link
CN (1) CN103440262A (en)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104462381A (en) * 2014-12-11 2015-03-25 北京中细软移动互联科技有限公司 Trademark image retrieval method
CN104978395A (en) * 2015-05-22 2015-10-14 北京交通大学 Vision dictionary construction and application method and apparatus
CN105653723A (en) * 2016-01-19 2016-06-08 中国科学技术大学 Inquired image characteristic cutting method for image retrieval
CN107622071A (en) * 2016-07-15 2018-01-23 上海交通大学 Clothes Image Retrieval System and Method Using Indirect Correlation Feedback Without Checking Sources
CN107766373A (en) * 2016-08-19 2018-03-06 阿里巴巴集团控股有限公司 The determination method and its system of the affiliated classification of picture
CN107909687A (en) * 2017-11-07 2018-04-13 惠州市瑞时智控科技有限公司 A kind of Intelligent door control system and its control method based on machine learning
CN110826627A (en) * 2019-11-06 2020-02-21 广东三维家信息科技有限公司 Image similarity measuring method and device and electronic equipment
CN113343009A (en) * 2021-01-05 2021-09-03 辽宁工程技术大学 Image retrieval method based on K-centroid clustering and related feedback
CN115357739A (en) * 2022-08-16 2022-11-18 珠高智能科技(深圳)有限公司 Image retrieval method and device, electronic equipment and storage medium

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101071439A (en) * 2007-05-24 2007-11-14 北京交通大学 Interactive video searching method based on multi-view angle
US20100195914A1 (en) * 2009-02-02 2010-08-05 Michael Isard Scalable near duplicate image search with geometric constraints

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101071439A (en) * 2007-05-24 2007-11-14 北京交通大学 Interactive video searching method based on multi-view angle
US20100195914A1 (en) * 2009-02-02 2010-08-05 Michael Isard Scalable near duplicate image search with geometric constraints

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
唐建鑫: ""基于BagofFeatures的图像分类和检索研究"", 《中国优秀硕士学位论文全文数据库信息科技辑》 *

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104462381A (en) * 2014-12-11 2015-03-25 北京中细软移动互联科技有限公司 Trademark image retrieval method
CN104462381B (en) * 2014-12-11 2019-03-19 中细软移动互联科技有限公司 Trademark image retrieval method
CN104978395A (en) * 2015-05-22 2015-10-14 北京交通大学 Vision dictionary construction and application method and apparatus
CN104978395B (en) * 2015-05-22 2019-05-21 北京交通大学 Visual dictionary building and application method and device
CN105653723B (en) * 2016-01-19 2019-03-01 中国科学技术大学 A kind of query image feature method of cutting out for image retrieval
CN105653723A (en) * 2016-01-19 2016-06-08 中国科学技术大学 Inquired image characteristic cutting method for image retrieval
CN107622071A (en) * 2016-07-15 2018-01-23 上海交通大学 Clothes Image Retrieval System and Method Using Indirect Correlation Feedback Without Checking Sources
CN107622071B (en) * 2016-07-15 2020-01-07 上海媒智科技有限公司 Clothes image retrieval system and method under non-source-retrieval condition through indirect correlation feedback
CN107766373A (en) * 2016-08-19 2018-03-06 阿里巴巴集团控股有限公司 The determination method and its system of the affiliated classification of picture
CN107766373B (en) * 2016-08-19 2021-07-20 阿里巴巴集团控股有限公司 Method and system for determining categories to which pictures belong
CN107909687A (en) * 2017-11-07 2018-04-13 惠州市瑞时智控科技有限公司 A kind of Intelligent door control system and its control method based on machine learning
CN110826627A (en) * 2019-11-06 2020-02-21 广东三维家信息科技有限公司 Image similarity measuring method and device and electronic equipment
CN113343009A (en) * 2021-01-05 2021-09-03 辽宁工程技术大学 Image retrieval method based on K-centroid clustering and related feedback
CN115357739A (en) * 2022-08-16 2022-11-18 珠高智能科技(深圳)有限公司 Image retrieval method and device, electronic equipment and storage medium

Similar Documents

Publication Publication Date Title
CN111753099B (en) Method and system for enhancing relevance of archive entity based on knowledge graph
CN103440262A (en) Image Retrieval System and Method Based on Correlation Feedback and Bag-of-Features
Zhu et al. Unsupervised visual hashing with semantic assistant for content-based image retrieval
CN105760507B (en) Cross-modal topic correlation modeling method based on deep learning
CN111680173A (en) A CMR Model for Unified Retrieval of Cross-Media Information
CN101364239B (en) A classification catalog automatic construction method and related system
US11783389B2 (en) Methods and systems for predicting a price of any subtractively manufactured part utilizing artificial intelligence at a computing device
CN103927358A (en) Text search method and system
Yang et al. Tag tagging: Towards more descriptive keywords of image content
CN107391565B (en) Matching method of cross-language hierarchical classification system based on topic model
CN111753067A (en) A method, device and equipment for evaluating the innovativeness of technical disclosure text
Bleik et al. Text categorization of biomedical data sets using graph kernels and a controlled vocabulary
CN103425740B (en) A kind of material information search method based on Semantic Clustering of internet of things oriented
CN109960730B (en) Short text classification method, device and equipment based on feature expansion
CN105045875A (en) Personalized information retrieval method and apparatus
CN114997288B (en) A design resource association method
CN109446313B (en) Sequencing system and method based on natural language analysis
Sumathi et al. An overview of automated image annotation approaches
Li et al. Bi-CMR: Bidirectional reinforcement guided hashing for effective cross-modal retrieval
CN104216993A (en) A Tag Clustering Method Based on Tag Co-occurrence
JP2011248596A (en) Searching system and searching method for picture-containing documents
Qin et al. Feature selection for text classification based on part of speech filter and synonym merge
CN105678244A (en) Approximate video retrieval method based on improvement of editing distance
CN116738068A (en) Trending topic mining method, device, storage medium and equipment
CN104077419B (en) With reference to semantic method for reordering is retrieved with the long query image of visual information

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C53 Correction of patent of invention or patent application
CB03 Change of inventor or designer information

Inventor after: Luo Xiaonan

Inventor after: Xu Songhua

Inventor after: Jiang Tao

Inventor after: Xiao Jian

Inventor after: Lin Shujin

Inventor before: Luo Xiaonan

Inventor before: Jiang Tao

Inventor before: Xiao Jian

COR Change of bibliographic data

Free format text: CORRECT: INVENTOR; FROM: LUO XIAONAN JIANG TAO XIAO JIAN TO: LUO XIAONAN XU SONGHUA JIANG TAO XIAO JIAN LIN SHUJIN

RJ01 Rejection of invention patent application after publication

Application publication date: 20131211

RJ01 Rejection of invention patent application after publication