[go: up one dir, main page]

CN109272013B - Similarity measurement method based on learning - Google Patents

Similarity measurement method based on learning Download PDF

Info

Publication number
CN109272013B
CN109272013B CN201810879213.4A CN201810879213A CN109272013B CN 109272013 B CN109272013 B CN 109272013B CN 201810879213 A CN201810879213 A CN 201810879213A CN 109272013 B CN109272013 B CN 109272013B
Authority
CN
China
Prior art keywords
model
view
stereo
hypergraph
learning
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201810879213.4A
Other languages
Chinese (zh)
Other versions
CN109272013A (en
Inventor
王坤
韦莎
程雨航
王伟忠
聂为之
刘安安
苏育挺
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tianjin University
China Electronics Standardization Institute
Original Assignee
Tianjin University
China Electronics Standardization Institute
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tianjin University, China Electronics Standardization Institute filed Critical Tianjin University
Priority to CN201810879213.4A priority Critical patent/CN109272013B/en
Publication of CN109272013A publication Critical patent/CN109272013A/en
Application granted granted Critical
Publication of CN109272013B publication Critical patent/CN109272013B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/217Validation; Performance evaluation; Active pattern learning techniques

Landscapes

  • Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Image Analysis (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

本发明公开了一种基于学习的相似性度量方法,包括:给定立体模型的视图,筛选具有代表性的视图构造一个基于视图的超图来表征立体对象之间的关系;利用立体模型数据,从每个立体模型中提取空间结构圆形描述符,并且使用每个立体模型之间的距离来生成简单基于模型的图以探索立体模型之间的关联性;选定合适的学习框架,生成初始的学习权重,将基于视图生成的超图和基于立体模型生成的图作为学习框架的输入,通过联合学习框架来学习两种图的最优组合权重,从而通过基于视图的超图和基于立体模型的图来估计立体对象之间的相关性。本发明通过提取视图特征信息和立体模型空间结构信息,使得对立体模型的描述更加全面,在相似度量化方面更加准确和科学。

Figure 201810879213

The invention discloses a learning-based similarity measurement method, comprising: given a view of a stereo model, screening representative views to construct a view-based hypergraph to represent the relationship between the stereo objects; using the stereo model data, Spatial structure circular descriptors are extracted from each solid model, and the distance between each solid model is used to generate a simple model-based graph to explore the correlation between the solid models; an appropriate learning framework is selected to generate an initial The learning weights of the view-based hypergraph and the stereo model-based graph are used as the input of the learning framework, and the optimal combination weights of the two graphs are learned through the joint learning framework, so that the view-based hypergraph and the stereo model-based to estimate the correlation between stereo objects. By extracting the view feature information and the spatial structure information of the three-dimensional model, the invention makes the description of the three-dimensional model more comprehensive, and is more accurate and scientific in the aspect of similarity quantification.

Figure 201810879213

Description

Similarity measurement method based on learning
Technical Field
The invention relates to the fields of similarity measurement, three-dimensional model retrieval and the like, in particular to a similarity measurement method based on learning.
Background
Due to the rapid development of graphics hardware, computer technology and networks, stereoscopic objects have been widely used in a variety of applications, such as: computer graphics, the medical industry, and the field of virtual reality. The large-scale database of stereo objects is rapidly increasing, which leads to an increasing demand for efficient stereo object retrieval algorithms.
Recently, extensive research efforts have been devoted to stereoscopic object retrieval techniques[1]-[4]. Existing stereo object retrieval methods can be simply divided into two paradigms, namely model-based methods and view-based methods.
In a model-based approach[5]-[7]A stereo object is described as a model-based feature, such as: low level features (e.g. volume descriptors)[7]Surface distribution of[6]And surface geometry[5]) And advanced features, e.g. in the literature[8]Visual and geometric features are considered and the high level semantic space resulting from the low level feature mapping is further learned with user relevance feedback, which is another euclidean space and can be considered a dimension reduction or feature selection method. One advantage of model-based approaches is that they can preserve global spatial information of the stereo object. While model-based methods are effective, they explicitly require stereo model information, which limits the application of model-based methods. The stereo model information is not always available, especially in some practical applications.
In view-based approaches, a stereoscopic object is represented by a set of images from different directions. For different approaches, these views may be captured with a still camera array or without such camera array constraints. For view-based approaches, matching between two stereo objects is done by multi-view matching. View-based approaches benefit from existing image processing and matching techniques. These methods make the retrieval of stereo objects more flexible because they do not require stereo model information. Existing work also shows that view-based methods can highly differentiate three-dimensional objects, which provides better retrieval performance than model-based methods. One disadvantage of view-based approaches compared to model-based approaches is that it is difficult to describe the spatial relationship between different views when camera array information is not available.
A typical scenario where stereo model information is not available is when one wants to search for objects in the real world, e.g. when a tourist finds something interesting and wants to find similar objects in a data set, the information of the model is usually obtained by taking several pictures. In this case, the model-based approach does not work and only the image-based approach can be applied. CAD is a very important field of application for model-based methods. In the entertainment field, model-based methods perform well, such as stereoscopic television and gaming, and in the medical field, such as telemedicine treatment and diagnosis. Notably, visual information has recently become more important in the above applications. Both model information and view-based information may bring useful perspectives, which may further improve performance.
The main challenges currently faced by similarity metrics are:
1) because the information content of the three-dimensional model is large, how to describe the model in the database, extract the features with high distinguishing degree and match the features, and the calculation of the similarity directly influences the retrieval result;
2) the problems of calculation amount and calculation complexity are also considered while the retrieval accuracy is ensured, so that the retrieval time is controlled within an acceptable range.
Disclosure of Invention
The invention provides a similarity measurement method based on learning, which extracts two different types of information, namely view characteristic information and spatial structure information of a stereo model, so that the description of the stereo model is more comprehensive, the quantification of the similarity is more accurate and scientific, and the details are described as follows:
a learning-based similarity metric method, the method comprising the steps of:
given the views of the stereo model, screening representative views to construct a view-based hypergraph to represent the relationship between stereo objects;
extracting spatial structure circle descriptors from each of the stereoscopic models using the stereoscopic model data, and generating simple model-based maps using distances between each of the stereoscopic models to explore associations between the stereoscopic models;
selecting a proper learning frame, generating initial learning weight, using the generated hypergraph based on the view and the generated graph based on the solid model as the input of the learning frame, and learning the optimal combined weight of the two graphs through a joint learning frame, thereby estimating the correlation between the solid objects through the generated hypergraph based on the view and the graph based on the solid model.
Wherein, the screening of the representative views constructs a hypergraph based on the views to represent the relationship between the three-dimensional objects, specifically:
1) calculating Zernike moments between any two views, selecting the first K views closest to the first view as a representative view of each view cluster, and forming a new view set;
2) and constructing a view-based hypergraph by adopting star unfolding according to the new view set so as to express the relationship between the three-dimensional models.
Further, the air conditioner is provided with a fan,
hierarchically clustering a given view comprising a plurality of views, dividing the views into view clusters,
in the hypergraph, each vertex is an object, each edge is a view cluster, and the weight of each edge is defined according to the similarity between any two views in the view cluster.
Wherein the extracting spatial structure circular descriptors from each of the stereo models, and using the distances between each of the stereo models to generate a simple model-based map to explore the relevance between the stereo models are specifically:
extracting a spatial structure circular descriptor of the three-dimensional model as a three-dimensional model feature, wherein the purpose of the spatial structure circular descriptor is to represent depth information of the surface of the three-dimensional model on a projection minimum bounding box of the 3D model and generate a depth histogram as the feature of the 3D model;
bipartite matching is performed to measure the distance between each two 3D models,i.e. dSSCD(Oi,Oj)。
In a specific implementation, the defining the weight of each edge according to the similarity between any two views in the view cluster specifically includes:
Figure BDA0001754064360000031
wherein d isSSCD(vi,vj) Is a 3D object OiAnd OjDistance between, σsSet to the median of the distances between all pairs of stereoscopic models.
Further, the step of learning the optimal combination weights of the two graphs through a joint learning framework so as to estimate the correlation between the stereo objects through the view-based hypergraph and the stereo model-based graph is:
1) setting an initial learning frame, namely setting two modes of a view-based hypergraph and a graph model based on a stereo model as the same weight, wherein the retrieval task of the stereo model is to learn the optimal pair-wise object correlation under the two information of the view-based hypergraph and the graph based on the stereo model;
2) learning the combined weight, and further learning the optimal weight of the view information and the stereo model data according to different influences caused by the view information and the stereo model information;
and optimizing and exploring the stereo model-based data and the view-based data simultaneously by using the learned combined weight, and obtaining a vector of the related similarity measurement, wherein the vector is the correlation of all the stereo models in the data set relative to the query model.
Further, the constructing a view-based hypergraph by star unfolding specifically includes:
a hypergraph for constructing a three-dimensional model using star expansion is denoted as GH=(VH,EH,WH) (ii) a Wherein V represents a vertex, E represents an edge, W is the weight of the edge E, and H is a correlation matrix;
hypergraph GHWeight W inH
Figure BDA0001754064360000032
Wherein v iscIs a central view of the super edge, vxIs and vcOne of the closest K views, d (v)x,vc) Is vxAnd vcDistance between, σHEmpirically set as the median of the distances between all views;
the correlation matrix H is generated by the following formula:
Figure BDA0001754064360000041
wherein, h (v)H,eH) Is the data in the correlation matrix H, where the vertex vH∈VHEdge eH∈EH
Vertex vHThe degree of (d) is defined as:
Figure BDA0001754064360000042
wherein, w (e)H)∈W。
Edge eHThe degree of (d) is defined as:
Figure BDA0001754064360000043
in specific implementation, the setting of an initial learning framework, namely setting two modes of a hypergraph based on a view and a graph model based on a three-dimensional model as the same weight specifically comprises the following steps of;
setting an initial learning frame, setting the same weight for the two data, and expressing the learning process by the following objective function:
Figure BDA0001754064360000044
in this formula, f is the correlation vector to be learned, ΩV(f) Is a regularization term on the view-based hypergraph structure, ΩM(f) Is a regularization term on the graph structure based on a stereo model, r (f) is an empirical penalty, and μ > 0 is a weighting parameter.
Wherein the objective function is further modified to:
Figure BDA0001754064360000045
in the calculation process, let:
Figure BDA0001754064360000046
then:
ΔH=I-ΘH
ΔS=I-ΘS
wherein, DeltaHAnd ΔSLaplacian as hypergraph; y is an initial label vector; eta > 0 is a weighting parameter, and H is a correlation matrix; w is a diagonal matrix of edge weights; dvA diagonal matrix which is a vertex degree; deIs a diagonal matrix of edge degrees.
The technical scheme provided by the invention has the beneficial effects that:
1. by extracting the view characteristic information and the spatial structure information of the three-dimensional model, the three-dimensional model is more comprehensively described, and the similarity is more accurate and scientific in quantification;
2. according to the method, when the similarity expression vector is calculated, a network model based on learning is used, so that the obtained weight is ensured to be an optimal solution, and the flexibility and the stability of the similarity measurement are improved;
3. by selecting representative view information to represent the three-dimensional model, the calculation amount is reduced, and the similarity measurement is efficient;
4. the present invention is the first task of co-exploring view-based and stereo-model-based correlations between stereo models in a graph-based framework;
5. the method avoids incomplete extraction of the stereo model information caused by only adopting a stereo model-based or view-based method, and can ensure the scientificity and accuracy of calculating the similarity of the stereo model.
Drawings
FIG. 1 is a flow chart of a method of learning-based similarity measurement;
FIG. 2 is an exemplary diagram of a content of a stereo model database;
FIG. 3 is a schematic representation of F and ANMRR in the NTU database;
FIG. 4 is a schematic representation of F and ANMRR in the SHREC database;
FIG. 5 is a schematic representation of F and ANMRR in the PSB database;
FIG. 6 is a chart of a full-check standard curve of the NTU database model retrieval method proposed by the present invention;
FIG. 7 is a chart of a full-check quasi-curve of the SHREC database model retrieval method proposed by the present invention;
fig. 8 is a check-reference curve diagram of the PSB database model retrieval method according to the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, embodiments of the present invention are described in further detail below.
At this stage, most approaches employ only model-based or view-based approaches, which may result in incomplete representation of the stereo model information. The present patent proposes joint learning to characterize a stereo model based on view and based on model information. In the view-based section, a representative view is first selected for each object, and then a view distance is calculated. Following reference [9 ]]The method of (1), constructing a view-based hypergraph using a view star extension. In the model part, a spatial structure circular descriptor is extracted[10]And generates a simple model-based map using the pair-wise object distances. Thus, the view information and the model data can be represented by two graphs. Learn the two figuresIn order to estimate the correlation between stereo objects, the weights that the maps occupy in learning can also be optimized. The results of comparison between the method and other methods are provided at the end of the text, and evaluation of three data sets shows excellent three-dimensional object retrieval accuracy.
Example 1
The most important part of the similarity measurement method based on learning is two different relational graphs of a previous extraction model, namely a view-based hypergraph and a model-based graph, aiming at the view of the model and passing through an HAC[11](HAC is one of hierarchical clustering methods, and the basic idea is that a single document is regarded as different classes, then different methods are utilized to combine the different classes, the number of the classes is gradually reduced until a class is finally obtained or the required class number is obtained), a view cluster is constructed, and a view-based hypergraph is established according to the view cluster; for model-specific data, by spatially structuring the circular descriptor SSCD[12](in SSCD, the spatial structure of the 3D model is described by a 2D image, and the attribute value of each pixel represents 3D spatial information, SSCD can preserve the global spatial structure of the 3D model with rotation and scaling invariance.) method extracts the depth histogram of the model and then establishes a model-based graph; and finally, learning according to the two relation graphs to obtain a final similarity quantization vector.
The method provided by the embodiment of the invention is a method for searching by jointly learning the relevance between the view of the three-dimensional model and the three-dimensional model, and the specific implementation steps are as follows:
101: given the views of the stereo model, screening representative views to construct a view-based hypergraph to represent the relationship between stereo objects;
102: extracting the SSCD from each of the stereoscopic models using the stereoscopic model data and using the distance between each of the stereoscopic models to generate a simple model-based map to explore the correlation between the stereoscopic models;
103: selecting a proper learning frame, generating initial learning weight, using the generated hypergraph based on the view and the generated graph based on the solid model as the input of the learning frame, and learning the optimal combined weight of the two graphs through a joint learning frame, thereby estimating the correlation between the solid objects through the generated hypergraph based on the view and the graph based on the solid model.
Wherein, the step 101 of screening the representative views to construct a view-based hypergraph to represent the relationship between the three-dimensional objects specifically includes:
1) performing hierarchical clustering on given views comprising a plurality of view angles, dividing the views into view clusters, firstly calculating Zernike moments between any two views in order to reduce redundant data, and selecting the first K views closest to the views as representative views of each cluster to form a new view set;
2) and constructing a view-based hypergraph by adopting star unfolding according to the new view set so as to express the relationship between the three-dimensional models, wherein each vertex is an object and each edge is a view cluster in the hypergraph. Thus, an edge connects multiple vertices, and the weight of each edge is defined according to the similarity between any two views within the cluster.
The specific steps of generating a simple model-based map using the distance between each three-dimensional model to explore the relationship between the three-dimensional models in step 102 to construct a three-dimensional model-based map are as follows:
extracting a spatial structure circular descriptor SSCD of the stereoscopic model as a stereoscopic model feature (wherein the purpose of SSCD is to generate a depth histogram as a feature of the 3D model on a projection minimum bounding box of the 3D model for representing depth information of a surface of the stereoscopic model), and after extracting SSCD, performing bipartite graph matching to measure a distance between every two 3D models, namely DSSCD(Oi,Oj)。
The relationship between objects is represented by a simple graph structure G ═ (V, E, W), where each vertex in the graph structure G represents an object, i.e., there are n vertices in G. According to two corresponding 3D objects OiAnd OjThe weight of the edge e (i, j) in the graph structure G is calculated as:
Figure BDA0001754064360000071
wherein d isSSCD(vi,vj) Is OiAnd OjA distance between, and σsSet to the median of the distances between all pairs of stereoscopic models.
The specific steps of learning the optimal combination weights of the two graphs through the joint learning framework in the step 103, and estimating the correlation between the stereo objects through the view-based hypergraph and the stereo model-based graph are as follows:
1) setting an initial learning framework, namely setting two modes of a view-based hypergraph and a three-dimensional model-based graph model as the same weight, and establishing a retrieval task of the three-dimensional model as a class of classification work, wherein the main aim is to learn the optimal pair-wise object correlation under two information of the view-based hypergraph and the three-dimensional model-based graph.
Where, given initial labeled data (in this case, a model of the query), an empirical loss term may be added as a constraint to the learning process.
2) Learning the combined weight, further learning the optimal weight of the view information and the stereo model data according to different influences caused by the view information and the stereo model information, and then adding the combined weight into a learning frame, wherein the target of the learning process consists of three parts, namely a structure regulator, an experience loss regulator and a combined weight regulator based on a hypergraph of the view and a graph based on the stereo model, so as to determine a final learning model;
the contents of the structure adjuster, the experience loss adjuster, and the combination weight adjuster are well known to those skilled in the art, and are not described in detail in the embodiments of the present invention.
By utilizing the learned combination weight, the data based on the stereo model and the data based on the view can be optimized and explored at the same time, and a vector f of the related similarity measurement is obtained, wherein the vector f is the correlation of all the stereo models in the data set relative to the query model, and a larger correlation value represents the high similarity between the stereo model and the query model. The higher the corresponding correlation value, the more similar the two objects are.
In summary, in the embodiment of the present invention, two different types of information, namely view feature information and stereo model spatial structure information, are extracted through the above steps 101 to 103, so that the description of the stereo model is more comprehensive, and the similarity quantization is more accurate and scientific.
Example 2
The scheme in example 1 is further described below with reference to specific calculation formulas, fig. 1 and fig. 2, and is described in detail below:
using O ═ O1,O2,...,OnDenotes n stereoscopic models, and Vi={vi1,vi2,...,vimRepresents a plurality of views of the ith stereoscopic model, from which a representative view is selected, assuming that the selected representative view is Vi={vi1,vi2,...,vimAnd then constructing a hypergraph of the three-dimensional model by adopting star expansion, wherein the hypergraph is represented as GH=(VH,EH,WH) (ii) a Where V represents a vertex, E represents an edge, W is the weight of edge E, and H is the correlation matrix.
Assuming n total stereo modelsrAnd (3) firstly calculating the distance between every two representative views based on Zernike moments, and generating top K nearest views for each representative view, wherein the value of K is set to be 10 in the embodiment of the invention. Calculate the hypergraph G byHWeight W inH
Figure BDA0001754064360000081
Wherein v iscIs a central view of the super edge, vxIs and vcOne of the closest K views, d (v)x,vc) Is vxAnd vcDistance between, σHEmpirically set as the median of the distances between all views.
The correlation matrix H may be generated by the following equation:
Figure BDA0001754064360000082
wherein, h (v)H,eH) Is the data in the correlation matrix H, where the vertex vH∈VHEdge eH∈EH
Further, vertex vH∈VHVertex vHThe degree of (c) can be defined as:
Figure BDA0001754064360000083
wherein, w (e)H)∈W。
Further, edge eH∈EHEdge eHThe degree of (c) can be defined as:
Figure BDA0001754064360000084
in concrete implementation, the vertex degree matrix vHAnd an edge degree matrix eHTwo diagonal matrices D may be usedvAnd DeAnd (4) showing.
In the constructed hypergraph GHWhen two stereo models share more similar views, they may be connected by a higher weight super-concealment, which may indicate a high correlation between these stereo models.
Given the stereo model data of the stereo object, the object relationships based on the stereo model are further explored here. Here, a Spatial Structure Circle Descriptor (SSCD) is used as a feature of the stereoscopic model. The method for generating the model-based map has been described above.
After the view-based hypergraph and the model-based hypergraph are obtained, an initial learning framework is set, the same weight is set for the two data, and the learning process can be expressed by the following formula:
Figure BDA0001754064360000091
in this formula, f is the correlation vector to be learned, ΩV(f) Is a regularization term on the view-based hypergraph structure, ΩM(f) Is a regularization term on the graph structure based on a stereo model, r (f) is an empirical penalty, and μ > 0 is a weighting parameter.
This objective function aims to minimize the empirical loss values of the stereomodel-based map and the view-based hypergraph, which can generate the optimal correlation vector f for retrieval.
Where the vector f is the relevance of all objects in the dataset with respect to the query object. A larger relevance value represents a high degree of similarity between the object and the query. The higher the corresponding correlation value, the more similar the two objects are. With the generated object correlation vector f, all objects in the data set may be sorted in descending order of the vector f.
Learning the combining weights: note that the view information and the stereo model information may not have the same effect on the object representation. In some cases, the view information may be more important, while in other cases, the stereo model data may play an important role. In this case, the optimal weights of the view information and the stereo model data are further learned.
Let α and β denote the combining weights of the view-based and the stereo-model-based information, respectively, where α + β is 1. Adding/to the combining weights2After the norm, the objective function can be further modified as:
Figure BDA0001754064360000092
in the calculation process, let:
Figure BDA0001754064360000093
then:
ΔH=I-ΘH
ΔS=I-ΘS
wherein, DeltaHAnd ΔSLaplacian that can be considered as a hypergraph; y is an initial label vector; eta > 0 is a weighting parameter.
The above alternative optimization can be processed under the optimal vector f value, and thus can be used for solving the object similarity measure. By using the learned combining weights, the exploration of the stereo model-based and view-based data can be optimized simultaneously, and the correlation vector f is obtained.
In summary, the embodiments of the present invention enhance the expressiveness of the stereo model through the above steps, and eliminate the influence of the single feature of the stereo model on the similarity calculation result, so that the accuracy of the stereo model search is improved, the calculation amount is reduced, and the search efficiency is improved.
Example 3
The following examples are presented to demonstrate the feasibility of the embodiments of examples 1 and 2, and are described in detail below:
the database in the embodiment of the invention is based on NTU and PSB[7]To proceed with. The three-dimensional models are drawn by workers through three-dimensional model processing software such as 3DMax or collected from websites with different domain names, the three-dimensional model database has different storage formats including an obj format, an off format and the like, a representative NTU549 database is used in the experimental design, 47 types of three-dimensional models are contained in the database, the PSB database contains 161 types and 1814 types of three-dimensional models in total, and the SHREC database contains 40 types and 800 types of three-dimensional models in total. Some models used in the embodiment of the present invention are shown in fig. 2, and the used three-dimensional model is in an off format.
An example of a stereo model dataset proposed by an embodiment of the present invention is shown in fig. 2, where F-measure considers the top 20 returned results for each query, and ANMRR (average normalized retrieval rank) evaluates ranking performance by considering ranking order. A low ANMRR value indicates the highest accuracy of the returned results. The above-mentioned metrics are used for evaluation, and the algorithm is shown in FIG. 3 to FIG. 5, wherein VMJR is the method of the present invention, in NTU, and,ANMRR value ED of VMJR on SHREC and PSB data sets[13]ERD[14]QVS[15]HL[9]and DC[15]The F-measure has a value higher than those of the methods, thereby highlighting the superiority of the method.
Wherein Precision is Precision, Recall is Recall, and the larger the area enclosed by the Recall curve and the horizontal and vertical coordinates, the better the retrieval performance is represented. Fig. 6 to 8 show that the performance of the method is better under the NTU and PSB databases, and on the NTU, SHREC and PSB data sets, the PR curve of the VMJR method is above the PR curves of the other five methods, and the area around the abscissa and ordinate axes is the largest, thereby verifying the feasibility of the method and meeting various requirements in practical application.
Reference to the literature
[1]J.W.H.Tangelder and R.C.Veltkamp,“A survey of content based 3D shape retrieval methods,”Multimedia Tools and Applications,vol.39,pp.441–471,2008.
[2]Y.Yang,H.Lin,and Y.Zhang,“Content-based 3D model retrieval:A survey,”IEEE Transactions on Systems,Man,and Cybernetics-Part C:Applications and Reviews,vol.37,pp.1081–1035,2007.
[3]K.Lu,Q.Wang,J.Xue,and W.Pan,“3d model retrieval and classification by semi-supervised learning with content-based similarity,”Information Sciences,vol.281,pp.703–713,2014.
[4]K.L¨u,N.He,and J.Xue,“Content-based similarity for 3d model retrieval and classification,”Progress in Natural Science,vol.19,no.4,pp.495–499,2009.
[5]A.E.Johnson and M.Hebert,“Using spin images for efficient object recognition in cluttered3D scenes,”IEEE Transactions on Pattern Analysis and Machine Intelligence,vol.21,no.5,pp.433–449,1999.
[6]R.Osada,T.Funkhouser,B.Chazelle,and D.Dobkin,“Shape distributions,”ACM Transactions on Graphic,vol.21,no.4,pp.807–832,2002.
[7]E.Paquet and M.Rioux,“Nefertiti:A query by content system for threedimensional model and image databases management,”Image Vision Computing,vol.17,pp.157–166,1999.
[8]B.Leng and Z.Xiong,“Modelseek:an effective 3D model retrieval system,”Multimedia Tools and Applications,2010.
[9]Gao Y,Wang M,Tao D,et al.3-D Object Retrieval and Recognition With Hypergraph Analysis[J].Acta Electronica Sinica,2012,21(9):4290-4303.
[10]Gao Y,Dai Q,Zhang N Y.3D model comparison using spatial structure circular descriptor[J].Pattern Recognition,2010,43(3):1142-1151.
[11]Steinbach M,Karypis G,Kumar V.A Comparison of Document Clustering Techniques[C]//2000.
[12]Gao Y,Dai Q,Zhang N Y.3D model comparison using spatial structure circular descriptor[J].Pattern Recognition,2010,43(3):1142-1151.
[13]Shih J L,Lee C H,Wang J T.A new 3D model retrieval approach based on the elevation descriptor[J].Pattern Recognition,2007,40(1):283-295.
[14]Vranic D V.An improvement of rotation invariant 3D-shape based on functions on concentric spheres[C]//Image Processing,2003.ICIP 2003.Proceedings.2003International Conference on.IEEE,2003,3:III-757.
[15]Gao Y,Wang M,Zha Z J,et al.Less is more:Efficient 3-D object retrieval with query view selection[J].IEEE Transactions on Multimedia,2011,13(5):1007-1018.
In the embodiment of the present invention, except for the specific description of the model of each device, the model of other devices is not limited, as long as the device can perform the above functions.
Those skilled in the art will appreciate that the drawings are only schematic illustrations of preferred embodiments, and the above-described embodiments of the present invention are merely provided for description and do not represent the merits of the embodiments.
The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents, improvements and the like that fall within the spirit and principle of the present invention are intended to be included therein.

Claims (8)

1.一种基于学习的相似性度量方法,其特征在于,所述方法包括以下步骤:1. A learning-based similarity measurement method, wherein the method comprises the following steps: 给定立体模型的视图,筛选具有代表性的视图构造一个基于视图的超图来表征立体对象之间的关系;Given a view of a solid model, filter representative views to construct a view-based hypergraph to represent the relationship between solid objects; 利用立体模型数据,从每个立体模型中提取空间结构圆形描述符,并且使用每个立体模型之间的距离来生成简单基于模型的图以探索立体模型之间的关联性;Using the stereo model data, the spatial structure circular descriptors are extracted from each stereo model, and the distance between each stereo model is used to generate a simple model-based graph to explore the association between the stereo models; 选定合适的学习框架,生成初始的学习权重,将基于视图生成的超图和基于立体模型生成的图作为学习框架的输入,通过联合学习框架来学习两种图的最优组合权重,从而通过基于视图的超图和基于立体模型的图来估计立体对象之间的相关性;Select an appropriate learning framework, generate initial learning weights, take the hypergraph generated based on the view and the graph generated based on the stereo model as the input of the learning framework, and learn the optimal combination weight of the two graphs through the joint learning framework, so as to pass View-based hypergraph and stereo model-based graph to estimate the correlation between stereo objects; 所述通过联合学习框架来学习两种图的最优组合权重,从而通过基于视图的超图和基于立体模型的图来估计立体对象之间的相关性的步骤为:The joint learning framework is used to learn the optimal combination weights of the two graphs, so that the steps of estimating the correlation between the stereo objects through the view-based hypergraph and the stereo model-based graph are: 1)设定初始的学习框架,即把基于视图的超图和基于立体模型的图模型两种模式设为相同的权重,立体模型的检索任务是在基于视图的超图和基于立体模型的图两种信息下学习最佳的成对对象相关性;1) Set the initial learning framework, that is, set the two modes of the view-based hypergraph and the stereo model-based graph model to the same weight. The retrieval task of the stereo model is between the view-based hypergraph and the stereo model-based graph. learn the best pairwise object correlation under two kinds of information; 2)学习组合权重,根据视图信息和立体模型信息造成的不同影响,将进一步学习视图信息和立体模型数据的最佳权重;2) Learning the combined weight, according to the different influences caused by the view information and the stereo model information, the optimal weight of the view information and the stereo model data will be further learned; 利用学习的组合权重,同时优化探索基于立体模型和基于视图的数据,并获得相关相似性度量的向量,向量是数据集中所有立体模型相对于查询模型的相关性。Using the learned combined weights, both the stereo model-based and view-based data are optimally explored, and a vector of relevant similarity measures is obtained, which is the correlation of all stereo models in the dataset with respect to the query model. 2.根据权利要求1所述的一种基于学习的相似性度量方法,其特征在于,所述筛选具有代表性的视图构造一个基于视图的超图来表征立体对象之间的关系具体为:2. a kind of similarity measurement method based on learning according to claim 1, is characterized in that, described screening representative view constructs a view-based hypergraph to characterize the relationship between three-dimensional objects is specifically: 1)计算任意两个视图之间的Zernike矩,选择最接近的前K个视图作为每个视图集群的代表性视图,组成新的视图集合;1) Calculate the Zernike moment between any two views, select the closest top K views as the representative views of each view cluster, and form a new view set; 2)根据新的视图集合,采用星形展开来构建一个基于视图的超图,以表达立体模型之间的关系。2) According to the new set of views, a star-expansion is used to build a view-based hypergraph to express the relationship between the solid models. 3.根据权利要求2所述的一种基于学习的相似性度量方法,其特征在于,3. a kind of similarity measurement method based on learning according to claim 2, is characterized in that, 对给定的包含多个视角的视图进行层次聚类,将这些视图分为视图集群,perform hierarchical clustering on a given view that contains multiple perspectives, and divide these views into view clusters, 在超图中,每个顶点都是一个对象,每条边都是一个视图集群,根据视图集群内任意两个视图之间的相似性来定义每条边的权重。In a hypergraph, each vertex is an object, each edge is a view cluster, and the weight of each edge is defined according to the similarity between any two views within the view cluster. 4.根据权利要求1所述的一种基于学习的相似性度量方法,其特征在于,所述从每个立体模型中提取空间结构圆形描述符,使用每个立体模型之间的距离来生成简单基于模型的图以探索立体模型之间的关联性具体为:4. A learning-based similarity measurement method according to claim 1, wherein the spatial structure circular descriptor is extracted from each solid model, and the distance between each solid model is used to generate A simple model-based graph to explore the associations between solid models is as follows: 提取立体模型的空间结构圆形描述符作为立体模型特征,空间结构圆形描述符的目的是在3D模型的投影最小边界框上,用于表示立体模型表面的深度信息,生成深度直方图作为3D模型的特征;Extract the spatial structure circular descriptor of the stereo model as the stereo model feature. The purpose of the spatial structure circular descriptor is to represent the depth information of the surface of the stereo model on the projected minimum bounding box of the 3D model, and generate a depth histogram as a 3D model. characteristics of the model; 进行二分图匹配以测量每两个3D模型之间的距离,即dSSCD(Oi,Oj)。Bipartite graph matching is performed to measure the distance between each two 3D models, ie dSSCD (O i ,O j ). 5.根据权利要求3所述的一种基于学习的相似性度量方法,其特征在于,所述根据视图集群内任意两个视图之间的相似性来定义每条边的权重具体为:5. A learning-based similarity measurement method according to claim 3, wherein the weight of each edge defined according to the similarity between any two views in the view cluster is specifically:
Figure FDA0003151813290000021
Figure FDA0003151813290000021
其中,dSSCD(vi,vj)是3D对象Oi和Oj之间的距离,σs被设置为所有成对立体模型之间距离的中值。where dSSCD (vi, vj ) is the distance between the 3D objects Oi and Oj , and σs is set as the median of the distances between all pairs of solid models.
6.根据权利要求2所述的一种基于学习的相似性度量方法,其特征在于,所述采用星形展开来构建一个基于视图的超图具体为:6. a kind of similarity measurement method based on learning according to claim 2, is characterized in that, described adopting star-shaped expansion to construct a view-based hypergraph is specifically: 采用星型拓展构建立体模型的超图,表示为GH=(VH,EH,WH);其中V代表顶点,E代表边,W是边E的权重,H为关联矩阵;The hypergraph of the three-dimensional model is constructed by star expansion, which is expressed as G H = (V H , E H , W H ); where V represents the vertex, E represents the edge, W is the weight of the edge E, and H is the association matrix; 超图GH中的权重WHWeights WH in hypergraph GH :
Figure FDA0003151813290000022
Figure FDA0003151813290000022
其中,vc是超边的中心视图,vx是与vc最接近的K视点之一,d(vx,vc)是vx和vc之间的距离,σH经验地设置为所有视图之间距离的中值;where vc is the center view of the hyperedge, vx is one of the K views closest to vc, d ( vx , vc ) is the distance between vx and vc , and σH is empirically set as median of distances between all views; 关联矩阵H通过如下公式生成:The correlation matrix H is generated by the following formula:
Figure FDA0003151813290000023
Figure FDA0003151813290000023
其中,h(vH,eH)为关联矩阵H中的数据,顶点vH∈VH,边eH∈EHAmong them, h(v H , e H ) is the data in the association matrix H, the vertex v H ∈ V H , and the edge e H ∈ E H ; 顶点vH的度定义为:The degree of vertex v H is defined as:
Figure FDA0003151813290000024
Figure FDA0003151813290000024
其中,w(eH)∈W;where w(e H )∈W; 边eH的度定义为:The degree of edge e H is defined as:
Figure FDA0003151813290000031
Figure FDA0003151813290000031
7.根据权利要求1所述的一种基于学习的相似性度量方法,其特征在于,所述设定初始的学习框架,即把基于视图的超图和基于立体模型的图模型两种模式设为相同的权重具体为;7. a kind of similarity measurement method based on learning according to claim 1, is characterized in that, described setting initial learning frame, namely setting two modes of the hypergraph based on view and the graph model based on solid model. For the same weight specifically; 设定初始学习框架,对两种数据设置相同的权重,学习过程用以下目标函数表示:The initial learning framework is set, and the same weights are set for the two kinds of data. The learning process is represented by the following objective function:
Figure FDA0003151813290000032
Figure FDA0003151813290000032
在该公式中,f是待学习的相关性向量,ΩV(f)是基于视图的超图结构上的正则化项,ΩM(f)是基于立体模型的图结构上的正则化项,R(f)是经验损失,μ>0为加权参数。In this formula, f is the correlation vector to be learned, Ω V (f) is the regularization term on the view-based hypergraph structure, Ω M (f) is the regularization term on the stereo model-based graph structure, R(f) is the empirical loss, and μ>0 is the weighting parameter.
8.根据权利要求7所述的一种基于学习的相似性度量方法,其特征在于,所述目标函数进一步修改为:8. a kind of similarity measurement method based on learning according to claim 7, is characterized in that, described objective function is further modified as:
Figure FDA0003151813290000033
Figure FDA0003151813290000033
计算过程中,令:During the calculation, let:
Figure FDA0003151813290000034
Figure FDA0003151813290000034
Figure FDA0003151813290000035
Figure FDA0003151813290000035
则:but: ΔH=I-ΘH ΔH =I- ΘH ΔS=I-ΘS ΔS = I- ΘS 其中,ΔH和ΔS看作为超图的拉普拉斯算子;y为初始标签向量;η>0为加权参数,H为关联矩阵;W为边权重的对角矩阵;Dv为顶点度的对角矩阵;De为边缘度的对角矩阵。Among them, ΔH and ΔS are regarded as the Laplace operator of the hypergraph; y is the initial label vector; η>0 is the weighting parameter, H is the correlation matrix; W is the diagonal matrix of edge weights; D v is the vertex The diagonal matrix of degrees; De is the diagonal matrix of edge degrees.
CN201810879213.4A 2018-08-03 2018-08-03 Similarity measurement method based on learning Active CN109272013B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810879213.4A CN109272013B (en) 2018-08-03 2018-08-03 Similarity measurement method based on learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810879213.4A CN109272013B (en) 2018-08-03 2018-08-03 Similarity measurement method based on learning

Publications (2)

Publication Number Publication Date
CN109272013A CN109272013A (en) 2019-01-25
CN109272013B true CN109272013B (en) 2021-09-14

Family

ID=65153103

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810879213.4A Active CN109272013B (en) 2018-08-03 2018-08-03 Similarity measurement method based on learning

Country Status (1)

Country Link
CN (1) CN109272013B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114049357B (en) * 2021-09-08 2024-06-14 匀熵科技(无锡)有限公司 Mammary gland ultrasonic segmentation method based on feature set association degree

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104268140A (en) * 2014-07-31 2015-01-07 浙江大学 Image retrieval method based on weight learning hypergraphs and multivariate information combination
CN106951501A (en) * 2017-03-16 2017-07-14 天津大学 A kind of method for searching three-dimension model based on many figure matchings

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1586020A2 (en) * 2003-01-25 2005-10-19 Purdue Research Foundation Methods, systems, and data structures for performing searches on three dimensional objects

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104268140A (en) * 2014-07-31 2015-01-07 浙江大学 Image retrieval method based on weight learning hypergraphs and multivariate information combination
CN106951501A (en) * 2017-03-16 2017-07-14 天津大学 A kind of method for searching three-dimension model based on many figure matchings

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
3D models retrieval algorithm based on multimodal data;Anan Liu et al.;《Neurocomputing》;20170214;第1节,第3节 *
Graph-based characteristic view set extraction and matching for 3D model retrieval;Anan Liu et al.;《Information Sciences》;20150509;摘要,图1 *
基于多特征融合的三维模型检索算法;周燕 等;《计算机科学》;20160731;第43卷(第7期);第2.3节,第3节 *

Also Published As

Publication number Publication date
CN109272013A (en) 2019-01-25

Similar Documents

Publication Publication Date Title
Yang et al. Content-based 3-D model retrieval: A survey
JP2023540917A (en) 3D reconstruction and related interactions, measurement methods and related devices and equipment
CN101477529B (en) Three-dimensional object retrieval method and apparatus
CN110019914B (en) Three-dimensional model database retrieval method supporting three-dimensional scene interaction
CN104751511B (en) A kind of three-dimensional scene construction method and device
JP2012043419A (en) Apparatus and method for three-dimensional model retrieval, and apparatus and method for model library generation
Zhou et al. 3D shape classification and retrieval based on polar view
JP2002288687A (en) Apparatus and method for calculating feature amount
Zhou et al. 2D compressive sensing and multi-feature fusion for effective 3D shape retrieval
Akgül et al. Density-based 3D shape descriptors
Peng et al. ZS-SBPRnet: A zero-shot sketch-based point cloud retrieval network based on feature projection and cross-reconstruction
TW202201249A (en) Device, method and program of generating two-dimensional mapping wherein the device includes a two-dimensional processing unit and a two-dimensional mapping generation unit
CN109272013B (en) Similarity measurement method based on learning
CN114971802A (en) Commodity recommendation method and device based on BIM (building information modeling), electronic equipment and medium
Yang et al. 3D model retrieval using constructive-learning for cross-model correlation
CN111597367B (en) Three-dimensional model retrieval method based on view and hash algorithm
Zou et al. A 3D model feature extraction method using curvature-based shape distribution
Varga et al. Template matching for 3d objects in large point clouds using dbms
Wang et al. A 3D shape descriptor based on spherical harmonics through evolutionary optimization
Pu et al. Invariant feature extraction for 3D model retrieval: An adaptive approach using Euclidean and topological metrics
CN111414802A (en) Protein data feature extraction method
Hou et al. Dynamic query interface for 3D shape search
CN106845543B (en) Three-dimensional model classification method based on feature matching
Liu et al. An approach to 3D building model retrieval based on topology structure and view feature
Tan et al. Local features and manifold ranking coupled method for sketch-based 3D model retrieval

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant