CN107330934B

CN107330934B - Low-dimensional cluster adjustment calculation method and system

Info

Publication number: CN107330934B
Application number: CN201710370360.4A
Authority: CN
Inventors: 武元新; 蔡奇; 郁文贤
Original assignee: Shanghai Jiao Tong University
Current assignee: Shanghai Jiao Tong University
Priority date: 2017-05-23
Filing date: 2017-05-23
Publication date: 2021-12-07
Anticipated expiration: 2037-05-23
Also published as: CN107330934A; WO2018214179A1

Abstract

The invention provides a low-dimensional cluster adjustment calculation method and system, including: determining the initial value of the motion parameter; performing optimization calculation on the objective function of the motion parameter to obtain the optimized motion parameter; according to the optimized motion parameter, calculating 3D scene point coordinates. The invention expresses the depth of field of multiple views as a function of the relative motion parameters of the two views, realizes that the motion parameters are directly recovered from the multiple views, and then the three-dimensional scene point coordinates are obtained through analysis from the motion parameters, so that the three-dimensional scene point coordinates are converted from It is eliminated in the parameter optimization process of cluster adjustment, which greatly reduces the dimension of the parameter space. The invention is a low-dimensional bundle adjustment method with simple initialization, good Lupin performance, faster calculation speed and higher calculation accuracy. The invention can be used as a core computing engine for applications such as unmanned vehicle/unmanned aerial vehicle visual navigation, visual three-dimensional reconstruction, and augmented reality.

Description

Low-dimensional cluster adjustment calculation method and system

Technical Field

The invention relates to the field of computer vision and photogrammetry, in particular to a low-dimensional cluster adjustment calculation method and system.

Background

Bundle Adjustment (Bundle Adjustment), i.e., recovering three-dimensional scene point coordinates, motion parameters and camera parameters from multiple views, is one of the core technologies in the fields of computer vision, photogrammetry and the like. The goal of the bundle adjustment technique is to minimize the reprojection error of the image points, which can be expressed as a non-linear function of the three-dimensional scene point coordinates, motion parameters, and camera parameters. For the case of m three-dimensional scene points and n views, the parameter space is 3 × m +6 × n dimensions. Since the number of three-dimensional scene points is usually large, the dimensions of the parameter space to be optimized are large. At present, the mainstream method of bundle adjustment is realized by adopting a nonlinear optimization algorithm considering the sparsity of a parameter Jacobian (Jacobian) matrix to improve the calculation speed, but the mainstream method still needs to be further improved due to more dimensions of a parameter space to meet the requirement of real-time calculation.

Disclosure of Invention

In view of the defects in the prior art, the present invention provides a method and a system for calculating a low-dimensional bundle adjustment. The invention expresses the depth of field of the multiple views as a function of relative motion parameters of every two views, realizes the direct recovery of the motion parameters from the multiple views, and then obtains the three-dimensional scene point coordinates from the analysis of the motion parameters.

The invention provides a low-dimensional bundle adjustment calculation method, which comprises the following steps:

step 1: determining an initial value of the motion parameter;

step 2: performing minimum calculation on the objective function of the motion parameters to obtain optimized motion parameters;

and step 3: and calculating the coordinates of the three-dimensional scene points according to the optimized motion parameters.

Preferably, the step 1 comprises the steps of:

step 1.1: for a dual-view image formed by the jth view and the jth view +1, j is 1, 2.. and n-1, image feature points corresponding to the common matching feature point set { j, j +1} on the dual-view image are solved by adopting a direct linear transformation algorithm (R) to solve the relative pose (R) of the jth view +1 relative to the jth view_j,j+1,t_j,j+1)；

Wherein:

n is the number of views participating in bundle adjustment;

R_j,j+1the relative posture of the j +1 th view relative to the j view is shown;

t_j,j+1is a unit relative displacement vector of the j +1 th view relative to the jth view, i.e. | t_j,j+1||＝1；

Calculating the three-dimensional coordinates of the ith matching image point pair corresponding to the common matching feature point set { j, j +1} in the jth view coordinate system

And the three-dimensional coordinates of the ith matching image point pair corresponding to the common matching feature point set { j, j +1} in the j +1 view coordinate system

Wherein:

i＝1,2,...,m^(j,j+1)；

m^(j,j+1)representing the number of matched image point pairs in the dual view formed by the jth view and the jth +1 th view;

normalized image point coordinates of an ith matching image point pair corresponding to the public matching feature point set { j, j +1} on a jth view;

normalized image point coordinates of an ith matching image point pair corresponding to the public matching feature point set { j, j +1} on a j +1 th view;

representing the three-dimensional coordinates of the ith matching image point pair corresponding to the public matching feature point set { j, j +1} in the jth view coordinate system;

representing the three-dimensional coordinates of the ith matching image point pair corresponding to the public matching feature point set { j, j +1} in the j +1 view coordinate system;

step 1.2: fixed | | | T_1,21, |; for the three views of j-1, j-1 and j +1, j is 2,3Calculating the scale of relative displacement | | | T by using a point set { j-1, j, j +1}, and calculating_j,j+1||/||T_j-1,j| | obtaining a relative displacement vector T with uniform dimension_j,j+1：

T_j,j+1＝||T_j,j+1||t_j,j+1；

Wherein:

T_1,2is the relative displacement vector of the 2 nd view relative to the 1 st view;

T_j,j+1a relative displacement vector of the j +1 th view relative to the j view;

T_j-1,jis the relative displacement vector of the jth view relative to the jth-1 view;

m^(j-1,j,j+1)representing the number of common matching image point pairs in three views consisting of the j-1 th view, the j +1 th view and the j +1 th view;

representing three-dimensional coordinates of an ith matching image point pair corresponding to the common matching feature point set { j-1, j } on the jth view and the jth view in a jth view coordinate system;

representing three-dimensional coordinates of an ith matching image point pair corresponding to a common matching feature point set { j, j +1} on jth and jth +1 views in a jth view coordinate system;

t_j,j+1a unit relative displacement vector of the j +1 th view relative to the j view;

step 1.3: according to absolute pose (R) of jth view_j,T_j) And calculating to obtain the absolute pose (R) of the j +1 th view_j+1,T_j+1)：

R_j+1＝R_j,j+1R_j

T_j+1＝T_j,j+1+R_j,j+1T_j

Wherein:

R_jrepresenting the absolute pose of the jth view;

R_j+1represents the absolute pose of the j +1 th view;

T_jan absolute displacement vector representing the jth view;

T_j+1absolute displacement vector representing the j +1 th view;

when referring to the first view:

(R₁,t₁)≡(I₃,0_3×1)

wherein:

R₁representing an absolute pose of the first view;

T₁an absolute displacement vector representing the first view;

I₃representing a 3-dimensional identity matrix;

0_3×1a zero matrix of 3 rows and 1 column is shown.

Preferably, in step 2, the objective function of the motion parameter is specifically as follows:

motion parameter θ ═ (R)_j,T_j)_j＝1,2,...nThe minimization objective function δ (θ) of (d) is given as follows:

e₃＝[0 0 1]^T

wherein:

theta represents the set of absolute pose parameters for all views;

δ (·) represents the minimization objective function;

m^(j,k)representing the number of matched image point pairs in the double view formed by the jth view and the kth view;

normalized image point coordinates of an ith matched image point corresponding to the common matched feature point set { j, k } on the jth view and the kth view;

normalized image point coordinates of the ith matched image point corresponding to the common matched feature point set { j, k } on the jth view and the kth view on the jth view;

R_j,kthe relative posture of the kth view relative to the jth view is taken;

T_j,kis the relative displacement vector of the kth view relative to the jth view.

Preferably, the motion parameter θ given in step 2 is (R ═ R)_j,T_j)_j＝1,2,...nThe premise of minimizing the objective function δ (θ) is: the same three-dimensional scene point is equidistant from the same view.

Preferably, the step 3 comprises the steps of:

the motion parameter theta (R) obtained by optimization_j,T_j)_j＝1,2,...nFor the double-view composed of the jth view and the kth view, the coordinates of the three-dimensional scene point are calculated in a weighted manner as follows:

T_j,k＝T_k-R_j,kT_j

wherein:

X_ithree-dimensional coordinates representing the ith three-dimensional scene point, X_iCorresponding to the s-th image feature point in the double view formed by the jth view and the kth view;

representing the ith three-dimensional scene point X_iWhether an identification function is visible in the dual view formed by the jth and kth views, i.e. when X is_iWhen visible in this double view the image is,

otherwise, then

R_jRepresenting the absolute pose of the jth view;

T_j,ka relative displacement vector of the kth view relative to the jth view;

representing the coordinates of the normalized image point of the s-th matched image point corresponding to the common matched feature point set { j, k } on the j-th view;

representing the common set of matching feature points { j, k }The coordinates of the normalized image point of the s-th matched image point pair on the k-th view;

R_krepresenting the absolute pose of the kth view;

T_jan absolute displacement vector representing the jth view;

T_kan absolute displacement vector representing the kth view;

R_j,krepresenting the relative pose of the kth view with respect to the jth view;

T_j,krepresenting the relative displacement vector of the kth view with respect to the jth view.

Preferably, the low-dimensional bundle adjustment calculation method considers the situation that the camera is calibrated, and assumes that matching image point pairs between the views have been determined.

According to the present invention, a low-dimensional bundle adjustment calculation system includes a computer readable storage medium storing a computer program, and the computer program, when executed by a processor, implements the steps of the low-dimensional bundle adjustment calculation method described above.

Compared with the prior art, the invention has the following beneficial effects:

the invention is a low-dimensional cluster adjustment method with simple and convenient initialization, good Lu bang property, higher calculation speed and higher calculation precision. The invention can be used as a core calculation engine for unmanned vehicle/unmanned aerial vehicle visual navigation, visual three-dimensional reconstruction, augmented reality and other applications.

Drawings

Other features, objects and advantages of the invention will become more apparent upon reading of the detailed description of non-limiting embodiments with reference to the following drawings:

fig. 1 is a flowchart illustrating steps of a method for adjusting a low-dimensional bundle according to the present invention.

Detailed Description

The present invention will be described in detail with reference to specific examples. The following examples will assist those skilled in the art in further understanding the invention, but are not intended to limit the invention in any way. It should be noted that it would be obvious to those skilled in the art that various changes and modifications can be made without departing from the spirit of the invention. All falling within the scope of the present invention.

The invention represents the depth of field as a function of the motion parameters, thereby eliminating the three-dimensional scene point coordinates from the parameter optimization process of cluster adjustment. For the case of m three-dimensional scene points and n views, the parameter space is 6 x n dimensions. Compared with the current mainstream method, the bundling adjustment method provided by the invention greatly reduces the dimensionality of the parameter space.

The present invention considers the situation where the camera is calibrated and assumes that matching pairs of image points between the views have been determined.

The following definitions are illustrative of the general forms of the formulae:

assuming n as the number of views for bundle adjustment, numbered sequentially view 1, view 2, … view n;

(R_i,T_i) Representing the absolute pose of the ith view;

R_irepresenting the absolute pose of the ith view;

T_i＝||T_i||t_ian absolute displacement vector representing the ith view;

t_ithe unit absolute displacement vector representing the ith view, i.e. | t_i||＝1；

Theta represents the set of absolute pose parameters for all views;

representing the relative pose of the kth view with respect to the jth view;

T_j,k≡T_k-R_jkT_jrepresenting a relative displacement vector of the kth view relative to the jth view;

T_j,k＝||T_j,k||t_j,k，t_j,kis a unit relative displacement vector of the kth view relative to the jth view, namely | | t_j,k||＝1；

(R_j,k,t_j,k) Represents the k-th webRelative pose of view with respect to jth view;

{ j } represents all feature point sets on the jth view;

{ j, k } represents a common set of matching feature points on the jth and kth views, { j, k. } and so on, representing common sets of matching feature points on more than three views;

(j, k) represents a dual view composed of the jth view and the kth view;

the coordinates of the ith matching image point in the double views respectively consisting of the jth view and the kth view are normalized image point coordinates on the jth view and the kth view, namely the coordinates of the image point after calibration are obtained in the first two components, and the coordinate of the image point after calibration is obtained in the third component, namely 1.

The invention provides a low-dimensional bundling adjustment method, which comprises the following steps:

step 1: determining an initial value of the motion parameter;

The respective steps will be described in detail below.

The step 1 comprises the following steps:

step 1.1: for a double-view image formed by the jth view and the jth +1 th view, j is 1, 2.. and n-1, image feature points corresponding to a common matching feature point set { j, j +1} on the double-view image are solved by adopting a Direct Linear Transformation (DLT) algorithm to solve the relative pose (R) of the jth +1 th view relative to the jth view_j,j+1,t_j,j+1)；

Wherein:

n is the number of views participating in bundle adjustment;

Wherein:

i＝1,2,...,m^(j,j+1)；

step 1.2: without loss of generality, | T is fixed_1,21, |; for a three-view composed of a j-1 th view, a j-1 th view and a j +1 th view, j is 2,3_j,j+1||/||T_j-1,j| | obtaining a relative displacement vector T with uniform dimension_j,j+1：

T_j,j+1＝||T_j,j+1||t_j,j+1；

Wherein:

representing the three-dimensional coordinates of the ith matching image point pair corresponding to the common matching feature point set { j-1, j } on the jth view and the jth view in the jth view coordinate system；

R_j+1＝R_j,j+1R_j

T_j+1＝T_j,j+1+R_j,j+1T_j

Wherein:

R_jrepresenting the absolute pose of the jth view;

R_j+1represents the absolute pose of the j +1 th view;

T_jan absolute displacement vector representing the jth view;

T_j+1absolute displacement vector representing the j +1 th view;

when referring to the first view:

(R₁,t₁)≡(I₃,0_3×1)

wherein:

R₁representing an absolute pose of the first view;

T₁an absolute displacement vector representing the first view;

I₃representing a 3-dimensional identity matrix;

0_3×1a zero matrix representing 3 rows and 1 columns;

it should be noted that:

in step 1.1, j has the value j 1, 2.., n-1;

in step 1.2, j has the value j 2, 3.

In step 1.3, j has the value j 1, 2.

In step 2, the objective function of the motion parameter is specifically as follows:

on the premise that the distances from the same three-dimensional scene point to the same view are equal, the motion parameter θ is equal to (R)_j,T_j)_j＝1,2,...nThe minimization objective function δ (θ) of (d) is given as follows:

e₃＝[0 0 1]^T

wherein:

theta represents the set of absolute pose parameters for all views;

δ (·) represents the minimization objective function;

R_j,kthe relative posture of the kth view relative to the jth view is taken;

T_j,ka relative displacement vector of the kth view relative to the jth view;

since the initial value of the motion parameter obtained in step 1 is already optimized in step 2, and the optimized value of the motion parameter is obtained, step 3 is performed according to the optimized value of the motion parameter. Specifically, the step 3 includes the following steps:

T_j,k＝T_k-R_j,kT_j

wherein:

representing the ith three-dimensional scene point X_iWhether or not a mark is visible in a dual view formed by a jth view and a kth viewIdentification function, i.e. when X_iWhen visible in this double view the image is,

otherwise, then

R_jRepresenting the absolute pose of the jth view;

T_j,ka relative displacement vector of the kth view relative to the jth view;

representing the coordinates of the normalized image points of the s-th matched image point pair corresponding to the common matched feature point set { j, k } on the k-th view;

R_krepresenting the absolute pose of the kth view;

T_jan absolute displacement vector representing the jth view;

T_kan absolute displacement vector representing the kth view;

The foregoing description of specific embodiments of the present invention has been presented. It is to be understood that the present invention is not limited to the specific embodiments described above, and that various changes or modifications may be made by one skilled in the art within the scope of the appended claims without departing from the spirit of the invention. The embodiments and features of the embodiments of the present application may be combined with each other arbitrarily without conflict.

Claims

1. A low-dimensional bundle adjustment calculation method, characterized in that, comprising the steps:

Step 1: Determine the initial value of the motion parameters;

Step 2: Minimize the objective function of the motion parameters to obtain the optimized motion parameters;

Step 3: Calculate the coordinates of the three-dimensional scene point according to the optimized motion parameters;

The step 1 includes the following steps:

Step 1.1: For the dual view composed of the jth and j+1th views, j=1, 2, ..., n-1, the common matching feature point set on the dual views {j, j+1 } For the corresponding image feature points, a direct linear transformation algorithm is used to solve the relative pose (R _j,j+1 ,t _j,j+1 ) of the j+1th view relative to the jth view;

in:

n is the number of views participating in bundle adjustment;

R _j,j+1 is the relative posture of the j+1th view relative to the jth view;

t _j,j+1 is the unit relative displacement vector of the j+1th view relative to the jth view, ie ||t _j,j+1 ||=1;

Calculate the three-dimensional coordinates of the i-th matching image point pair corresponding to the public matching feature point set {j,j+1} in the j-th view coordinate system

and the three-dimensional coordinates of the i-th matching image point pair corresponding to the public matching feature point set {j,j+1} in the j+1-th view coordinate system

in:

i=1,2,...,m ^(j,j+1) ;

m ^{(j, j+1)} represents the number of matched image point pairs in the dual view composed of the jth and j+1th views;

is the normalized image point coordinates of the ith matching image point pair corresponding to the common matching feature point set {j,j+1} on the jth view;

is the normalized image point coordinates of the i-th matching image point pair corresponding to the public matching feature point set {j, j+1} on the j+1-th view;

Indicates the three-dimensional coordinates of the i-th matching image point pair corresponding to the public matching feature point set {j,j+1} in the j-th view coordinate system;

Indicates the three-dimensional coordinates of the i-th matching image point pair corresponding to the public matching feature point set {j,j+1} in the j+1-th view coordinate system;

Step 1.2: Fix ||T _1,2 ||=1; for the three views composed of the j-1th, jth and j+1th views, j=2,3,...,n-1 , according to the common matching feature point set {j-1,j,j+1} on the three views, calculate the scale of relative displacement ||T _j,j+1 ||/||T _j-1,j || , get the relative displacement vector T _j,j+1 with uniform scale:

T _j,j+1 =||T _j,j+1 ||t _j,j+1 ;

in:

T _1,2 is the relative displacement vector of the second view with respect to the first view;

T _j,j+1 is the relative displacement vector of the j+1th view relative to the jth view;

T _j-1,j is the relative displacement vector of the jth view relative to the j-1th view;

m ^(j-1,j,j+1) represents the number of common matching image point pairs in the three views formed by the j-1th, jth and j+1th views;

Indicates the three-dimensional coordinates of the ith matching image point pair corresponding to the common matching feature point set {j-1,j} on the j-1th and jth views in the jth view coordinate system;

represents the three-dimensional coordinates of the i-th matching image point pair corresponding to the public matching feature point set {j,j+1} on the j-th and j+1-th views in the j-th view coordinate system;

t _j,j+1 is the unit relative displacement vector of the j+1th view relative to the jth view;

Step 1.3: According to the absolute pose (R _j , T _j ) of the j-th view, calculate the absolute pose (R _j+1 , T _j+1 ) of the j+1-th view:

R _j+1 =R _j,j+1 R _j

T _j+1 =T _j,j+1 +R _j,j+1 T _j

in:

R _j represents the absolute pose of the jth view;

R _j+1 represents the absolute pose of the j+1th view;

R _j,j+1 is the relative posture of the j+1th view relative to the jth view;

T _j represents the absolute displacement vector of the jth view;

T _j+1 represents the absolute displacement vector of the j+1th view;

When referring to the first view:

(R ₁ ,t ₁ )≡(I ₃ ,0 _3×1 )

in:

R ₁ represents the absolute pose of the first view;

T ₁ represents the absolute displacement vector of the first view;

I ₃ represents a 3-dimensional identity matrix;

0 _3×1 represents a zero matrix with 3 rows and 1 column.

2. The low-dimensional bundle adjustment calculation method according to claim 1, wherein, in the step 2, the objective function of the motion parameter is specifically as follows:

The minimization objective function δ(θ) of the motion parameters θ=(R _j ,T _j ) _j=1,2,...n is given as follows:

e ₃ =[0 0 1] ^T

in:

θ represents the absolute pose parameter set of all views;

δ( ) means to minimize the objective function;

m ^{(j, k)} represents the number of matched image point pairs in the double view composed of the jth and kth views;

is the normalized image point coordinates on the kth view of the ith matched image point pair corresponding to the common matching feature point set {j,k} on the jth and kth views;

is the normalized image point coordinates on the jth view of the ith matched image point pair corresponding to the common matching feature point set {j,k} on the jth and kth views;

R _j,k is the relative posture of the k-th view relative to the j-th view;

T _j,k is the relative displacement vector of the k-th view relative to the j-th view.

3. The low-dimensional bundle adjustment calculation method according to claim 2, characterized in that, the motion parameter θ=(R _j ,T _j ) _j=1,2,...n given in the step 2 The premise of minimizing the objective function δ(θ) is that the distances from the same 3D scene point to the same view are equal.

4. The low-dimensional bundle adjustment calculation method according to claim 1, wherein the step 3 comprises the following steps:

According to the motion parameters θ=(R _j ,T _j ) _j=1,2,...n obtained by optimization, for the double view composed of the jth and kth views, the coordinates of the three-dimensional scene points are weighted and calculated as follows:

T _j,k =T _k -R _j,k T _j

in:

X _i represents the three-dimensional coordinates of the i-th three-dimensional scene point, and the three-dimensional scene point X _i corresponds to the s-th image feature point in the dual view formed by the j-th and k-th views;

The identification function indicating whether the i-th 3D scene point X _i is visible in the dual view formed by the j-th and k-th views, that is, when X _i is visible in the dual view,

Otherwise, then

R _j represents the absolute pose of the jth view;

T _j,k is the relative displacement vector of the k-th view relative to the j-th view;

Represents the normalized image point coordinates of the sth matching image point pair corresponding to the common matching feature point set {j,k} on the jth view;

Represents the normalized image point coordinates of the sth matching image point pair corresponding to the common matching feature point set {j, k} on the kth view;

R _k represents the absolute pose of the k-th view;

T _j represents the absolute displacement vector of the jth view;

T _k represents the absolute displacement vector of the k-th view;

R _j,k represents the relative posture of the k-th view relative to the j-th view;

T _j,k represents the relative displacement vector of the k-th view with respect to the j-th view.

5 . The low-dimensional cluster adjustment calculation method according to claim 1 , wherein the low-dimensional cluster adjustment calculation method considers the situation that the camera has been calibrated, and assumes that the matching image points between the views have been determined. 6 . right.

6. A low-dimensional cluster adjustment computing system, comprising a computer-readable storage medium storing a computer program, characterized in that, when the computer program is executed by a processor, the computer program described in any one of claims 1 to 5 is implemented. The steps of the low-dimensional bundle adjustment computation method.