JP2008042424A

JP2008042424A - Image matching apparatus and image matching method

Info

Publication number: JP2008042424A
Application number: JP2006212743A
Authority: JP
Inventors: Isao Karube; 軽部勲; Masahiro Kageyama; 影山昌広; Shigeki Nagaya; 長屋茂喜; Tomokazu Murakami; 村上智一; Hiroo Ito; 伊藤浩朗; Masashi Takahashi; 高橋昌史
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 2006-08-04
Filing date: 2006-08-04
Publication date: 2008-02-21

Abstract

【課題】
より高精度な画像マッチングをより少ない演算量でより行う。
【解決手段】
整数画素精度で画像マッチングをおこなう整数画素精度マッチング部と、所定の軸方向における複数の画像マッチングの類似度を求め、該類似度から求めたフィッティング関数をもちいることにより、複数の画像マッチング位置と該位置での類似度を算出し、該複数の画像マッチング位置と該位置での類似度を比較することにより、一の画像マッチング位置を求める小数画素精度で画像マッチングをおこなう小数画素精度マッチング部とを備える。
【選択図】図１
【Task】
More accurate image matching is performed with a smaller amount of computation.
[Solution]
An integer pixel accuracy matching unit that performs image matching with integer pixel accuracy, and a plurality of image matching similarities in a predetermined axial direction are obtained, and a plurality of image matching positions are obtained by using a fitting function obtained from the similarity. A fractional pixel accuracy matching unit that calculates the degree of similarity at the position and compares the plurality of image matching positions with the degree of similarity at the position to perform image matching with a fractional pixel precision to obtain one image matching position; Is provided.
[Selection] Figure 1

Description

本発明は、ディジタル動画像の符号化技術に関するものである。 The present invention relates to a digital moving image encoding technique.

画像処理の分野の画像の高精度な位置合わせ技術として、パラボラフィッティングや等角直線フィッティングといった、コストの分布をフィッティング関数で近似し、それによってコスト最小画素を算出する方式が特許文献1に開示されている。 Patent Document 1 discloses a method for approximating a cost distribution with a fitting function, such as parabolic fitting or equiangular straight line fitting, and calculating a minimum cost pixel by using a fitting function as a high-precision positioning technique for images in the field of image processing. ing.

また、動画像符号化における画像ブロックマッチング方法として、複数タップのフィルタによって小数精度の画素値を補完したのち、それらの画素に対してブロックマッチングを行う方式である。複数タップによる補完画像の作成方法は、例えば非特許文献1(H.264 /AVC)に開示されている。 In addition, as an image block matching method in moving image encoding, a pixel value with decimal precision is complemented by a multi-tap filter, and then block matching is performed on these pixels. A method of creating a complementary image using a plurality of taps is disclosed, for example, in Non-Patent Document 1 (H.264 / AVC).

特願2004-525640, ”画像のサブピクセルマッチングにおける多パラメータ高精度同時推定方法及び多パラメータ高精度同時推定プログラム”, 財団法人理工学振興会(清水, 奥富)Japanese Patent Application No. 2004-525640, "Multi-parameter high-accuracy simultaneous estimation method and multi-parameter high-accuracy simultaneous estimation program for image sub-pixel matching", Science and Technology Promotion Association (Shimizu, Okutomi) H.264/AVC (ITU-T Recommendation H.264 and ISO/IEC 14496-10 (MPEG-4 part 10) Advanced Video Coding), JVT(Joint Video Team)H.264 / AVC (ITU-T Recommendation H.264 and ISO / IEC 14496-10 (MPEG-4 part 10) Advanced Video Coding), JVT (Joint Video Team)

特許文献1ではコスト最小画素を更に正確に求めるために、水平、垂直軸に対してそれぞれ複数の小数画素精度の位置にある画素をフィッティング関数を用いて求め、軸ごとにそれらの小数精度画素に対する近似直線を作成し、その交点を最終的に採用する小数精度画素とするが、各軸毎に小数精度画素を複数求め、更にそれらの近似直線を求める処理が含まれ、演算量が膨大になってしまう問題点があった。 In Patent Document 1, in order to obtain the minimum cost pixel more accurately, pixels at a plurality of decimal pixel precision positions with respect to the horizontal and vertical axes are obtained using a fitting function, and the decimal precision pixels for each axis are obtained. Create an approximate straight line and use the intersection as the decimal precision pixel that will eventually be adopted, but it includes the process of obtaining multiple decimal precision pixels for each axis and further obtaining those approximate straight lines, resulting in an enormous amount of computation. There was a problem.

一方、非特許文献１における方法は、複数タップのフィルタによって画素値を補間し、ブロックマッチングを行っていたため、補間処理によって非常に処理量が大きくなる問題があった。 On the other hand, the method in Non-Patent Document 1 has a problem that the amount of processing becomes very large due to interpolation processing because pixel values are interpolated by a multi-tap filter and block matching is performed.

本発明は、上記課題に鑑みて為されたものであり、その目的は、高精度の画像マッチングをより少ない演算量で実現することにある。 The present invention has been made in view of the above problems, and an object thereof is to realize high-precision image matching with a smaller amount of calculation.

上記目的を達成するために、本発明は、整数画素精度で画像マッチングをおこなう整数画素精度マッチング部と、小数画素精度で画像マッチングをおこなう小数画素精度マッチング部とを備えることを特徴とする。 In order to achieve the above object, the present invention includes an integer pixel accuracy matching unit that performs image matching with integer pixel accuracy, and a decimal pixel accuracy matching unit that performs image matching with decimal pixel accuracy.

本発明によれば、高精度の画像マッチングをより少ない演算量で実現することがである。 According to the present invention, highly accurate image matching can be realized with a smaller amount of calculation.

以下に発明を実施するための実施の形態を図面を参照しつつ説明する。なお、各図面において、同一の符号が付されている構成要素は同一の機能を有することとする。 Embodiments for carrying out the invention will be described below with reference to the drawings. In addition, in each drawing, the component to which the same code | symbol is attached | subjected shall have the same function.

まず、図5を用いて、本発明が適用される画像記憶再生装置の一例を説明する。この画像記憶再生装置は、例えば画像を撮像してテープや光ディスクなどの記憶媒体に記憶するビデオカメラ（そのような撮影機能を備えた携帯電話も含む）、または、例えば受信したテレビジョン放送を録画するためのハードディスクレコーダやDVDレコーダ、もしくはそれらのレコーダが内蔵されたテレビジョン装置などである。すなわち、本発明は、ビデオカメラ、ハードディスクレコーダ、DVDレコーダ、テレビジョン装置等に適用され得るものである。勿論、これ以外の、動画像を符号化して録画する機能を有する装置であれば同様に適用できる。 First, an example of an image storage / playback apparatus to which the present invention is applied will be described with reference to FIG. This image storage / playback apparatus records a video camera (including a mobile phone having such a photographing function) that captures an image and stores it in a storage medium such as a tape or an optical disc, or a received television broadcast, for example. For example, a hard disk recorder or a DVD recorder, or a television apparatus incorporating such a recorder. That is, the present invention can be applied to a video camera, a hard disk recorder, a DVD recorder, a television device, and the like. Of course, any other apparatus having a function of encoding and recording a moving image can be applied in the same manner.

図５において、入力端子INからは、例えばCCD等の撮像素子で撮像された画像、または受信したデジタルテレビジョン放送の画像が入力される。この入力画像は動画像であるものとする。入力画像は、符号化部501でリアルタイムに符号化されて記憶媒体502に供給される。この符号化部における符号化処理の詳細については後述するものとする。記憶媒体502は、符号化された動画像を記憶するものであり、例えば光ディスク、ハードディスク、フラッシュメモリなどの半導体メモリ、テープレコーダ等で構成される。復号部503は、記憶媒体502に記憶された符号化画像を復号し、例えばRGBまたはコンポーネント(Y/Cb/Cr)形式のビデオ信号を生成する。このビデオ信号は、必要に応じ、例えばLCDやPDPなどの表示素子を有する表示部504に供給される。表示部504は、供給されたビデオ信号に基づき、記憶媒体２から再生された動画像を表示する。 In FIG. 5, from an input terminal IN, for example, an image captured by an imaging device such as a CCD or a received digital television broadcast image is input. This input image is assumed to be a moving image. The input image is encoded in real time by the encoding unit 501 and supplied to the storage medium 502. Details of the encoding process in this encoding unit will be described later. The storage medium 502 stores encoded moving images, and includes, for example, a semiconductor memory such as an optical disk, a hard disk, and a flash memory, a tape recorder, and the like. The decoding unit 503 decodes the encoded image stored in the storage medium 502 and generates, for example, a video signal in RGB or component (Y / Cb / Cr) format. This video signal is supplied to a display unit 504 having a display element such as an LCD or a PDP as necessary. The display unit 504 displays a moving image reproduced from the storage medium 2 based on the supplied video signal.

次に、図１を用いて上記符号化部501の詳細について説明する。図１は、本発明に係る動画像符号化装置の一構成例を示しており、フィッティング関数を用いて小数画素精度のベクトル推定を行うものである。入力部100に入力された入力画像101は、減算器102、フィッティング関数を利用した動き推定部111、及び動きを補償部113にそれぞれ供給される。まず、減算器102では、入力部100から入力された入力画像101の一ブロックの画像と動き補償部113からの出力である予測ブロック114との差分がとられ、誤差信号として出力される。この誤差信号は、変換器103に入力され、DCT(離散コサイン変換)係数に変換される。変換器103から出力されたDCT係数は、量子化器104で量子化されて量子化変換係数105が生成される。このとき、量子化変換係数105と共に、誤差信号の符号量も出力される。量子化変換係数105は、伝送情報として多重化器116に供給される。また、量子化変換係数105は、フレーム間の予測画像を合成するための情報としても用いられる。すなわち、量子化変換係数105は逆量子化器106で逆量子化され、逆変換器107で逆変換された後、動き補償部113からの出力画像114と加算器108で加算され、現フレームの復号画像としてフレームメモリ109に記憶される。これによって、現フレームの画像は1フレーム分の時間だけ遅延され、前フレーム画像110として動き推定部111へ供給される。 Next, details of the encoding unit 501 will be described with reference to FIG. FIG. 1 shows an example of the configuration of a moving picture encoding apparatus according to the present invention, which performs vector estimation with decimal pixel accuracy using a fitting function. The input image 101 input to the input unit 100 is supplied to the subtractor 102, the motion estimation unit 111 using the fitting function, and the motion to the compensation unit 113, respectively. First, in the subtracter 102, a difference between one block image of the input image 101 input from the input unit 100 and the prediction block 114 output from the motion compensation unit 113 is obtained and output as an error signal. This error signal is input to the converter 103 and converted into DCT (discrete cosine transform) coefficients. The DCT coefficient output from the converter 103 is quantized by the quantizer 104 to generate a quantized transform coefficient 105. At this time, the code amount of the error signal is also output together with the quantized transform coefficient 105. The quantized transform coefficient 105 is supplied to the multiplexer 116 as transmission information. The quantized transform coefficient 105 is also used as information for combining predicted images between frames. That is, the quantized transform coefficient 105 is inversely quantized by the inverse quantizer 106, inversely transformed by the inverse transformer 107, then added by the adder 108 with the output image 114 from the motion compensation unit 113, and the current frame The decoded image is stored in the frame memory 109. As a result, the current frame image is delayed by one frame time and supplied to the motion estimation unit 111 as a previous frame image 110.

上記動き推定部111は、現フレームの画像である入力画像101と前フレーム画像110とを用いて動きベクトルを推定する処理を行う。動きベクトル推定とは、前フレームの復号画像（参照画像）から対象マクロブロックの内容と類似する部分を検索し（一般的には、前フレームの探索範囲に対して、輝度信号ブロック内の予測誤差信号の絶対値和が小さい部分を選択する）、その動き情報（動きベクトル）及び動き予測モード情報を得るための処理であって、本願ではこれをブロックマッチング（位置合わせ）処理と称する。ブロックマッチング処理における対象マクロブロックと参照画像との類似の度合いの判断処理は、例えば、特開2004-357086号公報に開示される技術を用いればよい。 The motion estimation unit 111 performs a process of estimating a motion vector using the input image 101 and the previous frame image 110 that are images of the current frame. In motion vector estimation, a portion similar to the content of the target macroblock is searched from the decoded image (reference image) of the previous frame (generally, the prediction error in the luminance signal block is compared with the search range of the previous frame. This is processing for obtaining motion information (motion vector) and motion prediction mode information, which is referred to as block matching (positioning) processing in the present application. For example, a technique disclosed in Japanese Patent Application Laid-Open No. 2004-357086 may be used for determining the degree of similarity between the target macroblock and the reference image in the block matching process.

上記動き推定部111は、動きベクトル、残差などの動き情報112を動き補償部113 へ出力する。動き補償部113は予測マクロブロック情報とブロックマッチング処理によって動き情報及び動き予測モード情報115とを生成する。予測マクロブロック画像114は前述のように減算器102に出力され、動き情報及び動き予測モード情報115は多重化器116に供給される。多重化器116は、上記量子化係数105と動き情報及び動き予測モード情報115とを多重化し、符号化する。 The motion estimation unit 111 outputs motion information 112 such as motion vectors and residuals to the motion compensation unit 113. The motion compensation unit 113 generates motion information and motion prediction mode information 115 by predictive macroblock information and block matching processing. The predicted macroblock image 114 is output to the subtractor 102 as described above, and the motion information and motion prediction mode information 115 are supplied to the multiplexer 116. The multiplexer 116 multiplexes and encodes the quantization coefficient 105 and the motion information and motion prediction mode information 115.

本実施形態では、上記動き推定部111における動き推定処理にてコストに対するフィッティング関数の情報を用いることを特徴とするものである。本実施形態における上記動き推定部111は、予め従来方式で求められた整数画素精度のコスト値からフィッティング関数を作成し、小数画素精度での最小コスト画素探索に係る演算量の低減するものである。 The present embodiment is characterized in that information on the fitting function for the cost is used in the motion estimation processing in the motion estimation unit 111. The motion estimation unit 111 in the present embodiment creates a fitting function from the cost value of integer pixel accuracy obtained in advance by the conventional method, and reduces the amount of calculation related to the minimum cost pixel search with decimal pixel accuracy. .

ここで、本願において、コストとは、前述した対象マクロブロックと参照画像との類似の度合い（類似度）を示す数値をいう。また、フィッティング関数とは、座標に与えられた離散的な数点の座標位置における値を通る関数もしくはこれに近似した関数をいう。例えば、座標に与えられた離散的な2点の値に対する直線関数、座標に与えられた離散的な3点の値に対する近似直線関数、座標に与えられた離散的な３点の値に対する２次曲線関数、座標に与えられた離散的な４点の値に対する近似２次曲線関数、座標に与えられた離散的な４点の値のうち２点を通る２対の直線による関数などが挙げられる。 Here, in the present application, the cost refers to a numerical value indicating the degree of similarity (similarity) between the above-described target macroblock and the reference image. The fitting function refers to a function that passes through values at coordinate positions of discrete points given to coordinates, or a function approximate to this. For example, a linear function for a discrete two-point value given to coordinates, an approximate linear function for a discrete three-point value given to coordinates, and a quadratic to a discrete three-point value given to coordinates Curve function, approximate quadratic curve function for discrete four point values given to coordinates, function by two pairs of straight lines passing through two points among discrete four point values given to coordinates, etc. .

ここで、コスト値からフィッティング関数を作成するもしくは求めるとは、コスト値をサンプリング値として、このサンプリング値を通るようにもしくはこのサンプリング値に近似するように、上記のような関数を与えることをいう。 Here, creating or obtaining a fitting function from a cost value means giving the above function so that the cost value is taken as a sampling value and passes through or approximates the sampling value. .

以下、その詳細な動作について説明する。 The detailed operation will be described below.

図2は、本実施形態に係るフィッティング関数を利用した上記動き推定部111にの構成を示している。 FIG. 2 shows a configuration of the motion estimation unit 111 using the fitting function according to the present embodiment.

入力画像101は複数のブロック（マクロブロック）に分割されて処理されるものとする。まず、整数画素精度動き推定部201において入力画像101のブロックに関して従来方式のマッチングを行い、整数画素精度の探索におけるコスト最小点を求める。次に、整数画素精度動き推定部201は整数画素精度のコスト最小点の情報202を出力し、小数画素精度動き推定部203にて整数画素精度のコスト最小点の情報202の周辺における小数画素精度のコスト最小画素を算出する。なお、本実施例では従来方式のマッチングにて整数画素精度のコスト最小点を算出し、小数画素精度のコスト最小画素をフィッティング関数を利用して算出するが、フィッティング関数を利用するのは小数画素に限るものではない。つまり、最初に何画素かおきにコストを作成した後、それらのコストから整数画素精度のコスト最小画素の位置を推定することも可能である。 Assume that the input image 101 is processed by being divided into a plurality of blocks (macroblocks). First, the integer pixel accuracy motion estimation unit 201 performs matching according to the conventional method for the block of the input image 101 to obtain the minimum cost point in the search for integer pixel accuracy. Next, the integer pixel accuracy motion estimator 201 outputs the integer pixel accuracy cost minimum point information 202, and the fractional pixel accuracy motion estimator 203 outputs the fractional pixel accuracy around the integer pixel accuracy cost minimum point information 202. The minimum cost pixel is calculated. In this embodiment, the minimum cost of integer pixel accuracy is calculated by matching in the conventional method, and the minimum cost pixel of decimal pixel accuracy is calculated using a fitting function, but the fitting function is used for decimal pixels. It is not limited to. That is, it is also possible to estimate the position of the minimum cost pixel with integer pixel accuracy from these costs after first creating a cost every several pixels.

なお、図2は動画像符号化装置の動き推定部として説明したが、画像マッチング装置として用いる場合は、整数画素精度動き推定部201は整数画素精度画像マッチング部、小数画素精度動き推定部203は小数画素画像マッチング部とすればよい。 2 has been described as the motion estimation unit of the video encoding device, but when used as an image matching device, the integer pixel accuracy motion estimation unit 201 is an integer pixel accuracy image matching unit, the decimal pixel accuracy motion estimation unit 203 is A decimal pixel image matching unit may be used.

次に、図3において、水平軸方向に対するフィッティング関数を利用したコスト最小画素算出の方式を説明する。301は入力画像、302は参照画像を表している。ここで、入力画像内のあるブロック303についてコスト最小となる画素を求めるとする。所定の範囲において、ブロック303と最もマッチするブロックが304であった場合、そのブロックにおけるコスト値と、そのブロックを水平方向に±１画素ずつ移動させたブロック(それぞれ305、306)におけるコストからフィッティング関数307を作成する。例えば、２次曲線関数であるフィッティング関数を用いる。このフィッティング関数の最小値の座標308を最小コストを与える画素の水平軸の座標として採用する。また、垂直軸に関しても図3 と同様の処理にて最小画素の座標を求め、水平、垂直軸の値から一意に決まる画素をコスト採用点として符号化を進める。ここではフィッテング関数の最小点をコスト最小点としたが、フィッティング関数を利用したコスト最小点の求め方は例えば2直線の交点など、関数の最小点以外の方法によって求めることも可能である。また、この例では±1画素ずつずらしたブロックでのコストに対してフィッティングを行うが、フィッティングを適用するブロックはそれに限るものではない。 Next, referring to FIG. 3, a method for calculating the minimum cost pixel using a fitting function in the horizontal axis direction will be described. 301 represents an input image, and 302 represents a reference image. Here, it is assumed that the pixel having the minimum cost is obtained for a certain block 303 in the input image. If the block that best matches the block 303 within a given range is 304, fitting is performed from the cost value of that block and the cost of the block (305 and 306, respectively) that moved the block by ± 1 pixel in the horizontal direction. Create function 307. For example, a fitting function that is a quadratic curve function is used. The coordinate 308 of the minimum value of this fitting function is adopted as the coordinate of the horizontal axis of the pixel that gives the minimum cost. Also, with respect to the vertical axis, the coordinates of the minimum pixel are obtained by the same processing as in FIG. Here, the minimum point of the fitting function is the minimum cost point. However, the minimum cost point using the fitting function can be determined by a method other than the minimum point of the function, such as an intersection of two straight lines. In this example, fitting is performed for the cost of blocks shifted by ± 1 pixel. However, the block to which the fitting is applied is not limited thereto.

また、図4に図２の小数画素精度動き推定部203におけるフィッティング関数を用いた小数画素精度動き推定のフローチャートを示す。小数画素精度動き推定部203では入力された従来方式にて求められた整数画素精度のコスト最小点の情報をもとに、ステップ400にて小数画素精度動き推定を開始する。次に、ステップ401にて水平軸に関して周辺画素のコスト値を計算し、ステップ403でフィッティング関数を用いてコストが最小になると推定される座標を決定する。垂直軸に関してもステップ405、ステップ407にて、同様の処理を行い、ステップ409にてステップ403から及びステップ407の出力情報から、最終的なコスト最小の推定点の水平軸方向の座標位置、垂直軸方向の座標位置を決定する。例えば、このとき、各軸上におけるフィッティング関数を用いてコストが最小になると推定される座標のうち、コストの小さい一方の座標を、最終的なコスト最小の推定点の座標としてもよい。また、最終的なコスト最小の推定点の座標の水平軸方向の座標値を、ステップ403から取得した垂直軸上のコストが最小位置座標とし、最終的なコスト最小の推定点の座標の垂直軸方向の座標値を、ステップ407から取得した垂直軸上のコストが最小位置座標としてもよい。 FIG. 4 shows a flowchart of sub-pixel accuracy motion estimation using a fitting function in the sub-pixel accuracy motion estimation unit 203 of FIG. The fractional pixel accuracy motion estimation unit 203 starts the fractional pixel accuracy motion estimation in step 400 based on the input information on the minimum cost of integer pixel accuracy obtained by the conventional method. Next, in step 401, cost values of neighboring pixels are calculated with respect to the horizontal axis, and in step 403, coordinates estimated to minimize the cost are determined using a fitting function. Regarding the vertical axis, the same processing is performed in step 405 and step 407, and in step 409, from the output information of step 403 and step 407, the coordinate position in the horizontal axis direction of the final estimated point of the minimum cost is determined to be vertical. Determine the axial coordinate position. For example, at this time, out of the coordinates estimated to minimize the cost using the fitting function on each axis, one of the coordinates having the smallest cost may be set as the coordinate of the estimated point having the smallest cost. In addition, the coordinate value in the horizontal axis direction of the coordinates of the estimated point of the lowest cost is the minimum position coordinate on the vertical axis acquired from step 403, and the vertical axis of the coordinate of the estimated point of the lowest cost The cost on the vertical axis acquired from step 407 may be the minimum position coordinate for the direction coordinate value.

なお、図４のフローチャートでは、水平軸と垂直軸との処理を並列に処理するステップを示したが、これらを直列に処理するステップとしても構わない。 In the flowchart of FIG. 4, the steps of processing the horizontal axis and the vertical axis in parallel are shown, but these steps may be processed in series.

上記までにフィッティング関数を利用した小数画素精度でのコスト最小点の求め方を説明したが、それだけでは、コストの分布が異方性を有し、水平軸及び垂直軸とに沿っていない場合に必ずしもコストが最小となる画素が選択されないという問題点がある。 Up to this point, the method of obtaining the minimum cost point with decimal pixel accuracy using the fitting function has been described. However, it is only that when the cost distribution has anisotropy and is not along the horizontal axis and the vertical axis. There is a problem that the pixel with the lowest cost is not necessarily selected.

すなわち、図6aに、コストの分布に異方性があり、水平軸及び垂直軸とに沿っていない場合の模式図を示す。図6aは異方性のあるコスト分布に対して、701は水平軸、垂直軸を利用してコスト最小点を推定した場合を示している。703はコスト分布の等高線で、ここでは楕円の中心方向にコストが小さくなっているとする。また705は推定によって求めたコスト最小値の画素、706は実際にコストが最小となる点を示している。 That is, FIG. 6a shows a schematic diagram when the cost distribution is anisotropic and not along the horizontal axis and the vertical axis. FIG. 6a shows the case where the cost minimum point is estimated using the horizontal axis and the vertical axis for the anisotropic cost distribution. Reference numeral 703 denotes a contour line of the cost distribution. Here, it is assumed that the cost is reduced toward the center of the ellipse. Reference numeral 705 denotes a pixel having a minimum cost value obtained by estimation, and reference numeral 706 denotes a point at which the cost is actually minimized.

これに対し、本実施例における小数画素精度動き推定では、水平軸、垂直軸軸以外を用いたコスト最小点を推定方法も用いる。例えば、図6bに示されるような45度傾けた軸を用いる。図6bにおいて707は推定によって求めたコスト最小値の画素を示している。 On the other hand, in the fractional pixel precision motion estimation in the present embodiment, a cost minimum point estimation method using other than the horizontal axis and the vertical axis is also used. For example, an axis inclined 45 degrees as shown in FIG. 6b is used. In FIG. 6b, reference numeral 707 denotes a pixel having the minimum cost value obtained by estimation.

図6aでは推定値705と実際の値706とが大きくずれているのに対し、図6bではコストの分布が軸に近く、推定値707と実際の値706が近い位置にあるため、図6a水平軸、垂直軸を利用する場合よりもの高い精度でコスト最小点を求められることが出来る。 In FIG. 6a, the estimated value 705 and the actual value 706 are significantly different from each other, whereas in FIG. 6b, the cost distribution is close to the axis, and the estimated value 707 and the actual value 706 are close to each other. It is possible to obtain the minimum cost point with higher accuracy than when using the axis and the vertical axis.

また、図6cのようにさらに軸を45度以外の角度に傾けた場合も、また、異なるコスト最小値と、コスト最小値の画素位置708が取得される。 Further, when the axis is further tilted to an angle other than 45 degrees as shown in FIG. 6c, the pixel position 708 having a different minimum cost value and a minimum cost value is also acquired.

なお、図6a,b,cでは、2軸においてフィッティング関数を求め、各軸におけるコスト最小の小さいほうの画素をコスト最小の画素位置として推定した。しかし、各軸におけるコスト最小値の位置を各軸方向の座標値とする画素の位置をコスト最小の画素位置として推定しても構わない。 In FIGS. 6a, b, and c, the fitting function is obtained on the two axes, and the smaller pixel with the smallest cost on each axis is estimated as the pixel position with the smallest cost. However, the position of the pixel having the position of the minimum cost value on each axis as the coordinate value in the direction of each axis may be estimated as the pixel position having the minimum cost.

なお、図6では互いに直交した2軸を用いて説明したが、直交しない2軸を用いてもよい。 In FIG. 6, the description is made using two axes orthogonal to each other, but two axes that are not orthogonal may be used.

このように、異方性のあるコスト分布においては、フィッティング関数を適用する軸をいずれの軸とするかにより、異なる最小コスト値と、最小コスト値が画素位置が得られることが分かる。 Thus, it can be seen that in an anisotropic cost distribution, pixel positions can be obtained with different minimum cost values and minimum cost values depending on which axis is used as the axis to which the fitting function is applied.

このように本実施例では従来の水平軸、垂直軸方向だけでなく、別の角度においても隣接した画素を用いてコスト最小画素を推定する。さらに、各方向においてコスト最小画素を推定した結果のうちコストの小さい画素を採用する。これにより、特定の2軸におけるフィッティング関数を用いた小数画素精度の推定よりも、より高精度な推定が可能となる。 As described above, in this embodiment, the minimum cost pixel is estimated using not only the conventional horizontal axis and vertical axis directions but also adjacent pixels at different angles. Further, a pixel having a low cost is adopted among the results of estimating the minimum cost pixel in each direction. Thereby, estimation with higher accuracy than estimation of decimal pixel accuracy using a fitting function in two specific axes can be performed.

次に、水平、垂直の軸に対して45度傾いた軸方向に対するフィッティング関数を利用したコスト最小画素算出の方式を図7を用いて説明する。ここでは水平、垂直の軸に対して45度の方向にて隣接する画素についてコストのフィッティング関数を求め、各方向で求まったコスト最小画素を基に、座標を決定するものである。801は入力画像、802は参照画像を表している。ここで、入力画像内のあるブロック803についてコスト最小となる画素を求めるとする。所定の範囲において、ブロック803と最もマッチするブロックが304であった場合、そのブロックにおけるコスト値と、そのブロックを±１画素ずつ水平+45度方向に移動させたブロック(それぞれ805、806)におけるコストからフィッティング関数307を作成する。このフィッティング関数の最小値の座標308を最小コストを与える画素の水平+45度方向軸の座標として採用する。また、垂直＋45度方向軸に関しても図7 と同様の処理にて最小画素の座標を求め、水平、垂直軸の値から一意に決まる画素をコスト採用点として符号化を進める。 Next, a method for calculating the minimum cost pixel using a fitting function for an axial direction inclined by 45 degrees with respect to the horizontal and vertical axes will be described with reference to FIG. Here, a cost fitting function is obtained for pixels adjacent in the direction of 45 degrees with respect to the horizontal and vertical axes, and coordinates are determined based on the minimum cost pixel obtained in each direction. Reference numeral 801 denotes an input image, and 802 denotes a reference image. Here, it is assumed that a pixel having a minimum cost is obtained for a certain block 803 in the input image. In the predetermined range, if the block that best matches the block 803 is 304, the cost value in that block and the block in which the block is moved in the horizontal +45 degree direction by ± 1 pixel (805 and 806 respectively) A fitting function 307 is created from the cost. The coordinate 308 of the minimum value of this fitting function is adopted as the coordinate of the horizontal +45 degree direction axis of the pixel giving the minimum cost. Also, with respect to the vertical +45 degree direction axis, the coordinates of the minimum pixel are obtained by the same processing as in FIG.

ここではフィッティング関数の最小点をコスト最小点としたが、フィッティング関数を利用したコスト最小点の求め方は例えば2直線の交点など、関数の最小点以外の方法によって求めることも可能である。また、この例では軸上に隣接した画素分ずらしたブロックでのコストに対してフィッティングを行ったが、フィッティングを適用するブロックはそれに限るものではない。 Here, the minimum point of the fitting function is the minimum cost point, but the method of determining the minimum cost point using the fitting function can also be determined by a method other than the minimum point of the function, such as the intersection of two straight lines. In this example, the fitting is performed on the cost of the block shifted by the adjacent pixels on the axis, but the block to which the fitting is applied is not limited thereto.

なお、上記の説明は、水平、垂直の軸に対して45度傾いた軸方向において説明したが、フィッティング関数を適用できる整数画素を含む軸であれば、例えば図6cのような軸においても同様に行えばよい。すなわちこの場合は、前述のブロック803と最もマッチするブロックのコストと、当該軸の傾き方向に移動させたブロックにおけるコストからフィッティング関数を求めればよい。 In the above description, the axis direction is inclined by 45 degrees with respect to the horizontal and vertical axes. However, as long as the axis includes integer pixels to which the fitting function can be applied, the same applies to the axis as shown in FIG. 6c, for example. You can go to That is, in this case, the fitting function may be obtained from the cost of the block that best matches the block 803 and the cost of the block moved in the inclination direction of the axis.

このようにすれば、水平、垂直及びこれらから45度傾いた軸以外の角度の軸においてもフィッティング関数を求めることが可能となり、より高精度な推定ができる可能性が高くなる。 In this way, it is possible to obtain the fitting function also in the axis other than the horizontal, vertical, and axes tilted by 45 degrees from these, and there is a high possibility that more accurate estimation can be performed.

次に、図２の小数画素精度動き推定部203において、図6bや図6cなどのような水平、垂直以外の軸におけるフィッティング関数を用いた小数画素精度動き推定のフローチャートを示す。 Next, a flowchart of sub-pel accuracy motion estimation using a fitting function in axes other than horizontal and vertical as shown in FIGS. 6b and 6c in the sub-pixel accuracy motion estimation unit 203 of FIG.

まず、ステップ900にて小数画素精度動き推定を開始する。次に、ステップ901では一方の軸(例えば水平+45度方向の軸)に関して近傍画素のコスト演算を行い、ステップ903で軸上の座標を求める。ステップ905ではもう一方の軸(例えば垂直+45度方向の軸)に関して同様に近傍画素のコスト演算を行い、ステップ907にて軸上の座標を求める。これら2つの軸上での座標を基に、ステップ909でコスト最小画素を決定する。ステップ909での処理は、図４のステップ409の処理における水平軸、垂直軸を、一方の軸ともう一方の軸としたのみの違いであり、基本的に同様の処理である。 First, in step 900, decimal pixel precision motion estimation is started. Next, in step 901, the cost calculation of neighboring pixels is performed with respect to one axis (for example, the axis in the horizontal +45 degree direction), and in step 903, the coordinates on the axis are obtained. In step 905, the cost of neighboring pixels is similarly calculated for the other axis (for example, the axis in the direction of vertical +45 degrees), and in step 907, the coordinates on the axis are obtained. Based on the coordinates on these two axes, the minimum cost pixel is determined in step 909. The process in step 909 is basically the same process, except that the horizontal axis and the vertical axis in the process of step 409 in FIG. 4 are changed to one axis and the other axis.

なお、図８のフローチャートでは、一方の軸ともう一方の軸との処理を並列に処理するステップを示したが、これらを直列に処理するステップとしても構わない。 In the flowchart of FIG. 8, the steps of processing the one axis and the other axis in parallel are shown, but these steps may be processed in series.

さらに、図２における小数画素精度動き推定部203においては、さらに図６a、図６b、図6cに示されるような軸ペアのうちから、複数の軸ペアにおいて求めたコスト最小値やコスト最小画素位置を比較して、よりコストの低いほうを選択するようにしても良い。この処理のフローチャートの一例を図９に示す。 Further, in the fractional pixel accuracy motion estimation unit 203 in FIG. 2, the minimum cost value and the minimum cost pixel position obtained in a plurality of axis pairs from among the axis pairs as shown in FIGS. 6 a, 6 b, and 6 c. May be selected so that the lower cost is selected. An example of a flowchart of this process is shown in FIG.

図９の例においては、図６a、図６b、図6cに示される軸ペアのうち、二つの軸ペアにおいて、それぞれフィッティング関数を用いた小数画素精度動き推定を行うこととする。ここで、第1の軸ペアを水平軸と垂直軸とし、第２の軸ペアを水平+45度方向の軸と、垂直+45度方向の軸として説明する。本処理においては、この二つの軸ペアにおけるフィッティング関数を用いた小数画素精度動き推定の結果を比較する。 In the example of FIG. 9, the fractional pixel accuracy motion estimation using the fitting function is performed on each of the two axis pairs out of the axis pairs shown in FIGS. 6a, 6b, and 6c. Here, the first axis pair will be described as a horizontal axis and a vertical axis, and the second axis pair will be described as an axis in the horizontal +45 degree direction and an axis in the vertical +45 degree direction. In this process, the results of sub-pixel precision motion estimation using the fitting function in the two axis pairs are compared.

まず、ステップ1000にて小数画素精度動き推定を開始する。次に、ステップ1001において、第１の軸ペアである水平軸と垂直軸とに対し、フィッティング関数を用いた小数画素精度動き推定を行う。すなわちステップ1001における処理においては、例えば、図４のフローチャートにおけるステップ400〜409までの処理が行われる。また、ステップ1002において、第2の軸ペアである水平+45度方向の軸と垂直+45度方向の軸とに対し、フィッティング関数を用いた小数画素精度動き推定を行う。すなわちステップ1002における処理においては、例えば、図8のフローチャートにおけるステップ900〜909までの処理が行われる。ステップ1003での処理は、フィッティング関数の最小値もしくはその近似値を比較し、コスト値が最小の方の画素位置を選択してもよい。また、複数の軸ペアにおけるフィッティング関数を用いた小数画素精度動き推定により得た複数の最小コスト値及び最小コスト画素位置を用いてさらに演算を行ってあらたな最小コスト値及び最小コスト画素位置を算出しても良い。 First, in step 1000, decimal pixel precision motion estimation is started. Next, in step 1001, sub-pixel accuracy motion estimation using a fitting function is performed on the horizontal axis and the vertical axis that are the first axis pair. That is, in the process in step 1001, for example, the processes in steps 400 to 409 in the flowchart of FIG. 4 are performed. Also, in step 1002, sub-pixel precision motion estimation using a fitting function is performed for the second axis pair, the horizontal +45 degree direction axis and the vertical +45 degree direction axis. That is, in the processing at step 1002, for example, the processing from steps 900 to 909 in the flowchart of FIG. 8 is performed. The processing in step 1003 may compare the minimum value of the fitting function or its approximate value, and select the pixel position with the smallest cost value. In addition, a new minimum cost value and minimum cost pixel position are calculated by performing further calculations using a plurality of minimum cost values and minimum cost pixel positions obtained by motion estimation of decimal pixels using fitting functions in a plurality of axis pairs. You may do it.

図9では予め定められた2つの軸ペアを比較しているが、複数の軸ペアの中から任意の2つ軸の軸ペアを選択し、最もコストが小さくなると推定される点を採用する方法としてもよい。 In FIG. 9, two predetermined axis pairs are compared, but a method of selecting an arbitrary two-axis axis pair from a plurality of axis pairs and adopting the point that the cost is estimated to be the smallest It is good.

本実施例では動画像符号化におけるマッチングを示したが、本発明は動画像符号化に限らず、マッチング処理一般に適用が可能である。 In the present embodiment, matching in moving image coding is shown, but the present invention is not limited to moving image coding, and can be applied to matching processing in general.

マッチングに利用するブロックサイズも本実施例では例として4×4のブロックで説明したがこれに限るものではない。 Although the block size used for matching has been described as a 4 × 4 block as an example in the present embodiment, it is not limited to this.

なお、本実施例は動画像符号化装置として説明したが、画像マッチング装置として用いる場合は、図2の整数画素精度動き推定部201を整数画素精度画像マッチング部、小数画素精度動き推定部203を小数画素画像マッチング部とし、動きベクトルの算出ではなく、画像のマッチング位置を算出すればよい。 Although this embodiment has been described as a moving image encoding device, when used as an image matching device, the integer pixel accuracy motion estimation unit 201 in FIG. 2 is replaced with an integer pixel accuracy image matching unit and a decimal pixel accuracy motion estimation unit 203. The decimal pixel image matching unit may be used to calculate the matching position of the image instead of calculating the motion vector.

以上説明した実施例によれば、動画像符号化の画像マッチングにおいて水平、垂直軸以外の軸においてもフィッティング関数を用いた小数画素精度動き推定をおこなうことにより、より高精度に動きベクトルの推定を行うことが可能となる。 According to the embodiment described above, motion vector estimation can be performed with higher accuracy by performing sub-pixel precision motion estimation using a fitting function in axes other than the horizontal and vertical axes in image matching of video coding. Can be done.

また、複数の軸ペアにおいてフィッティング関数を用いた小数画素精度動き推定を行い、その結果を比較して動きベクトルの推定することにより、少ない演算量でより高精度な動きベクトルの推定を行うことが可能となる。 Also, by performing fractional pixel precision motion estimation using a fitting function in a plurality of axis pairs and comparing the results to estimate the motion vector, it is possible to estimate the motion vector with high accuracy with a small amount of computation. It becomes possible.

すなわち、H.264をはじめとする動画像符号化で広く利用されている処理が、補間画像を作成しマッチングを行うものであり、補間処理及びマッチング処理が非常に大きくなってしまうのに対し、本実施例における動きベクトルの探索処理は、整数画素精度で求めたコスト値にフィッティング関数を適用して小数画素精度の動きベクトルを推定し、高精度で高速な小数画素精度の動きベクトル探索を実現するものである。 In other words, the processing widely used in video coding including H.264 is to create an interpolated image and perform matching, while the interpolation processing and matching processing become very large, The motion vector search processing in the present embodiment applies a fitting function to the cost value obtained with integer pixel accuracy to estimate the motion vector with decimal pixel accuracy, and realizes a high-precision and high-speed motion vector search with decimal pixel accuracy. To do.

また、以上説明した実施例はいずれを組み合わせて用いても、本発明の１つの実施の形態となりえる。 Further, any of the embodiments described above can be used in combination to form an embodiment of the present invention.

以上説明した本発明のマッチング方法によれば、より高精度な画像マッチングをより少ない演算量でより行うことが可能となる。 According to the matching method of the present invention described above, more accurate image matching can be performed with a smaller amount of calculation.

本発明の一実施例に係る動画像符号化装置の一例を示す図The figure which shows an example of the moving image encoder which concerns on one Example of this invention. 本発明の一実施例に係る動き推定部の構成例を示す図The figure which shows the structural example of the motion estimation part which concerns on one Example of this invention. 本発明の一実施例に係るコスト最小画素推定方法の一例の説明図Explanatory drawing of an example of the cost minimum pixel estimation method which concerns on one Example of this invention. 本発明の一実施例に係るコスト最小画素推定方法のフローチャートの一例An example of a flowchart of a minimum cost pixel estimation method according to an embodiment of the present invention 本発明が適用される画像記録再生装置の一例を示す図The figure which shows an example of the image recording / reproducing apparatus to which this invention is applied 本発明の一実施例に係る軸ペアとコスト分布の一例を示す図The figure which shows an example of the axis pair and cost distribution which concern on one Example of this invention 本発明の一実施例に係る軸ペアとコスト分布の一例を示す図The figure which shows an example of the axis pair and cost distribution which concern on one Example of this invention 本発明の一実施例に係る軸ペアとコスト分布の一例を示す図The figure which shows an example of the axis pair and cost distribution which concern on one Example of this invention 本発明の一実施例に係るコスト最小画素推定方法の一例の説明図Explanatory drawing of an example of the cost minimum pixel estimation method which concerns on one Example of this invention. 本発明の一実施例に係るコスト最小画素推定方法のフローチャートの一例An example of a flowchart of a minimum cost pixel estimation method according to an embodiment of the present invention 本発明の一実施例に係るコスト最小画素推定方法のフローチャートの一例An example of a flowchart of a minimum cost pixel estimation method according to an embodiment of the present invention

Explanation of symbols

100…入力部、101…入力画像、102…減算器、103…変換部、104…量子化部、105…予測誤差及び予測誤差符号量、106…逆量子化部、107…逆変換部、108…加算器109…フレームメモリ、110…復号画像、111…フィッティング関数を利用した動き推定部、112…動き推定結果情報、113…動き補償部、114…予測画像、116…多重化部、201…整数画素精度動き推定部、203…小数画素精度動き推定部
DESCRIPTION OF SYMBOLS 100 ... Input part, 101 ... Input image, 102 ... Subtractor, 103 ... Conversion part, 104 ... Quantization part, 105 ... Prediction error and prediction error code amount, 106 ... Dequantization part, 107 ... Inverse conversion part, 108 ... adder 109 ... frame memory, 110 ... decoded image, 111 ... motion estimation unit using fitting function, 112 ... motion estimation result information, 113 ... motion compensation unit, 114 ... predicted image, 116 ... multiplexing unit, 201 ... Integer pixel accuracy motion estimator, 203 ... Decimal pixel accuracy motion estimator

Claims

An integer pixel accuracy matching unit that performs image matching with integer pixel accuracy;
An image matching apparatus comprising: a decimal pixel accuracy matching unit that performs image matching with decimal pixel accuracy.

The decimal pixel accuracy matching unit calculates similarity of a plurality of image matching in a predetermined axial direction, and performs image matching with decimal pixel accuracy by using a fitting function calculated from the similarity. Image matching device.

The decimal pixel accuracy matching unit calculates a plurality of image matching similarities in a predetermined axial direction, and uses a fitting function calculated from the similarities to obtain a plurality of image matching positions and similarities at the positions. An image matching apparatus characterized in that a single image matching position is obtained by calculating and comparing the similarity at the plurality of image matching positions.

Performing image matching with integer pixel accuracy;
And a step of performing image matching with decimal pixel accuracy.

The step of performing image matching with the decimal pixel accuracy is as follows.
Obtaining a plurality of image matching similarities in a predetermined axial direction;
And a step of performing image matching with decimal pixel accuracy by using a fitting function obtained from the similarity.

The step of performing image matching with the decimal pixel accuracy is as follows.
Obtaining a plurality of image matching similarities in a predetermined axial direction;
Calculating a plurality of image matching positions and similarities at the positions by using a fitting function obtained from the similarities;
An image matching method comprising: obtaining a single image matching position by comparing the plurality of image matching positions and the similarity at the positions.

An image matching apparatus used for encoding a moving image includes a block matching processing unit that performs a block matching process for each block obtained by dividing the moving image into a plurality of blocks, and uses a fitting function from a cost value of integer pixel accuracy. An image matching apparatus that performs motion estimation with decimal pixel accuracy.

8. The image matching apparatus according to claim 7, wherein a further detailed integer pixel precision or decimal precision search point can be obtained from a search result with integer pixel precision obtained by thinning points. .

The image matching apparatus according to claim 7, wherein the feature detection unit performs fitting using samples in an oblique direction as well as in the horizontal axis and vertical axis directions, and uses both results or has a lower cost. A moving picture coding apparatus characterized by obtaining a decimal precision pixel.