JP7460384B2

JP7460384B2 - Prediction device, encoding device, decoding device, and program

Info

Publication number: JP7460384B2
Application number: JP2020020955A
Authority: JP
Inventors: 俊輔岩村; 敦郎市ヶ谷; 慎平根本
Original assignee: Japan Broadcasting Corp
Current assignee: Japan Broadcasting Corp
Priority date: 2020-02-10
Filing date: 2020-02-10
Publication date: 2024-04-02
Anticipated expiration: 2040-02-10
Also published as: JP7749052B2; JP2024069638A; JP2021129148A

Description

本発明は、予測装置、符号化装置、復号装置、及びプログラムに関する。 The present invention relates to a prediction device, an encoding device, a decoding device, and a program.

動画像（映像）の符号化方式においては、符号化装置は、原画像をブロックに分割し、フレーム間の時間的相関を利用したインター予測とフレーム内の空間的相関を利用したイントラ予測とを切り替えながら符号化対象ブロックを予測し、予測されたブロックと符号化対象ブロックとの差を表す予測残差に対して変換処理、量子化処理、及びエントロピー符号化処理を施し、ビットストリームである符号化データを出力する。 In a video encoding system, the encoding device divides the original image into blocks and performs inter prediction using temporal correlation between frames and intra prediction using spatial correlation within frames. The block to be encoded is predicted while switching, and the prediction residual representing the difference between the predicted block and the block to be encoded is subjected to transformation processing, quantization processing, and entropy encoding processing, and a code that is a bit stream is generated. Output converted data.

次世代の符号化方式であるＶＶＣ（ＶｅｒｓａｔｉｌｅＶｉｄｅｏＣｏｄｉｎｇ）では、インター予測のモードの１つとして三角形分割予測（ＴＰＭ：ＴｒｉａｎｇｌｅＰａｒｔｉｔｉｏｎｉｎｇＭｏｄｅ）が採用されている。ＴＰＭは、符号化対象ブロックを対角線で分割した領域のそれぞれに動きべクトルを割り当て、動き補償予測を行うものである。 Versatile Video Coding (VVC), a next-generation coding method, employs triangular partitioning prediction (TPM: Triangle Partitioning Mode) as one of the inter-prediction modes. TPM divides the coding target block diagonally, assigning motion vectors to each area, and performs motion compensation prediction.

ＴＰＭでは、領域境界付近の予測画像については、領域ごとの予測画像の位置に応じた重み付け合成（加重平均）より混合する混合処理（ｂｌｅｎｄｉｎｇと呼ばれる）を行うことで、領域ごとの動き予測補償による不連続性を抑制している。スクリーンコンテンツなどのようにフラットな領域及びエッジが多く含まれているシーケンスにおいては、ｂｌｅｎｄｉｎｇによってエッジのぼやけを引き起こす場合がある。このため、非特許文献１においては、ＴＰＭによる予測において、ｂｌｅｎｄｉｎｇを行わなわずに予測画像を合成するＮｏｎ－ｂｌｅｎｄｉｎｇモードを導入している。 In TPM, for predicted images near region boundaries, a mixing process (called blending) is performed in which predicted images for each region are mixed using weighted synthesis (weighted average) according to their position, thereby suppressing discontinuity caused by motion prediction compensation for each region. In sequences that contain many flat regions and edges, such as screen content, blending can cause edge blurring. For this reason, Non-Patent Document 1 introduces a non-blending mode in TPM prediction that synthesizes predicted images without blending.

ＪＶＥＴ－Ｏ１１７２,“Ｎｏｎ－ＣＥ４／８：ＯｎｄｉｓａｂｌｉｎｇｂｌｅｎｄｉｎｇｐｒｏｃｅｓｓｉｎＴＰＭ”JVET-O1172, “Non-CE4/8: On disabling blending process in TPM”

非特許文献１のようにＴＰＭにおいて複数のｂｌｅｎｄｉｎｇモード（複数の合成方式）を導入する場合、どのモードを用いるかを示すフラグを符号化側から復号側にシグナリングする必要がある。このため、シグナリングするフラグ量が増加し、符号化効率の低下を招くという問題がある。 When introducing multiple blending modes (multiple combining methods) in TPM as in Non-Patent Document 1, it is necessary to signal a flag indicating which mode is to be used from the encoding side to the decoding side. Therefore, there is a problem in that the amount of flags to be signaled increases, leading to a decrease in encoding efficiency.

そこで、本発明は、複数の合成方式を分割予測に導入する場合であっても、シグナリングするフラグ量の増加を抑制できる予測装置、符号化装置、復号装置、及びプログラムを提供することを目的とする。 SUMMARY OF THE INVENTION Therefore, an object of the present invention is to provide a prediction device, an encoding device, a decoding device, and a program that can suppress an increase in the amount of flags to be signaled even when a plurality of combining methods are introduced into segmented prediction. do.

第１の態様に係る予測装置は、画像を分割したブロック単位で予測を行う予測装置であって、予測対象ブロックを直線により分割して少なくとも２つの予測領域を出力する分割部と、前記２つの予測領域のそれぞれについて、動きベクトルを用いて領域予測画像を生成する生成部と、前記生成部が生成した２つの領域予測画像間の類似度を示す類似度情報を算出する算出部と、前記算出部が算出した前記類似度情報に基づいて、前記２つの領域予測画像の合成に用いる合成方式を複数の方式の中から決定する決定部と、前記決定部が決定した前記合成方式を用いて前記２つの領域予測画像を合成し、前記予測対象ブロックに対応する予測ブロックを出力する合成部とを備えることを要旨とする。 A prediction device according to a first aspect is a prediction device that performs prediction in units of blocks obtained by dividing an image, and includes a dividing unit that divides a prediction target block along a straight line and outputs at least two prediction regions; For each prediction region, a generation unit that generates a region prediction image using a motion vector, a calculation unit that calculates similarity information indicating a degree of similarity between two region prediction images generated by the generation unit, and the calculation unit. a determining unit that determines a combining method to be used for combining the two region predicted images from among a plurality of methods based on the similarity information calculated by the determining unit; The gist of the present invention is to include a combining unit that combines two region predicted images and outputs a predicted block corresponding to the prediction target block.

第２の態様に係る符号化装置は、第１の態様に係る予測装置を備えることを要旨とする。 The encoding device according to the second aspect is characterized in that it includes the prediction device according to the first aspect.

第３の態様に係る復号装置は、第１の態様に係る予測装置を備えることを要旨とする。 The decoding device according to the third aspect is characterized in that it includes the prediction device according to the first aspect.

第４の態様に係るプログラムは、コンピュータを第１の態様に係る予測装置として機能させることを要旨とする。 The gist of the program according to the fourth aspect is to cause a computer to function as the prediction device according to the first aspect.

本発明によれば、複数の合成方式を分割予測に導入する場合であっても、シグナリングするフラグ量の増加を抑制できる予測装置、符号化装置、復号装置、及びプログラムを提供できる。 According to the present invention, it is possible to provide a prediction device, an encoding device, a decoding device, and a program that can suppress an increase in the amount of flags to be signaled even when a plurality of combining methods are introduced into divided prediction.

実施形態に係る符号化装置の構成を示す図である。FIG. 1 is a diagram illustrating a configuration of an encoding device according to an embodiment. 実施形態に係る符号化装置の三角形分割予測部の構成を示す図である。FIG. 2 is a diagram illustrating a configuration of a triangulation prediction unit of an encoding device according to an embodiment. 実施形態に係る分割部の動作を示す図である。11A and 11B are diagrams illustrating an operation of a division unit according to the embodiment. 実施形態に係る算出部及び決定部の動作を示す図である。6A and 6B are diagrams illustrating operations of a calculation unit and a determination unit according to the embodiment. 実施形態に係る合成部の動作を示す図である。6A and 6B are diagrams illustrating an operation of a synthesis unit according to the embodiment. 実施形態に係る復号装置の構成を示す図である。FIG. 1 is a diagram showing the configuration of a decoding device according to an embodiment. 実施形態に係る復号装置の三角形分割予測部の構成を示す図である。FIG. 2 is a diagram showing a configuration of a triangulation prediction unit of a decoding device according to an embodiment. 実施形態に係る三角形分割予測部の動作フローを示す図である。FIG. 11 is a diagram showing an operation flow of a triangulation prediction unit according to the embodiment.

図面を参照して、実施形態に係る符号化装置及び復号装置について説明する。実施形態に係る符号化装置及び復号装置は、ＭＰＥＧに代表される動画像の符号化及び復号をそれぞれ行う。以下の図面の記載において、同一又は類似の部分には同一又は類似の符号を付している。 An encoding device and a decoding device according to an embodiment will be described with reference to the drawings. The encoding device and the decoding device according to the embodiment encode and decode moving images represented by MPEG, respectively. In the description of the drawings below, the same or similar parts are designated by the same or similar symbols.

＜符号化装置の構成＞
まず、本実施形態に係る符号化装置の構成について説明する。図１は、本実施形態に係る符号化装置１の構成を示す図である。符号化装置１は、画像を分割して得たブロック単位で符号化を行う装置である。 <Configuration of encoding device>
First, the configuration of the encoding device according to this embodiment will be explained. FIG. 1 is a diagram showing the configuration of an encoding device 1 according to this embodiment. The encoding device 1 is a device that performs encoding in units of blocks obtained by dividing an image.

図１に示すように、符号化装置１は、ブロック分割部１００と、減算部１１０と、変換・量子化部１２０と、エントロピー符号化部１３０と、逆量子化・逆変換部１４０と、合成部１５０と、メモリ１６０と、予測部１７０とを有する。 As shown in FIG. 1, the encoding device 1 includes a block division unit 100, a subtraction unit 110, a transformation and quantization unit 120, an entropy encoding unit 130, an inverse quantization and inverse transformation unit 140, a synthesis unit 150, a memory 160, and a prediction unit 170.

ブロック分割部１００は、動画像を構成するフレーム（或いはピクチャ）単位の入力画像を複数の画像ブロックに分割し、分割により得た画像ブロックを減算部１１０に出力する。画像ブロックのサイズは、例えば３２×３２画素、１６×１６画素、８×８画素、又は４×４画素等である。画像ブロックの形状は正方形に限らず矩形（非正方形）であってもよい。画像ブロックは、符号化装置１が符号化を行う単位（符号化対象ブロック）であり、且つ復号装置が復号を行う単位（復号対象ブロック）である。このような画像ブロックはＣＵ（ＣｏｄｉｎｇＵｎｉｔ）と呼ばれることがある。 The block division unit 100 divides an input image of a frame (or picture) constituting a moving image into a plurality of image blocks, and outputs the image blocks obtained by division to the subtraction unit 110. The size of the image block is, for example, 32 x 32 pixels, 16 x 16 pixels, 8 x 8 pixels, or 4 x 4 pixels. The shape of the image block is not limited to a square, but may be a rectangle (non-square). The image block is the unit (block to be coded) for coding by the coding device 1, and is the unit (block to be coded) for decoding by the decoding device. Such an image block is sometimes called a CU (Coding Unit).

ブロック分割部１００は、輝度信号と色差信号とに対してブロック分割を行う。以下において、ブロック分割の形状が輝度信号と色差信号とで同じである場合について主として説明するが、輝度信号と色差信号とで分割を独立に制御可能であってもよい。輝度ブロック及び色差ブロックを特に区別しないときは単に符号化対象ブロックと呼ぶ。 The block division unit 100 performs block division on the luminance signal and the color difference signal. In the following, the case where the shape of the block division is the same for the luminance signal and the color difference signal will be mainly described, but it may be possible to control the division independently for the luminance signal and the color difference signal. When there is no particular distinction between the luminance block and the color difference block, they are simply referred to as the block to be coded.

減算部１１０は、ブロック分割部１００が出力する符号化対象ブロックと、符号化対象ブロックを予測部１７０が予測して得た予測ブロックとの差分（誤差）を表す予測残差を算出する。減算部１１０は、ブロックの各画素値から予測ブロックの各画素値を減算することにより予測残差を算出し、算出した予測残差を変換・量子化部１２０に出力する。 The subtraction unit 110 calculates a prediction residual that represents the difference (error) between the block to be coded output by the block division unit 100 and a predicted block obtained by predicting the block to be coded by the prediction unit 170. The subtraction unit 110 calculates the prediction residual by subtracting each pixel value of the predicted block from each pixel value of the block, and outputs the calculated prediction residual to the transformation/quantization unit 120.

変換・量子化部１２０は、ブロック単位で変換処理及び量子化処理を行う。変換・量子化部１２０は、変換部１２１と、量子化部１２２とを有する。 The conversion/quantization unit 120 performs conversion processing and quantization processing on a block-by-block basis. The conversion/quantization unit 120 includes a conversion unit 121 and a quantization unit 122.

変換部１２１は、減算部１１０が出力する予測残差に対して変換処理を行って周波数成分ごとの変換係数を算出し、算出した変換係数を量子化部１２２に出力する。変換処理（変換）とは、画素領域の信号を周波数領域の信号に変換する処理をいい、例えば、離散コサイン変換（ＤＣＴ）や離散サイン変換（ＤＳＴ）、カルーネンレーブ変換（ＫＬＴ）、及びそれらを整数化した変換等をいう。 The transform unit 121 performs a transform process on the prediction residual output by the subtraction unit 110 to calculate a transform coefficient for each frequency component, and outputs the calculated transform coefficient to the quantization unit 122. The transform process (transformation) refers to a process of converting a pixel domain signal into a frequency domain signal, such as discrete cosine transform (DCT), discrete sine transform (DST), Karhunen-Loeb transform (KLT), and transforms that convert these to integers.

量子化部１２２は、変換部１２１が出力する変換係数を量子化パラメータ（Ｑｐ）及び量子化行列を用いて量子化し、量子化した変換係数をエントロピー符号化部１３０及び逆量子化・逆変換部１４０に出力する。なお、量子化パラメータ（Ｑｐ）は、ブロック内の各変換係数に対して共通して適用されるパラメータであって、量子化の粗さを定めるパラメータである。量子化行列は、各変換係数を量子化する際の量子化値を要素として有する行列である。 The quantization unit 122 quantizes the transform coefficients output by the transform unit 121 using a quantization parameter (Qp) and a quantization matrix, and outputs the quantized transform coefficients to the entropy coding unit 130 and the inverse quantization and inverse transform unit 140. Note that the quantization parameter (Qp) is a parameter that is commonly applied to each transform coefficient in a block and determines the coarseness of quantization. The quantization matrix is a matrix whose elements are the quantization values used when quantizing each transform coefficient.

エントロピー符号化部１３０は、量子化部１２２が出力する変換係数に対してエントロピー符号化を行い、データ圧縮を行って符号化ストリーム（ビットストリーム）を生成し、符号化ストリームを符号化装置１の外部に出力する。エントロピー符号化には、ハフマン符号やＣＡＢＡＣ（Ｃｏｎｔｅｘｔ－ｂａｓｅｄＡｄａｐｔｉｖｅＢｉｎａｒｙＡｒｉｔｈｍｅｔｉｃＣｏｄｉｎｇ；コンテキスト適応型２値算術符号）等を用いることができる。なお、エントロピー符号化部１３０は、ブロック分割部１００から各符号化対象ブロックのサイズ・形状等の情報を取得し、予測部１７０から予測に関する情報（例えば、予測モードや動きベクトルの情報）を取得し、これらの情報の符号化も行う。 The entropy encoding unit 130 performs entropy encoding on the transform coefficients output by the quantization unit 122, performs data compression to generate an encoded stream (bit stream), and transmits the encoded stream to the encoding device 1. Output to outside. For entropy encoding, a Huffman code, CABAC (Context-based Adaptive Binary Arithmetic Coding), or the like can be used. Note that the entropy encoding unit 130 acquires information such as the size and shape of each block to be encoded from the block division unit 100, and acquires information regarding prediction (for example, information on prediction mode and motion vector) from the prediction unit 170. It also encodes this information.

逆量子化・逆変換部１４０は、ブロック単位で逆量子化処理及び逆変換処理を行う。逆量子化・逆変換部１４０は、逆量子化部１４１と、逆変換部１４２とを有する。 The inverse quantization and inverse transform unit 140 performs inverse quantization and inverse transform processing on a block-by-block basis. The inverse quantization and inverse transform unit 140 includes an inverse quantization unit 141 and an inverse transform unit 142.

逆量子化部１４１は、量子化部１２２が行う量子化処理に対応する逆量子化処理を行う。具体的には、逆量子化部１４１は、量子化部１２２が出力する変換係数を、量子化パラメータ（Ｑｐ）及び量子化行列を用いて逆量子化することにより変換係数を復元し、復元した変換係数を逆変換部１４２に出力する。 The dequantization unit 141 performs dequantization processing corresponding to the quantization processing performed by the quantization unit 122. Specifically, the dequantization unit 141 restores the transform coefficients by dequantizing the transform coefficients output by the quantization unit 122 using a quantization parameter (Qp) and a quantization matrix. The transform coefficients are output to the inverse transform section 142.

逆変換部１４２は、変換部１２１が行う変換処理に対応する逆変換処理を行う。例えば、変換部１２１がＤＣＴを行った場合には、逆変換部１４２は逆ＤＣＴを行う。逆変換部１４２は、逆量子化部１４１が出力する変換係数に対して逆変換処理を行って予測残差を復元し、復元した予測残差である復元予測残差を合成部１５０に出力する。 The inverse transform unit 142 performs an inverse transform process corresponding to the transform process performed by the transform unit 121. For example, when the transformer 121 performs DCT, the inverse transformer 142 performs inverse DCT. The inverse transform unit 142 performs inverse transform processing on the transform coefficients output by the inverse quantization unit 141 to restore the prediction residual, and outputs the restored prediction residual, which is the restored prediction residual, to the synthesis unit 150. .

合成部１５０は、逆変換部１４２が出力する復元予測残差を、予測部１７０が出力する予測ブロックと画素単位で合成する。合成部１５０は、復元予測残差の各画素値と予測ブロックの各画素値を加算して符号化対象ブロックを復元（復号）し、復元したブロック単位の復号画像（復元ブロック）をメモリ１６０に出力する。 The synthesis unit 150 synthesizes the reconstructed prediction residual output by the inverse transform unit 142 with the predicted block output by the prediction unit 170 on a pixel-by-pixel basis. The synthesis unit 150 adds each pixel value of the reconstructed prediction residual to each pixel value of the predicted block to reconstruct (decode) the block to be coded, and outputs the reconstructed decoded image (reconstructed block) on a block-by-block basis to the memory 160.

メモリ１６０は、合成部１５０が出力する復元ブロックをフレーム単位で復号画像として蓄積する。メモリ１６０は、記憶している復号画像を予測部１７０に出力する。なお、合成部１５０とメモリ１６０との間にループフィルタが介在してもよい。 The memory 160 accumulates the reconstructed blocks output by the synthesis unit 150 as decoded images on a frame-by-frame basis. The memory 160 outputs the stored decoded images to the prediction unit 170. Note that a loop filter may be interposed between the synthesis unit 150 and the memory 160.

予測部１７０は、ブロック単位で予測処理を行うことにより、符号化対象ブロックに対応する予測ブロックを生成し、生成した予測ブロックを減算部１１０及び合成部１５０に出力する。予測処理が行われる符号化対象ブロックを予測対象ブロックと呼ぶ。 The prediction unit 170 performs prediction processing on a block-by-block basis to generate a prediction block corresponding to the block to be coded, and outputs the generated prediction block to the subtraction unit 110 and the synthesis unit 150. The block to be coded on which prediction processing is performed is called the block to be predicted.

予測部１７０は、イントラ予測部１７１と、インター予測部１７２と、切替部１７３とを有する。本実施形態において、インター予測部１７２は、ブロック単位で予測処理を行う予測装置に相当する。 The prediction unit 170 has an intra prediction unit 171, an inter prediction unit 172, and a switching unit 173. In this embodiment, the inter prediction unit 172 corresponds to a prediction device that performs prediction processing on a block-by-block basis.

イントラ予測部１７１は、複数のイントラ予測モードの中から、予測対象ブロックに適用する最適なイントラ予測モードを選択し、選択したイントラ予測モードを用いて予測対象ブロックを予測する。イントラ予測部１７１は、メモリ１６０に記憶された復号画像のうち、予測対象ブロックに隣接する復号済み画素値を参照してイントラ予測ブロックを生成し、生成したイントラ予測ブロックを切替部１７３に出力する。また、イントラ予測部１７１は、選択したイントラ予測モードに関する情報をエントロピー符号化部１３０に出力する。 The intra prediction unit 171 selects an optimal intra prediction mode to be applied to the prediction target block from among multiple intra prediction modes, and predicts the prediction target block using the selected intra prediction mode. The intra prediction unit 171 generates an intra prediction block by referring to decoded pixel values adjacent to the prediction target block in the decoded image stored in the memory 160, and outputs the generated intra prediction block to the switching unit 173. The intra prediction unit 171 also outputs information related to the selected intra prediction mode to the entropy coding unit 130.

インター予測部１７２は、メモリ１６０に記憶された復号画像を参照画像として用いて、ブロックマッチング等の手法により動きベクトルを算出し、予測対象ブロックを予測してインター予測ブロックを生成し、生成したインター予測ブロックを切替部１７３に出力する。インター予測部１７２は、複数の参照画像を用いるインター予測（典型的には、双予測）や、１つの参照画像を用いるインター予測（片方向予測）の中から最適なインター予測方法を選択し、選択したインター予測方法を用いてインター予測を行う。インター予測部１７２は、インター予測に関する情報（動きベクトル情報等）をエントロピー符号化部１３０に出力する。 The inter prediction unit 172 uses the decoded image stored in the memory 160 as a reference image, calculates a motion vector by a method such as block matching, predicts the prediction target block to generate an inter prediction block, and uses the generated inter prediction block. The predicted block is output to the switching unit 173. The inter prediction unit 172 selects the optimal inter prediction method from among inter prediction using multiple reference images (typically, bi-prediction) and inter prediction using one reference image (unidirectional prediction), Perform inter prediction using the selected inter prediction method. The inter prediction unit 172 outputs information related to inter prediction (motion vector information, etc.) to the entropy encoding unit 130.

本実施形態において、インター予測部１７２は、三角形分割予測（ＴＰＭ）を行う三角形分割予測部１７２ａを有する（図２参照）。三角形分割予測部１７２ａは、符号化対象ブロックを対角線で分割した領域のそれぞれに動きべクトルを割り当て、動き補償予測を行うものである。インター予測部１７２は、予測対象ブロックに三角形分割予測を適用する場合、この予測対象ブロックに三角形分割予測を適用することを示す三角形分割予測適用フラグをエントロピー符号化部１３０に出力し、エントロピー符号化部１３０から復号側に三角形分割予測適用フラグを伝送する。三角形分割予測部１７２ａの詳細については後述する。 In this embodiment, the inter prediction unit 172 includes a triangulation prediction unit 172a that performs triangulation prediction (TPM) (see FIG. 2). The triangulation prediction unit 172a assigns a motion vector to each region obtained by dividing the current block to be encoded diagonally, and performs motion compensation prediction. When applying triangulation prediction to the prediction target block, the inter prediction unit 172 outputs a triangulation prediction application flag indicating that triangulation prediction is applied to the prediction target block to the entropy encoding unit 130, and performs entropy encoding. The triangulation prediction application flag is transmitted from the unit 130 to the decoding side. Details of the triangulation prediction unit 172a will be described later.

切替部１７３は、インター予測部１７２が出力するインター予測ブロックとイントラ予測部１７１が出力するイントラ予測ブロックとを切り替えて、いずれかの予測ブロックを減算部１１０及び合成部１５０に出力する。 The switching unit 173 switches between the inter prediction block output by the inter prediction unit 172 and the intra prediction block output by the intra prediction unit 171, and outputs one of the prediction blocks to the subtraction unit 110 and the synthesis unit 150.

次に、本実施形態に係る三角形分割予測部１７２ａについて説明する。図２は、本実施形態に係る三角形分割予測部１７２ａの構成を示す図である。 Next, the triangulation prediction unit 172a according to this embodiment will be explained. FIG. 2 is a diagram showing the configuration of the triangulation prediction unit 172a according to this embodiment.

図２に示すように、三角形分割予測部１７２ａは、分割部１７２１と、生成部１７２２と、算出部１７２３と、決定部１７２４と、合成部１７２５とを有する。 As shown in FIG. 2, the triangulation prediction unit 172a includes a division unit 1721, a generation unit 1722, a calculation unit 1723, a determination unit 1724, and a synthesis unit 1725.

分割部１７２１は、予測対象ブロックを対角線で分割し、２つの三角形領域を生成部１７２２に出力する。三角形領域は、予測対象ブロックを直線により分割した予測領域の一例である。分割方法には、図３（ａ）に示す方法と図３（ｂ）に示す方法との２種類がある。図３（ａ）に示す方法では、分割部１７２１は、符号化対象ブロックの左上の頂天位置及び右下の頂天位置を通る分割線で分割する。図３（ｂ）に示す方法では、分割部１７２１は、符号化対象ブロックの右上の頂天位置及び左下の頂天位置を通る分割線で分割する。分割部１７２１は、分割の方向が図３（ａ）に示す対角線方向及び図３（ｂ）に示す対角線方向のいずれであるかを示す分割方向フラグをエントロピー符号化部１３０に出力し、エントロピー符号化部１３０から復号側に分割方向フラグを伝送する。 The dividing unit 1721 divides the prediction target block diagonally, and outputs two triangular areas to the generating unit 1722. The triangular region is an example of a prediction region obtained by dividing the prediction target block by straight lines. There are two division methods: the method shown in FIG. 3(a) and the method shown in FIG. 3(b). In the method shown in FIG. 3A, the dividing unit 1721 divides the block to be encoded along a dividing line passing through the upper left apex position and the lower right apex position. In the method shown in FIG. 3B, the dividing unit 1721 divides the block to be encoded along a dividing line passing through the upper right zenith position and the lower left zenith position. The dividing unit 1721 outputs a dividing direction flag indicating whether the direction of division is the diagonal direction shown in FIG. 3(a) or the diagonal direction shown in FIG. 3(b) to the entropy encoding unit 130, and generates an entropy code. The division direction flag is transmitted from the encoding unit 130 to the decoding side.

生成部１７２２は、２つの三角形領域のそれぞれについて動きベクトルを用いて領域予測画像を生成し、２つの領域予測画像を算出部１７２３及び合成部１７２５に出力する。具体的には、生成部１７２２は、予測対象ブロックを分割した各領域に動きべクトルを割り当て、割り当てた動きべクトルを用いて各領域の予測画像を生成する。ここで、生成部１７２２は、動きベクトルの参照元とする複数の候補を優先度順に並べて、上位一定数の動きベクトルの中から三角形領域ごとに１つの動きベクトルを選択する。生成部１７２２は、選択した動きベクトルを示す動きベクトル情報をエントロピー符号化部１３０に出力し、エントロピー符号化部１３０から復号側に動きベクトル情報を伝送する。 The generation unit 1722 generates a region predicted image using the motion vector for each of the two triangular regions, and outputs the two region prediction images to the calculation unit 1723 and the synthesis unit 1725. Specifically, the generation unit 1722 assigns a motion vector to each region into which the prediction target block is divided, and generates a predicted image for each region using the assigned motion vector. Here, the generation unit 1722 arranges a plurality of motion vector reference sources in order of priority, and selects one motion vector for each triangular region from among the fixed number of high-ranking motion vectors. The generation unit 1722 outputs motion vector information indicating the selected motion vector to the entropy encoding unit 130, and transmits the motion vector information from the entropy encoding unit 130 to the decoding side.

算出部１７２３は、生成部１７２２が生成した２つの領域予測画像間の類似度を示す類似度情報を算出し、算出した類似度情報を決定部１７２４に出力する。例えば、算出部１７２３は、図４（ａ）に示すように、領域予測画像１における領域予測画像２との境界領域Ｒ１と、領域予測画像２における領域予測画像１との境界領域Ｒ２との類似度を示す類似度情報を算出する。類似度情報は、類似度を示す情報であればどのような情報であってもよいが、例えば、差の絶対値の総和（ＳＡＤ：ＳｕｍｏｆＡｂｕｓｏｌｕｔｅＤｉｆｆｅｒｅｎｃｅ）を類似度情報として用いることができる。 The calculation unit 1723 calculates similarity information indicating the similarity between the two region predicted images generated by the generation unit 1722, and outputs the calculated similarity information to the determination unit 1724. For example, as shown in FIG. 4A, the calculation unit 1723 calculates the similarity between the boundary region R1 between the region predicted image 1 and the region predicted image 2 and the boundary region R2 between the region predicted image 1 and the region predicted image 2. Similarity information indicating the degree of similarity is calculated. The similarity information may be any information as long as it indicates the degree of similarity; for example, the sum of absolute values of differences (SAD) can be used as the similarity information.

算出部１７２３は、境界領域Ｒ１内の画素値（複数）と、対応する境界領域Ｒ２内の画素値（複数）との間のＳＡＤを類似度情報として算出してもよい。ＳＡＤが小さいほど類似度が高いことを示し、ＳＡＤが大きいほど類似度が低いことを示す。 The calculation unit 1723 may calculate the SAD between the pixel values (plurality) in the boundary region R1 and the pixel values (plurality) in the corresponding boundary region R2 as similarity information. A smaller SAD indicates a higher degree of similarity, and a larger SAD indicates a lower degree of similarity.

類似度情報を算出することは、２つの領域予測画像に対応する２つの参照画像間の類似度を算出することであってもよい。参照画像間の類似度は、一方の参照画像内の画素値（複数）と、対応する他方の参照画像内の画素値（複数）との間のＳＡＤとして算出されてもよい。 Calculating the similarity information may be calculating the similarity between two reference images corresponding to the two region predicted images. The degree of similarity between reference images may be calculated as the SAD between pixel values in one reference image and corresponding pixel values in the other reference image.

決定部１７２４は、算出部１７２３が算出した類似度情報に基づいて、２つの領域予測画像の合成に用いる合成方式を複数の方式の中から決定し、決定した合成方式を示す情報を合成部１７２５に出力する。例えば、複数の方式は、２つの領域予測画像間の境界領域を、画素位置ごとの重み係数を用いた重み付け合成により混合する第１方式（以下、「Ｂｌｅｎｄｉｎｇ方式」と呼ぶ）を含む。決定部１７２４は、算出部１７２３が算出した類似度情報に基づいて、２つの領域予測画像の合成に用いる合成方式としてＢｌｅｎｄｉｎｇ方式を用いるか否かを決定する。Ｂｌｅｎｄｉｎｇ方式によれば、領域ごとの動き予測補償による不連続性を抑制できる。 Based on the similarity information calculated by the calculation unit 1723, the determination unit 1724 determines a combination method to be used for combining the two region predicted images from among a plurality of methods, and sends information indicating the determined combination method to the combination unit 1725. Output to. For example, the plurality of methods includes a first method (hereinafter referred to as a "Blending method") that blends a boundary region between two regionally predicted images by weighted synthesis using a weighting coefficient for each pixel position. The determining unit 1724 determines, based on the similarity information calculated by the calculating unit 1723, whether to use the blending method as the combining method used to combine the two region predicted images. According to the blending method, discontinuity caused by motion prediction compensation for each region can be suppressed.

本実施形態において、複数の方式は、境界領域を重み付け合成により混合しない第２方式（以下、「Ｎｏｎ－ｂｌｅｎｄｉｎｇ方式」と呼ぶ）をさらに含む。決定部１７２４は、算出部１７２３が算出した類似度情報に基づいて、２つの領域予測画像の合成に用いる合成方式としてＢｌｅｎｄｉｎｇ方式及びＮｏｎ－ｂｌｅｎｄｉｎｇ方式のいずれを用いるかを決定する。Ｎｏｎ－ｂｌｅｎｄｉｎｇ方式によれば、フラットな領域及びエッジが多く含まれているシーケンスについて、Ｂｌｅｎｄｉｎｇ方式によるエッジのぼやけを回避できる。 In this embodiment, the plurality of methods further includes a second method (hereinafter referred to as a "Non-blending method") in which boundary regions are not mixed by weighted synthesis. Based on the similarity information calculated by the calculating unit 1723, the determining unit 1724 determines which of the blending method and the non-blending method to use as the combining method for combining the two region predicted images. According to the non-blending method, blurring of edges caused by the blending method can be avoided for sequences that include many flat areas and edges.

このようなＢｌｅｎｄｉｎｇ方式及びＮｏｎ－ｂｌｅｎｄｉｎｇ方式の切り替えを可能とする場合、Ｂｌｅｎｄｉｎｇ方式及びＮｏｎ－ｂｌｅｎｄｉｎｇ方式のいずれを適用するかを示すフラグを復号側に伝送する必要があり得る。しかしながら、画像における背景領域(例えば、空などの滑らかな領域)では、Ｂｌｅｎｄｉｎｇ方式であってもＮｏｎ－ｂｌｅｎｄｉｎｇ方式であっても生成される予測画像の特徴は変わることはなく、どちらの方式を選択しても符号化効率に影響は生じない。 When switching between the blending method and the non-blending method is enabled, it may be necessary to transmit a flag indicating which of the blending method and the non-blending method is applied to the decoding side. However, in the background area of an image (for example, a smooth area such as the sky), the characteristics of the predicted image generated will not change whether the blending method or the non-blending method is used, and it is difficult to select which method. However, the encoding efficiency is not affected.

本実施形態では、決定部１７２４は、算出部１７２３が算出した類似度情報に基づいてＢｌｅｎｄｉｎｇ方式及びＮｏｎ－ｂｌｅｎｄｉｎｇ方式のいずれかを決定する。このような類似度情報は復号側でも算出可能な情報であるため、Ｂｌｅｎｄｉｎｇ方式及びＮｏｎ－ｂｌｅｎｄｉｎｇ方式のいずれを適用するかを示すフラグを復号側に伝送しなくても、符号化側及び復号側で共通の方式を暗黙的に決定できる。これにより、このようなフラグの復号側への伝送（シグナリング）を不要とし、シグナリングするフラグ量の増加を抑制できる。 In this embodiment, the determining unit 1724 determines either the blending method or the non-blending method based on the similarity information calculated by the calculating unit 1723. Such similarity information can be calculated on the decoding side as well, so it can be calculated on both the encoding and decoding sides without transmitting a flag indicating whether to apply the blending method or the non-blending method to the decoding side. The common method can be determined implicitly. This eliminates the need for such flag transmission (signaling) to the decoding side, and can suppress an increase in the amount of flags to be signaled.

例えば、決定部１７２４は、図４（ｂ）及び（ｃ）に示すように、算出部１７２３が算出した類似度情報に基づいて、類似度情報が示す類似度が閾値よりも低い場合はＢｌｅｎｄｉｎｇ方式を決定し、類似度情報が示す類似度が閾値よりも高い場合はＮｏｎ－ｂｌｅｎｄｉｎｇ方式を決定する。この閾値は、予め定められた固定値であってもよいし、符号化側から復号側に伝送する可変値であってもよい。 For example, as shown in Figures 4(b) and (c), based on the similarity information calculated by the calculation unit 1723, the determination unit 1724 determines the blending method when the similarity indicated by the similarity information is lower than a threshold, and determines the non-blending method when the similarity indicated by the similarity information is higher than the threshold. This threshold may be a predetermined fixed value, or may be a variable value transmitted from the encoding side to the decoding side.

合成部１７２５は、決定部１７２４が決定した合成方式を用いて、生成部１７２２が出力する２つの領域予測画像を合成し、予測対象ブロックに対応する予測ブロックを出力する。本実施形態では、合成部１７２５は、決定部１７２４がＢｌｅｎｄｉｎｇ方式を決定した場合は２つの領域予測画像をＢｌｅｎｄｉｎｇ方式で合成し、決定部１７２４がＮｏｎ－ｂｌｅｎｄｉｎｇ方式を決定した場合は２つの領域予測画像をＮｏｎ－ｂｌｅｎｄｉｎｇ方式で合成する。 The combining unit 1725 combines the two region predicted images output by the generating unit 1722 using the combining method determined by the determining unit 1724, and outputs a predicted block corresponding to the prediction target block. In the present embodiment, the combining unit 1725 combines the two area predicted images using the blending method when the determining unit 1724 determines the blending method, and combines the two area predicted images using the blending method when the determining unit 1724 determines the non-blending method. Images are combined using a non-blending method.

図５は、２つの領域予測画像Ｐ_１及びＰ_２を合成する動作を示す図である。図５（ａ）に示すように、合成部１７２５は、２つの領域予測画像Ｐ_１及びＰ_２をＢｌｅｎｄｉｎｇ方式で合成する場合、予測対象ブロックのブロックサイズ及びブロック形状に応じた重み係数マップ（Ｗｅｉｇｈｔｍａｐ）を用いて２つの領域予測画像Ｐ_１及びＰ_２を加重平均により合成する。これにより、図４（ａ）に示す境界領域Ｒ１及びＲ２が連続的になるよう調整される。一方、図５（ｂ）に示すように、合成部１７２５は、２つの領域予測画像Ｐ_１及びＰ_２をＮｏｎ－ｂｌｅｎｄｉｎｇ方式で合成する場合、２つの領域予測画像Ｐ_１及びＰ_２を加重平均せずに合成する。 FIG. 5 is a diagram showing the operation of combining two region predicted images P ₁ and P ₂ . As shown in FIG. 5A, when combining two region predicted images P ₁ and P ₂ using the blending method, the combining unit 1725 creates a weight coefficient map (Weight) according to the block size and block shape of the prediction target block. map), the two region predicted images P ₁ and P ₂ are combined by weighted averaging. Thereby, the boundary regions R1 and R2 shown in FIG. 4(a) are adjusted to be continuous. On the other hand, as shown in FIG. 5B, when combining the two region predicted images P ₁ and P ₂ using the Non-blending method, the composition unit 1725 performs a weighted average of the two region prediction images P ₁ and P ₂ . Synthesize without

＜復号装置の構成＞
次に、本実施形態に係る復号装置の構成について、上述した符号化装置の構成との相違点を主として説明する。図６は、本実施形態に係る復号装置２の構成を示す図である。復号装置２は、符号化ストリームから復号対象ブロックを復号する装置である。 <Configuration of Decoding Device>
Next, the configuration of a decoding device according to this embodiment will be described, focusing mainly on the differences from the configuration of the encoding device described above. Fig. 6 is a diagram showing the configuration of a decoding device 2 according to this embodiment. The decoding device 2 is a device that decodes a current block from an encoded stream.

図６に示すように、復号装置２は、エントロピー復号部２００と、逆量子化・逆変換部２１０と、合成部２２０と、メモリ２３０と、予測部２４０とを有する。 As shown in FIG. 6, the decoding device 2 has an entropy decoding unit 200, an inverse quantization/inverse transform unit 210, a synthesis unit 220, a memory 230, and a prediction unit 240.

エントロピー復号部２００は、符号化装置１により生成された符号化ストリームを復号し、量子化された変換係数を取得し、取得した変換係数を逆量子化・逆変換部２１０（逆量子化部２１１）に出力する。また、エントロピー復号部２００は、各種のシグナリング情報を取得する。例えば、エントロピー復号部２００は、復号対象ブロックに適用する予測処理に関する情報を取得し、取得した情報を予測部２４０に出力する。 The entropy decoding unit 200 decodes the coding stream generated by the coding device 1, obtains quantized transform coefficients, and outputs the obtained transform coefficients to the inverse quantization/inverse transform unit 210 (inverse quantization unit 211). The entropy decoding unit 200 also obtains various types of signaling information. For example, the entropy decoding unit 200 obtains information related to the prediction process to be applied to the block to be decoded, and outputs the obtained information to the prediction unit 240.

逆量子化・逆変換部２１０は、ブロック単位で逆量子化処理及び逆変換処理を行う。逆量子化・逆変換部２１０は、逆量子化部２１１と、逆変換部２１２とを有する。 The inverse quantization and inverse transform unit 210 performs inverse quantization processing and inverse transform processing on a block-by-block basis. The inverse quantization and inverse transform unit 210 has an inverse quantization unit 211 and an inverse transform unit 212.

逆量子化部２１１は、符号化装置１の量子化部１２２が行う量子化処理に対応する逆量子化処理を行う。逆量子化部２１１は、エントロピー復号部２００が出力する量子化変換係数を、量子化パラメータ（Ｑｐ）及び量子化行列を用いて逆量子化することにより、復号対象ブロックの変換係数を復元し、復元した変換係数を逆変換部２１２に出力する。 The inverse quantization unit 211 performs an inverse quantization process corresponding to the quantization process performed by the quantization unit 122 of the encoding device 1. The inverse quantization unit 211 restores the transform coefficients of the block to be decoded by inverse quantizing the quantized transform coefficients output by the entropy decoding unit 200 using a quantization parameter (Qp) and a quantization matrix, and outputs the restored transform coefficients to the inverse transform unit 212.

逆変換部２１２は、符号化装置１の変換部１２１が行う変換処理に対応する逆変換処理を行う。逆変換部２１２は、逆量子化部２１１が出力する変換係数に対して逆変換処理を行って予測残差を復元し、復元した予測残差（復元予測残差）を合成部２２０に出力する。 The inverse transform unit 212 performs inverse transform processing corresponding to the transform processing performed by the transform unit 121 of the encoding device 1. The inverse transform unit 212 performs inverse transform processing on the transform coefficients output by the inverse quantization unit 211 to restore the prediction residual, and outputs the restored prediction residual (restored prediction residual) to the synthesis unit 220.

合成部２２０は、逆変換部２１２が出力する予測残差と、予測部２４０が出力する予測ブロックとを画素単位で合成することにより、復号対象ブロックを復元（復号）し、復元ブロックをメモリ２３０に出力する。 The synthesis unit 220 reconstructs (decodes) the block to be decoded by synthesizing the prediction residual output by the inverse transform unit 212 and the prediction block output by the prediction unit 240 on a pixel-by-pixel basis, and outputs the reconstructed block to the memory 230.

メモリ２３０は、合成部２２０が出力する復元ブロックをフレーム単位で復号画像として記憶する。メモリ２３０は、フレーム単位の復号画像を復号装置２の外部に出力する。なお、合成部２２０とメモリ２３０との間にループフィルタが介在してもよい。 The memory 230 stores the restored blocks output by the combining unit 220 in units of frames as decoded images. The memory 230 outputs the decoded image in units of frames to the outside of the decoding device 2 . Note that a loop filter may be interposed between the synthesis section 220 and the memory 230.

予測部２４０は、ブロック単位で予測を行う。予測部２４０は、イントラ予測部２４１と、インター予測部２４２と、切替部２４３とを有する。 The prediction unit 240 performs prediction on a block-by-block basis. The prediction unit 240 has an intra prediction unit 241, an inter prediction unit 242, and a switching unit 243.

イントラ予測部２４１は、メモリ２３０に記憶された復号画像のうち復号対象ブロックに隣接する参照画素を参照し、エントロピー復号部２００が出力する情報に基づいて、復号対象ブロックをイントラ予測により予測する。そして、イントラ予測部２４１は、イントラ予測ブロックを生成し、生成したイントラ予測ブロックを切替部２４３に出力する。 The intra prediction unit 241 refers to reference pixels adjacent to the decoding target block in the decoded image stored in the memory 230, and predicts the decoding target block by intra prediction based on the information output from the entropy decoding unit 200. Then, the intra prediction unit 241 generates an intra prediction block and outputs the generated intra prediction block to the switching unit 243.

インター予測部２４２は、メモリ２３０に記憶された復号画像を参照画像として用いて、復号対象ブロックをインター予測により予測する。インター予測部２４２は、エントロピー復号部２００が出力する動きベクトル情報を用いてインター予測を行うことによりインター予測ブロックを生成し、生成したインター予測ブロックを切替部２４３に出力する。 The inter prediction unit 242 predicts the block to be decoded by inter prediction using the decoded image stored in the memory 230 as a reference image. The inter prediction unit 242 generates an inter prediction block by performing inter prediction using the motion vector information output by the entropy decoding unit 200, and outputs the generated inter prediction block to the switching unit 243.

本実施形態において、インター予測部２４２は、三角形分割予測（ＴＰＭ）を行う三角形分割予測部２４２ａを有する（図７参照）。インター予測部２４２は、予測対象ブロックに三角形分割予測を適用することを示す三角形分割予測適用フラグをエントロピー復号部２００が取得した場合、予測対象ブロックに三角形分割予測を適用する。三角形分割予測部２４２ａの詳細については後述する。 In this embodiment, the inter prediction unit 242 includes a triangulation prediction unit 242a that performs triangulation prediction (TPM) (see FIG. 7). If the entropy decoding unit 200 acquires a triangulation prediction application flag indicating that triangulation prediction is to be applied to the prediction target block, the inter prediction unit 242 applies triangulation prediction to the prediction target block. Details of the triangulation prediction unit 242a will be described later.

切替部２４３は、イントラ予測部２４１が出力するイントラ予測ブロックとインター予測部２４２が出力するインター予測ブロックとを切り替えて、いずれかの予測ブロックを合成部２２０に出力する。 The switching unit 243 switches between the intra prediction block output by the intra prediction unit 241 and the inter prediction block output by the inter prediction unit 242, and outputs one of the prediction blocks to the combining unit 220.

次に、本実施形態に係る三角形分割予測部２４２ａについて説明する。図７は、本実施形態に係る三角形分割予測部２４２ａの構成を示す図である。 Next, the triangular decomposition prediction unit 242a according to this embodiment will be described. FIG. 7 is a diagram showing the configuration of the triangular decomposition prediction unit 242a according to this embodiment.

図７に示すように、三角形分割予測部２４２ａは、分割部２４２１と、生成部２４２２と、算出部２４２３と、決定部２４２４と、合成部２４２５とを有する。 As shown in FIG. 7, the triangulation prediction unit 242a includes a division unit 2421, a generation unit 2422, a calculation unit 2423, a determination unit 2424, and a synthesis unit 2425.

分割部２４２１は、エントロピー復号部２００が取得した分割方向フラグに基づいて予測対象ブロックを対角線で分割し、２つの三角形領域を生成部２４２２に出力する。 The dividing unit 2421 divides the prediction target block diagonally based on the dividing direction flag acquired by the entropy decoding unit 200, and outputs two triangular areas to the generating unit 2422.

生成部２４２２は、エントロピー復号部２００が取得した動きベクトル情報に基づいて２つの三角形領域のそれぞれについて領域予測画像を生成し、２つの領域予測画像を算出部２４２３及び合成部２４２５に出力する。 The generation unit 2422 generates a region predicted image for each of the two triangular regions based on the motion vector information acquired by the entropy decoding unit 200, and outputs the two region prediction images to the calculation unit 2423 and the composition unit 2425.

算出部２４２３は、生成部２４２２が生成した２つの領域予測画像間の類似度を示す類似度情報を算出し、算出した類似度情報を決定部２４２４に出力する。類似度情報の算出方法は符号化側と同じ方法であり、符号化側及び復号側で共通の規則（アルゴリズム）を用いて類似度情報を算出する。 The calculation unit 2423 calculates similarity information indicating the similarity between the two region predicted images generated by the generation unit 2422, and outputs the calculated similarity information to the determination unit 2424. The method of calculating the similarity information is the same as that on the encoding side, and the similarity information is calculated using a common rule (algorithm) on the encoding side and the decoding side.

決定部２４２４は、算出部２４２３が算出した類似度情報に基づいて、２つの領域予測画像の合成に用いる合成方式を複数の方式の中から決定し、決定した合成方式を示す情報を合成部２４２５に出力する。本実施形態において、決定部２４２４は、算出部２４２３が算出した類似度情報に基づいて、２つの領域予測画像の合成に用いる合成方式としてＢｌｅｎｄｉｎｇ方式及びＮｏｎ－ｂｌｅｎｄｉｎｇ方式のいずれを用いるかを決定する。合成方式の決定方法は符号化側と同じ方法であり、符号化側及び復号側で共通の規則を用いて合成方式を決定する。すなわち、決定部２４２４は、Ｂｌｅｎｄｉｎｇ方式及びＮｏｎ－ｂｌｅｎｄｉｎｇ方式のいずれを適用するかを示すフラグを基づくことなく、類似度情報に基づいて合成方式を決定する。 The determination unit 2424 determines the synthesis method to be used for synthesizing the two area prediction images from among a plurality of methods based on the similarity information calculated by the calculation unit 2423, and outputs information indicating the determined synthesis method to the synthesis unit 2425. In this embodiment, the determination unit 2424 determines whether to use the blending method or the non-blending method as the synthesis method to be used for synthesizing the two area prediction images based on the similarity information calculated by the calculation unit 2423. The method of determining the synthesis method is the same as that on the encoding side, and the synthesis method is determined using a rule common to the encoding side and the decoding side. In other words, the determination unit 2424 determines the synthesis method based on the similarity information, without being based on a flag indicating whether the blending method or the non-blending method is to be applied.

合成部２４２５は、決定部２４２４が決定した合成方式を用いて、生成部２４２２が出力する２つの領域予測画像を合成し、予測対象ブロックに対応する予測ブロックを出力する。本実施形態では、合成部２４２５は、決定部２４２４がＢｌｅｎｄｉｎｇ方式を決定した場合は２つの領域予測画像をＢｌｅｎｄｉｎｇ方式で合成し、決定部２４２４がＮｏｎ－ｂｌｅｎｄｉｎｇ方式を決定した場合は２つの領域予測画像をＮｏｎ－ｂｌｅｎｄｉｎｇ方式で合成する。 The combining unit 2425 combines the two region predicted images output by the generating unit 2422 using the combining method determined by the determining unit 2424, and outputs a predicted block corresponding to the prediction target block. In this embodiment, the combining unit 2425 combines two area predicted images using a blending method when the determining unit 2424 determines a blending method, and combines two area predicted images using a blending method when the determining unit 2424 determines a non-blending method. Images are combined using a non-blending method.

＜三角形分割予測部の動作＞
次に、本実施形態に係る三角形分割予測部１７２ａ及び２４２ａの動作について説明する。三角形分割予測部１７２ａ及び２４２ａは同様な動作を行うため、ここでは三角形分割予測部２４２ａを例に挙げて説明する。図８は、本実施形態に係る三角形分割予測部２４２ａの動作フローを示す図である。 <Operation of Triangulation Prediction Unit>
Next, the operation of the triangular decomposition prediction units 172a and 242a according to this embodiment will be described. Since the triangular decomposition prediction units 172a and 242a perform similar operations, the triangular decomposition prediction unit 242a will be taken as an example for description here. Fig. 8 is a diagram showing the operation flow of the triangular decomposition prediction unit 242a according to this embodiment.

図８に示すように、ステップＳ１において、分割部２４２１は、エントロピー復号部２００が取得した分割方向フラグに基づいて予測対象ブロックを対角線で分割し、２つの三角形領域を生成部２４２２に出力する。 As shown in FIG. 8, in step S1, the division unit 2421 divides the prediction target block diagonally based on the division direction flag acquired by the entropy decoding unit 200, and outputs two triangular regions to the generation unit 2422.

ステップＳ２において、生成部２４２２は、エントロピー復号部２００が取得した動きベクトル情報に基づいて２つの三角形領域のそれぞれについて領域予測画像を生成し、２つの領域予測画像を算出部２４２３及び合成部２４２５に出力する。 In step S2, the generation unit 2422 generates a region prediction image for each of the two triangular regions based on the motion vector information acquired by the entropy decoding unit 200, and outputs the two region prediction images to the calculation unit 2423 and the synthesis unit 2425.

ステップＳ３において、算出部２４２３は、生成部２４２２が生成した２つの領域予測画像間の類似度を示す類似度情報を算出し、算出した類似度情報を決定部２４２４に出力する。 In step S3, the calculation unit 2423 calculates similarity information indicating the similarity between the two region predicted images generated by the generation unit 2422, and outputs the calculated similarity information to the determination unit 2424.

ステップＳ４において、決定部２４２４は、算出部２４２３が算出した類似度情報に基づいて、２つの領域予測画像の合成に用いる合成方式としてＢｌｅｎｄｉｎｇ方式及びＮｏｎ－ｂｌｅｎｄｉｎｇ方式のいずれを用いるかを決定する。 In step S4, the determination unit 2424 determines whether the blending method or the non-blending method is to be used as the synthesis method for synthesizing the two area prediction images based on the similarity information calculated by the calculation unit 2423.

ステップＳ５において、合成部２４２５は、決定部２４２４が決定した合成方式を用いて、生成部２４２２が出力する２つの領域予測画像を合成し、予測対象ブロックに対応する予測ブロックを出力する。 In step S5, the synthesis unit 2425 synthesizes the two area prediction images output by the generation unit 2422 using the synthesis method determined by the determination unit 2424, and outputs a prediction block corresponding to the block to be predicted.

このように、本実施形態によれば、Ｂｌｅｎｄｉｎｇ方式及びＮｏｎ－ｂｌｅｎｄｉｎｇ方式のいずれを適用するかを示すフラグを復号側に伝送しなくても、符号化側及び復号側で共通の方式を暗黙的に決定できる。これにより、複数の合成方式を三角形分割予測に導入する場合であっても、合成方式を示すフラグの復号側への伝送（シグナリング）を不要とし、シグナリングするフラグ量の増加を抑制できる。 In this way, according to this embodiment, a common method can be implicitly determined on the encoding side and the decoding side without transmitting a flag indicating whether the blending method or the non-blending method is to be applied to the decoding side. This makes it unnecessary to transmit (signal) a flag indicating the blending method to the decoding side even when multiple blending methods are introduced into the triangulation prediction, and can suppress an increase in the amount of flags to be signaled.

＜その他の実施形態＞
上述した実施形態において、類似度情報に基づいて、２つの領域予測画像の合成に用いる合成方式としてＢｌｅｎｄｉｎｇ方式及びＮｏｎ－ｂｌｅｎｄｉｎｇ方式のいずれを用いるかを決定する一例について説明した。しかしながら、複数のＢｌｅｎｄｉｎｇ方式を導入する場合、類似度情報に基づいて複数のＢｌｅｎｄｉｎｇ方式のいずれを用いるかを決定してもよい。 <Other embodiments>
In the above-mentioned embodiment, an example is described in which the blending method or the non-blending method is determined as the synthesis method used for synthesizing two area prediction images based on similarity information. However, when introducing a plurality of blending methods, it is also possible to determine which of the plurality of blending methods is used based on similarity information.

複数のＢｌｅｎｄｉｎｇ方式は、境界領域を混合するパラメータが互いに異なっていてもよい。ここで、パラメータは、境界領域の範囲及び重み係数のセットのうち少なくとも１つであってもよい。例えば、Ｂｌｅｎｄｉｎｇ処理を適用する境界領域の範囲が第１範囲（狭範囲）であり、重み係数の第１セットを用いる第１Ｂｌｅｎｄｉｎｇ方式と、Ｂｌｅｎｄｉｎｇ処理を適用する境界領域の範囲が第２範囲（広範囲）であり、重み係数の第２セットを用いる第２Ｂｌｅｎｄｉｎｇ方式とを導入する。ここで、第２セットを構成する重み係数は、第１セットに比べて多い。 The plurality of blending methods may have different parameters for blending the boundary areas. Here, the parameter may be at least one of a boundary region range and a set of weighting factors. For example, the range of the boundary area to which the blending process is applied is the first range (narrow range), the first blending method uses the first set of weighting coefficients, and the range of the boundary area to which the blending process is applied is the second range (wide range). ) and a second blending scheme using a second set of weighting coefficients. Here, the number of weighting coefficients forming the second set is larger than that of the first set.

このような前提下において、決定部１７２４及び２４２４は、算出部１７２３及び２４２３が算出した類似度情報が示す類似度が閾値よりも低い場合は第２Ｂｌｅｎｄｉｎｇ方式を決定し、この類似度情報が示す類似度が閾値よりも高い場合は第１Ｂｌｅｎｄｉｎｇ方式を決定する。 Under such a premise, the determining units 1724 and 2424 determine the second blending method when the similarity indicated by the similarity information calculated by the calculating units 1723 and 2423 is lower than the threshold, and If the degree is higher than the threshold, the first blending method is determined.

或いは、第１閾値と、この第１閾値よりも大きい第２閾値とを導入してもよい。決定部１７２４及び２４２４は、算出部１７２３及び２４２３が算出した類似度情報が示す類似度が第２閾値よりも高い場合はＮｏｎ－ｂｌｅｎｄｉｎｇ方式を決定し、この類似度情報が示す類似度が第１閾値よりも低い場合は第２Ｂｌｅｎｄｉｎｇ方式を決定し、この類似度情報が示す類似度が第１閾値乃至第２閾値の範囲内にある場合は第１Ｂｌｅｎｄｉｎｇ方式を決定する。 Alternatively, a first threshold and a second threshold greater than the first threshold may be introduced. The decision units 1724 and 2424 decide on the non-blending method if the similarity indicated by the similarity information calculated by the calculation units 1723 and 2423 is higher than the second threshold, decide on the second blending method if the similarity indicated by this similarity information is lower than the first threshold, and decide on the first blending method if the similarity indicated by this similarity information is within the range from the first threshold to the second threshold.

上述した実施形態において、予測対象ブロックを対角線により分割して２つの三角形領域を出力する一例について説明したが、直線で分割された分割形状であればよく、分割形状は三角形には限らない。また、予測対象ブロックの分割数を３つ以上とすることもできる。その場合には、３つ以上の予測領域における各領域のペアについての境界付近の類似度をそれぞれ算出し、それらの情報に基づいて当該領域のペアの合成に用いる重み係数を決定することになる。 In the above-described embodiment, an example was described in which the prediction target block is divided along diagonal lines to output two triangular areas, but the divided shape is not limited to triangles, and may be any shape that is divided along straight lines. Further, the number of divisions of the prediction target block can also be three or more. In that case, the degree of similarity near the boundary for each pair of regions in three or more prediction regions is calculated, and the weighting coefficient used for compositing the pair of regions is determined based on that information. .

なお、符号化装置１が行う各処理をコンピュータに実行させるプログラムが提供されてもよい。復号装置２が行う各処理をコンピュータに実行させるプログラムが提供されてもよい。プログラムは、コンピュータ読取り可能媒体に記録されていてもよい。コンピュータ読取り可能媒体を用いれば、コンピュータにプログラムをインストールすることが可能である。ここで、プログラムが記録されたコンピュータ読取り可能媒体は、非一過性の記録媒体であってもよい。非一過性の記録媒体は、特に限定されるものではないが、例えば、ＣＤ－ＲＯＭやＤＶＤ－ＲＯＭ等の記録媒体であってもよい。 Note that a program that causes a computer to execute each process performed by the encoding device 1 may be provided. A program that causes a computer to execute each process performed by the decoding device 2 may be provided. The program may be recorded on a computer readable medium. Computer-readable media allow programs to be installed on a computer. Here, the computer-readable medium on which the program is recorded may be a non-transitory recording medium. The non-transitory recording medium is not particularly limited, and may be, for example, a recording medium such as a CD-ROM or a DVD-ROM.

符号化装置１が行う各処理を実行する回路を集積化し、符号化装置１を半導体集積回路（チップセット、ＳｏＣ）により構成してもよい。復号装置２が行う各処理を実行する回路を集積化し、復号装置２を半導体集積回路（チップセット、ＳｏＣ）により構成してもよい。 The circuits that execute the processes performed by the encoding device 1 may be integrated, and the encoding device 1 may be configured as a semiconductor integrated circuit (chip set, SoC). The circuits that execute the processes performed by the decoding device 2 may be integrated, and the decoding device 2 may be configured as a semiconductor integrated circuit (chip set, SoC).

以上、図面を参照して実施形態について詳しく説明したが、具体的な構成は上述のものに限られることはなく、要旨を逸脱しない範囲内において様々な設計変更等をすることが可能である。 Although the embodiments have been described above in detail with reference to the drawings, the specific configuration is not limited to that described above, and various design changes can be made without departing from the scope of the invention.

１：符号化装置
２：復号装置
１００：ブロック分割部
１１０：減算部
１２０：変換・量子化部
１２１：変換部
１２２：量子化部
１３０：エントロピー符号化部
１４０：逆量子化・逆変換部
１４１：逆量子化部
１４２：逆変換部
１５０：合成部
１６０：メモリ
１７０：予測部
１７１：イントラ予測部
１７２：インター予測部
１７２ａ：三角形分割予測部
１７３：切替部
２００：エントロピー復号部
２１０：逆変換部
２１１：逆量子化部
２１２：逆変換部
２２０：合成部
２３０：メモリ
２４０：予測部
２４１：イントラ予測部
２４２：インター予測部
２４２ａ：三角形分割予測部
２４３：切替部
１７２１：分割部
１７２２：生成部
１７２３：算出部
１７２４：決定部
１７２５：合成部
２４２１：分割部
２４２２：生成部
２４２３：算出部
２４２４：決定部
２４２５：合成部 1: Encoding device 2: Decoding device 100: Block division unit 110: Subtraction unit 120: Transformation and quantization unit 121: Transformation unit 122: Quantization unit 130: Entropy encoding unit 140: Inverse quantization and inverse transform unit 141: Inverse quantization unit 142: Inverse transform unit 150: Synthesis unit 160: Memory 170: Prediction unit 171: Intra prediction unit 172: Inter prediction unit 172a: Triangular partition prediction unit 173: Switching unit 200: Entropy decoding unit 210: Inverse transformation unit 211: Inverse quantization unit 212: Inverse transformation unit 220: Synthesis unit 230: Memory 240: Prediction unit 241: Intra prediction unit 242: Inter prediction unit 242a: Triangular partition prediction unit 243: Switching unit 1721: Division unit 1722 : Generation unit 1723 : Calculation unit 1724 : Determination unit 1725 : Combination unit 2421 : Division unit 2422 : Generation unit 2423 : Calculation unit 2424 : Determination unit 2425 : Combination unit

Claims

A prediction device that performs prediction in units of blocks obtained by dividing an image,
a dividing unit that divides the prediction target block along straight lines and outputs at least two prediction regions;
a generation unit that generates a region predicted image using a motion vector for each of the two prediction regions;
a calculation unit that calculates similarity information indicating the similarity between the two region predicted images generated by the generation unit;
a determining unit that determines a combination method to be used for combining the two region predicted images from among a plurality of methods, based on the similarity information calculated by the calculation unit;
A prediction device comprising: a synthesis section that synthesizes the two region predicted images using the synthesis method determined by the determination section and outputs a prediction block corresponding to the prediction target block.

The plurality of methods include a first method of mixing boundary regions between the two region predicted images by weighted synthesis using weighting coefficients for each pixel position,
The prediction device according to claim 1, wherein the determining unit determines whether to use the first method as the combining method based on the similarity information calculated by the calculating unit.

The plurality of methods further includes a second method of not blending the boundary region by the weighted blending,
The prediction device according to claim 2 , wherein the determination unit determines which of the first method and the second method to use as the combining method based on the similarity information calculated by the calculation unit.

The first method includes a plurality of first methods having mutually different parameters for mixing the boundary regions,
The parameter is at least one of the range of the boundary region and the set of weighting coefficients,
When the first method is used, the determining section determines which of the plurality of first methods is to be used as the combining method based on the similarity information calculated by the calculating section. The prediction device according to claim 2 or 3.

An encoding device comprising the prediction device according to any one of claims 1 to 4.

A decoding device comprising the prediction device according to any one of claims 1 to 4.

A program that causes a computer to function as a prediction device according to any one of claims 1 to 4.