WO2013065570A1 - Image processing device and method - Google Patents
Image processing device and method Download PDFInfo
- Publication number
- WO2013065570A1 WO2013065570A1 PCT/JP2012/077581 JP2012077581W WO2013065570A1 WO 2013065570 A1 WO2013065570 A1 WO 2013065570A1 JP 2012077581 W JP2012077581 W JP 2012077581W WO 2013065570 A1 WO2013065570 A1 WO 2013065570A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- unit
- image
- motion
- information
- prediction mode
- Prior art date
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
- H04N19/105—Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
- H04N19/109—Selection of coding mode or of prediction mode among a plurality of temporal predictive coding modes
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/176—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/523—Motion estimation or motion compensation with sub-pixel accuracy
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/55—Motion estimation with spatial constraints, e.g. at image or region borders
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/567—Motion estimation based on rate distortion criteria
Definitions
- the present disclosure relates to an image processing apparatus and method, and relates to an image processing apparatus and method that suppresses a reduction in encoding efficiency.
- MPEG compressed by orthogonal transform such as discrete cosine transform and motion compensation
- a device that conforms to a method such as Moving (Pictures Experts Group) has been widely used for both information distribution in broadcasting stations and information reception in general households.
- MPEG2 International Organization for Standardization
- IEC International Electrotechnical Commission
- MPEG2 was mainly intended for high-quality encoding suitable for broadcasting, but it did not support encoding methods with a lower code amount (bit rate) than MPEG1, that is, a higher compression rate. With the widespread use of mobile terminals, the need for such an encoding system is expected to increase in the future, and the MPEG4 encoding system has been standardized accordingly. Regarding the image coding system, the standard was approved as an international standard in December 1998 as ISO / IEC 14496-2.
- H.26L International Telecommunication Union Telecommunication Standardization Sector
- Q6 / 16 VCEG Video Coding Expert Group
- H.26L is known to achieve higher encoding efficiency than the conventional encoding schemes such as MPEG2 and MPEG4, although a large amount of calculation is required for encoding and decoding.
- Joint ⁇ ⁇ ⁇ ⁇ Model of Enhanced-Compression Video Coding has been implemented based on this H.26L and incorporating functions not supported by H.26L to achieve higher coding efficiency. It was broken.
- AVC Advanced Video Coding
- HEVC High Efficiency Efficiency Video Video Coding
- JCTVC Joint Collaboration Collaboration Team Video Coding
- a motion compensation filter (MC (Motion Compensated) filter
- MC Motion Compensated filter
- inter prediction interframe prediction
- the MC filter for example, in the case of HEVC, an 8-tap FIR (Finite Impulse Response) filter is used for the luminance component, and a 4-tap FIR filter is used for the color difference component.
- FIR Finite Impulse Response
- an encoded image is extracted from a frame memory as a reference image based on a block size, a prediction mode, a motion vector (MV), and the like, and an MC filter is applied on the basis of a fractional part of MV.
- a predicted image is generated. Note that MV is obtained from motion estimation in the encoder and stream information in the decoder.
- the pixel value of the reference image read out to generate a certain pixel of the predicted image can be used to generate other pixels around the pixel, generally, the larger the size of the predicted image to be generated, The use efficiency of the reference image is increased. In other words, the number of pixels of the reference image necessary for generating the predicted image is reduced as the size of the predicted image is larger when converted per pixel of the predicted image. In other words, the smaller the size of the predicted image, the lower the utilization efficiency of the reference image, and the larger the number of pixels of the reference image necessary for generating one pixel of the predicted image.
- the resources used by the decoder are limited depending on the level. Specifically, there is an upper limit on the number of motion vectors that can be used in two consecutive macroblocks. Due to this limitation, inter prediction with a small block size is not continuously generated. In addition, bi-directional prediction of 8 ⁇ 8 block size macroblocks is also prohibited.
- the present disclosure has been made in view of such a situation, and by limiting motion prediction in consideration of not only the block size and prediction mode but also the accuracy of motion vectors, unnecessary reduction in encoding efficiency can be achieved.
- the purpose is to suppress.
- One aspect of the present disclosure includes a control unit that controls an inter prediction mode candidate for performing motion compensation based on motion vector accuracy, and motion only in an inter prediction mode that is permitted to be the candidate by the control unit.
- a motion search unit that performs a search and generates a motion vector, and a motion compensation that generates a predicted image by performing the motion compensation in an inter prediction mode selected from each candidate generated by the motion search unit.
- An image processing apparatus that performs a search and generates a motion vector, and a motion compensation that generates a predicted image by performing the motion compensation in an inter prediction mode selected from each candidate generated by the motion search unit.
- the control unit can control whether or not the inter prediction mode is set as the candidate depending on whether the accuracy of the motion vector is integer accuracy or fractional accuracy.
- the control unit can control whether or not the inter prediction mode is set as the candidate depending on whether the vertical and horizontal accuracy of the motion vector is integer accuracy or fractional accuracy.
- the control unit can further control whether or not the inter prediction mode is set as the candidate depending on whether the prediction mode is uni-prediction or bi-prediction.
- the control unit can further control whether or not the inter prediction mode is set as the candidate according to the size of the region that is a prediction processing unit.
- the control unit determines the inter prediction mode as the candidate depending on whether information on the amount of data read from the memory determined according to the accuracy of the motion vector exceeds an upper limit due to a resource, a user instruction, or the like. Whether or not to do so can be controlled.
- a transmission unit that generates and transmits transmission information indicating the content of control by the control unit can be further provided.
- the transmission unit may define a profile and a level as the transmission information, and transmit the information indicating the profile and the level in a sequence parameter set.
- a cost function calculation unit For each candidate for which a motion vector is generated by the motion search unit, a cost function calculation unit that calculates a cost function value and an interface that performs the motion compensation based on the cost function value calculated by the cost calculation unit.
- a mode selection unit that selects a prediction mode from the candidates, and the motion compensation unit may perform the motion compensation in the inter prediction mode selected by the mode selection unit.
- One aspect of the present disclosure is also an image processing method of the image processing device, wherein the control unit controls inter prediction mode candidates for performing motion compensation based on the accuracy of the motion vector, and the motion search unit includes: The motion search is performed only in the inter prediction mode permitted to be the candidate, and a motion vector is generated. The motion compensation unit performs the motion in the inter prediction mode selected from the candidates from which the motion vector is generated. This is an image processing method for generating a predicted image by performing compensation.
- an acquisition unit that acquires control information used in inter prediction when encoding image data to control inter prediction mode candidates for motion compensation based on the accuracy of a motion vector; Based on the control information acquired by the acquisition unit, a determination unit that determines whether to decode the encoded data generated by encoding the image data, and a determination result by the determination unit,
- An image processing apparatus comprising: a control unit that controls decoding of the encoded data.
- the determination unit performs the determination based on whether or not the information regarding the amount of data read from the memory specified in the control information acquired by the acquisition unit exceeds an upper limit based on a resource or a user instruction. It can be carried out.
- the determination unit can make the determination as to whether the information related to the amount of data read from the memory defined by the profile and level exceeds an upper limit based on resources, user instructions, or the like.
- the control unit can execute decoding of the encoded data only when the determination unit determines that decoding is possible.
- control unit may further include an error processing unit that performs error processing under the control of the control unit.
- Another aspect of the present disclosure is also an image processing method of an image processing apparatus, in which an acquisition unit selects inter prediction mode candidates for motion compensation used in inter prediction when encoding image data. Control information to be controlled is acquired based on the accuracy of the motion vector, and the determination unit determines whether or not to decode the encoded data generated by encoding the image data based on the acquired control information.
- the control unit is an image processing method for controlling the decoding of the encoded data based on the determination result.
- a candidate for an inter prediction mode for performing motion compensation is controlled based on the accuracy of a motion vector, and a motion search is performed only in an inter prediction mode that is permitted to be the candidate.
- a motion vector is generated, motion compensation is performed in an inter prediction mode selected from the candidates for which the motion vector is generated, and a predicted image is generated.
- control information used to control inter prediction mode candidates for motion compensation which is used in inter prediction when encoding image data, is acquired based on the accuracy of a motion vector. Based on the acquired control information, it is determined whether or not the encoded data generated by encoding the image data can be decoded, and the decoding of the encoded data is controlled based on the determination result.
- an image can be processed.
- a reduction in encoding efficiency can be suppressed.
- FIG. 20 is a block diagram illustrating a main configuration example of a computer. It is a block diagram which shows the main structural examples of a television apparatus. It is a block diagram which shows the main structural examples of a mobile terminal device. It is a block diagram which shows the main structural examples of a recording / reproducing machine. It is a block diagram which shows the main structural examples of an imaging device.
- FIG. 1 is a block diagram illustrating a main configuration example of an image encoding device that is an image processing device to which the present technology is applied.
- the image encoding apparatus 100 shown in FIG. 1 is, for example, a HEVC (High Efficiency Video Coding) encoding scheme
- the image data of the moving image is encoded as in the H.264 and MPEG (Moving Picture Experts Group) 4 Part 10 (AVC (Advanced Video Coding)) coding system.
- H.264 and MPEG Motion Picture Experts Group
- AVC Advanced Video Coding
- the image encoding device 100 includes an A / D conversion unit 101, a screen rearrangement buffer 102, a calculation unit 103, an orthogonal transformation unit 104, a quantization unit 105, a lossless encoding unit 106, and a storage buffer. 107.
- the image coding apparatus 100 also includes an inverse quantization unit 108, an inverse orthogonal transform unit 109, a calculation unit 110, a loop filter 111, a frame memory 112, a selection unit 113, an intra prediction unit 114, a motion prediction / compensation unit 115, and a prediction.
- An image selection unit 116 and a rate control unit 117 are included.
- the image encoding device 100 includes a motion vector control unit 121.
- the A / D conversion unit 101 A / D converts the input image data, supplies the converted image data (digital data) to the screen rearrangement buffer 102, and stores it.
- the screen rearrangement buffer 102 rearranges the images of the frames in the stored display order in the order of frames for encoding in accordance with GOP (Group Of Picture), and the images in which the order of the frames is rearranged. This is supplied to the calculation unit 103.
- the screen rearrangement buffer 102 supplies each frame image to the calculation unit 103 for each predetermined partial area that is a processing unit (encoding unit) of the encoding process.
- the screen rearrangement buffer 102 also supplies the images in which the order of frames is rearranged to the intra prediction unit 114 and the motion prediction / compensation unit 115 for each partial region.
- the calculation unit 103 subtracts the prediction image supplied from the intra prediction unit 114 or the motion prediction / compensation unit 115 via the prediction image selection unit 116 from the image read from the screen rearrangement buffer 102, and the difference information Is output to the orthogonal transform unit 104. For example, in the case of an image on which intra coding is performed, the calculation unit 103 subtracts the prediction image supplied from the intra prediction unit 114 from the image read from the screen rearrangement buffer 102. For example, in the case of an image on which inter coding is performed, the arithmetic unit 103 subtracts the predicted image supplied from the motion prediction / compensation unit 115 from the image read from the screen rearrangement buffer 102.
- the orthogonal transform unit 104 performs orthogonal transform such as discrete cosine transform and Karhunen-Loeve transform on the difference information supplied from the computation unit 103. Note that this orthogonal transformation method is arbitrary.
- the orthogonal transform unit 104 supplies the transform coefficient obtained by the orthogonal transform to the quantization unit 105.
- the quantization unit 105 quantizes the transform coefficient supplied from the orthogonal transform unit 104.
- the quantization unit 105 supplies the quantized transform coefficient to the lossless encoding unit 106.
- the lossless encoding unit 106 encodes the transform coefficient quantized by the quantization unit 105 using an arbitrary encoding method, and generates encoded data (bit stream). Since the coefficient data is quantized under the control of the rate control unit 117, the code amount of the encoded data becomes the target value set by the rate control unit 117 (or approximates the target value).
- the lossless encoding unit 106 acquires intra prediction information including information indicating an intra prediction mode from the intra prediction unit 114, and moves inter prediction information including information indicating an inter prediction mode, motion vector information, and the like. Obtained from the prediction / compensation unit 115.
- the lossless encoding unit 106 acquires information (transmission information) regarding the motion prediction limitation supplied from the motion vector control unit 121.
- the lossless encoding unit 106 encodes these various types of information using an arbitrary encoding method, and includes (multiplexes) the information in the encoded data (bit stream).
- the lossless encoding unit 106 supplies the encoded data generated in this way to the storage buffer 107 for storage.
- Examples of the encoding scheme of the lossless encoding unit 106 include variable length encoding or arithmetic encoding.
- Examples of variable length coding include H.264.
- CAVLC Context-Adaptive Variable Length Coding
- Examples of arithmetic coding include CABAC (Context-Adaptive Binary Arithmetic Coding).
- the accumulation buffer 107 temporarily holds the encoded data supplied from the lossless encoding unit 106.
- the accumulation buffer 107 outputs the stored encoded data as a bit stream at a predetermined timing, for example, to a recording device (recording medium) or a transmission path (not shown) in the subsequent stage. That is, various types of encoded information are supplied to a device that decodes encoded data obtained by encoding image data by the image encoding device 100 (hereinafter also referred to as a decoding-side device).
- the transform coefficient quantized by the quantization unit 105 is also supplied to the inverse quantization unit 108.
- the inverse quantization unit 108 inversely quantizes the quantized transform coefficient by a method corresponding to the quantization by the quantization unit 105.
- the inverse quantization unit 108 supplies the obtained transform coefficient to the inverse orthogonal transform unit 109.
- the inverse orthogonal transform unit 109 performs inverse orthogonal transform on the transform coefficient supplied from the inverse quantization unit 108 by a method corresponding to the orthogonal transform performed by the orthogonal transform unit 104.
- the inversely orthogonally transformed output (difference information restored locally) is supplied to the calculation unit 110.
- the calculation unit 110 converts the inverse orthogonal transform result supplied from the inverse orthogonal transform unit 109, that is, locally restored difference information, into the intra prediction unit 114 or the motion prediction / compensation unit 115 via the predicted image selection unit 116. Are added to the predicted image to obtain a locally reconstructed image (hereinafter referred to as a reconstructed image).
- the reconstructed image is supplied to the loop filter 111 or the frame memory 112.
- the loop filter 111 includes a deblock filter, an adaptive loop filter, and the like, and appropriately performs a filtering process on the decoded image supplied from the calculation unit 110.
- the loop filter 111 removes block distortion of the decoded image by performing a deblocking filter process on the decoded image.
- the loop filter 111 performs image quality improvement by performing loop filter processing using a Wiener filter on the deblock filter processing result (decoded image from which block distortion has been removed). Do.
- the loop filter 111 may perform arbitrary filter processing on the decoded image. Further, the loop filter 111 can supply information such as filter coefficients used for the filter processing to the lossless encoding unit 106 and encode it as necessary. The loop filter 111 supplies a filter processing result (hereinafter referred to as a decoded image) to the frame memory 112.
- a filter processing result hereinafter referred to as a decoded image
- the frame memory 112 stores the reconstructed image supplied from the calculation unit 110 and the decoded image supplied from the loop filter 111, respectively.
- the frame memory 112 supplies the stored reconstructed image to the intra prediction unit 114 via the selection unit 113 at a predetermined timing or based on a request from the outside such as the intra prediction unit 114.
- the frame memory 112 also stores the decoded image stored at a predetermined timing or based on a request from the outside such as the motion prediction / compensation unit 115 via the selection unit 113. 115.
- the selection unit 113 indicates the supply destination of the image output from the frame memory 112. For example, in the case of intra prediction, the selection unit 113 reads an image (reconstructed image) that has not been subjected to filter processing from the frame memory 112 and supplies it to the intra prediction unit 114 as peripheral pixels.
- the selection unit 113 reads out an image (decoded image) that has been filtered from the frame memory 112, and supplies it as a reference image to the motion prediction / compensation unit 115.
- the intra prediction unit 114 When the intra prediction unit 114 acquires an image (peripheral image) of a peripheral region located around the processing target region from the frame memory 112, the intra prediction unit 114 basically uses a pixel value of the peripheral image to predict a prediction unit (PU ( Prediction (Unit))) is used as a processing unit to perform intra prediction (in-screen prediction) for generating a predicted image.
- the intra prediction unit 114 performs this intra prediction in a plurality of modes (intra prediction modes) prepared in advance.
- the intra prediction unit 114 generates predicted images in all candidate intra prediction modes, evaluates the cost function value of each predicted image using the input image supplied from the screen rearrangement buffer 102, and selects the optimum mode. select. When the intra prediction unit 114 selects the optimal intra prediction mode, the intra prediction unit 114 supplies the predicted image generated in the optimal mode to the predicted image selection unit 116.
- the intra prediction unit 114 appropriately supplies intra prediction information including information related to intra prediction, such as an optimal intra prediction mode, to the lossless encoding unit 106 to be encoded.
- the motion prediction / compensation unit 115 basically performs motion prediction (inter prediction) using the input image supplied from the screen rearrangement buffer 102 and the reference image supplied from the frame memory 112 as a processing unit. And a motion compensation process is performed according to the detected motion vector to generate a predicted image (inter predicted image information).
- the motion prediction / compensation unit 115 performs such inter prediction in a plurality of modes (inter prediction modes) prepared in advance.
- the motion prediction / compensation unit 115 performs motion prediction only in the permitted mode under the control of the motion vector control unit 121. That is, the motion vector control unit 121 may limit candidate inter prediction modes.
- the motion prediction / compensation unit 115 generates prediction images in all candidate inter prediction modes, evaluates the cost function value of each prediction image, and selects an optimal mode. When the optimal inter prediction mode is selected, the motion prediction / compensation unit 115 supplies the predicted image generated in the optimal mode to the predicted image selection unit 116.
- the motion prediction / compensation unit 115 appropriately supplies inter prediction information including information related to inter prediction, such as an optimal inter prediction mode, to the lossless encoding unit 106 to be encoded.
- the predicted image selection unit 116 selects a supply source of a predicted image to be supplied to the calculation unit 103 or the calculation unit 110.
- the prediction image selection unit 116 selects the intra prediction unit 114 as a supply source of the prediction image, and supplies the prediction image supplied from the intra prediction unit 114 to the calculation unit 103 and the calculation unit 110.
- the predicted image selection unit 116 selects the motion prediction / compensation unit 115 as a supply source of the predicted image, and calculates the predicted image supplied from the motion prediction / compensation unit 115 as the calculation unit 103. To the arithmetic unit 110.
- the rate control unit 117 controls the quantization operation rate of the quantization unit 105 based on the code amount of the encoded data stored in the storage buffer 107 so that overflow or underflow does not occur.
- the motion vector control unit 121 controls the mode of motion prediction performed by the motion prediction / compensation unit 115 in order to secure resources for reading out the reference image by the motion prediction / compensation unit 115. That is, the motion vector control unit 121 appropriately limits the generation of motion vectors so that the memory bandwidth of the frame memory 112 is not unnecessarily occupied for reading the reference image. At this time, the motion vector control unit 121 performs restriction in consideration of not only the block size and the prediction mode (prediction direction) but also the accuracy of the generated motion vector.
- the motion vector control unit 121 supplies control information indicating such restriction contents to the motion prediction / compensation unit 115.
- the motion prediction / compensation unit 115 performs motion prediction according to the supplied control information.
- the motion prediction / compensation unit 115 performs motion prediction according to a restriction that also considers the accuracy of the motion vector.
- the motion vector control unit 121 supplies information related to this restriction to the lossless encoding unit 106 as transmission information for transmission.
- FIG. 2 is a diagram for explaining an example of a state of motion prediction / compensation processing with 1/4 pixel accuracy defined in the AVC encoding method.
- each square represents a pixel.
- A indicates the position of the integer precision pixel stored in the frame memory
- b, c, d indicate the position of 1/2 pixel precision
- e 1 , e 2 , e 3 indicate 1/4 pixel. The position of accuracy is shown.
- Clip1 (a) 0; if (a ⁇ 0) a; otherwise max_pix; if (a> max_pix) ... (1)
- the pixel values at the positions b and d are generated as shown in the following equations (2) and (3) using a 6 tap FIR filter.
- the pixel value at the position of c is generated as shown in the following formulas (4) to (6) by applying a 6 tap FIR filter in the horizontal direction and the vertical direction.
- Clip processing is performed only once at the end after performing both horizontal and vertical product-sum processing.
- e 1 to e 3 are generated by linear interpolation as in the following formulas (7) to (9).
- coding unit a coding unit (CU (Coding Unit)) defined in the HEVC encoding method will be described.
- the coding unit is also called a coding tree block (CTB (Coding Tree Block)), and is a partial region of a multi-layer structure of a picture unit image that plays the same role as a macroblock in AVC. That is, CU is a unit (encoding unit) of encoding processing. While the size of the macroblock is fixed to 16 ⁇ 16 pixels, the size of the CU is not fixed, and is specified in the image compression information in each sequence.
- CTB Coding Tree Block
- a CU having the largest size is referred to as a large coding unit (LCU (Largest Coding Unit)), and a CU having the smallest size is referred to as a smallest coding unit (SCU (Smallest Coding Unit)). That is, the LCU is the maximum coding unit, and the SCU is the minimum coding unit.
- the sizes of these areas are specified, It is a square and is limited to a size represented by a power of 2. That is, each area obtained by dividing a (square) CU at a certain level into 2 ⁇ 2 is a (square) CU one level below.
- Figure 3 shows an example of coding unit (Coding Unit) defined in HEVC.
- split_flag When the value of split_flag is “1”, the CU having the size of 2Nx2N is divided into CUs having the size of NxN that is one level below.
- the CU is divided into prediction units (Prediction Units (PU)) that are regions (partial regions of images in units of pictures) that are processing units of intra or inter prediction, and are regions that are processing units of orthogonal transformation It is divided into transform units (Transform Unit (TU)), which is (a partial area of an image in units of pictures).
- Prediction Units PU
- TU Transform Unit
- inter prediction PU Inter Prediction Unit
- 4N sizes of 2Nx2N, 2NxN, Nx2N, and NxN can be set for a 2Nx2N CU.
- one PU of the same size as that CU two PUs obtained by dividing the CU vertically or horizontally, or four PUs obtained by dividing the CU into two vertically and horizontally respectively.
- the image encoding apparatus 100 performs each process related to encoding using a partial region of an image in units of pictures as a processing unit.
- a processing unit uses a CU defined by HEVC as a coding unit. That is, LCU is the maximum coding unit and SCU is the minimum coding unit.
- the processing unit of each encoding process by the image encoding apparatus 100 is not limited to this, and is arbitrary.
- a macroblock or sub-macroblock defined by AVC may be used as a processing unit.
- the “(partial) area” includes all the above-mentioned various areas (for example, macroblock, sub-macroblock, LCU, CU, SCU, PU, TU, etc.). May be). Of course, units other than those described above may be included, and units that are impossible according to the content of the description are appropriately excluded.
- the PU serving as a processing unit for motion prediction can be set to various sizes.
- the smaller the size of the PU the lower the utilization efficiency of the reference image, and the greater the number of reference image pixels necessary for generating one predicted image pixel.
- the number of reference images increases compared to unidirectional prediction.
- the resources used by the decoder are limited depending on the level. Specifically, there is an upper limit on the number of motion vectors that can be used in two consecutive macroblocks. Due to this limitation, inter prediction with a small block size is not continuously generated. In addition, bidirectional prediction of macroblocks having a block size smaller than 8 ⁇ 8 is also prohibited.
- the number of reference image pixels necessary for generating a predicted image will be described more specifically. Since the MC filter outputs n pixels, it is necessary to input (n + number of taps-1) pixels. That is, assuming that the vertical size of the PU is BV and the horizontal size is BH, the number of pixels N of the reference image necessary for generating the predicted image can be expressed as the following Expression (10). .
- N (BH + number of taps-1) x (BV + number of taps-1) ... (10)
- both L0 and L1 predictions are required, so the number of pixels N of the reference image necessary for generating a predicted image is expressed by the following equation (11). be able to.
- N (pixel corresponding to equation (10) of L0) + (pixel corresponding to equation (10) of L1) (11)
- N AH x (1 + ROUNDUP (((BH + number of taps-1)-1) / AH))) ⁇ AV ⁇ (1 + ROUNDUP ((((BV + number of taps-1)-1) / AV))) (12)
- ROUNDUP (x) represents the smallest integer greater than x.
- N of a reference image necessary for generating a predicted image can be expressed as the following equation (13).
- N (pixel corresponding to equation (12) of L0) + (pixel corresponding to equation (12) of L1) ... (13)
- the number of pixels N when the number of pixels of the reference image necessary for generating the predicted image is converted per pixel of the predicted image, the number of pixels N can be expressed as the following equation (14). In the case of prediction, the number of pixels N can be expressed as the following Expression (15).
- the number of pixels per pixel of the prediction image of the reference image necessary for generating the prediction image not only depends on the block size and the prediction mode (uni-prediction or bi-prediction), but also the number of taps, That is, it depends on whether the accuracy of the motion vector to be generated is integer accuracy or fractional accuracy.
- Prediction mode PU size, motion vector accuracy, and reference image necessary for prediction image generation when the horizontal memory alignment pixel number AH is 8 and the vertical memory alignment pixel number AV is 2.
- the relationship of the number of pixels per predicted image pixel is as shown in the tables shown in FIGS.
- FIG. 4 shows the relationship between the PU size, the motion vector accuracy, and the number of pixels per predicted image pixel of the reference image necessary for generating the predicted image when the prediction mode is uni-prediction (Uni).
- 5 shows the relationship between the PU size, the motion vector accuracy, and the number of pixels per pixel of the predicted image of the reference image necessary for generating the predicted image when the prediction mode is bi-prediction (Bi).
- the value in the column “Bi / Uni” indicates the prediction mode (whether it is single prediction (Uni) or bi-prediction (Bi)).
- the numbers in the “BH” column indicate the horizontal size of the PU, and the numbers in the “BV” column indicate the vertical size of the PU.
- the value in the “Sub pel” column indicates the accuracy of the motion vector.
- “Integer” indicates that the motion vector has integer precision in both the vertical direction and the horizontal direction.
- “Abc” indicates that the motion vector has integer precision in the vertical direction and fractional precision in the horizontal direction.
- “Dhn” indicates that the motion vector has fractional accuracy in the vertical direction and integer accuracy in the horizontal direction.
- “Efgijkpqr” indicates that the motion vector has fractional accuracy in both the vertical and horizontal directions.
- the number in the column “number of input pixels” indicates the number of pixels per pixel of the predicted image of the reference image necessary for generating the predicted image. That is, it represents the amount of data read from the frame memory.
- the maximum value of the number of input pixels is “13.5” (restricted by “13.5”).
- the hatched portions in FIGS. 4 and 5 show examples in which the value of “number of input pixels” is larger than “13.5” (larger than the AVC limit).
- the number of pixels per predicted image pixel of the reference image necessary for predicted image generation may be below the upper limit of AVC. there were. That is, since the influence on the load due to the accuracy of the motion vector is not taken into consideration, the options of the inter prediction mode are unnecessarily limited, and as a result, the coding efficiency may be reduced.
- the motion vector control unit 121 limits motion prediction in consideration of not only the block size and the prediction mode but also the accuracy of the motion vector in order to suppress unnecessary reduction in encoding efficiency as much as possible.
- FIG. 6 is a block diagram illustrating a main configuration example of the motion prediction / compensation unit 115 and the motion vector control unit 121 of FIG.
- the motion prediction / compensation unit 115 includes a motion search unit 151, a cost function calculation unit 152, a mode determination unit 153, a motion compensation unit 154, and a motion information buffer 155.
- the motion vector control unit 121 includes a restriction information storage unit 161, a restriction control unit 162, and a transmission information providing unit 163.
- the motion search unit 151 of the motion prediction / compensation unit 115 uses all the input image pixel values acquired from the screen rearrangement buffer 102 and the reference image pixel values acquired from the frame memory 112 as candidates for all inter prediction modes. Then, a motion search is performed for each region of the prediction processing unit, a motion vector is obtained, and a difference image (difference pixel value) between the input image and the reference image is obtained.
- the motion search unit 151 acquires the motion vector control information supplied from the restriction control unit 162 of the motion vector control unit 121, and performs a motion search according to the restriction indicated by the motion vector control information. That is, the motion search unit 151 sets only a mode permitted by the restriction control unit 162 as a candidate inter prediction mode, and performs a motion search only for that mode. In other words, the motion search unit 151 does not perform a motion search in a mode prohibited by the restriction control unit 162.
- the motion search unit 151 appropriately acquires peripheral motion information, which is information on motion vectors of peripheral PUs located around the attention PU that is the processing target PU, from the motion information buffer 155, and performs median prediction, merge mode, and the like. Perform the process.
- the motion search unit 151 supplies the cost function calculation unit 152 with information (motion information) regarding the motion of each inter prediction mode obtained as described above.
- the motion search part 151 supplies the difference pixel value corresponding to motion information to the cost function calculation part 152 as needed.
- the cost function calculation unit 152 calculates cost function values for all inter prediction modes based on the information supplied from the motion search unit 151.
- the cost function calculation unit 152 supplies the calculated cost function value of each inter prediction mode to the mode determination unit 153 together with the motion information.
- the mode determination unit 153 determines the inter prediction mode having the smallest cost function value as the optimal inter prediction mode.
- the mode determination unit 153 supplies the optimal prediction mode information indicating the optimal inter prediction mode to the motion compensation unit 154 together with the motion information of the optimal inter prediction mode.
- the motion compensation unit 154 performs motion compensation in an optimal inter prediction mode and generates a predicted image.
- the motion compensation unit 154 acquires the reference image pixel value corresponding to the motion information from the frame memory 112 in the optimal inter prediction mode according to the motion information and the optimal prediction mode information acquired from the mode determination unit 153.
- the optimal inter prediction mode is a mode in which no motion vector is supplied from the mode determination unit 153, such as the MV competition mode or the merge mode
- the motion compensation unit 154 receives the motion information of the peripheral PU from the motion information buffer 155. (Peripheral motion information) is acquired.
- the motion search unit 151 only the prediction mode permitted by the restriction control unit 162 is set as the candidate inter prediction mode, and the motion search is performed only for that mode. Therefore, the mode set as the optimal inter prediction mode is also one of the modes permitted by the restriction control unit 162. That is, the motion compensation unit 154 performs motion compensation in the inter prediction mode permitted by the restriction control unit 162.
- the motion compensation unit 154 supplies the generated predicted image pixel value to the predicted image selection unit 116 and causes the calculation unit 103 to subtract from the input image or causes the calculation unit 110 to add to the difference image.
- the motion compensation unit 154 supplies the optimal prediction mode information and the motion information to the lossless encoding unit 106 for encoding. Further, the motion compensation unit 154 supplies the motion information to the motion information buffer 155 and stores it.
- the motion information buffer 155 acquires and stores motion information supplied from the motion compensation unit 154.
- the motion information buffer 155 supplies the stored motion information to the motion search unit 151 and the motion compensation unit 154 as peripheral motion information.
- the restriction information storage unit 161 of the motion vector control unit 121 stores restriction information that is information related to the restriction of the inter prediction mode in advance, and at a predetermined timing or a request from another processing unit such as the restriction control unit 162, for example. Accordingly, the restriction information is supplied to the restriction controller 162.
- This restriction information is, for example, “4 ⁇ 4 block size is prohibited in L0 prediction or L1 prediction. 4 ⁇ 4, 8 ⁇ 4, 4 ⁇ 8 and 8 ⁇ 8“ efgijkpqr ”are prohibited in bi-prediction.
- the restriction information may be any information as long as such restriction can be provided. Further, which mode (prediction mode (single prediction, bi-prediction), combination of block size and motion vector accuracy) is specifically limited is arbitrary. That is, in the example of FIGS. 4 and 5, any row in the table may be prohibited.
- the restriction control unit 162 When the restriction control unit 162 acquires the restriction information from the restriction information storage unit 161, the restriction control unit 162 generates motion vector control information for notifying the motion search unit 151 of the restriction content according to the restriction information, and performs motion search on the generated motion vector control information. To the unit 151.
- the motion search unit 151 can perform a motion search only in a mode permitted by the restriction information by performing a motion search according to the restriction of the motion vector control information.
- the motion compensation unit 154 can also perform motion compensation in the mode permitted by the restriction information. That is, the motion vector control unit 121 (restriction control unit 162) can limit motion prediction based on the accuracy of the motion vector. Therefore, the motion vector control unit 121 (restriction control unit 162) can suppress an increase in load while suppressing a decrease in encoding efficiency, and can encode an image. Therefore, the image encoding device 100 can reduce suppression of encoding efficiency.
- the restriction control unit 162 supplies the transmission information providing unit 163 with control information instructing to transmit information indicating the restriction content.
- This control information includes the restriction content by the restriction information.
- the transmission information providing unit 163 generates transmission information indicating the content of restriction by the restriction information in accordance with the supplied control information, and supplies the transmission information to the lossless encoding unit 106 for transmission.
- This transmission information may be any information as long as it indicates the restriction content of the restriction information.
- the restriction information may be included in the transmission information, the restriction content may be included in the transmission information, or information equivalent to the restriction content may be included in the transmission information.
- the restriction by the motion vector control information may be any restriction as long as the restriction by the accuracy of the motion vector is included. For example, if the upper limit of the “number of input pixels” is set to the same level as AVC (13.5) in the tables of FIG. 4 and FIG. ”And 8 ⁇ 8 block size“ efgijkpqr ”are prohibited, others are allowed, and in bi-prediction, all Sub pels with block sizes 4 ⁇ 4, 8 ⁇ 4, and 4 ⁇ 8, and 8 ⁇ 8 Block sizes “dhn” and “efgijkpqr” may be prohibited and others may be permitted.
- limiting by the precision of a motion vector are arbitrary, restrictions other than prediction mode and block size may be included in the restriction
- the transmission information only needs to include the content of restriction by the motion vector control information.
- the restriction information may be any information as long as the motion vector control information and the transmission information include a restriction due to the accuracy of the motion vector.
- step S101 the A / D converter 101 performs A / D conversion on the input image.
- step S102 the screen rearrangement buffer 102 stores the A / D converted image, and rearranges the picture from the display order to the encoding order.
- step S103 the intra prediction unit 114 performs an intra prediction process in the intra prediction mode.
- step S104 the motion vector control unit 121 performs a motion vector control process that restricts motion vector generation in the motion prediction / compensation process.
- step S105 the motion prediction / compensation unit 115 performs inter motion prediction processing for performing motion prediction and motion compensation in the inter prediction mode in accordance with the motion vector restriction in step S104.
- step S106 the predicted image selection unit 116 determines the optimal prediction mode based on the cost function values output from the intra prediction unit 114 and the motion prediction / compensation unit 115. That is, the predicted image selection unit 116 selects one of the predicted image generated by the intra prediction unit 114 and the predicted image generated by the motion prediction / compensation unit 115.
- step S107 the calculation unit 103 calculates a difference between the image rearranged by the process of step S102 and the predicted image selected by the process of step S106.
- the data amount of the difference data is reduced compared to the original image data. Therefore, the data amount can be compressed as compared with the case where the image is encoded as it is.
- step S108 the orthogonal transform unit 104 orthogonally transforms the difference information generated by the process in step S107. Specifically, orthogonal transformation such as discrete cosine transformation and Karhunen-Loeve transformation is performed, and transformation coefficients are output.
- orthogonal transformation such as discrete cosine transformation and Karhunen-Loeve transformation is performed, and transformation coefficients are output.
- step S109 the quantization unit 105 quantizes the orthogonal transform coefficient obtained by the process in step S108.
- step S108 The difference information quantized by the processing in step S108 is locally decoded as follows. That is, in step S110, the inverse quantization unit 108 inversely quantizes the orthogonal transform coefficient quantized by the process in step S109 by a method corresponding to the quantization in step S109. In step S111, the inverse orthogonal transform unit 109 performs inverse orthogonal transform on the orthogonal transform coefficient obtained by the process of step S110 by a method corresponding to the process of step S108.
- step S112 the calculation unit 110 adds the predicted image to the locally decoded difference information, and generates a locally decoded image (an image corresponding to an input to the calculation unit 103).
- step S113 the loop filter 111 performs a loop filter process on the image generated by the process of step S112. As a result, block distortion or the like is removed or image quality is improved.
- step S114 the frame memory 112 stores an image from which block distortion has been removed by the process of step S113.
- the frame memory 112 also stores an image that has not been filtered.
- the image stored in the frame memory 112 is used for the process in step S103 and the process in step S105.
- step S115 the lossless encoding unit 106 encodes the transform coefficient quantized by the process of step S109, and generates encoded data. That is, lossless encoding such as variable length encoding or arithmetic encoding is performed on the difference image (secondary difference image in the case of inter).
- the lossless encoding unit 106 encodes information related to the prediction mode of the prediction image selected by the processing in step S106, and adds the encoded information to the encoded data obtained by encoding the difference image. For example, when the intra prediction mode is selected, the lossless encoding unit 106 encodes the intra prediction mode information. For example, when the inter prediction mode is selected, the lossless encoding unit 106 encodes the inter prediction mode information. These pieces of information are added (multiplexed) to the encoded data as header information, for example.
- step S116 the accumulation buffer 107 accumulates the encoded data generated by the process in step S115.
- the encoded data stored in the storage buffer 107 is read out as appropriate, and transmitted to a decoding-side device via an arbitrary transmission path (including not only a communication path but also a storage medium).
- step S117 the rate control unit 117 controls the quantization operation rate of the quantization unit 105 based on the compressed image accumulated in the accumulation buffer 107 by the processing in step S116 so that overflow or underflow does not occur. .
- step S117 ends, the encoding process ends.
- the restriction control unit 162 of the motion vector control unit 121 determines a motion vector to be prohibited from use based on the restriction information read from the restriction information storage unit 161 in step S131.
- step S132 the restriction control unit 162 generates motion vector control information indicating the content of the use restriction of the motion vector determined in step S131, and supplies the motion vector control information to the motion search unit 151.
- step S133 the transmission information providing unit 163 generates transmission information for transmission indicating the content of the use restriction of the motion vector determined in step S131, and supplies the transmission information to the lossless encoding unit 106 for transmission.
- step S133 the transmission information providing unit 163 ends the motion vector control process and returns the process to FIG.
- step S151 the motion search unit 151 determines each of the permitted candidate inter prediction modes according to the restriction content indicated in the motion vector control information generated by the process of step S132.
- a motion search is performed to generate motion information and a difference pixel value.
- step S152 the cost function calculation unit 152 calculates a cost function value for each of the permitted candidate inter prediction modes based on the motion information and the difference pixel value obtained by the process in step S151.
- step S153 the mode determination unit 153 determines the mode having the smallest cost function value calculated in step S152 as the optimal inter prediction mode.
- step S154 the motion compensation unit 154 performs motion compensation in the optimal inter prediction mode determined by the process in step S153, and generates a predicted image.
- This optimal inter prediction mode is a mode that is permitted to be used by the motion vector control information generated by the process of step S132.
- step S155 the motion compensation unit 154 supplies the predicted image generated by the process of step S154 to the predicted image selection unit 116, and generates difference image information and decoded image information. Also, in step S156, the motion compensation unit 154 supplies information related to the optimal inter prediction mode (optimum prediction mode information and motion information) to the lossless encoding unit 106 for encoding.
- the optimal inter prediction mode optimum prediction mode information and motion information
- step S157 the motion information buffer 155 stores the motion information in the optimal inter prediction mode.
- step S157 the motion information buffer 155 ends the optimal inter prediction mode and returns the process to FIG.
- the image encoding device 100 can reduce suppression of encoding efficiency.
- the image encoding apparatus 100 may provide a restriction according to a profile and a level when generating a stream. This is because the encoder considers the available resources of the decoder. Here, the resource indicates a processing amount, a memory bandwidth, and the like.
- the profile determines the available encoding tools. Levels are set to further limit the tools available in the profile.
- the encoder guarantees decoding by the decoder by generating a bit stream so as not to exceed the upper limit. be able to.
- the transmission information providing unit 163 includes information indicating which profile or level the image encoding device 100 has generated the bitstream in the transmission information. Normally, information on profiles and levels is added (multiplexed) to header information of a bit stream called a sequence parameter set (SPS (Sequence Parameter Set)).
- SPS Sequence Parameter Set
- the decoder can read the information about the profile and level included in the bit stream, and can grasp the resources necessary for decoding the stream.
- the image encoding device 100 adds a restriction to the stream to be generated. At this time, the image encoding apparatus 100 places a restriction on the resources available for the assumed image decoding apparatus.
- a profile in which the P slice can be used is referred to as BP (Baseline profile), and a profile in which the P slice and B slice can be used is referred to as HP (High profile).
- the image encoding device 100 considers whether or not the image decoding device can process the B slice. If not, the image encoding device 100 generates a stream based on the BP restriction, and the stream is limited to the BP restriction. Include information indicating that it was generated in. On the other hand, when the image decoding apparatus can process the B slice, or when the capability of the image decoding apparatus is not considered, the image encoding apparatus 100 generates a stream based on the limitation of HP, and Include information indicating that it is generated by HP.
- the image encoding device 100 determines the level in consideration of whether the maximum memory bandwidth of the image decoding device is 18 or more or 14 or more in the tables of FIGS.
- the image encoding apparatus 100 includes information indicating the level at which the stream is generated in the stream.
- the image encoding device 100 When encoding an image, the image encoding device 100 generates a stream within a range permitted by the profile and level determined by itself. Regarding the memory bandwidth, the number of input pixels read to generate one pixel in the tables of FIGS. 4 and 5 does not exceed the level specification based on the prediction mode of the block, the vertical and horizontal block sizes, and the MV. Encode with something.
- restriction information is arbitrary and is not limited to the example described above.
- a plurality of restriction candidates may be included, and a restriction to be adopted may be selected from among them according to various conditions.
- FIG. 10 is a block diagram illustrating a main configuration example of the motion vector control unit 121 in such a case.
- the motion vector control unit 121 includes a restriction control unit 172 instead of the restriction control unit 162 in FIG. 6.
- the motion vector control unit 121 in FIG. 10 further includes a condition acquisition unit 171.
- the condition acquisition unit 171 acquires various conditions related to motion vector generation, and notifies the restriction control unit 172 of the acquired conditions as condition information.
- This condition is arbitrary and may be any information. For example, it may be a motion vector permitted by a user or the like, a restriction specification, information on a resource such as an available memory band, or information on a restriction defined by a coding standard. Also good.
- the restriction control unit 172 determines the restriction content of the motion vector based on such condition information and the restriction information read from the restriction information storage unit 161.
- a plurality of patterns of restriction contents are prepared in the restriction information, and the restriction control unit 172 can select either one according to the condition information.
- the restriction information is as described above, “4 ⁇ 4 block size is prohibited in L0 prediction or L1 prediction.
- bi-prediction “ efgijkpqr of 4 ⁇ 4, 8 ⁇ 4, 4 ⁇ 8 and 8 ⁇ 8
- a plurality of conditions relating to the accuracy of the motion vector may be included, such as “prohibit”.
- the restriction control unit 172 can determine the restriction content according to the condition information by selecting any restriction content according to the condition information.
- the restriction information may include a predetermined arithmetic expression that can obtain a plurality of arithmetic results depending on conditions.
- the restriction control unit 172 can determine the restriction content according to the condition information by determining the value of the coefficient of the arithmetic expression according to the condition information and performing the calculation using the value.
- the restriction information may include table information shown in FIG. 4 and FIG.
- the restriction control unit 172 can determine the restriction content according to the condition information by determining the “number of input pixels” as the upper limit according to the condition information.
- the restriction information may include such table information for each memory alignment pattern.
- the restriction control unit 172 specifies (determines) the memory alignment according to the condition information, selects the table to be applied, and further determines the “number of input pixels” as the upper limit according to the condition information. Therefore, it is possible to determine the restriction content according to the condition information.
- a table in which whether to permit or prohibit each row in the tables of FIGS. 4 and 5 may be included in the restriction information. . That is, in this example, it is determined in advance which is permitted and which is prohibited for the combination of the prediction mode, the block size, and the accuracy of the motion vector. Therefore, the restriction control unit 172 can determine the restriction content according to the condition information only by specifying (determining) the memory alignment according to the condition information.
- restrictions other than the restriction due to the accuracy of the motion vector are arbitrary.
- restrictions other than the prediction mode and the block size may be included, or the restriction by the prediction mode and the block size may be omitted.
- the transmission information only needs to include the content of restriction by the motion vector control information.
- the restriction information and the condition may be any information.
- condition acquisition unit 171 of the motion vector control unit 121 acquires a condition related to the motion vector restriction in step S171.
- step S172 the restriction control unit 172 determines a motion vector that is prohibited from use based on the condition acquired in step S171 and the restriction information read from the restriction information storage unit 161.
- step S173 the restriction control unit 172 generates motion vector control information indicating the content of the use restriction of the motion vector determined in step S172, and supplies the motion vector control information to the motion search unit 151.
- step S174 the transmission information providing unit 163 generates transmission information for transmission indicating the content of the use restriction of the motion vector determined in step S172, and supplies the transmission information to the lossless encoding unit 106 for transmission.
- step S174 the transmission information providing unit 163 ends the motion vector control process and returns the process to FIG.
- the image encoding device 100 can reduce suppression of encoding efficiency.
- Second Embodiment> [Image decoding device] Since the encoded data (bitstream) generated by the image encoding apparatus 100 as described above is limited only in the motion vector to be used, the encoding standard is the same as that of the image encoding apparatus 100. A conventional image decoding apparatus that conforms can perform decoding by a conventional method.
- the image decoding apparatus analyzes the transmission information added (multiplexed) to the encoded data by the image encoding apparatus 100 as, for example, a profile or a level, so that the image decoding apparatus can safely and correctly decode the encoded data. You may make it determine whether it can do. Further, when it is determined that decoding cannot be performed, the image decoding apparatus may stop the decoding process. By doing so, encoded data can be decoded more reliably and safely without causing unnecessary delay due to overflow, underflow, or the like.
- FIG. 14 is a block diagram illustrating a main configuration example of an image decoding device that is an image processing device to which the present technology is applied.
- An image decoding apparatus 200 shown in FIG. 14 corresponds to the above-described image encoding apparatus 100, correctly decodes a bit stream (encoded data) generated by encoding image data by the image encoding apparatus 100, and decodes a decoded image. Is generated.
- the image decoding device 200 includes a storage buffer 201, a lossless decoding unit 202, an inverse quantization unit 203, an inverse orthogonal transform unit 204, a calculation unit 205, a loop filter 206, a screen rearrangement buffer 207, and a D A / A converter 208 is included.
- the image decoding apparatus 200 includes a frame memory 209, a selection unit 210, an intra prediction unit 211, a motion prediction / compensation unit 212, and a selection unit 213.
- the image decoding device 200 includes a decoding control unit 221.
- the accumulation buffer 201 accumulates the transmitted encoded data, and supplies the encoded data to the lossless decoding unit 202 at a predetermined timing.
- the lossless decoding unit 202 decodes the information supplied from the accumulation buffer 201 and encoded by the lossless encoding unit 106 in FIG. 1 using a method corresponding to the encoding method of the lossless encoding unit 106.
- the lossless decoding unit 202 supplies the quantized coefficient data of the difference image obtained by decoding to the inverse quantization unit 203.
- the lossless decoding unit 202 refers to information on the optimal prediction mode obtained by decoding the encoded data, and determines whether the intra prediction mode or the inter prediction mode is selected as the optimal prediction mode. . That is, the lossless decoding unit 202 determines whether the prediction mode employed in the transmitted encoded data is intra prediction or inter prediction.
- the lossless decoding unit 202 supplies information on the prediction mode to the intra prediction unit 211 or the motion prediction / compensation unit 212 based on the determination result.
- the lossless decoding unit 202 is intra prediction information, which is information about the selected intra prediction mode supplied from the encoding side. Is supplied to the intra prediction unit 211.
- the lossless decoding unit 202 is an inter that is information about the selected inter prediction mode supplied from the encoding side. The prediction information is supplied to the motion prediction / compensation unit 212.
- the lossless decoding unit 202 supplies transmission information added (multiplexed) to the encoded data, that is, information on the motion vector restriction content to the decoding control unit 221.
- the lossless decoding unit 202 supplies information such as a profile and level described in the sequence parameter set to the decoding control unit 221.
- the lossless decoding unit 202 stops its own processing and also stops the processing of other processing units, forcing the decoding process. Terminate.
- the inverse quantization unit 203 uses the method corresponding to the quantization method of the quantization unit 105 in FIG. 1 (similar to the inverse quantization unit 108) for the quantized coefficient data obtained by decoding by the lossless decoding unit 202. Method).
- the inverse quantization unit 203 supplies the inversely quantized coefficient data to the inverse orthogonal transform unit 204.
- the inverse orthogonal transform unit 204 performs inverse orthogonal transform on the coefficient data supplied from the inverse quantization unit 203 in a method corresponding to the orthogonal transform method of the orthogonal transform unit 104 in FIG.
- the inverse orthogonal transform unit 204 obtains a difference image corresponding to the difference image before being orthogonally transformed in the image encoding device 100 by the inverse orthogonal transform process.
- the difference image obtained by the inverse orthogonal transform is supplied to the calculation unit 205.
- a prediction image is supplied to the calculation unit 205 from the intra prediction unit 211 or the motion prediction / compensation unit 212 via the selection unit 213.
- the calculation unit 205 adds the difference image and the prediction image, and obtains a reconstructed image corresponding to the image before the prediction image is subtracted by the calculation unit 103 of the image encoding device 100.
- the arithmetic unit 205 supplies the reconstructed image to the loop filter 206.
- the loop filter 206 appropriately performs a loop filter process including a deblock filter process and an adaptive loop filter process on the supplied reconstructed image to generate a decoded image.
- the loop filter 206 removes block distortion by performing a deblocking filter process on the reconstructed image.
- the loop filter 206 performs image quality improvement by performing loop filter processing using a Wiener filter on the deblock filter processing result (reconstructed image from which block distortion has been removed). I do.
- the type of filter processing performed by the loop filter 206 is arbitrary, and filter processing other than that described above may be performed. Further, the loop filter 206 may perform filter processing using the filter coefficient supplied from the image encoding device 100 of FIG.
- the loop filter 206 supplies the decoded image as the filter processing result to the screen rearrangement buffer 207 and the frame memory 209. Note that the filter processing by the loop filter 206 can be omitted. That is, the output of the calculation unit 205 can be stored in the frame memory 209 without being subjected to filter processing.
- the intra prediction unit 211 uses pixel values of pixels included in this image as pixel values of peripheral pixels.
- the screen rearrangement buffer 207 rearranges the supplied decoded images. That is, the order of frames rearranged for the encoding order by the screen rearrangement buffer 102 in FIG. 1 is rearranged in the original display order.
- the D / A conversion unit 208 D / A converts the decoded image supplied from the screen rearrangement buffer 207, and outputs and displays the decoded image on a display (not shown).
- the frame memory 209 stores supplied reconstructed images and decoded images. Also, the frame memory 209 selects the stored reconstructed image or decoded image from the selection unit 210 at a predetermined timing or based on an external request such as the intra prediction unit 211 or the motion prediction / compensation unit 212. To the intra prediction unit 211 and the motion prediction / compensation unit 212.
- the intra prediction unit 211 performs intra prediction based on the intra prediction information supplied from the lossless decoding unit 202, and generates a predicted image. Note that the intra prediction unit 211 is based on the intra prediction information supplied from the lossless decoding unit 202 only for the region in which the prediction image is generated by the intra prediction at the time of encoding, as shown in FIG. Intra prediction is performed in a mode similar to the mode of the process performed by.
- the motion prediction / compensation unit 212 performs inter prediction based on the inter prediction information supplied from the lossless decoding unit 202, and generates a prediction image. Note that the motion prediction / compensation unit 212 is based on the inter prediction information supplied from the lossless decoding unit 202, and the motion prediction / compensation unit in FIG. Inter prediction is performed in a mode similar to the mode of processing performed by 115.
- the intra prediction unit 211 or the motion prediction / compensation unit 212 supplies the generated predicted image to the calculation unit 205 via the selection unit 213 for each region of the prediction processing unit.
- the selection unit 213 supplies the prediction image supplied from the intra prediction unit 211 or the prediction image supplied from the motion prediction / compensation unit 212 to the calculation unit 205.
- the decoding control unit 221 controls whether or not to decode the encoded data based on the transmission information supplied from the lossless decoding unit 202.
- the image decoding apparatus 200 can more reliably decode the encoded data in which the reduction of the encoding efficiency is suppressed, and more reliably realize the suppression of the reduction of the encoding efficiency. Can do.
- FIG. 15 is a block diagram illustrating a main configuration example of the decoding control unit 221 of FIG.
- the decoding control unit 221 includes a transmission information acquisition unit 251, a restriction information storage unit 252, an availability determination unit 253, a control unit 254, and an error processing unit 255.
- the transmission information acquisition unit 251 acquires transmission information supplied from the lossless decoding unit 202.
- This transmission information may be any information, may be information directly indicating the restriction content of the motion vector, or may be information specifying a profile or a level.
- the transmission information acquisition unit 251 supplies the acquired transmission information to the availability determination unit 253.
- the restriction information storage unit 252 stores restriction information for restricting usable motion vectors. This restriction information is for suppressing the occupancy of the memory band due to the reading of the reference image from the frame memory 209. The restriction information only needs to include at least information that can be substantially restricted by the accuracy of the motion vector in the availability determination unit 253.
- the availability determination unit 253 may allow the image decoding apparatus 200 to safely and correctly decode the encoded data. Determine if you can.
- the transmission information includes information on restrictions based on the accuracy of motion vectors in the encoded data. Therefore, the availability determining unit 253 can grasp the memory bandwidth necessary for decoding the encoded data from the transmission information. Further, the availability determination unit 253 can grasp the upper limit of the available memory bandwidth from the restriction based on the accuracy of the motion vector included in the restriction information. That is, the availability determination unit 253 determines whether the encoded data can be decoded by comparing them.
- the availability determination unit 253 determines that this encoded data is decodable when the memory bandwidth necessary for decoding the encoded data is equal to or less than the upper limit of the available memory bandwidth. Conversely, if the memory bandwidth necessary for decoding the encoded data is greater than the upper limit of the available memory bandwidth, the availability determining unit 253 determines that this encoded data is not decodable.
- the availability determination unit 253 does not directly compare the memory bandwidth, but compares information substantially equivalent to the memory bandwidth, for example, information such as prediction mode, block size, and motion vector accuracy. Also good.
- the availability determination unit 253 supplies such a determination result to the control unit 254.
- the control unit 254 controls the decoding process based on the supplied determination result. For example, when the encoded data can be decoded, the control unit 254 supplies control information that instructs the lossless decoding unit 202 to perform decoding. The lossless decoding unit 202 performs decoding according to the control information.
- the other processing units also execute the processing of each unit related to decoding.
- the control unit 254 supplies control information that instructs the error processing unit 255 to execute error processing.
- the error processing unit 255 performs error processing such as notifying the user that the decoding processing is to be stopped according to the control information.
- the control unit 254 supplies control information that instructs the lossless decoding unit 202 to stop decoding.
- the lossless decoding unit 202 stops the decoding process according to this control information.
- the other processing units also stop the processing of each unit related to decoding.
- the decoding control unit 221 can control the decoding process using the restriction based on the accuracy of the motion vector. Therefore, the decoding control unit 221 can forcibly terminate the decoding process of encoded data that cannot be decoded. Therefore, the decoding control unit 221 can more reliably decode the encoded data in which the reduction in encoding efficiency is suppressed. Therefore, the image decoding apparatus 200 can more reliably suppress the reduction in encoding efficiency.
- step S201 the accumulation buffer 201 accumulates the transmitted encoded data.
- step S202 the lossless decoding unit 202 decodes the encoded data supplied from the accumulation buffer 201. That is, the I picture, P picture, and B picture encoded by the lossless encoding unit 106 in FIG. 1 are decoded.
- information such as reference frame information, prediction mode information (intra prediction mode or inter prediction mode), and transmission information is also decoded.
- step S203 the decoding control unit 221 performs a decoding control process based on the transmission information decoded by the process in step S202.
- the decoding process is terminated in step S203.
- the processes after step S204 are executed.
- step S204 the inverse quantization unit 203 inversely quantizes the quantized orthogonal transform coefficient obtained by the process in step S202.
- step S205 the inverse orthogonal transform unit 204 performs inverse orthogonal transform on the orthogonal transform coefficient obtained by inverse quantization in step S203 by a method corresponding to the orthogonal transform unit 104 in FIG. As a result, the difference information corresponding to the input of the orthogonal transform unit 104 (output of the calculation unit 103) in FIG. 1 is decoded.
- step S ⁇ b> 206 the intra prediction unit 211 or the motion prediction / compensation unit 212 performs image prediction processing corresponding to the prediction mode information supplied from the lossless decoding unit 202. That is, when intra prediction mode information is supplied from the lossless decoding unit 202, the intra prediction unit 211 performs intra prediction processing in the intra prediction mode.
- the motion prediction / compensation unit 212 uses various pieces of information regarding the unit size obtained by the process of step S203 to perform inter prediction processing (motion prediction and motion prediction). Compensation).
- step S207 the calculation unit 205 adds the predicted image obtained by the process of step S206 to the difference information obtained by the process of step S205. Thereby, the original image data is decoded (a reconstructed image is obtained).
- step S208 the loop filter 206 performs loop filter processing.
- a loop filter process is appropriately performed on the reconstructed image obtained by the process of step S207.
- This loop filter process is basically performed in the same manner as the loop filter process described with reference to the flowchart of FIG.
- step S209 the screen rearrangement buffer 207 rearranges the frames of the decoded image that has been subjected to the loop filter processing in step S208. That is, the order of frames of the decoded image data rearranged for encoding by the screen rearrangement buffer 102 (FIG. 1) of the image encoding device 100 is rearranged to the original display order.
- step S210 the D / A conversion unit 208 performs D / A conversion on the decoded image data in which the frames are rearranged by the processing in step S209.
- the decoded image data is output to a display (not shown), and the image is displayed.
- step S211 the frame memory 209 stores the decoded image data subjected to the loop filter process by the process of step S208.
- the transmission information acquisition unit 251 acquires the transmission information supplied from the lossless decoding unit 202 in step S231.
- step S232 the availability determination unit 253 determines whether the encoded data supplied to the lossless decoding unit 202 is decodable based on the control information and the transmission information.
- step S233 the control unit 254 selects a process based on the determination result. That is, when it is determined that decoding is possible, the control unit 254 ends the decoding control process, returns the process to FIG. 16, and causes the processes after step S204 to be executed.
- step S234 the control unit 254 advances the process to step S234 in step S233.
- step S234 the error processing unit 255 performs predetermined error processing.
- step S235 the control unit 254 controls the lossless decoding unit 202 and the like to forcibly terminate the decoding process. When the process of step S235 ends, the control unit 254 ends the decoding control process.
- the image decoding apparatus 200 can more reliably decode the encoded data in which the reduction in encoding efficiency is suppressed. Therefore, the image decoding apparatus 200 can more reliably suppress the reduction in encoding efficiency.
- restriction information is arbitrary and is not limited to the example described above.
- a plurality of restriction candidates may be included, and a restriction to be adopted may be selected from among them according to various conditions.
- FIG. 18 is a block diagram illustrating a main configuration example of the decoding control unit 221 in such a case.
- the decoding control unit 221 includes an availability determination unit 273 instead of the availability determination unit 253 in FIG. 15. Further, the decoding control unit 221 in FIG. 18 further includes a condition acquisition unit 271.
- the condition acquisition unit 271 acquires various conditions related to motion vector generation, and notifies the availability determination unit 273 of the acquired conditions as condition information.
- This condition is arbitrary and may be any information. For example, it may be a motion vector permitted by a user or the like, a restriction specification, information on a resource such as an available memory band, or information on a restriction defined by a coding standard. Also good.
- the availability determination unit 273 determines the restriction content of the motion vector based on such condition information and the restriction information read from the restriction information storage unit 252.
- a plurality of patterns of restriction contents are prepared in the restriction information, and the availability determination unit 273 can select either one according to the condition information.
- the restriction information is as described above, “4 ⁇ 4 block size is prohibited in L0 prediction or L1 prediction.
- bi-prediction “ efgijkpqr of 4 ⁇ 4, 8 ⁇ 4, 4 ⁇ 8 and 8 ⁇ 8
- a plurality of conditions relating to the accuracy of the motion vector may be included, such as “prohibit”.
- the availability determination unit 273 can determine the restriction content according to the condition information by selecting any restriction content according to the condition information.
- the restriction information may include a predetermined arithmetic expression that can obtain a plurality of arithmetic results depending on conditions.
- the availability determination unit 273 can determine the restriction content according to the condition information by determining the value of the coefficient of the arithmetic expression according to the condition information and performing the calculation using the value.
- the restriction information may include table information shown in FIG. 4 and FIG.
- the availability determination unit 273 can determine the restriction content according to the condition information by determining the “number of input pixels” as the upper limit according to the condition information.
- the restriction information may include such table information for each memory alignment pattern.
- the availability determination unit 273 specifies (determines) the memory alignment according to the condition information, selects a table to be applied, and further determines the upper limit “number of input pixels” according to the condition information. Therefore, it is possible to determine the restriction content according to the condition information.
- a table in which whether to permit or prohibit each row in the tables of FIGS. 4 and 5 may be included in the restriction information. . That is, in this example, it is determined in advance which is permitted and which is prohibited for the combination of the prediction mode, the block size, and the accuracy of the motion vector. Therefore, the availability determination unit 273 can determine the restriction content according to the condition information only by specifying (determining) the memory alignment according to the condition information.
- restrictions other than the restriction due to the accuracy of the motion vector are arbitrary.
- restrictions other than the prediction mode and the block size may be included, or the restriction by the prediction mode and the block size may be omitted.
- the restriction information and the condition may be any information.
- the availability determination unit 273 compares the thus obtained restriction with transmission information as in the case of FIG. 15 (for example, various kinds of information such as prediction mode, block size, and motion vector accuracy may be compared). Then, they may be converted into a memory band and compared) to determine whether or not the decoding process is possible.
- the availability determination unit 273 supplies the determination result to the control unit 254. Based on the determination result, the control unit 254 controls the decoding process as in the case of FIG.
- the decoding control unit 221 can control the decoding process by using the restriction based on the accuracy of the motion vector. Therefore, the decoding control unit 221 can forcibly terminate the decoding process of encoded data that cannot be decoded. Therefore, the decoding control unit 221 can more reliably decode the encoded data in which the reduction in encoding efficiency is suppressed. Therefore, the image decoding apparatus 200 can more reliably suppress the reduction in encoding efficiency.
- the condition acquisition unit 271 acquires a condition in step S251.
- the transmission information acquisition unit 251 acquires the transmission information supplied from the lossless decoding unit 202.
- step S253 the availability determination unit 273 supplies the lossless decoding unit 202 based on the conditions acquired in step S251, the control information read from the restriction information storage unit 252, and the transmission information acquired in step S252. It is determined whether the encoded data to be decoded is decodable.
- step S254 the control unit 254 selects a process based on the determination result. That is, when it is determined that decoding is possible, the control unit 254 ends the decoding control process, returns the process to FIG. 16, and causes the processes after step S204 to be executed.
- step S255 the control unit 254 advances the process to step S255.
- step S255 the error processing unit 255 performs predetermined error processing.
- step S256 the control unit 254 controls the lossless decoding unit 202 and the like to forcibly terminate the decoding process.
- the control unit 254 ends the decoding control process.
- the image decoding apparatus 200 can more reliably decode the encoded data in which the reduction in encoding efficiency is suppressed. Therefore, the image decoding apparatus 200 can more reliably suppress the reduction in encoding efficiency.
- a CPU (Central Processing Unit) 701 of a computer 700 has various programs according to a program stored in a ROM (Read Only Memory) 702 or a program loaded from a storage unit 713 into a RAM (Random Access Memory) 703. Execute the process.
- the RAM 703 also appropriately stores data necessary for the CPU 701 to execute various processes.
- the CPU 701, ROM 702, and RAM 703 are connected to each other via a bus 704.
- An input / output interface 710 is also connected to the bus 704.
- the input / output interface 710 includes an input unit 711 including a keyboard, a mouse, a touch panel, and an input terminal, a display including a CRT (Cathode Ray Tube), an LCD (Liquid Crystal Display), and an OELD (Organic ElectroLuminescence Display), and the like.
- An output unit 712 including an arbitrary output device such as a speaker or an output terminal, a storage unit 713 configured by an arbitrary storage medium such as a hard disk or a flash memory, a control unit for controlling input / output of the storage medium, a modem,
- the communication unit 714 performs communication processing with other communication devices via a network including the Internet, for example.
- a drive 715 is also connected to the input / output interface 710 as necessary.
- a removable medium 721 such as a magnetic disk, an optical disk, a magneto-optical disk, or a semiconductor memory is appropriately attached to the drive 715.
- the drive 715 reads out a computer program, data, and the like from the removable medium 721 attached to the drive 715 according to the control of the CPU 701, for example.
- the read data and computer program are supplied to the RAM 703, for example.
- the computer program read from the removable medium 721 is installed in the storage unit 713 as necessary.
- a program constituting the software is installed from a network or a recording medium.
- the recording medium is distributed to distribute the program to the user separately from the apparatus main body, and includes a magnetic disk (including a flexible disk) on which the program is recorded, an optical disk ( It is only composed of removable media 721 consisting of CD-ROM (compact disc-read only memory), DVD (including digital versatile disc), magneto-optical disc (including MD (mini disc)), or semiconductor memory. Rather, it is composed of a ROM 702 in which a program is recorded and a hard disk included in the storage unit 713, which is distributed to the user in a state of being incorporated in the apparatus main body in advance.
- a magnetic disk including a flexible disk
- an optical disk It is only composed of removable media 721 consisting of CD-ROM (compact disc-read only memory), DVD (including digital versatile disc), magneto-optical disc (including MD (mini disc)), or semiconductor memory. Rather, it is composed of a ROM 702 in which a program is recorded and a hard disk included in the storage unit 713, which is distributed to the user in a
- the program executed by the computer may be a program that is processed in time series in the order described in this specification, or in parallel or at a necessary timing such as when a call is made. It may be a program for processing.
- the step of describing the program recorded on the recording medium is not limited to the processing performed in chronological order according to the described order, but may be performed in parallel or It also includes processes that are executed individually.
- system represents the entire apparatus composed of a plurality of devices (apparatuses).
- the configuration described as one device (or processing unit) may be divided and configured as a plurality of devices (or processing units).
- the configurations described above as a plurality of devices (or processing units) may be combined into a single device (or processing unit).
- a configuration other than that described above may be added to the configuration of each device (or each processing unit).
- a part of the configuration of a certain device (or processing unit) may be included in the configuration of another device (or other processing unit).
- the image encoding device 100 (FIG. 1) and the image decoding device 200 (FIG. 14) according to the above-described embodiment are distributed to satellite broadcasting, cable broadcasting such as cable TV, distribution on the Internet, and terminals by cellular communication.
- the present invention can be applied to various electronic devices such as a transmitter or receiver, a recording device that records an image on a medium such as an optical disk, a magnetic disk, and a flash memory, or a reproducing device that reproduces an image from these storage media.
- a transmitter or receiver a recording device that records an image on a medium such as an optical disk, a magnetic disk, and a flash memory, or a reproducing device that reproduces an image from these storage media.
- FIG. 21 illustrates an example of a schematic configuration of a television device to which the above-described embodiment is applied.
- the television apparatus 900 includes an antenna 901, a tuner 902, a demultiplexer 903, a decoder 904, a video signal processing unit 905, a display unit 906, an audio signal processing unit 907, a speaker 908, an external interface 909, a control unit 910, a user interface 911, And a bus 912.
- Tuner 902 extracts a signal of a desired channel from a broadcast signal received via antenna 901, and demodulates the extracted signal. Then, the tuner 902 outputs the encoded bit stream obtained by the demodulation to the demultiplexer 903. That is, the tuner 902 has a role as a transmission unit in the television device 900 that receives an encoded stream in which an image is encoded.
- the demultiplexer 903 separates the video stream and audio stream of the viewing target program from the encoded bit stream, and outputs each separated stream to the decoder 904. Further, the demultiplexer 903 extracts auxiliary data such as EPG (Electronic Program Guide) from the encoded bit stream, and supplies the extracted data to the control unit 910. Note that the demultiplexer 903 may perform descrambling when the encoded bit stream is scrambled.
- EPG Electronic Program Guide
- the decoder 904 decodes the video stream and audio stream input from the demultiplexer 903. Then, the decoder 904 outputs the video data generated by the decoding process to the video signal processing unit 905. In addition, the decoder 904 outputs audio data generated by the decoding process to the audio signal processing unit 907.
- the video signal processing unit 905 reproduces the video data input from the decoder 904 and causes the display unit 906 to display the video.
- the video signal processing unit 905 may cause the display unit 906 to display an application screen supplied via a network.
- the video signal processing unit 905 may perform additional processing such as noise removal on the video data according to the setting.
- the video signal processing unit 905 may generate a GUI (Graphical User Interface) image such as a menu, a button, or a cursor, and superimpose the generated image on the output image.
- GUI Graphic User Interface
- the display unit 906 is driven by a drive signal supplied from the video signal processing unit 905, and displays an image on a video screen of a display device (for example, a liquid crystal display, a plasma display, or an OELD (Organic ElectroLuminescence Display) (organic EL display)). Or an image is displayed.
- a display device for example, a liquid crystal display, a plasma display, or an OELD (Organic ElectroLuminescence Display) (organic EL display)). Or an image is displayed.
- the audio signal processing unit 907 performs reproduction processing such as D / A conversion and amplification on the audio data input from the decoder 904, and outputs audio from the speaker 908.
- the audio signal processing unit 907 may perform additional processing such as noise removal on the audio data.
- the external interface 909 is an interface for connecting the television apparatus 900 to an external device or a network.
- a video stream or an audio stream received via the external interface 909 may be decoded by the decoder 904. That is, the external interface 909 also has a role as a transmission unit in the television apparatus 900 that receives an encoded stream in which an image is encoded.
- the control unit 910 includes a processor such as a CPU and memories such as a RAM and a ROM.
- the memory stores a program executed by the CPU, program data, EPG data, data acquired via a network, and the like.
- the program stored in the memory is read and executed by the CPU when the television apparatus 900 is activated.
- the CPU executes the program to control the operation of the television device 900 according to an operation signal input from the user interface 911, for example.
- the user interface 911 is connected to the control unit 910.
- the user interface 911 includes, for example, buttons and switches for the user to operate the television device 900, a remote control signal receiving unit, and the like.
- the user interface 911 detects an operation by the user via these components, generates an operation signal, and outputs the generated operation signal to the control unit 910.
- the bus 912 connects the tuner 902, the demultiplexer 903, the decoder 904, the video signal processing unit 905, the audio signal processing unit 907, the external interface 909, and the control unit 910 to each other.
- the decoder 904 has the function of the image decoding apparatus 200 (FIG. 14) according to the above-described embodiment. Therefore, the decoder 904 can decode only the encoded data generated by the encoding in which the accuracy of the motion vector is considered in the limitation of motion prediction. Therefore, the decoder 904 can more reliably decode the encoded data in which the reduction in encoding efficiency is suppressed. Therefore, the television apparatus 900 can more reliably suppress the reduction in encoding efficiency.
- FIG. 22 shows an example of a schematic configuration of a mobile phone to which the above-described embodiment is applied.
- a mobile phone 920 includes an antenna 921, a communication unit 922, an audio codec 923, a speaker 924, a microphone 925, a camera unit 926, an image processing unit 927, a demultiplexing unit 928, a recording / reproducing unit 929, a display unit 930, a control unit 931, an operation A portion 932 and a bus 933.
- the antenna 921 is connected to the communication unit 922.
- the speaker 924 and the microphone 925 are connected to the audio codec 923.
- the operation unit 932 is connected to the control unit 931.
- the bus 933 connects the communication unit 922, the audio codec 923, the camera unit 926, the image processing unit 927, the demultiplexing unit 928, the recording / reproducing unit 929, the display unit 930, and the control unit 931 to each other.
- the mobile phone 920 has various operation modes including a voice call mode, a data communication mode, a shooting mode, and a videophone mode, and is used for sending and receiving voice signals, sending and receiving e-mail or image data, taking images, and recording data. Perform the action.
- the analog voice signal generated by the microphone 925 is supplied to the voice codec 923.
- the audio codec 923 converts an analog audio signal into audio data, A / D converts the compressed audio data, and compresses it. Then, the audio codec 923 outputs the compressed audio data to the communication unit 922.
- the communication unit 922 encodes and modulates the audio data and generates a transmission signal. Then, the communication unit 922 transmits the generated transmission signal to a base station (not shown) via the antenna 921. In addition, the communication unit 922 amplifies a radio signal received via the antenna 921 and performs frequency conversion to acquire a received signal.
- the communication unit 922 demodulates and decodes the received signal to generate audio data, and outputs the generated audio data to the audio codec 923.
- the audio codec 923 decompresses the audio data and performs D / A conversion to generate an analog audio signal. Then, the audio codec 923 supplies the generated audio signal to the speaker 924 to output audio.
- the control unit 931 generates character data constituting the e-mail in response to an operation by the user via the operation unit 932.
- the control unit 931 causes the display unit 930 to display characters.
- the control unit 931 generates e-mail data in response to a transmission instruction from the user via the operation unit 932, and outputs the generated e-mail data to the communication unit 922.
- the communication unit 922 encodes and modulates email data and generates a transmission signal. Then, the communication unit 922 transmits the generated transmission signal to a base station (not shown) via the antenna 921.
- the communication unit 922 amplifies a radio signal received via the antenna 921 and performs frequency conversion to acquire a received signal.
- the communication unit 922 demodulates and decodes the received signal to restore the email data, and outputs the restored email data to the control unit 931.
- the control unit 931 displays the content of the electronic mail on the display unit 930 and stores the electronic mail data in the storage medium of the recording / reproducing unit 929.
- the recording / reproducing unit 929 has an arbitrary readable / writable storage medium.
- the storage medium may be a built-in storage medium such as a RAM or a flash memory, or an externally mounted storage medium such as a hard disk, a magnetic disk, a magneto-optical disk, an optical disk, a USB memory, or a memory card. May be.
- the camera unit 926 images a subject to generate image data, and outputs the generated image data to the image processing unit 927.
- the image processing unit 927 encodes the image data input from the camera unit 926 and stores the encoded stream in the storage medium of the recording / playback unit 929.
- the demultiplexing unit 928 multiplexes the video stream encoded by the image processing unit 927 and the audio stream input from the audio codec 923, and the multiplexed stream is the communication unit 922. Output to.
- the communication unit 922 encodes and modulates the stream and generates a transmission signal. Then, the communication unit 922 transmits the generated transmission signal to a base station (not shown) via the antenna 921.
- the communication unit 922 amplifies a radio signal received via the antenna 921 and performs frequency conversion to acquire a received signal.
- These transmission signal and reception signal may include an encoded bit stream.
- the communication unit 922 demodulates and decodes the received signal to restore the stream, and outputs the restored stream to the demultiplexing unit 928.
- the demultiplexing unit 928 separates the video stream and the audio stream from the input stream, and outputs the video stream to the image processing unit 927 and the audio stream to the audio codec 923.
- the image processing unit 927 decodes the video stream and generates video data.
- the video data is supplied to the display unit 930, and a series of images is displayed on the display unit 930.
- the audio codec 923 decompresses the audio stream and performs D / A conversion to generate an analog audio signal. Then, the audio codec 923 supplies the generated audio signal to the speaker 924 to output audio.
- the image processing unit 927 has the function of the image encoding device 100 (FIG. 1) according to the above-described embodiment. Therefore, the image processing unit 927 can limit motion prediction in consideration of the accuracy of the motion vector. Therefore, the image processing unit 927 can encode an image while suppressing an increase in load while suppressing a decrease in encoding efficiency. Therefore, the mobile phone 920 can reduce suppression of encoding efficiency.
- the image processing unit 927 also has the function of the image decoding device 200 (FIG. 14) according to the above-described embodiment. Therefore, the image processing unit 927 can decode only the encoded data generated by the encoding in which the accuracy of the motion vector is considered in the limitation of motion prediction. Therefore, the image processing unit 927 can more reliably decode the encoded data in which the reduction in encoding efficiency is suppressed. Accordingly, the mobile phone 920 can more reliably suppress the reduction in encoding efficiency.
- the mobile phone 920 has been described.
- an imaging function similar to that of the mobile phone 920 such as a PDA (Personal Digital Assistant), a smartphone, an UMPC (Ultra Mobile Personal Computer), a netbook, a notebook personal computer, or the like.
- the image encoding device and the image decoding device to which the present technology is applied can be applied to any device as in the case of the mobile phone 920.
- FIG. 23 shows an example of a schematic configuration of a recording / reproducing apparatus to which the above-described embodiment is applied.
- the recording / reproducing device 940 encodes audio data and video data of a received broadcast program and records the encoded data on a recording medium.
- the recording / reproducing device 940 may encode audio data and video data acquired from another device and record them on a recording medium, for example.
- the recording / reproducing device 940 reproduces data recorded on the recording medium on a monitor and a speaker, for example, in accordance with a user instruction. At this time, the recording / reproducing device 940 decodes the audio data and the video data.
- the recording / reproducing apparatus 940 includes a tuner 941, an external interface 942, an encoder 943, an HDD (Hard Disk Drive) 944, a disk drive 945, a selector 946, a decoder 947, an OSD (On-Screen Display) 948, a control unit 949, and a user interface. 950.
- Tuner 941 extracts a signal of a desired channel from a broadcast signal received via an antenna (not shown), and demodulates the extracted signal. Then, the tuner 941 outputs the encoded bit stream obtained by the demodulation to the selector 946. That is, the tuner 941 serves as a transmission unit in the recording / reproducing apparatus 940.
- the external interface 942 is an interface for connecting the recording / reproducing apparatus 940 to an external device or a network.
- the external interface 942 may be, for example, an IEEE1394 interface, a network interface, a USB interface, or a flash memory interface.
- video data and audio data received via the external interface 942 are input to the encoder 943. That is, the external interface 942 serves as a transmission unit in the recording / reproducing device 940.
- the encoder 943 encodes video data and audio data when the video data and audio data input from the external interface 942 are not encoded. Then, the encoder 943 outputs the encoded bit stream to the selector 946.
- the HDD 944 records an encoded bit stream in which content data such as video and audio is compressed, various programs, and other data on an internal hard disk. Further, the HDD 944 reads out these data from the hard disk when reproducing video and audio.
- the disk drive 945 performs recording and reading of data to and from the mounted recording medium.
- the recording medium mounted on the disk drive 945 is, for example, a DVD disk (DVD-Video, DVD-RAM, DVD-R, DVD-RW, DVD + R, DVD + RW, etc.) or a Blu-ray (registered trademark) disk. It may be.
- the selector 946 selects an encoded bit stream input from the tuner 941 or the encoder 943 when recording video and audio, and outputs the selected encoded bit stream to the HDD 944 or the disk drive 945. In addition, the selector 946 outputs the encoded bit stream input from the HDD 944 or the disk drive 945 to the decoder 947 during video and audio reproduction.
- the decoder 947 decodes the encoded bit stream and generates video data and audio data. Then, the decoder 947 outputs the generated video data to the OSD 948. The decoder 904 outputs the generated audio data to an external speaker.
- OSD 948 reproduces the video data input from the decoder 947 and displays the video. Further, the OSD 948 may superimpose a GUI image such as a menu, a button, or a cursor on the video to be displayed.
- the control unit 949 includes a processor such as a CPU and memories such as a RAM and a ROM.
- the memory stores a program executed by the CPU, program data, and the like.
- the program stored in the memory is read and executed by the CPU when the recording / reproducing apparatus 940 is activated, for example.
- the CPU controls the operation of the recording / reproducing apparatus 940 in accordance with an operation signal input from the user interface 950, for example, by executing the program.
- the user interface 950 is connected to the control unit 949.
- the user interface 950 includes, for example, buttons and switches for the user to operate the recording / reproducing device 940, a remote control signal receiving unit, and the like.
- the user interface 950 detects an operation by the user via these components, generates an operation signal, and outputs the generated operation signal to the control unit 949.
- the encoder 943 has the function of the image encoding apparatus 100 (FIG. 1) according to the above-described embodiment. Therefore, the encoder 943 can limit motion prediction in consideration of the accuracy of the motion vector. Therefore, the encoder 943 can encode an image while suppressing an increase in load while suppressing a decrease in encoding efficiency. Therefore, the recording / reproducing apparatus 940 can reduce suppression of encoding efficiency.
- the decoder 947 has the function of the image decoding apparatus 200 (FIG. 14) according to the above-described embodiment. Therefore, the decoder 947 can decode only the encoded data generated by the encoding in which the accuracy of the motion vector is considered in the limitation of motion prediction. Therefore, the decoder 947 can more reliably decode the encoded data in which the reduction in encoding efficiency is suppressed. Therefore, the recording / reproducing apparatus 940 can more reliably suppress the reduction in encoding efficiency.
- FIG. 24 illustrates an example of a schematic configuration of an imaging apparatus to which the above-described embodiment is applied.
- the imaging device 960 images a subject to generate an image, encodes the image data, and records it on a recording medium.
- the imaging device 960 includes an optical block 961, an imaging unit 962, a signal processing unit 963, an image processing unit 964, a display unit 965, an external interface 966, a memory 967, a media drive 968, an OSD 969, a control unit 970, a user interface 971, and a bus. 972.
- the optical block 961 is connected to the imaging unit 962.
- the imaging unit 962 is connected to the signal processing unit 963.
- the display unit 965 is connected to the image processing unit 964.
- the user interface 971 is connected to the control unit 970.
- the bus 972 connects the image processing unit 964, the external interface 966, the memory 967, the media drive 968, the OSD 969, and the control unit 970 to each other.
- the optical block 961 includes a focus lens and a diaphragm mechanism.
- the optical block 961 forms an optical image of the subject on the imaging surface of the imaging unit 962.
- the imaging unit 962 includes an image sensor such as a CCD or a CMOS, and converts an optical image formed on the imaging surface into an image signal as an electrical signal by photoelectric conversion. Then, the imaging unit 962 outputs the image signal to the signal processing unit 963.
- the signal processing unit 963 performs various camera signal processing such as knee correction, gamma correction, and color correction on the image signal input from the imaging unit 962.
- the signal processing unit 963 outputs the image data after the camera signal processing to the image processing unit 964.
- the image processing unit 964 encodes the image data input from the signal processing unit 963 and generates encoded data. Then, the image processing unit 964 outputs the generated encoded data to the external interface 966 or the media drive 968. The image processing unit 964 also decodes encoded data input from the external interface 966 or the media drive 968 to generate image data. Then, the image processing unit 964 outputs the generated image data to the display unit 965. In addition, the image processing unit 964 may display the image by outputting the image data input from the signal processing unit 963 to the display unit 965. Further, the image processing unit 964 may superimpose display data acquired from the OSD 969 on an image output to the display unit 965.
- the OSD 969 generates a GUI image such as a menu, a button, or a cursor, and outputs the generated image to the image processing unit 964.
- the external interface 966 is configured as a USB input / output terminal, for example.
- the external interface 966 connects the imaging device 960 and a printer, for example, when printing an image.
- a drive is connected to the external interface 966 as necessary.
- a removable medium such as a magnetic disk or an optical disk is attached to the drive, and a program read from the removable medium can be installed in the imaging device 960.
- the external interface 966 may be configured as a network interface connected to a network such as a LAN or the Internet. That is, the external interface 966 has a role as a transmission unit in the imaging device 960.
- the recording medium mounted on the media drive 968 may be any readable / writable removable medium such as a magnetic disk, a magneto-optical disk, an optical disk, or a semiconductor memory.
- a recording medium may be fixedly mounted on the media drive 968, and a non-portable storage unit such as an internal hard disk drive or an SSD (Solid State Drive) may be configured.
- the control unit 970 includes a processor such as a CPU and memories such as a RAM and a ROM.
- the memory stores a program executed by the CPU, program data, and the like.
- the program stored in the memory is read and executed by the CPU when the imaging device 960 is activated, for example.
- the CPU controls the operation of the imaging device 960 according to an operation signal input from the user interface 971 by executing the program.
- the user interface 971 is connected to the control unit 970.
- the user interface 971 includes, for example, buttons and switches for the user to operate the imaging device 960.
- the user interface 971 detects an operation by the user via these components, generates an operation signal, and outputs the generated operation signal to the control unit 970.
- the image processing unit 964 has the function of the image encoding device 100 (FIG. 1) according to the above-described embodiment. Therefore, the image processing unit 964 can limit the motion prediction in consideration of the accuracy of the motion vector. Accordingly, the image processing unit 964 can encode an image while suppressing an increase in load while suppressing a decrease in encoding efficiency. Therefore, the imaging device 960 can reduce suppression of encoding efficiency.
- the image processing unit 964 also has the function of the image decoding device 200 (FIG. 14) according to the above-described embodiment. Therefore, the image processing unit 964 can decode only the encoded data generated by the encoding in which the accuracy of the motion vector is considered in the limitation of motion prediction. Therefore, the image processing unit 964 can more reliably decode the encoded data in which the reduction in encoding efficiency is suppressed. Therefore, the imaging device 960 can more reliably suppress the reduction in encoding efficiency.
- the image encoding device and the image decoding device to which the present technology is applied can be applied to devices and systems other than the above-described devices.
- the term “associate” means that an image (which may be a part of an image such as a slice or a block) included in the bitstream and information corresponding to the image can be linked at the time of decoding. Means. That is, information may be transmitted on a transmission path different from that of the image (or bit stream). Information may be recorded on a recording medium (or another recording area of the same recording medium) different from the image (or bit stream). Furthermore, the information and the image (or bit stream) may be associated with each other in an arbitrary unit such as a plurality of frames, one frame, or a part of the frame.
- this technique can also take the following structures.
- a control unit that controls inter prediction mode candidates for performing motion compensation based on the accuracy of motion vectors;
- a motion search unit that performs a motion search only in the inter prediction mode permitted to be the candidate by the control unit, and generates a motion vector;
- An image processing apparatus comprising: a motion compensation unit that performs the motion compensation in an inter prediction mode selected from the candidates for which motion vectors are generated by the motion search unit, and generates a predicted image.
- the control unit controls whether or not an inter prediction mode is set as the candidate depending on whether the accuracy of the motion vector is integer accuracy or fractional accuracy. .
- the control unit controls whether or not the inter prediction mode is set as the candidate depending on whether the vertical and horizontal accuracy of the motion vector is integer accuracy or fractional accuracy. ).
- the control unit further controls whether or not the inter prediction mode is set as the candidate depending on whether the prediction mode is uni-prediction or bi-prediction.
- the control unit further controls whether or not an inter prediction mode is set as the candidate according to a size of a region serving as a prediction processing unit.
- the control unit determines the inter prediction mode depending on whether or not the information regarding the amount of data read from the memory determined according to the accuracy of the motion vector exceeds an upper limit due to a resource, a user instruction, or the like.
- the image processing apparatus controls whether or not the candidate is used.
- the image processing apparatus controls whether or not the candidate is used.
- the image processing apparatus further including: a transmission unit that generates and transmits transmission information indicating the content of control by the control unit.
- the transmission unit defines a profile and a level as the transmission information, and transmits information indicating the profile and the level in a sequence parameter set.
- a cost function calculation unit that calculates a cost function value for each candidate for which a motion vector is generated by the motion search unit;
- a mode selection unit that selects an inter prediction mode for performing the motion compensation from the candidates based on the cost function value calculated by the cost calculation unit;
- the control unit controls inter prediction mode candidates for motion compensation based on the accuracy of the motion vector
- the motion search unit performs a motion search only in the inter prediction mode permitted to be the candidate, generates a motion vector
- An acquisition unit that acquires control information that is used in inter prediction when encoding image data and controls inter prediction mode candidates for motion compensation based on the accuracy of motion vectors;
- a determination unit that determines whether or not decoding of encoded data generated by encoding the image data is based on the control information acquired by the acquisition unit;
- An image processing apparatus comprising: a control unit that controls decoding of the encoded data based on a determination result by the determination unit.
- the determination unit determines whether the information on the amount of data read from the memory specified in the control information acquired by the acquisition unit exceeds an upper limit due to a resource, a user instruction, or the like.
- the image processing apparatus according to (11), wherein determination is performed.
- the determination unit performs the determination as to whether the information on the amount of data read from the memory defined by the profile and the level exceeds an upper limit based on a resource, a user instruction, or the like. ).
- the image processing device according to any one of (11) to (13), wherein the control unit executes decoding of the encoded data only when the determination unit determines that decoding is possible.
- An image processing method for an image processing apparatus The acquisition unit acquires control information used in inter prediction when encoding image data, and controls inter prediction mode candidates for motion compensation based on the accuracy of a motion vector, The determination unit determines whether or not to decode the encoded data generated by encoding the image data based on the acquired control information, An image processing method in which a control unit controls decoding of the encoded data based on the determination result.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
This disclosure pertains to an image processing device and method with which it is possible to suppress a reduction in coding efficiency. This image processing device is provided with: a control unit for controlling inter-prediction mode candidates for performing motion compensation, on the basis of motion vector accuracy; a motion search unit for generating a motion vector by performing a motion search only in inter-prediction modes which the control unit has allowed as candidates; and a motion compensation unit for generating a predicted image by performing the motion compensation in the inter-prediction mode selected from among the candidates with which a motion vector was generated by the motion search unit. This disclosure can be applied to an image processing device.
Description
本開示は、画像処理装置および方法に関し、符号化効率の低減を抑制する画像処理装置および方法に関する。
The present disclosure relates to an image processing apparatus and method, and relates to an image processing apparatus and method that suppresses a reduction in encoding efficiency.
近年、画像情報をデジタルとして取り扱い、その際、効率の高い情報の伝送、蓄積を目的とし、画像情報特有の冗長性を利用して、離散コサイン変換等の直交変換と動き補償により圧縮するMPEG(Moving Picture Experts Group)などの方式に準拠した装置が、放送局などの情報配信、及び一般家庭における情報受信の双方において普及した。
In recent years, image information is handled as digital data, and MPEG (compressed by orthogonal transform such as discrete cosine transform and motion compensation is used for the purpose of efficient transmission and storage of information. A device that conforms to a method such as Moving (Pictures Experts Group) has been widely used for both information distribution in broadcasting stations and information reception in general households.
特に、MPEG2(ISO(International Organization for Standardization)/IEC(International Electrotechnical Commission) 13818-2)は、汎用画像符号化方式として定義されており、飛び越し走査画像及び順次走査画像の双方、並びに標準解像度画像及び高精細画像を網羅する標準で、プロフェッショナル用途及びコンシューマ用途の広範なアプリケーションに現在広く用いられている。MPEG2圧縮方式を用いることにより、例えば720×480画素を持つ標準解像度の飛び越し走査画像であれば4~8Mbps、1920×1088画素を持つ高解像度の飛び越し走査画像であれば18~22Mbpsの符号量(ビットレート)を割り当てることで、高い圧縮率と良好な画質の実現が可能である。
In particular, MPEG2 (ISO (International Organization for Standardization) / IEC (International Electrotechnical Commission) 13818-2) is defined as a general-purpose image coding system, and includes both interlaced scanning images and sequential scanning images, as well as standard resolution images and This standard covers high-definition images and is currently widely used in a wide range of professional and consumer applications. By using the MPEG2 compression method, for example, a standard resolution interlaced scanning image having 720 × 480 pixels is 4 to 8 Mbps, and a high resolution interlaced scanning image having 1920 × 1088 pixels is 18 to 22 Mbps. (Bit rate) can be assigned to achieve a high compression rate and good image quality.
MPEG2は主として放送用に適合する高画質符号化を対象としていたが、MPEG1より低い符号量(ビットレート)、つまりより高い圧縮率の符号化方式には対応していなかった。携帯端末の普及により、今後そのような符号化方式のニーズは高まると思われ、これに対応してMPEG4符号化方式の標準化が行われた。画像符号化方式に関しては、1998年12月にISO/IEC 14496-2としてその規格が国際標準に承認された。
MPEG2 was mainly intended for high-quality encoding suitable for broadcasting, but it did not support encoding methods with a lower code amount (bit rate) than MPEG1, that is, a higher compression rate. With the widespread use of mobile terminals, the need for such an encoding system is expected to increase in the future, and the MPEG4 encoding system has been standardized accordingly. Regarding the image coding system, the standard was approved as an international standard in December 1998 as ISO / IEC 14496-2.
更に、近年、当初テレビ会議用の画像符号化を目的として、H.26L (ITU-T(International Telecommunication Union Telecommunication Standardization Sector) Q6/16 VCEG(Video Coding Expert Group))という標準の規格化が進んでいる。H.26LはMPEG2やMPEG4といった従来の符号化方式に比べ、その符号化、復号化により多くの演算量が要求されるものの、より高い符号化効率が実現されることが知られている。また、現在、MPEG4の活動の一環として、このH.26Lをベースに、H.26Lではサポートされない機能をも取り入れ、より高い符号化効率を実現する標準化がJoint Model of Enhanced-Compression Video Codingとして行われた。
In recent years, the standardization of H.26L (ITU-T (International Telecommunication Union Telecommunication Standardization Sector) Q6 / 16 VCEG (Video Coding Expert Group)) has been progressing for the purpose of initial video coding for video conferences. Yes. H.26L is known to achieve higher encoding efficiency than the conventional encoding schemes such as MPEG2 and MPEG4, although a large amount of calculation is required for encoding and decoding. In addition, as part of MPEG4 activities, Joint 取 り 入 れ Model of Enhanced-Compression Video Coding has been implemented based on this H.26L and incorporating functions not supported by H.26L to achieve higher coding efficiency. It was broken.
標準化のスケジュールとしては、2003年3月にはH.264及びMPEG-4 Part10 (Advanced Video Coding、以下AVCと記す)という名の元に国際標準となった。
The standardization schedule became an international standard in March 2003 under the names H.264 and MPEG-4 Part 10 (Advanced Video Coding, hereinafter referred to as AVC).
更に、その拡張として、RGBや4:2:2、4:4:4といった、業務用に必要な符号化ツールや、MPEG-2で規定されていた8x8DCTや量子化マトリクスをも含んだFRExt(Fidelity Range Extension)の標準化が2005年2月に完了し、これにより、AVCを用いて、映画に含まれるフィルムノイズをも良好に表現することが可能な符号化方式となって、Blu-Ray Disc等の幅広いアプリケーションに用いられる運びとなった。
Furthermore, as an extension, FRExt (including RGB, 4: 2: 2, 4: 4: 4, such as encoding tools necessary for business use, 8x8DCT and quantization matrix specified by MPEG-2) The standardization of Fidelity (Range Extension) was completed in February 2005. As a result, Blu-Ray Disc has become an encoding method that can well express film noise contained in movies using AVC. It has been used for a wide range of applications.
しかしながら、昨今、ハイビジョン画像の4倍の、UHD(Ultra High Definition;4000画素×2000画素)の画像を圧縮したい、或いは、インターネットのような、限られた伝送容量の環境において、ハイビジョン画像を配信したいといった、更なる高圧縮率符号化に対するニーズが高まっている。このため、先述の、ITU-T傘下のVCEG(Video Coding Expert Group)において、符号化効率の改善に関する検討が継続され行なわれている。
However, recently, I want to compress UHD (Ultra High Definition: 4000 pixels x 2000 pixels), which is four times higher than high-definition images, or distribute high-definition images in a limited transmission capacity environment such as the Internet. There is a growing need for further high-compression coding. For this reason, in the above-mentioned VCEG (Video Coding Expert Group) under the ITU-T, studies on improving the coding efficiency are being continued.
そして、AVCより更なる符号化効率の向上を目的として、ITU-Tと、ISO/IECの共同の標準化団体であるJCTVC(Joint Collaboration Team - Video Coding)により、HEVC(High Efficiency Video Coding)と呼ばれる符号化方式の標準化が進められている(例えば、特許文献1参照)。
And with the aim of further improving encoding efficiency than AVC, it is called HEVC (High Efficiency Efficiency Video Video Coding) by JCTVC (Joint Collaboration Collaboration Team Video Coding), a joint standardization organization of ITU-T and ISO / IEC. Standardization of the encoding method is underway (see, for example, Patent Document 1).
このようなAVCやHEVCにおいては、インター予測(フレーム間予測)の際に、分数精度の動き補償が行えるように動き補償フィルタ(MC(Motion Compensated)フィルタ)が用いられる。MCフィルタとして、例えばHEVCの場合、輝度成分に対して8タップのFIR(Finite Impulse Response)フィルタが用いられ、色差成分に対しては、4タップのFIRフィルタが用いられる。
In such AVC and HEVC, a motion compensation filter (MC (Motion Compensated) filter) is used so that fractional precision motion compensation can be performed in inter prediction (interframe prediction). As the MC filter, for example, in the case of HEVC, an 8-tap FIR (Finite Impulse Response) filter is used for the luminance component, and a 4-tap FIR filter is used for the color difference component.
インター予測においては、ブロックサイズ、予測モード、および動きベクトル(MV)等に基づいて、フレームメモリから符号化済み画像が参照画像として取り出され、MVの少数部に基づいてMCフィルタが適用されてインター予測画像が生成される。なお、MVは、エンコーダにおいては動き推定により、デコーダにおいてはストリーム情報から得られる。
In inter prediction, an encoded image is extracted from a frame memory as a reference image based on a block size, a prediction mode, a motion vector (MV), and the like, and an MC filter is applied on the basis of a fractional part of MV. A predicted image is generated. Note that MV is obtained from motion estimation in the encoder and stream information in the decoder.
分数精度のMVの場合、予測画像生成のためのMCフィルタのタップ数が大きくなる。つまり、MVが整数の場合よりも、予測画像生成に必要な参照画像の画素数は多くなる。したがって、フレームメモリからより多くの画素データを読み出さなければならない。
場合 In the case of MV with fractional accuracy, the number of taps of the MC filter for predictive image generation becomes large. That is, the number of pixels of the reference image necessary for predictive image generation is larger than when MV is an integer. Therefore, more pixel data must be read from the frame memory.
また、予測画像のある画素を生成するために読み出した参照画像の画素値は、その画素周辺の他の画素の生成にも利用できるので、一般的には、生成する予測画像のサイズが大きいほど参照画像の利用効率は高くなる。つまり、予測画像の生成に必要な参照画像の画素数は、予測画像の1画素当たりに換算すると、予測画像のサイズが大きいほど少なくなる。換言するに、予測画像のサイズが小さいほど、参照画像の利用効率は低くなり、予測画像1画素生成するために必要な参照画像の画素数が多くなる。
In addition, since the pixel value of the reference image read out to generate a certain pixel of the predicted image can be used to generate other pixels around the pixel, generally, the larger the size of the predicted image to be generated, The use efficiency of the reference image is increased. In other words, the number of pixels of the reference image necessary for generating the predicted image is reduced as the size of the predicted image is larger when converted per pixel of the predicted image. In other words, the smaller the size of the predicted image, the lower the utilization efficiency of the reference image, and the larger the number of pixels of the reference image necessary for generating one pixel of the predicted image.
さらに、双方向予測の場合、単方向予測に比べて参照画像の数が増大する。
Furthermore, in the case of bidirectional prediction, the number of reference images increases compared to unidirectional prediction.
このように、読み出す参照画像のサイズが大きくなると、符号化の処理速度を維持するためには、フレームメモリとインター予測部との間のメモリ帯域もより大きく確保する必要があり、負荷が大きくなる。
As described above, when the size of the reference image to be read increases, in order to maintain the encoding processing speed, it is necessary to secure a larger memory bandwidth between the frame memory and the inter prediction unit, which increases the load. .
そのため、AVCでは、レベルによってデコーダが使用するリソースに制限がかけられるようになっている。具体的には、連続する2つのマクロブロックで用いることができる動きベクトルの本数に上限が設けられている。この制限により、小さいブロックサイズのインター予測が連続して発生しないようになされている。また、8×8ブロックサイズのマクロブロックの双方向予測も禁止されている。
Therefore, in AVC, the resources used by the decoder are limited depending on the level. Specifically, there is an upper limit on the number of motion vectors that can be used in two consecutive macroblocks. Due to this limitation, inter prediction with a small block size is not continuously generated. In addition, bi-directional prediction of 8 × 8 block size macroblocks is also prohibited.
以上のような制限は、メモリ帯域への負荷の過剰な増大は抑制するためのものであるが、この制限により、使用できる予測モードとブロックサイズが少なくなってしまうことから、符号化効率を低減させてしまう。
The above restrictions are intended to suppress an excessive increase in the load on the memory band, but this restriction reduces the usable prediction mode and block size, thus reducing the coding efficiency. I will let you.
しかしながら、従来の場合、このような制限において、動きベクトルの精度による負荷への影響は考慮されていなかった。そのため、使用できる予測モードやブロックサイズが不要に制限されてしまい、符号化効率を不要に低減させてしまう恐れがあった。
However, in the conventional case, in such a restriction, the influence on the load due to the accuracy of the motion vector has not been considered. Therefore, usable prediction modes and block sizes are unnecessarily limited, and there is a possibility that the encoding efficiency is unnecessarily reduced.
本開示は、このような状況に鑑みてなされたものであり、ブロックサイズや予測モードだけでなく、動きベクトルの精度も考慮して動き予測を制限することにより、符号化効率の不要な低減を抑制することを目的とする。
The present disclosure has been made in view of such a situation, and by limiting motion prediction in consideration of not only the block size and prediction mode but also the accuracy of motion vectors, unnecessary reduction in encoding efficiency can be achieved. The purpose is to suppress.
本開示の一側面は、動き補償を行うインター予測モードの候補を、動きベクトルの精度に基づいて制御する制御部と、前記制御部により前記候補とすることを許可されたインター予測モードでのみ動き探索を行い、動きベクトルを生成する動き探索部と、前記動き探索部により動きベクトルが生成された各候補の中から選択されたインター予測モードで前記動き補償を行い、予測画像を生成する動き補償部とを備える画像処理装置である。
One aspect of the present disclosure includes a control unit that controls an inter prediction mode candidate for performing motion compensation based on motion vector accuracy, and motion only in an inter prediction mode that is permitted to be the candidate by the control unit. A motion search unit that performs a search and generates a motion vector, and a motion compensation that generates a predicted image by performing the motion compensation in an inter prediction mode selected from each candidate generated by the motion search unit. An image processing apparatus.
前記制御部は、前記動きベクトルの精度が整数精度であるか分数精度であるかによって、インター予測モードを前記候補とするか否かを制御することができる。
The control unit can control whether or not the inter prediction mode is set as the candidate depending on whether the accuracy of the motion vector is integer accuracy or fractional accuracy.
前記制御部は、前記動きベクトルの縦と横の各方向の精度が整数精度であるか分数精度であるかによって、インター予測モードを前記候補とするか否かを制御することができる。
The control unit can control whether or not the inter prediction mode is set as the candidate depending on whether the vertical and horizontal accuracy of the motion vector is integer accuracy or fractional accuracy.
前記制御部は、さらに、予測モードが単予測であるか双予測であるかによって、インター予測モードを前記候補とするか否かを制御することができる。
The control unit can further control whether or not the inter prediction mode is set as the candidate depending on whether the prediction mode is uni-prediction or bi-prediction.
前記制御部は、さらに、予測処理単位とする領域の大きさによって、インター予測モードを前記候補とするか否かを制御することができる。
The control unit can further control whether or not the inter prediction mode is set as the candidate according to the size of the region that is a prediction processing unit.
前記制御部は、前記動きベクトルの精度に応じて決定されるメモリからのデータ読み出し量の大きさに関する情報が、リソースやユーザ指示等による上限を超えるか否かによって、インター予測モードを前記候補とするか否かを制御することができる。
The control unit determines the inter prediction mode as the candidate depending on whether information on the amount of data read from the memory determined according to the accuracy of the motion vector exceeds an upper limit due to a resource, a user instruction, or the like. Whether or not to do so can be controlled.
前記制御部による制御の内容を示す伝送情報を生成し、伝送させる伝送部をさらに備えることができる。
A transmission unit that generates and transmits transmission information indicating the content of control by the control unit can be further provided.
前記伝送部は、前記伝送情報として、プロファイルとレベルを定義し、前記プロファイルおよび前記レベルを示す情報をシーケンスパラメータセットに含めて伝送させることができる。
The transmission unit may define a profile and a level as the transmission information, and transmit the information indicating the profile and the level in a sequence parameter set.
前記動き探索部により動きベクトルが生成された各候補について、コスト関数値を算出するコスト関数算出部と、前記コスト関す算出部により算出された前記コスト関数値に基づいて、前記動き補償を行うインター予測モードを前記候補の中から選択するモード選択部とをさらに備え、前記動き補償部は、前記モード選択部により選択されたインター予測モードで前記動き補償を行うことができる。
For each candidate for which a motion vector is generated by the motion search unit, a cost function calculation unit that calculates a cost function value and an interface that performs the motion compensation based on the cost function value calculated by the cost calculation unit. A mode selection unit that selects a prediction mode from the candidates, and the motion compensation unit may perform the motion compensation in the inter prediction mode selected by the mode selection unit.
本開示の一側面は、また、画像処理装置の画像処理方法であって、制御部が、動き補償を行うインター予測モードの候補を、動きベクトルの精度に基づいて制御し、動き探索部が、前記候補とすることを許可されたインター予測モードでのみ動き探索を行い、動きベクトルを生成し、動き補償部が、動きベクトルが生成された各候補の中から選択されたインター予測モードで前記動き補償を行い、予測画像を生成する画像処理方法である。
One aspect of the present disclosure is also an image processing method of the image processing device, wherein the control unit controls inter prediction mode candidates for performing motion compensation based on the accuracy of the motion vector, and the motion search unit includes: The motion search is performed only in the inter prediction mode permitted to be the candidate, and a motion vector is generated. The motion compensation unit performs the motion in the inter prediction mode selected from the candidates from which the motion vector is generated. This is an image processing method for generating a predicted image by performing compensation.
本開示の他の側面は、画像データを符号化する際のインター予測において用いられた、動き補償を行うインター予測モードの候補を動きベクトルの精度に基づいて制御する制御情報を取得する取得部と、前記取得部により取得された前記制御情報に基づいて、前記画像データが符号化されて生成された符号化データの復号の可否判定を行う判定部と、前記判定部による判定結果に基づいて、前記符号化データの復号を制御する制御部とを備える画像処理装置である。
According to another aspect of the present disclosure, an acquisition unit that acquires control information used in inter prediction when encoding image data to control inter prediction mode candidates for motion compensation based on the accuracy of a motion vector; Based on the control information acquired by the acquisition unit, a determination unit that determines whether to decode the encoded data generated by encoding the image data, and a determination result by the determination unit, An image processing apparatus comprising: a control unit that controls decoding of the encoded data.
前記判定部は、前記取得部により取得された前記制御情報において指定されるメモリからのデータ読み出し量の大きさに関する情報が、リソースやユーザ指示等による上限を超えるか否かによって、前記可否判定を行うことができる。
The determination unit performs the determination based on whether or not the information regarding the amount of data read from the memory specified in the control information acquired by the acquisition unit exceeds an upper limit based on a resource or a user instruction. It can be carried out.
前記判定部は、プロファイルとレベルによって定義されるメモリからのデータ読み出し量の大きさに関する情報が、リソースやユーザ指示等による上限を超えるか否かによって、前記可否判定を行うことができる。
The determination unit can make the determination as to whether the information related to the amount of data read from the memory defined by the profile and level exceeds an upper limit based on resources, user instructions, or the like.
前記制御部は、前記判定部により復号可能と判定された場合のみ、前記符号化データの復号を実行させることができる。
The control unit can execute decoding of the encoded data only when the determination unit determines that decoding is possible.
前記判定部により復号不可能と判定された場合、前記制御部に制御されて、エラー処理を行うエラー処理部をさらに備えることができる。
If the determination unit determines that decoding is impossible, the control unit may further include an error processing unit that performs error processing under the control of the control unit.
本開示の他の側面は、また、画像処理装置の画像処理方法であって、取得部が、画像データを符号化する際のインター予測において用いられた、動き補償を行うインター予測モードの候補を動きベクトルの精度に基づいて制御する制御情報を取得し、判定部が、取得された前記制御情報に基づいて、前記画像データが符号化されて生成された符号化データの復号の可否判定を行い、制御部が、その判定結果に基づいて、前記符号化データの復号を制御する画像処理方法である。
Another aspect of the present disclosure is also an image processing method of an image processing apparatus, in which an acquisition unit selects inter prediction mode candidates for motion compensation used in inter prediction when encoding image data. Control information to be controlled is acquired based on the accuracy of the motion vector, and the determination unit determines whether or not to decode the encoded data generated by encoding the image data based on the acquired control information. The control unit is an image processing method for controlling the decoding of the encoded data based on the determination result.
本開示の一側面においては、動き補償を行うインター予測モードの候補が、動きベクトルの精度に基づいて制御され、その候補とすることを許可されたインター予測モードでのみ動き探索が行われて、動きベクトルが生成され、動きベクトルが生成された各候補の中から選択されたインター予測モードで動き補償が行われ、予測画像が生成される。
In one aspect of the present disclosure, a candidate for an inter prediction mode for performing motion compensation is controlled based on the accuracy of a motion vector, and a motion search is performed only in an inter prediction mode that is permitted to be the candidate. A motion vector is generated, motion compensation is performed in an inter prediction mode selected from the candidates for which the motion vector is generated, and a predicted image is generated.
本開示の他の側面においては、画像データを符号化する際のインター予測において用いられた、動き補償を行うインター予測モードの候補を動きベクトルの精度に基づいて制御する制御情報が取得され、その取得された制御情報に基づいて、画像データが符号化されて生成された符号化データの復号の可否判定が行われ、その判定結果に基づいて、符号化データの復号が制御される。
In another aspect of the present disclosure, control information used to control inter prediction mode candidates for motion compensation, which is used in inter prediction when encoding image data, is acquired based on the accuracy of a motion vector. Based on the acquired control information, it is determined whether or not the encoded data generated by encoding the image data can be decoded, and the decoding of the encoded data is controlled based on the determination result.
本開示によれば、画像を処理することができる。特に、符号化効率の低減を抑制することができる。
According to the present disclosure, an image can be processed. In particular, a reduction in encoding efficiency can be suppressed.
以下、本開示を実施するための形態(以下実施の形態とする)について説明する。なお、説明は以下の順序で行う。
1.第1の実施の形態(画像符号化装置)
2.第2の実施の形態(画像復号装置)
3.第3の実施の形態(コンピュータ)
4.第4の実施の形態(テレビジョン受像機)
5.第5の実施の形態(携帯電話機)
6.第6の実施の形態(記録再生装置)
7.第7の実施の形態(撮像装置) Hereinafter, modes for carrying out the present disclosure (hereinafter referred to as embodiments) will be described. The description will be given in the following order.
1. First Embodiment (Image Encoding Device)
2. Second embodiment (image decoding apparatus)
3. Third embodiment (computer)
4). Fourth embodiment (television receiver)
5. Fifth embodiment (mobile phone)
6). Sixth embodiment (recording / reproducing apparatus)
7). Seventh embodiment (imaging device)
1.第1の実施の形態(画像符号化装置)
2.第2の実施の形態(画像復号装置)
3.第3の実施の形態(コンピュータ)
4.第4の実施の形態(テレビジョン受像機)
5.第5の実施の形態(携帯電話機)
6.第6の実施の形態(記録再生装置)
7.第7の実施の形態(撮像装置) Hereinafter, modes for carrying out the present disclosure (hereinafter referred to as embodiments) will be described. The description will be given in the following order.
1. First Embodiment (Image Encoding Device)
2. Second embodiment (image decoding apparatus)
3. Third embodiment (computer)
4). Fourth embodiment (television receiver)
5. Fifth embodiment (mobile phone)
6). Sixth embodiment (recording / reproducing apparatus)
7). Seventh embodiment (imaging device)
<1.第1の実施の形態>
[画像符号化装置]
図1は、本技術を適用した画像処理装置である画像符号化装置の主な構成例を示すブロック図である。 <1. First Embodiment>
[Image encoding device]
FIG. 1 is a block diagram illustrating a main configuration example of an image encoding device that is an image processing device to which the present technology is applied.
[画像符号化装置]
図1は、本技術を適用した画像処理装置である画像符号化装置の主な構成例を示すブロック図である。 <1. First Embodiment>
[Image encoding device]
FIG. 1 is a block diagram illustrating a main configuration example of an image encoding device that is an image processing device to which the present technology is applied.
図1に示される画像符号化装置100は、例えばHEVC(High Efficiency Video Coding)符号化方式や、H.264及びMPEG(Moving Picture Experts Group)4 Part10(AVC(Advanced Video Coding))符号化方式のように、動画像の画像データを符号化する。
The image encoding apparatus 100 shown in FIG. 1 is, for example, a HEVC (High Efficiency Video Coding) encoding scheme, The image data of the moving image is encoded as in the H.264 and MPEG (Moving Picture Experts Group) 4 Part 10 (AVC (Advanced Video Coding)) coding system.
図1に示されるように画像符号化装置100は、A/D変換部101、画面並べ替えバッファ102、演算部103、直交変換部104、量子化部105、可逆符号化部106、および蓄積バッファ107を有する。また、画像符号化装置100は、逆量子化部108、逆直交変換部109、演算部110、ループフィルタ111、フレームメモリ112、選択部113、イントラ予測部114、動き予測・補償部115、予測画像選択部116、およびレート制御部117を有する。
As shown in FIG. 1, the image encoding device 100 includes an A / D conversion unit 101, a screen rearrangement buffer 102, a calculation unit 103, an orthogonal transformation unit 104, a quantization unit 105, a lossless encoding unit 106, and a storage buffer. 107. The image coding apparatus 100 also includes an inverse quantization unit 108, an inverse orthogonal transform unit 109, a calculation unit 110, a loop filter 111, a frame memory 112, a selection unit 113, an intra prediction unit 114, a motion prediction / compensation unit 115, and a prediction. An image selection unit 116 and a rate control unit 117 are included.
さらに、画像符号化装置100は、動きベクトル制御部121を有する。
Furthermore, the image encoding device 100 includes a motion vector control unit 121.
A/D変換部101は、入力された画像データをA/D変換し、変換後の画像データ(デジタルデータ)を画面並べ替えバッファ102に供給し、記憶させる。
The A / D conversion unit 101 A / D converts the input image data, supplies the converted image data (digital data) to the screen rearrangement buffer 102, and stores it.
画面並べ替えバッファ102は、記憶した表示の順番のフレームの画像を、GOP(Group Of Picture)に応じて、符号化のためのフレームの順番に並べ替え、フレームの順番を並び替えた画像を、演算部103に供給する。画面並べ替えバッファ102は、各フレーム画像を、符号化処理の処理単位(符号化単位)となる所定の部分領域毎に演算部103に供給する。また、画面並べ替えバッファ102は、フレームの順番を並び替えた画像を、同様に部分領域毎に、イントラ予測部114および動き予測・補償部115にも供給する。
The screen rearrangement buffer 102 rearranges the images of the frames in the stored display order in the order of frames for encoding in accordance with GOP (Group Of Picture), and the images in which the order of the frames is rearranged. This is supplied to the calculation unit 103. The screen rearrangement buffer 102 supplies each frame image to the calculation unit 103 for each predetermined partial area that is a processing unit (encoding unit) of the encoding process. The screen rearrangement buffer 102 also supplies the images in which the order of frames is rearranged to the intra prediction unit 114 and the motion prediction / compensation unit 115 for each partial region.
演算部103は、画面並べ替えバッファ102から読み出された画像から、予測画像選択部116を介してイントラ予測部114若しくは動き予測・補償部115から供給される予測画像を減算し、その差分情報を直交変換部104に出力する。例えば、イントラ符号化が行われる画像の場合、演算部103は、画面並べ替えバッファ102から読み出された画像から、イントラ予測部114から供給される予測画像を減算する。また、例えば、インター符号化が行われる画像の場合、演算部103は、画面並べ替えバッファ102から読み出された画像から、動き予測・補償部115から供給される予測画像を減算する。
The calculation unit 103 subtracts the prediction image supplied from the intra prediction unit 114 or the motion prediction / compensation unit 115 via the prediction image selection unit 116 from the image read from the screen rearrangement buffer 102, and the difference information Is output to the orthogonal transform unit 104. For example, in the case of an image on which intra coding is performed, the calculation unit 103 subtracts the prediction image supplied from the intra prediction unit 114 from the image read from the screen rearrangement buffer 102. For example, in the case of an image on which inter coding is performed, the arithmetic unit 103 subtracts the predicted image supplied from the motion prediction / compensation unit 115 from the image read from the screen rearrangement buffer 102.
直交変換部104は、演算部103から供給される差分情報に対して、離散コサイン変換やカルーネン・レーベ変換等の直交変換を施す。なお、この直交変換の方法は任意である。直交変換部104は、その直交変換により得られた変換係数を量子化部105に供給する。
The orthogonal transform unit 104 performs orthogonal transform such as discrete cosine transform and Karhunen-Loeve transform on the difference information supplied from the computation unit 103. Note that this orthogonal transformation method is arbitrary. The orthogonal transform unit 104 supplies the transform coefficient obtained by the orthogonal transform to the quantization unit 105.
量子化部105は、直交変換部104から供給される変換係数を量子化する。量子化部105は、量子化された変換係数を可逆符号化部106に供給する。
The quantization unit 105 quantizes the transform coefficient supplied from the orthogonal transform unit 104. The quantization unit 105 supplies the quantized transform coefficient to the lossless encoding unit 106.
可逆符号化部106は、量子化部105において量子化された変換係数を、任意の符号化方式で符号化し、符号化データ(ビットストリーム)を生成する。係数データは、レート制御部117の制御の下で量子化されているので、この符号化データの符号量は、レート制御部117が設定した目標値となる(若しくは目標値に近似する)。
The lossless encoding unit 106 encodes the transform coefficient quantized by the quantization unit 105 using an arbitrary encoding method, and generates encoded data (bit stream). Since the coefficient data is quantized under the control of the rate control unit 117, the code amount of the encoded data becomes the target value set by the rate control unit 117 (or approximates the target value).
また、可逆符号化部106は、イントラ予測のモードを示す情報等を含むイントラ予測情報をイントラ予測部114から取得し、インター予測のモードを示す情報や動きベクトル情報などを含むインター予測情報を動き予測・補償部115から取得する。
Further, the lossless encoding unit 106 acquires intra prediction information including information indicating an intra prediction mode from the intra prediction unit 114, and moves inter prediction information including information indicating an inter prediction mode, motion vector information, and the like. Obtained from the prediction / compensation unit 115.
さらに、可逆符号化部106は、動きベクトル制御部121から供給される動き予測の制限に関する情報(伝送情報)を取得する。
Furthermore, the lossless encoding unit 106 acquires information (transmission information) regarding the motion prediction limitation supplied from the motion vector control unit 121.
可逆符号化部106は、これらの各種情報を任意の符号化方式で符号化し、符号化データ(ビットストリーム)に含める(多重化する)。
The lossless encoding unit 106 encodes these various types of information using an arbitrary encoding method, and includes (multiplexes) the information in the encoded data (bit stream).
可逆符号化部106は、このように生成された符号化データを蓄積バッファ107に供給して蓄積させる。可逆符号化部106の符号化方式としては、例えば、可変長符号化または算術符号化等が挙げられる。可変長符号化としては、例えば、H.264/AVC方式で定められているCAVLC(Context-Adaptive Variable Length Coding)などが挙げられる。算術符号化としては、例えば、CABAC(Context-Adaptive Binary Arithmetic Coding)などが挙げられる。
The lossless encoding unit 106 supplies the encoded data generated in this way to the storage buffer 107 for storage. Examples of the encoding scheme of the lossless encoding unit 106 include variable length encoding or arithmetic encoding. Examples of variable length coding include H.264. CAVLC (Context-Adaptive Variable Length Coding) defined in the H.264 / AVC format. Examples of arithmetic coding include CABAC (Context-Adaptive Binary Arithmetic Coding).
蓄積バッファ107は、可逆符号化部106から供給された符号化データを、一時的に保持する。蓄積バッファ107は、所定のタイミングにおいて、保持している符号化データを、ビットストリームとして、例えば、後段の図示せぬ記録装置(記録媒体)や伝送路などに出力する。つまり、符号化された各種情報が、画像符号化装置100によって画像データが符号化されて得られた符号化データを復号する装置(以下、復号側の装置とも称する)に供給される。
The accumulation buffer 107 temporarily holds the encoded data supplied from the lossless encoding unit 106. The accumulation buffer 107 outputs the stored encoded data as a bit stream at a predetermined timing, for example, to a recording device (recording medium) or a transmission path (not shown) in the subsequent stage. That is, various types of encoded information are supplied to a device that decodes encoded data obtained by encoding image data by the image encoding device 100 (hereinafter also referred to as a decoding-side device).
また、量子化部105において量子化された変換係数は、逆量子化部108にも供給される。逆量子化部108は、その量子化された変換係数を、量子化部105による量子化に対応する方法で逆量子化する。逆量子化部108は、得られた変換係数を、逆直交変換部109に供給する。
Also, the transform coefficient quantized by the quantization unit 105 is also supplied to the inverse quantization unit 108. The inverse quantization unit 108 inversely quantizes the quantized transform coefficient by a method corresponding to the quantization by the quantization unit 105. The inverse quantization unit 108 supplies the obtained transform coefficient to the inverse orthogonal transform unit 109.
逆直交変換部109は、逆量子化部108から供給された変換係数を、直交変換部104による直交変換に対応する方法で逆直交変換する。逆直交変換された出力(局所的に復元された差分情報)は、演算部110に供給される。
The inverse orthogonal transform unit 109 performs inverse orthogonal transform on the transform coefficient supplied from the inverse quantization unit 108 by a method corresponding to the orthogonal transform performed by the orthogonal transform unit 104. The inversely orthogonally transformed output (difference information restored locally) is supplied to the calculation unit 110.
演算部110は、逆直交変換部109から供給された逆直交変換結果、すなわち、局所的に復元された差分情報に、予測画像選択部116を介してイントラ予測部114若しくは動き予測・補償部115から供給される予測画像を加算し、局所的に再構成された画像(以下、再構成画像と称する)を得る。その再構成画像は、ループフィルタ111またはフレームメモリ112に供給される。
The calculation unit 110 converts the inverse orthogonal transform result supplied from the inverse orthogonal transform unit 109, that is, locally restored difference information, into the intra prediction unit 114 or the motion prediction / compensation unit 115 via the predicted image selection unit 116. Are added to the predicted image to obtain a locally reconstructed image (hereinafter referred to as a reconstructed image). The reconstructed image is supplied to the loop filter 111 or the frame memory 112.
ループフィルタ111は、デブロックフィルタや適応ループフィルタ等を含み、演算部110から供給される復号画像に対して適宜フィルタ処理を行う。例えば、ループフィルタ111は、復号画像に対してデブロックフィルタ処理を行うことにより復号画像のブロック歪を除去する。また、例えば、ループフィルタ111は、そのデブロックフィルタ処理結果(ブロック歪みの除去が行われた復号画像)に対して、ウィナーフィルタ(Wiener Filter)を用いてループフィルタ処理を行うことにより画質改善を行う。
The loop filter 111 includes a deblock filter, an adaptive loop filter, and the like, and appropriately performs a filtering process on the decoded image supplied from the calculation unit 110. For example, the loop filter 111 removes block distortion of the decoded image by performing a deblocking filter process on the decoded image. In addition, for example, the loop filter 111 performs image quality improvement by performing loop filter processing using a Wiener filter on the deblock filter processing result (decoded image from which block distortion has been removed). Do.
なお、ループフィルタ111が、復号画像に対して任意のフィルタ処理を行うようにしてもよい。また、ループフィルタ111は、必要に応じて、フィルタ処理に用いたフィルタ係数等の情報を可逆符号化部106に供給し、それを符号化させるようにすることもできる。ループフィルタ111は、フィルタ処理結果(以下、復号画像と称する)をフレームメモリ112に供給する。
Note that the loop filter 111 may perform arbitrary filter processing on the decoded image. Further, the loop filter 111 can supply information such as filter coefficients used for the filter processing to the lossless encoding unit 106 and encode it as necessary. The loop filter 111 supplies a filter processing result (hereinafter referred to as a decoded image) to the frame memory 112.
フレームメモリ112は、演算部110から供給される再構成画像と、ループフィルタ111から供給される復号画像とをそれぞれ記憶する。フレームメモリ112は、所定のタイミングにおいて、若しくは、イントラ予測部114等の外部からの要求に基づいて、記憶している再構成画像を、選択部113を介してイントラ予測部114に供給する。また、フレームメモリ112は、所定のタイミングにおいて、若しくは、動き予測・補償部115等の外部からの要求に基づいて、記憶している復号画像を、選択部113を介して、動き予測・補償部115に供給する。
The frame memory 112 stores the reconstructed image supplied from the calculation unit 110 and the decoded image supplied from the loop filter 111, respectively. The frame memory 112 supplies the stored reconstructed image to the intra prediction unit 114 via the selection unit 113 at a predetermined timing or based on a request from the outside such as the intra prediction unit 114. The frame memory 112 also stores the decoded image stored at a predetermined timing or based on a request from the outside such as the motion prediction / compensation unit 115 via the selection unit 113. 115.
選択部113は、フレームメモリ112から出力される画像の供給先を示す。例えば、イントラ予測の場合、選択部113は、フレームメモリ112からフィルタ処理されていない画像(再構成画像)を読み出し、周辺画素として、イントラ予測部114に供給する。
The selection unit 113 indicates the supply destination of the image output from the frame memory 112. For example, in the case of intra prediction, the selection unit 113 reads an image (reconstructed image) that has not been subjected to filter processing from the frame memory 112 and supplies it to the intra prediction unit 114 as peripheral pixels.
また、例えば、インター予測の場合、選択部113は、フレームメモリ112からフィルタ処理された画像(復号画像)を読み出し、参照画像として、それを動き予測・補償部115に供給する。
Also, for example, in the case of inter prediction, the selection unit 113 reads out an image (decoded image) that has been filtered from the frame memory 112, and supplies it as a reference image to the motion prediction / compensation unit 115.
イントラ予測部114は、フレームメモリ112から、処理対象領域の周辺に位置する周辺領域の画像(周辺画像)を取得すると、その周辺画像の画素値を用いて、基本的にプレディクションユニット(PU(Prediction Unit))を処理単位として予測画像を生成するイントラ予測(画面内予測)を行う。イントラ予測部114は、予め用意された複数のモード(イントラ予測モード)でこのイントラ予測を行う。
When the intra prediction unit 114 acquires an image (peripheral image) of a peripheral region located around the processing target region from the frame memory 112, the intra prediction unit 114 basically uses a pixel value of the peripheral image to predict a prediction unit (PU ( Prediction (Unit))) is used as a processing unit to perform intra prediction (in-screen prediction) for generating a predicted image. The intra prediction unit 114 performs this intra prediction in a plurality of modes (intra prediction modes) prepared in advance.
イントラ予測部114は、候補となる全てのイントラ予測モードで予測画像を生成し、画面並べ替えバッファ102から供給される入力画像を用いて各予測画像のコスト関数値を評価し、最適なモードを選択する。イントラ予測部114は、最適なイントラ予測モードを選択すると、その最適なモードで生成された予測画像を、予測画像選択部116に供給する。
The intra prediction unit 114 generates predicted images in all candidate intra prediction modes, evaluates the cost function value of each predicted image using the input image supplied from the screen rearrangement buffer 102, and selects the optimum mode. select. When the intra prediction unit 114 selects the optimal intra prediction mode, the intra prediction unit 114 supplies the predicted image generated in the optimal mode to the predicted image selection unit 116.
また、イントラ予測部114は、最適なイントラ予測モード等、イントラ予測に関する情報を含むイントラ予測情報を、適宜可逆符号化部106に供給し、符号化させる。
In addition, the intra prediction unit 114 appropriately supplies intra prediction information including information related to intra prediction, such as an optimal intra prediction mode, to the lossless encoding unit 106 to be encoded.
動き予測・補償部115は、画面並べ替えバッファ102から供給される入力画像と、フレームメモリ112から供給される参照画像とを用いて、基本的にPUを処理単位として、動き予測(インター予測)を行い、検出された動きベクトルに応じて動き補償処理を行い、予測画像(インター予測画像情報)を生成する。動き予測・補償部115は、予め用意された複数のモード(インター予測モード)でこのようなインター予測を行う。
The motion prediction / compensation unit 115 basically performs motion prediction (inter prediction) using the input image supplied from the screen rearrangement buffer 102 and the reference image supplied from the frame memory 112 as a processing unit. And a motion compensation process is performed according to the detected motion vector to generate a predicted image (inter predicted image information). The motion prediction / compensation unit 115 performs such inter prediction in a plurality of modes (inter prediction modes) prepared in advance.
なお、動き予測・補償部115は、動きベクトル制御部121の制御に従って、許可されたモードでのみ動き予測を行う。つまり、動きベクトル制御部121により、候補となるインター予測モードが制限される場合がある。
Note that the motion prediction / compensation unit 115 performs motion prediction only in the permitted mode under the control of the motion vector control unit 121. That is, the motion vector control unit 121 may limit candidate inter prediction modes.
動き予測・補償部115は、候補となる全てのインター予測モードで予測画像を生成し、各予測画像のコスト関数値を評価し、最適なモードを選択する。動き予測・補償部115は、最適なインター予測モードを選択すると、その最適なモードで生成された予測画像を、予測画像選択部116に供給する。
The motion prediction / compensation unit 115 generates prediction images in all candidate inter prediction modes, evaluates the cost function value of each prediction image, and selects an optimal mode. When the optimal inter prediction mode is selected, the motion prediction / compensation unit 115 supplies the predicted image generated in the optimal mode to the predicted image selection unit 116.
また、動き予測・補償部115は、最適なインター予測モード等、インター予測に関する情報を含むインター予測情報を、適宜可逆符号化部106に供給し、符号化させる。
Also, the motion prediction / compensation unit 115 appropriately supplies inter prediction information including information related to inter prediction, such as an optimal inter prediction mode, to the lossless encoding unit 106 to be encoded.
予測画像選択部116は、演算部103や演算部110に供給する予測画像の供給元を選択する。例えば、イントラ符号化の場合、予測画像選択部116は、予測画像の供給元としてイントラ予測部114を選択し、そのイントラ予測部114から供給される予測画像を演算部103や演算部110に供給する。また、例えば、インター符号化の場合、予測画像選択部116は、予測画像の供給元として動き予測・補償部115を選択し、その動き予測・補償部115から供給される予測画像を演算部103や演算部110に供給する。
The predicted image selection unit 116 selects a supply source of a predicted image to be supplied to the calculation unit 103 or the calculation unit 110. For example, in the case of intra coding, the prediction image selection unit 116 selects the intra prediction unit 114 as a supply source of the prediction image, and supplies the prediction image supplied from the intra prediction unit 114 to the calculation unit 103 and the calculation unit 110. To do. Also, for example, in the case of inter coding, the predicted image selection unit 116 selects the motion prediction / compensation unit 115 as a supply source of the predicted image, and calculates the predicted image supplied from the motion prediction / compensation unit 115 as the calculation unit 103. To the arithmetic unit 110.
レート制御部117は、蓄積バッファ107に蓄積された符号化データの符号量に基づいて、オーバーフローあるいはアンダーフローが発生しないように、量子化部105の量子化動作のレートを制御する。
The rate control unit 117 controls the quantization operation rate of the quantization unit 105 based on the code amount of the encoded data stored in the storage buffer 107 so that overflow or underflow does not occur.
動きベクトル制御部121は、動き予測・補償部115による参照画像の読み出しについてのリソースの確保のために、動き予測・補償部115による動き予測のモードを制御する。つまり、動きベクトル制御部121は、参照画像読み出しのためにフレームメモリ112のメモリ帯域が不要に占有されないように、動きベクトルの生成を適宜制限する。その際、動きベクトル制御部121は、ブロックサイズや予測モード(予測方向)だけでなく、生成する動きベクトルの精度も考慮して制限を行う。
The motion vector control unit 121 controls the mode of motion prediction performed by the motion prediction / compensation unit 115 in order to secure resources for reading out the reference image by the motion prediction / compensation unit 115. That is, the motion vector control unit 121 appropriately limits the generation of motion vectors so that the memory bandwidth of the frame memory 112 is not unnecessarily occupied for reading the reference image. At this time, the motion vector control unit 121 performs restriction in consideration of not only the block size and the prediction mode (prediction direction) but also the accuracy of the generated motion vector.
動きベクトル制御部121は、そのような制限内容を示す制御情報を動き予測・補償部115に供給する。動き予測・補償部115は、供給された制御情報に従って動き予測を行う。これにより、動き予測・補償部115は、動きベクトルの精度も考慮された制限に従って動き予測を行うことになる。
The motion vector control unit 121 supplies control information indicating such restriction contents to the motion prediction / compensation unit 115. The motion prediction / compensation unit 115 performs motion prediction according to the supplied control information. As a result, the motion prediction / compensation unit 115 performs motion prediction according to a restriction that also considers the accuracy of the motion vector.
なお、動きベクトル制御部121は、この制限に関する情報を、伝送情報として可逆符号化部106に供給し、伝送させる。
Note that the motion vector control unit 121 supplies information related to this restriction to the lossless encoding unit 106 as transmission information for transmission.
[1/4画素精度動き予測]
図2は、AVC符号化方式において規定されている、1/4画素精度の動き予測・補償処理の様子の例を説明する図である。図2において、各四角は、画素を示している。その内、Aはフレームメモリに格納されている整数精度画素の位置を示し、b,c,dは、1/2画素精度の位置を示し、e1,e2,e3は1/4画素精度の位置を示している。 [1/4 pixel precision motion prediction]
FIG. 2 is a diagram for explaining an example of a state of motion prediction / compensation processing with 1/4 pixel accuracy defined in the AVC encoding method. In FIG. 2, each square represents a pixel. Among them, A indicates the position of the integer precision pixel stored in the frame memory, b, c, d indicate the position of 1/2 pixel precision, and e 1 , e 2 , e 3 indicate 1/4 pixel. The position of accuracy is shown.
図2は、AVC符号化方式において規定されている、1/4画素精度の動き予測・補償処理の様子の例を説明する図である。図2において、各四角は、画素を示している。その内、Aはフレームメモリに格納されている整数精度画素の位置を示し、b,c,dは、1/2画素精度の位置を示し、e1,e2,e3は1/4画素精度の位置を示している。 [1/4 pixel precision motion prediction]
FIG. 2 is a diagram for explaining an example of a state of motion prediction / compensation processing with 1/4 pixel accuracy defined in the AVC encoding method. In FIG. 2, each square represents a pixel. Among them, A indicates the position of the integer precision pixel stored in the frame memory, b, c, d indicate the position of 1/2 pixel precision, and e 1 , e 2 , e 3 indicate 1/4 pixel. The position of accuracy is shown.
以下においては、関数Clip1()を以下の式(1)のように定義する。
In the following, the function Clip1 () is defined as in the following equation (1).
Clip1(a)=
0;if(a<0)
a;otherwise
max_pix;if(a>max_pix)
・・・(1) Clip1 (a) =
0; if (a <0)
a; otherwise
max_pix; if (a> max_pix)
... (1)
0;if(a<0)
a;otherwise
max_pix;if(a>max_pix)
・・・(1) Clip1 (a) =
0; if (a <0)
a; otherwise
max_pix; if (a> max_pix)
... (1)
例えば、入力画像が8ビット精度である場合、式(1)のmax_pixの値は255となる。
For example, when the input image has 8-bit precision, the value of max_pix in Expression (1) is 255.
b及びdの位置における画素値は、6tapのFIRフィルタを用いて、以下の式(2)および式(3)のように生成される。
The pixel values at the positions b and d are generated as shown in the following equations (2) and (3) using a 6 tap FIR filter.
F = A-2-5・A-1+20・A0+20・A1-5・A2+A3 ・・・(2)
b,d = Clip1((F+16)>>5) ・・・(3) F = A -2 -5 ・ A -1 +20 ・ A 0 +20 ・ A 1 -5 ・ A 2 + A 3 (2)
b, d = Clip1 ((F + 16) >> 5) (3)
b,d = Clip1((F+16)>>5) ・・・(3) F = A -2 -5 ・ A -1 +20 ・ A 0 +20 ・ A 1 -5 ・ A 2 + A 3 (2)
b, d = Clip1 ((F + 16) >> 5) (3)
cの位置における画素値は、水平方向及び垂直方向に6tapのFIRフィルタを適用し、以下の式(4)乃至式(6)のように生成される。
The pixel value at the position of c is generated as shown in the following formulas (4) to (6) by applying a 6 tap FIR filter in the horizontal direction and the vertical direction.
F = b-2-5・b-1+20・b0+20・b1-5・b2+b3 ・・・(4)
もしくは、
F = d-2-5・d-1+20・d0+20・d1-5・d2+d3 ・・・(5)
c = Clip1((F+512)>>10) ・・・(6) F = b -2 -5 ・ b -1 +20 ・ b 0 +20 ・ b 1 -5 ・ b 2 + b 3 (4)
Or
F = d -2 -5 ・ d -1 +20 ・ d 0 +20 ・ d 1 -5 ・ d 2 + d 3 (5)
c = Clip1 ((F + 512) >> 10) (6)
もしくは、
F = d-2-5・d-1+20・d0+20・d1-5・d2+d3 ・・・(5)
c = Clip1((F+512)>>10) ・・・(6) F = b -2 -5 ・ b -1 +20 ・ b 0 +20 ・ b 1 -5 ・ b 2 + b 3 (4)
Or
F = d -2 -5 ・ d -1 +20 ・ d 0 +20 ・ d 1 -5 ・ d 2 + d 3 (5)
c = Clip1 ((F + 512) >> 10) (6)
なお、Clip処理は、水平方向及び垂直方向の積和処理の両方を行った後、最後に1度のみ行われる。
Note that the Clip processing is performed only once at the end after performing both horizontal and vertical product-sum processing.
e1乃至e3は、以下の式(7)乃至式(9)のように、線形内挿により生成される。
e 1 to e 3 are generated by linear interpolation as in the following formulas (7) to (9).
e1 = (A+b+1)>>1 ・・・(7)
e2 = (b+d+1)>>1 ・・・(8)
e3 = (b+c+1)>>1 ・・・(9) e 1 = (A + b + 1) >> 1 (7)
e 2 = (b + d + 1) >> 1 (8)
e 3 = (b + c + 1) >> 1 (9)
e2 = (b+d+1)>>1 ・・・(8)
e3 = (b+c+1)>>1 ・・・(9) e 1 = (A + b + 1) >> 1 (7)
e 2 = (b + d + 1) >> 1 (8)
e 3 = (b + c + 1) >> 1 (9)
このように動きベクトルが分数精度の場合、予測画像生成に、動きベクトルが整数精度の場合よりも多くの画素の参照画像が必要になる。
In this way, when the motion vector has fractional accuracy, a reference image with more pixels is required to generate a predicted image than when the motion vector has integer accuracy.
[コーディングユニット]
ここで、HEVC符号化方式において定められている、コーディングユニット(CU(Coding Unit))について説明する。 [Coding unit]
Here, a coding unit (CU (Coding Unit)) defined in the HEVC encoding method will be described.
ここで、HEVC符号化方式において定められている、コーディングユニット(CU(Coding Unit))について説明する。 [Coding unit]
Here, a coding unit (CU (Coding Unit)) defined in the HEVC encoding method will be described.
コーディングユニット(CU)は、コーディングツリーブロック(CTB(Coding Tree Block))とも呼ばれ、AVCにおけるマクロブロックと同様の役割を果たす、ピクチャ単位の画像の多層構造の部分領域である。つまり、CUは、符号化処理の単位(符号化単位)である。マクロブロックの大きさが16×16画素に固定されているのに対し、CUの大きさは固定されておらず、それぞれのシーケンスにおいて画像圧縮情報中に指定されることになる。
The coding unit (CU) is also called a coding tree block (CTB (Coding Tree Block)), and is a partial region of a multi-layer structure of a picture unit image that plays the same role as a macroblock in AVC. That is, CU is a unit (encoding unit) of encoding processing. While the size of the macroblock is fixed to 16 × 16 pixels, the size of the CU is not fixed, and is specified in the image compression information in each sequence.
特に、最大の大きさを持つCUを、ラージストコーディングユニット(LCU(Largest Coding Unit)と呼び、また、最小の大きさを持つCUをスモーレストコーディングユニット(SCU(Smallest Coding Unit))と称する。すなわち、LCUは、最大符号化単位であり、SCUは、最小符号化単位である。例えば画像圧縮情報に含まれるシーケンスパラメータセットにおいて、これらの領域のサイズが指定されることになるが、それぞれ、正方形で、2の冪乗で表される大きさに限定される。つまり、ある階層の(正方形の)CUが2x2に4分割された各領域が1階層下の(正方形の)CUとなる。
In particular, a CU having the largest size is referred to as a large coding unit (LCU (Largest Coding Unit)), and a CU having the smallest size is referred to as a smallest coding unit (SCU (Smallest Coding Unit)). That is, the LCU is the maximum coding unit, and the SCU is the minimum coding unit.For example, in the sequence parameter set included in the image compression information, the sizes of these areas are specified, It is a square and is limited to a size represented by a power of 2. That is, each area obtained by dividing a (square) CU at a certain level into 2 × 2 is a (square) CU one level below.
図3に、HEVCで定義されているコーディングユニット(Coding Unit)の例を示す。図3の例では、LCUの大きさが128(2N(N=64))であり、最大階層深度が5(Depth=4)となる。2Nx2Nの大きさのCUは、split_flagの値が「1」である場合、1つ下の階層となる、NxNの大きさのCUに分割される。
Figure 3 shows an example of coding unit (Coding Unit) defined in HEVC. In the example of FIG. 3, the LCU size is 128 (2N (N = 64)), and the maximum hierarchical depth is 5 (Depth = 4). When the value of split_flag is “1”, the CU having the size of 2Nx2N is divided into CUs having the size of NxN that is one level below.
更に、CUは、イントラ若しくはインター予測の処理単位となる領域(ピクチャ単位の画像の部分領域)であるプレディクションユニット(Prediction Unit(PU))に分割され、また、直交変換の処理単位となる領域(ピクチャ単位の画像の部分領域)である、トランスフォームユニット(Transform Unit(TU))に分割される。
Further, the CU is divided into prediction units (Prediction Units (PU)) that are regions (partial regions of images in units of pictures) that are processing units of intra or inter prediction, and are regions that are processing units of orthogonal transformation It is divided into transform units (Transform Unit (TU)), which is (a partial area of an image in units of pictures).
インター予測のPU(Inter Prediction Unit)の場合、大きさ2Nx2NのCUに対して、2Nx2N、2NxN、Nx2N、NxNの4種類のサイズが設定可能である。つまり、1つのCUに対して、そのCUと同サイズの1つPU、そのCUを縦若しくは横に2分割した2つのPU、若しくは、そのCUを縦および横にそれぞれ2分割した4つのPUを定義することができる。
In the case of inter prediction PU (Inter Prediction Unit), 4N sizes of 2Nx2N, 2NxN, Nx2N, and NxN can be set for a 2Nx2N CU. In other words, for one CU, one PU of the same size as that CU, two PUs obtained by dividing the CU vertically or horizontally, or four PUs obtained by dividing the CU into two vertically and horizontally respectively. Can be defined.
画像符号化装置100は、このようなピクチャ単位の画像の部分領域を処理単位として符号化に関する各処理を行う。以下においては、画像符号化装置100が、HEVCで定義されるCUを符号化単位とする場合について説明する。つまり、LCUが最大符号化単位となり、SCUが最小符号化単位となる。ただし、画像符号化装置100による符号化の各処理の処理単位は、これに限らず、任意である。例えば、AVCで定義されるマクロブロックやサブマクロブロックを処理単位とするようにしてもよい。
The image encoding apparatus 100 performs each process related to encoding using a partial region of an image in units of pictures as a processing unit. In the following, a case will be described in which the image coding apparatus 100 uses a CU defined by HEVC as a coding unit. That is, LCU is the maximum coding unit and SCU is the minimum coding unit. However, the processing unit of each encoding process by the image encoding apparatus 100 is not limited to this, and is arbitrary. For example, a macroblock or sub-macroblock defined by AVC may be used as a processing unit.
なお、以下において、「(部分)領域」には、上述した各種領域(例えば、マクロブロック、サブマクロブロック、LCU、CU、SCU、PU、およびTU等)が全て含まれる(それらのいずれかであってもよい)。もちろん、上述した以外の単位が含まれてもよいし、説明の内容に応じて不可能な単位は、適宜、除外するものとする。
In the following, the “(partial) area” includes all the above-mentioned various areas (for example, macroblock, sub-macroblock, LCU, CU, SCU, PU, TU, etc.). May be). Of course, units other than those described above may be included, and units that are impossible according to the content of the description are appropriately excluded.
[参照画像の画素数制限]
以上のように動き予測(動きベクトル生成)の処理単位となるPUは様々な大きさに設定することができる。一般的には、PUのサイズが小さいほど、参照画像の利用効率は低くなり、予測画像1画素生成するために必要な参照画像の画素数が多くなる。また、双方向予測の場合、単方向予測に比べて参照画像の数が増大する。 [Reference image pixel limit]
As described above, the PU serving as a processing unit for motion prediction (motion vector generation) can be set to various sizes. In general, the smaller the size of the PU, the lower the utilization efficiency of the reference image, and the greater the number of reference image pixels necessary for generating one predicted image pixel. Further, in the case of bidirectional prediction, the number of reference images increases compared to unidirectional prediction.
以上のように動き予測(動きベクトル生成)の処理単位となるPUは様々な大きさに設定することができる。一般的には、PUのサイズが小さいほど、参照画像の利用効率は低くなり、予測画像1画素生成するために必要な参照画像の画素数が多くなる。また、双方向予測の場合、単方向予測に比べて参照画像の数が増大する。 [Reference image pixel limit]
As described above, the PU serving as a processing unit for motion prediction (motion vector generation) can be set to various sizes. In general, the smaller the size of the PU, the lower the utilization efficiency of the reference image, and the greater the number of reference image pixels necessary for generating one predicted image pixel. Further, in the case of bidirectional prediction, the number of reference images increases compared to unidirectional prediction.
そのため、AVCでは、レベルによってデコーダが使用するリソースに制限がかけられるようになっている。具体的には、連続する2つのマクロブロックで用いることができる動きベクトルの本数に上限が設けられている。この制限により、小さいブロックサイズのインター予測が連続して発生しないようになされている。また、8×8より小さいブロックサイズのマクロブロックの双方向予測も禁止されている。
Therefore, in AVC, the resources used by the decoder are limited depending on the level. Specifically, there is an upper limit on the number of motion vectors that can be used in two consecutive macroblocks. Due to this limitation, inter prediction with a small block size is not continuously generated. In addition, bidirectional prediction of macroblocks having a block size smaller than 8 × 8 is also prohibited.
このような制限は、利用できる予測モードとブロックサイズが少なくなってしまうことから、符号化効率を低減させてしまうが、メモリ帯域への負荷の過剰な増大は抑制するために必要なものであった。
Such a restriction reduces the encoding efficiency because it reduces the available prediction modes and block sizes, but is necessary to suppress an excessive increase in the load on the memory bandwidth. It was.
[必要な参照画像画素数]
ここで、予測画像の生成に必要な参照画像画素数についてより具体的に説明する。MCフィルタはn画素を出力するために、(n+タップ数-1)画素を入力する必要がある。つまり、PUの縦方向のサイズをBVとし、横方向のサイズをBHとすると、予測画像生成のために必要な参照画像の画素数Nは、以下の式(10)のように表すことができる。 [Required number of reference image pixels]
Here, the number of reference image pixels necessary for generating a predicted image will be described more specifically. Since the MC filter outputs n pixels, it is necessary to input (n + number of taps-1) pixels. That is, assuming that the vertical size of the PU is BV and the horizontal size is BH, the number of pixels N of the reference image necessary for generating the predicted image can be expressed as the following Expression (10). .
ここで、予測画像の生成に必要な参照画像画素数についてより具体的に説明する。MCフィルタはn画素を出力するために、(n+タップ数-1)画素を入力する必要がある。つまり、PUの縦方向のサイズをBVとし、横方向のサイズをBHとすると、予測画像生成のために必要な参照画像の画素数Nは、以下の式(10)のように表すことができる。 [Required number of reference image pixels]
Here, the number of reference image pixels necessary for generating a predicted image will be described more specifically. Since the MC filter outputs n pixels, it is necessary to input (n + number of taps-1) pixels. That is, assuming that the vertical size of the PU is BV and the horizontal size is BH, the number of pixels N of the reference image necessary for generating the predicted image can be expressed as the following Expression (10). .
N=(BH+タップ数-1)×(BV+タップ数-1) ・・・(10)
N = (BH + number of taps-1) x (BV + number of taps-1) ... (10)
動きベクトルが整数(少数部が0)の場合、FIRフィルタは1タップのフィルタが用いられることに相当する。つまり、式(10)において、タップ数=1となる。また、双予測(Bi-prediction)の場合、L0およびL1予測の両方が必要になるため、予測画像生成のために必要な参照画像の画素数Nは、以下の式(11)のように表すことができる。
When the motion vector is an integer (the decimal part is 0), the FIR filter corresponds to a 1-tap filter being used. That is, in Equation (10), the number of taps = 1. In addition, in the case of bi-prediction, both L0 and L1 predictions are required, so the number of pixels N of the reference image necessary for generating a predicted image is expressed by the following equation (11). be able to.
N=(L0の式(10)に相当する画素)+(L1の式(10)に相当する画素)
・・・(11) N = (pixel corresponding to equation (10) of L0) + (pixel corresponding to equation (10) of L1)
(11)
・・・(11) N = (pixel corresponding to equation (10) of L0) + (pixel corresponding to equation (10) of L1)
(11)
さらに、実際のメモリには1画素単位のアクセスはできず、メモリアラインメント単位で読み込みが必要になる。横と縦のメモリアライメントの画素数がAHおよびAVだとすると、予測画像生成のために必要な参照画像の画素数Nは、以下の式(12)のように表すことができる。
N=AH×(1+ROUNDUP(((BH+タップ数-1)-1)/AH)))
×AV×(1+ROUNDUP(((BV+タップ数-1)-1)/AV)))
・・・(12) Furthermore, the actual memory cannot be accessed in units of one pixel, and must be read in units of memory alignment. Assuming that the number of pixels in the horizontal and vertical memory alignments is AH and AV, the number of pixels N of the reference image necessary for generating the predicted image can be expressed as the following Expression (12).
N = AH x (1 + ROUNDUP (((BH + number of taps-1)-1) / AH)))
× AV × (1 + ROUNDUP ((((BV + number of taps-1)-1) / AV)))
(12)
N=AH×(1+ROUNDUP(((BH+タップ数-1)-1)/AH)))
×AV×(1+ROUNDUP(((BV+タップ数-1)-1)/AV)))
・・・(12) Furthermore, the actual memory cannot be accessed in units of one pixel, and must be read in units of memory alignment. Assuming that the number of pixels in the horizontal and vertical memory alignments is AH and AV, the number of pixels N of the reference image necessary for generating the predicted image can be expressed as the following Expression (12).
N = AH x (1 + ROUNDUP (((BH + number of taps-1)-1) / AH)))
× AV × (1 + ROUNDUP ((((BV + number of taps-1)-1) / AV)))
(12)
なお、式(12)において、ROUNDUP(x)はxより大きい最小の整数を示す。双予測の場合、予測画像生成のために必要な参照画像の画素数Nは、以下の式(13)のように表すことができる。
In equation (12), ROUNDUP (x) represents the smallest integer greater than x. In the case of bi-prediction, the number of pixels N of a reference image necessary for generating a predicted image can be expressed as the following equation (13).
N=(L0の式(12)に相当する画素)+(L1の式(12)に相当する画素)
・・・(13) N = (pixel corresponding to equation (12) of L0) + (pixel corresponding to equation (12) of L1)
... (13)
・・・(13) N = (pixel corresponding to equation (12) of L0) + (pixel corresponding to equation (12) of L1)
... (13)
したがって、単予測の場合、予測画像生成に必要な参照画像の画素数を、予測画像1画素当たりに換算すると、その画素数Nは、以下の式(14)のように表すことができ、双予測の場合、その画素数Nは、以下の式(15)のように表すことができる。
Therefore, in the case of uni-prediction, when the number of pixels of the reference image necessary for generating the predicted image is converted per pixel of the predicted image, the number of pixels N can be expressed as the following equation (14). In the case of prediction, the number of pixels N can be expressed as the following Expression (15).
N=(式(12)に相当する画素数)/(BH×BV) ・・・(14)
N=(式(13)に相当する画素数)/(BH×BV) ・・・(15) N = (number of pixels corresponding to equation (12)) / (BH × BV) (14)
N = (number of pixels corresponding to equation (13)) / (BH × BV) (15)
N=(式(13)に相当する画素数)/(BH×BV) ・・・(15) N = (number of pixels corresponding to equation (12)) / (BH × BV) (14)
N = (number of pixels corresponding to equation (13)) / (BH × BV) (15)
以上のように、予測画像生成のために必要な参照画像の、予測画像1画素当たりの画素数は、ブロックサイズと予測モード(単予測か双予測か)に依存するだけでなく、タップ数、すなわち、生成する動きベクトルの精度が整数精度であるか分数精度であるかにも依存する。
As described above, the number of pixels per pixel of the prediction image of the reference image necessary for generating the prediction image not only depends on the block size and the prediction mode (uni-prediction or bi-prediction), but also the number of taps, That is, it depends on whether the accuracy of the motion vector to be generated is integer accuracy or fractional accuracy.
横方向のメモリアライメントの画素数AHを8とし、縦方向のメモリアライメントの画素数AVを2とした場合の、予測モード、PUサイズ、動きベクトル精度、および、予測画像生成に必要な参照画像の予測画像1画素当たりの画素数の関係は、図4および図5に示される表のようにになる。図4は、予測モードが単予測(Uni)の場合の、PUサイズ、動きベクトル精度、および、予測画像生成に必要な参照画像の予測画像1画素当たりの画素数の関係を示しており、図5は、予測モードが双予測(Bi)の場合の、PUサイズ、動きベクトル精度、および、予測画像生成に必要な参照画像の予測画像1画素当たりの画素数の関係を示している。
Prediction mode, PU size, motion vector accuracy, and reference image necessary for prediction image generation when the horizontal memory alignment pixel number AH is 8 and the vertical memory alignment pixel number AV is 2. The relationship of the number of pixels per predicted image pixel is as shown in the tables shown in FIGS. FIG. 4 shows the relationship between the PU size, the motion vector accuracy, and the number of pixels per predicted image pixel of the reference image necessary for generating the predicted image when the prediction mode is uni-prediction (Uni). 5 shows the relationship between the PU size, the motion vector accuracy, and the number of pixels per pixel of the predicted image of the reference image necessary for generating the predicted image when the prediction mode is bi-prediction (Bi).
図4および図5に示される表において、「Bi/Uni」の列の値は、予測モード(単予測(Uni)であるか双予測(Bi)であるか)を示している。
4 and 5, the value in the column “Bi / Uni” indicates the prediction mode (whether it is single prediction (Uni) or bi-prediction (Bi)).
「BH」の列の数字は、PUの横方向のサイズを示し、「BV」の列の数字は、PUの縦方向のサイズを示している。
The numbers in the “BH” column indicate the horizontal size of the PU, and the numbers in the “BV” column indicate the vertical size of the PU.
「Sub pel」の列の値は、動きベクトルの精度を示している。「integer」は、動きベクトルが、縦方向と横方向の両方とも整数精度であることを示している。「abc」は、動きベクトルが、縦方向が整数精度で横方向が分数精度であることを示している。「dhn」は、動きベクトルが、縦方向が分数精度で横方向が整数精度であることを示している。「efgijkpqr」は、動きベクトルが、縦方向と横方向の両方とも分数精度であることを示している。
The value in the “Sub pel” column indicates the accuracy of the motion vector. “Integer” indicates that the motion vector has integer precision in both the vertical direction and the horizontal direction. “Abc” indicates that the motion vector has integer precision in the vertical direction and fractional precision in the horizontal direction. “Dhn” indicates that the motion vector has fractional accuracy in the vertical direction and integer accuracy in the horizontal direction. “Efgijkpqr” indicates that the motion vector has fractional accuracy in both the vertical and horizontal directions.
「入力画素数」の列の数字は、予測画像生成に必要な参照画像の予測画像1画素当たりの画素数を示している。つまり、フレームメモリから読み出されるデータ量の大きさを表している。なお、AVCの場合、この入力画素数の最大値は、「13.5」である(「13.5」で制限される)。図4および図5の斜線部分は、この「入力画素数」の値が、「13.5」より大きい(AVCの制限より大きい)例を示している。
The number in the column “number of input pixels” indicates the number of pixels per pixel of the predicted image of the reference image necessary for generating the predicted image. That is, it represents the amount of data read from the frame memory. In the case of AVC, the maximum value of the number of input pixels is “13.5” (restricted by “13.5”). The hatched portions in FIGS. 4 and 5 show examples in which the value of “number of input pixels” is larger than “13.5” (larger than the AVC limit).
AVCのような従来の場合、予測モードとブロックサイズのみで制限されるため、8×8より小さいブロックサイズと、8×8ブロックサイズの双予測は、全て禁止とされてしまっていた。
In the conventional case such as AVC, since it is limited only by the prediction mode and the block size, block prediction smaller than 8 × 8 and bi-prediction of 8 × 8 block size are all prohibited.
しかしながら、図4や図5に示されるように、8×8より小さいブロックサイズであっても、予測画像生成に必要な参照画像の予測画像1画素当たりの画素数がAVCの上限を下回る場合があった。つまり、動きベクトルの精度による負荷への影響は考慮されていないため、インター予測モードの選択肢が不要に制限されており、結果として符号化効率が低減する恐れがあった。
However, as shown in FIG. 4 and FIG. 5, even if the block size is smaller than 8 × 8, the number of pixels per predicted image pixel of the reference image necessary for predicted image generation may be below the upper limit of AVC. there were. That is, since the influence on the load due to the accuracy of the motion vector is not taken into consideration, the options of the inter prediction mode are unnecessarily limited, and as a result, the coding efficiency may be reduced.
そこで動きベクトル制御部121は、符号化効率の不要な低減をできるだけ抑制するために、ブロックサイズや予測モードだけでなく、動きベクトルの精度も考慮して動き予測を制限する。
Therefore, the motion vector control unit 121 limits motion prediction in consideration of not only the block size and the prediction mode but also the accuracy of the motion vector in order to suppress unnecessary reduction in encoding efficiency as much as possible.
[動き予測・補償部、動きベクトル制御部]
図6は、図1の動き予測・補償部115および動きベクトル制御部121の主な構成例を示すブロック図である。 [Motion prediction / compensation unit, motion vector control unit]
FIG. 6 is a block diagram illustrating a main configuration example of the motion prediction /compensation unit 115 and the motion vector control unit 121 of FIG.
図6は、図1の動き予測・補償部115および動きベクトル制御部121の主な構成例を示すブロック図である。 [Motion prediction / compensation unit, motion vector control unit]
FIG. 6 is a block diagram illustrating a main configuration example of the motion prediction /
図6に示されるように、動き予測・補償部115は、動き探索部151、コスト関数算出部152、モード判定部153、動き補償部154、および動き情報バッファ155を有する。
6, the motion prediction / compensation unit 115 includes a motion search unit 151, a cost function calculation unit 152, a mode determination unit 153, a motion compensation unit 154, and a motion information buffer 155.
また、動きベクトル制御部121は、制限情報記憶部161、制限制御部162、および伝送情報提供部163を有する。
In addition, the motion vector control unit 121 includes a restriction information storage unit 161, a restriction control unit 162, and a transmission information providing unit 163.
動き予測・補償部115の動き探索部151は、画面並べ替えバッファ102から取得した入力画像画素値と、フレームメモリ112から取得した参照画像画素値とを用いて、候補とする全てのインター予測モードで予測処理単位の領域毎に動き探索を行い、動きベクトルを求め、さらに、入力画像と参照画像との差分画像(差分画素値)を求める。
The motion search unit 151 of the motion prediction / compensation unit 115 uses all the input image pixel values acquired from the screen rearrangement buffer 102 and the reference image pixel values acquired from the frame memory 112 as candidates for all inter prediction modes. Then, a motion search is performed for each region of the prediction processing unit, a motion vector is obtained, and a difference image (difference pixel value) between the input image and the reference image is obtained.
その際、動き探索部151は、動きベクトル制御部121の制限制御部162から供給される動きベクトル制御情報を取得し、その動きベクトル制御情報により示される制限に従って、動き探索を行う。つまり、動き探索部151は、制限制御部162に許可されたモードのみを候補のインター予測モードとし、そのモードについてのみ動き探索を行う。換言するに、動き探索部151は、制限制御部162により禁止されたモードの動き探索を行わない。
At that time, the motion search unit 151 acquires the motion vector control information supplied from the restriction control unit 162 of the motion vector control unit 121, and performs a motion search according to the restriction indicated by the motion vector control information. That is, the motion search unit 151 sets only a mode permitted by the restriction control unit 162 as a candidate inter prediction mode, and performs a motion search only for that mode. In other words, the motion search unit 151 does not perform a motion search in a mode prohibited by the restriction control unit 162.
また、動き探索部151は、処理対象のPUである注目PUの周辺に位置する周辺PUの動きベクトルに関する情報である周辺動き情報を、動き情報バッファ155から適宜取得し、メディアン予測やマージモード等の処理を行う。
In addition, the motion search unit 151 appropriately acquires peripheral motion information, which is information on motion vectors of peripheral PUs located around the attention PU that is the processing target PU, from the motion information buffer 155, and performs median prediction, merge mode, and the like. Perform the process.
動き探索部151は、以上のように求めた各インター予測モードの動きに関する情報(動き情報)をコスト関数算出部152に供給する。なお、動き探索部151は、必要に応じて、動き情報に対応する差分画素値もコスト関数算出部152に供給する。
The motion search unit 151 supplies the cost function calculation unit 152 with information (motion information) regarding the motion of each inter prediction mode obtained as described above. In addition, the motion search part 151 supplies the difference pixel value corresponding to motion information to the cost function calculation part 152 as needed.
コスト関数算出部152は、動き探索部151から供給される情報に基づいて、全てのインター予測モードについてコスト関数値を算出する。コスト関数算出部152は、算出した各インター予測モードのコスト関数値を動き情報とともにモード判定部153に供給する。
The cost function calculation unit 152 calculates cost function values for all inter prediction modes based on the information supplied from the motion search unit 151. The cost function calculation unit 152 supplies the calculated cost function value of each inter prediction mode to the mode determination unit 153 together with the motion information.
モード判定部153は、コスト関数値が最小のインター予測モードを最適なインター予測モードとして判定する。モード判定部153は、この最適なインター予測モードを示す最適予測モード情報を、その最適なインター予測モードの動き情報とともに、動き補償部154に供給する。
The mode determination unit 153 determines the inter prediction mode having the smallest cost function value as the optimal inter prediction mode. The mode determination unit 153 supplies the optimal prediction mode information indicating the optimal inter prediction mode to the motion compensation unit 154 together with the motion information of the optimal inter prediction mode.
動き補償部154は、最適なインター予測モードで動き補償を行い、予測画像を生成する。動き補償部154は、モード判定部153から取得した動き情報と最適予測モード情報に従って、最適なインター予測モードで、動き情報に対応する参照画像画素値をフレームメモリ112から取得する。なお、最適なインター予測モードが、MVコンペティションモードやマージモードのように、モード判定部153から動きベクトルが供給されないモードの場合、動き補償部154は、動き情報バッファ155から、周辺PUの動き情報(周辺動き情報)を取得する。
The motion compensation unit 154 performs motion compensation in an optimal inter prediction mode and generates a predicted image. The motion compensation unit 154 acquires the reference image pixel value corresponding to the motion information from the frame memory 112 in the optimal inter prediction mode according to the motion information and the optimal prediction mode information acquired from the mode determination unit 153. Note that when the optimal inter prediction mode is a mode in which no motion vector is supplied from the mode determination unit 153, such as the MV competition mode or the merge mode, the motion compensation unit 154 receives the motion information of the peripheral PU from the motion information buffer 155. (Peripheral motion information) is acquired.
上述したように、動き探索部151においては、制限制御部162により許可された予測モードのみが候補のインター予測モードとされ、そのモードについてのみ動き探索が行われる。したがって、最適インター予測モードとされるモードも、制限制御部162により許可されたモードの中の1つである。つまり、動き補償部154は、制限制御部162により許可されたインター予測モードで動き補償を行う。
As described above, in the motion search unit 151, only the prediction mode permitted by the restriction control unit 162 is set as the candidate inter prediction mode, and the motion search is performed only for that mode. Therefore, the mode set as the optimal inter prediction mode is also one of the modes permitted by the restriction control unit 162. That is, the motion compensation unit 154 performs motion compensation in the inter prediction mode permitted by the restriction control unit 162.
動き補償部154は、生成した予測画像画素値を予測画像選択部116に供給し、演算部103において入力画像から減算させたり、演算部110において差分画像に加算させたりする。
The motion compensation unit 154 supplies the generated predicted image pixel value to the predicted image selection unit 116 and causes the calculation unit 103 to subtract from the input image or causes the calculation unit 110 to add to the difference image.
また、動き補償部154は、最適予測モード情報および動き情報を可逆符号化部106に供給し、符号化させる。さらに、動き補償部154は、動き情報を動き情報バッファ155に供給し、記憶させる。
Also, the motion compensation unit 154 supplies the optimal prediction mode information and the motion information to the lossless encoding unit 106 for encoding. Further, the motion compensation unit 154 supplies the motion information to the motion information buffer 155 and stores it.
動き情報バッファ155は、動き補償部154から供給される動き情報を取得し、記憶する。動き情報バッファ155は、記憶している動き情報を、周辺動き情報として、動き探索部151や動き補償部154に供給する。
The motion information buffer 155 acquires and stores motion information supplied from the motion compensation unit 154. The motion information buffer 155 supplies the stored motion information to the motion search unit 151 and the motion compensation unit 154 as peripheral motion information.
動きベクトル制御部121の制限情報記憶部161は、インター予測モードの制限に関する情報である制限情報を予め記憶し、所定のタイミングにおいて、若しくは、例えば制限制御部162等の他の処理部からの要求に従って、その制限情報を制限制御部162に供給する。
The restriction information storage unit 161 of the motion vector control unit 121 stores restriction information that is information related to the restriction of the inter prediction mode in advance, and at a predetermined timing or a request from another processing unit such as the restriction control unit 162, for example. Accordingly, the restriction information is supplied to the restriction controller 162.
この制限情報は、例えば、「L0予測若しくはL1予測では、4×4ブロックサイズを禁止する。双予測では、4×4,8×4,4×8および8×8の「efgijkpqr」を禁止する」といったように、動きベクトル制御情報において、少なくとも、動きベクトルの精度による制限(図4および図5における「Sub pel」の列による制限)を含むようにすることができる情報である。換言するに、そのような制限を提供することができる限り制限情報はどのような情報であっても良い。また、具体的にどのモード(予測モード(単予測、双予測)、ブロックサイズ、動きベクトルの精度の組み合わせ)を制限するかは、任意である。つまり、図4や図5の例の場合、テーブルのどの行が禁止されるようにしてもよい。
This restriction information is, for example, “4 × 4 block size is prohibited in L0 prediction or L1 prediction. 4 × 4, 8 × 4, 4 × 8 and 8 × 8“ efgijkpqr ”are prohibited in bi-prediction. In the motion vector control information, information that can include at least a restriction due to the accuracy of the motion vector (a restriction due to the column “Sub pel” in FIGS. 4 and 5). In other words, the restriction information may be any information as long as such restriction can be provided. Further, which mode (prediction mode (single prediction, bi-prediction), combination of block size and motion vector accuracy) is specifically limited is arbitrary. That is, in the example of FIGS. 4 and 5, any row in the table may be prohibited.
制限制御部162は、制限情報記憶部161から制限情報を取得すると、その制限情報に従って、動き探索部151に制限内容を通知する動きベクトル制御情報を生成し、生成した動きベクトル制御情報を動き探索部151に供給する。動き探索部151は、この動きベクトル制御情報の制限に従って動き探索を行うことにより、制限情報により許可されたモードでのみ動き探索を行うことができる。
When the restriction control unit 162 acquires the restriction information from the restriction information storage unit 161, the restriction control unit 162 generates motion vector control information for notifying the motion search unit 151 of the restriction content according to the restriction information, and performs motion search on the generated motion vector control information. To the unit 151. The motion search unit 151 can perform a motion search only in a mode permitted by the restriction information by performing a motion search according to the restriction of the motion vector control information.
このようにすることにより、動き補償部154も、制限情報により許可されたモードで動き補償を行うことができる。つまり、動きベクトル制御部121(制限制御部162)は、動きベクトルの精度によって動き予測を制限することができる。したがって、動きベクトル制御部121(制限制御部162)は、符号化効率の低減を抑制しながら負荷の増大を抑制し、画像を符号化させることができる。したがって、画像符号化装置100は、符号化効率の抑制を低減させることができる。
By doing in this way, the motion compensation unit 154 can also perform motion compensation in the mode permitted by the restriction information. That is, the motion vector control unit 121 (restriction control unit 162) can limit motion prediction based on the accuracy of the motion vector. Therefore, the motion vector control unit 121 (restriction control unit 162) can suppress an increase in load while suppressing a decrease in encoding efficiency, and can encode an image. Therefore, the image encoding device 100 can reduce suppression of encoding efficiency.
また、制限制御部162は、制限内容を示す情報を伝送させるように指示する制御情報を伝送情報提供部163に供給する。この制御情報には、制限情報による制限内容も含まれる。伝送情報提供部163は、供給された制御情報に従って、制限情報による制限の内容を示す伝送情報を生成し、その伝送情報を可逆符号化部106に供給し、伝送させる。
Further, the restriction control unit 162 supplies the transmission information providing unit 163 with control information instructing to transmit information indicating the restriction content. This control information includes the restriction content by the restriction information. The transmission information providing unit 163 generates transmission information indicating the content of restriction by the restriction information in accordance with the supplied control information, and supplies the transmission information to the lossless encoding unit 106 for transmission.
この伝送情報は、制限情報の制限内容を示す情報であればどのようなものであってもよい。例えば、制限情報を伝送情報に含めるようにしてもよいし、制限の内容を伝送情報に含めるようにしてもよいし、制限内容と等価の情報を伝送情報に含めるようにしてもよい。
This transmission information may be any information as long as it indicates the restriction content of the restriction information. For example, the restriction information may be included in the transmission information, the restriction content may be included in the transmission information, or information equivalent to the restriction content may be included in the transmission information.
以上に説明したように、動きベクトルの精度による制限が含まれていれば、動きベクトル制御情報による制限はどのような制限であっても良い。例えば、図4や図5のテーブルにおいて「入力画素数」の上限をAVC並み(13.5)とした場合、単予測において、4×4ブロックサイズの「abc」、「dhn」、および「efgijkpqr」と、8×8ブロックサイズの「efgijkpqr」を禁止し、それ以外を許可し、双予測において、ブロックサイズ4×4、8×4、および4×8の全てのSub pelと、8×8ブロックサイズの「dhn」および「efgijkpqr」を禁止し、それ以外を許可するようにしてもよい。
As described above, the restriction by the motion vector control information may be any restriction as long as the restriction by the accuracy of the motion vector is included. For example, if the upper limit of the “number of input pixels” is set to the same level as AVC (13.5) in the tables of FIG. 4 and FIG. ”And 8 × 8 block size“ efgijkpqr ”are prohibited, others are allowed, and in bi-prediction, all Sub pels with block sizes 4 × 4, 8 × 4, and 4 × 8, and 8 × 8 Block sizes “dhn” and “efgijkpqr” may be prohibited and others may be permitted.
このように制限することにより、従来のようにブロックサイズと予測モードのみによって制限する方法において禁止されていたモード(例えば、単予測の、4×4ブロックサイズの「integer」、8×4ブロックサイズ、4×8ブロックサイズの「integer」、「abc」、および「dhn」、並びに、双予測の8×8ブロックサイズの「integer」および「abc」)が許可されるようになる。したがって、画像符号化装置100は、予測モードの候補数を不要に低減させることを抑制することができるので、符号化効率の抑制を低減させることができる。
By limiting in this way, a mode that has been prohibited in the conventional method of limiting only by block size and prediction mode (for example, “integer” of 4 × 4 block size of single prediction, 8 × 4 block size) 4 × 8 block size “integer”, “abc”, and “dhn”, and bi-prediction 8 × 8 block size “integer” and “abc”). Therefore, since the image coding apparatus 100 can suppress unnecessarily reducing the number of prediction mode candidates, it is possible to reduce suppression of encoding efficiency.
なお、動きベクトルの精度による制限以外の制限は任意であるので、動きベクトル制御情報による制限において、予測モードやブロックサイズ以外の制限が含まれていても良いし、予測モードやブロックサイズによる制限が省略されても良い。また、伝送情報は、動きベクトル制御情報による制限の内容が含まれていればよい。さらに、動きベクトル制御情報や伝送情報に、動きベクトルの精度による制限が含まれている限り、制限情報は、どのような情報であっても良い。
In addition, since restrictions other than the restriction | limiting by the precision of a motion vector are arbitrary, restrictions other than prediction mode and block size may be included in the restriction | limiting by motion vector control information, and the restriction | limiting by prediction mode or block size may be included. It may be omitted. Further, the transmission information only needs to include the content of restriction by the motion vector control information. Furthermore, the restriction information may be any information as long as the motion vector control information and the transmission information include a restriction due to the accuracy of the motion vector.
[符号化処理の流れ]
次に、以上のような画像符号化装置100により実行される各処理の流れについて説明する。最初に、図7のフローチャートを参照して、符号化処理の流れの例を説明する。 [Flow of encoding process]
Next, the flow of each process executed by theimage encoding device 100 as described above will be described. First, an example of the flow of the encoding process will be described with reference to the flowchart of FIG.
次に、以上のような画像符号化装置100により実行される各処理の流れについて説明する。最初に、図7のフローチャートを参照して、符号化処理の流れの例を説明する。 [Flow of encoding process]
Next, the flow of each process executed by the
ステップS101において、A/D変換部101は入力された画像をA/D変換する。ステップS102において、画面並べ替えバッファ102は、A/D変換された画像を記憶し、各ピクチャの表示する順番から符号化する順番への並べ替えを行う。
In step S101, the A / D converter 101 performs A / D conversion on the input image. In step S102, the screen rearrangement buffer 102 stores the A / D converted image, and rearranges the picture from the display order to the encoding order.
ステップS103において、イントラ予測部114は、イントラ予測モードのイントラ予測処理を行う。
In step S103, the intra prediction unit 114 performs an intra prediction process in the intra prediction mode.
ステップS104において、動きベクトル制御部121は、動き予測・補償処理の動きベクトル生成を制限する動きベクトル制御処理を行う。
In step S104, the motion vector control unit 121 performs a motion vector control process that restricts motion vector generation in the motion prediction / compensation process.
ステップS105において、動き予測・補償部115は、ステップS104による動きベクトルの制限に従って、インター予測モードでの動き予測や動き補償を行うインター動き予測処理を行う。
In step S105, the motion prediction / compensation unit 115 performs inter motion prediction processing for performing motion prediction and motion compensation in the inter prediction mode in accordance with the motion vector restriction in step S104.
ステップS106において、予測画像選択部116は、イントラ予測部114および動き予測・補償部115から出力された各コスト関数値に基づいて、最適予測モードを決定する。つまり、予測画像選択部116は、イントラ予測部114により生成された予測画像と、動き予測・補償部115により生成された予測画像のいずれか一方を選択する。
In step S106, the predicted image selection unit 116 determines the optimal prediction mode based on the cost function values output from the intra prediction unit 114 and the motion prediction / compensation unit 115. That is, the predicted image selection unit 116 selects one of the predicted image generated by the intra prediction unit 114 and the predicted image generated by the motion prediction / compensation unit 115.
ステップS107において、演算部103は、ステップS102の処理により並び替えられた画像と、ステップS106の処理により選択された予測画像との差分を演算する。差分データは元の画像データに較べてデータ量が低減される。したがって、画像をそのまま符号化する場合に較べて、データ量を圧縮することができる。
In step S107, the calculation unit 103 calculates a difference between the image rearranged by the process of step S102 and the predicted image selected by the process of step S106. The data amount of the difference data is reduced compared to the original image data. Therefore, the data amount can be compressed as compared with the case where the image is encoded as it is.
ステップS108において、直交変換部104は、ステップS107の処理により生成された差分情報を直交変換する。具体的には、離散コサイン変換、カルーネン・レーベ変換等の直交変換が行われ、変換係数が出力される。
In step S108, the orthogonal transform unit 104 orthogonally transforms the difference information generated by the process in step S107. Specifically, orthogonal transformation such as discrete cosine transformation and Karhunen-Loeve transformation is performed, and transformation coefficients are output.
ステップS109において、量子化部105は、ステップS108の処理により得られた直交変換係数を量子化する。
In step S109, the quantization unit 105 quantizes the orthogonal transform coefficient obtained by the process in step S108.
ステップS108の処理により量子化された差分情報は、次のようにして局部的に復号される。すなわち、ステップS110において、逆量子化部108はステップS109の処理により量子化された直交変換係数を、ステップS109の量子化に対応する方法で逆量子化する。ステップS111において、逆直交変換部109は、ステップS110の処理により得られた直交変換係数を、ステップS108の処理に対応する方法で逆直交変換する。
The difference information quantized by the processing in step S108 is locally decoded as follows. That is, in step S110, the inverse quantization unit 108 inversely quantizes the orthogonal transform coefficient quantized by the process in step S109 by a method corresponding to the quantization in step S109. In step S111, the inverse orthogonal transform unit 109 performs inverse orthogonal transform on the orthogonal transform coefficient obtained by the process of step S110 by a method corresponding to the process of step S108.
ステップS112において、演算部110は、予測画像を局部的に復号された差分情報に加算し、局部的に復号された画像(演算部103への入力に対応する画像)を生成する。ステップS113において、ループフィルタ111は、ステップS112の処理により生成された画像に対してループフィルタ処理を行う。これによりブロック歪み等が除去されたり画質改善が行われたりする。
In step S112, the calculation unit 110 adds the predicted image to the locally decoded difference information, and generates a locally decoded image (an image corresponding to an input to the calculation unit 103). In step S113, the loop filter 111 performs a loop filter process on the image generated by the process of step S112. As a result, block distortion or the like is removed or image quality is improved.
ステップS114において、フレームメモリ112は、ステップS113の処理によりブロック歪みの除去等が行われた画像を記憶する。なお、フレームメモリ112にはフィルタ処理されていない画像も記憶される。このフレームメモリ112に記憶された画像は、ステップS103の処理やステップS105の処理に利用される。
In step S114, the frame memory 112 stores an image from which block distortion has been removed by the process of step S113. The frame memory 112 also stores an image that has not been filtered. The image stored in the frame memory 112 is used for the process in step S103 and the process in step S105.
ステップS115おいて、可逆符号化部106は、ステップS109の処理により量子化された変換係数を符号化し、符号化データを生成する。すなわち、差分画像(インターの場合、2次差分画像)に対して、可変長符号化や算術符号化等の可逆符号化が行われる。
In step S115, the lossless encoding unit 106 encodes the transform coefficient quantized by the process of step S109, and generates encoded data. That is, lossless encoding such as variable length encoding or arithmetic encoding is performed on the difference image (secondary difference image in the case of inter).
なお、可逆符号化部106は、ステップS106の処理により選択された予測画像の予測モードに関する情報を符号化し、差分画像を符号化して得られる符号化データに付加する。例えば、イントラ予測モードが選択された場合、可逆符号化部106は、イントラ予測モード情報を符号化する。また、例えば、インター予測モードが選択された場合、可逆符号化部106は、インター予測モード情報を符号化する。これらの情報は、例えばヘッダ情報等として符号化データに付加(多重化)される。
Note that the lossless encoding unit 106 encodes information related to the prediction mode of the prediction image selected by the processing in step S106, and adds the encoded information to the encoded data obtained by encoding the difference image. For example, when the intra prediction mode is selected, the lossless encoding unit 106 encodes the intra prediction mode information. For example, when the inter prediction mode is selected, the lossless encoding unit 106 encodes the inter prediction mode information. These pieces of information are added (multiplexed) to the encoded data as header information, for example.
ステップS116において蓄積バッファ107は、ステップS115の処理により生成された符号化データを蓄積する。蓄積バッファ107に蓄積された符号化データは、適宜読み出され、任意の伝送路(通信路だけでなく記憶媒体等も含む)を介して復号側の装置に伝送される。
In step S116, the accumulation buffer 107 accumulates the encoded data generated by the process in step S115. The encoded data stored in the storage buffer 107 is read out as appropriate, and transmitted to a decoding-side device via an arbitrary transmission path (including not only a communication path but also a storage medium).
ステップS117においてレート制御部117は、ステップS116の処理により蓄積バッファ107に蓄積された圧縮画像に基づいて、オーバーフローあるいはアンダーフローが発生しないように、量子化部105の量子化動作のレートを制御する。
In step S117, the rate control unit 117 controls the quantization operation rate of the quantization unit 105 based on the compressed image accumulated in the accumulation buffer 107 by the processing in step S116 so that overflow or underflow does not occur. .
ステップS117の処理が終了すると、符号化処理が終了される。
When the process of step S117 ends, the encoding process ends.
[動きベクトル制御処理の流れ]
次に、図8のフローチャートを参照して、図7のステップS104において実行される動きベクトル制御処理の流れの例を説明する。 [Flow of motion vector control processing]
Next, an example of the flow of the motion vector control process executed in step S104 of FIG. 7 will be described with reference to the flowchart of FIG.
次に、図8のフローチャートを参照して、図7のステップS104において実行される動きベクトル制御処理の流れの例を説明する。 [Flow of motion vector control processing]
Next, an example of the flow of the motion vector control process executed in step S104 of FIG. 7 will be described with reference to the flowchart of FIG.
動きベクトル制御処理が開始されると、動きベクトル制御部121の制限制御部162は、ステップS131において、制限情報記憶部161から読み出した制限情報に基づいて、使用を禁止する動きベクトルを決定する。
When the motion vector control process is started, the restriction control unit 162 of the motion vector control unit 121 determines a motion vector to be prohibited from use based on the restriction information read from the restriction information storage unit 161 in step S131.
ステップS132において、制限制御部162は、ステップS131において決定した動きベクトルの使用制限の内容を示す動きベクトル制御情報を生成し、動き探索部151に供給する。
In step S132, the restriction control unit 162 generates motion vector control information indicating the content of the use restriction of the motion vector determined in step S131, and supplies the motion vector control information to the motion search unit 151.
ステップS133において、伝送情報提供部163は、ステップS131において決定した動きベクトルの使用制限の内容を示す、伝送用の伝送情報を生成し、可逆符号化部106に供給して、伝送させる。
In step S133, the transmission information providing unit 163 generates transmission information for transmission indicating the content of the use restriction of the motion vector determined in step S131, and supplies the transmission information to the lossless encoding unit 106 for transmission.
ステップS133の処理が終了すると、伝送情報提供部163は、動きベクトル制御処理を終了し、処理を図7に戻す。
When the process of step S133 is completed, the transmission information providing unit 163 ends the motion vector control process and returns the process to FIG.
[インター動き予測処理の流れ]
次に、図9のフローチャートを参照して、図7のステップS105において実行されるインター動き予測処理の流れの例を説明する。 [Flow of inter motion prediction processing]
Next, an example of the flow of inter motion prediction processing executed in step S105 of FIG. 7 will be described with reference to the flowchart of FIG.
次に、図9のフローチャートを参照して、図7のステップS105において実行されるインター動き予測処理の流れの例を説明する。 [Flow of inter motion prediction processing]
Next, an example of the flow of inter motion prediction processing executed in step S105 of FIG. 7 will be described with reference to the flowchart of FIG.
インター動き予測処理が開始されると、ステップS151において、動き探索部151は、ステップS132の処理により生成された動きベクトル制御情報に示される制限内容に従って、許可された候補のインター予測モードのそれぞれについて動き探索を行い、動き情報と差分画素値を生成する。
When the inter motion prediction process is started, in step S151, the motion search unit 151 determines each of the permitted candidate inter prediction modes according to the restriction content indicated in the motion vector control information generated by the process of step S132. A motion search is performed to generate motion information and a difference pixel value.
ステップS152において、コスト関数算出部152は、ステップS151の処理により得られた動き情報および差分画素値に基づいて、許可された候補のインター予測モードのそれぞれについてのコスト関数値を算出する。
In step S152, the cost function calculation unit 152 calculates a cost function value for each of the permitted candidate inter prediction modes based on the motion information and the difference pixel value obtained by the process in step S151.
ステップS153において、モード判定部153は、ステップS152において算出されたコスト関数値が最も小さいモードを最適インター予測モードに決定する。
In step S153, the mode determination unit 153 determines the mode having the smallest cost function value calculated in step S152 as the optimal inter prediction mode.
ステップS154において、動き補償部154は、ステップS153の処理により決定された最適インター予測モードで動き補償を行い、予測画像を生成する。この最適インター予測モードは、ステップS132の処理により生成された動きベクトル制御情報により使用が許可されたモードである。
In step S154, the motion compensation unit 154 performs motion compensation in the optimal inter prediction mode determined by the process in step S153, and generates a predicted image. This optimal inter prediction mode is a mode that is permitted to be used by the motion vector control information generated by the process of step S132.
ステップS155において、動き補償部154は、ステップS154の処理により生成した予測画像を、予測画像選択部116に供給し、差分画像情報および復号画像情報を生成させる。また、動き補償部154は、ステップS156において、最適インター予測モードに関する情報(最適予測モード情報や動き情報)を可逆符号化部106に供給し、符号化させる。
In step S155, the motion compensation unit 154 supplies the predicted image generated by the process of step S154 to the predicted image selection unit 116, and generates difference image information and decoded image information. Also, in step S156, the motion compensation unit 154 supplies information related to the optimal inter prediction mode (optimum prediction mode information and motion information) to the lossless encoding unit 106 for encoding.
ステップS157において、動き情報バッファ155は、最適インター予測モードの動き情報を記憶する。
In step S157, the motion information buffer 155 stores the motion information in the optimal inter prediction mode.
ステップS157の処理が終了すると、動き情報バッファ155は、最適インター予測モードを終了し、処理を図7に戻す。
When the process of step S157 is completed, the motion information buffer 155 ends the optimal inter prediction mode and returns the process to FIG.
以上のように各処理を実行することにより、画像符号化装置100は、符号化効率の抑制を低減させることができる。
By executing each process as described above, the image encoding device 100 can reduce suppression of encoding efficiency.
[プロファイルとレベル]
画像符号化装置100は、ストリームを生成する際、プロファイル(profile)とレベル(level)に応じた制限を設けるようにしてもよい。これは、デコーダの利用可能な資源をエンコーダで考慮するためである。ここで、資源とは処理量やメモリ帯域などを指している。プロファイルは利用可能なエンコード・ツールを決める。レベルはプロファイルで利用可能なツールの中で、更に制限を加えるために決められる。 Profile and level
Theimage encoding apparatus 100 may provide a restriction according to a profile and a level when generating a stream. This is because the encoder considers the available resources of the decoder. Here, the resource indicates a processing amount, a memory bandwidth, and the like. The profile determines the available encoding tools. Levels are set to further limit the tools available in the profile.
画像符号化装置100は、ストリームを生成する際、プロファイル(profile)とレベル(level)に応じた制限を設けるようにしてもよい。これは、デコーダの利用可能な資源をエンコーダで考慮するためである。ここで、資源とは処理量やメモリ帯域などを指している。プロファイルは利用可能なエンコード・ツールを決める。レベルはプロファイルで利用可能なツールの中で、更に制限を加えるために決められる。 Profile and level
The
このプロファイルやレベルの具体的な制限内容が標準化規格で決められていれば、その規格に基づいて生成されたストリームが、規格に準拠するデコーダで再生できることを保証することができるようになる。
If the specific restriction contents of this profile and level are determined by the standardization standard, it can be ensured that the stream generated based on the standard can be reproduced by a decoder compliant with the standard.
例えば、動き補償(MC)処理に関するメモリ帯域の上限が、このプロファイルやレベルにより定義されている場合、エンコーダは、その上限を超えないようにビットストリームを生成することにより、デコーダによる復号を保証することができる。
For example, when the upper limit of the memory bandwidth related to motion compensation (MC) processing is defined by this profile or level, the encoder guarantees decoding by the decoder by generating a bit stream so as not to exceed the upper limit. be able to.
[プロファイルとレベルの転送]
その場合、伝送情報提供部163は、画像符号化装置100が、ビットストリームをいずれのプロファイルやレベルに基づいて生成したかを示す情報を伝送情報に含める。通常、プロファイルやレベルに関する情報はシーケンスパラメータセット(SPS(Sequence Parameter Set)と呼ばれるビットストリームのヘッダ情報に付加(多重化)される。 Transfer profile and level
In that case, the transmissioninformation providing unit 163 includes information indicating which profile or level the image encoding device 100 has generated the bitstream in the transmission information. Normally, information on profiles and levels is added (multiplexed) to header information of a bit stream called a sequence parameter set (SPS (Sequence Parameter Set)).
その場合、伝送情報提供部163は、画像符号化装置100が、ビットストリームをいずれのプロファイルやレベルに基づいて生成したかを示す情報を伝送情報に含める。通常、プロファイルやレベルに関する情報はシーケンスパラメータセット(SPS(Sequence Parameter Set)と呼ばれるビットストリームのヘッダ情報に付加(多重化)される。 Transfer profile and level
In that case, the transmission
デコーダは、ビットストリームに含められたプロファイルやレベルに関する情報を読みだして、そのストリームをデコードするために必要な資源を把握することができる。
The decoder can read the information about the profile and level included in the bit stream, and can grasp the resources necessary for decoding the stream.
[具体例]
より具体的に説明する。画像符号化装置100は生成するストリームに制限を加える。この時、想定する画像復号装置の利用可能な資源から、画像符号化装置100は制限を加えることになる。Pスライスが使用可能なプロファイルをBP(Baseline profile)と称し、PスライスおよびBスライスが使用可能なプロファイルをHP(High profile)と称するとする。 [Concrete example]
This will be described more specifically. Theimage encoding device 100 adds a restriction to the stream to be generated. At this time, the image encoding apparatus 100 places a restriction on the resources available for the assumed image decoding apparatus. A profile in which the P slice can be used is referred to as BP (Baseline profile), and a profile in which the P slice and B slice can be used is referred to as HP (High profile).
より具体的に説明する。画像符号化装置100は生成するストリームに制限を加える。この時、想定する画像復号装置の利用可能な資源から、画像符号化装置100は制限を加えることになる。Pスライスが使用可能なプロファイルをBP(Baseline profile)と称し、PスライスおよびBスライスが使用可能なプロファイルをHP(High profile)と称するとする。 [Concrete example]
This will be described more specifically. The
画像符号化装置100は、画像復号装置がBスライスを処理することができるか否かを考慮し、できないのであれば、BPの制限に基づいてストリームを生成し、ストリームにこのストリームがBPの制限で生成されていることを示す情報を含める。これに対して、画像復号装置がBスライスを処理することができる場合、若しくは画像復号装置の能力を考慮しない場合、画像符号化装置100は、HPの制限に基づいてストリームを生成し、ストリームにHPで生成されていることを示す情報を含める。
The image encoding device 100 considers whether or not the image decoding device can process the B slice. If not, the image encoding device 100 generates a stream based on the BP restriction, and the stream is limited to the BP restriction. Include information indicating that it was generated in. On the other hand, when the image decoding apparatus can process the B slice, or when the capability of the image decoding apparatus is not considered, the image encoding apparatus 100 generates a stream based on the limitation of HP, and Include information indicating that it is generated by HP.
画像符号化装置100は、さらに、レベルによって制限を加える。例えば、メモリ帯域で図4や図5の表で入力画素数18以上を制限するレベル(level)をlevel=1とし、14以上を制限するレベル(level)をlevel=2とするとする。
The image encoding apparatus 100 further restricts depending on the level. For example, in the table of FIG. 4 or FIG. 5 in the memory band, the level (level) for limiting the number of input pixels of 18 or more is set to level = 1, and the level (level) for limiting 14 or more is set to level = 2.
画像符号化装置100は、画像復号装置のメモリ帯域の最大が図4や図5の表で18以上か14以上かを考慮してレベルを決める。画像符号化装置100はストリームを生成する際にこのストリームがどのレベルで生成されたかを示す情報をストリームに含める。
The image encoding device 100 determines the level in consideration of whether the maximum memory bandwidth of the image decoding device is 18 or more or 14 or more in the tables of FIGS. When the image encoding apparatus 100 generates a stream, the image encoding apparatus 100 includes information indicating the level at which the stream is generated in the stream.
画像符号化装置100は、画像をエンコードする際、自身が決定したプロファイルやレベルで許される範囲でストリームを生成する。メモリ帯域に関して言えば、当該ブロックの予測モード、縦と横のブロックサイズ、MVに基づいて、図4や図5の表で1画素を生成するために読み込む入力画素数がレベルの規定を超えないものを用いてエンコードする。
When encoding an image, the image encoding device 100 generates a stream within a range permitted by the profile and level determined by itself. Regarding the memory bandwidth, the number of input pixels read to generate one pixel in the tables of FIGS. 4 and 5 does not exceed the level specification based on the prediction mode of the block, the vertical and horizontal block sizes, and the MV. Encode with something.
[動きベクトル制御部の他の例]
なお、制限情報は、任意であり、上述した例に限らない。例えば、複数の制限の候補を含み、各種条件に従ってその中から採用する制限が選択されるようにしてもよい。 [Another example of motion vector control unit]
The restriction information is arbitrary and is not limited to the example described above. For example, a plurality of restriction candidates may be included, and a restriction to be adopted may be selected from among them according to various conditions.
なお、制限情報は、任意であり、上述した例に限らない。例えば、複数の制限の候補を含み、各種条件に従ってその中から採用する制限が選択されるようにしてもよい。 [Another example of motion vector control unit]
The restriction information is arbitrary and is not limited to the example described above. For example, a plurality of restriction candidates may be included, and a restriction to be adopted may be selected from among them according to various conditions.
図10は、そのような場合の、動きベクトル制御部121の主な構成例を示すブロック図である。図10に示される例の場合、動きベクトル制御部121は、図6の制限制御部162の代わりに、制限制御部172を有する。また、図10の動きベクトル制御部121は、さらに、条件取得部171を有する。
FIG. 10 is a block diagram illustrating a main configuration example of the motion vector control unit 121 in such a case. In the example illustrated in FIG. 10, the motion vector control unit 121 includes a restriction control unit 172 instead of the restriction control unit 162 in FIG. 6. In addition, the motion vector control unit 121 in FIG. 10 further includes a condition acquisition unit 171.
条件取得部171は、動きベクトル生成に関する各種条件を取得し、それを条件情報として制限制御部172に通知する。この条件は任意であり、どのような情報であってもよい。例えば、ユーザ等による許可する動きベクトルや制限の指定であってもよいし、利用可能なメモリ帯域等のリソースに関する情報であってもよいし、符号化規格により規定される制限に関する情報であってもよい。
The condition acquisition unit 171 acquires various conditions related to motion vector generation, and notifies the restriction control unit 172 of the acquired conditions as condition information. This condition is arbitrary and may be any information. For example, it may be a motion vector permitted by a user or the like, a restriction specification, information on a resource such as an available memory band, or information on a restriction defined by a coding standard. Also good.
制限制御部172は、このような条件情報と、制限情報記憶部161から読み出した制限情報に基づいて、動きベクトルの制限内容を決定する。
The restriction control unit 172 determines the restriction content of the motion vector based on such condition information and the restriction information read from the restriction information storage unit 161.
この場合、制限情報において、制限内容が複数パターン用意されており、制限制御部172が、条件情報に応じていずれかを選択することができるようになされている。例えば、制限情報が、上述したような、「L0予測若しくはL1予測では、4×4ブロックサイズを禁止する。双予測では、4×4,8×4,4×8および8×8の「efgijkpqr」を禁止する」といったように、少なくとも、動きベクトルの精度に関する条件を複数含むようにしてもよい。この場合、制限制御部172は、条件情報に従っていずれかの制限内容を選択することにより、条件情報に応じた制限内容を決定することができる。
In this case, a plurality of patterns of restriction contents are prepared in the restriction information, and the restriction control unit 172 can select either one according to the condition information. For example, the restriction information is as described above, “4 × 4 block size is prohibited in L0 prediction or L1 prediction. In bi-prediction,“ efgijkpqr of 4 × 4, 8 × 4, 4 × 8 and 8 × 8 A plurality of conditions relating to the accuracy of the motion vector may be included, such as “prohibit”. In this case, the restriction control unit 172 can determine the restriction content according to the condition information by selecting any restriction content according to the condition information.
また、制限情報が、条件によって複数通りの演算結果が得られる所定の演算式を含むようにしてもよい。この場合、制限制御部172は、条件情報に従って演算式の係数の値を決定し、その値を用いて演算を行うことにより、条件情報に応じた制限内容を決定することができる。
Also, the restriction information may include a predetermined arithmetic expression that can obtain a plurality of arithmetic results depending on conditions. In this case, the restriction control unit 172 can determine the restriction content according to the condition information by determining the value of the coefficient of the arithmetic expression according to the condition information and performing the calculation using the value.
さらに、制限情報が、図4や図5に示されるテーブル情報を含むようにしてもよい。この場合、制限制御部172は、条件情報に従って、上限とする「入力画素数」を決定することにより、条件情報に応じた制限内容を決定することができる。
Furthermore, the restriction information may include table information shown in FIG. 4 and FIG. In this case, the restriction control unit 172 can determine the restriction content according to the condition information by determining the “number of input pixels” as the upper limit according to the condition information.
なお、図4および図5の例は、メモリアラインメントを固定した例であるが、制限情報が、各メモリアラインメントのパターンについての、このようなテーブル情報を含むようにしてもよい。その場合、制限制御部172は、条件情報に従って、メモリアラインメントを特定(決定)することにより、適用するテーブルを選択し、さらに、条件情報に従って、上限とする「入力画素数」を決定することにより、条件情報に応じた制限内容を決定することができる。
4 and 5 are examples in which the memory alignment is fixed. However, the restriction information may include such table information for each memory alignment pattern. In that case, the restriction control unit 172 specifies (determines) the memory alignment according to the condition information, selects the table to be applied, and further determines the “number of input pixels” as the upper limit according to the condition information. Therefore, it is possible to determine the restriction content according to the condition information.
また、図11や図12に示される例のように、図4および図5のテーブルの各行について「許可」するか「禁止」するかを予め決定したテーブルを制限情報に含めるようにしてもよい。つまり、この例の場合、予測モード、ブロックサイズ、および動きベクトルの精度の組み合わせについて、どれを許可し、どれを禁止するかが予め定められている。したがって、制限制御部172は、条件情報に従ってメモリアラインメントを特定(決定)するだけで、条件情報に応じた制限内容を決定することができる。
Further, as in the examples shown in FIGS. 11 and 12, a table in which whether to permit or prohibit each row in the tables of FIGS. 4 and 5 may be included in the restriction information. . That is, in this example, it is determined in advance which is permitted and which is prohibited for the combination of the prediction mode, the block size, and the accuracy of the motion vector. Therefore, the restriction control unit 172 can determine the restriction content according to the condition information only by specifying (determining) the memory alignment according to the condition information.
この場合も、動きベクトルの精度による制限以外の制限は任意である。動きベクトル制御情報による制限において、予測モードやブロックサイズ以外の制限が含まれていても良いし、予測モードやブロックサイズによる制限が省略されても良い。また、伝送情報は、動きベクトル制御情報による制限の内容が含まれていればよい。さらに、動きベクトル制御情報や伝送情報に、動きベクトルの精度による制限が含まれている限り、制限情報や条件は、どのような情報であっても良い。
Also in this case, restrictions other than the restriction due to the accuracy of the motion vector are arbitrary. In the restriction by the motion vector control information, restrictions other than the prediction mode and the block size may be included, or the restriction by the prediction mode and the block size may be omitted. Further, the transmission information only needs to include the content of restriction by the motion vector control information. Furthermore, as long as the motion vector control information and the transmission information include a restriction due to the accuracy of the motion vector, the restriction information and the condition may be any information. *
[動きベクトル制御処理の流れ]
図13のフローチャートを参照して、この場合の動きb稀有撮る制御処理の流れの他の例を説明する。 [Flow of motion vector control processing]
With reference to the flowchart of FIG. 13, another example of the flow of control processing for taking a rare motion b in this case will be described.
図13のフローチャートを参照して、この場合の動きb稀有撮る制御処理の流れの他の例を説明する。 [Flow of motion vector control processing]
With reference to the flowchart of FIG. 13, another example of the flow of control processing for taking a rare motion b in this case will be described.
動きベクトル制御処理が開始されると、動きベクトル制御部121の条件取得部171は、ステップS171において、動きベクトルの制限に関する条件を取得する。
When the motion vector control process is started, the condition acquisition unit 171 of the motion vector control unit 121 acquires a condition related to the motion vector restriction in step S171.
ステップS172において、制限制御部172は、ステップS171において取得した条件と、制限情報記憶部161から読み出した制限情報とに基づいて、使用を禁止する動きベクトルを決定する。
In step S172, the restriction control unit 172 determines a motion vector that is prohibited from use based on the condition acquired in step S171 and the restriction information read from the restriction information storage unit 161.
ステップS173において、制限制御部172は、ステップS172において決定した動きベクトルの使用制限の内容を示す動きベクトル制御情報を生成し、動き探索部151に供給する。
In step S173, the restriction control unit 172 generates motion vector control information indicating the content of the use restriction of the motion vector determined in step S172, and supplies the motion vector control information to the motion search unit 151.
ステップS174において、伝送情報提供部163は、ステップS172において決定した動きベクトルの使用制限の内容を示す、伝送用の伝送情報を生成し、可逆符号化部106に供給して、伝送させる。
In step S174, the transmission information providing unit 163 generates transmission information for transmission indicating the content of the use restriction of the motion vector determined in step S172, and supplies the transmission information to the lossless encoding unit 106 for transmission.
ステップS174の処理が終了すると、伝送情報提供部163は、動きベクトル制御処理を終了し、処理を図7に戻す。
When the process of step S174 is completed, the transmission information providing unit 163 ends the motion vector control process and returns the process to FIG.
以上のように処理を行うことにより、画像符号化装置100は、符号化効率の抑制を低減させることができる。
By performing the processing as described above, the image encoding device 100 can reduce suppression of encoding efficiency.
<2.第2の実施の形態>
[画像復号装置]
以上のようにして画像符号化装置100により生成された符号化データ(ビットストリーム)は、使用される動きベクトルが制限されているのみであるので、画像符号化装置100と同様の符号化規格に準拠する従来の画像復号装置により、従来の方法で復号することができる。 <2. Second Embodiment>
[Image decoding device]
Since the encoded data (bitstream) generated by theimage encoding apparatus 100 as described above is limited only in the motion vector to be used, the encoding standard is the same as that of the image encoding apparatus 100. A conventional image decoding apparatus that conforms can perform decoding by a conventional method.
[画像復号装置]
以上のようにして画像符号化装置100により生成された符号化データ(ビットストリーム)は、使用される動きベクトルが制限されているのみであるので、画像符号化装置100と同様の符号化規格に準拠する従来の画像復号装置により、従来の方法で復号することができる。 <2. Second Embodiment>
[Image decoding device]
Since the encoded data (bitstream) generated by the
ただし、画像復号装置が、例えばプロファイルやレベル等として、画像符号化装置100により符号化データに付加(多重化)された伝送情報を解析することにより、自身がその符号化データを安全に正しく復号する事ができるか否かを判定するようにしてもよい。また、復号する事ができないと判定された場合、画像復号装置が、その復号処理を中止するようにしてもよい。このようにすることにより、オーバーフローやアンダーフロー等による不要な遅延を生じさせずに、より確実に、安全に正しく符号化データを復号することができる。
However, the image decoding apparatus analyzes the transmission information added (multiplexed) to the encoded data by the image encoding apparatus 100 as, for example, a profile or a level, so that the image decoding apparatus can safely and correctly decode the encoded data. You may make it determine whether it can do. Further, when it is determined that decoding cannot be performed, the image decoding apparatus may stop the decoding process. By doing so, encoded data can be decoded more reliably and safely without causing unnecessary delay due to overflow, underflow, or the like.
このような画像復号装置について以下に説明する。
Such an image decoding apparatus will be described below.
図14は、本技術を適用した画像処理装置である画像復号装置の主な構成例を示すブロック図である。図14に示される画像復号装置200は、上述した画像符号化装置100に対応し、画像符号化装置100が画像データを符号化して生成したビットストリーム(符号化データ)を正しく復号し、復号画像を生成する。
FIG. 14 is a block diagram illustrating a main configuration example of an image decoding device that is an image processing device to which the present technology is applied. An image decoding apparatus 200 shown in FIG. 14 corresponds to the above-described image encoding apparatus 100, correctly decodes a bit stream (encoded data) generated by encoding image data by the image encoding apparatus 100, and decodes a decoded image. Is generated.
図14に示されるように画像復号装置200は、蓄積バッファ201、可逆復号部202、逆量子化部203、逆直交変換部204、演算部205、ループフィルタ206、画面並べ替えバッファ207、およびD/A変換部208を有する。また、画像復号装置200は、フレームメモリ209、選択部210、イントラ予測部211、動き予測・補償部212、および選択部213を有する。
As illustrated in FIG. 14, the image decoding device 200 includes a storage buffer 201, a lossless decoding unit 202, an inverse quantization unit 203, an inverse orthogonal transform unit 204, a calculation unit 205, a loop filter 206, a screen rearrangement buffer 207, and a D A / A converter 208 is included. The image decoding apparatus 200 includes a frame memory 209, a selection unit 210, an intra prediction unit 211, a motion prediction / compensation unit 212, and a selection unit 213.
さらに、画像復号装置200は、復号制御部221を有する。
Furthermore, the image decoding device 200 includes a decoding control unit 221.
蓄積バッファ201は、伝送されてきた符号化データを蓄積し、所定のタイミングにおいてその符号化データを可逆復号部202に供給する。
The accumulation buffer 201 accumulates the transmitted encoded data, and supplies the encoded data to the lossless decoding unit 202 at a predetermined timing.
可逆復号部202は、蓄積バッファ201より供給された、図1の可逆符号化部106により符号化された情報を、可逆符号化部106の符号化方式に対応する方式で復号する。可逆復号部202は、復号して得られた差分画像の量子化された係数データを、逆量子化部203に供給する。
The lossless decoding unit 202 decodes the information supplied from the accumulation buffer 201 and encoded by the lossless encoding unit 106 in FIG. 1 using a method corresponding to the encoding method of the lossless encoding unit 106. The lossless decoding unit 202 supplies the quantized coefficient data of the difference image obtained by decoding to the inverse quantization unit 203.
また、可逆復号部202は、符号化データを復号して得られた最適な予測モードに関する情報を参照し、最適な予測モードにイントラ予測モードが選択されたかインター予測モードが選択されたかを判定する。つまり、可逆復号部202は、伝送されてきた符号化データにおいて採用された予測モードが、イントラ予測であるか、インター予測であるかを判定する。
In addition, the lossless decoding unit 202 refers to information on the optimal prediction mode obtained by decoding the encoded data, and determines whether the intra prediction mode or the inter prediction mode is selected as the optimal prediction mode. . That is, the lossless decoding unit 202 determines whether the prediction mode employed in the transmitted encoded data is intra prediction or inter prediction.
可逆復号部202は、その判定結果に基づいて、その予測モードに関する情報を、イントラ予測部211若しくは動き予測・補償部212に供給する。例えば、画像符号化装置100において最適な予測モードとしてイントラ予測モードが選択された場合、可逆復号部202は、符号化側から供給された、その選択されたイントラ予測モードに関する情報であるイントラ予測情報をイントラ予測部211に供給する。また、例えば、画像符号化装置100において最適な予測モードとしてインター予測モードが選択された場合、可逆復号部202は、符号化側から供給された、その選択されたインター予測モードに関する情報であるインター予測情報を動き予測・補償部212に供給する。
The lossless decoding unit 202 supplies information on the prediction mode to the intra prediction unit 211 or the motion prediction / compensation unit 212 based on the determination result. For example, when the intra prediction mode is selected as the optimal prediction mode in the image encoding device 100, the lossless decoding unit 202 is intra prediction information, which is information about the selected intra prediction mode supplied from the encoding side. Is supplied to the intra prediction unit 211. Further, for example, when the inter prediction mode is selected as the optimum prediction mode in the image encoding device 100, the lossless decoding unit 202 is an inter that is information about the selected inter prediction mode supplied from the encoding side. The prediction information is supplied to the motion prediction / compensation unit 212.
さらに、可逆復号部202は、符号化データに付加(多重化)された伝送情報、すなわち、動きベクトルの制限内容に関する情報を復号制御部221に供給する。例えば、可逆復号部202は、シーケンスパラメータセット等に記述されるプロファイルやレベル等の情報を復号制御部221に供給する。
Further, the lossless decoding unit 202 supplies transmission information added (multiplexed) to the encoded data, that is, information on the motion vector restriction content to the decoding control unit 221. For example, the lossless decoding unit 202 supplies information such as a profile and level described in the sequence parameter set to the decoding control unit 221.
なお、この伝送情報に基づいて復号制御部221により復号処理の中止を指示された場合、可逆復号部202は、自身の処理を中止し、その他の処理部の処理も中止させ、復号処理を強制終了させる。
When the decoding control unit 221 is instructed to stop the decoding process based on this transmission information, the lossless decoding unit 202 stops its own processing and also stops the processing of other processing units, forcing the decoding process. Terminate.
逆量子化部203は、可逆復号部202により復号されて得られた量子化された係数データを、図1の量子化部105の量子化方式に対応する方式(逆量子化部108と同様の方式)で逆量子化を行う。逆量子化部203は、逆量子化された係数データを逆直交変換部204に供給する。
The inverse quantization unit 203 uses the method corresponding to the quantization method of the quantization unit 105 in FIG. 1 (similar to the inverse quantization unit 108) for the quantized coefficient data obtained by decoding by the lossless decoding unit 202. Method). The inverse quantization unit 203 supplies the inversely quantized coefficient data to the inverse orthogonal transform unit 204.
逆直交変換部204は、図1の直交変換部104の直交変換方式に対応する方式で逆量子化部203から供給される係数データを逆直交変換する。逆直交変換部204は、この逆直交変換処理により、画像符号化装置100において直交変換される前の差分画像に対応する差分画像を得る。
The inverse orthogonal transform unit 204 performs inverse orthogonal transform on the coefficient data supplied from the inverse quantization unit 203 in a method corresponding to the orthogonal transform method of the orthogonal transform unit 104 in FIG. The inverse orthogonal transform unit 204 obtains a difference image corresponding to the difference image before being orthogonally transformed in the image encoding device 100 by the inverse orthogonal transform process.
逆直交変換されて得られた差分画像は、演算部205に供給される。また、演算部205には、選択部213を介して、イントラ予測部211若しくは動き予測・補償部212から予測画像が供給される。
The difference image obtained by the inverse orthogonal transform is supplied to the calculation unit 205. In addition, a prediction image is supplied to the calculation unit 205 from the intra prediction unit 211 or the motion prediction / compensation unit 212 via the selection unit 213.
演算部205は、差分画像と予測画像とを加算し、画像符号化装置100の演算部103により予測画像が減算される前の画像に対応する再構成画像を得る。演算部205は、その再構成画像をループフィルタ206に供給する。
The calculation unit 205 adds the difference image and the prediction image, and obtains a reconstructed image corresponding to the image before the prediction image is subtracted by the calculation unit 103 of the image encoding device 100. The arithmetic unit 205 supplies the reconstructed image to the loop filter 206.
ループフィルタ206は、供給された再構成画像に対して、デブロックフィルタ処理や適応ループフィルタ処理等を含むループフィルタ処理を適宜施して復号画像を生成する。例えば、ループフィルタ206は、再構成画像に対してデブロックフィルタ処理を行うことにより、ブロック歪を除去する。また、例えば、ループフィルタ206は、そのデブロックフィルタ処理結果(ブロック歪みの除去が行われた再構成画像)に対して、ウィナーフィルタ(Wiener Filter)を用いてループフィルタ処理を行うことにより画質改善を行う。
The loop filter 206 appropriately performs a loop filter process including a deblock filter process and an adaptive loop filter process on the supplied reconstructed image to generate a decoded image. For example, the loop filter 206 removes block distortion by performing a deblocking filter process on the reconstructed image. Further, for example, the loop filter 206 performs image quality improvement by performing loop filter processing using a Wiener filter on the deblock filter processing result (reconstructed image from which block distortion has been removed). I do.
なお、ループフィルタ206が行うフィルタ処理の種類は任意であり、上述した以外のフィルタ処理を行ってもよい。また、ループフィルタ206が、図1の画像符号化装置100から供給されたフィルタ係数を用いてフィルタ処理を行うようにしてもよい。
Note that the type of filter processing performed by the loop filter 206 is arbitrary, and filter processing other than that described above may be performed. Further, the loop filter 206 may perform filter processing using the filter coefficient supplied from the image encoding device 100 of FIG.
ループフィルタ206は、フィルタ処理結果である復号画像を画面並べ替えバッファ207およびフレームメモリ209に供給する。なお、このループフィルタ206によるフィルタ処理は省略することもできる。つまり、演算部205の出力が、フィルタ処理されずに、フレームメモリ209に格納されるようにすることもできる。例えば、イントラ予測部211は、この画像に含まれる画素の画素値を周辺画素の画素値として利用する。
The loop filter 206 supplies the decoded image as the filter processing result to the screen rearrangement buffer 207 and the frame memory 209. Note that the filter processing by the loop filter 206 can be omitted. That is, the output of the calculation unit 205 can be stored in the frame memory 209 without being subjected to filter processing. For example, the intra prediction unit 211 uses pixel values of pixels included in this image as pixel values of peripheral pixels.
画面並べ替えバッファ207は、供給された復号画像の並べ替えを行う。すなわち、図1の画面並べ替えバッファ102により符号化の順番のために並べ替えられたフレームの順番が、元の表示の順番に並べ替えられる。D/A変換部208は、画面並べ替えバッファ207から供給された復号画像をD/A変換し、図示せぬディスプレイに出力し、表示させる。
The screen rearrangement buffer 207 rearranges the supplied decoded images. That is, the order of frames rearranged for the encoding order by the screen rearrangement buffer 102 in FIG. 1 is rearranged in the original display order. The D / A conversion unit 208 D / A converts the decoded image supplied from the screen rearrangement buffer 207, and outputs and displays the decoded image on a display (not shown).
フレームメモリ209は、供給される再構成画像や復号画像を記憶する。また、フレームメモリ209は、所定のタイミングにおいて、若しくは、イントラ予測部211や動き予測・補償部212等の外部の要求に基づいて、記憶している再構成画像や復号画像を、選択部210を介してイントラ予測部211や動き予測・補償部212に供給する。
The frame memory 209 stores supplied reconstructed images and decoded images. Also, the frame memory 209 selects the stored reconstructed image or decoded image from the selection unit 210 at a predetermined timing or based on an external request such as the intra prediction unit 211 or the motion prediction / compensation unit 212. To the intra prediction unit 211 and the motion prediction / compensation unit 212.
なお、画像符号化装置100の符号化処理において、上述したように動きベクトルの使用が制限されているので、このフレームメモリ209からの参照画像の呼び出しによる負荷の増大(すなわち、使用するメモリ帯域の増大)も、従来の場合より抑制される。
Note that, in the encoding process of the image encoding device 100, the use of motion vectors is limited as described above, so that an increase in load due to the reference image call from the frame memory 209 (that is, the memory bandwidth to be used) Increase) is also suppressed as compared with the conventional case.
イントラ予測部211は、可逆復号部202から供給されるイントラ予測情報に基づいてイントラ予測を行い、予測画像を生成する。なお、イントラ予測部211は、可逆復号部202から供給されるイントラ予測情報に基づいて、符号化の際にイントラ予測により予測画像が生成された領域に対してのみ、図1のイントラ予測部114が行った処理のモードと同様のモードでイントラ予測を行う。
The intra prediction unit 211 performs intra prediction based on the intra prediction information supplied from the lossless decoding unit 202, and generates a predicted image. Note that the intra prediction unit 211 is based on the intra prediction information supplied from the lossless decoding unit 202 only for the region in which the prediction image is generated by the intra prediction at the time of encoding, as shown in FIG. Intra prediction is performed in a mode similar to the mode of the process performed by.
動き予測・補償部212は、可逆復号部202から供給されるインター予測情報に基づいてインター予測を行い、予測画像を生成する。なお、動き予測・補償部212は、可逆復号部202から供給されるインター予測情報に基づいて、符号化の際にインター予測が行われた領域に対してのみ、図1の動き予測・補償部115が行った処理のモードと同様のモードでインター予測を行う。
The motion prediction / compensation unit 212 performs inter prediction based on the inter prediction information supplied from the lossless decoding unit 202, and generates a prediction image. Note that the motion prediction / compensation unit 212 is based on the inter prediction information supplied from the lossless decoding unit 202, and the motion prediction / compensation unit in FIG. Inter prediction is performed in a mode similar to the mode of processing performed by 115.
イントラ予測部211若しくは動き予測・補償部212は、予測処理単位の領域毎に、生成した予測画像を、選択部213を介して演算部205に供給する。選択部213は、イントラ予測部211から供給される予測画像、若しくは、動き予測・補償部212から供給される予測画像を演算部205に供給する。
The intra prediction unit 211 or the motion prediction / compensation unit 212 supplies the generated predicted image to the calculation unit 205 via the selection unit 213 for each region of the prediction processing unit. The selection unit 213 supplies the prediction image supplied from the intra prediction unit 211 or the prediction image supplied from the motion prediction / compensation unit 212 to the calculation unit 205.
復号制御部221は、可逆復号部202から供給される伝送情報に基づいて、その符号化データを復号するか否かを制御する。
The decoding control unit 221 controls whether or not to decode the encoded data based on the transmission information supplied from the lossless decoding unit 202.
このようにすることにより、画像復号装置200は、より確実に符号化効率の低減が抑制された符号化データを復号することができ、より確実に、符号化効率の低減の抑制を実現することができる。
By doing in this way, the image decoding apparatus 200 can more reliably decode the encoded data in which the reduction of the encoding efficiency is suppressed, and more reliably realize the suppression of the reduction of the encoding efficiency. Can do.
[復号制御部]
図15は、図14の復号制御部221の主な構成例を示すブロック図である。 [Decryption control unit]
FIG. 15 is a block diagram illustrating a main configuration example of thedecoding control unit 221 of FIG.
図15は、図14の復号制御部221の主な構成例を示すブロック図である。 [Decryption control unit]
FIG. 15 is a block diagram illustrating a main configuration example of the
図15に示されるように、復号制御部221は、伝送情報取得部251、制限情報記憶部252、可否判定部253、制御部254、およびエラー処理部255を有する。
15, the decoding control unit 221 includes a transmission information acquisition unit 251, a restriction information storage unit 252, an availability determination unit 253, a control unit 254, and an error processing unit 255.
伝送情報取得部251は、可逆復号部202から供給される伝送情報を取得する。この伝送情報は、どのような情報であっても良く、動きベクトルの制限内容を直接示す情報であってもよいし、プロファイルやレベルを指定する情報であってもよい。伝送情報取得部251は、取得した伝送情報を可否判定部253に供給する。
The transmission information acquisition unit 251 acquires transmission information supplied from the lossless decoding unit 202. This transmission information may be any information, may be information directly indicating the restriction content of the motion vector, or may be information specifying a profile or a level. The transmission information acquisition unit 251 supplies the acquired transmission information to the availability determination unit 253.
制限情報記憶部252は、利用可能な動きベクトルを制限する制限情報を記憶する。この制限情報は、フレームメモリ209からの参照画像の読み出しによるメモリ帯域の占有度を抑制するためのものである。この制限情報には、少なくとも、可否判定部253において、実質的に、動きベクトルの精度による制限を行うことができる情報が含まれていればよい。
The restriction information storage unit 252 stores restriction information for restricting usable motion vectors. This restriction information is for suppressing the occupancy of the memory band due to the reading of the reference image from the frame memory 209. The restriction information only needs to include at least information that can be substantially restricted by the accuracy of the motion vector in the availability determination unit 253.
可否判定部253は、制限情報記憶部252から読み出した制限情報と、伝送情報取得部251から供給された伝送情報とに基づいて、画像復号装置200が符号化データを安全に正しく復号することができるか否かを判定する。
Based on the restriction information read from the restriction information storage unit 252 and the transmission information supplied from the transmission information acquisition unit 251, the availability determination unit 253 may allow the image decoding apparatus 200 to safely and correctly decode the encoded data. Determine if you can.
伝送情報には、符号化データにおける動きベクトルの精度に基づいた制限についての情報が含まれる。したがって、可否判定部253は、この伝送情報から、この符号化データを復号するのに必要なメモリ帯域を把握することができる。また、可否判定部253は、制限情報に含まれる動きベクトルの精度に基づいた制限から、利用可能なメモリ帯域の上限を把握することができる。つまり、可否判定部253は、それらを比較することにより、符号化データを復号可能か否かを判定する。
The transmission information includes information on restrictions based on the accuracy of motion vectors in the encoded data. Therefore, the availability determining unit 253 can grasp the memory bandwidth necessary for decoding the encoded data from the transmission information. Further, the availability determination unit 253 can grasp the upper limit of the available memory bandwidth from the restriction based on the accuracy of the motion vector included in the restriction information. That is, the availability determination unit 253 determines whether the encoded data can be decoded by comparing them.
すなわち、可否判定部253は、符号化データを復号するのに必要なメモリ帯域が、利用可能なメモリ帯域の上限以下の場合、この符号化データを復号可能と判定する。逆に、可否判定部253は、符号化データを復号するのに必要なメモリ帯域が、利用可能なメモリ帯域の上限より大きい場合、この符号化データを復号不可能と判定する。なお、可否判定部253は、直接メモリ帯域を比較するのではなく、実質的にメモリ帯域に相当する情報、例えば、予測モード、ブロックサイズ、および動きベクトルの精度等の情報を比較するようにしてもよい。
That is, the availability determination unit 253 determines that this encoded data is decodable when the memory bandwidth necessary for decoding the encoded data is equal to or less than the upper limit of the available memory bandwidth. Conversely, if the memory bandwidth necessary for decoding the encoded data is greater than the upper limit of the available memory bandwidth, the availability determining unit 253 determines that this encoded data is not decodable. The availability determination unit 253 does not directly compare the memory bandwidth, but compares information substantially equivalent to the memory bandwidth, for example, information such as prediction mode, block size, and motion vector accuracy. Also good.
可否判定部253は、このような可否判定結果を制御部254に供給する。制御部254は、供給された可否判定結果に基づいて、復号処理を制御する。例えば、符号化データが復号可能な場合、制御部254は、可逆復号部202に、復号を行うように指示する制御情報を供給する。可逆復号部202は、この制御情報に従って、復号を行う。また、その他の処理部も復号に関する各部の処理を実行する。
The availability determination unit 253 supplies such a determination result to the control unit 254. The control unit 254 controls the decoding process based on the supplied determination result. For example, when the encoded data can be decoded, the control unit 254 supplies control information that instructs the lossless decoding unit 202 to perform decoding. The lossless decoding unit 202 performs decoding according to the control information. The other processing units also execute the processing of each unit related to decoding.
また、例えば、符号化データが復号不可能な場合、制御部254は、エラー処理部255にエラー処理を実行するように指示する制御情報を供給する。エラー処理部255は、この制御情報に従って、復号処理を中止することをユーザに通知する等のエラー処理を行う。さらに制御部254は、可逆復号部202に対して、復号を中止するように指示する制御情報を供給する。可逆復号部202は、この制御情報に従って、復号処理を中止する。また、その他の処理部も復号に関する各部の処理を中止する。
For example, when the encoded data cannot be decoded, the control unit 254 supplies control information that instructs the error processing unit 255 to execute error processing. The error processing unit 255 performs error processing such as notifying the user that the decoding processing is to be stopped according to the control information. Furthermore, the control unit 254 supplies control information that instructs the lossless decoding unit 202 to stop decoding. The lossless decoding unit 202 stops the decoding process according to this control information. The other processing units also stop the processing of each unit related to decoding.
以上のように、復号制御部221は、動きベクトルの精度による制限を用いて、復号処理を制御することができる。したがって、復号制御部221は、復号できない符号化データの復号処理を強制終了させることができる。したがって、復号制御部221は、より確実に符号化効率の低減が抑制された符号化データを復号することができる。したがって、画像復号装置200は、より確実に、符号化効率の低減の抑制を実現することができる。
As described above, the decoding control unit 221 can control the decoding process using the restriction based on the accuracy of the motion vector. Therefore, the decoding control unit 221 can forcibly terminate the decoding process of encoded data that cannot be decoded. Therefore, the decoding control unit 221 can more reliably decode the encoded data in which the reduction in encoding efficiency is suppressed. Therefore, the image decoding apparatus 200 can more reliably suppress the reduction in encoding efficiency.
[復号処理の流れ]
次に、以上のような画像復号装置200により実行される各処理の流れについて説明する。最初に、図16のフローチャートを参照して、復号処理の流れの例を説明する。 [Decoding process flow]
Next, the flow of each process executed by theimage decoding apparatus 200 as described above will be described. First, an example of the flow of decoding processing will be described with reference to the flowchart of FIG.
次に、以上のような画像復号装置200により実行される各処理の流れについて説明する。最初に、図16のフローチャートを参照して、復号処理の流れの例を説明する。 [Decoding process flow]
Next, the flow of each process executed by the
復号処理が開始されると、ステップS201において、蓄積バッファ201は、伝送されてきた符号化データを蓄積する。ステップS202において、可逆復号部202は、蓄積バッファ201から供給される符号化データを復号する。すなわち、図1の可逆符号化部106により符号化されたIピクチャ、Pピクチャ、並びにBピクチャが復号される。
When the decoding process is started, in step S201, the accumulation buffer 201 accumulates the transmitted encoded data. In step S202, the lossless decoding unit 202 decodes the encoded data supplied from the accumulation buffer 201. That is, the I picture, P picture, and B picture encoded by the lossless encoding unit 106 in FIG. 1 are decoded.
このとき、参照フレーム情報、予測モード情報(イントラ予測モード、またはインター予測モード)、並びに伝送情報等の情報も復号される。
At this time, information such as reference frame information, prediction mode information (intra prediction mode or inter prediction mode), and transmission information is also decoded.
ステップS203において、復号制御部221は、ステップS202の処理により復号された伝送情報に基づいて復号制御処理を行う。この復号制御処理において、復号処理を強制終了すると判定された場合、このステップS203において復号処理が終了される。また、復号制御処理において、復号処理を実行すると判定された場合、ステップS204以降の処理が実行される。
In step S203, the decoding control unit 221 performs a decoding control process based on the transmission information decoded by the process in step S202. In this decoding control process, when it is determined that the decoding process is forcibly terminated, the decoding process is terminated in step S203. In the decoding control process, when it is determined that the decoding process is to be executed, the processes after step S204 are executed.
ステップS204において、逆量子化部203は、ステップS202の処理により得られた、量子化された直交変換係数を逆量子化する。
In step S204, the inverse quantization unit 203 inversely quantizes the quantized orthogonal transform coefficient obtained by the process in step S202.
ステップS205において逆直交変換部204は、ステップS203において逆量子化されて得られた直交変換係数を、図1の直交変換部104に対応する方法で逆直交変換する。これにより図1の直交変換部104の入力(演算部103の出力)に対応する差分情報が復号されたことになる。
In step S205, the inverse orthogonal transform unit 204 performs inverse orthogonal transform on the orthogonal transform coefficient obtained by inverse quantization in step S203 by a method corresponding to the orthogonal transform unit 104 in FIG. As a result, the difference information corresponding to the input of the orthogonal transform unit 104 (output of the calculation unit 103) in FIG. 1 is decoded.
ステップS206において、イントラ予測部211または動き予測・補償部212は、可逆復号部202から供給される予測モード情報に対応して、それぞれ画像の予測処理を行う。すなわち、可逆復号部202からイントラ予測モード情報が供給された場合、イントラ予測部211は、イントラ予測モードのイントラ予測処理を行う。また、可逆復号部202からインター予測モード情報が供給された場合、動き予測・補償部212は、ステップS203の処理により得られたユニットサイズに関する各種情報を用いて、インター予測処理(動き予測および動き補償を含む)を行う。
In step S <b> 206, the intra prediction unit 211 or the motion prediction / compensation unit 212 performs image prediction processing corresponding to the prediction mode information supplied from the lossless decoding unit 202. That is, when intra prediction mode information is supplied from the lossless decoding unit 202, the intra prediction unit 211 performs intra prediction processing in the intra prediction mode. When the inter prediction mode information is supplied from the lossless decoding unit 202, the motion prediction / compensation unit 212 uses various pieces of information regarding the unit size obtained by the process of step S203 to perform inter prediction processing (motion prediction and motion prediction). Compensation).
その際、符号化データは、動きベクトルの精度によって、使用される動きベクトルが制限されているので、フレームメモリ209からの参照画像の読み出しの負荷は、結果として制限される。
At that time, since the motion data used in the encoded data is limited by the accuracy of the motion vector, the load of reading the reference image from the frame memory 209 is limited as a result.
ステップS207において、演算部205は、ステップS205の処理により得られた差分情報に、ステップS206の処理により得られた予測画像を加算する。これにより元の画像データが復号される(再構成画像が得られる)。
In step S207, the calculation unit 205 adds the predicted image obtained by the process of step S206 to the difference information obtained by the process of step S205. Thereby, the original image data is decoded (a reconstructed image is obtained).
ステップS208において、ループフィルタ206は、ループフィルタ処理を行う。この処理により、ステップS207の処理により得られた再構成画像に対して、ループフィルタ処理が適宜行われる。なお、このループフィルタ処理は、図7のフローチャートを参照して説明したループフィルタ処理と基本的に同様に行われる。
In step S208, the loop filter 206 performs loop filter processing. By this process, a loop filter process is appropriately performed on the reconstructed image obtained by the process of step S207. This loop filter process is basically performed in the same manner as the loop filter process described with reference to the flowchart of FIG.
ステップS209において、画面並べ替えバッファ207は、ステップS208の処理によりループフィルタ処理が施された復号画像のフレームの並べ替えを行う。すなわち、復号画像データの、画像符号化装置100の画面並べ替えバッファ102(図1)により符号化のために並べ替えられたフレームの順序が、元の表示の順序に並べ替えられる。
In step S209, the screen rearrangement buffer 207 rearranges the frames of the decoded image that has been subjected to the loop filter processing in step S208. That is, the order of frames of the decoded image data rearranged for encoding by the screen rearrangement buffer 102 (FIG. 1) of the image encoding device 100 is rearranged to the original display order.
ステップS210において、D/A変換部208は、ステップS209の処理によりフレームが並べ替えられた復号画像データをD/A変換する。この復号画像データが図示せぬディスプレイに出力され、その画像が表示される。
In step S210, the D / A conversion unit 208 performs D / A conversion on the decoded image data in which the frames are rearranged by the processing in step S209. The decoded image data is output to a display (not shown), and the image is displayed.
ステップS211において、フレームメモリ209は、ステップS208の処理によりループフィルタ処理された復号画像データを記憶する。
In step S211, the frame memory 209 stores the decoded image data subjected to the loop filter process by the process of step S208.
[復号制御処理の流れ]
次に、図17のフローチャートを参照して、図16のステップS203において実行される復号制御処理の流れの例を説明する。 [Decoding control process flow]
Next, an example of the flow of the decoding control process executed in step S203 of FIG. 16 will be described with reference to the flowchart of FIG.
次に、図17のフローチャートを参照して、図16のステップS203において実行される復号制御処理の流れの例を説明する。 [Decoding control process flow]
Next, an example of the flow of the decoding control process executed in step S203 of FIG. 16 will be described with reference to the flowchart of FIG.
復号制御処理が開始されると、伝送情報取得部251は、ステップS231において、可逆復号部202から供給される伝送情報を取得する。
When the decoding control process is started, the transmission information acquisition unit 251 acquires the transmission information supplied from the lossless decoding unit 202 in step S231.
ステップS232において、可否判定部253は、制御情報と伝送情報とに基づいて、可逆復号部202に供給される符号化データが復号可能であるか否かを判定する。ステップS233において、制御部254は、その判定結果に基づいて処理を選択する。すなわち、復号可能であると判定された場合、制御部254は、復号制御処理を終了し、処理を図16に戻し、ステップS204以降の処理を実行させる。
In step S232, the availability determination unit 253 determines whether the encoded data supplied to the lossless decoding unit 202 is decodable based on the control information and the transmission information. In step S233, the control unit 254 selects a process based on the determination result. That is, when it is determined that decoding is possible, the control unit 254 ends the decoding control process, returns the process to FIG. 16, and causes the processes after step S204 to be executed.
また、復号不可能であると判定された場合、ステップS233において、制御部254は、処理をステップS234に進める。ステップS234において、エラー処理部255は、所定のエラー処理を行う。ステップS235において、制御部254は、可逆復号部202等を制御し、復号処理を強制終了させる。ステップS235の処理が終了すると、制御部254は、復号制御処理を終了する。
If it is determined that decryption is impossible, the control unit 254 advances the process to step S234 in step S233. In step S234, the error processing unit 255 performs predetermined error processing. In step S235, the control unit 254 controls the lossless decoding unit 202 and the like to forcibly terminate the decoding process. When the process of step S235 ends, the control unit 254 ends the decoding control process.
以上のように各処理を行うことにより、画像復号装置200は、より確実に符号化効率の低減が抑制された符号化データを復号することができる。したがって、画像復号装置200は、より確実に、符号化効率の低減の抑制を実現することができる。
By performing each process as described above, the image decoding apparatus 200 can more reliably decode the encoded data in which the reduction in encoding efficiency is suppressed. Therefore, the image decoding apparatus 200 can more reliably suppress the reduction in encoding efficiency.
[復号制御部の他の例]
なお、制限情報は、任意であり、上述した例に限らない。例えば、複数の制限の候補を含み、各種条件に従ってその中から採用する制限が選択されるようにしてもよい。 [Another example of decoding control unit]
The restriction information is arbitrary and is not limited to the example described above. For example, a plurality of restriction candidates may be included, and a restriction to be adopted may be selected from among them according to various conditions.
なお、制限情報は、任意であり、上述した例に限らない。例えば、複数の制限の候補を含み、各種条件に従ってその中から採用する制限が選択されるようにしてもよい。 [Another example of decoding control unit]
The restriction information is arbitrary and is not limited to the example described above. For example, a plurality of restriction candidates may be included, and a restriction to be adopted may be selected from among them according to various conditions.
図18は、そのような場合の、復号制御部221の主な構成例を示すブロック図である。図18に示される例の場合、復号制御部221は、図15の可否判定部253の代わりに、可否判定部273を有する。また、図18の復号制御部221は、さらに、条件取得部271を有する。
FIG. 18 is a block diagram illustrating a main configuration example of the decoding control unit 221 in such a case. In the case of the example shown in FIG. 18, the decoding control unit 221 includes an availability determination unit 273 instead of the availability determination unit 253 in FIG. 15. Further, the decoding control unit 221 in FIG. 18 further includes a condition acquisition unit 271.
条件取得部271は、動きベクトル生成に関する各種条件を取得し、それを条件情報として可否判定部273に通知する。この条件は任意であり、どのような情報であってもよい。例えば、ユーザ等による許可する動きベクトルや制限の指定であってもよいし、利用可能なメモリ帯域等のリソースに関する情報であってもよいし、符号化規格により規定される制限に関する情報であってもよい。
The condition acquisition unit 271 acquires various conditions related to motion vector generation, and notifies the availability determination unit 273 of the acquired conditions as condition information. This condition is arbitrary and may be any information. For example, it may be a motion vector permitted by a user or the like, a restriction specification, information on a resource such as an available memory band, or information on a restriction defined by a coding standard. Also good.
可否判定部273は、このような条件情報と、制限情報記憶部252から読み出した制限情報に基づいて、動きベクトルの制限内容を決定する。
The availability determination unit 273 determines the restriction content of the motion vector based on such condition information and the restriction information read from the restriction information storage unit 252.
この場合、制限情報において、制限内容が複数パターン用意されており、可否判定部273が、条件情報に応じていずれかを選択することができるようになされている。例えば、制限情報が、上述したような、「L0予測若しくはL1予測では、4×4ブロックサイズを禁止する。双予測では、4×4,8×4,4×8および8×8の「efgijkpqr」を禁止する」といったように、少なくとも、動きベクトルの精度に関する条件を複数含むようにしてもよい。この場合、可否判定部273は、条件情報に従っていずれかの制限内容を選択することにより、条件情報に応じた制限内容を決定することができる。
In this case, a plurality of patterns of restriction contents are prepared in the restriction information, and the availability determination unit 273 can select either one according to the condition information. For example, the restriction information is as described above, “4 × 4 block size is prohibited in L0 prediction or L1 prediction. In bi-prediction,“ efgijkpqr of 4 × 4, 8 × 4, 4 × 8 and 8 × 8 A plurality of conditions relating to the accuracy of the motion vector may be included, such as “prohibit”. In this case, the availability determination unit 273 can determine the restriction content according to the condition information by selecting any restriction content according to the condition information.
また、制限情報が、条件によって複数通りの演算結果が得られる所定の演算式を含むようにしてもよい。この場合、可否判定部273は、条件情報に従って演算式の係数の値を決定し、その値を用いて演算を行うことにより、条件情報に応じた制限内容を決定することができる。
Also, the restriction information may include a predetermined arithmetic expression that can obtain a plurality of arithmetic results depending on conditions. In this case, the availability determination unit 273 can determine the restriction content according to the condition information by determining the value of the coefficient of the arithmetic expression according to the condition information and performing the calculation using the value.
さらに、制限情報が、図4や図5に示されるテーブル情報を含むようにしてもよい。この場合、可否判定部273は、条件情報に従って、上限とする「入力画素数」を決定することにより、条件情報に応じた制限内容を決定することができる。
Furthermore, the restriction information may include table information shown in FIG. 4 and FIG. In this case, the availability determination unit 273 can determine the restriction content according to the condition information by determining the “number of input pixels” as the upper limit according to the condition information.
なお、図4および図5の例は、メモリアラインメントを固定した例であるが、制限情報が、各メモリアラインメントのパターンについての、このようなテーブル情報を含むようにしてもよい。その場合、可否判定部273は、条件情報に従って、メモリアラインメントを特定(決定)することにより、適用するテーブルを選択し、さらに、条件情報に従って、上限とする「入力画素数」を決定することにより、条件情報に応じた制限内容を決定することができる。
4 and 5 are examples in which the memory alignment is fixed. However, the restriction information may include such table information for each memory alignment pattern. In that case, the availability determination unit 273 specifies (determines) the memory alignment according to the condition information, selects a table to be applied, and further determines the upper limit “number of input pixels” according to the condition information. Therefore, it is possible to determine the restriction content according to the condition information.
また、図11や図12に示される例のように、図4および図5のテーブルの各行について「許可」するか「禁止」するかを予め決定したテーブルを制限情報に含めるようにしてもよい。つまり、この例の場合、予測モード、ブロックサイズ、および動きベクトルの精度の組み合わせについて、どれを許可し、どれを禁止するかが予め定められている。したがって、可否判定部273は、条件情報に従ってメモリアラインメントを特定(決定)するだけで、条件情報に応じた制限内容を決定することができる。
Further, as in the examples shown in FIGS. 11 and 12, a table in which whether to permit or prohibit each row in the tables of FIGS. 4 and 5 may be included in the restriction information. . That is, in this example, it is determined in advance which is permitted and which is prohibited for the combination of the prediction mode, the block size, and the accuracy of the motion vector. Therefore, the availability determination unit 273 can determine the restriction content according to the condition information only by specifying (determining) the memory alignment according to the condition information.
この場合も、動きベクトルの精度による制限以外の制限は任意である。動きベクトル制御情報による制限において、予測モードやブロックサイズ以外の制限が含まれていても良いし、予測モードやブロックサイズによる制限が省略されても良い。また、動きベクトル制御情報や伝送情報に、動きベクトルの精度による制限が含まれている限り、制限情報や条件は、どのような情報であっても良い。
Also in this case, restrictions other than the restriction due to the accuracy of the motion vector are arbitrary. In the restriction by the motion vector control information, restrictions other than the prediction mode and the block size may be included, or the restriction by the prediction mode and the block size may be omitted. Further, as long as the motion vector control information and the transmission information include a restriction due to the accuracy of the motion vector, the restriction information and the condition may be any information.
可否判定部273は、このように求めた制限を、図15の場合と同様に伝送情報と比較する(例えば、予測モード、ブロックサイズ、動きベクトルの精度のような各種情報を比較しても良いし、それらをメモリ帯域に換算して比較しても良い)ことにより、復号処理の可否判定を行う。可否判定部273は、その判定結果を制御部254に供給する。制御部254は、この判定結果に基づいて、図15の場合と同様に復号処理を制御する。
The availability determination unit 273 compares the thus obtained restriction with transmission information as in the case of FIG. 15 (for example, various kinds of information such as prediction mode, block size, and motion vector accuracy may be compared). Then, they may be converted into a memory band and compared) to determine whether or not the decoding process is possible. The availability determination unit 273 supplies the determination result to the control unit 254. Based on the determination result, the control unit 254 controls the decoding process as in the case of FIG.
この場合も、復号制御部221は、動きベクトルの精度による制限を用いて、復号処理を制御することができる。したがって、復号制御部221は、復号できない符号化データの復号処理を強制終了させることができる。したがって、復号制御部221は、より確実に符号化効率の低減が抑制された符号化データを復号することができる。したがって、画像復号装置200は、より確実に、符号化効率の低減の抑制を実現することができる。
Also in this case, the decoding control unit 221 can control the decoding process by using the restriction based on the accuracy of the motion vector. Therefore, the decoding control unit 221 can forcibly terminate the decoding process of encoded data that cannot be decoded. Therefore, the decoding control unit 221 can more reliably decode the encoded data in which the reduction in encoding efficiency is suppressed. Therefore, the image decoding apparatus 200 can more reliably suppress the reduction in encoding efficiency.
[復号制御処理の流れ]
次に、図19のフローチャートを参照して、この場合の復号制御処理の流れの例を説明する。 [Decoding control process flow]
Next, an example of the flow of decoding control processing in this case will be described with reference to the flowchart of FIG.
次に、図19のフローチャートを参照して、この場合の復号制御処理の流れの例を説明する。 [Decoding control process flow]
Next, an example of the flow of decoding control processing in this case will be described with reference to the flowchart of FIG.
復号制御処理が開始されると、条件取得部271は、ステップS251において、条件を取得する。ステップS252において、伝送情報取得部251は、可逆復号部202から供給される伝送情報を取得する。
When the decoding control process is started, the condition acquisition unit 271 acquires a condition in step S251. In step S252, the transmission information acquisition unit 251 acquires the transmission information supplied from the lossless decoding unit 202.
ステップS253において、可否判定部273は、ステップS251において取得された条件、制限情報記憶部252から読み出した制御情報、並びに、ステップS252において取得された伝送情報とに基づいて、可逆復号部202に供給される符号化データが復号可能であるか否かを判定する。ステップS254において、制御部254は、その判定結果に基づいて処理を選択する。すなわち、復号可能であると判定された場合、制御部254は、復号制御処理を終了し、処理を図16に戻し、ステップS204以降の処理を実行させる。
In step S253, the availability determination unit 273 supplies the lossless decoding unit 202 based on the conditions acquired in step S251, the control information read from the restriction information storage unit 252, and the transmission information acquired in step S252. It is determined whether the encoded data to be decoded is decodable. In step S254, the control unit 254 selects a process based on the determination result. That is, when it is determined that decoding is possible, the control unit 254 ends the decoding control process, returns the process to FIG. 16, and causes the processes after step S204 to be executed.
また、復号不可能であると判定された場合、ステップS255において、制御部254は、処理をステップS255に進める。ステップS255において、エラー処理部255は、所定のエラー処理を行う。ステップS256において、制御部254は、可逆復号部202等を制御し、復号処理を強制終了させる。ステップS235の処理が終了すると、制御部254は、復号制御処理を終了する。
If it is determined that decoding is impossible, in step S255, the control unit 254 advances the process to step S255. In step S255, the error processing unit 255 performs predetermined error processing. In step S256, the control unit 254 controls the lossless decoding unit 202 and the like to forcibly terminate the decoding process. When the process of step S235 ends, the control unit 254 ends the decoding control process. *
以上のように各処理を行うことにより、画像復号装置200は、より確実に符号化効率の低減が抑制された符号化データを復号することができる。したがって、画像復号装置200は、より確実に、符号化効率の低減の抑制を実現することができる。
By performing each process as described above, the image decoding apparatus 200 can more reliably decode the encoded data in which the reduction in encoding efficiency is suppressed. Therefore, the image decoding apparatus 200 can more reliably suppress the reduction in encoding efficiency.
<3.第3の実施の形態>
[コンピュータ]
上述した一連の処理は、ハードウエアにより実行させることもできるし、ソフトウエアにより実行させることもできる。この場合、例えば、図29に示されるようなコンピュータとして構成されるようにしてもよい。 <3. Third Embodiment>
[Computer]
The series of processes described above can be executed by hardware or can be executed by software. In this case, for example, the computer may be configured as shown in FIG.
[コンピュータ]
上述した一連の処理は、ハードウエアにより実行させることもできるし、ソフトウエアにより実行させることもできる。この場合、例えば、図29に示されるようなコンピュータとして構成されるようにしてもよい。 <3. Third Embodiment>
[Computer]
The series of processes described above can be executed by hardware or can be executed by software. In this case, for example, the computer may be configured as shown in FIG.
図20において、コンピュータ700のCPU(Central Processing Unit)701は、ROM(Read Only Memory)702に記憶されているプログラム、または記憶部713からRAM(Random Access Memory)703にロードされたプログラムに従って各種の処理を実行する。RAM703にはまた、CPU701が各種の処理を実行する上において必要なデータなども適宜記憶される。
In FIG. 20, a CPU (Central Processing Unit) 701 of a computer 700 has various programs according to a program stored in a ROM (Read Only Memory) 702 or a program loaded from a storage unit 713 into a RAM (Random Access Memory) 703. Execute the process. The RAM 703 also appropriately stores data necessary for the CPU 701 to execute various processes.
CPU701、ROM702、およびRAM703は、バス704を介して相互に接続されている。このバス704にはまた、入出力インタフェース710も接続されている。
The CPU 701, ROM 702, and RAM 703 are connected to each other via a bus 704. An input / output interface 710 is also connected to the bus 704.
入出力インタフェース710には、キーボード、マウス、タッチパネル、および入力端子などよりなる入力部711、CRT(Cathode Ray Tube)、LCD(Liquid Crystal Display)、およびOELD(Organic ElectroLuminescence Display)などよりなるディスプレイ、並びにスピーカ等の任意の出力デバイスや出力端子等よりなる出力部712、ハードディスクやフラッシュメモリ等の任意の記憶媒体やその記憶媒体の入出力を制御する制御部等により構成される記憶部713、モデム、LANインタフェース、USB(Universal Serial Bus)、並びにBluetooth(登録商標)等、有線や無線の任意の通信デバイスよりなる通信部714が接続されている。通信部714は、例えばインターネットを含むネットワークを介して他の通信デバイスとの通信処理を行う。
The input / output interface 710 includes an input unit 711 including a keyboard, a mouse, a touch panel, and an input terminal, a display including a CRT (Cathode Ray Tube), an LCD (Liquid Crystal Display), and an OELD (Organic ElectroLuminescence Display), and the like. An output unit 712 including an arbitrary output device such as a speaker or an output terminal, a storage unit 713 configured by an arbitrary storage medium such as a hard disk or a flash memory, a control unit for controlling input / output of the storage medium, a modem, A communication unit 714 including any wired or wireless communication device such as a LAN interface, USB (Universal Serial Bus), and Bluetooth (registered trademark) is connected. The communication unit 714 performs communication processing with other communication devices via a network including the Internet, for example.
入出力インタフェース710にはまた、必要に応じてドライブ715が接続される。そのドライブ715には、磁気ディスク、光ディスク、光磁気ディスク、或いは半導体メモリなどのリムーバブルメディア721が適宜装着される。ドライブ715は、例えばCPU701の制御に従って、自身に装着されたリムーバブルメディア721からコンピュータプログラムやデータ等を読み出す。その読み出されたデータやコンピュータプログラムは、例えば、RAM703に供給される。また、リムーバブルメディア721から読み出されたコンピュータプログラムは、必要に応じて記憶部713にインストールされる。
A drive 715 is also connected to the input / output interface 710 as necessary. A removable medium 721 such as a magnetic disk, an optical disk, a magneto-optical disk, or a semiconductor memory is appropriately attached to the drive 715. The drive 715 reads out a computer program, data, and the like from the removable medium 721 attached to the drive 715 according to the control of the CPU 701, for example. The read data and computer program are supplied to the RAM 703, for example. The computer program read from the removable medium 721 is installed in the storage unit 713 as necessary.
上述した一連の処理をソフトウエアにより実行させる場合には、そのソフトウエアを構成するプログラムが、ネットワークや記録媒体からインストールされる。
When the above-described series of processing is executed by software, a program constituting the software is installed from a network or a recording medium.
この記録媒体は、例えば、図20に示されるように、装置本体とは別に、ユーザにプログラムを配信するために配布される、プログラムが記録されている磁気ディスク(フレキシブルディスクを含む)、光ディスク(CD-ROM(Compact Disc - Read Only Memory),DVD(Digital Versatile Disc)を含む)、光磁気ディスク(MD(Mini Disc)を含む)、若しくは半導体メモリなどよりなるリムーバブルメディア721により構成されるだけでなく、装置本体に予め組み込まれた状態でユーザに配信される、プログラムが記録されているROM702や、記憶部713に含まれるハードディスクなどで構成される。
For example, as shown in FIG. 20, the recording medium is distributed to distribute the program to the user separately from the apparatus main body, and includes a magnetic disk (including a flexible disk) on which the program is recorded, an optical disk ( It is only composed of removable media 721 consisting of CD-ROM (compact disc-read only memory), DVD (including digital versatile disc), magneto-optical disc (including MD (mini disc)), or semiconductor memory. Rather, it is composed of a ROM 702 in which a program is recorded and a hard disk included in the storage unit 713, which is distributed to the user in a state of being incorporated in the apparatus main body in advance.
なお、コンピュータが実行するプログラムは、本明細書で説明する順序に沿って時系列に処理が行われるプログラムであっても良いし、並列に、あるいは呼び出しが行われたとき等の必要なタイミングで処理が行われるプログラムであっても良い。
The program executed by the computer may be a program that is processed in time series in the order described in this specification, or in parallel or at a necessary timing such as when a call is made. It may be a program for processing.
また、本明細書において、記録媒体に記録されるプログラムを記述するステップは、記載された順序に沿って時系列的に行われる処理はもちろん、必ずしも時系列的に処理されなくとも、並列的あるいは個別に実行される処理をも含むものである。
Further, in the present specification, the step of describing the program recorded on the recording medium is not limited to the processing performed in chronological order according to the described order, but may be performed in parallel or It also includes processes that are executed individually.
また、本明細書において、システムとは、複数のデバイス(装置)により構成される装置全体を表すものである。
In addition, in this specification, the system represents the entire apparatus composed of a plurality of devices (apparatuses).
また、以上において、1つの装置(または処理部)として説明した構成を分割し、複数の装置(または処理部)として構成するようにしてもよい。逆に、以上において複数の装置(または処理部)として説明した構成をまとめて1つの装置(または処理部)として構成されるようにしてもよい。また、各装置(または各処理部)の構成に上述した以外の構成を付加するようにしてももちろんよい。さらに、システム全体としての構成や動作が実質的に同じであれば、ある装置(または処理部)の構成の一部を他の装置(または他の処理部)の構成に含めるようにしてもよい。つまり、本技術の実施の形態は、上述した実施の形態に限定されるものではなく、本技術の要旨を逸脱しない範囲において種々の変更が可能である。
Also, in the above, the configuration described as one device (or processing unit) may be divided and configured as a plurality of devices (or processing units). Conversely, the configurations described above as a plurality of devices (or processing units) may be combined into a single device (or processing unit). Of course, a configuration other than that described above may be added to the configuration of each device (or each processing unit). Furthermore, if the configuration and operation of the entire system are substantially the same, a part of the configuration of a certain device (or processing unit) may be included in the configuration of another device (or other processing unit). . That is, the embodiments of the present technology are not limited to the above-described embodiments, and various modifications can be made without departing from the gist of the present technology.
上述した実施形態に係る画像符号化装置100(図1)および画像復号装置200(図14)は、衛星放送、ケーブルTVなどの有線放送、インターネット上での配信、及びセルラー通信による端末への配信などにおける送信機若しくは受信機、光ディスク、磁気ディスク及びフラッシュメモリなどの媒体に画像を記録する記録装置、又は、これら記憶媒体から画像を再生する再生装置などの様々な電子機器に応用され得る。以下、4つの応用例について説明する。
The image encoding device 100 (FIG. 1) and the image decoding device 200 (FIG. 14) according to the above-described embodiment are distributed to satellite broadcasting, cable broadcasting such as cable TV, distribution on the Internet, and terminals by cellular communication. The present invention can be applied to various electronic devices such as a transmitter or receiver, a recording device that records an image on a medium such as an optical disk, a magnetic disk, and a flash memory, or a reproducing device that reproduces an image from these storage media. Hereinafter, four application examples will be described.
<4.第4の実施の形態>
[テレビジョン装置]
図21は、上述した実施形態を適用したテレビジョン装置の概略的な構成の一例を示している。テレビジョン装置900は、アンテナ901、チューナ902、デマルチプレクサ903、デコーダ904、映像信号処理部905、表示部906、音声信号処理部907、スピーカ908、外部インタフェース909、制御部910、ユーザインタフェース911、及びバス912を備える。 <4. Fourth Embodiment>
[TV equipment]
FIG. 21 illustrates an example of a schematic configuration of a television device to which the above-described embodiment is applied. Thetelevision apparatus 900 includes an antenna 901, a tuner 902, a demultiplexer 903, a decoder 904, a video signal processing unit 905, a display unit 906, an audio signal processing unit 907, a speaker 908, an external interface 909, a control unit 910, a user interface 911, And a bus 912.
[テレビジョン装置]
図21は、上述した実施形態を適用したテレビジョン装置の概略的な構成の一例を示している。テレビジョン装置900は、アンテナ901、チューナ902、デマルチプレクサ903、デコーダ904、映像信号処理部905、表示部906、音声信号処理部907、スピーカ908、外部インタフェース909、制御部910、ユーザインタフェース911、及びバス912を備える。 <4. Fourth Embodiment>
[TV equipment]
FIG. 21 illustrates an example of a schematic configuration of a television device to which the above-described embodiment is applied. The
チューナ902は、アンテナ901を介して受信される放送信号から所望のチャンネルの信号を抽出し、抽出した信号を復調する。そして、チューナ902は、復調により得られた符号化ビットストリームをデマルチプレクサ903へ出力する。即ち、チューナ902は、画像が符号化されている符号化ストリームを受信する、テレビジョン装置900における伝送部としての役割を有する。
Tuner 902 extracts a signal of a desired channel from a broadcast signal received via antenna 901, and demodulates the extracted signal. Then, the tuner 902 outputs the encoded bit stream obtained by the demodulation to the demultiplexer 903. That is, the tuner 902 has a role as a transmission unit in the television device 900 that receives an encoded stream in which an image is encoded.
デマルチプレクサ903は、符号化ビットストリームから視聴対象の番組の映像ストリーム及び音声ストリームを分離し、分離した各ストリームをデコーダ904へ出力する。また、デマルチプレクサ903は、符号化ビットストリームからEPG(Electronic Program Guide)などの補助的なデータを抽出し、抽出したデータを制御部910に供給する。なお、デマルチプレクサ903は、符号化ビットストリームがスクランブルされている場合には、デスクランブルを行ってもよい。
The demultiplexer 903 separates the video stream and audio stream of the viewing target program from the encoded bit stream, and outputs each separated stream to the decoder 904. Further, the demultiplexer 903 extracts auxiliary data such as EPG (Electronic Program Guide) from the encoded bit stream, and supplies the extracted data to the control unit 910. Note that the demultiplexer 903 may perform descrambling when the encoded bit stream is scrambled.
デコーダ904は、デマルチプレクサ903から入力される映像ストリーム及び音声ストリームを復号する。そして、デコーダ904は、復号処理により生成される映像データを映像信号処理部905へ出力する。また、デコーダ904は、復号処理により生成される音声データを音声信号処理部907へ出力する。
The decoder 904 decodes the video stream and audio stream input from the demultiplexer 903. Then, the decoder 904 outputs the video data generated by the decoding process to the video signal processing unit 905. In addition, the decoder 904 outputs audio data generated by the decoding process to the audio signal processing unit 907.
映像信号処理部905は、デコーダ904から入力される映像データを再生し、表示部906に映像を表示させる。また、映像信号処理部905は、ネットワークを介して供給されるアプリケーション画面を表示部906に表示させてもよい。また、映像信号処理部905は、映像データについて、設定に応じて、例えばノイズ除去などの追加的な処理を行ってもよい。さらに、映像信号処理部905は、例えばメニュー、ボタン又はカーソルなどのGUI(Graphical User Interface)の画像を生成し、生成した画像を出力画像に重畳してもよい。
The video signal processing unit 905 reproduces the video data input from the decoder 904 and causes the display unit 906 to display the video. In addition, the video signal processing unit 905 may cause the display unit 906 to display an application screen supplied via a network. Further, the video signal processing unit 905 may perform additional processing such as noise removal on the video data according to the setting. Furthermore, the video signal processing unit 905 may generate a GUI (Graphical User Interface) image such as a menu, a button, or a cursor, and superimpose the generated image on the output image.
表示部906は、映像信号処理部905から供給される駆動信号により駆動され、表示デバイス(例えば、液晶ディスプレイ、プラズマディスプレイ又はOELD(Organic ElectroLuminescence Display)(有機ELディスプレイ)など)の映像面上に映像又は画像を表示する。
The display unit 906 is driven by a drive signal supplied from the video signal processing unit 905, and displays an image on a video screen of a display device (for example, a liquid crystal display, a plasma display, or an OELD (Organic ElectroLuminescence Display) (organic EL display)). Or an image is displayed.
音声信号処理部907は、デコーダ904から入力される音声データについてD/A変換及び増幅などの再生処理を行い、スピーカ908から音声を出力させる。また、音声信号処理部907は、音声データについてノイズ除去などの追加的な処理を行ってもよい。
The audio signal processing unit 907 performs reproduction processing such as D / A conversion and amplification on the audio data input from the decoder 904, and outputs audio from the speaker 908. The audio signal processing unit 907 may perform additional processing such as noise removal on the audio data.
外部インタフェース909は、テレビジョン装置900と外部機器又はネットワークとを接続するためのインタフェースである。例えば、外部インタフェース909を介して受信される映像ストリーム又は音声ストリームが、デコーダ904により復号されてもよい。即ち、外部インタフェース909もまた、画像が符号化されている符号化ストリームを受信する、テレビジョン装置900における伝送部としての役割を有する。
The external interface 909 is an interface for connecting the television apparatus 900 to an external device or a network. For example, a video stream or an audio stream received via the external interface 909 may be decoded by the decoder 904. That is, the external interface 909 also has a role as a transmission unit in the television apparatus 900 that receives an encoded stream in which an image is encoded.
制御部910は、CPUなどのプロセッサ、並びにRAM及びROMなどのメモリを有する。メモリは、CPUにより実行されるプログラム、プログラムデータ、EPGデータ、及びネットワークを介して取得されるデータなどを記憶する。メモリにより記憶されるプログラムは、例えば、テレビジョン装置900の起動時にCPUにより読み込まれ、実行される。CPUは、プログラムを実行することにより、例えばユーザインタフェース911から入力される操作信号に応じて、テレビジョン装置900の動作を制御する。
The control unit 910 includes a processor such as a CPU and memories such as a RAM and a ROM. The memory stores a program executed by the CPU, program data, EPG data, data acquired via a network, and the like. For example, the program stored in the memory is read and executed by the CPU when the television apparatus 900 is activated. The CPU executes the program to control the operation of the television device 900 according to an operation signal input from the user interface 911, for example.
ユーザインタフェース911は、制御部910と接続される。ユーザインタフェース911は、例えば、ユーザがテレビジョン装置900を操作するためのボタン及びスイッチ、並びに遠隔制御信号の受信部などを有する。ユーザインタフェース911は、これら構成要素を介してユーザによる操作を検出して操作信号を生成し、生成した操作信号を制御部910へ出力する。
The user interface 911 is connected to the control unit 910. The user interface 911 includes, for example, buttons and switches for the user to operate the television device 900, a remote control signal receiving unit, and the like. The user interface 911 detects an operation by the user via these components, generates an operation signal, and outputs the generated operation signal to the control unit 910.
バス912は、チューナ902、デマルチプレクサ903、デコーダ904、映像信号処理部905、音声信号処理部907、外部インタフェース909及び制御部910を相互に接続する。
The bus 912 connects the tuner 902, the demultiplexer 903, the decoder 904, the video signal processing unit 905, the audio signal processing unit 907, the external interface 909, and the control unit 910 to each other.
このように構成されたテレビジョン装置900において、デコーダ904は、上述した実施形態に係る画像復号装置200(図14)の機能を有する。従って、デコーダ904は、動き予測の制限において動きベクトルの精度が考慮された符号化により生成された符号化データのみ復号するようにすることができる。したがってデコーダ904は、より確実に符号化効率の低減が抑制された符号化データを復号することができる。したがって、テレビジョン装置900は、より確実に、符号化効率の低減の抑制を実現することができる。
In the television apparatus 900 configured as described above, the decoder 904 has the function of the image decoding apparatus 200 (FIG. 14) according to the above-described embodiment. Therefore, the decoder 904 can decode only the encoded data generated by the encoding in which the accuracy of the motion vector is considered in the limitation of motion prediction. Therefore, the decoder 904 can more reliably decode the encoded data in which the reduction in encoding efficiency is suppressed. Therefore, the television apparatus 900 can more reliably suppress the reduction in encoding efficiency.
<5.第5の実施の形態>
[携帯電話機]
図22は、上述した実施形態を適用した携帯電話機の概略的な構成の一例を示している。携帯電話機920は、アンテナ921、通信部922、音声コーデック923、スピーカ924、マイクロホン925、カメラ部926、画像処理部927、多重分離部928、記録再生部929、表示部930、制御部931、操作部932、及びバス933を備える。 <5. Fifth embodiment>
[Mobile phone]
FIG. 22 shows an example of a schematic configuration of a mobile phone to which the above-described embodiment is applied. Amobile phone 920 includes an antenna 921, a communication unit 922, an audio codec 923, a speaker 924, a microphone 925, a camera unit 926, an image processing unit 927, a demultiplexing unit 928, a recording / reproducing unit 929, a display unit 930, a control unit 931, an operation A portion 932 and a bus 933.
[携帯電話機]
図22は、上述した実施形態を適用した携帯電話機の概略的な構成の一例を示している。携帯電話機920は、アンテナ921、通信部922、音声コーデック923、スピーカ924、マイクロホン925、カメラ部926、画像処理部927、多重分離部928、記録再生部929、表示部930、制御部931、操作部932、及びバス933を備える。 <5. Fifth embodiment>
[Mobile phone]
FIG. 22 shows an example of a schematic configuration of a mobile phone to which the above-described embodiment is applied. A
アンテナ921は、通信部922に接続される。スピーカ924及びマイクロホン925は、音声コーデック923に接続される。操作部932は、制御部931に接続される。バス933は、通信部922、音声コーデック923、カメラ部926、画像処理部927、多重分離部928、記録再生部929、表示部930、及び制御部931を相互に接続する。
The antenna 921 is connected to the communication unit 922. The speaker 924 and the microphone 925 are connected to the audio codec 923. The operation unit 932 is connected to the control unit 931. The bus 933 connects the communication unit 922, the audio codec 923, the camera unit 926, the image processing unit 927, the demultiplexing unit 928, the recording / reproducing unit 929, the display unit 930, and the control unit 931 to each other.
携帯電話機920は、音声通話モード、データ通信モード、撮影モード及びテレビ電話モードを含む様々な動作モードで、音声信号の送受信、電子メール又は画像データの送受信、画像の撮像、及びデータの記録などの動作を行う。
The mobile phone 920 has various operation modes including a voice call mode, a data communication mode, a shooting mode, and a videophone mode, and is used for sending and receiving voice signals, sending and receiving e-mail or image data, taking images, and recording data. Perform the action.
音声通話モードにおいて、マイクロホン925により生成されるアナログ音声信号は、音声コーデック923に供給される。音声コーデック923は、アナログ音声信号を音声データへ変換し、変換された音声データをA/D変換し圧縮する。そして、音声コーデック923は、圧縮後の音声データを通信部922へ出力する。通信部922は、音声データを符号化及び変調し、送信信号を生成する。そして、通信部922は、生成した送信信号をアンテナ921を介して基地局(図示せず)へ送信する。また、通信部922は、アンテナ921を介して受信される無線信号を増幅し及び周波数変換し、受信信号を取得する。そして、通信部922は、受信信号を復調及び復号して音声データを生成し、生成した音声データを音声コーデック923へ出力する。音声コーデック923は、音声データを伸張し及びD/A変換し、アナログ音声信号を生成する。そして、音声コーデック923は、生成した音声信号をスピーカ924に供給して音声を出力させる。
In the voice call mode, the analog voice signal generated by the microphone 925 is supplied to the voice codec 923. The audio codec 923 converts an analog audio signal into audio data, A / D converts the compressed audio data, and compresses it. Then, the audio codec 923 outputs the compressed audio data to the communication unit 922. The communication unit 922 encodes and modulates the audio data and generates a transmission signal. Then, the communication unit 922 transmits the generated transmission signal to a base station (not shown) via the antenna 921. In addition, the communication unit 922 amplifies a radio signal received via the antenna 921 and performs frequency conversion to acquire a received signal. Then, the communication unit 922 demodulates and decodes the received signal to generate audio data, and outputs the generated audio data to the audio codec 923. The audio codec 923 decompresses the audio data and performs D / A conversion to generate an analog audio signal. Then, the audio codec 923 supplies the generated audio signal to the speaker 924 to output audio.
また、データ通信モードにおいて、例えば、制御部931は、操作部932を介するユーザによる操作に応じて、電子メールを構成する文字データを生成する。また、制御部931は、文字を表示部930に表示させる。また、制御部931は、操作部932を介するユーザからの送信指示に応じて電子メールデータを生成し、生成した電子メールデータを通信部922へ出力する。通信部922は、電子メールデータを符号化及び変調し、送信信号を生成する。そして、通信部922は、生成した送信信号を、アンテナ921を介して基地局(図示せず)へ送信する。また、通信部922は、アンテナ921を介して受信される無線信号を増幅し及び周波数変換し、受信信号を取得する。そして、通信部922は、受信信号を復調及び復号して電子メールデータを復元し、復元した電子メールデータを制御部931へ出力する。制御部931は、表示部930に電子メールの内容を表示させると共に、電子メールデータを記録再生部929の記憶媒体に記憶させる。
Further, in the data communication mode, for example, the control unit 931 generates character data constituting the e-mail in response to an operation by the user via the operation unit 932. In addition, the control unit 931 causes the display unit 930 to display characters. In addition, the control unit 931 generates e-mail data in response to a transmission instruction from the user via the operation unit 932, and outputs the generated e-mail data to the communication unit 922. The communication unit 922 encodes and modulates email data and generates a transmission signal. Then, the communication unit 922 transmits the generated transmission signal to a base station (not shown) via the antenna 921. In addition, the communication unit 922 amplifies a radio signal received via the antenna 921 and performs frequency conversion to acquire a received signal. Then, the communication unit 922 demodulates and decodes the received signal to restore the email data, and outputs the restored email data to the control unit 931. The control unit 931 displays the content of the electronic mail on the display unit 930 and stores the electronic mail data in the storage medium of the recording / reproducing unit 929.
記録再生部929は、読み書き可能な任意の記憶媒体を有する。例えば、記憶媒体は、RAM又はフラッシュメモリなどの内蔵型の記憶媒体であってもよく、ハードディスク、磁気ディスク、光磁気ディスク、光ディスク、USBメモリ、又はメモリカードなどの外部装着型の記憶媒体であってもよい。
The recording / reproducing unit 929 has an arbitrary readable / writable storage medium. For example, the storage medium may be a built-in storage medium such as a RAM or a flash memory, or an externally mounted storage medium such as a hard disk, a magnetic disk, a magneto-optical disk, an optical disk, a USB memory, or a memory card. May be.
また、撮影モードにおいて、例えば、カメラ部926は、被写体を撮像して画像データを生成し、生成した画像データを画像処理部927へ出力する。画像処理部927は、カメラ部926から入力される画像データを符号化し、符号化ストリームを記録再生部929の記憶媒体に記憶させる。
In the shooting mode, for example, the camera unit 926 images a subject to generate image data, and outputs the generated image data to the image processing unit 927. The image processing unit 927 encodes the image data input from the camera unit 926 and stores the encoded stream in the storage medium of the recording / playback unit 929.
また、テレビ電話モードにおいて、例えば、多重分離部928は、画像処理部927により符号化された映像ストリームと、音声コーデック923から入力される音声ストリームとを多重化し、多重化したストリームを通信部922へ出力する。通信部922は、ストリームを符号化及び変調し、送信信号を生成する。そして、通信部922は、生成した送信信号をアンテナ921を介して基地局(図示せず)へ送信する。また、通信部922は、アンテナ921を介して受信される無線信号を増幅し及び周波数変換し、受信信号を取得する。これら送信信号及び受信信号には、符号化ビットストリームが含まれ得る。そして、通信部922は、受信信号を復調及び復号してストリームを復元し、復元したストリームを多重分離部928へ出力する。多重分離部928は、入力されるストリームから映像ストリーム及び音声ストリームを分離し、映像ストリームを画像処理部927、音声ストリームを音声コーデック923へ出力する。画像処理部927は、映像ストリームを復号し、映像データを生成する。映像データは、表示部930に供給され、表示部930により一連の画像が表示される。音声コーデック923は、音声ストリームを伸張し及びD/A変換し、アナログ音声信号を生成する。そして、音声コーデック923は、生成した音声信号をスピーカ924に供給して音声を出力させる。
Further, in the videophone mode, for example, the demultiplexing unit 928 multiplexes the video stream encoded by the image processing unit 927 and the audio stream input from the audio codec 923, and the multiplexed stream is the communication unit 922. Output to. The communication unit 922 encodes and modulates the stream and generates a transmission signal. Then, the communication unit 922 transmits the generated transmission signal to a base station (not shown) via the antenna 921. In addition, the communication unit 922 amplifies a radio signal received via the antenna 921 and performs frequency conversion to acquire a received signal. These transmission signal and reception signal may include an encoded bit stream. Then, the communication unit 922 demodulates and decodes the received signal to restore the stream, and outputs the restored stream to the demultiplexing unit 928. The demultiplexing unit 928 separates the video stream and the audio stream from the input stream, and outputs the video stream to the image processing unit 927 and the audio stream to the audio codec 923. The image processing unit 927 decodes the video stream and generates video data. The video data is supplied to the display unit 930, and a series of images is displayed on the display unit 930. The audio codec 923 decompresses the audio stream and performs D / A conversion to generate an analog audio signal. Then, the audio codec 923 supplies the generated audio signal to the speaker 924 to output audio.
このように構成された携帯電話機920において、画像処理部927は、上述した実施形態に係る画像符号化装置100(図1)の機能を有する。したがって、画像処理部927は、動きベクトルの精度を考慮して動き予測を制限することができる。したがって、画像処理部927は、符号化効率の低減を抑制しながら負荷の増大を抑制し、画像を符号化することができる。したがって、携帯電話機920は、符号化効率の抑制を低減させることができる。
In the mobile phone 920 configured as described above, the image processing unit 927 has the function of the image encoding device 100 (FIG. 1) according to the above-described embodiment. Therefore, the image processing unit 927 can limit motion prediction in consideration of the accuracy of the motion vector. Therefore, the image processing unit 927 can encode an image while suppressing an increase in load while suppressing a decrease in encoding efficiency. Therefore, the mobile phone 920 can reduce suppression of encoding efficiency.
なお、画像処理部927は、上述した実施形態に係る画像復号装置200(図14)の機能も有する。従って、画像処理部927は、動き予測の制限において動きベクトルの精度が考慮された符号化により生成された符号化データのみ復号するようにすることができる。したがって、画像処理部927は、より確実に符号化効率の低減が抑制された符号化データを復号することができる。したがって、携帯電話機920は、より確実に、符号化効率の低減の抑制を実現することができる。
Note that the image processing unit 927 also has the function of the image decoding device 200 (FIG. 14) according to the above-described embodiment. Therefore, the image processing unit 927 can decode only the encoded data generated by the encoding in which the accuracy of the motion vector is considered in the limitation of motion prediction. Therefore, the image processing unit 927 can more reliably decode the encoded data in which the reduction in encoding efficiency is suppressed. Accordingly, the mobile phone 920 can more reliably suppress the reduction in encoding efficiency.
また、以上においては携帯電話機920として説明したが、例えば、PDA(Personal Digital Assistants)、スマートフォン、UMPC(Ultra Mobile Personal Computer)、ネットブック、ノート型パーソナルコンピュータ等、この携帯電話機920と同様の撮像機能や通信機能を有する装置であれば、どのような装置であっても携帯電話機920の場合と同様に、本技術を適用した画像符号化装置および画像復号装置を適用することができる。
In the above description, the mobile phone 920 has been described. For example, an imaging function similar to that of the mobile phone 920 such as a PDA (Personal Digital Assistant), a smartphone, an UMPC (Ultra Mobile Personal Computer), a netbook, a notebook personal computer, or the like. As long as the device has a communication function, the image encoding device and the image decoding device to which the present technology is applied can be applied to any device as in the case of the mobile phone 920.
<6.第6の実施の形態>
[記録再生装置]
図23は、上述した実施形態を適用した記録再生装置の概略的な構成の一例を示している。記録再生装置940は、例えば、受信した放送番組の音声データ及び映像データを符号化して記録媒体に記録する。また、記録再生装置940は、例えば、他の装置から取得される音声データ及び映像データを符号化して記録媒体に記録してもよい。また、記録再生装置940は、例えば、ユーザの指示に応じて、記録媒体に記録されているデータをモニタ及びスピーカ上で再生する。このとき、記録再生装置940は、音声データ及び映像データを復号する。 <6. Sixth Embodiment>
[Recording and playback device]
FIG. 23 shows an example of a schematic configuration of a recording / reproducing apparatus to which the above-described embodiment is applied. For example, the recording / reproducingdevice 940 encodes audio data and video data of a received broadcast program and records the encoded data on a recording medium. In addition, the recording / reproducing device 940 may encode audio data and video data acquired from another device and record them on a recording medium, for example. In addition, the recording / reproducing device 940 reproduces data recorded on the recording medium on a monitor and a speaker, for example, in accordance with a user instruction. At this time, the recording / reproducing device 940 decodes the audio data and the video data.
[記録再生装置]
図23は、上述した実施形態を適用した記録再生装置の概略的な構成の一例を示している。記録再生装置940は、例えば、受信した放送番組の音声データ及び映像データを符号化して記録媒体に記録する。また、記録再生装置940は、例えば、他の装置から取得される音声データ及び映像データを符号化して記録媒体に記録してもよい。また、記録再生装置940は、例えば、ユーザの指示に応じて、記録媒体に記録されているデータをモニタ及びスピーカ上で再生する。このとき、記録再生装置940は、音声データ及び映像データを復号する。 <6. Sixth Embodiment>
[Recording and playback device]
FIG. 23 shows an example of a schematic configuration of a recording / reproducing apparatus to which the above-described embodiment is applied. For example, the recording / reproducing
記録再生装置940は、チューナ941、外部インタフェース942、エンコーダ943、HDD(Hard Disk Drive)944、ディスクドライブ945、セレクタ946、デコーダ947、OSD(On-Screen Display)948、制御部949、及びユーザインタフェース950を備える。
The recording / reproducing apparatus 940 includes a tuner 941, an external interface 942, an encoder 943, an HDD (Hard Disk Drive) 944, a disk drive 945, a selector 946, a decoder 947, an OSD (On-Screen Display) 948, a control unit 949, and a user interface. 950.
チューナ941は、アンテナ(図示せず)を介して受信される放送信号から所望のチャンネルの信号を抽出し、抽出した信号を復調する。そして、チューナ941は、復調により得られた符号化ビットストリームをセレクタ946へ出力する。即ち、チューナ941は、記録再生装置940における伝送部としての役割を有する。
Tuner 941 extracts a signal of a desired channel from a broadcast signal received via an antenna (not shown), and demodulates the extracted signal. Then, the tuner 941 outputs the encoded bit stream obtained by the demodulation to the selector 946. That is, the tuner 941 serves as a transmission unit in the recording / reproducing apparatus 940.
外部インタフェース942は、記録再生装置940と外部機器又はネットワークとを接続するためのインタフェースである。外部インタフェース942は、例えば、IEEE1394インタフェース、ネットワークインタフェース、USBインタフェース、又はフラッシュメモリインタフェースなどであってよい。例えば、外部インタフェース942を介して受信される映像データ及び音声データは、エンコーダ943へ入力される。即ち、外部インタフェース942は、記録再生装置940における伝送部としての役割を有する。
The external interface 942 is an interface for connecting the recording / reproducing apparatus 940 to an external device or a network. The external interface 942 may be, for example, an IEEE1394 interface, a network interface, a USB interface, or a flash memory interface. For example, video data and audio data received via the external interface 942 are input to the encoder 943. That is, the external interface 942 serves as a transmission unit in the recording / reproducing device 940.
エンコーダ943は、外部インタフェース942から入力される映像データ及び音声データが符号化されていない場合に、映像データ及び音声データを符号化する。そして、エンコーダ943は、符号化ビットストリームをセレクタ946へ出力する。
The encoder 943 encodes video data and audio data when the video data and audio data input from the external interface 942 are not encoded. Then, the encoder 943 outputs the encoded bit stream to the selector 946.
HDD944は、映像及び音声などのコンテンツデータが圧縮された符号化ビットストリーム、各種プログラムおよびその他のデータを内部のハードディスクに記録する。また、HDD944は、映像及び音声の再生時に、これらデータをハードディスクから読み出す。
The HDD 944 records an encoded bit stream in which content data such as video and audio is compressed, various programs, and other data on an internal hard disk. Further, the HDD 944 reads out these data from the hard disk when reproducing video and audio.
ディスクドライブ945は、装着されている記録媒体へのデータの記録及び読み出しを行う。ディスクドライブ945に装着される記録媒体は、例えばDVDディスク(DVD-Video、DVD-RAM、DVD-R、DVD-RW、DVD+R、DVD+RW等)又はBlu-ray(登録商標)ディスクなどであってよい。
The disk drive 945 performs recording and reading of data to and from the mounted recording medium. The recording medium mounted on the disk drive 945 is, for example, a DVD disk (DVD-Video, DVD-RAM, DVD-R, DVD-RW, DVD + R, DVD + RW, etc.) or a Blu-ray (registered trademark) disk. It may be.
セレクタ946は、映像及び音声の記録時には、チューナ941又はエンコーダ943から入力される符号化ビットストリームを選択し、選択した符号化ビットストリームをHDD944又はディスクドライブ945へ出力する。また、セレクタ946は、映像及び音声の再生時には、HDD944又はディスクドライブ945から入力される符号化ビットストリームをデコーダ947へ出力する。
The selector 946 selects an encoded bit stream input from the tuner 941 or the encoder 943 when recording video and audio, and outputs the selected encoded bit stream to the HDD 944 or the disk drive 945. In addition, the selector 946 outputs the encoded bit stream input from the HDD 944 or the disk drive 945 to the decoder 947 during video and audio reproduction.
デコーダ947は、符号化ビットストリームを復号し、映像データ及び音声データを生成する。そして、デコーダ947は、生成した映像データをOSD948へ出力する。また、デコーダ904は、生成した音声データを外部のスピーカへ出力する。
The decoder 947 decodes the encoded bit stream and generates video data and audio data. Then, the decoder 947 outputs the generated video data to the OSD 948. The decoder 904 outputs the generated audio data to an external speaker.
OSD948は、デコーダ947から入力される映像データを再生し、映像を表示する。また、OSD948は、表示する映像に、例えばメニュー、ボタン又はカーソルなどのGUIの画像を重畳してもよい。
OSD 948 reproduces the video data input from the decoder 947 and displays the video. Further, the OSD 948 may superimpose a GUI image such as a menu, a button, or a cursor on the video to be displayed.
制御部949は、CPUなどのプロセッサ、並びにRAM及びROMなどのメモリを有する。メモリは、CPUにより実行されるプログラム、及びプログラムデータなどを記憶する。メモリにより記憶されるプログラムは、例えば、記録再生装置940の起動時にCPUにより読み込まれ、実行される。CPUは、プログラムを実行することにより、例えばユーザインタフェース950から入力される操作信号に応じて、記録再生装置940の動作を制御する。
The control unit 949 includes a processor such as a CPU and memories such as a RAM and a ROM. The memory stores a program executed by the CPU, program data, and the like. The program stored in the memory is read and executed by the CPU when the recording / reproducing apparatus 940 is activated, for example. The CPU controls the operation of the recording / reproducing apparatus 940 in accordance with an operation signal input from the user interface 950, for example, by executing the program.
ユーザインタフェース950は、制御部949と接続される。ユーザインタフェース950は、例えば、ユーザが記録再生装置940を操作するためのボタン及びスイッチ、並びに遠隔制御信号の受信部などを有する。ユーザインタフェース950は、これら構成要素を介してユーザによる操作を検出して操作信号を生成し、生成した操作信号を制御部949へ出力する。
The user interface 950 is connected to the control unit 949. The user interface 950 includes, for example, buttons and switches for the user to operate the recording / reproducing device 940, a remote control signal receiving unit, and the like. The user interface 950 detects an operation by the user via these components, generates an operation signal, and outputs the generated operation signal to the control unit 949.
このように構成された記録再生装置940において、エンコーダ943は、上述した実施形態に係る画像符号化装置100(図1)の機能を有する。したがって、エンコーダ943は、動きベクトルの精度を考慮して動き予測を制限することができる。したがって、エンコーダ943は、符号化効率の低減を抑制しながら負荷の増大を抑制し、画像を符号化することができる。したがって、記録再生装置940は、符号化効率の抑制を低減させることができる。
In the thus configured recording / reproducing apparatus 940, the encoder 943 has the function of the image encoding apparatus 100 (FIG. 1) according to the above-described embodiment. Therefore, the encoder 943 can limit motion prediction in consideration of the accuracy of the motion vector. Therefore, the encoder 943 can encode an image while suppressing an increase in load while suppressing a decrease in encoding efficiency. Therefore, the recording / reproducing apparatus 940 can reduce suppression of encoding efficiency.
また、このように構成された記録再生装置940において、デコーダ947は、上述した実施形態に係る画像復号装置200(図14)の機能を有する。したがって、デコーダ947は、動き予測の制限において動きベクトルの精度が考慮された符号化により生成された符号化データのみ復号するようにすることができる。したがって、デコーダ947は、より確実に符号化効率の低減が抑制された符号化データを復号することができる。したがって、記録再生装置940は、より確実に、符号化効率の低減の抑制を実現することができる。
Further, in the recording / reproducing apparatus 940 configured as described above, the decoder 947 has the function of the image decoding apparatus 200 (FIG. 14) according to the above-described embodiment. Therefore, the decoder 947 can decode only the encoded data generated by the encoding in which the accuracy of the motion vector is considered in the limitation of motion prediction. Therefore, the decoder 947 can more reliably decode the encoded data in which the reduction in encoding efficiency is suppressed. Therefore, the recording / reproducing apparatus 940 can more reliably suppress the reduction in encoding efficiency.
<7.第7の実施の形態>
[撮像装置]
図24は、上述した実施形態を適用した撮像装置の概略的な構成の一例を示している。撮像装置960は、被写体を撮像して画像を生成し、画像データを符号化して記録媒体に記録する。 <7. Seventh Embodiment>
[Imaging device]
FIG. 24 illustrates an example of a schematic configuration of an imaging apparatus to which the above-described embodiment is applied. Theimaging device 960 images a subject to generate an image, encodes the image data, and records it on a recording medium.
[撮像装置]
図24は、上述した実施形態を適用した撮像装置の概略的な構成の一例を示している。撮像装置960は、被写体を撮像して画像を生成し、画像データを符号化して記録媒体に記録する。 <7. Seventh Embodiment>
[Imaging device]
FIG. 24 illustrates an example of a schematic configuration of an imaging apparatus to which the above-described embodiment is applied. The
撮像装置960は、光学ブロック961、撮像部962、信号処理部963、画像処理部964、表示部965、外部インタフェース966、メモリ967、メディアドライブ968、OSD969、制御部970、ユーザインタフェース971、及びバス972を備える。
The imaging device 960 includes an optical block 961, an imaging unit 962, a signal processing unit 963, an image processing unit 964, a display unit 965, an external interface 966, a memory 967, a media drive 968, an OSD 969, a control unit 970, a user interface 971, and a bus. 972.
光学ブロック961は、撮像部962に接続される。撮像部962は、信号処理部963に接続される。表示部965は、画像処理部964に接続される。ユーザインタフェース971は、制御部970に接続される。バス972は、画像処理部964、外部インタフェース966、メモリ967、メディアドライブ968、OSD969、及び制御部970を相互に接続する。
The optical block 961 is connected to the imaging unit 962. The imaging unit 962 is connected to the signal processing unit 963. The display unit 965 is connected to the image processing unit 964. The user interface 971 is connected to the control unit 970. The bus 972 connects the image processing unit 964, the external interface 966, the memory 967, the media drive 968, the OSD 969, and the control unit 970 to each other.
光学ブロック961は、フォーカスレンズ及び絞り機構などを有する。光学ブロック961は、被写体の光学像を撮像部962の撮像面に結像させる。撮像部962は、CCD又はCMOSなどのイメージセンサを有し、撮像面に結像した光学像を光電変換によって電気信号としての画像信号に変換する。そして、撮像部962は、画像信号を信号処理部963へ出力する。
The optical block 961 includes a focus lens and a diaphragm mechanism. The optical block 961 forms an optical image of the subject on the imaging surface of the imaging unit 962. The imaging unit 962 includes an image sensor such as a CCD or a CMOS, and converts an optical image formed on the imaging surface into an image signal as an electrical signal by photoelectric conversion. Then, the imaging unit 962 outputs the image signal to the signal processing unit 963.
信号処理部963は、撮像部962から入力される画像信号に対してニー補正、ガンマ補正、色補正などの種々のカメラ信号処理を行う。信号処理部963は、カメラ信号処理後の画像データを画像処理部964へ出力する。
The signal processing unit 963 performs various camera signal processing such as knee correction, gamma correction, and color correction on the image signal input from the imaging unit 962. The signal processing unit 963 outputs the image data after the camera signal processing to the image processing unit 964.
画像処理部964は、信号処理部963から入力される画像データを符号化し、符号化データを生成する。そして、画像処理部964は、生成した符号化データを外部インタフェース966又はメディアドライブ968へ出力する。また、画像処理部964は、外部インタフェース966又はメディアドライブ968から入力される符号化データを復号し、画像データを生成する。そして、画像処理部964は、生成した画像データを表示部965へ出力する。また、画像処理部964は、信号処理部963から入力される画像データを表示部965へ出力して画像を表示させてもよい。また、画像処理部964は、OSD969から取得される表示用データを、表示部965へ出力する画像に重畳してもよい。
The image processing unit 964 encodes the image data input from the signal processing unit 963 and generates encoded data. Then, the image processing unit 964 outputs the generated encoded data to the external interface 966 or the media drive 968. The image processing unit 964 also decodes encoded data input from the external interface 966 or the media drive 968 to generate image data. Then, the image processing unit 964 outputs the generated image data to the display unit 965. In addition, the image processing unit 964 may display the image by outputting the image data input from the signal processing unit 963 to the display unit 965. Further, the image processing unit 964 may superimpose display data acquired from the OSD 969 on an image output to the display unit 965.
OSD969は、例えばメニュー、ボタン又はカーソルなどのGUIの画像を生成して、生成した画像を画像処理部964へ出力する。
The OSD 969 generates a GUI image such as a menu, a button, or a cursor, and outputs the generated image to the image processing unit 964.
外部インタフェース966は、例えばUSB入出力端子として構成される。外部インタフェース966は、例えば、画像の印刷時に、撮像装置960とプリンタとを接続する。また、外部インタフェース966には、必要に応じてドライブが接続される。ドライブには、例えば、磁気ディスク又は光ディスクなどのリムーバブルメディアが装着され、リムーバブルメディアから読み出されるプログラムが、撮像装置960にインストールされ得る。さらに、外部インタフェース966は、LAN又はインターネットなどのネットワークに接続されるネットワークインタフェースとして構成されてもよい。即ち、外部インタフェース966は、撮像装置960における伝送部としての役割を有する。
The external interface 966 is configured as a USB input / output terminal, for example. The external interface 966 connects the imaging device 960 and a printer, for example, when printing an image. Further, a drive is connected to the external interface 966 as necessary. For example, a removable medium such as a magnetic disk or an optical disk is attached to the drive, and a program read from the removable medium can be installed in the imaging device 960. Further, the external interface 966 may be configured as a network interface connected to a network such as a LAN or the Internet. That is, the external interface 966 has a role as a transmission unit in the imaging device 960.
メディアドライブ968に装着される記録媒体は、例えば、磁気ディスク、光磁気ディスク、光ディスク、又は半導体メモリなどの、読み書き可能な任意のリムーバブルメディアであってよい。また、メディアドライブ968に記録媒体が固定的に装着され、例えば、内蔵型ハードディスクドライブ又はSSD(Solid State Drive)のような非可搬性の記憶部が構成されてもよい。
The recording medium mounted on the media drive 968 may be any readable / writable removable medium such as a magnetic disk, a magneto-optical disk, an optical disk, or a semiconductor memory. In addition, a recording medium may be fixedly mounted on the media drive 968, and a non-portable storage unit such as an internal hard disk drive or an SSD (Solid State Drive) may be configured.
制御部970は、CPUなどのプロセッサ、並びにRAM及びROMなどのメモリを有する。メモリは、CPUにより実行されるプログラム、及びプログラムデータなどを記憶する。メモリにより記憶されるプログラムは、例えば、撮像装置960の起動時にCPUにより読み込まれ、実行される。CPUは、プログラムを実行することにより、例えばユーザインタフェース971から入力される操作信号に応じて、撮像装置960の動作を制御する。
The control unit 970 includes a processor such as a CPU and memories such as a RAM and a ROM. The memory stores a program executed by the CPU, program data, and the like. The program stored in the memory is read and executed by the CPU when the imaging device 960 is activated, for example. For example, the CPU controls the operation of the imaging device 960 according to an operation signal input from the user interface 971 by executing the program.
ユーザインタフェース971は、制御部970と接続される。ユーザインタフェース971は、例えば、ユーザが撮像装置960を操作するためのボタン及びスイッチなどを有する。ユーザインタフェース971は、これら構成要素を介してユーザによる操作を検出して操作信号を生成し、生成した操作信号を制御部970へ出力する。
The user interface 971 is connected to the control unit 970. The user interface 971 includes, for example, buttons and switches for the user to operate the imaging device 960. The user interface 971 detects an operation by the user via these components, generates an operation signal, and outputs the generated operation signal to the control unit 970.
このように構成された撮像装置960において、画像処理部964は、上述した実施形態に係る画像符号化装置100(図1)の機能を有する。したがって、画像処理部964は、動きベクトルの精度を考慮して動き予測を制限することができる。したがって、画像処理部964は、符号化効率の低減を抑制しながら負荷の増大を抑制し、画像を符号化することができる。したがって、撮像装置960は、符号化効率の抑制を低減させることができる。
In the imaging device 960 configured as described above, the image processing unit 964 has the function of the image encoding device 100 (FIG. 1) according to the above-described embodiment. Therefore, the image processing unit 964 can limit the motion prediction in consideration of the accuracy of the motion vector. Accordingly, the image processing unit 964 can encode an image while suppressing an increase in load while suppressing a decrease in encoding efficiency. Therefore, the imaging device 960 can reduce suppression of encoding efficiency.
なお、画像処理部964は、上述した実施形態に係る画像復号装置200(図14)の機能も有する。したがって、画像処理部964は、動き予測の制限において動きベクトルの精度が考慮された符号化により生成された符号化データのみ復号するようにすることができる。したがって、画像処理部964は、より確実に符号化効率の低減が抑制された符号化データを復号することができる。したがって、撮像装置960は、より確実に、符号化効率の低減の抑制を実現することができる。
Note that the image processing unit 964 also has the function of the image decoding device 200 (FIG. 14) according to the above-described embodiment. Therefore, the image processing unit 964 can decode only the encoded data generated by the encoding in which the accuracy of the motion vector is considered in the limitation of motion prediction. Therefore, the image processing unit 964 can more reliably decode the encoded data in which the reduction in encoding efficiency is suppressed. Therefore, the imaging device 960 can more reliably suppress the reduction in encoding efficiency.
もちろん、本技術を適用した画像符号化装置および画像復号装置は、上述した装置以外の装置やシステムにも適用可能である。
Of course, the image encoding device and the image decoding device to which the present technology is applied can be applied to devices and systems other than the above-described devices.
なお、本明細書では、量子化パラメータが、符号化側から復号側へ伝送される例について説明した。量子化行列パラメータを伝送する手法は、符号化ビットストリームに多重化されることなく、符号化ビットストリームと関連付けられた別個のデータとして伝送され又は記録されてもよい。ここで、「関連付ける」という用語は、ビットストリームに含まれる画像(スライス若しくはブロックなど、画像の一部であってもよい)と当該画像に対応する情報とを復号時にリンクさせ得るようにすることを意味する。即ち、情報は、画像(又はビットストリーム)とは別の伝送路上で伝送されてもよい。また、情報は、画像(又はビットストリーム)とは別の記録媒体(又は同一の記録媒体の別の記録エリア)に記録されてもよい。さらに、情報と画像(又はビットストリーム)とは、例えば、複数フレーム、1フレーム、又はフレーム内の一部分などの任意の単位で互いに関連付けられてよい。
In this specification, the example in which the quantization parameter is transmitted from the encoding side to the decoding side has been described. Techniques for transmitting quantization matrix parameters may be transmitted or recorded as separate data associated with the encoded bitstream without being multiplexed into the encoded bitstream. Here, the term “associate” means that an image (which may be a part of an image such as a slice or a block) included in the bitstream and information corresponding to the image can be linked at the time of decoding. Means. That is, information may be transmitted on a transmission path different from that of the image (or bit stream). Information may be recorded on a recording medium (or another recording area of the same recording medium) different from the image (or bit stream). Furthermore, the information and the image (or bit stream) may be associated with each other in an arbitrary unit such as a plurality of frames, one frame, or a part of the frame.
以上、添付図面を参照しながら本開示の好適な実施形態について詳細に説明したが、本開示の技術的範囲はかかる例に限定されない。本開示の技術分野における通常の知識を有する者であれば、請求の範囲に記載された技術的思想の範疇内において、各種の変更例または修正例に想到し得ることは明らかであり、これらについても、当然に本開示の技術的範囲に属するものと了解される。
The preferred embodiments of the present disclosure have been described in detail above with reference to the accompanying drawings, but the technical scope of the present disclosure is not limited to such examples. It is obvious that a person having ordinary knowledge in the technical field of the present disclosure can come up with various changes or modifications within the scope of the technical idea described in the claims. Of course, it is understood that it belongs to the technical scope of the present disclosure.
なお、本技術は以下のような構成も取ることができる。
(1) 動き補償を行うインター予測モードの候補を、動きベクトルの精度に基づいて制御する制御部と、
前記制御部により前記候補とすることを許可されたインター予測モードでのみ動き探索を行い、動きベクトルを生成する動き探索部と、
前記動き探索部により動きベクトルが生成された各候補の中から選択されたインター予測モードで前記動き補償を行い、予測画像を生成する動き補償部と
を備える画像処理装置。
(2) 前記制御部は、前記動きベクトルの精度が整数精度であるか分数精度であるかによって、インター予測モードを前記候補とするか否かを制御する
前記(1)に記載の画像処理装置。
(3) 前記制御部は、前記動きベクトルの縦と横の各方向の精度が整数精度であるか分数精度であるかによって、インター予測モードを前記候補とするか否かを制御する
前記(2)に記載の画像処理装置。
(4) 前記制御部は、さらに、予測モードが単予測であるか双予測であるかによって、インター予測モードを前記候補とするか否かを制御する
前記(3)に記載の画像処理装置。
(5) 前記制御部は、さらに、予測処理単位とする領域の大きさによって、インター予測モードを前記候補とするか否かを制御する
前記(4)に記載の画像処理装置。
(6) 前記制御部は、前記動きベクトルの精度に応じて決定されるメモリからのデータ読み出し量の大きさに関する情報が、リソースやユーザ指示等による上限を超えるか否かによって、インター予測モードを前記候補とするか否かを制御する
前記(1)乃至(5)のいずれかに記載の画像処理装置。
(7) 前記制御部による制御の内容を示す伝送情報を生成し、伝送させる伝送部をさらに備える
前記(1)乃至(6)のいずれかに記載の画像処理装置。
(8) 前記伝送部は、前記伝送情報として、プロファイルとレベルを定義し、前記プロファイルおよび前記レベルを示す情報をシーケンスパラメータセットに含めて伝送させる
前記(7)に記載の画像処理装置。
(9) 前記動き探索部により動きベクトルが生成された各候補について、コスト関数値を算出するコスト関数算出部と、
前記コスト関す算出部により算出された前記コスト関数値に基づいて、前記動き補償を行うインター予測モードを前記候補の中から選択するモード選択部と
をさらに備え、
前記動き補償部は、前記モード選択部により選択されたインター予測モードで前記動き補償を行う
前記(1)乃至(8)のいずれかに記載の画像処理装置。
(10) 画像処理装置の画像処理方法であって、
制御部が、動き補償を行うインター予測モードの候補を、動きベクトルの精度に基づいて制御し、
動き探索部が、前記候補とすることを許可されたインター予測モードでのみ動き探索を行い、動きベクトルを生成し、
動き補償部が、動きベクトルが生成された各候補の中から選択されたインター予測モードで前記動き補償を行い、予測画像を生成する
画像処理方法。
(11) 画像データを符号化する際のインター予測において用いられた、動き補償を行うインター予測モードの候補を動きベクトルの精度に基づいて制御する制御情報を取得する取得部と、
前記取得部により取得された前記制御情報に基づいて、前記画像データが符号化されて生成された符号化データの復号の可否判定を行う判定部と、
前記判定部による判定結果に基づいて、前記符号化データの復号を制御する制御部と
を備える画像処理装置。
(12) 前記判定部は、前記取得部により取得された前記制御情報において指定されるメモリからのデータ読み出し量の大きさに関する情報が、リソースやユーザ指示等による上限を超えるか否かによって、前記可否判定を行う
前記(11)に記載の画像処理装置。
(13) 前記判定部は、プロファイルとレベルによって定義されるメモリからのデータ読み出し量の大きさに関する情報が、リソースやユーザ指示等による上限を超えるか否かによって、前記可否判定を行う
前記(12)に記載の画像処理装置。
(14) 前記制御部は、前記判定部により復号可能と判定された場合のみ、前記符号化データの復号を実行させる
前記(11)乃至(13)のいずれかに記載の画像処理装置。
(15) 前記判定部により復号不可能と判定された場合、前記制御部に制御されて、エラー処理を行うエラー処理部をさらに備える
前記(11)乃至(14)のいずれかに記載の画像処理装置。
(16) 画像処理装置の画像処理方法であって、
取得部が、画像データを符号化する際のインター予測において用いられた、動き補償を行うインター予測モードの候補を動きベクトルの精度に基づいて制御する制御情報を取得し、
判定部が、取得された前記制御情報に基づいて、前記画像データが符号化されて生成された符号化データの復号の可否判定を行い、
制御部が、その判定結果に基づいて、前記符号化データの復号を制御する
画像処理方法。 In addition, this technique can also take the following structures.
(1) A control unit that controls inter prediction mode candidates for performing motion compensation based on the accuracy of motion vectors;
A motion search unit that performs a motion search only in the inter prediction mode permitted to be the candidate by the control unit, and generates a motion vector;
An image processing apparatus comprising: a motion compensation unit that performs the motion compensation in an inter prediction mode selected from the candidates for which motion vectors are generated by the motion search unit, and generates a predicted image.
(2) The image processing apparatus according to (1), wherein the control unit controls whether or not an inter prediction mode is set as the candidate depending on whether the accuracy of the motion vector is integer accuracy or fractional accuracy. .
(3) The control unit controls whether or not the inter prediction mode is set as the candidate depending on whether the vertical and horizontal accuracy of the motion vector is integer accuracy or fractional accuracy. ).
(4) The image processing device according to (3), wherein the control unit further controls whether or not the inter prediction mode is set as the candidate depending on whether the prediction mode is uni-prediction or bi-prediction.
(5) The image processing device according to (4), wherein the control unit further controls whether or not an inter prediction mode is set as the candidate according to a size of a region serving as a prediction processing unit.
(6) The control unit determines the inter prediction mode depending on whether or not the information regarding the amount of data read from the memory determined according to the accuracy of the motion vector exceeds an upper limit due to a resource, a user instruction, or the like. The image processing apparatus according to any one of (1) to (5), wherein the image processing apparatus controls whether or not the candidate is used.
(7) The image processing apparatus according to any one of (1) to (6), further including: a transmission unit that generates and transmits transmission information indicating the content of control by the control unit.
(8) The image processing device according to (7), wherein the transmission unit defines a profile and a level as the transmission information, and transmits information indicating the profile and the level in a sequence parameter set.
(9) a cost function calculation unit that calculates a cost function value for each candidate for which a motion vector is generated by the motion search unit;
A mode selection unit that selects an inter prediction mode for performing the motion compensation from the candidates based on the cost function value calculated by the cost calculation unit;
The image processing device according to any one of (1) to (8), wherein the motion compensation unit performs the motion compensation in an inter prediction mode selected by the mode selection unit.
(10) An image processing method for an image processing apparatus,
The control unit controls inter prediction mode candidates for motion compensation based on the accuracy of the motion vector,
The motion search unit performs a motion search only in the inter prediction mode permitted to be the candidate, generates a motion vector,
An image processing method in which a motion compensation unit performs the motion compensation in an inter prediction mode selected from the candidates for which motion vectors are generated, and generates a predicted image.
(11) An acquisition unit that acquires control information that is used in inter prediction when encoding image data and controls inter prediction mode candidates for motion compensation based on the accuracy of motion vectors;
A determination unit that determines whether or not decoding of encoded data generated by encoding the image data is based on the control information acquired by the acquisition unit;
An image processing apparatus comprising: a control unit that controls decoding of the encoded data based on a determination result by the determination unit.
(12) The determination unit determines whether the information on the amount of data read from the memory specified in the control information acquired by the acquisition unit exceeds an upper limit due to a resource, a user instruction, or the like. The image processing apparatus according to (11), wherein determination is performed.
(13) The determination unit performs the determination as to whether the information on the amount of data read from the memory defined by the profile and the level exceeds an upper limit based on a resource, a user instruction, or the like. ).
(14) The image processing device according to any one of (11) to (13), wherein the control unit executes decoding of the encoded data only when the determination unit determines that decoding is possible.
(15) The image processing according to any one of (11) to (14), further including an error processing unit that is controlled by the control unit and performs error processing when the determination unit determines that decoding is impossible. apparatus.
(16) An image processing method for an image processing apparatus,
The acquisition unit acquires control information used in inter prediction when encoding image data, and controls inter prediction mode candidates for motion compensation based on the accuracy of a motion vector,
The determination unit determines whether or not to decode the encoded data generated by encoding the image data based on the acquired control information,
An image processing method in which a control unit controls decoding of the encoded data based on the determination result.
(1) 動き補償を行うインター予測モードの候補を、動きベクトルの精度に基づいて制御する制御部と、
前記制御部により前記候補とすることを許可されたインター予測モードでのみ動き探索を行い、動きベクトルを生成する動き探索部と、
前記動き探索部により動きベクトルが生成された各候補の中から選択されたインター予測モードで前記動き補償を行い、予測画像を生成する動き補償部と
を備える画像処理装置。
(2) 前記制御部は、前記動きベクトルの精度が整数精度であるか分数精度であるかによって、インター予測モードを前記候補とするか否かを制御する
前記(1)に記載の画像処理装置。
(3) 前記制御部は、前記動きベクトルの縦と横の各方向の精度が整数精度であるか分数精度であるかによって、インター予測モードを前記候補とするか否かを制御する
前記(2)に記載の画像処理装置。
(4) 前記制御部は、さらに、予測モードが単予測であるか双予測であるかによって、インター予測モードを前記候補とするか否かを制御する
前記(3)に記載の画像処理装置。
(5) 前記制御部は、さらに、予測処理単位とする領域の大きさによって、インター予測モードを前記候補とするか否かを制御する
前記(4)に記載の画像処理装置。
(6) 前記制御部は、前記動きベクトルの精度に応じて決定されるメモリからのデータ読み出し量の大きさに関する情報が、リソースやユーザ指示等による上限を超えるか否かによって、インター予測モードを前記候補とするか否かを制御する
前記(1)乃至(5)のいずれかに記載の画像処理装置。
(7) 前記制御部による制御の内容を示す伝送情報を生成し、伝送させる伝送部をさらに備える
前記(1)乃至(6)のいずれかに記載の画像処理装置。
(8) 前記伝送部は、前記伝送情報として、プロファイルとレベルを定義し、前記プロファイルおよび前記レベルを示す情報をシーケンスパラメータセットに含めて伝送させる
前記(7)に記載の画像処理装置。
(9) 前記動き探索部により動きベクトルが生成された各候補について、コスト関数値を算出するコスト関数算出部と、
前記コスト関す算出部により算出された前記コスト関数値に基づいて、前記動き補償を行うインター予測モードを前記候補の中から選択するモード選択部と
をさらに備え、
前記動き補償部は、前記モード選択部により選択されたインター予測モードで前記動き補償を行う
前記(1)乃至(8)のいずれかに記載の画像処理装置。
(10) 画像処理装置の画像処理方法であって、
制御部が、動き補償を行うインター予測モードの候補を、動きベクトルの精度に基づいて制御し、
動き探索部が、前記候補とすることを許可されたインター予測モードでのみ動き探索を行い、動きベクトルを生成し、
動き補償部が、動きベクトルが生成された各候補の中から選択されたインター予測モードで前記動き補償を行い、予測画像を生成する
画像処理方法。
(11) 画像データを符号化する際のインター予測において用いられた、動き補償を行うインター予測モードの候補を動きベクトルの精度に基づいて制御する制御情報を取得する取得部と、
前記取得部により取得された前記制御情報に基づいて、前記画像データが符号化されて生成された符号化データの復号の可否判定を行う判定部と、
前記判定部による判定結果に基づいて、前記符号化データの復号を制御する制御部と
を備える画像処理装置。
(12) 前記判定部は、前記取得部により取得された前記制御情報において指定されるメモリからのデータ読み出し量の大きさに関する情報が、リソースやユーザ指示等による上限を超えるか否かによって、前記可否判定を行う
前記(11)に記載の画像処理装置。
(13) 前記判定部は、プロファイルとレベルによって定義されるメモリからのデータ読み出し量の大きさに関する情報が、リソースやユーザ指示等による上限を超えるか否かによって、前記可否判定を行う
前記(12)に記載の画像処理装置。
(14) 前記制御部は、前記判定部により復号可能と判定された場合のみ、前記符号化データの復号を実行させる
前記(11)乃至(13)のいずれかに記載の画像処理装置。
(15) 前記判定部により復号不可能と判定された場合、前記制御部に制御されて、エラー処理を行うエラー処理部をさらに備える
前記(11)乃至(14)のいずれかに記載の画像処理装置。
(16) 画像処理装置の画像処理方法であって、
取得部が、画像データを符号化する際のインター予測において用いられた、動き補償を行うインター予測モードの候補を動きベクトルの精度に基づいて制御する制御情報を取得し、
判定部が、取得された前記制御情報に基づいて、前記画像データが符号化されて生成された符号化データの復号の可否判定を行い、
制御部が、その判定結果に基づいて、前記符号化データの復号を制御する
画像処理方法。 In addition, this technique can also take the following structures.
(1) A control unit that controls inter prediction mode candidates for performing motion compensation based on the accuracy of motion vectors;
A motion search unit that performs a motion search only in the inter prediction mode permitted to be the candidate by the control unit, and generates a motion vector;
An image processing apparatus comprising: a motion compensation unit that performs the motion compensation in an inter prediction mode selected from the candidates for which motion vectors are generated by the motion search unit, and generates a predicted image.
(2) The image processing apparatus according to (1), wherein the control unit controls whether or not an inter prediction mode is set as the candidate depending on whether the accuracy of the motion vector is integer accuracy or fractional accuracy. .
(3) The control unit controls whether or not the inter prediction mode is set as the candidate depending on whether the vertical and horizontal accuracy of the motion vector is integer accuracy or fractional accuracy. ).
(4) The image processing device according to (3), wherein the control unit further controls whether or not the inter prediction mode is set as the candidate depending on whether the prediction mode is uni-prediction or bi-prediction.
(5) The image processing device according to (4), wherein the control unit further controls whether or not an inter prediction mode is set as the candidate according to a size of a region serving as a prediction processing unit.
(6) The control unit determines the inter prediction mode depending on whether or not the information regarding the amount of data read from the memory determined according to the accuracy of the motion vector exceeds an upper limit due to a resource, a user instruction, or the like. The image processing apparatus according to any one of (1) to (5), wherein the image processing apparatus controls whether or not the candidate is used.
(7) The image processing apparatus according to any one of (1) to (6), further including: a transmission unit that generates and transmits transmission information indicating the content of control by the control unit.
(8) The image processing device according to (7), wherein the transmission unit defines a profile and a level as the transmission information, and transmits information indicating the profile and the level in a sequence parameter set.
(9) a cost function calculation unit that calculates a cost function value for each candidate for which a motion vector is generated by the motion search unit;
A mode selection unit that selects an inter prediction mode for performing the motion compensation from the candidates based on the cost function value calculated by the cost calculation unit;
The image processing device according to any one of (1) to (8), wherein the motion compensation unit performs the motion compensation in an inter prediction mode selected by the mode selection unit.
(10) An image processing method for an image processing apparatus,
The control unit controls inter prediction mode candidates for motion compensation based on the accuracy of the motion vector,
The motion search unit performs a motion search only in the inter prediction mode permitted to be the candidate, generates a motion vector,
An image processing method in which a motion compensation unit performs the motion compensation in an inter prediction mode selected from the candidates for which motion vectors are generated, and generates a predicted image.
(11) An acquisition unit that acquires control information that is used in inter prediction when encoding image data and controls inter prediction mode candidates for motion compensation based on the accuracy of motion vectors;
A determination unit that determines whether or not decoding of encoded data generated by encoding the image data is based on the control information acquired by the acquisition unit;
An image processing apparatus comprising: a control unit that controls decoding of the encoded data based on a determination result by the determination unit.
(12) The determination unit determines whether the information on the amount of data read from the memory specified in the control information acquired by the acquisition unit exceeds an upper limit due to a resource, a user instruction, or the like. The image processing apparatus according to (11), wherein determination is performed.
(13) The determination unit performs the determination as to whether the information on the amount of data read from the memory defined by the profile and the level exceeds an upper limit based on a resource, a user instruction, or the like. ).
(14) The image processing device according to any one of (11) to (13), wherein the control unit executes decoding of the encoded data only when the determination unit determines that decoding is possible.
(15) The image processing according to any one of (11) to (14), further including an error processing unit that is controlled by the control unit and performs error processing when the determination unit determines that decoding is impossible. apparatus.
(16) An image processing method for an image processing apparatus,
The acquisition unit acquires control information used in inter prediction when encoding image data, and controls inter prediction mode candidates for motion compensation based on the accuracy of a motion vector,
The determination unit determines whether or not to decode the encoded data generated by encoding the image data based on the acquired control information,
An image processing method in which a control unit controls decoding of the encoded data based on the determination result.
100 画像符号化装置, 115 動き予測・補償部, 121 動きベクトル制御部, 151 動き探索部, 152 コスト関数算出部, 153 モード判定部, 154 動き補償部, 155 動き情報バッファ, 161 制限情報記憶部, 162 制限制御部, 163 伝送情報提供部, 171 条件取得部, 172 制限制御部, 200 画像復号装置, 212 動き予測・補償部, 221 復号制御部, 251 伝送情報取得部, 252 制限情報記憶部, 253 可否判定部, 254 制御部, 255 エラー処理部, 271 条件取得部, 273 可否判定部
100 image encoding device, 115 motion prediction / compensation unit, 121 motion vector control unit, 151 motion search unit, 152 cost function calculation unit, 153 mode determination unit, 154 motion compensation unit, 155 motion information buffer, 161 restriction information storage unit , 162 restriction control unit, 163 transmission information provision unit, 171 condition acquisition unit, 172 restriction control unit, 200 image decoding device, 212 motion prediction / compensation unit, 221 decoding control unit, 251 transmission information acquisition unit, 252 restriction information storage unit , 253 availability determination unit, 254 control unit, 255 error processing unit, 271 condition acquisition unit, 273 availability determination unit
Claims (16)
- 動き補償を行うインター予測モードの候補を、動きベクトルの精度に基づいて制御する制御部と、
前記制御部により前記候補とすることを許可されたインター予測モードでのみ動き探索を行い、動きベクトルを生成する動き探索部と、
前記動き探索部により動きベクトルが生成された各候補の中から選択されたインター予測モードで前記動き補償を行い、予測画像を生成する動き補償部と
を備える画像処理装置。 A control unit that controls inter prediction mode candidates for performing motion compensation based on the accuracy of motion vectors;
A motion search unit that performs a motion search only in the inter prediction mode permitted to be the candidate by the control unit, and generates a motion vector;
An image processing apparatus comprising: a motion compensation unit that performs the motion compensation in an inter prediction mode selected from the candidates for which motion vectors are generated by the motion search unit, and generates a predicted image. - 前記制御部は、前記動きベクトルの精度が整数精度であるか分数精度であるかによって、インター予測モードを前記候補とするか否かを制御する
請求項1に記載の画像処理装置。 The image processing apparatus according to claim 1, wherein the control unit controls whether or not an inter prediction mode is set as the candidate depending on whether the accuracy of the motion vector is integer precision or fractional precision. - 前記制御部は、前記動きベクトルの縦と横の各方向の精度が整数精度であるか分数精度であるかによって、インター予測モードを前記候補とするか否かを制御する
請求項2に記載の画像処理装置。 The control unit according to claim 2, wherein the control unit controls whether or not the inter prediction mode is set as the candidate depending on whether the vertical and horizontal accuracy of the motion vector is integer accuracy or fractional accuracy. Image processing device. - 前記制御部は、さらに、予測モードが単予測であるか双予測であるかによって、インター予測モードを前記候補とするか否かを制御する
請求項3に記載の画像処理装置。 The image processing apparatus according to claim 3, wherein the control unit further controls whether or not the inter prediction mode is set as the candidate depending on whether the prediction mode is uni-prediction or bi-prediction. - 前記制御部は、さらに、予測処理単位とする領域の大きさによって、インター予測モードを前記候補とするか否かを制御する
請求項4に記載の画像処理装置。 The image processing apparatus according to claim 4, wherein the control unit further controls whether or not an inter prediction mode is set as the candidate, depending on a size of a region serving as a prediction processing unit. - 前記制御部は、前記動きベクトルの精度に応じて決定されるメモリからのデータ読み出し量の大きさに関する情報が、リソースやユーザ指示等による上限を超えるか否かによって、インター予測モードを前記候補とするか否かを制御する
請求項1に記載の画像処理装置。 The control unit determines the inter prediction mode as the candidate depending on whether information on the amount of data read from the memory determined according to the accuracy of the motion vector exceeds an upper limit due to a resource, a user instruction, or the like. The image processing apparatus according to claim 1, wherein whether or not to perform control is controlled. - 前記制御部による制御の内容を示す伝送情報を生成し、伝送させる伝送部をさらに備える
請求項1に記載の画像処理装置。 The image processing apparatus according to claim 1, further comprising: a transmission unit that generates and transmits transmission information indicating the content of control by the control unit. - 前記伝送部は、前記伝送情報として、プロファイルとレベルを定義し、前記プロファイルおよび前記レベルを示す情報をシーケンスパラメータセットに含めて伝送させる
請求項7に記載の画像処理装置。 The image processing apparatus according to claim 7, wherein the transmission unit defines a profile and a level as the transmission information, and transmits the profile and the information indicating the level in a sequence parameter set. - 前記動き探索部により動きベクトルが生成された各候補について、コスト関数値を算出するコスト関数算出部と、
前記コスト関す算出部により算出された前記コスト関数値に基づいて、前記動き補償を行うインター予測モードを前記候補の中から選択するモード選択部と
をさらに備え、
前記動き補償部は、前記モード選択部により選択されたインター予測モードで前記動き補償を行う
請求項1に記載の画像処理装置。 A cost function calculation unit that calculates a cost function value for each candidate for which a motion vector is generated by the motion search unit;
A mode selection unit that selects an inter prediction mode for performing the motion compensation from the candidates based on the cost function value calculated by the cost calculation unit;
The image processing apparatus according to claim 1, wherein the motion compensation unit performs the motion compensation in an inter prediction mode selected by the mode selection unit. - 画像処理装置の画像処理方法であって、
制御部が、動き補償を行うインター予測モードの候補を、動きベクトルの精度に基づいて制御し、
動き探索部が、前記候補とすることを許可されたインター予測モードでのみ動き探索を行い、動きベクトルを生成し、
動き補償部が、動きベクトルが生成された各候補の中から選択されたインター予測モードで前記動き補償を行い、予測画像を生成する
画像処理方法。 An image processing method of an image processing apparatus,
The control unit controls inter prediction mode candidates for motion compensation based on the accuracy of the motion vector,
The motion search unit performs a motion search only in the inter prediction mode permitted to be the candidate, generates a motion vector,
An image processing method in which a motion compensation unit performs the motion compensation in an inter prediction mode selected from the candidates for which motion vectors are generated, and generates a predicted image. - 画像データを符号化する際のインター予測において用いられた、動き補償を行うインター予測モードの候補を動きベクトルの精度に基づいて制御する制御情報を取得する取得部と、
前記取得部により取得された前記制御情報に基づいて、前記画像データが符号化されて生成された符号化データの復号の可否判定を行う判定部と、
前記判定部による判定結果に基づいて、前記符号化データの復号を制御する制御部と
を備える画像処理装置。 An acquisition unit that acquires control information used in inter prediction when encoding image data to control inter prediction mode candidates for motion compensation based on the accuracy of a motion vector;
A determination unit that determines whether or not decoding of encoded data generated by encoding the image data is based on the control information acquired by the acquisition unit;
An image processing apparatus comprising: a control unit that controls decoding of the encoded data based on a determination result by the determination unit. - 前記判定部は、前記取得部により取得された前記制御情報において指定されるメモリからのデータ読み出し量の大きさに関する情報が、リソースやユーザ指示等による上限を超えるか否かによって、前記可否判定を行う
請求項11に記載の画像処理装置。 The determination unit performs the determination based on whether or not the information regarding the amount of data read from the memory specified in the control information acquired by the acquisition unit exceeds an upper limit based on a resource or a user instruction. The image processing apparatus according to claim 11. - 前記判定部は、プロファイルとレベルによって定義されるメモリからのデータ読み出し量の大きさに関する情報が、リソースやユーザ指示等による上限を超えるか否かによって、前記可否判定を行う
請求項12に記載の画像処理装置。 13. The determination unit according to claim 12, wherein the determination unit performs the determination of whether or not the information on the amount of data read from the memory defined by the profile and the level exceeds an upper limit based on a resource, a user instruction, or the like. Image processing device. - 前記制御部は、前記判定部により復号可能と判定された場合のみ、前記符号化データの復号を実行させる
請求項11に記載の画像処理装置。 The image processing apparatus according to claim 11, wherein the control unit causes the encoded data to be decoded only when the determination unit determines that decoding is possible. - 前記判定部により復号不可能と判定された場合、前記制御部に制御されて、エラー処理を行うエラー処理部をさらに備える
請求項11に記載の画像処理装置。 The image processing apparatus according to claim 11, further comprising: an error processing unit that performs error processing under the control of the control unit when the determination unit determines that decoding is impossible. - 画像処理装置の画像処理方法であって、
取得部が、画像データを符号化する際のインター予測において用いられた、動き補償を行うインター予測モードの候補を動きベクトルの精度に基づいて制御する制御情報を取得し、
判定部が、取得された前記制御情報に基づいて、前記画像データが符号化されて生成された符号化データの復号の可否判定を行い、
制御部が、その判定結果に基づいて、前記符号化データの復号を制御する
画像処理方法。 An image processing method of an image processing apparatus,
The acquisition unit acquires control information used in inter prediction when encoding image data, and controls inter prediction mode candidates for motion compensation based on the accuracy of a motion vector,
The determination unit determines whether or not to decode the encoded data generated by encoding the image data based on the acquired control information,
An image processing method in which a control unit controls decoding of the encoded data based on the determination result.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US14/344,239 US10075705B2 (en) | 2011-11-04 | 2012-10-25 | Image processing apparatus and method |
CN201280052450.4A CN103907353A (en) | 2011-11-04 | 2012-10-25 | Image processing device and method |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2011242671A JP2013098933A (en) | 2011-11-04 | 2011-11-04 | Image processing device and method |
JP2011-242671 | 2011-11-04 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2013065570A1 true WO2013065570A1 (en) | 2013-05-10 |
Family
ID=48191919
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2012/077581 WO2013065570A1 (en) | 2011-11-04 | 2012-10-25 | Image processing device and method |
Country Status (4)
Country | Link |
---|---|
US (1) | US10075705B2 (en) |
JP (1) | JP2013098933A (en) |
CN (1) | CN103907353A (en) |
WO (1) | WO2013065570A1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2014531153A (en) * | 2011-09-14 | 2014-11-20 | サムスン エレクトロニクスカンパニー リミテッド | Video encoding method, video decoding method and apparatus |
Families Citing this family (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2017156669A1 (en) * | 2016-03-14 | 2017-09-21 | Mediatek Singapore Pte. Ltd. | Methods for motion vector storage in video coding |
KR102017027B1 (en) * | 2016-05-10 | 2019-09-02 | 한양대학교 산학협력단 | Fast video encoding method and apparatus for the same |
GB2588004B (en) | 2018-06-05 | 2023-03-01 | Beijing Bytedance Network Tech Co Ltd | Interaction between IBC and affine |
JP7096374B2 (en) | 2018-06-21 | 2022-07-05 | 北京字節跳動網絡技術有限公司 | Subblock motion vector inheritance between color components |
TWI739120B (en) | 2018-06-21 | 2021-09-11 | 大陸商北京字節跳動網絡技術有限公司 | Unified constrains for the merge affine mode and the non-merge affine mode |
CN110944170B (en) | 2018-09-24 | 2023-05-02 | 北京字节跳动网络技术有限公司 | Extended Merge Prediction |
CN112970262B (en) | 2018-11-10 | 2024-02-20 | 北京字节跳动网络技术有限公司 | Rounding in Triangular Forecast Mode |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2006174014A (en) * | 2004-12-15 | 2006-06-29 | Nippon Telegr & Teleph Corp <Ntt> | Motion vector search method, motion vector search apparatus, motion vector search program, and computer-readable recording medium storing the program |
JP2010050833A (en) * | 2008-08-22 | 2010-03-04 | Canon Inc | Video coding device |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7327791B1 (en) * | 1999-02-22 | 2008-02-05 | Mitsubishi Denki Kabushiki Kaisha | Video decoding method performing selective error concealment and resynchronization |
US7110459B2 (en) * | 2002-04-10 | 2006-09-19 | Microsoft Corporation | Approximate bicubic filter |
WO2006082690A1 (en) * | 2005-02-01 | 2006-08-10 | Matsushita Electric Industrial Co., Ltd. | Image encoding method and image encoding device |
US8116576B2 (en) * | 2006-03-03 | 2012-02-14 | Panasonic Corporation | Image processing method and image processing device for reconstructing a high-resolution picture from a captured low-resolution picture |
US8160150B2 (en) * | 2007-04-10 | 2012-04-17 | Texas Instruments Incorporated | Method and system for rate distortion optimization |
KR20090004659A (en) * | 2007-07-02 | 2009-01-12 | 엘지전자 주식회사 | Digital broadcasting system and data processing method |
US20120051431A1 (en) * | 2010-08-25 | 2012-03-01 | Qualcomm Incorporated | Motion direction based adaptive motion vector resolution signaling for video coding |
US9049455B2 (en) * | 2010-12-28 | 2015-06-02 | Panasonic Intellectual Property Corporation Of America | Image coding method of coding a current picture with prediction using one or both of a first reference picture list including a first current reference picture for a current block and a second reference picture list including a second current reference picture for the current block |
US9532066B2 (en) * | 2011-01-21 | 2016-12-27 | Qualcomm Incorporated | Motion vector prediction |
-
2011
- 2011-11-04 JP JP2011242671A patent/JP2013098933A/en active Pending
-
2012
- 2012-10-25 US US14/344,239 patent/US10075705B2/en active Active
- 2012-10-25 CN CN201280052450.4A patent/CN103907353A/en active Pending
- 2012-10-25 WO PCT/JP2012/077581 patent/WO2013065570A1/en active Application Filing
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2006174014A (en) * | 2004-12-15 | 2006-06-29 | Nippon Telegr & Teleph Corp <Ntt> | Motion vector search method, motion vector search apparatus, motion vector search program, and computer-readable recording medium storing the program |
JP2010050833A (en) * | 2008-08-22 | 2010-03-04 | Canon Inc | Video coding device |
Non-Patent Citations (1)
Title |
---|
KENJI KONDO ET AL.: "Non-CE3: Report on a restriction for small block", JOINT COLLABORATIVE TEAM ON VIDEO CODING (JCT-VC) OF ITU-T SG16 WP3 AND ISO/IEC JTC1/SC29/WG11, 7TH MEETING, 21 November 2011 (2011-11-21), GENEVA, CH * |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2014531153A (en) * | 2011-09-14 | 2014-11-20 | サムスン エレクトロニクスカンパニー リミテッド | Video encoding method, video decoding method and apparatus |
US9538188B2 (en) | 2011-09-14 | 2017-01-03 | Samsung Electronics Co., Ltd. | Method and device for encoding and decoding video |
US9538184B2 (en) | 2011-09-14 | 2017-01-03 | Samsung Electronics Co., Ltd. | Method and device for encoding and decoding video |
US9538187B2 (en) | 2011-09-14 | 2017-01-03 | Samsung Electronics Co., Ltd. | Method and device for encoding and decoding video |
US9544600B2 (en) | 2011-09-14 | 2017-01-10 | Samsung Electronics Co., Ltd. | Method and device for encoding and decoding video |
US9578332B2 (en) | 2011-09-14 | 2017-02-21 | Samsung Electronics Co., Ltd. | Method and device for encoding and decoding video |
JP2017063468A (en) * | 2011-09-14 | 2017-03-30 | サムスン エレクトロニクス カンパニー リミテッド | Video encoding method, video decoding method and device |
Also Published As
Publication number | Publication date |
---|---|
US10075705B2 (en) | 2018-09-11 |
CN103907353A (en) | 2014-07-02 |
JP2013098933A (en) | 2013-05-20 |
US20140294066A1 (en) | 2014-10-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP6419113B2 (en) | Image processing apparatus and method, and program | |
JP5942990B2 (en) | Image processing apparatus and method | |
US10419756B2 (en) | Image processing device and method | |
JP2013150173A (en) | Image processing apparatus and method | |
WO2014013880A1 (en) | Image processing device and method | |
WO2013065570A1 (en) | Image processing device and method | |
WO2013002110A1 (en) | Image processing device and method | |
JP5982734B2 (en) | Image processing apparatus and method | |
WO2012157538A1 (en) | Image processing device and method | |
WO2013108688A1 (en) | Image processing device and method | |
WO2013002108A1 (en) | Image processing device and method | |
WO2012173022A1 (en) | Image processing device and method | |
US20140294312A1 (en) | Image processing device and method | |
US10397583B2 (en) | Image processing apparatus and method | |
WO2013065567A1 (en) | Image processing device and method | |
WO2013002106A1 (en) | Image processing device and method | |
WO2012173063A1 (en) | Image processing device and method | |
US10218969B2 (en) | Image processing device and method using adjusted motion vector accuracy between sub-pixels of reference frames | |
JP2018029347A (en) | Image processing apparatus and method | |
WO2013002111A1 (en) | Image processing device and method | |
JP6217997B2 (en) | Image processing apparatus and method | |
JP2013150124A (en) | Image processing apparatus and method | |
JP2019146225A (en) | Image processing device and method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 12846681 Country of ref document: EP Kind code of ref document: A1 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 14344239 Country of ref document: US |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 12846681 Country of ref document: EP Kind code of ref document: A1 |