JP2004032300A

JP2004032300A - Video compression encoding device

Info

Publication number: JP2004032300A
Application number: JP2002184801A
Authority: JP
Inventors: Hiroshi Nakayama; 中山　寛
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2002-06-25
Filing date: 2002-06-25
Publication date: 2004-01-29

Abstract

【課題】従来、構成を複雑化することなく符号化ストリームを一定の転送レートに制御することのできる動画圧縮符号化装置の提供は困難であった。
【解決手段】映像データの各フレームをブロック単位に分割し該ブロック毎に現フレームと前フレームとの予測される動き差分を検出する手段２０２と、該予測される動き差分の結果または元の画像データに対して規定された直交変換および量子化を実施して可変長符号化を行う手段２０３，　２０４と、該可変長符号化されたデータに対して規定された逆量子化、逆直交変換および動き補償処理を実施して復号画像を生成する手段２０９，　２１０，　２１１と、前記可変長符号化の処理時に、各ブロックの符号化を途中で打ち切る符号化打ち切り手段２０７１と、該符号化打ち切りが通知されることにより、次フレーム画像の符号化において動き予測を用いない符号化を実施するように制御するフレーム管理手段２１３と、を備えるように構成する。
【選択図】　　　図３Conventionally, it has been difficult to provide a moving picture compression encoding apparatus capable of controlling an encoded stream at a constant transfer rate without complicating the configuration.
A means for dividing each frame of video data into blocks and detecting a predicted motion difference between a current frame and a previous frame for each block, a result of the predicted motion difference or an original image Means 203 and 204 for performing variable-length coding by performing orthogonal transform and quantization specified for the data, and performing inverse quantization, inverse orthogonal transform, and inverse quantization specified for the variable-length coded data. Means 209, 210, 211 for performing a motion compensation process to generate a decoded image; coding terminating means 2071 for terminating the encoding of each block during the variable length encoding process; In response to the notification, there is provided a frame management unit 213 that controls so as to perform encoding without using motion prediction in encoding of the next frame image. To.
[Selection diagram] FIG.

Description

【０００１】
【発明の属する技術分野】
本発明は、動画圧縮符号化装置に関し、特に、映像信号に対してディジタル圧縮符号化を行うエンコード装置における発生符号量の制御技術に関する。
【０００２】
近年、動画像、静止画像、音声およびテキスト等の各種データを統合的に取り扱うマルチメディアシステムの研究開発が各方面で活発に行われている。ところで、映像信号は、重要な情報源の１つではあるが、その莫大なデータ量を効率良く圧縮することが必要とされ、また、国際標準化が進んでいる。映像信号の圧縮符号化国際標準として、ＩＳＯ−ＩＥＣ　１４４９６−２　にビジュアル符号化ストリームのシンタックスおよびその復号化方式が示されており、通称ＭＰＥＧ−４（ＩＳＯ（Ｉｎｔｅｒｎａｔｉｏｎａｌ　Ｓｔａｎｄａｒｄｉｚａｔｉｏｎ　Ｏｒｇａｎｉｚａｔｉｏｎ）で審議が行われているＭＰＥＧ（Ｍｏｖｉｎｇ　Ｐｉｃｔｕｒｅ　Ｅｘｐｅｒｔｓ　Ｇｒｏｕｐ）のうち、特に、マルチメディア対応の標準化技術を目指しているＰｈａｓｅ　４）と呼ばれている。
【０００３】
通常、低ビットレートで画面サイズの小さい映像を扱うアプリケーションでは、ＭＰＥＧ−４のシンプルプロファイルと呼ばれる規格が採用される。シンプルプロファイルでは、動画の構成要素である各フレーム画像に対して、Ｉ−ＶＯＰ（フレーム内符号化ＶＯＰ（Ｉｎｔｒａ　Ｖｉｄｅｏ　Ｏｂｊｅｃｔ　Ｐｌａｎｅ））と呼ばれる現フレーム画像の情報のみから圧縮符号化を行う画像符号化タイプと、Ｐ−ＶＯＰ（フレーム間順方向予測符号化ＶＯＰ（Ｐｒｅｄｉｃｔｅｄ　Ｖｉｄｅｏ　Ｏｂｊｅｃｔ　Ｐｌａｎｅ））と呼ばれる前フレーム画像を予測値として現フレーム画像との動き差分を圧縮符号化する画像符号化タイプの２種類のフレーム符号化タイプが定義されている。
【０００４】
一般的に、動画像の符号化では映像の動きをベースに符号化効率を向上させるため、１フレーム画像のＩ−ＶＯＰ符号化に対して複数フレーム画像のＰ−ＶＯＰ符号化が連続して実施される。また、このような符号化を行うエンコード装置（動画圧縮符号化装置）では、符号化された映像ストリームを伝送する通信路の帯域が限られているため、符号化ストリームを一定の転送レートになるように制御する必要がある。そこで、構成を複雑化することなく符号化ストリームを一定の転送レートに制御することのできる動画圧縮符号化装置の提供が要望されている。
【０００５】
【従来の技術】
図１は従来の動画圧縮符号化装置の一例を示すブロック図であり、ＭＰＥＧ−４の圧縮技術に用いられるエンコード装置の構成を示すものである。図１において、参照符号１０１は現画像メモリ、１０２は動き差分検出部、１０３はＤＣＴ部、１０４は量子化部、１０５はＤＣ・ＡＣ予測部、１０６はレート制御部、そして、１０７は可変長符号化部を示している。また、参照符号１０８は発生符号バッファ、１０９は逆量子化部、１１０はＩＤＣＴ部、１１１は動き差分補償部、そして、１１２は復号画像メモリを示している。
【０００６】
現画像メモリ１０１は、映像信号の１フレームをディジタル化した画像データを記憶するメモリである。ここで、ＭＰＥＧ−４において、１フレームの画像は、１６×１６画素のマクロブロックと呼ばれる処理単位に分割して扱われ、さらに、１マクロブロックは、８×８画素の輝度信号ブロック４個と８×８画素の色差信号ブロック２個、つまり６個のブロックに分割して処理される。なお、色差信号は、輝度信号に比べて、水平方向および垂直方向ともに１／２の解像度で処理される。
【０００７】
動き差分検出部１０２は、Ｐ−ＶＯＰタイプで符号化される現画像の各ブロック画素データを現画像メモリ１０１から入力し、該ブロックに対して、復号画像メモリ１１２に記憶されている前フレームの復号画像との動き差分値が最小となるような動きベクトルを検出し、その動き差分値をＤＣＴ部１０３に出力する。ここで、動きベクトルとは、現フレーム画像と直前のフレーム画像（前フレーム画像）との水平方向および垂直方向における位置の変位を示す２次元ベクトルであり、また、動き差分値とは、現画像ブロックの各画素データから、動きベクトル分だけ変位した前フレーム画像の８×８ブロックの相対する画素データを差し引いたものである。一般的に、動き差分値が最小とは、各画素の差分絶対値の総和が最小であることを意味する。
【０００８】
Ｉ−ＶＯＰタイプのフレーム画像については、動き差分検出部１０２における動き差分検出処理は行われず、現画像メモリ１０１から読み出されたブロック画素データがそのままＤＣＴ部１０３に出力される。また、ＭＰＥＧ−４の仕様では、Ｐ−ＶＯＰ内のマクロブロックに対して動き差分検出を行わない指定が可能である。これは、フレーム画像間で動きが激しく、動き予測ブロックと現画像ブロックの動き差分が大きいブロックに対して有効である。なお、動き予測を行わないマクロブロックをイントラマクロブロックと呼び、また、動き予測を行うマクロブロックをインターマクロブロックと呼ぶ。
【０００９】
図２は動き差分検出部における処理を説明するための図であり、図２（ａ）はビデオシーケンスを示し、また、図２（ｂ）はブロックマッチング処理を示している。
【００１０】
図２（ａ）に示されるように、動画圧縮符号化装置には、時間に伴って複数のフレームの画像データが入力され、前述したように、例えば、１６×１６画素のマクロブロックＭＢにより処理される。図２（ａ）において、参照符号ＣＦは現フレーム（カレントフレーム）を示し、ＲＦは直前のフレーム（参照フレーム）を示し、そして、ＳＷは直前のフレームＲＦにおけるサーチウィンドウ（探索領域）を示している。
【００１１】
図２（ｂ）に示されるように、現フレームＣＦのマクロブロックＭＢは、直前のフレームＲＦにおけるサーチウィンドウＳＷにおいて、動き差分値が最小となるような動きベクトルの検出が行われる（Ｐ−ＶＯＰタイプの符号化処理）。
【００１２】
ＤＣＴ部１０３は、８×８画素のブロック画素データに対して、離散コサイン変換（Ｄｉｓｃｒｅｔｅ　Ｃｏｓｉｎｅ　Ｔｒａｎｓｌａｔｉｏｎ）と呼ばれる周波数変換を施すものであり、この離散コサイン変換された結果は８×８、つまり、６４個のＤＣＴ係数データとなる。このうち、第１の係数をＤＣ（直流）成分と呼び、それ以外の６３個の係数をＡＣ（交流）成分と呼ぶ。
【００１３】
量子化部１０４は、入力される各ＤＣＴ係数データを量子化スケールと呼ばれる値で除算することにより、ＤＣＴ係数の精度を粗くし、量子化ＤＣＴ係数を生成する。生成された量子化ＤＣＴ係数は、ＤＣ・ＡＣ予測部１０５および逆量子化部１０９の入力となる。この量子化部１０４における処理において、算術的に量子化スケールを大きくすれば量子化ＤＣＴ係数の値は小さくなり、結果として発生符号量を抑えることができるが、復号化する時に、元のＤＣＴ係数の再生精度が低下するため、復号画像の画質は劣化することになる。
【００１４】
ＤＣ・ＡＣ予測部１０５は、イントラマクロブロック内の各ブロックにおいて、その周辺ブロックとのＤＣ・ＡＣ係数予測差分を計算して出力する。予測値として使用されるブロックは、現ブロックの空間的に左または上のブロックが候補となり、現ブロックとＤＣ係数の差分が小さいブロックが選択される。ＡＣ予測は、選択された予測ブロックと現ブロックの一部のＡＣ係数について差分計算を行う処理である。ここで、ＤＣ予測はＭＰＥＧ−４仕様で必須処理であるが、ＡＣ予測は選択的処理である。
【００１５】
ＡＣ予測処理を行うか否かは、マクロブロック内の６個のブロックについてＡＣ差分計算を行い、その結果の絶対値総和がＡＣ予測を行わない場合に比べて小さければ、ＡＣ予測結果を使用するという判定手法が推奨されている。また、ＤＣ・ＡＣ予測部１０５から出力されるブロック内係数データの並びは、
（１）ＡＣ予測処理を行わない。
【００１６】
（２）ＡＣ予測処理を行い、左側のブロックを予測データとして使用する。
【００１７】
（３）ＡＣ予測処理を行い、上側のブロックを予測データとして使用する。
【００１８】
という３つの場合によって異なる。
【００１９】
このように、ＤＣ・ＡＣ予測部１０５では、マクロブロック入力データの統計量に依存して出力データの特性が決定されるため、量子化部１０４からのマクロブロック入力データを内部にバファリングする必要がある。
【００２０】
可変長符号化部１０７は、ＤＣ・ＡＣ予測部１０５から入力される量子化ＤＣＴ係数をランレングス符号化し、各０ラン列（『０』が連続する列）に可変長符号を割り当てて、符号化ストリームを出力する。発生符号バッファ１０８は、可変長符号化部１０７の符号化結果を一時的に蓄積するバッファで、符号化ストリームの生成性能と外部への出力性能の性能差を吸収する。
【００２１】
レート制御部１０６は、可変長符号化部１０７によって発生した符号量に基づき、発生符号量が目標とする符号量より大きくなっている場合は、量子化部１０４に対してより大きな量子化スケール値で量子化するように指示し、逆に、発生符号量が目標符号量より小さくなっている場合は、量子化部１０４に対してより小さな量子化スケール値で量子化するように指示する。
【００２２】
逆量子化部１０９、ＩＤＣＴ部１１０および動き差分補償部１１１は、次フレームの動き差分検出処理用の復号画像を生成して復号画像メモリ１１２に蓄積するために、一連の処理の逆変換を行う処理系である。
【００２３】
逆量子化部１０９は、量子化部１０４から出力される量子化ＤＣＴ係数に量子化スケール値を乗じて、誤差を含んだＤＣＴ係数を復元する。ＩＤＣＴ部１１０は、ＤＣＴ係数に逆離散コサイン変換（Ｉｎｖｅｒｓｅ　Ｄｉｓｃｒｅｔｅ　Ｃｏｓｉｎｅ　Ｔｒａｎｓｌａｔｉｏｎ）を施して、８×８画素の動き差分画素データを復元する。動き差分予測部１１１は、動きベクトルに基づいて、復号画像メモリ１１２から前フレームの８×８画素データを読み出し、入力される動き差分画素データを足し込み、現フレームの復号画像を再生し、そして、復号画像メモリ１１２に記憶する。復号画像メモリ１１２は、再生された復号画像を記憶するメモリであり、前フレームと現フレームの２フレーム分の復号画像データをフレーム単位で切り替えながら記憶する。
【００２４】
【発明が解決しようとする課題】
上述した図１の構成を有する従来の動画圧縮符号化装置において、レート制御部１０６は、発生した符号量の結果に基づくフィードバック制御を行うため、画像によってはフィードバックの効果が現れる前に多大な符号量を生成し、発生符号バッファ１０８がオーバーフローしてしまうという問題がある。
【００２５】
このような問題を解決するものとして、従来、例えば、特開平９−２６１６４２号公報が知られている。この特開平９−２６１６４２号公報においては、発生符号量が所定値を超えた時点で、そのブロックのＤＣＴ部の出力を停止する機能を持つエンコード装置（動画圧縮符号化装置）が提案されている。
【００２６】
しかしながら、ＭＰＥＧ−４の仕様においては、上述したように、可変長符号化の前にＤＣ・ＡＣ予測部１０５において量子化ＤＣＴ係数を一度バッファリングする必要がある。つまり、１マクロブロックに対する可変長符号化処理は、そのマクロブロック内のＤＣＴ処理が完全に終了するのを待たなければならず、可変長符号化部の発生符号量によってＤＣＴ係数の処理を途中で打ち切ることは不可能である。さらに、発生符号量による打ち切り処理をＤＣ・ＡＣ予測部１０５の出力に対して行ったとしても、エンコーダ装置内部で復号画像を生成するためには、ＤＣ・ＡＣ予測処理後に打ち切り処理を行った量子化ＤＣＴ係数データを元の順序に戻して、逆量子化部１０９に入力しなければならない。これは、量子化部１０４から逆量子化部１０９に直接量子化ＤＣＴ係数データを出力する場合に比べて、時間遅延を著しく大きくするだけでなく、内部回路を複雑化させることにもなる。
【００２７】
本発明は、上述した従来の動画圧縮符号化装置が有する課題に鑑み、構成を複雑化することなく符号化ストリームを一定の転送レートに制御することのできる動画圧縮符号化装置の提供を目的とする。
【００２８】
【課題を解決するための手段】
本発明の第１の形態によれば、映像データの各フレームをブロック単位に分割し、該ブロック毎に現フレームと前フレームとの予測される動き差分を検出する動き差分検出手段と、該予測される動き差分の結果または元の画像データに対して、規定された直交変換および量子化を実施して可変長符号化を行う可変長符号化手段と、該直交変換および量子化が実施されたデータに対して、規定された逆量子化、逆直交変換および動き補償処理を実施して復号画像を生成する復号画像生成手段と、前記可変長符号化の処理時に、各ブロックの符号化を途中で打ち切る符号化打ち切り手段と、該符号化打ち切り手段からの符号化打ち切りが通知されることにより、次フレーム画像の符号化において、動き予測を用いない符号化を実施するように制御するフレーム管理手段と、を備えることを特徴とする動画像圧縮符号化装置が提供される。
【００２９】
本発明の第１の形態に係る動画圧縮符号化装置によれば、符号化打ち切り手段は、可変長符号化の処理時に、各ブロックの符号化を途中で打ち切り、そして、フレーム管理手段は、符号化打ち切り手段からの符号化打ち切りが通知されることにより、次フレーム画像の符号化において、動き予測を用いない符号化を実施するように制御する。
【００３０】
本発明の第２の形態によれば、映像データの各フレームをブロック単位に分割し、該ブロック毎に現フレームと前フレームとの予測される動き差分を検出する動き差分検出手段と、該予測される動き差分の結果または元の画像データに対して、規定された直交変換および量子化を実施して可変長符号化を行う可変長符号化手段と、該直交変換および量子化が実施されたデータに対して、規定された逆量子化、逆直交変換および動き補償処理を実施して復号画像を生成する復号画像生成手段と、前記可変長符号化の処理時に、各ブロックの符号化を途中で打ち切る符号化打ち切り手段と、該符号化打ち切り手段により符号化打ち切りが発生したブロックの位置を記憶するブロック位置記憶手段と、を備え、前記符号化打ち切りが発生した次フレーム画像のブロックの動き予測において、該符号化打ち切りが発生したブロックを動き予測ブロックの候補としないことを特徴とする動画像圧縮符号化装置が提供される。
【００３１】
本発明の第２の形態に係る動画圧縮符号化装置によれば、符号化打ち切り手段は、可変長符号化の処理時に、各ブロックの符号化を途中で打ち切り、また、ブロック位置記憶手段は、符号化打ち切り手段により符号化打ち切りが発生したブロックの位置を記憶し、そして、符号化打ち切りが発生した次フレーム画像のブロックの動き予測において、該符号化打ち切りが発生したブロックを動き予測ブロックの候補としないようになっている。
【００３２】
これにより、構成を複雑化することなく符号化ストリームを一定の転送レートに制御することのできる動画圧縮符号化装置を提供することができる。なお、本発明に係る動画圧縮符号化装置は、ＭＰＥＧ−４の適用に限定されるものではなく、ＭＰＥＧ−１およびＭＰＥＧ−２、さらには、他の方式の動画圧縮符号化装置としても適用することができる。
【００３３】
【発明の実施の形態】
以下、本発明に係る動画圧縮符号化装置の実施例を、添付図面を参照して詳述する。
【００３４】
図３は本発明に係る動画圧縮符号化装置の第１実施例を示すブロック図であり、ＭＰＥＧ−４の圧縮技術に用いられるエンコード装置の構成を示すものである。図３において、参照符号２０１は現画像メモリ、２０２は動き差分検出部、２０３はＤＣＴ部、２０４は量子化部、２０５はＤＣ・ＡＣ予測部、２０６はレート制御部、２０７は可変長符号化部、そして、２０７１は符号化打ち切り手段を示している。また、参照符号２０８は発生符号バッファ、２０９は逆量子化部、２１０はＩＤＣＴ部、２１１は動き差分補償部、２１２は復号画像メモリ、そして、２１３はフレーム管理部を示している。
【００３５】
図３と前述した図１との比較から明らかなように、本第１実施例の動画圧縮符号化装置（エンコード装置）では、可変長符号化部２０７内に符号化打ち切り手段２０７１が設けられている。符号化打ち切り手段２０７１は、ブロック内の量子化係数データを可変長符号化する時、発生符号量がある値を超えた場合はブロック内の残りの量子化ＤＣＴ係数を無視し、符号化形式を遵守し、そして、最小の発生符号量となる形式で残りのブロックデータを充填して出力する。さらに、可変長符号化部２０７は、ブロック内で符号化を打ち切りが発生したというイベントをフレーム管理部２１３に通知する。フレーム管理部２１３は、符号化打ち切りのイベントを受けると次フレームの符号化において、動き予測を行わない画像タイプ、つまりＭＰＥＧ−４の場合はＩ−ＶＯＰ、として圧縮符号化するようにエンコード処理を制御する。
【００３６】
符号化打ち切り手段２０７１は、ブロックの発生符号量がある値を超えたとき、ブロック内の残りの量子化ＤＣＴ係数をすべてゼロとみなして符号化する。これは、ブロック内の非ゼロである量子化ＤＣＴ係数を符号化する毎に発生符合量が閾値を超えるか否かをチェックし、閾値を超えた場合は、その非ゼロ係数に対して、ブロックの最後であることを示すフラグを付加して符号化出力すればよい。ここで、符号化打ち切り手段２０７１は、符号化打ち切りによる強制ゼロ係数発生を実施した場合、そのイベントをフレーム管理部２１３およびレート制御部２０６に通知する。
【００３７】
フレーム管理部２１３では、現在の（カレントな）処理フレームにおいて、１ブロックでも強制ゼロ係数が発生した場合、次のフレームをＩ−ＶＯＰとして扱うようにエンコード処理を制御する。また、レート制御部２０６では、ＶＯＰタイプによって割り当て符号量の目標値を調整する必要があるので、符号化打ち切りによるゼロ係数発生を受けて、次フレームに対してＩ−ＶＯＰに応じた目標符号量を割り当てる。
【００３８】
図４は図３の動画圧縮符号化装置におけるレート制御部の処理を説明するための図であり、図４（ａ）は通常の（符号化打ち切りが発生していない）処理を行っている様子を示し、図４（ｂ）は符号化打ち切りが発生してそのままの処理を継続している様子を示し、そして、図４（ｃ）は符号化打ち切りが発生して本第１実施例の処理を行っている様子を示している。
【００３９】
まず、符号化打ち切りが発生していない通常の場合、図４（ａ）に示されるように、レート制御部２０６は、１つのＩ−ＶＯＰと次のＩ−ＶＯＰまでの複数のＰ−ＶＯＰとを１つのグループ単位（レート制御グループ）として扱い、各グループがほぼ一定の符号量になるように各ＶＯＰの発生符合量を制御する。ただし、１グループ内では、Ｐ−ＶＯＰに比べてＩ−ＶＯＰに多めの符号量を割り当てるのが一般的である。また、計算を単純化するため、２つのＩ−ＶＯＰ間のＰ−ＶＯＰ数はエンコード前に予め一定の枚数に決定しておく。図４においては、１グループがＩ−ＶＯＰ１枚とＰ−ＶＯＰ５枚で構成される。
【００４０】
ところで、図４（ｂ）に示されるように、ブロックの発生符号量がある値を超えて符号化打ち切りが発生してそのままの処理を継続した場合、グループの途中で強制的にＩ−ＶＯＰを生成し、１グループ内に比較的符号量が多いＩ−ＶＯＰが２枚以上存在することになる。これは、もともとグループ内のＰ−ＶＯＰ数が多い場合は問題ないが、グループ内のＰ−ＶＯＰ数が少ない場合は、強制的に発生したＩ−ＶＯＰにより、残りのＰ−ＶＯＰに割り当てる符号量が少なくなって画質が劣化する可能性がある。（図４（ｂ）のＩＰ参照）
そこで、本第１実施例のレート制御部２０６は、図４（ｃ）に示されるように、符号化打ち切りブロックを含むフレームの次のフレームが、レート制御グループの先頭のＩ−ＶＯＰになるようにグループビット量を割り当て直す。これにより、強制的なＩ−ＶＯＰの挿入によって発生したビット量の増加をグループ単位の中で緩やかに吸収することが可能になり、画質の劣化を防ぐことができる。
【００４１】
具体的な計算法を図４（ｃ）を参照して説明する。
【００４２】
まず、目標とする転送レートによって与えられた１フレームあたりの平均目標ビット量をＲｐｖとし、予め定義されているグループ内のフレーム数をＮｇｒｐとすると、グループに割り当てられる目標符号量は、　Ｒｐｖ×Ｎｇｒｐ　となる。
【００４３】
グループ内で符号化打ち切りが発生した場合、グループの先頭から符号化打ち切り発生フレームまでの発生符号量をＢａ、生成したフレーム数をＮａ（図４（ｃ）の場合は３）とすると、Ｎａ枚までに平均的に割り当てられるべき符号量は、　Ｒｐｖ×Ｎａなので、Ｂａとの差分Ｄａ（すなわち、Ｄａ＝Ｂａ−Ｒｐｖ×Ｎａ）は、転送レートに対して余分に発生した符合量を意味する。そこで、次のレート制御のグループに割り与える目標符号量から余分に発生した符合量を減じる。つまり、　Ｒｐｖ×Ｎｇｒｐ−Ｄａ　を新たな目標符号量として次のレートグループ単位に割り当てる。これにより、例えば、図４（ｂ）の参照符号ＩＰで示されるような割り当てる符号量が少なくないＰ−ＶＯＰの発生をなくして画質の劣化を避けることが可能になる。
【００４４】
図５は本発明に係る動画圧縮符号化装置の第２実施例を示すブロック図である。図５において、参照符号３０１は現画像メモリ、３０２は動き差分検出部、３０３はＤＣＴ部、３０４は量子化部、３０５はＤＣ・ＡＣ予測部、３０６はレート制御部、３０７は可変長符号化部、そして、３０７１は符号化打ち切り手段を示している。また、参照符号３０８は発生符号バッファ、３０９は逆量子化部、３１０はＩＤＣＴ部、３１１は動き差分補償部、３１２は復号画像メモリ、そして、３１４はブロック位置記憶部を示している。
【００４５】
図５と図３との比較から明らかなように、本第２実施例の動画圧縮符号化装置は、図３に示す第１実施例におけるフレーム管理部２１３の代わりにブロック位置記憶部３１４を設けるようになっている。
【００４６】
ブロック位置記憶部３１４は、符号化打ち切りを発生したブロックの位置を記憶するもので、記憶すべき情報量は、各ブロック位置に対して打ち切りが発生したか否かの１ビットの情報でよく、小規模な回路で実現することができる。すなわち、符号化打ち切りを発生したフレームの次フレームがＰ−ＶＯＰの場合、強制的にＩ−ＶＯＰ化する（図４（ｂ）参照）ことなく、動き予測検出処理において、動き予測検出部３０２がブロック位置記憶部３１４のブロック打ち切り情報を参照して、後述する図６に示ような、動き予測ブロック候補の一部に符号化打ち切りブロックを含む場合は、その動き予測ブロック候補を動き予測候補から外すように制御する。
【００４７】
図６は図５に示す動画圧縮符号化装置の特徴的な動作を説明するための図である。図６では、任意の８×８画素の動き予測ブロック候補に対して、３×４画素（黒丸で示す画素）が前フレームにおいて符号化打ち切りされたことを意味する。
【００４８】
図６に示されるように、符号化打ち切りを発生したブロックの位置は、ブロック位置記憶部３１４に記憶され、その記憶された位置情報に従って、動き予測検出部３０２は、動き予測ブロック候補の一部に符号化打ち切りブロックを含む場合は、その動き予測ブロック候補を動き予測候補から外すように制御する。
【００４９】
図７は本発明に係る第３実施例としての動画圧縮符号化装置の特徴的な動作を説明するための図である。図７では、本来の探索領域ＢＬ０は横方向が４ブロックで縦方向が３ブロックであるが、その矩形領域ＢＬ０を、横方向が３ブロックで縦方向が１ブロックの第１の矩形領域ＢＬ１、横方向が４ブロックで縦方向が１ブロックの第２の矩形領域ＢＬ２および横方向が３ブロックで縦方向が１ブロックの第３の矩形領域ＢＬ３の３つの矩形領域に分割したことを示している。
【００５０】
動き予測ブロックを検出する場合、探索する予測ブロックの範囲は予め限定され、例えば、カレントなブロック位置を囲む矩形領域内で探索が行われる。ここで、動き予測検出処理は、２つの矩形領域を含むような（例えば、第１の矩形領域ＢＬ１と第２の矩形領域ＢＬ２に跨がるようなもの（Ｒ０）に対しても行われる。
【００５１】
本第３実施例では、図７に示されるように、動き予測検出処理において、動き予測検出部３０２が動き予測を検出する前にブロック位置記憶部３０４を参照して、本来の探索すべき矩形領域を符号化打ち切りブロックを含まない複数の矩形領域に分割する。これにより、動き検出処理は、各矩形領域に対して探索処理を行う必要があるが、動き予測ブロック候補単位に符号化打ち切り発生ブロックが含まれるか否かを検出する必要がなく、動き予測検出処理を単純化することが可能になる。
【００５２】
このように、本発明の各実施例に係る動画圧縮符号化装置によれば、符号化打ち切りが発生した画像の次のフレームにおいて、符号化打ち切りが発生した画像を動き予測差分検出のため予測画像として使う必要がない。従って、符号化打ち切り発生ブロックを含むフレームにおいて、エンコーダ内部で復号化する画像と、受信装置において符号化ストリームを復号化する画像が異なっても、次フレームはＩ−ＶＯＰとしてともに同一の画像が再生される。その結果、量子化部２０５（３０５）から逆量子化部２０９（３０９）へのデータの流れを遮ることなく、発生符号量のオーバーフローを制御することができる。
【００５３】
【発明の効果】
以上、詳述したように、本発明によれば、構成を複雑化することなく符号化ストリームを一定の転送レートに制御することのできる動画圧縮符号化装置を提供することができる。
【図面の簡単な説明】
【図１】従来の動画圧縮符号化装置の一例を示すブロック図である。
【図２】動き差分検出部における処理を説明するための図である。
【図３】本発明に係る動画圧縮符号化装置の第１実施例を示すブロック図である。
【図４】図３の動画圧縮符号化装置におけるレート制御部の処理を説明するための図である。
【図５】本発明に係る動画圧縮符号化装置の第２実施例を示すブロック図である。
【図６】図５に示す動画圧縮符号化装置の特徴的な動作を説明するための図である。
【図７】本発明に係る第３実施例としての動画圧縮符号化装置の特徴的な動作を説明するための図である。
【符号の説明】
１０１，２０１，３０１…現画像メモリ
１０２，２０２，３０２…動き差分検出部
１０３，２０３，３０３…ＤＣＴ部
１０４，２０４，３０４…量子化部
１０５，２０５，３０５…ＤＣ・ＡＣ予測部
１０６，２０６，３０６…レート制御部
１０７，２０７，３０７…可変長符号化部
１０８，２０８，３０８…発生符号バッファ
１０９，２０９，３０９…逆量子化部
１１０，２１０，３１０…ＩＤＣＴ部
１１１，２１１，３１１…動き差分補償部
１１２，２１２，３１２…復号画像メモリを示している。
２１３…フレーム管理部
３１４…ブロック位置記憶部
２０７１，３０７１…符号化打ち切り手段[0001]
TECHNICAL FIELD OF THE INVENTION
The present invention relates to a moving image compression encoding apparatus, and more particularly, to a technique for controlling a generated code amount in an encoding apparatus that performs digital compression encoding on a video signal.
[0002]
2. Description of the Related Art In recent years, research and development of multimedia systems that integrally handle various types of data such as moving images, still images, sounds, and texts have been actively conducted in various fields. By the way, a video signal is one of important information sources, but it is necessary to efficiently compress a huge amount of data, and international standardization is progressing. ISO-IEC 14496-2 describes the syntax of a visual coded stream and its decoding method as an international standard for compression coding of video signals, and is deliberated by MPEG-4 (International Standardization Organization (ISO)). Of the MPEG (Moving Picture Experts Group), it is called Phase 4), which aims at a standardization technology for multimedia.
[0003]
Usually, in an application that handles a video having a small screen size at a low bit rate, a standard called a simple profile of MPEG-4 is adopted. In the simple profile, image coding in which each frame image that is a component of a moving image is compression-coded only from information of a current frame image called an I-VOP (Intra-Video Object Plane). And a picture coding type for compressing and coding a motion difference from the current frame picture using a previous frame picture called a P-VOP (inter-frame forward predictive coding VOP (Predicted Video Object Plane)) as a prediction value. Are defined.
[0004]
Generally, in moving picture coding, P-VOP coding of a plurality of frame images is continuously performed with respect to I-VOP coding of one frame image in order to improve coding efficiency based on video motion. Is done. Further, in an encoding device (moving image compression encoding device) that performs such encoding, the bandwidth of a communication path for transmitting an encoded video stream is limited, and thus the encoded stream has a constant transfer rate. Need to be controlled. Therefore, there is a demand for a moving image compression encoding apparatus capable of controlling an encoded stream at a constant transfer rate without complicating the configuration.
[0005]
[Prior art]
FIG. 1 is a block diagram showing an example of a conventional moving image compression encoding apparatus, and shows a configuration of an encoding apparatus used for MPEG-4 compression technology. In FIG. 1, reference numeral 101 denotes a current image memory, 102 denotes a motion difference detection unit, 103 denotes a DCT unit, 104 denotes a quantization unit, 105 denotes a DC / AC prediction unit, 106 denotes a rate control unit, and 107 denotes a variable length. 2 shows an encoding unit. Reference numeral 108 denotes a generated code buffer, 109 denotes an inverse quantization unit, 110 denotes an IDCT unit, 111 denotes a motion difference compensating unit, and 112 denotes a decoded image memory.
[0006]
The current image memory 101 is a memory for storing image data obtained by digitizing one frame of a video signal. Here, in MPEG-4, one frame image is handled by being divided into processing units called macroblocks of 16 × 16 pixels, and one macroblock is composed of four luminance signal blocks of 8 × 8 pixels. The color difference signal block of 8 × 8 pixels is divided into two blocks, that is, six blocks, and processed. Note that the color difference signal is processed at half the resolution in the horizontal and vertical directions as compared to the luminance signal.
[0007]
The motion difference detection unit 102 receives, from the current image memory 101, each block pixel data of the current image coded in the P-VOP type, and inputs the block pixel data of the previous frame stored in the decoded image memory 112. A motion vector that minimizes the motion difference value from the decoded image is detected, and the motion difference value is output to DCT section 103. Here, the motion vector is a two-dimensional vector indicating the displacement of the position between the current frame image and the immediately preceding frame image (previous frame image) in the horizontal and vertical directions, and the motion difference value is the current image. This is obtained by subtracting the corresponding pixel data of the 8 × 8 block of the previous frame image displaced by the motion vector from each pixel data of the block. Generally, the minimum motion difference value means that the sum of the absolute difference values of the pixels is minimum.
[0008]
For the I-VOP type frame image, the motion difference detection processing in the motion difference detection unit 102 is not performed, and the block pixel data read from the current image memory 101 is output to the DCT unit 103 as it is. Further, according to the MPEG-4 specification, it is possible to specify not to perform the motion difference detection on the macroblock in the P-VOP. This is effective for a block in which the motion between frame images is severe and the motion difference between the motion prediction block and the current image block is large. Note that a macroblock on which motion prediction is not performed is called an intra macroblock, and a macroblock on which motion prediction is performed is called an inter macroblock.
[0009]
FIG. 2 is a diagram for explaining processing in the motion difference detection unit. FIG. 2A shows a video sequence, and FIG. 2B shows block matching processing.
[0010]
As shown in FIG. 2A, image data of a plurality of frames is input to the moving image compression encoding apparatus with time, and is processed by a macroblock MB of 16 × 16 pixels as described above, for example. Is done. In FIG. 2A, reference numeral CF indicates a current frame (current frame), RF indicates an immediately preceding frame (reference frame), and SW indicates a search window (search area) in the immediately preceding frame RF. I have.
[0011]
As shown in FIG. 2B, in the macroblock MB of the current frame CF, in the search window SW in the immediately preceding frame RF, a motion vector is detected such that the motion difference value becomes minimum (P-VOP). Type encoding process).
[0012]
The DCT unit 103 performs a frequency transform called discrete cosine transform (discrete cosine transform) on the block pixel data of 8 × 8 pixels, and the result of the discrete cosine transform is 8 × 8, that is, 64. DCT coefficient data. Of these, the first coefficient is called a DC (direct current) component, and the other 63 coefficients are called AC (alternating current) components.
[0013]
The quantization unit 104 divides each input DCT coefficient data by a value called a quantization scale, thereby reducing the accuracy of the DCT coefficient and generating a quantized DCT coefficient. The generated quantized DCT coefficients are input to the DC / AC prediction unit 105 and the inverse quantization unit 109. In the processing in the quantization unit 104, if the quantization scale is arithmetically increased, the value of the quantized DCT coefficient decreases, and as a result, the generated code amount can be suppressed. Since the reproduction accuracy of the decoded image is reduced, the image quality of the decoded image is deteriorated.
[0014]
The DC / AC prediction unit 105 calculates and outputs a DC / AC coefficient prediction difference between each block in the intra macroblock and its neighboring blocks. As a block used as a prediction value, a block spatially left or above the current block is a candidate, and a block having a small difference between the current block and the DC coefficient is selected. The AC prediction is a process of calculating a difference between a selected prediction block and some AC coefficients of the current block. Here, DC prediction is an essential process in the MPEG-4 specification, whereas AC prediction is a selective process.
[0015]
Whether or not to perform the AC prediction processing is determined by performing the AC difference calculation on the six blocks in the macroblock and using the AC prediction result if the sum of the absolute values of the results is smaller than the case where the AC prediction is not performed. Is recommended. The arrangement of the coefficient data in the block output from the DC / AC prediction unit 105 is as follows.
(1) No AC prediction processing is performed.
[0016]
(2) Perform AC prediction processing and use the left block as prediction data.
[0017]
(3) Perform AC prediction processing and use the upper block as prediction data.
[0018]
It depends on the three cases.
[0019]
As described above, in the DC / AC prediction unit 105, the characteristics of the output data are determined depending on the statistics of the macroblock input data. Therefore, it is necessary to buffer the macroblock input data from the quantization unit 104 internally. There is.
[0020]
The variable-length coding unit 107 performs run-length coding on the quantized DCT coefficient input from the DC / AC prediction unit 105, assigns a variable-length code to each 0-run sequence (a sequence in which “0” is continuous), and performs coding. Output the stream. The generation code buffer 108 is a buffer for temporarily storing the coding result of the variable length coding unit 107, and absorbs a difference in performance between the generation performance of the coded stream and the output performance to the outside.
[0021]
If the generated code amount is larger than the target code amount based on the code amount generated by the variable length coding unit 107, the rate control unit 106 gives the quantization unit 104 a larger quantization scale value. In the case where the generated code amount is smaller than the target code amount, the quantization unit 104 is instructed to perform quantization with a smaller quantization scale value.
[0022]
The inverse quantization unit 109, the IDCT unit 110, and the motion difference compensating unit 111 perform a series of inverse transforms in order to generate a decoded image for the motion difference detection process of the next frame and store the decoded image in the decoded image memory 112. It is a processing system.
[0023]
The inverse quantization unit 109 multiplies the quantized DCT coefficient output from the quantization unit 104 by the quantization scale value to restore the DCT coefficient including the error. The IDCT unit 110 performs inverse discrete cosine transformation (DCT) on the DCT coefficients to restore motion difference pixel data of 8 × 8 pixels. The motion difference prediction unit 111 reads out the 8 × 8 pixel data of the previous frame from the decoded image memory 112 based on the motion vector, adds the input motion difference pixel data, reproduces the decoded image of the current frame, and , Stored in the decoded image memory 112. The decoded image memory 112 is a memory for storing the reproduced decoded image, and stores the decoded image data of two frames of the previous frame and the current frame while switching in units of frames.
[0024]
[Problems to be solved by the invention]
In the conventional moving image compression encoding apparatus having the configuration shown in FIG. 1, the rate control unit 106 performs feedback control based on the result of the generated code amount. There is a problem that the amount is generated and the generated code buffer 108 overflows.
[0025]
As a solution to such a problem, for example, Japanese Patent Application Laid-Open No. 9-261624 is known. Japanese Patent Application Laid-Open No. 9-261652 proposes an encoding device (moving image compression encoding device) having a function of stopping the output of the DCT section of a block when the generated code amount exceeds a predetermined value. .
[0026]
However, in the MPEG-4 specification, as described above, it is necessary to buffer the quantized DCT coefficients once in the DC / AC prediction unit 105 before performing variable-length coding. That is, in the variable length coding process for one macroblock, it is necessary to wait for the DCT process in the macroblock to be completely completed. It is impossible to abort. Further, even if the truncation process based on the generated code amount is performed on the output of the DC / AC prediction unit 105, in order to generate a decoded image inside the encoder device, the quantum that is subjected to the truncation process after the DC / AC prediction process is generated. The transformed DCT coefficient data must be returned to the original order and input to the inverse quantization unit 109. This not only significantly increases the time delay but also complicates the internal circuit as compared with the case where the quantized DCT coefficient data is directly output from the quantizer 104 to the inverse quantizer 109.
[0027]
SUMMARY OF THE INVENTION The present invention has been made in view of the above-described problems of the conventional moving image compression encoding apparatus, and has as its object to provide a moving image compression encoding apparatus capable of controlling an encoded stream to a constant transfer rate without complicating the configuration. I do.
[0028]
[Means for Solving the Problems]
According to the first aspect of the present invention, a motion difference detecting means for dividing each frame of video data into blocks and detecting a predicted motion difference between a current frame and a previous frame for each block; A variable-length encoding unit that performs a specified orthogonal transformation and quantization on the result of the motion difference or the original image data to perform variable-length encoding, and the orthogonal transformation and quantization are performed. Decoding image generating means for performing a specified inverse quantization, inverse orthogonal transform, and motion compensation processing on the data to generate a decoded image, and performing encoding of each block during the variable length encoding processing. Control to perform coding without using motion prediction in coding of the next frame image by being notified of coding cut-off means to be cut off by the coding stop means. Moving picture compression encoding apparatus is provided, characterized in that it comprises a frame management unit that, the.
[0029]
According to the moving image compression encoding apparatus according to the first embodiment of the present invention, the encoding discontinuation unit discontinues encoding of each block in the middle of the variable length encoding process, and the frame management unit In response to the notification of the coding cut-off from the coding cut-off means, control is performed so as to perform coding without using motion prediction in coding the next frame image.
[0030]
According to the second aspect of the present invention, a motion difference detecting means for dividing each frame of video data into blocks and detecting a predicted motion difference between a current frame and a previous frame for each block; A variable-length encoding unit that performs a specified orthogonal transformation and quantization on the result of the motion difference or the original image data to perform variable-length encoding, and the orthogonal transformation and quantization are performed. Decoding image generating means for performing a specified inverse quantization, inverse orthogonal transform, and motion compensation processing on the data to generate a decoded image, and performing encoding of each block during the variable length encoding processing. And a block position storing means for storing a position of a block in which encoding has been aborted by the encoding aborting means. In the motion prediction block in beam image, moving picture compression encoding apparatus, characterized in that no blocks the encoded truncation occurs as candidates for the motion prediction block is provided.
[0031]
According to the moving image compression encoding apparatus according to the second embodiment of the present invention, the encoding discontinuation unit discontinues the encoding of each block in the middle of the variable length encoding process, and the block position storage unit The position of the block in which the coding abortion has occurred is stored by the coding aborting means, and in the motion prediction of the block of the next frame image in which the coding abortion has occurred, the block in which the coding abortion has occurred is a candidate for a motion prediction block. And not to.
[0032]
Accordingly, it is possible to provide a moving image compression encoding device capable of controlling an encoded stream to a constant transfer rate without complicating the configuration. It should be noted that the moving picture compression / encoding device according to the present invention is not limited to the application of MPEG-4, but is also applicable to MPEG-1 and MPEG-2, and also to other types of moving picture compression / encoding device. be able to.
[0033]
BEST MODE FOR CARRYING OUT THE INVENTION
Hereinafter, embodiments of the moving image compression encoding apparatus according to the present invention will be described in detail with reference to the accompanying drawings.
[0034]
FIG. 3 is a block diagram showing a first embodiment of the moving picture compression encoding apparatus according to the present invention, and shows the configuration of an encoding apparatus used for MPEG-4 compression technology. 3, reference numeral 201 denotes a current image memory, 202 denotes a motion difference detection unit, 203 denotes a DCT unit, 204 denotes a quantization unit, 205 denotes a DC / AC prediction unit, 206 denotes a rate control unit, and 207 denotes variable length coding. Reference numeral 2071 denotes a coding termination unit. Reference numeral 208 denotes a generated code buffer, 209 denotes an inverse quantization unit, 210 denotes an IDCT unit, 211 denotes a motion difference compensation unit, 212 denotes a decoded image memory, and 213 denotes a frame management unit.
[0035]
As is clear from the comparison between FIG. 3 and FIG. 1 described above, in the moving image compression encoding apparatus (encoding apparatus) of the first embodiment, the encoding cutoff means 2071 is provided in the variable length encoding unit 207. I have. When the quantized coefficient data in the block is subjected to variable-length encoding, if the amount of generated code exceeds a certain value, the coding terminating unit 2071 ignores the remaining quantized DCT coefficients in the block and changes the coding format. Then, the remaining block data is filled and output in a format that minimizes the amount of generated codes. Further, the variable-length coding unit 207 notifies the frame management unit 213 of an event that coding has been discontinued in the block. The frame management unit 213 performs an encoding process so as to perform compression encoding as an image type for which motion prediction is not performed, that is, an I-VOP in the case of MPEG-4, in the encoding of the next frame upon receiving the event of the encoding termination. Control.
[0036]
When the generated code amount of the block exceeds a certain value, the coding termination unit 2071 performs coding by regarding all the remaining quantized DCT coefficients in the block as zero. This means that every time a non-zero quantized DCT coefficient in a block is encoded, it is checked whether or not the amount of generated code exceeds a threshold. May be coded and output with a flag indicating that it is the last. Here, when the forced zero coefficient generation by the coding termination is performed, the coding termination unit 2071 notifies the frame management unit 213 and the rate control unit 206 of the event.
[0037]
The frame management unit 213 controls the encoding process so as to treat the next frame as an I-VOP when a forced zero coefficient occurs even in one block in the current (current) processing frame. In addition, since the rate control unit 206 needs to adjust the target value of the allocated code amount according to the VOP type, the rate control unit 206 receives the zero coefficient generation due to the coding termination, and receives the target code amount corresponding to the I-VOP for the next frame. Assign.
[0038]
FIG. 4 is a diagram for explaining the processing of the rate control unit in the moving picture compression encoding apparatus of FIG. 3, and FIG. 4 (a) shows a state in which normal processing (no encoding termination occurs) is performed. FIG. 4B shows a state in which the coding is discontinued and the processing is continued as it is, and FIG. 4C shows a state in which the coding is discontinued and the processing of the first embodiment is performed. Is shown.
[0039]
First, in a normal case in which coding termination does not occur, as shown in FIG. 4A, the rate control unit 206 transmits one I-VOP and a plurality of P-VOPs up to the next I-VOP. Are handled as one group unit (rate control group), and the generated code amount of each VOP is controlled so that each group has a substantially constant code amount. However, within one group, it is common to assign a larger amount of code to the I-VOP than to the P-VOP. Further, in order to simplify the calculation, the number of P-VOPs between two I-VOPs is determined to be a fixed number before encoding. In FIG. 4, one group includes one I-VOP and five P-VOPs.
[0040]
By the way, as shown in FIG. 4 (b), when the code generation of the block exceeds a certain value and the coding is discontinued and the processing is continued as it is, the I-VOP is forcibly stopped in the middle of the group. Thus, one or more I-VOPs having a relatively large code amount exist in one group. This is not a problem if the number of P-VOPs in the group is large, but if the number of P-VOPs in the group is small, the code amount to be allocated to the remaining P-VOPs due to the forcibly generated I-VOPs And the image quality may be degraded. (See IP in FIG. 4B)
Therefore, as shown in FIG. 4C, the rate control unit 206 according to the first embodiment sets the next frame after the frame including the coding cutoff block to be the first I-VOP of the rate control group. Is reassigned to the group bit amount. As a result, the increase in the bit amount caused by the forced insertion of the I-VOP can be gradually absorbed in the group unit, and the deterioration of the image quality can be prevented.
[0041]
A specific calculation method will be described with reference to FIG.
[0042]
First, assuming that the average target bit amount per frame given by the target transfer rate is Rpv, and the number of frames in a group defined in advance is Ngrp, the target code amount allocated to the group is Rpv × Ngrp It becomes.
[0043]
When the coding is discontinued in the group, if the generated code amount from the head of the group to the coding-discontinued frame is Ba, and the number of generated frames is Na (3 in FIG. 4C), Na sheets are generated. Since the code amount to be allocated on average is Rpv × Na, the difference Da from Ba (that is, Da = Ba−Rpv × Na) means the code amount generated extra for the transfer rate. Therefore, the extra code amount is subtracted from the target code amount assigned to the next rate control group. That is, Rpv × Ngrp-Da is assigned to the next rate group unit as a new target code amount. Accordingly, for example, it is possible to avoid the occurrence of a P-VOP in which the code amount to be allocated is small as shown by the reference code IP in FIG.
[0044]
FIG. 5 is a block diagram showing a second embodiment of the moving picture compression encoding apparatus according to the present invention. 5, reference numeral 301 denotes a current image memory, 302 denotes a motion difference detection unit, 303 denotes a DCT unit, 304 denotes a quantization unit, 305 denotes a DC / AC prediction unit, 306 denotes a rate control unit, and 307 denotes variable-length coding. Reference numeral 3071 denotes a coding terminating means. Reference numeral 308 denotes a generated code buffer, 309 denotes an inverse quantization unit, 310 denotes an IDCT unit, 311 denotes a motion difference compensation unit, 312 denotes a decoded image memory, and 314 denotes a block position storage unit.
[0045]
As is clear from the comparison between FIG. 5 and FIG. 3, the moving image compression encoding apparatus of the second embodiment includes a block position storage unit 314 instead of the frame management unit 213 in the first embodiment shown in FIG. It has become.
[0046]
The block position storage unit 314 stores the position of a block in which encoding has been discontinued, and the amount of information to be stored may be 1-bit information indicating whether or not truncation has occurred for each block position. It can be realized with a small-scale circuit. That is, when the frame next to the frame in which encoding has been discontinued is a P-VOP, the motion prediction detection unit 302 does not forcibly convert the frame into an I-VOP (see FIG. 4B). Referring to the block truncation information of the block position storage unit 314, when a motion prediction block candidate includes a coding truncation block as shown in FIG. 6 described later, the motion prediction block candidate is converted from the motion prediction candidate. Control to remove.
[0047]
FIG. 6 is a diagram for explaining the characteristic operation of the moving picture compression encoding apparatus shown in FIG. In FIG. 6, it means that 3 × 4 pixels (pixels indicated by black circles) for an arbitrary 8 × 8 pixel motion prediction block candidate have been coded and terminated in the previous frame.
[0048]
As shown in FIG. 6, the position of the block in which the coding is discontinued is stored in the block position storage unit 314, and the motion prediction detection unit 302 determines a part of the motion prediction block candidate in accordance with the stored position information. When the coding prediction block includes a coding cutoff block, control is performed such that the motion prediction block candidate is excluded from the motion prediction candidates.
[0049]
FIG. 7 is a diagram for explaining the characteristic operation of the moving picture compression encoding apparatus as the third embodiment according to the present invention. In FIG. 7, the original search area BL0 has four blocks in the horizontal direction and three blocks in the vertical direction. The rectangular area BL0 is defined as a first rectangular area BL1 having three blocks in the horizontal direction and one block in the vertical direction. It shows that the rectangular area is divided into three rectangular areas, that is, a second rectangular area BL2 having four blocks in the horizontal direction and one block in the vertical direction, and a third rectangular area BL3 having three blocks in the horizontal direction and one block in the vertical direction. .
[0050]
When detecting a motion prediction block, the range of the prediction block to be searched is limited in advance. For example, a search is performed in a rectangular area surrounding the current block position. Here, the motion prediction detection process is also performed on a region including two rectangular regions (for example, a region (R0) extending over the first rectangular region BL1 and the second rectangular region BL2).
[0051]
In the third embodiment, as shown in FIG. 7, in the motion prediction detection process, before the motion prediction detection unit 302 detects the motion prediction, the block position storage unit 304 is referred to and the rectangle to be searched originally is determined. The region is divided into a plurality of rectangular regions that do not include the coding cutoff block. As a result, in the motion detection process, it is necessary to perform a search process for each rectangular area, but there is no need to detect whether or not a coding discontinuation occurrence block is included in a motion prediction block candidate unit. Processing can be simplified.
[0052]
As described above, according to the moving image compression encoding apparatus according to each embodiment of the present invention, in a frame next to an image in which encoding has been discontinued, an image in which encoding has been discontinued is subjected to a prediction image for motion prediction difference detection. No need to use as Therefore, in the frame including the coding discontinuation occurrence block, even if the image decoded inside the encoder and the image decoded from the encoded stream in the receiving device are different, the same image is reproduced as the I-VOP in the next frame. Is done. As a result, the overflow of the generated code amount can be controlled without interrupting the flow of data from the quantization unit 205 (305) to the inverse quantization unit 209 (309).
[0053]
【The invention's effect】
As described above in detail, according to the present invention, it is possible to provide a moving image compression encoding apparatus capable of controlling an encoded stream at a constant transfer rate without complicating the configuration.
[Brief description of the drawings]
FIG. 1 is a block diagram illustrating an example of a conventional moving image compression encoding device.
FIG. 2 is a diagram for explaining processing in a motion difference detection unit.
FIG. 3 is a block diagram showing a first embodiment of the moving picture compression encoding apparatus according to the present invention.
FIG. 4 is a diagram for explaining processing of a rate control unit in the moving picture compression encoding apparatus of FIG. 3;
FIG. 5 is a block diagram showing a second embodiment of the moving picture compression encoding apparatus according to the present invention.
6 is a diagram for explaining a characteristic operation of the moving picture compression encoding device shown in FIG. 5;
FIG. 7 is a diagram for explaining a characteristic operation of a moving image compression encoding apparatus as a third embodiment according to the present invention.
[Explanation of symbols]
101, 201, 301 ... current image memory
102, 202, 302... Motion difference detection unit
103, 203, 303 ... DCT unit
104, 204, 304 ... quantization unit
105, 205, 305 DC / AC prediction unit
106, 206, 306 ... rate control unit
107, 207, 307 ... variable length coding unit
108, 208, 308... Generated code buffer
109, 209, 309: inverse quantization unit
110, 210, 310 ... IDCT unit
111, 211, 311... Motion difference compensator
112, 212, 312... Show decoded image memories.
213: Frame management unit
314: Block position storage unit
2071, 3071 ... coding termination means

Claims

A motion difference detection unit that divides each frame of the video data into blocks and detects a predicted motion difference between the current frame and the previous frame for each block;
Variable-length encoding means for performing variable-length encoding by performing a prescribed orthogonal transformation and quantization on the result of the predicted motion difference or the original image data,
Decoded image generating means for generating a decoded image by performing a prescribed inverse quantization, inverse orthogonal transform, and motion compensation processing on the data on which the orthogonal transform and quantization have been performed;
At the time of the variable-length encoding process, encoding discontinuing means for discontinuing the encoding of each block in the middle,
A frame management unit configured to control to perform encoding without using motion prediction in encoding the next frame image by being notified of the encoding termination from the encoding termination unit. Moving image compression encoding apparatus.

2. The moving image compression encoding apparatus according to claim 1, further comprising a rate control unit configured to control a generated encoding amount of a group including a plurality of frames to be equal to a target encoded amount,
The moving picture compression method, wherein the rate control means re-assigns a target code amount by treating the next frame as the first frame of the rate control unit group by being notified of the coding cut-off from the coding cut-off means. Encoding device.

A motion difference detection unit that divides each frame of the video data into blocks and detects a predicted motion difference between the current frame and the previous frame for each block;
Variable-length encoding means for performing variable-length encoding by performing a prescribed orthogonal transformation and quantization on the result of the predicted motion difference or the original image data,
Decoded image generating means for generating a decoded image by performing a prescribed inverse quantization, inverse orthogonal transform, and motion compensation processing on the data on which the orthogonal transform and quantization have been performed;
At the time of the variable-length encoding process, encoding discontinuing means for discontinuing the encoding of each block in the middle,
Block position storing means for storing a position of a block in which the coding abort has occurred by the coding abort means, wherein the coding abort occurs in a motion prediction of a block of a next frame image in which the coding abort occurs. A moving image compression encoding apparatus characterized in that the selected block is not used as a motion prediction block candidate.

4. The moving image compression encoding apparatus according to claim 3, wherein, based on the coding cutoff block information stored in the block position storage unit, the motion prediction of the block of the next frame image in which the coding cutoff occurs. Is divided into one or more areas that do not include any coding cutoff blocks.

5. The moving image compression encoding apparatus according to claim 1, wherein the encoding cutoff unit is configured such that data on which orthogonal transform and quantization have been performed by the variable length encoding unit are predetermined. 6. A moving image compression encoding apparatus characterized in that when a value is exceeded, encoding of the block is discontinued halfway.