JP6157436B2

JP6157436B2 - Alignment apparatus and program

Info

Publication number: JP6157436B2
Application number: JP2014226966A
Authority: JP
Inventors: 裕二中沢
Original assignee: Secom Co Ltd
Current assignee: Secom Co Ltd
Priority date: 2014-11-07
Filing date: 2014-11-07
Publication date: 2017-07-05
Anticipated expiration: 2034-11-07
Also published as: JP2016091407A

Description

本発明は、人物の姿勢推定等のために、対象物の部位形状を模した部位モデルを画像から抽出した対象物領域に位置合わせする位置合わせ装置及びプログラムに関する。 The present invention relates to an alignment apparatus and a program for aligning a part model imitating a part shape of an object to an object region extracted from an image for estimating a posture of a person.

画像上の人物の姿勢を推定する場合に、画像から抽出した人物領域に頭と胴、上腕、前腕、上肢、下肢の形状を模した部位モデルを位置合わせして当てはめることが行われる。従来、部位モデルの位置合わせは、画像上で各部位モデルを複数通りに微小量ずつ回転及び平行移動させて人物全体の姿勢モデルを生成しては人物領域全体との重なりを評価する処理を反復的に繰り返して実現していた。 When estimating the posture of a person on the image, a part model imitating the shape of the head, torso, upper arm, forearm, upper limb, and lower limb is aligned and applied to the human region extracted from the image. Conventionally, the positioning of the part model is performed by rotating and translating each part model by a plurality of small amounts on the image to generate a posture model of the entire person and then evaluating the overlap with the entire person area. It was realized repeatedly.

特開平０７−３０２３４１号公報Japanese Patent Laid-Open No. 07-302341

特許文献１に示されるような従来技術では姿勢推定のために行う部位モデルの位置合わせに要する演算量が膨大となるため、実時間処理の実現が困難であった。 In the prior art as disclosed in Patent Document 1, the amount of calculation required for position model alignment performed for posture estimation is enormous, and it is difficult to realize real-time processing.

従来技術にて演算量が多くなる要因の一つとして、人物領域全体に対して複数の部位モデルの位置合わせを一度に行っていることが挙げられる。この点、人物領域に対して部位モデルを一つずつ順次に位置合わせすれば、先行して位置合わせした部位により後続して位置合わせする部位の配置に拘束条件を課すことができるため演算量を効果的に減らすことが可能となる。しかしながら従来技術においては部位モデルを一つずつ順次に位置合わせする仕組みが無かった。 One factor that increases the amount of calculation in the prior art is that a plurality of part models are aligned at once with respect to the entire person region. In this regard, if the part models are sequentially aligned with respect to the person region one by one, it is possible to impose a constraint condition on the arrangement of the parts to be subsequently aligned by the previously aligned parts. It can be effectively reduced. However, in the prior art, there is no mechanism for sequentially aligning the site models one by one.

従来技術にて演算量が多くなる要因のもう一つとして、各反復において部位モデルを複数通りに微小量ずつ回転及び平行移動させて人物領域との重なりが最も大きくなる回転量と平行移動量を試行錯誤的に求めていることが挙げられる。各反復における部位モデルの配置に対して人物領域との重なりが最も大きくなる回転量及び平行移動量を決定論的に算出して試行錯誤を排除できれば反復回数を効果的に減らすことが可能となる。しかしながら、決定論的な回転量を算出するために有効なアルゴリズムが無かった。 Another factor that increases the amount of computation in the prior art is to rotate and translate the part model by a small amount in multiple ways at each iteration to determine the amount of rotation and translation that maximizes the overlap with the human area. It is mentioned that it is determined by trial and error. It is possible to effectively reduce the number of iterations if trial and error can be eliminated by deterministically calculating the amount of rotation and the amount of parallel movement with the largest overlap with the human region with respect to the arrangement of the part model in each iteration. . However, there is no effective algorithm for calculating the deterministic rotation amount.

本発明は上記問題を鑑みてなされたものであり、対象物の一部である部位の形状を模した部位モデルを、画像から抽出した対象物領域に少ない演算量で高精度に位置合わせできる位置合わせ装置及びプログラムを提供することを目的とする。 The present invention has been made in view of the above problem, and a position model that imitates the shape of a part that is a part of an object can be accurately aligned with the object area extracted from the image with a small amount of calculation. An object of the present invention is to provide a matching device and a program.

本発明に係る位置合わせ装置は、二次元又は三次元の処理対象空間における、所定対象物を抽出した対象物領域に、前記所定対象物を構成する部位の形状を模した部位モデルを位置合わせする位置合わせ装置であって、前記部位モデルの形状及び慣性主軸を記憶する記憶手段と、前記部位モデルと前記対象物領域とが重複領域を生じる位置に前記部位モデルを初期配置する初期配置手段と、前記重複領域の慣性主軸を推定して、前記部位モデルの慣性主軸の向きを前記重複領域の慣性主軸に近づける前記部位モデルの回転量を算出し、かつ前記重複領域を増加させる前記部位モデルの平行移動量を算出する移動量算出手段と、前記部位モデルを前記回転量だけ回転させ、かつ前記平行移動量だけ平行移動させる配置更新手段と、前記移動量算出手段及び前記配置更新手段による処理を予め定めた終了条件が満たされるまで反復させる反復制御手段と、を有する。 The alignment apparatus according to the present invention aligns a part model imitating the shape of a part constituting the predetermined object in a target object region in which the predetermined target is extracted in a two-dimensional or three-dimensional processing target space. An alignment device, the storage means for storing the shape of the part model and the principal axis of inertia, and the initial placement means for initially arranging the part model at a position where the part model and the object region generate an overlapping area; Estimating the inertial principal axis of the overlapping area, calculating the amount of rotation of the part model to bring the direction of the inertial principal axis of the part model closer to the inertial principal axis of the overlapping area, and increasing the overlap area A movement amount calculating means for calculating a movement amount; a layout updating means for rotating the part model by the rotation amount and translating the part model by the parallel movement amount; and the movement amount calculation. Having a repetition control means for repeated until the end conditions predetermined processing by means and the placement updating means is satisfied.

他の本発明に係る位置合わせ装置において、前記処理対象空間は三次元であり、前記移動量算出手段は、三次元の前記部位モデルが有する三つの慣性主軸のうちの二つを座標軸とする三通りの平面に前記重複領域の慣性主軸を投影し、前記三通りの平面のそれぞれにおいて、前記回転量の当該平面と直交する軸の回りの成分を、当該平面の座標軸に設定された前記部位モデルの慣性主軸の一方と当該平面に投影した前記重複領域の慣性主軸とのなす角度に応じて算出する。 In another alignment apparatus according to the present invention, the processing target space is three-dimensional, and the movement amount calculating means is a three-dimensional coordinate axis having two of the three principal axes of inertia of the three-dimensional part model. The region model in which the principal axis of inertia of the overlapping region is projected onto a plane of the street, and the component around the axis perpendicular to the plane of the rotation amount is set as the coordinate axis of the plane in each of the three planes Is calculated according to an angle formed by one of the principal axes of inertia and the principal axis of inertia of the overlapping region projected onto the plane.

さらに他の本発明に係る位置合わせ装置において、前記移動量算出手段は、前記重複領域を前記部位モデルの慣性主軸の方向に並ぶ複数の部分領域に分割し、当該複数の部分領域の重心の並びを近似する直線であり前記重複領域の重心を通る直線を前記重複領域の慣性主軸と推定する。 In still another alignment apparatus according to the present invention, the movement amount calculation means divides the overlap region into a plurality of partial regions arranged in the direction of the principal axis of the part model, and arranges the centers of gravity of the plurality of partial regions. And a straight line passing through the center of gravity of the overlapping region is estimated as the inertial principal axis of the overlapping region.

さらに他の本発明に係る位置合わせ装置において、前記移動量算出手段は、前記重複領域を前記部位モデルの慣性主軸に直交する直線又は平面で２つの部分領域に分割し、当該各部分領域及び前記重複領域それぞれの重心を通る直線を前記重複領域の慣性主軸と推定する。 In still another alignment apparatus according to the present invention, the movement amount calculation unit divides the overlapping region into two partial regions by a straight line or a plane orthogonal to the principal axis of inertia of the part model, and each of the partial regions and the A straight line passing through the center of gravity of each overlapping region is estimated as the inertial principal axis of the overlapping region.

本発明に係るプログラムは、二次元又は三次元の処理対象空間における、所定対象物を抽出した対象物領域に、前記所定対象物を構成する部位の形状を模して予め定められた部位モデルを位置合わせする処理をコンピュータに行わせるためのプログラムであって、当該コンピュータに、前記部位モデルと前記対象物領域とが重複領域を生じる位置に前記部位モデルを初期配置させ、前記重複領域の慣性主軸を推定して、前記部位モデルの慣性主軸の向きを前記重複領域の慣性主軸に近づける前記部位モデルの回転量を算出し、当該回転量だけ前記部位モデルを回転させる処理、及び前記重複領域を増加させる前記部位モデルの平行移動量を算出し、当該平行移動量だけ前記部位モデルを平行移動させる処理を予め定めた終了条件が満たされるまで反復させる。 The program according to the present invention provides a predetermined part model imitating a shape of a part constituting the predetermined object in a target area in which the predetermined target is extracted in a two-dimensional or three-dimensional processing target space. A program for causing a computer to perform alignment processing, wherein the computer initially arranges the part model at a position where the part model and the object area generate an overlapping area, and the inertia main axis of the overlapping area To calculate the amount of rotation of the part model that brings the direction of the principal axis of inertia of the part model closer to the principal axis of inertia of the overlapping area, and the process of rotating the part model by the amount of rotation and increasing the overlapping area The amount of translation of the part model to be calculated is calculated, and a process for translating the part model by the amount of translation is satisfied in advance. In be repeated.

本発明によれば、対象物の一部である部位の形状を模した部位モデルを、画像から抽出した対象物領域に少ない演算量で高精度に位置合わせできる位置合わせ装置が得られる。 ADVANTAGE OF THE INVENTION According to this invention, the position alignment apparatus which can align the site | part model imitating the shape of the site | part which is a part of target object with the small amount of calculations to the target object area extracted from the image with high precision is obtained.

本発明の実施形態に係る姿勢推定装置の概略のブロック構成図である。1 is a schematic block configuration diagram of an attitude estimation apparatus according to an embodiment of the present invention. 本発明の実施形態に係る姿勢推定装置の概略の機能ブロック図である。It is a functional block diagram of the outline of the attitude | position estimation apparatus which concerns on embodiment of this invention. 部位モデルがワールド座標系に存在する状態を示す模式図である。It is a schematic diagram which shows the state in which a site | part model exists in a world coordinate system. 部位モデルの初期位置の高さについての条件を説明する模式図である。It is a mimetic diagram explaining the conditions about the height of the initial position of a part model. ミーンシフト法による平行移動量の算出方法を説明する模式図である。It is a schematic diagram explaining the calculation method of the parallel displacement by the mean shift method. 移動量算出手段によるローカル座標系の回転量の算出方法を説明する模式図である。It is a schematic diagram explaining the calculation method of the rotation amount of a local coordinate system by a movement amount calculation means. 本発明の第一の実施形態に係る姿勢推定装置の概略の動作を示すフロー図である。It is a flowchart which shows the operation | movement of the outline of the attitude | position estimation apparatus which concerns on 1st embodiment of this invention. 部位モデルがワールド座標系に存在する状態を示す模式図である。It is a schematic diagram which shows the state in which a site | part model exists in a world coordinate system. 回転軸ごとの基準軸、分割平面及び分割領域の設定規則を表形式に表した模式図である。It is the schematic diagram which represented the setting rule of the reference | standard axis | shaft for every rotating shaft, a division | segmentation plane, and a division | segmentation area | region in tabular form. ｘ軸を回転軸とした場合の回転量の求め方を説明する模式図である。It is a schematic diagram explaining how to obtain the amount of rotation when the x-axis is the rotation axis. ｙ軸を回転軸とした場合の回転量の求め方を説明する模式図である。It is a schematic diagram explaining how to obtain the amount of rotation when the y axis is the rotation axis. ｚ軸を回転軸とした場合の回転量の求め方を説明する模式図である。It is a schematic diagram explaining how to obtain the amount of rotation when the z axis is the rotation axis. 本発明の第二の実施形態に係る姿勢推定装置の概略の動作を示すフロー図である。It is a flowchart which shows the operation | movement of the outline of the attitude | position estimation apparatus which concerns on 2nd embodiment of this invention.

以下、本発明の実施の形態（以下実施形態という）である姿勢推定装置について、図面に基づいて説明する。実施形態に係る姿勢推定装置は人物を姿勢推定の対象物とし、監視空間を撮影した監視画像に基づいて抽出された人物領域に、人物の部位の形状を模した部位モデルを位置合わせする。そして、位置合わせした部位モデルの一致度や配置に基づいて人物の姿勢を推定する。 Hereinafter, an attitude estimation apparatus according to an embodiment of the present invention (hereinafter referred to as an embodiment) will be described with reference to the drawings. The posture estimation apparatus according to the embodiment uses a person as an object for posture estimation, and aligns a part model imitating the shape of the part of the person with a person region extracted based on a monitoring image obtained by photographing the monitoring space. Then, the posture of the person is estimated based on the degree of coincidence and arrangement of the aligned part models.

［第一の実施形態］
本発明の第一の実施形態に係る姿勢推定装置１は姿勢推定に際し、監視画像から抽出した二次元の人物領域に二次元の部位モデルを位置合わせする。 [First embodiment]
At the time of posture estimation, the posture estimation device 1 according to the first embodiment of the present invention aligns a two-dimensional part model with a two-dimensional person region extracted from a monitoring image.

図１は姿勢推定装置１の概略のブロック構成図である。姿勢推定装置１は撮影部２、記憶部３、画像処理部４及び出力部５を含んで構成される。撮影部２、記憶部３及び出力部５は画像処理部４と接続される。 FIG. 1 is a schematic block diagram of the posture estimation apparatus 1. The posture estimation apparatus 1 includes a photographing unit 2, a storage unit 3, an image processing unit 4, and an output unit 5. The photographing unit 2, the storage unit 3, and the output unit 5 are connected to the image processing unit 4.

撮影部２は、監視空間を撮影する監視カメラであり、撮影した監視画像を画像処理部４に入力する。 The imaging unit 2 is a monitoring camera that images the monitoring space, and inputs the captured monitoring image to the image processing unit 4.

記憶部３は、ＲＯＭ（Read Only Memory）、ＲＡＭ（Random Access Memory）、ハードディスク等の記憶装置であり、画像処理部４で使用されるプログラム及びデータや、画像処理部４が生成する各種データなどを記憶する。記憶部３はこれらプログラム、データを画像処理部４との間で入出力する。 The storage unit 3 is a storage device such as a ROM (Read Only Memory), a RAM (Random Access Memory), and a hard disk, and programs and data used by the image processing unit 4, various data generated by the image processing unit 4, and the like. Remember. The storage unit 3 inputs and outputs these programs and data to and from the image processing unit 4.

画像処理部４はＣＰＵ（Central Processing Unit）、ＤＳＰ（Digital Signal Processor）、ＭＣＵ（Micro Control Unit）等のプロセッサ及びその周辺回路で構成される。例えば、画像処理部４は、記憶部３に格納されたプログラムに基づいて動作するコンピュータにより構成される。画像処理部４は後述する各手段として動作し、監視画像から人物の姿勢を推定し出力部５に出力する。 The image processing unit 4 includes a processor such as a CPU (Central Processing Unit), a DSP (Digital Signal Processor), an MCU (Micro Control Unit), and its peripheral circuits. For example, the image processing unit 4 is configured by a computer that operates based on a program stored in the storage unit 3. The image processing unit 4 operates as each unit to be described later, and estimates the posture of the person from the monitoring image and outputs it to the output unit 5.

出力部５は推定された人物姿勢をユーザに示す表示装置などである。 The output unit 5 is a display device or the like that shows the estimated person posture to the user.

図２は姿勢推定装置１の概略の機能ブロック図である。なお、図２では撮影部２及び出力部５は図示を省略している。画像処理部４は対象物領域抽出手段４０、初期配置手段４１、移動量算出手段４２、配置更新手段４３、反復制御手段４４及び姿勢推定手段４５として動作する。このうち、初期配置手段４１、移動量算出手段４２、配置更新手段４３及び反復制御手段４４が本発明に係る位置合わせ装置１０として機能する。また記憶部３は位置合わせ装置１０で用いる情報を記憶する記憶手段３０として機能する。具体的には記憶手段３０は部位モデル情報３００を記憶する。 FIG. 2 is a schematic functional block diagram of the posture estimation apparatus 1. In FIG. 2, the photographing unit 2 and the output unit 5 are not shown. The image processing unit 4 operates as an object area extraction unit 40, an initial arrangement unit 41, a movement amount calculation unit 42, an arrangement update unit 43, an iterative control unit 44, and an attitude estimation unit 45. Among these, the initial arrangement means 41, the movement amount calculation means 42, the arrangement update means 43, and the iterative control means 44 function as the alignment apparatus 10 according to the present invention. The storage unit 3 functions as a storage unit 30 that stores information used in the alignment apparatus 10. Specifically, the storage unit 30 stores the part model information 300.

対象物領域抽出手段４０は監視画像から対象物領域として人物の領域を、例えば監視画像と予め用意した背景画像との差分処理などにより抽出する。抽出により得られる差分画像においては、例えば、人物領域内の画素値が“１”、人物領域外の画素値が“０”に設定されることで人物領域が表示される。 The object area extracting unit 40 extracts a person area as an object area from the monitoring image by, for example, a difference process between the monitoring image and a background image prepared in advance. In the difference image obtained by extraction, for example, the person area is displayed by setting the pixel value in the person area to “1” and the pixel value outside the person area to “0”.

位置合わせ装置１０は人物領域が表示された差分画像を二次元の処理対象空間とし、当該処理対象空間にて人物領域中の目的部位に部位モデルを位置合わせする。以下、本実施形態の二次元の処理対象空間を処理対象画像と称する。本実施形態においては人物の頭部と胴体部分を併せて一つの目的部位とし、姿勢推定装置１は位置合わせ装置１０によって処理対象画像上にて楕円形の部位モデルを人物領域の頭部及び胴体部分に位置合わせすることで上半身の姿勢推定を行う。この位置合わせ処理では、部位モデルを初期位置に配置し、当該部位モデルを回転及び平行移動させる配置更新を反復して行い、目的部位への部位モデルの配置の収束を図る。よって楕円形の長軸の一方端を頭頂部と一致させ、他方端を臀部と一致させることを目標とした位置合わせがなされる。 The alignment apparatus 10 uses a difference image on which a person region is displayed as a two-dimensional processing target space, and aligns a part model with a target part in the person region in the processing target space. Hereinafter, the two-dimensional processing target space of the present embodiment is referred to as a processing target image. In the present embodiment, the human head and torso are combined as one target part, and the posture estimation apparatus 1 uses the registration apparatus 10 to convert an elliptical part model on the processing target image to the head and torso of the person region. Estimate the posture of the upper body by aligning with the part. In this alignment process, the part model is arranged at the initial position, and the arrangement update for rotating and translating the part model is repeatedly performed, so that the arrangement of the part model at the target part is converged. Accordingly, alignment is performed with the goal of matching one end of the major axis of the ellipse with the top of the head and the other end with the buttocks.

部位モデル情報３００は部位モデルを定義する情報であり、部位モデルの形状や向きを表す情報を含む。部位モデルは位置合わせの対象とされる部位の形状をモデル化した二次元形状である。部位モデルの形状は基本的に、軸対称、中心対称などの制限はなく目的部位に合わせた任意の二次元形状とすることができるが、本実施形態では楕円形とする。 The part model information 300 is information that defines a part model, and includes information indicating the shape and orientation of the part model. The part model is a two-dimensional shape that models the shape of a part to be aligned. The shape of the part model is basically not limited to axial symmetry, central symmetry, and the like, and can be an arbitrary two-dimensional shape that matches the target part. In the present embodiment, the part model is an ellipse.

ここで、処理対象画像の座標系を、例えば、右手系の直交座標系ＸＹで定義する。なお、処理対象画像と監視画像とは基本的に同じ座標系とすることができる。また部位モデルに固定された座標系をローカル座標系とし、例えば、右手系の直交座標系ｘｙで定義する。 Here, the coordinate system of the processing target image is defined by a right-handed orthogonal coordinate system XY, for example. Note that the processing target image and the monitoring image can be basically the same coordinate system. The coordinate system fixed to the part model is defined as a local coordinate system, and is defined by, for example, a right-handed orthogonal coordinate system xy.

ローカル座標系は部位モデル６０とする楕円形が次式で表されるように設定される。
（ｘ^２／Ｒｘ^２）＋（ｙ^２／Ｒｙ^２）＝１ The local coordinate system is set so that an ellipse as the part model 60 is expressed by the following equation.
^{^{^{(X 2 / Rx 2) +}}} (y 2 / Ry 2) = 1

ここで、Ｒｘ＜Ｒｙとする。すなわち部位モデル６０はｙ軸方向に長半径Ｒｙ、ｘ軸方向に短半径Ｒｘを有する楕円形である。また楕円形の中心ｃがローカル座標系の原点となる。よってｙ軸とｘ軸とが部位モデル６０の慣性主軸、原点が部位モデル６０の重心に設定されている。 Here, Rx <Ry. That is, the region model 60 is an ellipse having a major radius Ry in the y-axis direction and a minor radius Rx in the x-axis direction. The center c of the ellipse is the origin of the local coordinate system. Therefore, the y axis and the x axis are set as the principal axes of inertia of the part model 60, and the origin is set as the center of gravity of the part model 60.

なお、部位モデル６０を任意形状とする場合には、当該形状の主成分分析の第一主成分、第二主成分の軸をそれぞれｙ軸、ｘ軸と定めることができる。 When the part model 60 has an arbitrary shape, the axes of the first principal component and the second principal component in the principal component analysis of the shape can be defined as the y axis and the x axis, respectively.

処理対象空間が二次元である本実施形態では監視カメラと目的部位との位置関係によって人物領域における目的部位の見た目の形状が変わる。このことに対応するために形状が異なる複数（Ｍ通り）の部位モデル６０を表す部位モデル情報３００が候補として用意される。例えば、部位モデル６０としてＲｘ，Ｒｙの組み合わせが異なる複数の楕円形を部位モデル情報３００に用意することができる。または、画像処理部４は、対象物領域抽出手段４０により抽出された人物領域の大きさなどに応じて複数通りの部位モデル６０を生成し記憶手段３０に格納する。 In the present embodiment in which the processing target space is two-dimensional, the appearance of the target part in the person region changes depending on the positional relationship between the monitoring camera and the target part. In order to cope with this, site model information 300 representing a plurality (M types) of site models 60 having different shapes is prepared as candidates. For example, a plurality of ellipses with different combinations of Rx and Ry can be prepared in the part model information 300 as the part model 60. Alternatively, the image processing unit 4 generates a plurality of part models 60 according to the size of the person area extracted by the object area extraction unit 40 and stores the generated part models 60 in the storage unit 30.

図３は部位モデル６０が処理対象画像の座標系に存在する状態を示す模式図である。部位モデル６０の姿勢は、処理対象画像の座標系で記述されたローカル座標系の座標軸及び原点により記述される。すなわち、部位モデル６０の平行移動は処理対象画像の座標系でのローカル座標系の原点（点ｃ）の移動によって記述され、部位モデル６０の回転は処理対象画像の座標系でのローカル座標系のｘ軸，ｙ軸の向きの変化によって記述される。 FIG. 3 is a schematic diagram showing a state in which the part model 60 exists in the coordinate system of the processing target image. The posture of the part model 60 is described by the coordinate axes and the origin of the local coordinate system described in the coordinate system of the processing target image. That is, the translation of the part model 60 is described by the movement of the origin (point c) of the local coordinate system in the coordinate system of the processing target image, and the rotation of the part model 60 is performed in the local coordinate system of the processing target image. It is described by changes in the orientation of the x-axis and y-axis.

初期配置手段４１は、処理対象画像に部位モデルとして配置される楕円形の初期位置を決定する。例えば、人物の直立姿勢を基本姿勢と考え、初期位置における楕円形の向きはｙ軸の向きが監視画像における鉛直方向となるように設定する。本実施形態では処理対象画像は監視画像と同じ座標系としており、ｙ軸の向きはＹ軸の向きとなるように設定される。また、当該楕円形の位置は、処理対象画像における人物領域の位置に基づいて設定することができる。例えば、楕円形の中心（つまりローカル座標系の原点）の処理対象画像の座標系におけるＸ座標は人物領域のＸ軸方向における位置に対応させることができる。一方、楕円形の中心の処理対象画像の座標系におけるＹ座標は、楕円形と人物領域とが重複領域を生じ、かつ楕円形の最上部が人物領域の最上部よりも少し高い位置になるように設定する。 The initial arrangement means 41 determines an elliptical initial position arranged as a part model in the processing target image. For example, an upright posture of a person is considered as a basic posture, and the elliptical direction at the initial position is set so that the y-axis direction is the vertical direction in the monitoring image. In this embodiment, the processing target image has the same coordinate system as the monitoring image, and the direction of the y axis is set to be the direction of the Y axis. Further, the position of the ellipse can be set based on the position of the person area in the processing target image. For example, the X coordinate in the coordinate system of the processing target image at the center of the ellipse (that is, the origin of the local coordinate system) can be made to correspond to the position of the person region in the X axis direction. On the other hand, the Y coordinate in the coordinate system of the processing target image at the center of the ellipse is such that the ellipse and the person area overlap, and the top of the ellipse is slightly higher than the top of the person area. Set to.

ちなみに、楕円形と人物領域とが重複領域を有するように初期設定する理由は、後述するように移動量算出手段４２が当該重複領域の重心を利用して部位モデル６０の回転量及び平行移動量を求めるからである。また、部位モデル６０を人物領域、つまり人の身長よりも高い位置に初期設定する理由は、本実施形態では部位モデル６０を人物の上半身をなす頭部及び胴体部分に位置合わせすることに関係している。位置合わせ装置１０における部位モデル６０の回転・平行移動の反復処理の収束判定は部位モデル６０と人物領域との重複領域の大きさに基づいてなされ、例えば、重複領域が上半身、下半身のいずれであるかに関係なく収束したと判定され得る。そのため、部位モデル６０の初期位置を人物領域の下寄りに設定すると、楕円形の頂点が頭頂部に一致しない状態で収束してしまう可能性が高くなる。この可能性を低くするために、部位モデル６０の最上部を人物領域の最上部よりも高く初期設定する。 Incidentally, the reason for initial setting so that the ellipse and the person area have overlapping areas is that the movement amount calculation means 42 uses the center of gravity of the overlapping area and the amount of rotation and the parallel movement amount of the part model 60 as described later. Because it asks for. In addition, the reason why the part model 60 is initially set to a position higher than the person region, that is, the height of the person, is related to the position model 60 being aligned with the head and torso parts forming the upper body of the person in the present embodiment. ing. The convergence determination of the repetitive process of rotation / translation of the part model 60 in the alignment apparatus 10 is made based on the size of the overlapping area between the part model 60 and the person area. For example, the overlapping area is either the upper body or the lower body. It can be determined that it has converged regardless. Therefore, if the initial position of the part model 60 is set to the lower side of the person area, there is a high possibility that the elliptical vertex converges in a state where it does not coincide with the top of the head. In order to reduce this possibility, the top of the part model 60 is initially set higher than the top of the person region.

図４は部位モデル６０の初期位置の高さについての上述の条件を説明する模式図である。図４（ａ）は部位モデル６０の最上部が人の身長よりも高い位置になるように初期設定した場合である。状態（ａ−１）は初期設定時、状態（ａ−２）は収束過程、状態（ａ−３）は収束した最終結果を示している。状態（ａ−３）では、楕円形の長手方向の一方端は頭頂部、他方端は臀部に位置しており部位モデル６０は人物領域６２における目的とする部位である頭部及び胴体部分に好適に位置合わせされている。一方、図４（ｂ）は部位モデル６０の最上部が人の身長よりも低い位置になるように初期設定した場合であり、状態（ｂ−１）は初期設定時、状態（ｂ−２）は収束過程、状態（ｂ−３）は収束した最終結果を示している。この場合、部位モデル６０の頂点の位置が頭頂部や臀部に収束せず、人物領域６２のうちの意図しない部位に部位モデル６０の位置が収束している。 FIG. 4 is a schematic diagram for explaining the above-described conditions for the height of the initial position of the part model 60. FIG. 4A shows a case where the initial setting is made so that the top of the part model 60 is positioned higher than the height of the person. State (a-1) shows the initial setting, state (a-2) shows the convergence process, and state (a-3) shows the final converged result. In the state (a-3), one end in the longitudinal direction of the ellipse is located at the top of the head and the other end is located at the buttocks, and the part model 60 is suitable for the head part and the body part which are the target parts in the human region 62. Are aligned. On the other hand, FIG. 4B shows a case where the initial setting is made so that the top of the part model 60 is lower than the height of the person, and the state (b-1) is the state (b-2) at the initial setting. Indicates the convergence process, and state (b-3) indicates the final converged result. In this case, the position of the apex of the part model 60 does not converge to the top of the head or the buttocks, and the position of the part model 60 converges to an unintended part of the person region 62.

移動量算出手段４２は、部位モデルのローカル座標系の原点の平行移動量とｘ軸又はｙ軸の回転量とを算出する。本実施形態では平行移動量をミーンシフト（mean shift）法により求める。図５はミーンシフト法による平行移動量の算出方法を説明する模式図である。本実施形態では移動量算出手段４２は例えば、部位モデル６０内に存在する人物領域、つまり部位モデル６０と人物領域６２との重複領域の重心ｇを求める。図５では図５（ａ）の状態での当該重心ｇに“●”を印している。そして、移動量算出手段４２はローカル座標系の原点が位置する楕円中心ｃから重心ｇに向かうベクトルを平行移動量として算出する。図５（ｂ）は図５（ａ）の状態にて算出された平行移動量だけ部位モデル６０を移動させた状態を示しており、楕円中心ｃを示す“×”印が“●”印の位置に移動している。 The movement amount calculation means 42 calculates the parallel movement amount of the origin of the local coordinate system of the part model and the rotation amount of the x axis or the y axis. In the present embodiment, the parallel movement amount is obtained by a mean shift method. FIG. 5 is a schematic diagram for explaining a method of calculating the parallel movement amount by the mean shift method. In the present embodiment, the movement amount calculation means 42 obtains, for example, a person area existing in the part model 60, that is, a center of gravity g of an overlapping area between the part model 60 and the person area 62. In FIG. 5, “●” is marked on the center of gravity g in the state of FIG. Then, the movement amount calculation means 42 calculates a vector from the ellipse center c where the origin of the local coordinate system is located toward the center of gravity g as a parallel movement amount. FIG. 5B shows a state in which the part model 60 has been moved by the amount of translation calculated in the state of FIG. 5A, and the “x” mark indicating the ellipse center c is the “●” mark. Moved to position.

図６は移動量算出手段４２によるローカル座標系の回転量の算出方法を説明する模式図である。図６（ａ）は回転量の算出対象とする状態での人物領域６２と部位モデル６０との配置例である。本実施形態では、部位モデル６０の慣性主軸を、部位モデル６０と人物領域６２との重複領域の慣性主軸に近づけるように回転量θを決定する。移動量算出手段４２は部位モデルの慣性主軸のうち回転操作を行うための基準軸とするものを設定し、当該基準軸に対応する重複領域の慣性主軸を推定して基準軸に対する目標軸とする。そして、基準軸の向きを目標軸に近づける部位モデル６０の回転量θを算出する。 FIG. 6 is a schematic diagram for explaining a method of calculating the rotation amount of the local coordinate system by the movement amount calculation means 42. FIG. 6A shows an arrangement example of the person region 62 and the part model 60 in a state where the rotation amount is to be calculated. In the present embodiment, the rotation amount θ is determined so that the inertial principal axis of the part model 60 approaches the inertial principal axis of the overlapping region between the part model 60 and the person region 62. The movement amount calculating means 42 sets the inertial main axis of the part model as a reference axis for performing the rotation operation, estimates the inertial main axis of the overlapping region corresponding to the reference axis, and sets it as the target axis with respect to the reference axis . Then, the rotation amount θ of the part model 60 that brings the direction of the reference axis closer to the target axis is calculated.

本実施形態では直交する２つの慣性主軸のうち慣性モーメントが最小となるものを基準軸Ｌとする。この場合、楕円形である部位モデル６０において基準軸Ｌとする慣性主軸は楕円形の長軸に一致する。なお、ローカル座標系のｙ軸も当該長軸に一致するように設定されている。 In the present embodiment, the reference axis L is the one of the two inertial main axes that are orthogonal to each other and has the minimum moment of inertia. In this case, in the part model 60 having an elliptical shape, the inertial principal axis serving as the reference axis L coincides with the major axis of the elliptical shape. Note that the y-axis of the local coordinate system is also set to coincide with the major axis.

また本実施形態では重複領域の慣性主軸を推定し、当該慣性主軸の近似直線を目標軸とする。具体的には、移動量算出手段４２は部位モデル６０と人物領域６２との重複領域の重心ｇを求める。重心ｇを通り主軸Ｌに直交する分割直線Ｐにより重複領域を２つの領域に分割する。当該２つの領域を分割領域Ａ，Ｂと呼ぶことにする。移動量算出手段４２は分割領域Ａ，Ｂそれぞれの重心（部分重心と呼ぶ）ｇａ，ｇｂを求める。図６（ｂ）には部分重心ｇａ，ｇｂをそれぞれ“■”，“▲”で示している。本実施形態では重複領域の慣性主軸の近似としてｇａ，ｇ，ｇｂを通る直線を用いる。この部位モデル６０の基準軸Lの方向に並ぶ２つの部分領域Ａ，Ｂの重心を通る直線の方向に部位モデル６０の基準軸を合わせれば、重複領域が大きくなることが期待できる。そこで、移動量算出手段４２は当該直線を目標軸とし、回転量θとして当該直線と軸Ｌとがなす角度を算出する。直線Ｌに平行で重心ｇを通る直線をＬ’とすると、移動量算出手段４２はｇとｇａとを結ぶ線分と直線Ｌ’とがなす角θａ、及びｇとｇｂとを結ぶ線分と直線Ｌ’とがなす角θｂを求め、θａとθｂの平均値を回転量θとする。 In this embodiment, the inertia main axis of the overlapping region is estimated, and an approximate straight line of the inertia main axis is set as the target axis. Specifically, the movement amount calculating means 42 obtains the center of gravity g of the overlapping area between the part model 60 and the person area 62. The overlapping area is divided into two areas by a dividing line P passing through the center of gravity g and orthogonal to the main axis L. The two areas will be referred to as divided areas A and B. The movement amount calculating means 42 obtains centroids (called partial centroids) ga and gb of the divided areas A and B, respectively. In FIG. 6B, the partial centroids ga and gb are indicated by “■” and “▲”, respectively. In this embodiment, a straight line passing through ga, g, and gb is used as an approximation of the inertial principal axis of the overlapping region. If the reference axis of the part model 60 is aligned with the direction of a straight line passing through the center of gravity of the two partial areas A and B arranged in the direction of the reference axis L of the part model 60, it can be expected that the overlapping area will increase. Therefore, the movement amount calculating means 42 calculates the angle between the straight line and the axis L as the rotation amount θ using the straight line as the target axis. Assuming that a straight line parallel to the straight line L and passing through the center of gravity g is L ′, the movement amount calculating means 42 includes an angle θa formed by the line segment connecting g and ga and the straight line L ′, and a line segment connecting g and gb. An angle θb formed by the straight line L ′ is obtained, and an average value of θa and θb is defined as a rotation amount θ.

なお、ここでは重複領域のピクセル数が少ない場合等において計算誤差の影響を小さくすることなどを考慮してθａとθｂとを平均しているが原理的にはｇａ，ｇ，ｇｂは一直線に並び、θａとθｂとは当該直線と直線Ｌ’とにより形成される対頂角であり等しくなるので、θａ，θｂのいずれか一方を回転量θとすることもできる。また、重複領域のピクセル数が少ない場合における重心座標の計算精度の低下に対して、重複領域のピクセル数に応じて回転量を減じるなど、過剰な回転による振動や発散を防止する構成としてもよい。 Here, θa and θb are averaged in consideration of reducing the influence of calculation error when the number of pixels in the overlapping region is small, but in principle, ga, g, and gb are arranged in a straight line. , Θa and θb are the vertical angles formed by the straight line and the straight line L ′ and are equal to each other, so that either one of θa and θb can be set as the rotation amount θ. In addition, it may be configured to prevent vibration and divergence due to excessive rotation, such as reducing the rotation amount according to the number of pixels in the overlapping area, in response to a decrease in the calculation accuracy of the barycentric coordinates when the number of pixels in the overlapping area is small. .

図６（ｃ）は、部位モデル６０を角度θだけ回転させ主軸Ｌの向きを重複領域の主軸の近似方向に変えた状態を示している。ここでは重心ｇを中心に部位モデル６０を回転させた状態を示している。 FIG. 6C shows a state where the part model 60 is rotated by an angle θ and the direction of the main axis L is changed to the approximate direction of the main axis of the overlapping region. Here, a state where the part model 60 is rotated around the center of gravity g is shown.

配置更新手段４３は、部位モデル６０を移動量算出手段４２で求めた平行移動量だけ平行移動させ、移動量算出手段４２で求めた回転量だけ回転させることで部位モデル６０の配置を更新する。すなわち、求めた平行移動量だけローカル座標系の原点を平行移動させ、また、求めた回転量だけローカル座標系を回転させる。ここで、回転中心をローカル座標系の原点（楕円中心ｃ）とすれば、平行移動の操作と回転の操作とは独立した操作となり処理の順序依存性がなくなり、どちらを先に実行しても同じ配置に更新される。 The arrangement updating unit 43 updates the arrangement of the part model 60 by translating the part model 60 by the parallel movement amount obtained by the movement amount calculating unit 42 and rotating the part model 60 by the rotation amount obtained by the movement amount calculating unit 42. That is, the origin of the local coordinate system is translated by the calculated parallel movement amount, and the local coordinate system is rotated by the calculated rotation amount. Here, if the rotation center is the origin of the local coordinate system (ellipse center c), the translation operation and the rotation operation become independent operations, and there is no processing order dependency. Updated to the same arrangement.

反復制御手段４４は移動量算出手段４２及び配置更新手段４３による一回の配置更新が行われるごとに、部位モデル６０に位置合わせ処理の終了条件を満たすか否かを判定する。例えば、一回の平行移動量及び回転量が予め定めたしきい値以下に収束したことを終了条件とすることができる。反復制御手段４４は終了条件が満たされるまで、移動量算出手段４２及び配置更新手段４３による処理を繰り返させ、終了条件が満たされると、位置合わせされた部位モデル６０の配置を部位モデル情報３００に記憶させる。 The iterative control unit 44 determines whether or not the part model 60 satisfies the end condition of the alignment process every time the movement amount calculation unit 42 and the arrangement update unit 43 perform one arrangement update. For example, the end condition can be that the amount of parallel movement and the amount of rotation of one time converges to a predetermined threshold value or less. The iterative control unit 44 repeats the processing by the movement amount calculating unit 42 and the arrangement updating unit 43 until the end condition is satisfied. When the end condition is satisfied, the arrangement of the aligned part model 60 is added to the part model information 300. Remember me.

なお、終了条件は上述の例に限られない。例えば、部位モデル６０の中に含まれる人物領域の面積の割合Ｒ_ＩＮが予め定めた一定値β以上であることを終了条件としてもよい。ここでβの値は例えば０．９５など、好適に位置合わせがなされたことが期待できる値とする。また、反復処理が無限ループに陥ることを想定して、所定の反復回数を超えた場合は反復処理を打ち切防止するように終了条件を定めることもできる。 The end condition is not limited to the above example. For example, the end condition may be that the ratio R _IN of the area of the person region included in the part model 60 is equal to or greater than a predetermined value β. Here, the value of β is assumed to be a value that can be expected to be suitably aligned, for example, 0.95. Assuming that the iterative process falls into an infinite loop, it is also possible to set an end condition so as to prevent the iterative process from being aborted when a predetermined number of iterations is exceeded.

姿勢推定手段４５は位置合わせされた部位モデル６０の配置に基づいて監視画像に写る人物の姿勢を推定する。 The posture estimation means 45 estimates the posture of the person shown in the monitoring image based on the position of the aligned part model 60.

図７は本実施形態の姿勢推定装置１の概略の動作を示すフロー図である。姿勢推定装置１は監視空間に人がいない状態にて起動され、当該状態にて撮影部２により監視空間を撮影し背景画像を取得する（ステップＳ２）。なお、姿勢推定装置１の動作中において背景画像を適宜更新することもできる。取得した背景画像は記憶部３に保存する。 FIG. 7 is a flowchart showing a schematic operation of the posture estimation apparatus 1 of the present embodiment. The posture estimation apparatus 1 is activated in a state where there is no person in the monitoring space, and in this state, the monitoring space is imaged by the imaging unit 2 to acquire a background image (step S2). Note that the background image can be appropriately updated during the operation of the posture estimation apparatus 1. The acquired background image is stored in the storage unit 3.

撮影部２は監視空間を例えば、一定時間ごとに撮影する（ステップＳ４）。撮影された画像は監視画像として画像処理部４に入力される。 The imaging unit 2 images the monitoring space at regular intervals, for example (step S4). The captured image is input to the image processing unit 4 as a monitoring image.

画像処理部４において対象物領域抽出手段４０は記憶部３に記憶された背景画像を用いて監視画像に対し背景差分処理を行い、位置合わせ装置１０における処理対象画像として、人物領域が抽出された差分画像を生成する（ステップＳ６）。当該差分画像は例えば、監視画像における人物領域が画素値“１”、それ以外の領域が画素値“０”である二値化画像である。 In the image processing unit 4, the object region extraction unit 40 performs background difference processing on the monitoring image using the background image stored in the storage unit 3, and a person region is extracted as a processing target image in the alignment apparatus 10. A difference image is generated (step S6). The difference image is, for example, a binarized image in which the person area in the monitoring image has a pixel value “1” and the other area has a pixel value “0”.

位置合わせ装置１０は記憶手段３０に記憶されたＭ通りの部位モデル情報３００それぞれについて位置合わせ処理を行う。画像処理部４は部位モデル情報３００の種類を示すインデックスＩを初期値“１”に設定し（ステップＳ８）、インデックスＩを１ずつインクリメントしてＭ通りの部位モデル６０それぞれについて位置合わせ装置１０による位置合わせ処理を実行する。 The alignment apparatus 10 performs alignment processing for each of the M part model information 300 stored in the storage unit 30. The image processing unit 4 sets an index I indicating the type of the part model information 300 to an initial value “1” (step S8), increments the index I by 1, and uses the alignment apparatus 10 for each of the M part model 60. Perform alignment processing.

位置合わせ装置１０はＩ番目の部位モデル情報３００を記憶手段３０から読み出し（ステップＳ１０）、初期配置手段４１によりＩ番目の部位モデル情報３００が表す部位モデル６０を、既に述べたように人物領域に基づいて設定される初期位置に配置して（ステップＳ１２）、当該部位モデル６０の配置を反復的に更新して人物領域の目的とする部位に位置合わせする処理（ステップＳ１４〜Ｓ２０）を開始する。 The alignment apparatus 10 reads the I-th part model information 300 from the storage unit 30 (step S10), and the initial placement means 41 converts the part model 60 represented by the I-th part model information 300 into the person area as described above. Arrangement is made at the initial position set on the basis (step S12), and the arrangement (steps S14 to S20) of repetitively updating the arrangement of the part model 60 and aligning with the target part of the person region is started. .

当該反復処理では、移動量算出手段４２が現在の部位モデル６０と人物領域との重複領域に基づいて上述のように部位モデル６０の平行移動量及び回転量を算出し（ステップＳ１４）、配置更新手段４３が移動量算出手段４２の算出結果に基づいて部位モデル６０を新たな配置へ更新する（ステップＳ１６）。具体的には部位モデル６０の位置を示す楕円中心ｃ、及び部位モデル６０の姿勢（向き）を示すｙ軸の処理対象画像の座標系での向きが更新され、部位モデル情報３００に記憶される。 In the iterative process, the movement amount calculation means 42 calculates the parallel movement amount and the rotation amount of the part model 60 as described above based on the overlapping region of the current part model 60 and the person region (step S14), and updates the arrangement. The means 43 updates the part model 60 to a new arrangement based on the calculation result of the movement amount calculating means 42 (step S16). Specifically, the orientation of the ellipse center c indicating the position of the part model 60 and the y-axis processing target image indicating the posture (orientation) of the part model 60 in the coordinate system is updated and stored in the part model information 300. .

反復制御手段４４は配置更新手段４３による一回の更新処理が完了するごとに、反復処理を制御するための処理を行う（ステップＳ１８，Ｓ２０）。例えば、移動量算出手段４２により算出された平行移動量と回転量とをそれぞれ反復の収束を判定するしきい値と比較し（ステップＳ１８）、いずれかがしきい値を超えている場合は反復を繰り返し（ステップＳ２０にて「Ｎｏ」の場合）、両方ともしきい値以下となった場合は収束したとして反復を終了する（ステップＳ２０にて「Ｙｅｓ」の場合）。また、反復回数に上限回数を予め定め、当該回数を超えた場合も反復を終了してもよい。 The iterative control means 44 performs a process for controlling the iterative process every time one update process by the arrangement updating means 43 is completed (steps S18 and S20). For example, the parallel movement amount and the rotation amount calculated by the movement amount calculation means 42 are respectively compared with threshold values for determining the convergence of iteration (step S18), and if either exceeds the threshold value, the iteration is repeated. Is repeated (in the case of “No” in step S20), and if both are equal to or less than the threshold value, the iteration is terminated as being converged (in the case of “Yes” in step S20). In addition, an upper limit may be set in advance as the number of iterations, and the iteration may be terminated when the number of iterations is exceeded.

反復制御手段４４はＩ番目の部位モデル６０について反復が終了すると、部位モデル６０の人物領域への一致度を計算する（ステップＳ２２）。例えば、一致度として、部位モデル６０の中に含まれる人物領域の面積Ｓ_ＩＮを部位モデル６０の面積で除した値Ｒ_ＩＮを用いることができる。また、人物領域の全体の面積Ｓに対する部位モデル６０外の残余面積（Ｓ−Ｓ_ＩＮ）の比の値Ｒ_ＯＵＴを、その値が小さいほど一致度が高いことを示す指標値として用いることもできる。算出された一致度を示す値は部位モデル情報３００に記憶される。 When the iteration for the I-th part model 60 is completed, the iteration control unit 44 calculates the degree of coincidence of the part model 60 with the person region (step S22). For example, a value R _IN obtained by dividing the area S _{IN of the} person region included in the part model 60 by the area of the part model 60 can be used as the degree of coincidence. Further, the value R _OUT of the ratio of the remaining area (S−S _IN ) outside the part model 60 to the entire area S of the human region can be used as an index value indicating that the smaller the value, the higher the matching degree. . A value indicating the calculated degree of coincidence is stored in the part model information 300.

位置合わせ装置１０は位置合わせ処理が完了した部位モデル６０が最後の、つまりＭ番目のものでなければ（ステップＳ２４にて「Ｎｏ」の場合）、インデックスＩをインクリメントして（ステップＳ２６）次の部位モデル６０についての位置合わせ処理（ステップＳ１０〜Ｓ２２）を行う。 The alignment apparatus 10 increments the index I (step S26) if the part model 60 for which the alignment process has been completed is not the last, that is, the Mth one (in the case of “No” in step S24). The alignment process (steps S10 to S22) for the part model 60 is performed.

一方、Ｍ通りの全部位モデル６０について位置合わせ処理が完了すると（ステップＳ２４にて「Ｙｅｓ」の場合）、姿勢推定手段４５は部位モデル情報３００に記憶された各部位モデル６０についての一致度を比較し、最も好適に人物領域の目的部位に適合した部位モデル６０を選択し人物の姿勢を推定する（ステップＳ２８）。 On the other hand, when the alignment process is completed for all M part body models 60 (in the case of “Yes” in step S24), the posture estimation unit 45 determines the degree of coincidence for each part model 60 stored in the part model information 300. Comparison is made, and a part model 60 most suitable for the target part of the person region is selected, and the posture of the person is estimated (step S28).

画像処理部４は監視画像が得られるたびに（ステップＳ４）、上述したステップＳ６〜Ｓ２８の処理を行う。 Every time a monitoring image is obtained (step S4), the image processing unit 4 performs the processes of steps S6 to S28 described above.

本実施形態では平行移動量の算出にミーンシフト法を用いている。これにより、部位モデル６０と人物領域６２との位置関係に基づいて、好適な平行移動量が一意的に定まるので、例えば、一定の微小量ずつ平行移動を行うよりも、人物領域６２に対する部位モデル６０の位置合わせが効率的に行われる。 In this embodiment, the mean shift method is used for calculating the parallel movement amount. Thereby, since a suitable amount of parallel movement is uniquely determined based on the positional relationship between the part model 60 and the person area 62, for example, the part model with respect to the person area 62 rather than performing a certain amount of parallel movement. 60 alignments are performed efficiently.

さらに、本実施形態では回転量の算出について、各反復処理にて、重複領域の慣性主軸を推定した直線に部位モデル６０の慣性主軸を合わせるように回転量を決定する。この手法は平行移動におけるミーンシフト法と同様、部位モデル６０と人物領域６２との位置関係に基づいて、好適な回転量が一意的に定まるので部位モデル６０の人物領域６２に対する位置合わせが効率的に行われる。 Furthermore, in the present embodiment, with respect to the calculation of the rotation amount, the rotation amount is determined so that the inertial main axis of the part model 60 is aligned with the straight line that estimates the inertial main axis of the overlapping region in each iterative process. In this method, similar to the mean shift method in translation, a suitable rotation amount is uniquely determined based on the positional relationship between the part model 60 and the person area 62, so that the position of the part model 60 with respect to the person area 62 is efficient. To be done.

部位モデル６０と人物領域６２との重複領域の慣性主軸を推定する方法は上記実施形態以外のやり方とすることもできる。 The method of estimating the inertial principal axis of the overlapping region between the part model 60 and the person region 62 may be a method other than the above embodiment.

例えば、重複領域について主成分分析を行って慣性主軸を求めることができる。具体的には、処理対象画像における重複領域のピクセルの分布について得られる主成分分析の第一成分が重複領域の慣性主軸となる。そこで、上記実施形態で位置合わせの基準軸に採用している部位モデル６０の最小慣性モーメントの慣性主軸を、重複領域について主成分分析で求めた最小慣性モーメントの慣性主軸に合わせるように部位モデル６０の回転量を算出する構成とすることができる。 For example, a principal component analysis can be performed on the overlapping region to obtain the principal axis of inertia. Specifically, the first component of the principal component analysis obtained for the pixel distribution of the overlapping region in the processing target image becomes the inertial principal axis of the overlapping region. Therefore, the part model 60 is adapted so that the inertia principal axis of the minimum moment of inertia of the part model 60 adopted as the reference axis for alignment in the above embodiment matches the inertia principal axis of the minimum moment of inertia obtained by principal component analysis for the overlapping region. The amount of rotation can be calculated.

また、上述の実施形態では重複領域の慣性主軸を推定するに際して、部位モデル６０の慣性主軸Ｌに直交する分割直線Ｐにより重複領域を二つの領域に分割したが、重複領域を軸Ｌの方向に並ぶ三つ以上の部分領域に分割し、当該複数の部分領域の重心の並びを近似する直線であり重複領域の重心ｇを通る直線Ｑを重複領域の慣性主軸と推定することもできる。具体的には、Ｌに直交する複数の直線で重複領域を三つ以上に分割し各分割領域の部分重心を求める。そして、重複領域の重心ｇを通るという拘束条件の下で、最小二乗法などにより部分重心の分布を近似する直線を求め、これを直線Ｑとすることができる。なお、部分重心の分布を近似する直線を求める際には、各部分重心に対応する分割領域の大きさを重みとして考慮する重み付き最小二乗法とする。 In the above embodiment, when estimating the inertial principal axis of the overlap region, the overlap region is divided into two regions by the dividing line P orthogonal to the inertial principal axis L of the part model 60. It is also possible to estimate a straight line Q that is divided into three or more partial regions and that approximates the arrangement of the centroids of the plurality of partial regions and passes through the centroid g of the overlapping region as the inertial principal axis of the overlapping region. Specifically, the overlapping area is divided into three or more by a plurality of straight lines orthogonal to L, and the partial center of gravity of each divided area is obtained. A straight line that approximates the distribution of the partial centroids can be obtained by the least square method under the constraint that it passes through the centroid g of the overlapping region, and this can be used as the straight line Q. When obtaining a straight line approximating the distribution of partial centroids, a weighted least square method is used in which the size of the divided area corresponding to each partial centroid is considered as a weight.

なお、上記実施形態では、人物の上半身が起きている状態を想定して初期位置における部位モデル６０の楕円形の長軸（ｙ軸）の向きをＹ軸の向きに設定した。これにより重複領域も比較的に縦長になり易く、上述のｙ軸に直交する直線Ｐで重複領域を分割する手法で重複領域について求めた軸は、部位モデル６０について設定した慣性主軸と同様、慣性モーメントが最小の慣性主軸を近似するものであることが期待でき、部位モデル６０が好適に目的とする人物の頭部及び胴体に位置合わせされ得る。 In the above embodiment, assuming that the upper body of the person is standing, the direction of the elliptical long axis (y-axis) of the part model 60 at the initial position is set to the Y-axis direction. As a result, the overlapping area is also likely to be relatively long, and the axis obtained for the overlapping area by the above-described method of dividing the overlapping area by the straight line P orthogonal to the y-axis is the same as the inertia main axis set for the part model 60. It can be expected that the moment approximates the principal axis of inertia, and the part model 60 can be suitably aligned to the intended person's head and torso.

一方、人物の上半身の傾きが大きい場合など、重複領域が縦長にならないことも想定される。このような場合には上述のｙ軸に直交する直線Ｐで重複領域を分割する手法で求めた重複領域の軸は慣性モーメントが最小ではない慣性主軸に近い向きとなり、意図しない姿勢に部位モデル６０が収束するかもしれない。これを避けるために、同じ部位モデル６０について初期配置での向きを複数通り設定し、各姿勢について反復処理を行って一致度を求め、最大の一致度を与える姿勢と当該一致度を部位モデル情報３００に記憶し、姿勢推定手段４５で利用する構成としてもよい。 On the other hand, it is also assumed that the overlapping area does not become vertically long when the upper body of the person has a large inclination. In such a case, the axis of the overlapping area obtained by the above-described method of dividing the overlapping area by the straight line P orthogonal to the y-axis is oriented close to the inertial main axis where the moment of inertia is not minimum, and the part model 60 is in an unintended posture. May converge. In order to avoid this, a plurality of orientations in the initial arrangement are set for the same part model 60, the degree of coincidence is obtained by performing iterative processing for each posture, and the posture that gives the maximum degree of coincidence and the degree of coincidence are obtained as part model information. It is good also as a structure memorize | stored in 300 and utilized with the attitude | position estimation means 45. FIG.

［第二の実施形態］
本発明の第二の実施形態について以下、第一の実施形態と同様の構成要素には同一の符号を付すと共に、共通する事柄については説明を省略し、第一の実施形態との相違点について主に説明する。 [Second Embodiment]
In the following description of the second embodiment of the present invention, the same components as those in the first embodiment are denoted by the same reference numerals, explanations of common matters are omitted, and differences from the first embodiment are described. Mainly explained.

第二の実施形態に係る姿勢推定装置１は姿勢推定に際し、監視画像から求めた三次元の人物領域に三次元の部位モデルを位置合わせする。すなわち、第一の実施形態は二次元空間での位置合わせであったのに対し、本実施形態は三次元空間での位置合わせである点が基本的な相違点である。本実施形態では処理対象空間の座標系をワールド座標系とし、右手系の直交座標系ＸＹＺで定義する。また部位モデル６０に固定されたローカル座標系を右手系の直交座標系ｘｙｚで定義する。 At the time of posture estimation, the posture estimation device 1 according to the second embodiment aligns a three-dimensional part model with a three-dimensional human region obtained from a monitoring image. That is, the first embodiment is the alignment in the two-dimensional space, whereas the present embodiment is the basic difference in that the alignment is in the three-dimensional space. In the present embodiment, the coordinate system of the processing target space is defined as a world coordinate system, which is defined by a right-handed orthogonal coordinate system XYZ. A local coordinate system fixed to the part model 60 is defined by a right-handed orthogonal coordinate system xyz.

本実施形態の構成は、第一の実施形態の図１及び図２に示すものと基本的に同じであり、これらの図を援用する。 The configuration of the present embodiment is basically the same as that shown in FIGS. 1 and 2 of the first embodiment, and these figures are incorporated.

撮影部２は複数のカメラであり、それらが撮影する監視画像から監視空間にいる人物の立体形状を視体積交差法などにより求めることができるように配置される。例えば、各カメラは魚眼レンズを装着し、天井から真下向きに設置される。 The photographing unit 2 is a plurality of cameras, and is arranged so that a three-dimensional shape of a person in the surveillance space can be obtained from a surveillance image photographed by them by a visual volume intersection method or the like. For example, each camera is equipped with a fisheye lens and is installed directly downward from the ceiling.

これら複数のカメラの内部パラメータ及び外部パラメータは予めのキャリブレーションにより計測され、カメラパラメータとして記憶部３に記憶される。なお、外部パラメータによりワールド座標系における各カメラの位置・姿勢が表され、ワールド座標系ＸＹＺの位置を各カメラの撮影面座標系ｕｖに変換（投影）したり、各カメラの撮影面座標系ｕｖの位置をワールド座標系ＸＹＺに変換（逆投影）したりすることが可能となる。 The internal parameters and external parameters of the plurality of cameras are measured by calibration in advance and stored in the storage unit 3 as camera parameters. The position / orientation of each camera in the world coordinate system is represented by an external parameter, and the position of the world coordinate system XYZ is converted (projected) into the shooting plane coordinate system uv of each camera, or the shooting plane coordinate system uv of each camera. Can be converted (backprojected) into the world coordinate system XYZ.

対象物領域抽出手段４０は、各カメラの監視画像について背景差分処理を行い人物のシルエットを表す差分画像を生成する。対象物領域抽出手段４０は記憶部３に記憶されたカメラパラメータを用いて差分画像をワールド座標系に逆投影する。この逆投影により、カメラ位置を頂点とする錐体の空間である視体積が定義される。対象物領域抽出手段４０は全カメラの視体積が交差する空間（積空間）を算出する。この積空間は人物の三次元の概略形状を表し、これが本実施形態における人物領域となる。なお、第一の実施形態では人物領域はピクセルの集合として扱うことができた。これと同様、本実施形態では人物領域はボクセルの集合として扱うことができる。 The object region extraction unit 40 performs background difference processing on the monitoring images of the cameras and generates a difference image representing the silhouette of the person. The object area extraction unit 40 backprojects the difference image onto the world coordinate system using the camera parameters stored in the storage unit 3. By this back projection, a viewing volume, which is a space of a cone with the camera position as a vertex, is defined. The object region extraction means 40 calculates a space (product space) where the viewing volumes of all the cameras intersect. This product space represents a three-dimensional schematic shape of a person, and this is a person area in this embodiment. In the first embodiment, the person area can be handled as a set of pixels. Similarly, in this embodiment, a person area can be handled as a set of voxels.

本実施形態では処理対象空間として、監視空間に対応して設定され三次元の人物領域が配置された仮想的な空間を定義する。処理対象空間と監視空間とは共通のワールド座標系ＸＹＺで定義される。 In the present embodiment, a virtual space in which a three-dimensional person area set corresponding to the monitoring space is arranged is defined as the processing target space. The processing target space and the monitoring space are defined by a common world coordinate system XYZ.

位置合わせ装置１０は人物領域が配置された処理対象空間にて、三次元の部位モデルを人物領域中の目的部位に位置合わせすることで人物の姿勢推定を行う。この位置合わせ処理では、部位モデルを初期位置に配置し、当該部位モデルを回転及び平行移動させる配置更新を反復して行い、目的部位への部位モデルの配置の収束を図る。なお、本実施形態では第一の実施形態と同様、人物の頭部と胴体部分とを併せて一つの目的部位とし、扁平楕円体（以下、単に楕円体と称する。）の部位モデルを位置合わせするものとする。よって楕円体の長軸の一方端を頭頂部に一致させ、他方端を臀部に一致させることを目標とした位置合わせがなされる。 The alignment apparatus 10 estimates the posture of a person by aligning a three-dimensional part model with a target part in the person area in the processing target space in which the person area is arranged. In this alignment process, the part model is arranged at the initial position, and the arrangement update for rotating and translating the part model is repeatedly performed, so that the arrangement of the part model at the target part is converged. In this embodiment, as in the first embodiment, the head and torso portions of a person are combined into one target portion, and a flat ellipsoid (hereinafter simply referred to as an ellipsoid) portion model is aligned. It shall be. Therefore, alignment is performed with the goal of matching one end of the major axis of the ellipsoid with the top of the head and the other end with the buttocks.

部位モデルの形状は基本的に目的部位に合わせた任意の三次元形状とすることができるが、本実施形態ではローカル座標系で次式で表される楕円体とする。
（ｘ^２／Ｒｘ^２）＋（ｙ^２／Ｒｙ^２）＋（ｚ^２／Ｒｚ^２）＝１ The shape of the part model can basically be an arbitrary three-dimensional shape that matches the target part, but in this embodiment, it is an ellipsoid represented by the following expression in the local coordinate system.
^{^{^{^{(X 2 / Rx 2) +}}}} (y 2 / Ry 2) + (z 2 / Rz 2) = 1

ここで、Ｒｘ＜Ｒｙ＜Ｒｚとする。すなわち部位モデル６０はｚ軸方向に長半径Ｒｚを有する三軸不等楕円体である。ちなみに当該楕円体のｘ軸方向が人の前後方向、ｙ軸方向が胴体の脇方向、ｚ軸方向が身長方向に対応する。また楕円体の中心ｃがローカル座標系の原点となる。 Here, Rx <Ry <Rz. That is, the part model 60 is a three-axis unequal ellipsoid having a major radius Rz in the z-axis direction. Incidentally, the x-axis direction of the ellipsoid corresponds to the human front-rear direction, the y-axis direction corresponds to the torso side direction, and the z-axis direction corresponds to the height direction. The center c of the ellipsoid is the origin of the local coordinate system.

記憶手段３０には部位モデル６０として楕円体の部位モデル情報３００が記憶される。本実施形態では第一の実施形態とは異なり三次元空間内で位置合わせを行うので、三次元の対象物を二次元画像に投影することに伴う、人物の姿勢に応じた目的部位の形状の変化は生じない。そのため、部位モデル６０は人物の頭部及び胴体部分の代表的な形状を表す一種類だけとすることができる。 The storage means 30 stores ellipsoidal part model information 300 as the part model 60. In this embodiment, unlike the first embodiment, alignment is performed in a three-dimensional space. Therefore, the shape of the target part corresponding to the posture of the person according to the projection of the three-dimensional object onto the two-dimensional image is obtained. No change will occur. Therefore, the part model 60 can be only one type representing the representative shapes of the person's head and torso parts.

図８は部位モデル６０がワールド座標系に存在する状態を示す模式図である。部位モデル６０の姿勢は、ワールド座標系で記述されたローカル座標系の座標軸及び原点により記述される。すなわち、部位モデル６０の平行移動はワールド座標系でのローカル座標系の原点（点ｃ）の移動によって記述され、部位モデル６０の回転はワールド座標系でのローカル座標系のｘ軸，ｙ軸，ｚ軸の向きの変化によって記述される。 FIG. 8 is a schematic diagram showing a state in which the part model 60 exists in the world coordinate system. The posture of the part model 60 is described by the coordinate axes and the origin of the local coordinate system described in the world coordinate system. That is, the parallel movement of the part model 60 is described by the movement of the origin (point c) of the local coordinate system in the world coordinate system, and the rotation of the part model 60 is the x-axis, y-axis, It is described by a change in the orientation of the z-axis.

初期配置手段４１は、処理対象空間に部位モデルとして配置される楕円体の初期位置を決定する。例えば、人物の直立姿勢を基本姿勢と考え、初期位置における楕円体の向きはｚ軸の向きが処理対象空間における鉛直方向、つまりＺ軸方向となるように設定する。また、楕円体のｘ軸、ｙ軸は人物領域の水平断面形状、つまりＸＹ平面に沿った形状に応じて設定することができる。具体的には部位モデルのｘ軸、ｙ軸はそれぞれおおよそ人物領域の短手方向、長手方向になるように設定することができる。 The initial placement means 41 determines an initial position of an ellipsoid placed as a part model in the processing target space. For example, the upright posture of the person is considered as the basic posture, and the orientation of the ellipsoid at the initial position is set so that the direction of the z axis is the vertical direction in the processing target space, that is, the Z axis direction. Further, the x-axis and y-axis of the ellipsoid can be set according to the horizontal cross-sectional shape of the person area, that is, the shape along the XY plane. Specifically, the x-axis and y-axis of the part model can be set so as to be approximately in the short direction and the long direction of the person region, respectively.

また、楕円体の位置は、処理対象空間における人物領域の位置に基づいて設定することができる。例えば、楕円体の中心（つまりローカル座標系の原点）のワールド座標系におけるＸ，Ｙ座標は人物領域のＸ，Ｙ軸方向における位置に対応させることができる。一方、楕円体の中心のワールド座標系におけるＺ座標は、第一の実施形態と同じ理由から、楕円体と人物領域とが重複領域を生じ、かつ楕円体の最上部が人物領域の最上部よりも少し高い位置になるように設定する。 The position of the ellipsoid can be set based on the position of the person area in the processing target space. For example, the X and Y coordinates in the world coordinate system at the center of the ellipsoid (that is, the origin of the local coordinate system) can be made to correspond to the positions of the person region in the X and Y axis directions. On the other hand, the Z coordinate in the world coordinate system at the center of the ellipsoid is the same as in the first embodiment, and the ellipsoid and the person area overlap with each other, and the uppermost part of the ellipsoid is higher than the uppermost part of the person area. Also set it to a slightly higher position.

移動量算出手段４２は、部位モデルのローカル座標系の原点の平行移動量とｘ軸、ｙ軸及びｚ軸それぞれを中心とした回転量θｘ，θｙ，θｚとを算出する。平行移動量はミーンシフト法により求める。 The movement amount calculation means 42 calculates the parallel movement amount of the origin of the local coordinate system of the part model and the rotation amounts θx, θy, θz about the x axis, the y axis, and the z axis, respectively. The amount of translation is determined by the mean shift method.

回転量θｘ，θｙ，θｚそれぞれの計算手法は、第一の実施形態で説明した二次元の部位モデル６０の回転量の計算手法を三次元に拡張したものである。第一の実施形態では、ｘｙ平面に直交する軸を回転軸とし、その回りの回転量θを求めた。その際、部位モデル６０を表す楕円形の長軸を基準軸とし、重複領域の重心ｇを通り当該基準軸に直交する直線Ｐで重複領域を２つの領域を分割領域Ａ，Ｂに分割し、分割領域Ａ，Ｂそれぞれの重心ｇａ，ｇｂ及び重複領域全体の重心ｇの並ぶ方向と基準軸の方向とのなす角をθとして求めた。これに対し本実施形態では、ｘ，ｙ，ｚ軸それぞれを回転軸として選択し、当該回転軸に直交する平面（投影平面）内での回転量を求める。その際、投影平面のそれぞれにおいて楕円体である部位モデル６０を回転軸方向から見た楕円形の長軸を基準軸とし、重複領域の重心ｇを通り当該基準軸に直交する平面Ｐで重複領域を２つの領域を分割領域Ａ，Ｂに分割し、分割領域Ａ，Ｂそれぞれの重心ｇａ，ｇｂ及び重複領域全体の重心ｇを投影平面に投影した点ｇａ’，ｇｂ’，ｇ’の並ぶ方向と基準軸の方向とのなす角を選択した回転軸の回りの回転量とする。ここで、特に第一の実施形態との相違として、本実施形態では重複領域、分割領域は三次元の領域であり、第一の実施形態の分割直線Ｐに代えて分割平面Ｐを用いて重複領域を分割すること、及び重心ｇ、部分重心ｇａ，ｇｂは三次元空間で定義される点であることが挙げられる。 The calculation methods for the rotation amounts θx, θy, and θz are three-dimensional extensions of the calculation method for the rotation amount of the two-dimensional part model 60 described in the first embodiment. In the first embodiment, an axis orthogonal to the xy plane is a rotation axis, and the amount of rotation θ around the axis is obtained. At that time, an elliptical long axis representing the part model 60 is used as a reference axis, and the overlapping region is divided into two divided regions A and B by a straight line P passing through the center of gravity g of the overlapping region and orthogonal to the reference axis. The angle formed by the direction in which the center of gravity ga, gb of each of the divided areas A and B and the center of gravity g of the entire overlapping area are aligned and the direction of the reference axis was determined as θ. On the other hand, in this embodiment, each of the x, y, and z axes is selected as a rotation axis, and the amount of rotation in a plane (projection plane) orthogonal to the rotation axis is obtained. At this time, the elliptical long axis obtained by viewing the part model 60 that is an ellipsoid from the direction of the rotation axis in each of the projection planes is used as a reference axis, and the overlapping area is formed on a plane P that passes through the center of gravity g of the overlapping area and is orthogonal to the reference axis Are divided into divided areas A and B, and the directions in which the points ga ′, gb ′, and g ′ are arranged by projecting the centroids ga and gb of the divided areas A and B and the centroid g of the entire overlapping area onto the projection plane. And the angle between the direction of the reference axis and the rotation amount around the selected rotation axis. Here, in particular, as a difference from the first embodiment, in the present embodiment, the overlapping region and the divided region are three-dimensional regions, and are overlapped using the dividing plane P instead of the dividing line P of the first embodiment. It is mentioned that the area is divided, and the center of gravity g and the partial centers of gravity ga and gb are points defined in a three-dimensional space.

図９はＲｘ＜Ｒｙ＜Ｒｚである場合の回転軸ごとの基準軸、分割平面Ｐ、分割領域Ａ，Ｂの設定規則を表形式に表した模式図である。また、図１０はｘ軸を回転軸とした場合の回転量の求め方を説明する模式図であり、図１０（ａ）は部位モデル６０及び分割平面Ｐの斜視図、図１０（ｂ）はｘ軸方向からの投影図である。同様に図１１はｙ軸を回転軸とした場合、また図１２はｚ軸を回転軸とした場合の回転量の求め方をそれぞれ説明する模式図である。なお、図１０〜図１２では人物領域は表示を省略し、また、部位モデル６０と人物領域との重複領域にハッチングを付している。 FIG. 9 is a schematic diagram showing the setting rules for the reference axis, the division plane P, and the division areas A and B for each rotation axis when Rx <Ry <Rz. FIG. 10 is a schematic diagram for explaining how to obtain the rotation amount when the x axis is the rotation axis. FIG. 10A is a perspective view of the part model 60 and the division plane P, and FIG. It is a projection view from the x-axis direction. Similarly, FIG. 11 is a schematic diagram for explaining how to obtain the rotation amount when the y-axis is a rotation axis, and FIG. 10 to 12, the display of the person area is omitted, and the overlapping area between the part model 60 and the person area is hatched.

例えば、ｘ軸を回転軸とした場合は図１０に示すように、移動量算出手段４２はｚ軸を基準軸に設定し、分割平面Ｐとしてｘｙ平面に平行な平面を重心ｇを通る位置に設定する。移動量算出手段４２は当該分割平面Ｐにより設定される分割領域Ａ，Ｂそれぞれの重心（部分重心）ｇａ，ｇｂを求め、当該部分重心ｇａ，ｇｂをｙｚ平面に投影した点ｇａ’，ｇｂ’を求める。移動量算出手段４２は重複領域の慣性主軸をｙｚ平面に投影した直線の近似としてｇａ’，ｇ’，ｇｂ’を通る直線を用い、当該直線と基準軸であるｚ軸とがなす角度を回転量θｘとする。例えば、基準軸に平行で重心投影点ｇ’を通る直線をＬ’とし、ｇ’とｇａ’とを結ぶ線分と直線Ｌ’とがなす角θａ、ｇ’とｇｂ’とを結ぶ線分と直線Ｌ’とがなす角θｂを求め、θａとθｂの平均値を回転量θｘとする。 For example, when the x axis is the rotation axis, as shown in FIG. 10, the movement amount calculating means 42 sets the z axis as the reference axis and sets a plane parallel to the xy plane as the division plane P at a position passing through the center of gravity g. Set. The movement amount calculating means 42 obtains centroids (partial centroids) ga and gb of the divided areas A and B set by the division plane P, and points ga ′ and gb ′ obtained by projecting the partial centroids ga and gb on the yz plane. Ask for. The movement amount calculation means 42 uses a straight line passing through ga ′, g ′, and gb ′ as an approximation of a straight line obtained by projecting the principal axis of inertia of the overlapping region onto the yz plane, and rotates the angle formed by the straight line and the reference z axis. Let it be the quantity θx. For example, let L ′ be a straight line parallel to the reference axis and passing through the center-of-gravity projection point g ′, and a line segment connecting the angles θa, g ′ and gb ′ formed by the line segment connecting g ′ and ga ′ and the straight line L ′. The angle θb formed by the straight line L ′ is obtained, and the average value of θa and θb is defined as the rotation amount θx.

移動量算出手段４２によるｙ軸を回転軸とした場合の回転量θｙ、ｚ軸を回転軸とした場合の回転量θｚの算出方法は、上述の回転量θｘの算出方法から類推される通りであり説明を省略する。 The calculation method of the rotation amount θy when the movement amount calculation means 42 uses the y axis as the rotation axis and the rotation amount θz when the z axis serves as the rotation axis are as inferred from the calculation method of the rotation amount θx described above. There is no explanation.

ここで、ｘ軸を回転軸とした場合、及びｙ軸を回転軸とした場合の基準軸であるｚ軸は部位モデル６０の最小慣性モーメントを与える慣性主軸であり、この点で第一の実施形態と共通している。一方、ｚ軸を回転軸とした場合の基準軸であるｙ軸は、ｘｙ平面での部位モデル６０の断面の楕円形の長軸である点で第一の実施形態と共通するが、当該ｙ軸は部位モデル６０の最小慣性モーメントを与える慣性主軸ではない点で第一の実施形態と相違する。部位モデル６０である楕円体においてｚ軸が最小慣性モーメントの慣性主軸、ｘ軸が最大慣性モーメントの慣性主軸であり、ｙ軸はそれらの中間の慣性モーメントを与える慣性主軸である。基本的には、ｘ軸、ｙ軸の回りの回転量θｘ，θｙは部位モデル６０が表す人物の身長方向を定め、ｚ軸の回りの回転量θｚは部位モデル６０が表す人物の前後又は左右方向を定める。 Here, when the x-axis is the rotation axis and the y-axis is the rotation axis, the reference z-axis is the inertia main axis that gives the minimum moment of inertia of the part model 60. Common with form. On the other hand, the y-axis, which is the reference axis when the z-axis is the rotation axis, is the same as in the first embodiment in that it is an elliptical long axis of the cross section of the part model 60 on the xy plane. The axis differs from the first embodiment in that the axis is not the principal axis of inertia that gives the minimum moment of inertia of the part model 60. In the ellipsoid that is the part model 60, the z-axis is the inertia main axis with the minimum moment of inertia, the x-axis is the inertia main axis with the maximum moment of inertia, and the y-axis is the inertia main axis that gives an intermediate moment of inertia. Basically, the rotation amounts θx and θy around the x-axis and the y-axis determine the height direction of the person represented by the part model 60, and the rotation amount θz around the z-axis is before, after, or left and right of the person represented by the part model 60 Determine the direction.

このように移動量算出手段４２は三次元の部位モデル６０が有する三つの慣性主軸のうちの二つを座標軸とする三通りの平面のそれぞれにおいて座標軸の一方を基準軸とし、基準軸に対応して推定された重複領域の慣性主軸を当該平面に投影し、回転量の当該平面と直交する軸の回りの成分を、基準軸と当該平面に投影した重複領域の慣性主軸とのなす角度に応じて算出する。これにより少ない演算量で決定論的に回転量を算出できる。 As described above, the movement amount calculation means 42 corresponds to the reference axis with one of the coordinate axes as a reference axis in each of three planes having two of the three principal axes of inertia of the three-dimensional region model 60 as the coordinate axes. Project the principal axis of inertia of the overlapping area estimated in this way onto the plane, and the component of the rotation amount around the axis perpendicular to the plane depends on the angle between the reference axis and the principal axis of inertia of the overlapping area projected onto the plane To calculate. Thereby, the rotation amount can be calculated deterministically with a small amount of calculation.

なお、第一の実施形態で述べたθの算出方法の変形例は本実施形態においても適用できる。 Note that the modified example of the θ calculation method described in the first embodiment can also be applied to this embodiment.

本実施形態も第一の実施形態で述べたように、部位モデル６０の人物領域６２に対する好適な姿勢の探索が効率的に行われる効果が得られる。ちなみに、反復演算に従来手法である最急降下法を用いて部位モデルの好適な位置を求めることができるが、当該手法は、三次元空間での並進と回転とを合わせた六つの自由度ごとの部位モデルの姿勢パラメータのそれぞれに対して、部位モデルと人物領域との一致度の勾配を求めることでパラメータの修正量を求めるものであり、勾配の計算のコストが高い。本実施形態及び第一の実施形態により説明した本発明は、このような高計算コストの勾配計算を必要としないので、当該従来手法に比べて位置合わせの速度が向上し得る。 As described in the first embodiment, this embodiment also has an effect of efficiently searching for a suitable posture with respect to the person region 62 of the part model 60. By the way, it is possible to find a suitable position of the part model using the steepest descent method which is a conventional method for iterative calculation, but this method is based on six degrees of freedom combining translation and rotation in three-dimensional space. For each posture parameter of the part model, the correction amount of the parameter is obtained by obtaining the gradient of the degree of coincidence between the part model and the person region, and the cost of calculating the gradient is high. Since the present invention described in the present embodiment and the first embodiment does not require such high calculation cost gradient calculation, the alignment speed can be improved as compared with the conventional method.

配置更新手段４３は、移動量算出手段４２で求めた平行移動量及び回転量に従い部位モデル６０の位置・姿勢を修正する。ここで、部位モデル６０の慣性主軸を直線ｇａ−ｇｂのように重複領域に設定される目標軸に合わせる回転操作を上述のようにｘ，ｙ，ｚ各軸の回りの回転量θｘ，θｙ，θｚに分解すると誤差が生じ得る。つまり、ｘ，ｙ，ｚ各軸の回りに回転量θｘ，θｙ，θｚずつ回転させても部位モデル６０の慣性主軸は重複領域の目標軸に一般には完全に一致しない。しかし、反復処理により部位モデル６０の姿勢を好適な向きに収束させることができる。なお、回転を行う軸の順序により最終的な回転結果に差が生じ得る。この点を考慮して、回転を行う軸の順序は反復処理中には一定に保つ。例えば、ｘ軸、ｙ軸、ｚ軸の順番に回転することとする。すなわち、はじめにｘ軸の回りに回転し（ローカル座標系のｙ軸及びｚ軸がｘ軸を中心に回転する）、その結果の座標系に対して次にｙ軸の回りに回転し（ローカル座標系のｘ軸、ｚ軸がｙ軸を中心に回転する）、さらにその結果の座標系に対してｚ軸の回りに回転する（ローカル座標系のｘ軸、ｙ軸がｚ軸を中心に回転する）。 The arrangement updating unit 43 corrects the position / posture of the part model 60 according to the parallel movement amount and the rotation amount obtained by the movement amount calculation unit 42. Here, as described above, the rotation operation for aligning the principal axis of inertia of the part model 60 with the target axis set in the overlapping region as shown by the straight line ga-gb is the rotation amounts θx, θy, Decomposing into θz can cause errors. That is, even if the rotation amounts θx, θy, and θz are rotated about the x, y, and z axes, the inertia principal axes of the part model 60 generally do not completely coincide with the target axes of the overlapping regions. However, the posture of the part model 60 can be converged in a suitable direction by iterative processing. Note that there may be a difference in the final rotation result depending on the order of the axes to be rotated. Considering this point, the order of the axes to be rotated is kept constant during the iterative process. For example, the rotation is performed in the order of the x axis, the y axis, and the z axis. That is, it first rotates around the x-axis (the y-axis and z-axis of the local coordinate system rotate around the x-axis), and then rotates around the y-axis with respect to the resulting coordinate system (local coordinates The x-axis and z-axis of the system rotate around the y-axis), and the resulting coordinate system rotates around the z-axis (the x- and y-axes of the local coordinate system rotate around the z-axis) To do).

なお、平行移動の操作と回転の操作との順序に関しては、回転中心をローカル座標系の原点（楕円体中心ｃ）とすれば、どちらを先に実行しても同じ位置・姿勢に修正される。 As for the order of the translation operation and the rotation operation, if the rotation center is the origin (ellipsoid center c) of the local coordinate system, it is corrected to the same position and orientation regardless of which is executed first. .

図１３は本実施形態の姿勢推定装置１の概略の動作を示すフロー図である。背景画像撮影（ステップＳ５０）及び撮影（ステップＳ５２）は複数のカメラそれぞれについて行われる。各カメラでのそれらの処理は第一の実施形態のステップＳ２，Ｓ４と基本的に同じである。 FIG. 13 is a flowchart showing a schematic operation of the posture estimation apparatus 1 of the present embodiment. Background image shooting (step S50) and shooting (step S52) are performed for each of the plurality of cameras. Those processes in each camera are basically the same as steps S2 and S4 in the first embodiment.

第一の実施形態の人物領域抽出処理（ステップＳ６）では監視画像と背景画像との差分画像を生成し、これを二次元の処理対象空間として設定したのに対し、本実施形態の人物領域抽出処理（ステップＳ５４）は三次元の処理対象空間を設定する。具体的には、対象物領域抽出手段４０はカメラごとに、ステップＳ５０にて撮影され記憶部３に保存された背景画像と、ステップＳ５２にて撮影された監視画像とから差分画像を生成し、当該カメラごとの差分画像を逆投影して、視体積交差法により三次元の人物領域を生成し、また当該人物領域が配置された三次元の処理対象空間を設定する（ステップＳ５４）。 In the human region extraction process (step S6) of the first embodiment, a difference image between the monitoring image and the background image is generated and set as a two-dimensional processing target space, whereas the human region extraction of this embodiment is performed. In the processing (step S54), a three-dimensional processing target space is set. Specifically, the object region extraction unit 40 generates a difference image for each camera from the background image captured in step S50 and stored in the storage unit 3, and the monitoring image captured in step S52. A difference image for each camera is back-projected to generate a three-dimensional human region by the view volume intersection method, and a three-dimensional processing target space in which the human region is arranged is set (step S54).

位置合わせ装置１０は部位モデル情報３００を記憶手段３０から読み出し、初期配置手段４１により当該部位モデル情報３００が表す部位モデル６０を人物領域に基づいて設定される初期位置に配置して（ステップＳ５６）、当該部位モデル６０の配置を反復的に更新する位置合わせ処理（ステップＳ５８〜Ｓ６４）を行う。 The alignment apparatus 10 reads the part model information 300 from the storage unit 30 and places the part model 60 represented by the part model information 300 by the initial placement unit 41 at an initial position set based on the person area (step S56). Then, alignment processing (steps S58 to S64) for repetitively updating the arrangement of the part model 60 is performed.

当該反復処理では、移動量算出手段４２が現在の部位モデル６０と人物領域との重複領域に基づいて上述のように部位モデル６０の平行移動量及び回転量を算出し（ステップＳ５８）、配置更新手段４３が移動量算出手段４２の算出結果に基づいて部位モデル６０の中心ｃ及び向きを更新し部位モデル情報３００に記憶する（ステップＳ６０）。本実施形態では部位モデル６０の向きは、回転量として算出されたθｘ，θｙ，θｚだけｘｙｚ各軸の回りについて変更される。 In the iterative process, the movement amount calculation means 42 calculates the parallel movement amount and the rotation amount of the part model 60 as described above based on the overlapping region of the current part model 60 and the person region (step S58), and updates the arrangement. The means 43 updates the center c and orientation of the part model 60 based on the calculation result of the movement amount calculating means 42 and stores it in the part model information 300 (step S60). In the present embodiment, the orientation of the part model 60 is changed about each axis of xyz by θx, θy, and θz calculated as rotation amounts.

反復制御手段４４は配置更新手段４３による一回の更新処理が完了するごとに、反復処理の終了条件が満たされたか否かを判断し（ステップＳ６２）、満たされるまで移動量算出手段４２及び配置更新手段４３による処理（ステップＳ５８，Ｓ６０）を繰り返させる（ステップＳ６４にて「Ｎｏ」の場合）。 Each time the update processing unit 43 completes one update process, the iterative control unit 44 determines whether or not the end condition of the iterative process is satisfied (step S62). The processing (steps S58 and S60) by the updating unit 43 is repeated (in the case of “No” in step S64).

反復制御手段４４は反復処理が終了すると（ステップＳ６４にて「Ｙｅｓ」の場合）、部位モデル６０の人物領域への一致度を計算する（ステップＳ６６）。例えば、一致度として、部位モデル６０の中に含まれる人物領域の体積（ボクセル数）Ｖ_ＩＮを部位モデル６０の体積（ボクセル数）で除した値Ｒ_ＩＮを用いることができる。また、人物領域の全体の体積Ｖに対する部位モデル６０外の残余体積（Ｖ−Ｖ_ＩＮ）の比の値Ｒ_ＯＵＴを、その値が小さいほど一致度が高いことを示す指標値として用いることもできる。算出された一致度を示す値は部位モデル情報３００に記憶される。 When the iterative process is completed (“Yes” in step S64), the iterative control unit 44 calculates the degree of coincidence of the part model 60 with the person region (step S66). For example, the value R _IN obtained by dividing the volume (number of voxels) _VIN of the person region included in the part model 60 by the volume (number of voxels) of the part model 60 can be used as the degree of coincidence. Further, the value R _OUT of the ratio of the remaining volume (V−V _IN ) outside the part model 60 to the entire volume V of the person region can be used as an index value indicating that the smaller the value, the higher the matching degree. . A value indicating the calculated degree of coincidence is stored in the part model information 300.

姿勢推定手段４５は人物領域の目的部位に適合した部位モデル６０の配置に基づいて人物の姿勢を推定する。 The posture estimation means 45 estimates the posture of the person based on the arrangement of the part model 60 that matches the target part of the person region.

画像処理部４は監視画像が得られるたびに（ステップＳ５２）、上述したステップＳ５４〜Ｓ６６の処理を行う。 Every time a monitoring image is obtained (step S52), the image processing unit 4 performs the processes of steps S54 to S66 described above.

上記各実施形態においては、人の姿勢を推定したが、対象物は人に限らず犬、猫、椅子など複数の部位から構成される種々の物体に適用できる。また、例えば、対象物を「鞄を所持した人」、部位を「鞄」とし、鞄の位置を特定するなど、姿勢推定に限らず部位の位置検出に適用することもできる。 In each of the above embodiments, the posture of a person is estimated. However, the object is not limited to a person, but can be applied to various objects including a plurality of parts such as a dog, a cat, and a chair. Further, for example, it is possible to apply not only to posture estimation but also to position detection of a part, such as specifying “the person who possesses the eyelid” as the object and “鞄” as the part and specifying the position of the eyelid.

１姿勢推定装置、２撮影部、３記憶部、４画像処理部、５出力部、１０位置合わせ装置、３０記憶手段、４０対象物領域抽出手段、４１初期配置手段、４２移動量算出手段、４３配置更新手段、４４反復制御手段、４５姿勢推定手段、６０部位モデル、６２人物領域。 DESCRIPTION OF SYMBOLS 1 Posture estimation apparatus, 2 Image pick-up part, 3 Storage part, 4 Image processing part, 5 Output part, 10 Positioning apparatus, 30 Storage means, 40 Object area extraction means, 41 Initial arrangement means, 42 Movement amount calculation means, 43 Arrangement updating means, 44 iteration control means, 45 posture estimation means, 60 part model, 62 person area.

Claims

In a two-dimensional or three-dimensional processing target space, an alignment device that aligns a part model imitating the shape of a part constituting the predetermined object with an object region from which the predetermined object is extracted,
Storage means for storing the shape and inertial principal axis of the part model;
An initial placement means for initially placing the part model at a position where the part model and the object region generate an overlapping area;
Estimating the inertial principal axis of the overlapping area, calculating the amount of rotation of the part model to bring the direction of the inertial principal axis of the part model closer to the inertial principal axis of the overlapping area, and increasing the overlap area A movement amount calculating means for calculating a movement amount;
An arrangement updating means for rotating the part model by the rotation amount and translating by the translation amount;
Repetitive control means for repeating the processing by the movement amount calculating means and the arrangement updating means until a predetermined end condition is satisfied,
An alignment apparatus comprising:

The processing target space is three-dimensional,
The movement amount calculating means projects the inertial principal axes of the overlap region onto three planes having two of the three principal axes of the three-dimensional part model as coordinate axes, and each of the three planes An angle between a component of the rotation amount around an axis orthogonal to the plane and an inertia main axis of the region model set on the coordinate axis of the plane and the inertia main axis of the overlapping region projected on the plane To calculate according to
The alignment apparatus according to claim 1.

The movement amount calculation means divides the overlapping region into a plurality of partial regions arranged in the direction of the principal axis of the part model, and is a straight line that approximates the arrangement of the centroids of the partial regions, and calculates the centroid of the overlapping region. The alignment apparatus according to claim 1, wherein a straight line passing through is estimated as an inertia main axis of the overlapping region.

The movement amount calculation means divides the overlapping area into two partial areas by a straight line or a plane orthogonal to the principal axis of inertia of the part model, and a straight line passing through the center of gravity of each of the partial areas and the overlapping area. The alignment apparatus according to claim 1, wherein the alignment apparatus estimates the main axis of inertia.

A computer performs processing for aligning a predetermined part model imitating the shape of the part constituting the predetermined object with the target object region in which the predetermined target is extracted in a two-dimensional or three-dimensional processing target space A program for causing the computer to
Initially placing the part model at a position where the part model and the object region generate an overlapping region,
A process of estimating the inertia principal axis of the overlapping region, calculating the amount of rotation of the part model that brings the direction of the inertia principal axis of the region model closer to the inertia principal axis of the overlapping region, and rotating the part model by the amount of rotation; And calculating the amount of translation of the part model for increasing the overlapping region, and repeating the process of translating the part model by the amount of translation until a predetermined end condition is satisfied,
A program characterized by