[go: up one dir, main page]

CN109870669B - Two-dimensional multi-snapshot meshless compressed beam forming sound source identification method - Google Patents

Two-dimensional multi-snapshot meshless compressed beam forming sound source identification method Download PDF

Info

Publication number
CN109870669B
CN109870669B CN201910121202.4A CN201910121202A CN109870669B CN 109870669 B CN109870669 B CN 109870669B CN 201910121202 A CN201910121202 A CN 201910121202A CN 109870669 B CN109870669 B CN 109870669B
Authority
CN
China
Prior art keywords
sound source
matrix
sound
iteration
snapshot
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910121202.4A
Other languages
Chinese (zh)
Other versions
CN109870669A (en
Inventor
杨洋
刘宴利
褚志刚
张晋源
张永祥
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Chongqing Industrial Vocational and Technical University
Chongqing University
Original Assignee
Chongqing University
Chongqing Industry Polytechnic College
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chongqing University, Chongqing Industry Polytechnic College filed Critical Chongqing University
Priority to CN201910121202.4A priority Critical patent/CN109870669B/en
Publication of CN109870669A publication Critical patent/CN109870669A/en
Application granted granted Critical
Publication of CN109870669B publication Critical patent/CN109870669B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
  • Circuit For Audible Band Transducer (AREA)

Abstract

本发明公开了一种二维多快拍无网格压缩波束形成声源识别方法,包括以下步骤:步骤1、获取测量声压矩阵P;步骤2、迭代求声压P:1、使用MATLAB的CVX工具箱中的SDPT3求解器求解:

Figure DDA0001971925020000011
2、确定第l+1次迭代规则化参数κl+1;3、第l+1次迭代确定的权矩阵Wl+1;当连续两次迭代的
Figure DDA0001971925020000012
间的相对变化量小于等于10‑3或者最大迭代次数被完成时,迭代终止;步骤3、估计声源DOA;步骤4、估计声源强度。本发明能准确估计声源彼此间距较小的DOA,并能定量获得声源的强度,提高了分辨率、去噪声能力和声源识别精度。

Figure 201910121202

The invention discloses a two-dimensional multi-snapshot gridless compression beamforming sound source identification method, comprising the following steps: step 1, obtaining the measured sound pressure matrix P ; step 2, iteratively seeking the sound pressure P: 1, using MATLAB The SDPT3 solver in the CVX toolbox solves:

Figure DDA0001971925020000011
2. Determine the regularization parameter κ l+1 of the l+1 iteration; 3. Determine the weight matrix W l+1 of the l+1 iteration; when two consecutive iterations
Figure DDA0001971925020000012
When the relative variation between is less than or equal to 10-3 or the maximum number of iterations is completed, the iteration is terminated; step 3, estimating the DOA of the sound source; step 4, estimating the intensity of the sound source. The invention can accurately estimate the DOA with small distances between sound sources, and can quantitatively obtain the intensity of the sound sources, thereby improving resolution, noise removal ability and sound source recognition accuracy.

Figure 201910121202

Description

一种二维多快拍无网格压缩波束形成声源识别方法A two-dimensional multi-snapshot gridless compression beamforming method for sound source identification

技术领域technical field

本发明属于声场识别技术领域。The invention belongs to the technical field of sound field identification.

背景技术Background technique

基于平面传声器阵列测量的压缩波束形成是实现声源二维波达方向(Direction-of-arrival,DOA)估计和强度量化的有效途径。传统压缩波束形成假设声源分布在一组预先设置的离散网格点上,每个网格点代表一个观测方向,施加稀疏约束体现为最小化声源分布向量的

Figure BDA0001971923000000015
范数。上述假设不成立时,其重构结果不准确,该问题称作基不匹配,在实际应用中经常出现。Compressed beamforming based on planar microphone array measurement is an effective way to realize two-dimensional direction-of-arrival (DOA) estimation and intensity quantification of sound sources. Traditional compressed beamforming assumes that the sound source is distributed on a set of preset discrete grid points, each grid point represents an observation direction, and the sparse constraint is reflected by minimizing the distribution vector of the sound source
Figure BDA0001971923000000015
norm. When the above assumptions are not established, the reconstruction results will be inaccurate. This problem is called base mismatch, which often occurs in practical applications.

为从根本上解决该问题,先后发展了二维单快拍和多快拍无网格压缩波束形成策略,相比于单快拍,多快拍方法更稳健,但现有的基于原子范数最小化(Atomic NormMinimization,ANM)的二维多快拍无网格压缩波束形成声源识别存在对声源彼此间距较小产生失效的缺陷。In order to fundamentally solve this problem, two-dimensional single-snapshot and multi-snapshot gridless compression beamforming strategies have been developed successively. Compared with single-snapshot, multi-snapshot method is more robust, but the existing atomic norm-based Atomic NormMinimization (ANM) two-dimensional multi-snapshot gridless compression beamforming sound source identification has the defect that the distance between sound sources is small.

发明内容Contents of the invention

针对现有技术存在的问题,本发明所要解决的技术问题就是提供一种二维多快拍无网格压缩波束形成声源识别方法,它使用迭代重加权原子范数最小化,能提高分辨率、去噪声能力和声源识别精度。Aiming at the problems existing in the prior art, the technical problem to be solved by the present invention is to provide a two-dimensional multi-snapshot gridless compression beamforming sound source identification method, which uses iterative re-weighted atomic norm minimization, which can improve the resolution , Noise removal ability and sound source identification accuracy.

本发明所要解决的技术问题是通过这样的技术方案实现的,它包括步骤:The technical problem to be solved in the present invention is realized by such technical scheme, and it comprises steps:

步骤1、获取测量声压矩阵PStep 1. Obtain the measured sound pressure matrix P ;

测量声压矩阵

Figure BDA0001971923000000011
为:Measuring Sound Pressure Matrix
Figure BDA0001971923000000011
for:

P=P+NP =P+N

测量声压矩阵P通过传声器阵列测量得到,

Figure BDA0001971923000000012
为噪声干扰,
Figure BDA0001971923000000013
为复数集,A为矩形传声器阵列的行数,B为矩形传声器阵列的列数,L为快拍次数;The measured sound pressure matrix P is obtained through the measurement of the microphone array,
Figure BDA0001971923000000012
for noise interference,
Figure BDA0001971923000000013
is a set of complex numbers, A is the number of rows of the rectangular microphone array, B is the number of columns of the rectangular microphone array, and L is the number of snapshots;

步骤2、重构声源在阵列传声器处产生的声压P;Step 2, reconstructing the sound pressure P generated by the sound source at the array microphone;

步骤1)、建立重构P的数学模型Step 1), establish the mathematical model of reconstruction P

Figure BDA0001971923000000014
Figure BDA0001971923000000014

式中,

Figure BDA0001971923000000021
是该式计算结果,||·||F表示Frobenius范数,P为待求声压矩阵,定义Tb(·)为二重Toeplitz算子,对任意给定向量u映射为A×A维的块Toeplitz型Hermitian矩阵:In the formula,
Figure BDA0001971923000000021
is the calculation result of this formula, ||·|| F represents the Frobenius norm, P is the sound pressure matrix to be sought, define T b (·) as a double Toeplitz operator, and map any given vector u into A×A dimension The block Toeplitz-type Hermitian matrix:

Figure BDA0001971923000000022
Figure BDA0001971923000000022

Tb(u)中,每个块Ta(0≤a≤A-1)都是一个B×B维的Toeplitz矩阵:In T b (u), each block T a (0≤a≤A-1) is a B×B-dimensional Toeplitz matrix:

Figure BDA0001971923000000023
Figure BDA0001971923000000023

Figure BDA0001971923000000024
都是辅助量,κ>0为规则化参数,
Figure BDA0001971923000000025
为单位矩阵,“tr(·)”表示求矩阵的迹,上标“H”表示共轭转置,“≥0”表示半正定;
Figure BDA0001971923000000024
are auxiliary quantities, κ>0 is a regularization parameter,
Figure BDA0001971923000000025
is the identity matrix, "tr( )" means to seek the trace of the matrix, the superscript "H" means conjugate transpose, and "≥0" means positive semi-definite;

步骤2)、求解PStep 2), solve for P

初始化

Figure BDA0001971923000000026
κ0=1,则W0=I,基于第l次迭代的结果
Figure BDA0001971923000000027
κl,第l+1次迭代的步骤为:initialization
Figure BDA0001971923000000026
κ 0 =1, then W 0 =I, based on the result of the lth iteration
Figure BDA0001971923000000027
κ l , the steps of the l+1th iteration are:

1.使用MATLAB的CVX工具箱中的SDPT3求解器求解:1. Use the SDPT3 solver in the CVX toolbox of MATLAB to solve:

Figure BDA0001971923000000028
Figure BDA0001971923000000028

2.确定规则化参数κl+1

Figure BDA0001971923000000029
2. Determine the regularization parameter κ l+1 :
Figure BDA0001971923000000029

Figure BDA00019719230000000210
Figure BDA00019719230000000211
的最大特征值;
Figure BDA00019719230000000210
for
Figure BDA00019719230000000211
The largest eigenvalue of ;

3.第l+1次迭代确定的权矩阵Wl+1

Figure BDA00019719230000000212
Figure BDA00019719230000000213
小于等于10-3或者最大迭代次数被完成时,迭代终止;3. The weight matrix W l+1 determined in the l+1 iteration:
Figure BDA00019719230000000212
when
Figure BDA00019719230000000213
When less than or equal to 10 -3 or the maximum number of iterations is completed, the iteration terminates;

步骤3、估计声源DOA;Step 3, estimating the sound source DOA;

步骤4、估计声源强度。Step 4. Estimate the intensity of the sound source.

本发明的技术效果是:Technical effect of the present invention is:

本发明能准确估计声源彼此间距较小的DOA,并能定量获得声源的强度,克服了现有ANM的二维多快拍无网格压缩波束形成的缺陷,提高了分辨率、去噪声能力和声源识别精度。The present invention can accurately estimate the DOA with small distance between sound sources, and can quantitatively obtain the intensity of the sound source, overcomes the defects of existing ANM's two-dimensional multi-snapshot gridless compression beamforming, improves the resolution and denoises ability and accuracy of sound source identification.

附图说明Description of drawings

本发明的附图说明如下:The accompanying drawings of the present invention are as follows:

图1为传声器阵列测量布局;Figure 1 is the measurement layout of the microphone array;

图2为声源频率为2000Hz、3000Hz、4000Hz和4900Hz时的仿真重构结果对比图;Figure 2 is a comparison chart of simulation reconstruction results when the sound source frequency is 2000Hz, 3000Hz, 4000Hz and 4900Hz;

图2(a)、(c)、(e)、(g)为ANM;图2(b)、(d)、(f)、(h)为本发明;Fig. 2 (a), (c), (e), (g) are ANM; Fig. 2 (b), (d), (f), (h) are the present invention;

图3为本发明与现有的ANM的声压P的重构精度、声源DOA的估计精度和声源强度的量化精度曲线对比图;Fig. 3 is the comparison chart of the reconstruction accuracy of the sound pressure P, the estimation accuracy of the sound source DOA and the quantization accuracy of the sound source intensity between the present invention and the existing ANM;

图3(a)

Figure BDA0001971923000000031
随Δmin的变化曲线;Figure 3(a)
Figure BDA0001971923000000031
Variation curve with Δ min ;

图3(b)

Figure BDA0001971923000000032
随Δmin的变化曲线;Figure 3(b)
Figure BDA0001971923000000032
Variation curve with Δ min ;

图3(c)

Figure BDA0001971923000000033
随Δmin的变化曲线;Figure 3(c)
Figure BDA0001971923000000033
Variation curve with Δ min ;

图4为试验布局图;Figure 4 is a test layout;

图5为声源频率为2000Hz、3000Hz、4000Hz和4900Hz时的实验声场重构结果对比图;Figure 5 is a comparison chart of the experimental sound field reconstruction results when the sound source frequency is 2000Hz, 3000Hz, 4000Hz and 4900Hz;

图5(a)、(c)、(e)、(g)为ANM;图5(b)、(d)、(f)、(h)为本发明。Fig. 5 (a), (c), (e), (g) are ANM; Fig. 5 (b), (d), (f), (h) are the present invention.

具体实施方式Detailed ways

下面结合附图和实施例对本发明作进一步说明:Below in conjunction with accompanying drawing and embodiment the present invention will be further described:

本发明包括以下步骤:The present invention comprises the following steps:

步骤1、获取测量声压矩阵P Step 1. Obtain the measured sound pressure matrix P

二维无网格压缩波束形成声源识别是利用矩形传声器阵列测量声信号。如图1所示的传声器阵列测量布局,符号“●”表示传声器,a=0,1,…,A-1,b=0,1,…,B-1分别为x、y维的传声器索引,Δx、Δy分别为x、y维的传声器间隔,θi、φi分别为i号声源DOA的仰角和方位角(0°≤θ≤90°、0°≤φ≤360°)。记

Figure BDA0001971923000000041
为各快拍下i号声源的强度(声源在(0,0)号传声器处产生的声压)组成的行向量,l=1,2,…,L为快拍索引,si,l为第l快拍下i号声源的强度,
Figure BDA0001971923000000042
为复数集。Two-dimensional gridless compressed beamforming sound source identification is to use a rectangular microphone array to measure the sound signal. The measurement layout of the microphone array is shown in Figure 1, the symbol "●" indicates the microphone, a=0,1,...,A-1, b=0,1,...,B-1 are the microphone indexes of x and y dimensions respectively , Δx, Δy are the microphone spacing in x and y dimensions, respectively, θ i , φ i are the elevation angle and azimuth angle of sound source i DOA (0°≤θ≤90°, 0°≤φ≤360°), respectively. remember
Figure BDA0001971923000000041
It is a row vector composed of the intensity of the sound source i in each snapshot (the sound pressure produced by the sound source at the (0,0) microphone), l=1,2,...,L is the index of the snapshot, si, l is the intensity of the sound source i in the first snapshot,
Figure BDA0001971923000000042
is a complex set.

假设声源辐射平面声波,波长为λ,t1i≡sinθi cosφiΔx/λ,t2i≡sinθi sinφiΔy/λ,则各快拍下声源在(a,b)号传声器处产生声压组成的行向量

Figure BDA0001971923000000043
可表示为:Assuming that the sound source radiates a plane sound wave with a wavelength of λ, t 1i ≡sinθ i cosφ i Δx/λ, t 2i ≡sinθ i sinφ i Δy/λ, then each snapshot of the sound source is generated at the microphones (a,b) row vector of sound pressures
Figure BDA0001971923000000043
Can be expressed as:

Figure BDA0001971923000000044
Figure BDA0001971923000000044

式(1)中,k为声源总数,

Figure BDA0001971923000000045
为虚数单位。In formula (1), k is the total number of sound sources,
Figure BDA0001971923000000045
is an imaginary unit.

构建矩阵:Build the matrix:

Figure BDA0001971923000000046
Figure BDA0001971923000000046

列向量

Figure BDA0001971923000000047
标量
Figure BDA0001971923000000048
行向量
Figure BDA0001971923000000049
其中,上标“T”表示转置,符号
Figure BDA00019719230000000410
表示Kronecker乘积,“||·||2”表示
Figure BDA00019719230000000416
范数,
Figure BDA00019719230000000411
为正实数集,||ψi||2=1,与式(1)对应,有:Column vector
Figure BDA0001971923000000047
scalar
Figure BDA0001971923000000048
row vector
Figure BDA0001971923000000049
Among them, the superscript "T" means transpose, and the symbol
Figure BDA00019719230000000410
Indicates the Kronecker product, "||·|| 2 "indicates
Figure BDA00019719230000000416
norm,
Figure BDA00019719230000000411
is a set of positive real numbers, ||ψ i || 2 = 1, corresponding to formula (1), we have:

Figure BDA00019719230000000412
Figure BDA00019719230000000412

存在噪声干扰

Figure BDA00019719230000000413
时,测量声压矩阵
Figure BDA00019719230000000414
可表示为:There is noise interference
Figure BDA00019719230000000413
When the sound pressure matrix is measured
Figure BDA00019719230000000414
Can be expressed as:

P=P+N (3)P =P+N (3)

在仿真试验中,添加噪声为独立同分布高斯白噪声,信噪比(Signal-to-noiseRatio,SNR)定义为SNR=20log10(||P||F/||N||F),由此可确定||N||F=||P||F10-SNR/20,其中,“||·||F”表示Frobenius范数。In the simulation experiment, the added noise is independent and identically distributed Gaussian white noise, and the Signal-to-noise Ratio (SNR) is defined as SNR=20log 10 (||P|| F /||N|| F ), given by This can determine ||N|| F =||P|| F 10 −SNR/20 , where “||·|| F ” denotes the Frobenius norm.

试验测试时,

Figure BDA00019719230000000415
由传声器阵列测量得到。When testing the test,
Figure BDA00019719230000000415
Measured by the microphone array.

步骤2、重构声源在阵列传声器处产生的声压PStep 2. Reconstruct the sound pressure P generated by the sound source at the array microphone

步骤1)、建立重构P的数学模型Step 1), establish the mathematical model of reconstruction P

二维多快拍无网格压缩波束形成后处理的第一步是滤除测量声压P中的噪声、重构声源在阵列传声器处产生的声压P,该步骤通过对声源施加稀疏约束来实现。建立源稀疏性的度量:The first step in the post-processing of 2D multi-snapshot gridless compression beamforming is to filter out the noise in the measured sound pressure P and reconstruct the sound pressure P produced by the sound source at the array microphone. This step is applied to the sound source Sparse constraints are implemented. Build a measure of source sparsity:

Figure BDA0001971923000000051
Figure BDA0001971923000000051

式(4)中,

Figure BDA0001971923000000052
是关于P的函数,
Figure BDA0001971923000000053
都是辅助量,κ>0为规则化参数,
Figure BDA0001971923000000054
为单位矩阵,“tr(·)”表示求矩阵的迹,上标“H”表示共轭转置,“≥0”表示半正定。In formula (4),
Figure BDA0001971923000000052
is a function of P,
Figure BDA0001971923000000053
are auxiliary quantities, κ>0 is a regularization parameter,
Figure BDA0001971923000000054
is the identity matrix, "tr(·)" means to seek the trace of the matrix, the superscript "H" means conjugate transpose, and "≥0" means positive semi-definite.

定义Tb(·)为二重Toeplitz算子,对任意给定向量u映射为A×A维的块Toeplitz型Hermitian矩阵:Define T b ( ) as a double Toeplitz operator, and any given vector u is mapped to an A×A-dimensional block Toeplitz-type Hermitian matrix:

Figure BDA0001971923000000055
Figure BDA0001971923000000055

式(5)中,每个块Ta(0≤a≤A-1)都是一个B×B维的Toeplitz矩阵:In formula (5), each block T a (0≤a≤A-1) is a B×B-dimensional Toeplitz matrix:

Figure BDA0001971923000000056
Figure BDA0001971923000000056

P的重构问题可写为:The reconstruction problem of P can be written as:

Figure BDA0001971923000000057
Figure BDA0001971923000000057

式(7)中,ε为噪声控制参数,通常取为||N||FIn formula (7), ε is the noise control parameter, usually taken as ||N|| F .

联立式(4)、(7)可得:Simultaneous formula (4), (7) can get:

Figure BDA0001971923000000058
Figure BDA0001971923000000058

式(8)中,

Figure BDA0001971923000000059
表示计算结果,P为待求声压矩阵。In formula (8),
Figure BDA0001971923000000059
Indicates the calculation result, and P is the sound pressure matrix to be sought.

现有的ANM二维多快拍无网格压缩波束形成声源识别方法采用P的原子范数

Figure BDA00019719230000000510
度量声源稀疏性,P的重构问题可写为:The existing ANM two-dimensional multi-snapshot gridless compressed beamforming sound source identification method adopts the atomic norm of P
Figure BDA00019719230000000510
To measure the sparsity of the sound source, the reconstruction problem of P can be written as:

Figure BDA00019719230000000511
Figure BDA00019719230000000511

Figure BDA0001971923000000061
为凸函数,该式为凸优化问题,可转化为如下半正定规划后利用CVX工具箱中的SDPT3求解器求解:
Figure BDA0001971923000000061
is a convex function, and this formula is a convex optimization problem, which can be transformed into the following semi-positive definite programming and solved by the SDPT3 solver in the CVX toolbox:

Figure BDA0001971923000000062
Figure BDA0001971923000000062

当声源间分离较小时,现有的ANM二维多快拍无网格压缩波束形成会因上式获得的

Figure BDA0001971923000000063
不能高概率准确包含声源DOA信息或对声源数目的预测不稳健可靠而失效。When the separation between sound sources is small, the existing ANM two-dimensional multi-snapshot gridless compression beamforming will be obtained by the above formula
Figure BDA0001971923000000063
Failure to accurately contain sound source DOA information with a high probability, or the prediction of the number of sound sources is not robust and reliable.

步骤2)、使用本方法发明求解PStep 2), use this method invention to solve P

Figure BDA0001971923000000064
κl为第l次迭代确定的最优解和规则化参数,
Figure BDA0001971923000000065
第l次迭代确定的权矩阵,第l+1次迭代的替代函数式为:remember
Figure BDA0001971923000000064
κ l is the optimal solution and regularization parameters determined in the lth iteration,
Figure BDA0001971923000000065
The weight matrix determined in the lth iteration, the substitution function of the l+1th iteration is:

Figure BDA0001971923000000066
Figure BDA0001971923000000066

式(9)中,

Figure BDA0001971923000000067
Figure BDA0001971923000000068
处的切平面,cl为与变量无关的常数。忽略cl,第l+1次迭代的最小化问题可写为:In formula (9),
Figure BDA0001971923000000067
Yes
Figure BDA0001971923000000068
The tangent plane at , c l is a constant that has nothing to do with variables. Neglecting c l , the minimization problem of the l+1th iteration can be written as:

Figure BDA0001971923000000069
Figure BDA0001971923000000069

式(10)为凸优化问题,可用CVX工具箱中的SDPT3求解器求解。Equation (10) is a convex optimization problem, which can be solved by the SDPT3 solver in the CVX toolbox.

初始化

Figure BDA00019719230000000610
κ0=1,则W0=I,为增强稀疏,令κ在迭代过程中逐渐减小,按下式变化:initialization
Figure BDA00019719230000000610
κ 0 =1, then W 0 =I, in order to enhance the sparseness, let κ gradually decrease in the iterative process, and change according to the following formula:

Figure BDA00019719230000000611
Figure BDA00019719230000000611

式(11)中,

Figure BDA00019719230000000612
Figure BDA00019719230000000613
的最大特征值,l为迭代次数。In formula (11),
Figure BDA00019719230000000612
for
Figure BDA00019719230000000613
The largest eigenvalue of , l is the number of iterations.

因此,初始化

Figure BDA00019719230000000614
κ0=1,则W0=I,基于第l次迭代的结果
Figure BDA00019719230000000615
κl,第l+1次迭代的步骤为:Therefore, initializing
Figure BDA00019719230000000614
κ 0 =1, then W 0 =I, based on the result of the lth iteration
Figure BDA00019719230000000615
κ l , the steps of the l+1th iteration are:

1.使用MATLAB的CVX工具箱中的SDPT3求解器求解:1. Use the SDPT3 solver in the CVX toolbox of MATLAB to solve:

Figure BDA0001971923000000071
Figure BDA0001971923000000071

2.确定规则化参数κl+1

Figure BDA0001971923000000072
2. Determine the regularization parameter κ l+1 :
Figure BDA0001971923000000072

Figure BDA0001971923000000073
Figure BDA0001971923000000074
的最大特征值;
Figure BDA0001971923000000073
for
Figure BDA0001971923000000074
The largest eigenvalue of ;

3.第l+1次迭代确定的权矩阵Wl+1

Figure BDA0001971923000000075
当连续两次迭代的
Figure BDA0001971923000000076
的相对变化量,即
Figure BDA0001971923000000077
小于等于10-3或者最大迭代次数被完成时,迭代终止,最大迭代次数选为20。3. The weight matrix W l+1 determined in the l+1 iteration:
Figure BDA0001971923000000075
When two consecutive iterations of
Figure BDA0001971923000000076
The relative change of
Figure BDA0001971923000000077
When it is less than or equal to 10 -3 or the maximum number of iterations is completed, the iteration terminates, and the maximum number of iterations is selected as 20.

步骤3、估计声源DOA;Step 3, estimating the sound source DOA;

该步骤的运算与现有技术ANM二维多快拍无网格压缩波束形成声源识别方法相同。参考文献Z.Yang,L.Xie,P.Stoica.Vandermonde decomposition of multilevelToeplitz matrices with application to multidimensional super-resolution[J].IEEE Transactions on Information Theory,2016,62(6):3685-3701.(“多重Toeplitz矩阵的范德蒙德分解及其在多维超分辨率中的应用[J].”,杨在,谢利华,P.Stoica,《IEEE信息理论学报》,2016,62(6):3685-3701)。The operation of this step is the same as the prior art ANM two-dimensional multi-snapshot gridless compression beamforming sound source identification method. References Z.Yang, L.Xie, P.Stoica.Vandermonde decomposition of multilevelToeplitz matrices with application to multidimensional super-resolution[J].IEEE Transactions on Information Theory,2016,62(6):3685-3701. (“Multiple Vandermonde decomposition of Toeplitz matrix and its application in multidimensional super-resolution[J].”, Yang Zai, Xie Lihua, P.Stoica, "IEEE Transactions on Information Theory", 2016,62(6):3685-3701).

获得声源个数

Figure BDA0001971923000000078
及声源
Figure BDA0001971923000000079
Get the number of sound sources
Figure BDA0001971923000000078
and sound source
Figure BDA0001971923000000079

步骤4、估计声源强度;Step 4, estimating the intensity of the sound source;

Figure BDA00019719230000000710
为根据估计的声源DOA计算的感知矩阵,
Figure BDA00019719230000000711
为量化的各快拍下各声源的强度构成的矩阵,根据式(2)得到声源强度的计算式:remember
Figure BDA00019719230000000710
is the perceptual matrix calculated from the estimated sound source DOA,
Figure BDA00019719230000000711
To quantify the matrix formed by the intensity of each sound source under each snapshot, the formula for calculating the intensity of the sound source is obtained according to formula (2):

Figure BDA00019719230000000712
Figure BDA00019719230000000712

式中,A为感知矩阵,上标“+”表示伪逆,

Figure BDA00019719230000000713
为使用本发明迭代求解所得声压。In the formula, A is the perceptual matrix, and the superscript "+" indicates the pseudo-inverse,
Figure BDA00019719230000000713
The resulting sound pressure is iteratively solved for using the present invention.

仿真模拟试验simulation test

为验证建立本发明的准确性和对比其性能的提高,进行声源识别仿真模拟。In order to verify and establish the accuracy of the present invention and compare its performance improvement, a sound source identification simulation is carried out.

仿真设置如下:在特定位置假设具有特定强度辐射特定频率声波的点声源(假设六个声源,仰角依次为60°、60°、50°、40°、30°、70°,方位角依次为180°、190°、90°、90°、200°、300°,强度(均方根:

Figure BDA0001971923000000081
)依次为100dB、97dB、97dB、94dB、94dB、90dB(参考2×10-5Pa))。The simulation settings are as follows: assume a point sound source with a specific intensity radiating sound waves of a specific frequency at a specific position (assuming six sound sources, the elevation angles are 60°, 60°, 50°, 40°, 30°, 70°, and the azimuth angles are sequentially 180°, 190°, 90°, 90°, 200°, 300°, intensity (RMS:
Figure BDA0001971923000000081
) are 100dB, 97dB, 97dB, 94dB, 94dB, 90dB in turn (refer to 2×10 -5 Pa)).

仿真具体流程为:The specific flow of the simulation is:

1、根据式(2)及式(3)计算各传声器接收声压信号(用A=B=8、Δx=Δy=0.035m的传声器阵列测量声信号,信噪比SNR取20dB,快拍总数设为10);1. Calculate the sound pressure signal received by each microphone according to formula (2) and formula (3) (measure the sound signal with a microphone array of A=B=8, Δx=Δy=0.035m, the signal-to-noise ratio SNR is 20dB, and the total number of snapshots set to 10);

2、根据步骤2的本方法发明迭代求解声压

Figure BDA0001971923000000082
2. According to the method invention of step 2, iteratively solve the sound pressure
Figure BDA0001971923000000082

3、根据步骤3估计声源DOA;3. Estimate the sound source DOA according to step 3;

4、根据步骤4估计声源强度。4. Estimate the sound source intensity according to step 4.

本方法发明与用ANM方法的结果进行对比,如图2所示,声源辐射声波的频率:图2(a)和图2(b)为2000Hz,图2(c)和图(d)为3000Hz,图2(e)和图2(f)为4000Hz;图2(g)和图2(h)为4900Hz;每幅图中,“*”指示重构声源分布,“○”指示真实声源分布,并为便于对比,各图中均参考最大输出值进行dB缩放,显示动态范围均为0~-20dB,同时,每幅图的上方还列出了以标准声压大小2.0×10-5Pa为参考的最大输出值。2000Hz、3000Hz、4000Hz、4900Hz下,声源间的最小分离

Figure BDA0001971923000000083
依次为0.025、0.038、0.050、0.062。This method invention compares with the result of ANM method, as shown in Figure 2, the frequency of sound source radiation sound wave: Fig. 2 (a) and Fig. 2 (b) are 2000Hz, and Fig. 2 (c) and Fig. (d) are 3000Hz, Figure 2(e) and Figure 2(f) are 4000Hz; Figure 2(g) and Figure 2(h) are 4900Hz; in each figure, "*" indicates the reconstructed sound source distribution, and "○" indicates the real The distribution of sound sources, and for the convenience of comparison, each figure is scaled in dB with reference to the maximum output value, and the displayed dynamic range is 0~-20dB. At the same time, the standard sound pressure of 2.0×10 -5 Pa is the maximum output value for reference. Minimum separation between sound sources at 2000Hz, 3000Hz, 4000Hz, 4900Hz
Figure BDA0001971923000000083
They are 0.025, 0.038, 0.050, 0.062 in turn.

由图2可以看出:使用ANM的二维多快拍无网格压缩波束形成(图2(a)、图2(c)、图2(e)、图2(g))仅准确重构了4900Hz时的声源分布,而使用本发明的二维多快拍无网格压缩波束形成(图2(b)、图2(d)、图2(f)、图2(h))在四个频率下的重构结果均准确,所以证明:本发明克服了ANM的缺陷,具有更高分辨率。It can be seen from Figure 2 that the two-dimensional multi-snapshot gridless compression beamforming using ANM (Figure 2(a), Figure 2(c), Figure 2(e), Figure 2(g)) only accurately reconstructs The sound source distribution at 4900Hz is obtained, and the two-dimensional multi-snapshot gridless compression beamforming (Fig. 2(b), Fig. 2(d), Fig. 2(f), Fig. 2(h)) of the present invention is The reconstruction results at the four frequencies are all accurate, so it is proved that the present invention overcomes the defects of ANM and has higher resolution.

定义标准Frobenius范数误差

Figure BDA0001971923000000084
平均Frobenius范数误差
Figure BDA0001971923000000085
和标准
Figure BDA00019719230000000810
范数误差
Figure BDA0001971923000000086
来衡量P的重构精度、声源DOA的估计精度和声源强度的量化精度,其中,
Figure BDA0001971923000000087
为估计的和真实的声源仰角构成的列向量,
Figure BDA0001971923000000088
为估计的和真实的声源方位角构成的列向量,
Figure BDA0001971923000000089
为量化的和真实的声源强度的均方根构成的列向量。误差值越小,精度越高,计算结果越准确Define the standard Frobenius norm error
Figure BDA0001971923000000084
Average Frobenius norm error
Figure BDA0001971923000000085
and standard
Figure BDA00019719230000000810
Norm error
Figure BDA0001971923000000086
To measure the reconstruction accuracy of P, the estimation accuracy of sound source DOA and the quantization accuracy of sound source intensity, where,
Figure BDA0001971923000000087
is a column vector of estimated and true source elevation angles,
Figure BDA0001971923000000088
is a column vector of estimated and true source azimuths,
Figure BDA0001971923000000089
Column vector of quantized and true source intensities rms. The smaller the error value, the higher the precision and the more accurate the calculation result

由图3可以看出:与使用ANM的二维多快拍无网格压缩波束形成相比,本发明具有更强的去噪声能力和更好的分辨率,能更准确地重构声源在阵列传声器处产生的声压、更准确地估计彼此间分离较小的声源的DOA和量化其强度。It can be seen from Fig. 3 that compared with the two-dimensional multi-snapshot gridless compression beamforming using ANM, the present invention has stronger denoising ability and better resolution, and can more accurately reconstruct the sound source in Sound pressure generated at array microphones, more accurately estimating the DOA and quantifying the intensity of sound sources that are less separated from each other.

试验验证Test verification

为检验仿真结论的正确性和使用本发明的二维多快拍无网格压缩波束形成在实际应用中的有效性和优越性,进行验证试验。In order to check the correctness of the simulation conclusion and the effectiveness and superiority of using the two-dimensional multi-snapshot gridless compression beamforming of the present invention in practical applications, a verification test is carried out.

图4为试验布局图,测量阵列为

Figure BDA0001971923000000091
公司的64个4958型传声器均匀分布的矩形阵列(A=B=8,Δx=Δy=0.035m),声源为稳态白噪声信号激励的3个扬声器,从左至右,笛卡尔坐标依次为(2.24,0,5)m、(-1.24,0,5)m、(-2.24,0,5)m,仰角依次为24.13°、13.93°、24.13°,方位角依次为0°、180°、180°,由于试验在半消声室内进行,地面存在反射,扬声器关于地面还存在3个镜像声源,笛卡尔坐标依次为(2.24,-2.2,5)m、(-1.24,-2.2,5)m、(-2.24,-2.2,5)m,仰角依次为32.13°、26.80°、32.13°,方位角依次为315.52°、240.59°、224.48°。各传声器测得的声压经PULSE 3560D型数据采集系统同时采集并传输到PULSELABSHOP中进行快速傅里叶变换,得声压频谱,采样频率为16384Hz,每个快拍长1s、包含214个采样点,10个快拍被采用。进一步,使用ANM(基于矩阵束与配对)和本发明后处理声压频谱来重构声源分布,其余设置均与仿真一致。Figure 4 is the test layout diagram, the measurement array is
Figure BDA0001971923000000091
The company's 64 4958-type microphones are uniformly distributed rectangular arrays (A=B=8, Δx=Δy=0.035m), and the sound source is 3 loudspeakers excited by steady-state white noise signals. From left to right, the Cartesian coordinates are sequential (2.24,0,5)m, (-1.24,0,5)m, (-2.24,0,5)m, the elevation angles are 24.13°, 13.93°, 24.13°, the azimuth angles are 0°, 180 °, 180°, because the test is carried out in a semi-anechoic chamber, there are reflections on the ground, and there are still three image sound sources of the speaker on the ground, and the Cartesian coordinates are (2.24,-2.2,5)m, (-1.24,-2.2 ,5)m, (-2.24,-2.2,5)m, the elevation angles are 32.13°, 26.80°, 32.13°, and the azimuth angles are 315.52°, 240.59°, 224.48°. The sound pressure measured by each microphone is simultaneously collected by the PULSE 3560D data acquisition system and transmitted to PULSELABSHOP for fast Fourier transformation to obtain the sound pressure spectrum. The sampling frequency is 16384Hz. Each snapshot is 1s long and contains 214 samples. point, 10 snapshots are taken. Further, ANM (based on matrix beam and pairing) and the post-processing sound pressure spectrum of the present invention are used to reconstruct the sound source distribution, and the rest of the settings are consistent with the simulation.

图5给出了声源频率为2000Hz、3000Hz、4000Hz和4900Hz的结果,每幅图中,“*”指示重构声源分布,参考最大输出值进行dB缩放,“○”指示真实声源DOA,不包含强度信息。四个频率下,声源间的最小分离依次为0.032、0.048、0.065、0.079。Figure 5 shows the results of sound source frequencies of 2000Hz, 3000Hz, 4000Hz and 4900Hz. In each figure, "*" indicates the reconstructed sound source distribution, with reference to the maximum output value for dB scaling, and "○" indicates the real sound source DOA , without intensity information. Under the four frequencies, the minimum separation between sound sources is 0.032, 0.048, 0.065, 0.079 in turn.

由图5可以看出:使用ANM,在2000Hz(图5(a))和3000Hz(图5(c))重构的声源分布严重偏离真实声源分布,仅在4000Hz(图5(e))和4900Hz(图5(g))准确估计出了六个主要声源的DOA。使用本发明,图5(b)、5(d)、5(f)、5(h)在四个频率下的声源DOA估计结果均准确。It can be seen from Fig. 5 that using ANM, the reconstructed sound source distribution at 2000Hz (Fig. 5(a)) and 3000Hz (Fig. 5(c)) seriously deviates from the real sound source distribution, only at 4000Hz (Fig. 5(e) ) and 4900Hz (Fig. 5(g)) accurately estimated the DOA of the six main sources. Using the present invention, the sound source DOA estimation results at the four frequencies in Figs. 5(b), 5(d), 5(f) and 5(h) are all accurate.

试验结论与仿真结论一致,证明仿真结论正确,使用本发明的二维多快拍无网格压缩波束形成在实际应用中更有效。The test conclusion is consistent with the simulation conclusion, which proves that the simulation conclusion is correct, and the two-dimensional multi-snapshot gridless compression beamforming of the present invention is more effective in practical application.

Claims (2)

1.一种二维多快拍无网格压缩波束形成声源识别方法,其特征是,包括以下步骤:1. a two-dimensional multi-snapshot gridless compression beamforming sound source identification method, is characterized in that, comprises the following steps: 步骤1、获取测量声压矩阵PStep 1. Obtain the measured sound pressure matrix P ; 测量声压矩阵P∈CAB×L为:The measured sound pressure matrix P ∈ C AB × L is: P=P+NP =P+N 测量声压矩阵P通过传声器阵列测量得到,N∈CAB×L为噪声干扰,C为复数集,A为矩形传声器阵列的行数,B为矩形传声器阵列的列数,L为快拍次数;Measured sound pressure matrix P Obtained by microphone array measurement, N∈C AB×L is noise interference, C is complex number set, A is the number of rows of rectangular microphone array, B is the number of columns of rectangular microphone array, L is the number of snapshots ; 步骤2、重构声源在阵列传声器处产生的声压P;Step 2, reconstructing the sound pressure P generated by the sound source at the array microphone; 步骤1)、建立重构P的数学模型Step 1), establish the mathematical model of reconstruction P
Figure FDA0001971922990000011
Figure FDA0001971922990000011
式中,
Figure FDA0001971922990000012
是该式计算结果,||·||F表示Frobenius范数,P为待求声压矩阵,定义Tb(·)为二重Toeplitz算子,对任意给定向量u映射为A×A维的块Toeplitz型Hermitian矩阵:
In the formula,
Figure FDA0001971922990000012
is the calculation result of this formula, ||·|| F represents the Frobenius norm, P is the sound pressure matrix to be sought, define T b (·) as a double Toeplitz operator, and map any given vector u into A×A dimension The block Toeplitz-type Hermitian matrix:
Figure FDA0001971922990000013
Figure FDA0001971922990000013
Tb(u)中,每个块Ta,0≤a≤A-1,都是一个B×B维的Toeplitz矩阵:In T b (u), each block T a , 0≤a≤A-1, is a B×B-dimensional Toeplitz matrix:
Figure FDA0001971922990000014
Figure FDA0001971922990000014
Figure DEST_PATH_IMAGE002
都是辅助量,κ>0为规则化参数,I∈ΡAB×AB为单位矩阵,“tr(·)”表示求矩阵的迹,上标“H”表示共轭转置,
Figure FDA0001971922990000016
表示半正定;
Figure DEST_PATH_IMAGE002
Both are auxiliary quantities, κ>0 is the regularization parameter, I∈ΡAB ×AB is the unit matrix, "tr( )" means the trace of the matrix, and the superscript " H " means the conjugate transpose,
Figure FDA0001971922990000016
Indicates positive semi-definite;
步骤2)、求解PStep 2), solve for P 初始化
Figure FDA0001971922990000021
κ0=1,则W0=I,基于第l次迭代的结果
Figure FDA0001971922990000022
κl,Wl,第l+1次迭代的步骤为:
initialization
Figure FDA0001971922990000021
κ 0 =1, then W 0 =I, based on the result of the lth iteration
Figure FDA0001971922990000022
κ l , W l , the steps of the l+1th iteration are:
1.使用MATLAB的CVX工具箱中的SDPT3求解器求解:1. Use the SDPT3 solver in the CVX toolbox of MATLAB to solve:
Figure FDA0001971922990000023
Figure FDA0001971922990000023
2.确定规则化参数κl+1
Figure FDA0001971922990000024
Figure FDA0001971922990000025
Figure FDA0001971922990000026
的最大特征值;
2. Determine the regularization parameter κ l+1 :
Figure FDA0001971922990000024
Figure FDA0001971922990000025
for
Figure FDA0001971922990000026
The largest eigenvalue of ;
3.第l+1次迭代确定的权矩阵Wl+1
Figure FDA0001971922990000027
,当
Figure FDA0001971922990000028
小于等于10-3或者最大迭代次数被完成时,迭代终止;
3. The weight matrix W l+1 determined in the l+1 iteration:
Figure FDA0001971922990000027
,when
Figure FDA0001971922990000028
When less than or equal to 10 -3 or the maximum number of iterations is completed, the iteration terminates;
步骤3、估计声源DOA;Step 3, estimating the sound source DOA; 步骤4、估计声源强度。Step 4. Estimate the intensity of the sound source.
2.根据权利要求1所述的一种二维多快拍无网格压缩波束形成声源识别方法,其特征是,所述的最大迭代次数为20。2. A method for identifying sound sources with two-dimensional multi-snapshot gridless compression beamforming according to claim 1, characterized in that the maximum number of iterations is 20.
CN201910121202.4A 2019-02-19 2019-02-19 Two-dimensional multi-snapshot meshless compressed beam forming sound source identification method Active CN109870669B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910121202.4A CN109870669B (en) 2019-02-19 2019-02-19 Two-dimensional multi-snapshot meshless compressed beam forming sound source identification method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910121202.4A CN109870669B (en) 2019-02-19 2019-02-19 Two-dimensional multi-snapshot meshless compressed beam forming sound source identification method

Publications (2)

Publication Number Publication Date
CN109870669A CN109870669A (en) 2019-06-11
CN109870669B true CN109870669B (en) 2022-11-15

Family

ID=66918811

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910121202.4A Active CN109870669B (en) 2019-02-19 2019-02-19 Two-dimensional multi-snapshot meshless compressed beam forming sound source identification method

Country Status (1)

Country Link
CN (1) CN109870669B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111965599B (en) * 2020-07-03 2024-07-19 重庆大学 Sound source identification method for two-dimensional dynamic grid compression beam forming
CN112710990B (en) * 2020-12-10 2023-07-11 重庆大学 A two-dimensional gridless compressed beamforming method for arbitrary planar array forms
CN113687306B (en) * 2021-05-26 2023-07-21 重庆大学 Multi-frequency synchronous two-dimensional off-grid compressed beamforming sound source identification method
CN113567913B (en) * 2021-06-21 2023-07-21 电子科技大学 Two-dimensional planar DOA estimation method based on iterative reweighting for dimensionality reduction

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4379916B2 (en) * 2004-04-14 2009-12-09 横河電機株式会社 Waveform display device
CN101487888B (en) * 2009-02-23 2011-01-26 重庆大学 A Spatial Spectrum Peak Search Method
US8861311B2 (en) * 2010-04-23 2014-10-14 Vanderbilt University System and method for estimating projectile trajectory and source location
CN106168942B (en) * 2016-07-12 2018-08-21 河海大学 A kind of fluctuation types dynamic data reconstructing method based on singular boundary method
CN107422295B (en) * 2017-08-30 2019-09-13 浙江大学 Direction of Arrival Estimation Method Based on Coprime Array Virtual Domain Equivalent Signal Atom Norm Representation
CN107817465B (en) * 2017-10-12 2019-11-15 中国人民解放军陆军工程大学 DOA Estimation Method Based on Meshless Compressive Sensing under Super Gaussian Noise Background
CN108051076A (en) * 2017-11-28 2018-05-18 南昌工程学院 A kind of enclosure space panel-acoustic contribution degree recognition methods
CN108445462B (en) * 2018-02-05 2019-10-01 江苏大学 A kind of DOD and DOA estimation method of the bistatic MIMO radar based on management loading
CN109343003B (en) * 2018-11-29 2022-11-11 重庆大学 Method for identifying sound source formed by fast iterative shrinking wave beams

Also Published As

Publication number Publication date
CN109870669A (en) 2019-06-11

Similar Documents

Publication Publication Date Title
CN109870669B (en) Two-dimensional multi-snapshot meshless compressed beam forming sound source identification method
CN107817465B (en) DOA Estimation Method Based on Meshless Compressive Sensing under Super Gaussian Noise Background
CN103353595B (en) Meter Wave Radar Altimetry Method Based on Array Interpolation Compressed Sensing
CN103954950B (en) A kind of Wave arrival direction estimating method openness based on sample covariance matrix
CN107450047B (en) Compressed Sensing DOA Estimation Method Based on Unknown Mutual Coupling Information in Nested Arrays
CN108872926B (en) An Amplitude and Phase Error Correction and DOA Estimation Method Based on Convex Optimization
CN106125041B (en) The wideband source localization method of sparse recovery is weighted based on subspace
CN106646376A (en) P-norm noise source positioning identification method based on weight correction parameter
CN108181557B (en) A Method for Determining the Orientation of UHF Partial Discharge Signals
CN103353596A (en) Wave beam space domain meter wave radar height measurement method based on compressed sensing
CN110837076A (en) Tensor decomposition-based vector hydrophone array orientation estimation method
CN112379327A (en) Two-dimensional DOA estimation and cross coupling correction method based on rank loss estimation
CN108761394A (en) A kind of high-resolution low sidelobe based on space-time processing deconvolutes Power estimation method
CN114814830A (en) Meter-wave radar low elevation height measurement method based on robust principal component analysis noise reduction
CN113050075A (en) Underwater sound source matching field positioning method based on diffusion mapping
CN112285647A (en) Signal orientation high-resolution estimation method based on sparse representation and reconstruction
CN111812581B (en) Spherical array sound source direction-of-arrival estimation method based on atomic norms
CN108761380B (en) A Target Direction of Arrival Estimation Method for Improving Accuracy
CN114779236A (en) Improved meter-wave radar low-elevation height measurement method based on spatial smoothing MUSIC
CN113721184A (en) Near-field signal source positioning method based on improved MUSIC algorithm
CN109298382A (en) A Method for Estimating Direction of Arrival Angle of Non-Uniform Linear Array Based on Expectation Maximum Algorithm
CN109932682B (en) Two-dimensional multi-snapshot non-grid compressed beam forming sound source identification method
CN108594165B (en) A Method for Estimating Direction of Arrival for Narrowband Signals Based on Expectation-Maximization Algorithm
CN110031796B (en) Three-dimensional multi-snapshot non-grid compressed beam forming sound source identification method
Yang et al. Two‐Dimensional Multiple‐Snapshot Grid‐Free Compressive Beamforming Using Alternating Direction Method of Multipliers

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CP03 Change of name, title or address

Address after: 1000 Chongqing City No. 401120 Taoyuan Avenue Yubei District Airport

Patentee after: Chongqing Industrial Vocational and Technical University

Country or region after: China

Patentee after: Chongqing University

Address before: 1000 Chongqing City No. 401120 Taoyuan Avenue Yubei District Airport

Patentee before: CHONGQING INDUSTRY POLYTECHNIC College

Country or region before: China

Patentee before: Chongqing University

CP03 Change of name, title or address