KR102012522B1

KR102012522B1 - Apparatus for processing directional sound

Info

Publication number: KR102012522B1
Application number: KR1020130037586A
Authority: KR
Inventors: 고한석; 윤종성; 금민석; 박진수; 문성규
Original assignee: 고려대학교 산학협력단
Priority date: 2013-04-05
Filing date: 2013-04-05
Publication date: 2019-08-20
Anticipated expiration: 2033-04-05
Also published as: KR20140121168A

Abstract

본 발명은 방향성 음향 처리 장치를 개시한다. 본 발명에 따르면, 주변 음향을 수신하는 복수의 마이크; 상기 복수의 마이크에서 입력된 음향 신호의 각 주파수 대역별 코히어런스를 계산하는 코히어런스 계산부; 상기 복수의 마이크 각각에서 입력된 음향 신호를 통해 음성 신호의 존재 확률을 추정하는 추정부; 상기 추정된 음성 신호 존재 확률에 따라 각 주파수 대역별로 서로 다른 가중치를 부여하며, 상기 부여된 가중치를 상기 코히어런스에 반영하여 결과값을 출력하는 가중치 계산부; 및 상기 결과값과 미리 설정된 임계치를 비교하여 상기 복수의 마이크로 입력된 음향 신호의 처리 여부를 결정하는 제어부를 포함하는 방향성 음향 신호 처리 장치가 제공된다. The present invention discloses a directional sound processing apparatus. According to the present invention, a plurality of microphones for receiving ambient sound; A coherence calculator for calculating coherence for each frequency band of the sound signals input from the plurality of microphones; An estimator estimating the existence probability of the speech signal based on the acoustic signal input from each of the plurality of microphones; A weight calculator configured to assign different weights to respective frequency bands according to the estimated probability of existence of the speech signal, and output a result value by reflecting the weighted weights to the coherence; And a controller which compares the result value with a preset threshold and determines whether to process the plurality of micro-input sound signals.

Description

Directional acoustic signal processing device {APPARATUS FOR PROCESSING DIRECTIONAL SOUND}

본 발명은 방향성 음향 신호 처리 장치에 관한 것으로서, 더욱 상세하게는 소음 환경에서 사용자가 원하는 방향의 소리만을 선택적으로 획득하여 처리할 수 있도록 하는 장치에 관한 것이다. The present invention relates to an apparatus for processing directional acoustic signals, and more particularly, to an apparatus for selectively acquiring and processing a sound in a desired direction in a noise environment.

마이크는 음파 또는 초음파를 받아서 그 진동에 따른 전기신호를 발생하는 장치를 의미한다. The microphone refers to a device that receives sound waves or ultrasonic waves and generates electric signals according to the vibrations.

최근 로봇 관련 기술의 발전에 따라, 로봇과 사용자의 의사 소통을 위한 인터페이스로 마이크가 사용되고 있으며, 로봇은 마이크을 통해 입력되는 음향 신호를 전기신호로 변환하여 사용자의 음향을 인식하게 된다. Recently, with the development of robot-related technology, a microphone is used as an interface for communication between a robot and a user, and the robot recognizes a user's sound by converting a sound signal input through the microphone into an electrical signal.

이러한 인식 과정에는 마이크의 주변 환경에서 발생되는 소음(잡음)이 항상 수반되며, 보다 다양한 명령을 정확하게 인식하여 사용자의 편의성을 향상시키기 위해서는 소음을 제외한 음향 신호만을 수신하는 것이 필수적이다. This recognition process is always accompanied by noise (noise) generated in the surrounding environment of the microphone, and in order to recognize more various commands accurately and improve user convenience, it is essential to receive only sound signals except noise.

또한, 이동 통신 단말기 및 네비게이션 장치 등 많은 수의 장치들이 사용자의 음성 인식 기능을 구비하고 있으며, 이러한 장치에서도 정확한 음성 인식에 기반한 동작을 위해서는 원하는 방향에서의 음성 신호만을 수신 처리하는 것이 필수적이다. In addition, a large number of devices, such as a mobile communication terminal and a navigation device, have a user's voice recognition function, and in such a device, it is essential to process only a voice signal in a desired direction for an operation based on accurate voice recognition.

종래의 다수의 음향기기 업체에서 능동형 소음제거(Active Noise Cancelling) 기술을 적용하여 주변의 소음을 차단하는 장치를 제안하고 있다. A number of conventional sound equipment companies have proposed an apparatus for blocking ambient noise by applying active noise canceling technology.

그러나 종래의 장치는 주변의 모든 소음을 제거하는 정숙한 환경을 조성하기 위한 목적으로 사용된다. Conventional devices, however, are used for the purpose of creating a quiet environment that eliminates all ambient noise.

또한 멀티 채널 마이크를 사용하여 특정 방향에서 들어오는 소리만을 획득하는 종래의 기술이 헤드폰 및 보청기에 적용된 사례가 있으나, 사람의 시선이 향하는 방향의 소리만을 받아들이기 때문에 활용성에 제약이 있다. In addition, although the conventional technology of obtaining only a sound coming from a specific direction by using a multi-channel microphone has been applied to headphones and hearing aids, there is a limitation in utilization because it only accepts sound in a direction in which a person's eyes are directed.

보다 상세하게, 종래의 잡음 제거 방식으로는 GSC(Generalized Sidelobe Canceller)를 이용한 방법과 두 개의 마이크으로 입력되는 신호의 위상 차를 이용하는 방법이 있다. More specifically, conventional noise canceling methods include a method using a generalized sidelobe canceller (GSC) and a method using a phase difference between signals input to two microphones.

전자의 방법은 GSC 내부의 적응 필터 수렴 문제로 인해 출력이 발산하거나 수렴 속도가 느려지는 문제점을 갖는다. The former method has a problem of output divergence or slowing down of convergence due to adaptive filter convergence problem in the GSC.

그리고, 후자의 방법은 적응 필터를 사용하지 않아 수렴 문제는 갖지 않으나, 두 개의 마이크간의 위상 정보만을 사용하는 한계로 인하여 위상의 작은 변화에도 시스템이 민감하게 반응하여 일시적인 오류가 자주 발생하는 문제점을 갖는다. In addition, the latter method does not have a convergence problem because it does not use an adaptive filter. However, due to the limitation of using only phase information between two microphones, the system reacts sensitively to small changes in phase so that a temporary error occurs frequently. .

대한민국 공개특허 제2008-0000478호(발명명칭: 휴대 단말기에서 복수의 마이크들로 입력된 신호들의 잡음을 제거하는 방법 및 장치)는 휴대 단말기로 입력되는 위상 차가 있는 신호로부터 잡음을 제거하는 방법 및 장치를 개시하고 있다. Korean Patent Laid-Open No. 2008-0000478 (Invention name: Method and apparatus for removing noise of signals input by a plurality of microphones in a portable terminal) is a method and apparatus for removing noise from a phase difference signal input to the portable terminal It is starting.

그러나, 앞서 설명한 바와 같이, 상기의 종래특허는 방향성 신호를 제거하기 위해 위상 차만을 이용함으로써 시스템의 안정성이 낮아 음성 신호에서 잡음을 효과적으로 제거할 수 없게 된다. However, as described above, the conventional patent uses only the phase difference to remove the directional signal, so that the stability of the system is low, so that the noise cannot be effectively removed from the voice signal.

상기한 종래기술의 문제점을 해결하기 위해, 본 발명은 사용자가 원하는 특정 방향의 음향만을 획득할 수 있는 방향성 음향 신호 처리 장치를 제안하고자 한다. In order to solve the above problems of the prior art, the present invention is to propose a directional acoustic signal processing apparatus that can obtain only the sound of a specific direction desired by the user.

상기한 기술적 과제를 해결하기 위해, 본 발명의 바람직한 일 실시예에 따르면, 방향성 음향 신호 처리 장치로서, 주변 음향을 수신하는 복수의 마이크; 상기 복수의 마이크에서 입력된 음향 신호의 각 주파수 대역별 코히어런스를 계산하는 코히어런스 계산부; 상기 복수의 마이크 각각에서 입력된 음향 신호를 통해 음성 신호의 존재 확률을 추정하는 추정부; 상기 추정된 음성 신호 존재 확률에 따라 각 주파수 대역별로 서로 다른 가중치를 부여하며, 상기 부여된 가중치를 상기 코히어런스에 반영하여 결과값을 출력하는 가중치 계산부; 및 상기 결과값과 미리 설정된 임계치를 비교하여 상기 복수의 마이크로 입력된 음향 신호의 처리 여부를 결정하는 제어부를 포함하는 방향성 음향 신호 처리 장치가 제공된다. In order to solve the above technical problem, according to an embodiment of the present invention, a directional acoustic signal processing device, a plurality of microphones for receiving the ambient sound; A coherence calculator for calculating coherence for each frequency band of the sound signals input from the plurality of microphones; An estimator estimating the existence probability of the speech signal based on the acoustic signal input from each of the plurality of microphones; A weight calculator configured to assign different weights to respective frequency bands according to the estimated probability of existence of the speech signal, and output a result value by reflecting the weighted weights to the coherence; And a controller which compares the result value with a preset threshold and determines whether to process the plurality of micro-input sound signals.

사용자가 청취를 원하는 각도를 입력 받는 인터페이스부; 및 상기 인터페이스부를 통해 입력된 청취 각도에 따라 상기 복수의 마이크에서 입력된 음향 신호의 위상을 변환하는 위상 변환부를 더 포함할 수 있다. An interface unit for receiving an angle at which the user wants to listen; And a phase converting unit converting phases of sound signals input from the plurality of microphones according to a listening angle input through the interface unit.

상기 추정부는, 상기 복수의 마이크 각각에서 입력된 음향 신호를 통해 배경 잡음을 추정하는 배경 잡음 추정부; 및 상기 배경 잡음에 기초하여 상기 각 주파수 대역별 음성 신호 존재 확률을 추정하는 신호 존재 확률 추정부를 포함할 수 있다. The estimator may include a background noise estimator for estimating background noise through an acoustic signal input from each of the plurality of microphones; And a signal existence probability estimator estimating a speech signal existence probability of each frequency band based on the background noise.

상기 가중치 계산부는, 상기 신호 존재 확률이 미리 설정된 값 이상인 경우에는 가중치를 1로 부여하고, 미리 설정된 값 미만인 경우에는 상기 신호 존재 확률을 가중치로 부여할 수 있다. The weight calculator may assign a weight to 1 when the signal existence probability is equal to or greater than a preset value, and may assign the signal existence probability as a weight when less than the preset value.

상기 결과값은 각 주파수 대역별 코히어런스에 가중치를 곱한 값을 모두 합산한 값일 수 있다. The result value may be a value obtained by adding up a value obtained by multiplying the coherence for each frequency band by a weight.

상기 제어부의 판단에 따라 상기 복수의 마이크로 입력된 신호를 스피커, 음성 인식부 및 전송부 중 적어도 하나로 출력되도록 선택하는 음향 출력 선택부를 더 포함할 수 있다. The apparatus may further include a sound output selection unit configured to select the plurality of micro-input signals to be output to at least one of a speaker, a voice recognition unit, and a transmission unit according to the determination of the controller.

본 발명에 따르면, 배경 잡음을 추정하고, 이를 통해 신호 존재 확률을 추정하여 음향 신호의 처리 여부를 결정하기 때문에 소음을 제외한 실제 사용자의 음성만을 선택적으로 처리할 수 있는 장점이 있다. According to the present invention, since the background noise is estimated and the signal existence probability is determined to determine whether to process the acoustic signal, only the voice of the actual user except the noise can be selectively processed.

도 1은 본 발명의 바람직한 일 실시예에 따른 방향성 음향 처리 장치의 구성을 도시한 도면.
도 2는 본 발명에 따른 공간 음향 활성 감지부의 상세 구성을 도시한 도면.
도 3은 본 발명에 따른 음향 출력 선택부의 온/오프 과정을 도시한 도면.
도 4는 본 발명에 따른 음향 감쇠 및 처리 상태를 도시한 도면. 1 is a diagram showing the configuration of a directional sound processing apparatus according to an embodiment of the present invention.
2 is a diagram showing a detailed configuration of the spatial acoustic activity detection unit according to the present invention.
3 is a diagram illustrating an on / off process of the sound output selection unit according to the present invention;
4 illustrates acoustic attenuation and processing conditions in accordance with the present invention.

본 발명은 다양한 변경을 가할 수 있고 여러 가지 실시예를 가질 수 있는 바, 특정 실시예들을 도면에 예시하고 상세하게 설명하고자 한다. 그러나, 이는 본 발명을 특정한 실시 형태에 대해 한정하려는 것이 아니며, 본 발명의 사상 및 기술 범위에 포함되는 모든 변경, 균등물 내지 대체물을 포함하는 것으로 이해되어야 한다. 각 도면을 설명하면서 유사한 참조부호를 유사한 구성요소에 대해 사용하였다. As the present invention allows for various changes and numerous embodiments, particular embodiments will be illustrated in the drawings and described in detail in the written description. However, this is not intended to limit the present invention to specific embodiments, it should be understood to include all modifications, equivalents, and substitutes included in the spirit and scope of the present invention. In describing the drawings, similar reference numerals are used for similar elements.

이하, 본 발명의 바람직한 실시예를 첨부한 도면들을 참조하여 상세히 설명하기로 한다. 본 발명을 설명함에 있어 전체적인 이해를 용이하게 하기 위하여 도면 번호에 상관없이 동일한 수단에 대해서는 동일한 참조 번호를 사용하기로 한다.
Hereinafter, exemplary embodiments of the present invention will be described in detail with reference to the accompanying drawings. In the following description of the present invention, the same reference numerals will be used for the same means regardless of the reference numerals in order to facilitate the overall understanding.

도 1은 본 발명의 바람직한 일 실시예에 따른 방향성 음향 처리 장치의 블록도이다. 1 is a block diagram of a directional sound processing apparatus according to a preferred embodiment of the present invention.

도 1에 도시된 바와 같이, 본 발명에 따른 방향성 음향 처리 장치는 복수의 마이크(100-1 내지 100-n), 공간 음향 활성 감지부(Spacial Voice Activity Detection, 102), 인터페이스부(104) 및 음향 출력 선택부(106)를 포함할 수 있다. As shown in FIG. 1, the directional sound processing apparatus according to the present invention includes a plurality of microphones 100-1 to 100-n, a spatial voice activity detection unit 102, an interface unit 104, and the like. Sound output selection unit 106 may be included.

도 1에 도시된 바와 같이, 본 발명에 따른 방향성 음향 처리 장치에는 정면 소리 및 측면 소리와 같이 서로 다른 방향으로부터의 음향이 복수의 마이크(100)를 통해 입력된다. As shown in FIG. 1, in the directional sound processing apparatus according to the present invention, sounds from different directions, such as front and side sounds, are input through a plurality of microphones 100.

공간 음향 활성 감지부(102)는 미리 설정된 알고리즘에 따라 마이크(100)를 통해 입력된 음향을 스피커로 출력할지 또는 상대방 단말로 전송할지 여부를 결정한다. The spatial acoustic activity detecting unit 102 determines whether to output the sound input through the microphone 100 to the speaker or transmit the signal to the counterpart terminal according to a preset algorithm.

바람직하게, 본 발명에 따른 공간 음향 활성 감지부(102)는 배경 잡음을 추정하고, 추정된 배경 잡음에 따라 음성 신호 존재 확률을 추정하며, 추정된 신호 존재 확률에 기초하여 최종적으로 마이크를 통해 입력된 신호를 처리할지 여부를 결정한다. Preferably, the spatial acoustic activity detection unit 102 according to the present invention estimates the background noise, estimates the voice signal presence probability according to the estimated background noise, and finally inputs the microphone through the microphone based on the estimated signal presence probability. Determine whether to process the generated signal.

이때, 공간 음향 활성 감지부(102)는 미리 설정된 시간 간격(예를 들어, 20ms 또는 30ms 간격)으로 입력된 음향의 신호 처리 여부를 결정한다. At this time, the spatial acoustic activity detection unit 102 determines whether to process the signal of the input sound at a predetermined time interval (for example, 20 ms or 30 ms interval).

여기서, 마이크를 통해 입력된 신호의 처리는 입력된 음향 신호를 스피커를 통해 출력할지 여부, 장치가 음성 인식 장치일 경우, 음성 인식 처리부(미도시)로 출력할지 여부, 장치가 통화 장치인 경우, 상대방 장치에 전달할지 여부를 결정하는 것을 의미한다. Herein, the processing of the signal input through the microphone may include outputting the input sound signal through a speaker, whether the device is a voice recognition device, whether to output the voice recognition processing unit (not shown), or when the device is a call device. Determining whether to deliver to the counterpart device.

바람직하게, 음향 출력 선택부(106)는 스피커, 음성 인식 처리부 또는 전송부와 연결되며, 공간 음향 활성 감지부(102)의 판단에 따라 음향의 출력 여부가 결정된다. Preferably, the sound output selection unit 106 is connected to a speaker, a voice recognition processing unit, or a transmission unit, and it is determined whether to output sound according to the determination of the spatial sound activity detection unit 102.

도 2는 본 발명에 따른 공간 음향 활성 감지부의 상세한 구성을 도시한 도면이다. 2 is a view showing a detailed configuration of the spatial acoustic activity detection unit according to the present invention.

도 2에 도시된 바와 같이, 본 발명에 따른 공간 음향 활성 감지부(102)는 FFT부(200), 위상 변환부(202), 코히어런스 계산부(204), 추정부(206), 가중치 계산부(208) 및 제어부(210)를 포함할 수 있다. As shown in FIG. 2, the spatial acoustic activity detection unit 102 according to the present invention includes an FFT unit 200, a phase shifter 202, a coherence calculator 204, an estimator 206, and a weight. The calculator 208 and the controller 210 may be included.

도 2에서는 설명의 편의를 위해 제1 마이크(100-1) 내지 제2 마이크(100-2)가 구비되는 것으로 설명하나, 반드시 이에 한정되지 않는다는 점을 당업자는 이해하여야 할 것이다. In FIG. 2, the first microphone 100-1 to the second microphone 100-2 are provided for convenience of description, but it should be understood by those skilled in the art that the present invention is not limited thereto.

제1 마이크(100-1)를 통해 입력되는 음향 신호를

, 제2 마이크(100-2)를 통해 입력되는 음향 신호를

라 가정한다. The sound signal input through the first microphone 100-1

, The sound signal input through the second microphone (100-2)

Assume

여기서, t는 시간(프레임)이며, w는 주파수 대역 인덱스를 의미한다. Here, t is time (frame) and w means frequency band index.

FFT부(200)는 제1 및 제2 마이크(100-1 내지 100-2)를 통해 입력된 음향 신호를 주파수 도메인으로 변환한다. 바람직하게, FFT부(200)는 Short Time Fast Fourier Transform(STFFT)를 수행하여 마이크를 통해 입력된 음향 신호를 주파수 도메인으로 변환한다. The FFT unit 200 converts the sound signals input through the first and second microphones 100-1 to 100-2 into the frequency domain. Preferably, the FFT unit 200 performs a Short Time Fast Fourier Transform (STFFT) to convert the sound signal input through the microphone into the frequency domain.

위상 변환부(202)는 인터페이스부(104)를 통해 사용자가 미리 설정한 청취 각도에 따라 제1 마이크 및 제2 마이크의 주파수별 위상(phase) 값을 변환한다. The phase converting unit 202 converts phase values for frequencies of the first microphone and the second microphone according to a listening angle preset by the user through the interface unit 104.

여기서, 청취 각도는 사용자가 청취를 원하는 각도로 정의되며, 예를 들어, 마이크(100-1,100-2)를 기준으로 정면에서 입력되는 음향의 각도를 0°라 가정할 때, 일반적으로 청취 각도는 0°로 설정될 수 있다. 그러나 경우에 따라 사용자가 측면에서 입력되는 음향을 청취하기를 원하는 경우 청취 각도는 예를 들어, 90°로 설정될 수 있으며, 위상 변환부(202)는 90°각도에서 입력되는 음향의 위상을 변환하여 0°, 즉 정면에서 입력되는 것과 마찬가지로 위상 값을 변환할 수 있다. Here, the listening angle is defined as the angle that the user wants to hear. For example, assuming that the angle of the sound input from the front with respect to the microphones 100-1 and 100-2 is 0 °, the listening angle is generally Can be set to 0 °. However, in some cases, when the user wants to hear the sound input from the side, the listening angle may be set to, for example, 90 °, and the phase shifter 202 converts the phase of the sound input at the 90 ° angle. 0 °, i.e., the phase value can be converted as if input from the front.

위상 변환 후 코히어런스 계산부(204)는 아래의 수학식 1을 통해 주파수 대역별 코히어런스를 계산한다. After the phase shift, the coherence calculator 204 calculates coherence for each frequency band through Equation 1 below.

여기서,

는 주파수 대역별 코히어런스,

는 제1 마이크(100-1) 및 제2 마이크(100-2) 입력 X,Y의 상호상관(cross correlation) 값이며,

와

는 자기상관(auto correlation) 값이다. here,

Is frequency coherence,

Is a cross correlation value of the first microphone 100-1 and the second microphone 100-2 inputs X and Y,

Wow

Is an auto correlation value.

본 발명의 바람직한 일 실시예에 따르면, 추정부(206)는 복수의 마이크 각각에서 입력된 신호를 통해 음성 신호 존재 확률을 추정한다. According to an exemplary embodiment of the present invention, the estimator 206 estimates a voice signal presence probability based on signals input from each of the plurality of microphones.

보다 상세하게 추정부(206)는 배경 잡음 추정부(220) 및 신호 존재 확률 추정부(222)를 포함할 수 있다. In more detail, the estimator 206 may include a background noise estimator 220 and a signal existence probability estimator 222.

배경 잡음 추정부(220)에서는 MCRA(Minima Controlled Recursive Averaging) 방법 등을 이용하여 마이크 입력 신호로부터 배경 잡음 성분

을 추정한다.The background noise estimator 220 uses the Minima Controlled Recursive Averaging (MCRA) method to generate the background noise from the microphone input signal.

Estimate

신호 존재 확률 추정부(222)는 추정된 배경 잡음 성분에 기초하여 각 주파수 대역의 음성 존재 확률인 p(t,w)를 아래의 수학식 2를 통해 계산한다. The signal existence probability estimator 222 calculates p (t, w), which is a speech existence probability of each frequency band, based on the estimated background noise component through Equation 2 below.

여기서,

here,

: Prior Speech Absent Probability,

Prior Speech Absent Probability,

,이다.

,to be.

상기한 수학식 2를 통해 계산된 p(t,w)를 바탕으로 본 발명에 따른 가중치 계산부(208)는 가중치를 계산한다. Based on p (t, w) calculated through Equation 2, the weight calculator 208 calculates the weight.

본 발명에 일 실시예에 따르면, p(t,w)가 0.8 이상이면 가중치

에 1을 부여하고, 나머지 경우에는 p(t,w) 값을

으로 부여한다. According to an embodiment of the present invention, if p (t, w) is 0.8 or greater, the weight

To 1, and in other cases p (t, w)

To give.

여기서, 0.8은 실험적으로 결정된 수치이며, 이에 한정됨이 없이 가중치 1에 부여되는 수치는 다양하게 적용될 수 있다. Herein, 0.8 is an experimentally determined value, and the value given to the weight 1 is not limited thereto and may be variously applied.

가중치 계산부(208)는 아래의 수학식 3과 같이 소정 주파수 대역마다 앞서 계산된 코히어런스에 상기와 같이 부여된 가중치를 곱한다. The weight calculator 208 multiplies the previously given coherence by the above-mentioned weight for each predetermined frequency band as shown in Equation 3 below.

또한, 가중치 계산부(208)는 각 주파수 대역의 코히어런스에 가중치가 곱해진 결과를 아래의 수학식 4와 같이 더한 후 이를 제어부(210)로 출력한다. In addition, the weight calculator 208 adds the result of the weight multiplied by the coherence of each frequency band, as shown in Equation 4 below, and outputs the result to the controller 210.

본 발명에 따른 제어부(210)는 수학식 5와 같이 가중치 계산부(208)에서 출력된 결과값이 미리 설정된 임계치를 넘는지 여부를 판단하며, 임계치를 넘으면 사용자가 청취를 원하는 각도에서 음성이 입력된 것으로 판단한다. 즉, 제어부(210)는 결과값이 임계치 이상인 경우 현재 입력된 음향 신호를 처리하는 것으로 결정한다. The controller 210 according to the present invention determines whether the result value output from the weight calculator 208 exceeds a preset threshold value as shown in Equation 5, and when the threshold value is exceeded, the voice is input at an angle desired by the user to listen. I think it is. That is, the controller 210 determines to process the currently input sound signal when the result value is greater than or equal to the threshold.

반면, 제어부(210)는 가중치 계산부(208)에서의 결과값이 임계치를 넘지 못할 경우 현재 입력된 음향 신호를 감쇠시키는 것으로 결정한다. On the other hand, if the result of the weight calculator 208 does not exceed the threshold, the controller 210 determines to attenuate the currently input sound signal.

본 발명에 따르면, 제어부(210)는 상기한 판단 결과에 따른 제어 신호를 음향 출력 선택부(106)로 출력한다. According to the present invention, the controller 210 outputs a control signal according to the determination result to the sound output selector 106.

도 3에 도시된 바와 같이, 제어부(210)의 온/오프 제어 신호에 따라 음향 출력 선택부(106)는 마이크를 통해 입력된 음향 신호의 출력 여부를 선택한다. As shown in FIG. 3, the sound output selector 106 selects whether to output the sound signal input through the microphone according to the on / off control signal of the controller 210.

바람직하게 음향 출력 선택부(106)는 스위칭 소자로 구현될 수 있다. Preferably, the sound output selector 106 may be implemented as a switching element.

보다 상세하게, 도 3b와 같이 제어부(210)는 가중치 계산부(208)의 결과값이 임계치를 초과하는 경우에만 마이크로부터 입력된 음향 신호가 스피커, 음성 인식부 또는 전송부로 입력되도록 제어한다. In more detail, as illustrated in FIG. 3B, the controller 210 controls the sound signal input from the microphone to be input to the speaker, the voice recognition unit, or the transmitter only when the result value of the weight calculator 208 exceeds the threshold.

도 4에 도시된 바와 같이, 사용자가 원하지 않는 방향에서 음향이 입력될 때, 이러한 신호는 감쇠시키고, 사용자가 원하는 방향에서 입력된 음향만을 출력하기 때문에 소음 환경에서 효율적으로 사용될 수 있다. As shown in FIG. 4, when sound is input in a direction not desired by the user, such a signal is attenuated and outputs only sound input in the direction desired by the user, and thus can be efficiently used in a noisy environment.

본 발명의 실시예들은 다양한 컴퓨터 수단을 통하여 수행될 수 있는 프로그램 명령 형태로 구현되어 컴퓨터 판독 가능 매체에 기록될 수 있다. 상기 컴퓨터 판독 가능 매체는 프로그램 명령, 데이터 파일, 데이터 구조 등을 단독으로 또는 조합하여 포함할 수 있다. 상기 매체에 기록되는 프로그램 명령은 본 발명을 위하여 특별히 설계되고 구성된 것들이거나 컴퓨터 소프트웨어 당업자에게 공지되어 사용 가능한 것일 수도 있다. 컴퓨터 판독 가능 기록 매체의 예에는 하드 디스크, 플로피 디스크 및 자기 테이프와 같은 자기 매체(magnetic media), CD-ROM, DVD와 같은 광기록 매체(optical media), 플롭티컬 디스크(floptical disk)와 같은 자기-광 매체(magneto-optical), 및 롬(ROM), 램(RAM), 플래시 메모리 등과 같은 프로그램 명령의 예에는 컴파일러에 의해 만들어지는 것과 같은 기계어 코드뿐만 아니라 인터프리터 등을 사용해서 컴퓨터에 의해서 실행될 수 있는 고급 언어 코드를 포함한다. 상기된 하드웨어 장치는 본 발명의 일 실시예들의 동작을 수행하기 위해 적어도 하나의 소프트웨어 모듈로서 동작하도록 구성될 수 있으며, 그 역도 마찬가지이다.Embodiments of the present invention may be implemented in the form of program instructions that can be executed by various computer means and recorded in a computer readable medium. The computer readable medium may include program instructions, data files, data structures, etc. alone or in combination. Program instructions recorded on the media may be those specially designed and constructed for the purposes of the present invention, or they may be of the kind well-known and available to those having skill in the computer software arts. Examples of computer-readable recording media include magnetic media such as hard disks, floppy disks, and magnetic tape, optical media such as CD-ROMs, DVDs, and magnetic disks, such as floppy disks. Examples of program instructions such as magneto-optical, ROM, RAM, flash memory, etc. may be executed by a computer using an interpreter as well as machine code such as produced by a compiler. Contains high-level language codes. The hardware device described above may be configured to operate as at least one software module to perform the operations of one embodiment of the present invention, and vice versa.

상기한 본 발명의 바람직한 실시예는 예시의 목적을 위해 개시된 것이고, 본 발명에 대해 통상의 지식을 가진 당업자라면 본 발명의 사상과 범위 안에서 다양한 수정, 변경, 부가가 가능할 것이며, 이러한 수정, 변경 및 부가는 하기의 특허청구범위에 속하는 것으로 보아야 할 것이다.Preferred embodiments of the present invention described above are disclosed for purposes of illustration, and those skilled in the art will be able to make various modifications, changes, and additions within the spirit and scope of the present invention. Additions should be considered to be within the scope of the following claims.

Claims

A directional acoustic signal processing device,
A plurality of microphones for receiving ambient sound; And
A coherence calculator for calculating coherence for each frequency band of the sound signals input from the plurality of microphones;
An estimator estimating the existence probability of the speech signal based on the acoustic signals input from the plurality of microphones;
A weight calculator configured to assign different weights to respective frequency bands according to the estimated probability of existence of the speech signal, and output a result value by reflecting the weighted weights to the coherence;
A control unit which determines whether to process the plurality of micro-input sound signals by comparing the result value with a preset threshold value; And
And a sound output selector configured to select the plurality of micro-input signals to be output to at least one of a speaker, a voice recognizer, and a transmitter according to a determination of the controller.

The method of claim 1,
An interface unit for receiving an angle at which the user wants to listen; And
And a phase shifter for converting phases of sound signals input from the plurality of microphones according to a listening angle input through the interface unit.

The method of claim 1,
The estimating unit,
A background noise estimator for estimating background noise through an acoustic signal input from each of the plurality of microphones; And
And a signal existence probability estimator for estimating a speech signal existence probability of each frequency band based on the background noise.

The method of claim 1,
The weight calculation unit,
And a weight of 1 when the signal existence probability is equal to or greater than a preset value, and a weight of the signal existence probability as a weight when less than the preset value.

The method of claim 1,
The result value is a directional sound signal processing apparatus is a sum of the sum of the product of the weighted multiplied coherence for each frequency band.

delete