The classification of microphone arrays was introduced from the perspective of hardware. Then, combined with the front-end algorithm of the microphone array, let's see what the function of the microphone array is. 1. Sound source localization Humans have two ears and executive email list can judge the direction of the sound by sound, and robots can do the same. This function is sound source localization, which senses the direction of the person through the sound, so as to realize executive email list the tracking of the direction of the target sound source. This also lays the groundwork for the subsequent beamforming technology.
For example, in a robot scene, we call it on the left side of the robot, and the robot turns its head to the left when it hears the sound. When we call it behind the robot, the robot turns over when it hears the sound. This is the most typical application of sound source executive email list localization. Usually sound source localization is used in the wake-up phase of speech, which can detect a general direction. The commonly used technology is TDOA (Time Difference Of Arrival executive email list , time difference of arrival). The simple understanding is to calculate the position coordinates of the sound source by calculating the time difference between the signal arriving at the microphone, which requires millisecond-level response and calculation. 2.
Noise suppression / vocal enhancement In speech recognition, the speech information is often mixed with noise, such as ambient noise and human voice interference, which usually do not cover up normal speech, but only affect the clarity of the sound. The microphone array executive email list mainly uses beamforming technology to suppress noise and enhance human voice. It can be understood that only the sound of a certain angle is recognized (generally the angle can be adjusted), executive email list and the sound of other angles will be suppressed, so as to achieve the purpose of suppressing noise. Conversely, it can also enhance the vocals within the angle, which is to enhance the vocals. For example, in a family scene, if we turn on the