Wednesday, April 22, 15:00 - 17:30, Location: Show and Tell Area A
Tim Fingscheidt, Suhadi Suhadi
Quality assessment of speech enhancement systems is generally conducted via subjective listening tests or objective measurements. The most important parameters are speech distortion, residual noise level, and residual echo level. Usually, the performance is evaluated directly on the enhanced signal, which however makes it difficult to judge these parameters independently and precisely, especially during double talk.
As a solution, it is reasonable to split the resulting enhanced signal into its signal components. For this purpose, all additive components of the microphone signal (i.e., clean speech, noise and echo signals) are separately processed through the sending direction (i.e., uplink) of the enhancement system, whereby the enhancement system's behaviour has to be logged beforehand based on noisy and echoic input speech. There are a couple of assumptions taken with such an approach, the most problematic may be that internal processing of the speech enhancement system must be known beforehand (“white box” test).
The new ITU-T P.1100 Recommendation "Narrowband Hands Free Communication in Motor Vehicles" offers an interesting instrumental test option that can be applied to unkown ("black box") systems such as hardware or compiled software realizations. It is also attractive for research on speech enhancement systems since some intuitive white box system assumptions and restrictions do not apply. Based on some prior work on signal separation we developed a MATLAB-GUI-based tool that allows to experience the following at the demo booth: