Why Mel cepstral coefficients , PLP?

Shweta_S · Feb 24, 2012

I have a huge doubt!!

When I looked for types of features to extract from the speech signal for speech recognition, many suggested to take the MFCC ( in case of non-noisy conditions) and PLP in case of noisly conditions.

In both the feature extraction processes, we have to warp the power spectrum of the signal according the auditory response of the ear ( i.e. the mel scale or the bark scale).

Why do we have to take into account the auditory response when we're doing speech recognition? Please Help!!

manimail2007 · Feb 24, 2012

The main reason why we consider response of the ear is in real world everything is analog in nature like speech,sound etc.,But in electronics processing digital is much convenient than analog.while doing digital processing especially for real time signals approximation leads to great loss of information.Hence in speech processing we are more concern about how the human ears will perceive the particular signal and process it accordingly.For that only mfcc and plp like algorithms had been developed.hope its helpful..

Why Mel cepstral coefficients , PLP?

Shweta_S

Member level 3

manimail2007

Newbie level 6

Similar threads

Why Mel cepstral coefficients , PLP?

Shweta_S

Member level 3

manimail2007

Newbie level 6

Similar threads

Privacy & Transparency

Privacy & Transparency