Shweta_S
Member level 3
I am working on speech recognition... Using MFCC as coefficients as speech features...
I generated a filter bank of overlapping triangular filters in frequency domain.
Since it is in frequency domain, I multiplied the power spectrum of the signal with each filter.
Now what do I do? Should I take the sum of the bandpassed signal in each band to give the filter bank outputs.
Then take the log and DCT of this signal right?
Is taking the sum of the bandpassed signal (in the frequency domain) correct?
Please help me out...
I generated a filter bank of overlapping triangular filters in frequency domain.
Since it is in frequency domain, I multiplied the power spectrum of the signal with each filter.
Now what do I do? Should I take the sum of the bandpassed signal in each band to give the filter bank outputs.
Then take the log and DCT of this signal right?
Is taking the sum of the bandpassed signal (in the frequency domain) correct?
Please help me out...