MPC - MP2

 

MPC is only distantly related with the ancient MP2 encoders.
It's a completely self written encoder, based on subband compression which makes use of filterbanks.

There are many difference between MPC and MP2 codecs.
MPC introduces the following features:

- VBR
- M/S Coding
- ANS (Adaptive Noise Shaping)
- Huffman (lossless compression) but the main difference is the psychoacoustic model with its extensions.

"- because the resolution in freq.domain is much less than the resolution with transform codecs, mpc uses so-called "adaptive noise shaping" to do a spectral shaping of the quantization noise in the subbands. This proceeding saves bitrate especially within the lower subbands".

- to get a much higher lossless compression, mpc uses
huffman-coding and more sophisticated coding-schemes like differential coding.


psychoacoustics (from old Buschmann's post but still good):
=============
also the psychoacoustic model has been tuned a lot. it was based on the "psychoacoustic model 2" that was introduced with the MPEG1-standard. But i implemented a lot of new features like "clear voice detection" (cvd), exploitation of temporal postmasking, detection of fast transient signals, nonlinear spreading-function, new threshold in quiet and so on... also the
psychoacoustic model takes filterbank-specific attributes (e.g. aliasing) into consideration.
CVD is able to detect voice-like signals to give a higher quality with voices or sounds with harmonic spectra. it uses special analysis to detect harmonics with varying base frequency - the "normal" psychoacoustics are not able to
detect such signals and will add audible noise to these signals.
The exploitation of temporal postmasking saves a few kbit/s because the human hearing has to "relax" after a sound event, so that the encoder can put a bit more distortion to the signal during this time.
Another thing that had to be changed was the threshold in quiet.
The threshold in quiet is the loudness below noone can perceive a sine-tone or a narrowband-noise. The threshold in quiet (ltq: "l"evel "t"reshold in "q"uiet) described in the ISO-MPEG-standard is the ltq that was measured with average listeners (me, my mom, my grandma and so on...), so that also
listeners were included that have not the best hearing. the new ltq was measured by listening tests of three (mine and my major listening testers) persons using very precise headphones.
The results of the psychoacoustics are much better than with the use of the iso-ltq.
Another change in the psychoacoustics is the use of nonlinear spreading-function. In fact the masking effect the spreading-function describes is nonlinear - it varies with the frequency and the loudness of the masker.
The "normal" psychoacoustic model does not exploit this fact.
===============

HOME02.gif (9867 byte)     ritorna alla pagina Principale

                      HOME02.gif (9867 byte)    ritorna alla pagina Muse PaCk