4.3. Audio Compression

Simple Audio Compression Methods
Psychoacoustics
MPEG Audio Compression

Reference: Chapter 6 of Steinmetz and Nahrstedt

  

4.3.1 Simple Audio Compression Methods


The following are some of the Lossy methods:

  

4.3.2 Psychoacoustics


Human hearing and voice

Question: How sensitive is human hearing?

Frequency Masking

Question: Do receptors interfere with each other?

Critical Bands

Barks

Temporal masking

Summary

  

4.3.3 MPEG Audio Compression


Reference: Davis Pan, "A Tutorial on MPEG/Audio Compression", IEEE Multimedia, pp. 60-74, 1995.

Some facts

Steps in algorithm:

  1. Use convolution filters to divide the audio signal (e.g., 48 kHz sound) into frequency subbands that approximate the 32 critical bands --> sub-band filtering.

  2. Determine amount of masking for each band caused by nearby band using the results shown above (this is called the psychoacoustic model).

  3. If the power in a band is below the masking threshold, don't encode it.

  4. Otherwise, determine number of bits needed to represent the coefficient such that noise introduced by quantization is below the masking effect (Recall that 1 bit of quantization introduces about 6 dB of noise).

  5. Format bitstream

Example:

MPEG Layers

Effectiveness of MPEG audio

--------------------------------------------------------------------
Layer      Target     Ratio    Quality @    Quality @    Theoretical
           bitrate             64 kbits     128 kbits    Min. Delay
--------------------------------------------------------------------
Layer 1   192 kbit     4:1       ---          ---          19 ms  
Layer 2   128 kbit     6:1    2.1 to 2.6       4+          35 ms   
Layer 3    64 kbit    12:1    3.6 to 3.8       4+          59 ms    
--------------------------------------------------------------------

Further Exploration

MPEG Resources on the Web.
Last Updated: 7/8/96
  
Top | Chap 4 | CMPT 365 Home Page | CS