Prepare for your exams
Get points
Guidelines and tips
Sell on Docsity
Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Earn points to download

Earn points by helping other students or get them with a premium plan

Guidelines and tips

Sell on Docsity

Prepare for your exams

Study with the several resources on Docsity

Prepare for your exams with the study notes shared by other students like you on Docsity

Search for your university

Find the specific documents for your university's exams

Summarize your documents, ask them questions, convert them into quizzes and concept maps

Explore questions

Clear up your doubts by reading the answers to questions asked by your fellow students

Earn points to download

Earn points by helping other students or get them with a premium plan

Share documents

For each uploaded document

Answer questions

For each given answer (max 1 per day)

All the ways to get free points

Get points immediately

Choose a premium plan with all the points you need

Study Opportunities

Choose your next study program

Get in touch with the best universities in the world. Search through thousands of universities and official partners

Community

Ask the community

Ask the community for help and clear up your study doubts

Free resources

Our save-the-student-ebooks!

Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors

Digital Audio Compression - Multimedia Computing - Lecture Slides, Slides of Multimedia Applications

Amity University - Bihar Multimedia Applications

Multimedia Computing, In this short course we study the basic concept of the principle of computer architecture. In these lecture slides the key points cover in these slides are:Digital Audio Compression, Speech Compression, General Audio Compression, Psychoacoustics, Equal-Loudness Relations, Threshold of Hearing, Frequency Masking, Critical Bands, Human Hearing Range, Temporal Masking

Typology: Slides

2012/2013

Uploaded on 04/23/2013

sarasvatir 🇮🇳

4.5

(28)

86 documents

1 / 33

This page cannot be seen from the preview

Don't miss anything!

bg1

Digital Audio Compression

Docsity.com

pf3

pf4

pf5

pf8

pf9

pfa

pfd

pfe

pff

pf12

pf13

pf14

pf15

pf16

pf17

pf18

pf19

pf1a

pf1b

pf1c

pf1d

pf1e

pf1f

pf20

pf21

Discover Slides of Multimedia Applications Amity University - Bihar

Related documents

Lossy Compression - Multimedia Computing - Lecture Slides

Lossless Compression: Huffman, Run-Length, Quadtrees, and Lossless JPEG

The JPEG Image Compression Standard: Principles and Techniques

MPEG Audio Compression: Psychoacoustics, Critical Bands, and Masking in ECE160 Lecture 14

Audio Representation - Multimedia Computing - Lecture Slides

Java Media Framework - Multimedia Computing - Lecture Slides

Fundamentals of Audio Signals: Sampling, Quantization, and Digital Signal Processing

Data Compression Syllabus - Prof. Fathima

Compression Techniques: Lossless and Lossy Image Compression

Image Compression Techniques

(1)

Lossless Compression: Run-Length, Variable-Length, Dictionary, Arithmetic, Huffman Coding

Data Compression: Key Concepts and Techniques

Partial preview of the text

Download Digital Audio Compression - Multimedia Computing - Lecture Slides and more Slides Multimedia Applications in PDF only on Docsity!

Digital Audio Compression

Speech Compression

Compression of voice data
- We have previously mentioned several methods that are used to compress voice data - mu-law and A-law companding - ADPCM and delta modulation
- These are examples of methods which work in the time domain (as opposed to the frequency domain) - Often they are not even considered compression methods

General Audio Compression

If we want to compress general audio (not

just speech), different techniques are

needed

In particular, music compression is a more general form of audio compression
We make use of psychoacoustical

modeling

Enable perceptual encoding based upon an analysis of the ear and brain perceive sound
Perceptual encoding exploits audio elements that the human ear cannot hear well

Psychoacoustics

If you have been listening to very loud music,

you may have trouble afterwards hearing soft

sounds (that normally you could hear)

Temporal masking
A loud sound at one frequency (a lead guitar)

may drown out a sound at another frequency

(the singer)

Frequency masking

Equal-Loudness Relations

Threshold of Hearing

The following image is a plot of the threshold

of human hearing for pure tones – at loudness

below the curve, we don’t hear a tone

Frequency masking

We can determine how a pure tone at a

particular frequency affects our ability to hear

tones at nearby frequencies

Then, if a signal can be decomposed into

frequencies, for those frequencies that are

only partially masked, only the audible part

will be used to set the quantization noise

thresholds

Critical Bands

Human hearing range divides into critical

bands

Human auditory system cannot resolve sounds better than within about one critical band when other sounds are present
Critical bandwidth represents the ear’s resolving power for simultaneous tones
At lower frequencies the bands are narrower than at higher frequencies
The band is the section of the inner ear which responds to a particular frequency

Critical Bands

Generally, the audio frequency range for

hearing (20 Hz – 20 kHz) can be

partitioned into about 24 critical bands

(25 are typically used for coding

applications

The previous slide does not show several of the highest frequency critical bands
The critical band at the highest audible frequency is over 4000 Hz wide
The ear is not very discriminating within a critical band

Temporal Masking

A loud tone causes the hearing receptors in

the inner ear to become saturated, and they

require time to recover

This leads to the temporal masking effect
After the loud tone we cannot immediately hear another tone – post-masking - The length of the masking depends on the duration of the masking tone
A masking tone can also block sounds played just before – pre-masking (shorter time)

MPEG Audio Compression

MPEG (Motion Picture Experts Group) is a

family of standards for compression of both

audio and video data

MPEG-1 (1991) CD quality audio
MPEG-2 (1994) Multi-channel surround sound
MPEG-4 (1998) Also includes MIDI, speech, etc.
MPEG-7 (2003) Not compression – searching
MPEG-21 (2004) Not compression – digital rights management

MPEG Audio Compression

MPEG-1 defined three downward

compatible layers of audio compression

Each layer offers more complexity in the psychoacoustic model used and hence better compression
Increased complexity leads to increased delay
Compatibility achieved by shared file header information
Layer 1 – used for Digital Audio Tape
Layer 2 – proposed for digital audio broadcasting
Layer 3 – music (MPEG-1 layer 3 == mp3)

MPEG Audio Compression

PCM input filtered into 32 bands
PCM FFT transformed for PA model
Windows of samples (384, 576, 1152) coded

at a time

MPEG Audio Compression

Since the sub-bands overlap, aliasing may

occur

This is overcome by the use of a quadrature mirror filter bank - Attenuation slopes of adjacent bands are mirror images