Prepare for your exams
Get points
Guidelines and tips
Sell on Docsity
Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Earn points to download

Earn points by helping other students or get them with a premium plan

Guidelines and tips

Sell on Docsity

Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Find documents

Prepare for your exams with the study notes shared by other students like you on Docsity

Search for your university

Find the specific documents for your university's exams

Docsity AINEW

Summarize your documents, ask them questions, convert them into quizzes and concept maps

Explore questions

Clear up your doubts by reading the answers to questions asked by your fellow students

Earn points to download

Earn points by helping other students or get them with a premium plan

Share documents

20 Points

For each uploaded document

Answer questions

5 Points

For each given answer (max 1 per day)

All the ways to get free points

Get points immediately

Choose a premium plan with all the points you need

Study Opportunities

Choose your next study program

Get in touch with the best universities in the world. Search through thousands of universities and official partners

Community

Ask the community

Ask the community for help and clear up your study doubts

Free resources

Our save-the-student-ebooks!

Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors

Data Compressor - Data Structures - Lecture Slides, Slides of Data Structures and Algorithms

Indian Institute of Management (IIM)Data Structures and Algorithms

Some concept of Data Structures are Abstract, Balance Factor, Complete Binary Tree, Dynamically, Storage, Implementation, Sequential Search, Advanced Data Structures, Graph Coloring Two, Insertion Sort. Main points of this lecture are: Data Compressor, Encoding and Decoding, Huffman, Compression, Files and Messages, Wasting, Smallest Number, Arbitrary Piece, Frequency, Short Bit Strings

Typology: Slides

2012/2013

Uploaded on 04/30/2013

dinpal 🇮🇳

3.6

(12)

73 documents

1 / 44

This page cannot be seen from the preview

Don't miss anything!

Data Compressor---Huffman

Encoding and Decoding

Docsity.com

Discover Slides of Data Structures and Algorithms Indian Institute of Management (IIM)

Partial preview of the text

Download Data Compressor - Data Structures - Lecture Slides and more Slides Data Structures and Algorithms in PDF only on Docsity!

Data Compressor---Huffman

Encoding and Decoding

Huffman Encoding

Compression
- Typically, in files and messages,
  - Each character requires 1 byte or 8 bits
  - Already wasting 1 bit for most purposes!
Question
- What’s the smallest number of bits that can be used to store an arbitrary piece of text?
Idea
- Find the frequency of occurrence of each character
- Encode Frequent characters short bit strings
- Rarer characters longer bit strings

Huffman's Algorithm

• Repeatedly merges trees - maintains a forest

• Tree weight - the sum of its leaves frequencies

• For C characters to code, start with C single

node trees

• Select two trees, T 1 and T 2 , of smallest weights

and merge them

• C - 1 merge operations

Huffman Encoding

Encoding
- Use a tree
  - Inefficient in practice
- Use a direct-addressed lookup table

? Finding the optimal encoding

Smallest number of bits to represent arbitrary text

A 010

E 00

B : : N : S T

A divide-and-conquer approach might have us

asking which characters should appear in the

left and right subtrees and trying to build the

tree from the top down.

A greedy approach places our n characters in

n sub-trees and starts by combining the two

least weight nodes into a tree which is

assigned the sum of the two leaf node weights

as the weight for its root node.

Data Compressor - Data Structures - Lecture Slides, Slides of Data Structures and Algorithms

Related documents

Partial preview of the text

Download Data Compressor - Data Structures - Lecture Slides and more Slides Data Structures and Algorithms in PDF only on Docsity!

Data Compressor---Huffman

Encoding and Decoding

Huffman Encoding

Huffman's Algorithm

• Repeatedly merges trees - maintains a forest

• Tree weight - the sum of its leaves frequencies

• For C characters to code, start with C single

node trees

• Select two trees, T 1 and T 2 , of smallest weights

and merge them

• C - 1 merge operations

Huffman Encoding

? Finding the optimal encoding

A 010

E 00

B : : N : S T

Standard Coding Scheme

Binary Tree Representation

• For the character set of C characters, the

standard fixed-length coding needs ┌log C┐^ bits

• Fixed-length code can be represented by a

binary tree where characters are stored only

in leaf nodes - binary trie

• Each character path - start at the root, follow

the branches, record 0 for the left branch and

1 for the right branch

• Optimal code is always a full tree - all nodes

are either leaves or have two children

Improved Binary Trie

Prefix Code

• The fixed-length character code that has

characters places only at the leaves

guarantees that any bit sequence can be

decoded unambiguously

• Prefix code - characters may have varying

lengths as long as no character code is a prefix

of another code

• That means that characters can be only in

leafs

Optimal Prefix Code Tree

Optimal Prefix Code Cost