Data Compression


Criteria

Survey Formats

Basics

Basic Terms

Symbol

Set of Symbols

Alphabet

Code

Coding

Redundancy

Information Theory

Message

Probability

Information

Entropy

Calculation

Characteristics

Extreme Values

Diagram

Redundancy Reduction

Irrelevance Reduction

Entropy Coding

Variable Length Codes

Code Trees

Compression Methods

Data Formats


Glossary

Index


Download


www.BinaryEssence.com

Diagram: Entropy of a binary Information Source


The diagram below shall demonstrate the characteristics of a binary information source providing two different messages (A and B):



The probability of the message A is always

P(A) = 1 - P(B),

because the sum of both probabilities must be 1. Other messages than A or B did not occur.


The maximum entropy (H = 1) will be reached if both messages have the same probability (each 0.5). In this case the information source must be coded with an average code length of 1 bit. Only if the probabilities differ, i.e. one of the messages appears more frequent, there would be an option to get a better code efficiency.


Values:

P(A)P(B)H(A,B)
0,01,00,00
0,10,90,47
0,20,80,72
0,30,70,88
0,40,60,97
0,50,51,00
0,60,40,97
0,70,30,88
0,80,20,72
0,90,10,47
1,00,00,00

 <   ^   > 

Entropy Extreme Values Redundancy Reduction