UNIT-5 Entropy Encoding
UNIT-5 Entropy Encoding
Entropy encoding
Advantages
● Reduces memory and storage usage.
● Improves data transmission speed for repetitive or sparse datasets.
Disadvantages
● Encoding and decoding can add computational overhead.
● Not effective for dense datasets without a high prevalence of blanks/zeros.
Statistical Encoding is a data compression technique that leverages the statistical
properties of data—such as the frequency of occurrence of symbols—to achieve
efficient encoding. The fundamental idea is to assign shorter codes to more frequently
occurring symbols and longer codes to less frequent ones, reducing the overall storage
or transmission cost.
Advantages
● Optimized for datasets with skewed distributions, reducing average code length.
● Can achieve near-optimal compression ratios for well-understood distributions.
Disadvantages
● Ineffective for uniformly distributed data (no significant frequency difference).
● Requires preprocessing to calculate symbol frequencies, which can be
computationally expensive.
Source Encoding
Source Encoding, also known as data compression or source coding, is the process
of converting data from its original format into a compressed representation to reduce
redundancy and optimize storage or transmission. It is widely used in
telecommunications, multimedia, and data processing.
Advantages
● Saves storage space.
● Reduces transmission time and bandwidth costs.
● Optimizes performance in systems with limited resources.
Disadvantages
● Encoding/decoding can be computationally expensive.
● Lossy methods may degrade data quality.