0% found this document useful (0 votes)
79 views

Data Compression

Data compression reduces the size of a file by removing redundant information. It is either lossless, meaning no information is lost, or lossy, where unnecessary details are removed. Lossless techniques like Lempel-Ziv compression identify and eliminate statistical patterns in the data, while lossy methods like JPEG remove imperceptible components. Compression is useful for data storage and transmission but requires computational resources for compression and decompression. The optimal approach balances the degree of compression against distortion and processing requirements.

Uploaded by

sebastian431
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
79 views

Data Compression

Data compression reduces the size of a file by removing redundant information. It is either lossless, meaning no information is lost, or lossy, where unnecessary details are removed. Lossless techniques like Lempel-Ziv compression identify and eliminate statistical patterns in the data, while lossy methods like JPEG remove imperceptible components. Compression is useful for data storage and transmission but requires computational resources for compression and decompression. The optimal approach balances the degree of compression against distortion and processing requirements.

Uploaded by

sebastian431
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 18

Data compression

In information theory, data compression, source coding,[1] or bit-rate reduction is the process of
encoding information using fewer bits than the original representation.[2] Any particular compression is
either lossy or lossless. Lossless compression reduces bits by identifying and eliminating statistical
redundancy. No information is lost in lossless compression. Lossy compression reduces bits by removing
unnecessary or less important information.[3] Typically, a device that performs data compression is referred
to as an encoder, and one that performs the reversal of the process (decompression) as a decoder.

The process of reducing the size of a data file is often referred to as data compression. In the context of data
transmission, it is called source coding: encoding is done at the source of the data before it is stored or
transmitted.[4] Source coding should not be confused with channel coding, for error detection and
correction or line coding, the means for mapping data onto a signal.

Compression is useful because it reduces the resources required to store and transmit data. Computational
resources are consumed in the compression and decompression processes. Data compression is subject to a
space-time complexity trade-off. For instance, a compression scheme for video may require expensive
hardware for the video to be decompressed fast enough to be viewed as it is being decompressed, and the
option to decompress the video in full before watching it may be inconvenient or require additional storage.
The design of data compression schemes involves trade-offs among various factors, including the degree of
compression, the amount of distortion introduced (when using lossy data compression), and the
computational resources required to compress and decompress the data.[5]

Lossless
Lossless data compression algorithms usually exploit statistical redundancy to represent data without losing
any information, so that the process is reversible. Lossless compression is possible because most real-world
data exhibits statistical redundancy. For example, an image may have areas of color that do not change over
several pixels; instead of coding "red pixel, red pixel, ..." the data may be encoded as "279 red pixels". This
is a basic example of run-length encoding; there are many schemes to reduce file size by eliminating
redundancy.

The Lempel–Ziv (LZ) compression methods are among the most popular algorithms for lossless storage.[6]
DEFLATE is a variation on LZ optimized for decompression speed and compression ratio, but
compression can be slow. In the mid-1980s, following work by Terry Welch, the Lempel–Ziv–Welch
(LZW) algorithm rapidly became the method of choice for most general-purpose compression systems.
LZW is used in GIF images, programs such as PKZIP, and hardware devices such as modems.[7] LZ
methods use a table-based compression model where table entries are substituted for repeated strings of
data. For most LZ methods, this table is generated dynamically from earlier data in the input. The table
itself is often Huffman encoded. Grammar-based codes like this can compress highly repetitive input
extremely effectively, for instance, a biological data collection of the same or closely related species, a huge
versioned document collection, internet archival, etc. The basic task of grammar-based codes is
constructing a context-free grammar deriving a single string. Other practical grammar compression
algorithms include Sequitur and Re-Pair.
The strongest modern lossless compressors use probabilistic models, such as prediction by partial matching.
The Burrows–Wheeler transform can also be viewed as an indirect form of statistical modelling.[8] In a
further refinement of the direct use of probabilistic modelling, statistical estimates can be coupled to an
algorithm called arithmetic coding. Arithmetic coding is a more modern coding technique that uses the
mathematical calculations of a finite-state machine to produce a string of encoded bits from a series of input
data symbols. It can achieve superior compression compared to other techniques such as the better-known
Huffman algorithm. It uses an internal memory state to avoid the need to perform a one-to-one mapping of
individual input symbols to distinct representations that use an integer number of bits, and it clears out the
internal memory only after encoding the entire string of data symbols. Arithmetic coding applies especially
well to adaptive data compression tasks where the statistics vary and are context-dependent, as it can be
easily coupled with an adaptive model of the probability distribution of the input data. An early example of
the use of arithmetic coding was in an optional (but not widely used) feature of the JPEG image coding
standard.[9] It has since been applied in various other designs including H.263, H.264/MPEG-4 AVC and
HEVC for video coding.[10]

Archive software typically has the ability to adjust the "dictionary size", where a larger size demands more
random-access memory during compression and decompression, but compresses stronger, especially on
repeating patterns in files' content.[11][12]

Lossy
In the late 1980s, digital images became more common, and
standards for lossless image compression emerged. In the early
1990s, lossy compression methods began to be widely used.[13] In
these schemes, some loss of information is accepted as dropping
nonessential detail can save storage space. There is a corresponding
trade-off between preserving information and reducing size. Lossy
data compression schemes are designed by research on how people
perceive the data in question. For example, the human eye is more
sensitive to subtle variations in luminance than it is to the variations
in color. JPEG image compression works in part by rounding off
nonessential bits of information.[14] A number of popular
compression formats exploit these perceptual differences, including MP3, an example of a lossy file
psychoacoustics for sound, and psychovisuals for images and format compared to WAV.
video.

Most forms of lossy compression are based on transform coding, especially the discrete cosine transform
(DCT). It was first proposed in 1972 by Nasir Ahmed, who then developed a working algorithm with T.
Natarajan and K. R. Rao in 1973, before introducing it in January 1974.[15][16] DCT is the most widely
used lossy compression method, and is used in multimedia formats for images (such as JPEG and
HEIF),[17] video (such as MPEG, AVC and HEVC) and audio (such as MP3, AAC and Vorbis).

Lossy image compression is used in digital cameras, to increase storage capacities. Similarly, DVDs, Blu-
ray and streaming video use lossy video coding formats. Lossy compression is extensively used in video.

In lossy audio compression, methods of psychoacoustics are used to remove non-audible (or less audible)
components of the audio signal. Compression of human speech is often performed with even more
specialized techniques; speech coding is distinguished as a separate discipline from general-purpose audio
compression. Speech coding is used in internet telephony, for example, audio compression is used for CD
ripping and is decoded by the audio players.[8]
Lossy compression can cause generation loss.

Theory
The theoretical basis for compression is provided by information theory and, more specifically, Shannon's
source coding theorem; domain-specific theories include algorithmic information theory for lossless
compression and rate–distortion theory for lossy compression. These areas of study were essentially created
by Claude Shannon, who published fundamental papers on the topic in the late 1940s and early 1950s.
Other topics associated with compression include coding theory and statistical inference.[18]

Machine learning

There is a close connection between machine learning and compression. A system that predicts the posterior
probabilities of a sequence given its entire history can be used for optimal data compression (by using
arithmetic coding on the output distribution). Conversely, an optimal compressor can be used for prediction
(by finding the symbol that compresses best, given the previous history). This equivalence has been used as
a justification for using data compression as a benchmark for "general intelligence".[19][20][21]

An alternative view can show compression algorithms implicitly map strings into implicit feature space
vectors, and compression-based similarity measures compute similarity within these feature spaces. For
each compressor C(.) we define an associated vector space ℵ, such that C(.) maps an input string x,
corresponding to the vector norm ||~x||. An exhaustive examination of the feature spaces underlying all
compression algorithms is precluded by space; instead, feature vectors chooses to examine three
representative lossless compression methods, LZW, LZ77, and PPM.[22]

According to AIXI theory, a connection more directly explained in Hutter Prize, the best possible
compression of x is the smallest possible software that generates x. For example, in that model, a zip file's
compressed size includes both the zip file and the unzipping software, since you can not unzip it without
both, but there may be an even smaller combined form.

Examples of AI-powered audio/video compression software include VP9, NVIDIA Maxine, AIVC,
AccMPEG.[23] Examples of software that can perform AI-powered image compression include OpenCV,
TensorFlow, MATLAB's Image Processing Toolbox (IPT) and High-Fidelity Generative Image
Compression.[24]

Data differencing

Data compression can be viewed as a special case of data differencing.[25][26] Data differencing consists of
producing a difference given a source and a target, with patching reproducing the target given a source and
a difference. Since there is no separate source and target in data compression, one can consider data
compression as data differencing with empty source data, the compressed file corresponding to a difference
from nothing. This is the same as considering absolute entropy (corresponding to data compression) as a
special case of relative entropy (corresponding to data differencing) with no initial data.
The term differential compression is used to emphasize the data differencing connection.

Uses

Image

Entropy coding originated in the 1940s with the introduction of Shannon–Fano coding,[27] the basis for
Huffman coding which was developed in 1950.[28] Transform coding dates back to the late 1960s, with the
introduction of fast Fourier transform (FFT) coding in 1968 and the Hadamard transform in 1969.[29]

An important image compression technique is the discrete cosine transform (DCT), a technique developed
in the early 1970s.[15] DCT is the basis for JPEG, a lossy compression format which was introduced by the
Joint Photographic Experts Group (JPEG) in 1992.[30] JPEG greatly reduces the amount of data required to
represent an image at the cost of a relatively small reduction in image quality and has become the most
widely used image file format.[31][32] Its highly efficient DCT-based compression algorithm was largely
responsible for the wide proliferation of digital images and digital photos.[33]

Lempel–Ziv–Welch (LZW) is a lossless compression algorithm developed in 1984. It is used in the GIF
format, introduced in 1987.[34] DEFLATE, a lossless compression algorithm specified in 1996, is used in
the Portable Network Graphics (PNG) format.[35]

Wavelet compression, the use of wavelets in image compression, began after the development of DCT
coding.[36] The JPEG 2000 standard was introduced in 2000.[37] In contrast to the DCT algorithm used by
the original JPEG format, JPEG 2000 instead uses discrete wavelet transform (DWT) algorithms.[38][39][40]
JPEG 2000 technology, which includes the Motion JPEG 2000 extension, was selected as the video coding
standard for digital cinema in 2004.[41]

Audio

Audio data compression, not to be confused with dynamic range compression, has the potential to reduce
the transmission bandwidth and storage requirements of audio data. Audio compression algorithms are
implemented in software as audio codecs. In both lossy and lossless compression, information redundancy
is reduced, using methods such as coding, quantization, DCT and linear prediction to reduce the amount of
information used to represent the uncompressed data.

Lossy audio compression algorithms provide higher compression and are used in numerous audio
applications including Vorbis and MP3. These algorithms almost all rely on psychoacoustics to eliminate or
reduce fidelity of less audible sounds, thereby reducing the space required to store or transmit them.[2][42]

The acceptable trade-off between loss of audio quality and transmission or storage size depends upon the
application. For example, one 640 MB compact disc (CD) holds approximately one hour of uncompressed
high fidelity music, less than 2 hours of music compressed losslessly, or 7 hours of music compressed in the
MP3 format at a medium bit rate. A digital sound recorder can typically store around 200 hours of clearly
intelligible speech in 640 MB.[43]

Lossless audio compression produces a representation of digital data that can be decoded to an exact digital
duplicate of the original. Compression ratios are around 50–60% of the original size,[44] which is similar to
those for generic lossless data compression. Lossless codecs use curve fitting or linear prediction as a basis
for estimating the signal. Parameters describing the estimation and the difference between the estimation
and the actual signal are coded separately.[45]

A number of lossless audio compression formats exist. See list of lossless codecs for a listing. Some formats
are associated with a distinct system, such as Direct Stream Transfer, used in Super Audio CD and
Meridian Lossless Packing, used in DVD-Audio, Dolby TrueHD, Blu-ray and HD DVD.

Some audio file formats feature a combination of a lossy format and a lossless correction; this allows
stripping the correction to easily obtain a lossy file. Such formats include MPEG-4 SLS (Scalable to
Lossless), WavPack, and OptimFROG DualStream.

When audio files are to be processed, either by further compression or for editing, it is desirable to work
from an unchanged original (uncompressed or losslessly compressed). Processing of a lossily compressed
file for some purpose usually produces a final result inferior to the creation of the same compressed file
from an uncompressed original. In addition to sound editing or mixing, lossless audio compression is often
used for archival storage, or as master copies.

Lossy audio compression

Lossy audio compression is used in a wide range of applications. In


addition to standalone audio-only applications of file playback in
MP3 players or computers, digitally compressed audio streams are
used in most video DVDs, digital television, streaming media on
the Internet, satellite and cable radio, and increasingly in terrestrial
radio broadcasts. Lossy compression typically achieves far greater
compression than lossless compression, by discarding less-critical
data based on psychoacoustic optimizations.[46]

Psychoacoustics recognizes that not all data in an audio stream can


be perceived by the human auditory system. Most lossy
compression reduces redundancy by first identifying perceptually
irrelevant sounds, that is, sounds that are very hard to hear. Typical
examples include high frequencies or sounds that occur at the same
time as louder sounds. Those irrelevant sounds are coded with
decreased accuracy or not at all. Comparison of spectrograms of
audio in an uncompressed format
Due to the nature of lossy algorithms, audio quality suffers a digital and several lossy formats. The
generation loss when a file is decompressed and recompressed. This lossy spectrograms show
makes lossy compression unsuitable for storing the intermediate bandlimiting of higher frequencies, a
results in professional audio engineering applications, such as sound common technique associated with
editing and multitrack recording. However, lossy formats such as lossy audio compression.
MP3 are very popular with end-users as the file size is reduced to 5-
20% of the original size and a megabyte can store about a minute's
worth of music at adequate quality.

Coding methods

To determine what information in an audio signal is perceptually irrelevant, most lossy compression
algorithms use transforms such as the modified discrete cosine transform (MDCT) to convert time domain
sampled waveforms into a transform domain, typically the frequency domain. Once transformed,
component frequencies can be prioritized according to how audible they are. Audibility of spectral
components is assessed using the absolute threshold of hearing and the principles of simultaneous masking
—the phenomenon wherein a signal is masked by another signal separated by frequency—and, in some
cases, temporal masking—where a signal is masked by another signal separated by time. Equal-loudness
contours may also be used to weigh the perceptual importance of components. Models of the human ear-
brain combination incorporating such effects are often called psychoacoustic models.[47]

Other types of lossy compressors, such as the linear predictive coding (LPC) used with speech, are source-
based coders. LPC uses a model of the human vocal tract to analyze speech sounds and infer the
parameters used by the model to produce them moment to moment. These changing parameters are
transmitted or stored and used to drive another model in the decoder which reproduces the sound.

Lossy formats are often used for the distribution of streaming audio or interactive communication (such as
in cell phone networks). In such applications, the data must be decompressed as the data flows, rather than
after the entire data stream has been transmitted. Not all audio codecs can be used for streaming
applications.[46]

Latency is introduced by the methods used to encode and decode the data. Some codecs will analyze a
longer segment, called a frame, of the data to optimize efficiency, and then code it in a manner that requires
a larger segment of data at one time to decode. The inherent latency of the coding algorithm can be critical;
for example, when there is a two-way transmission of data, such as with a telephone conversation,
significant delays may seriously degrade the perceived quality.

In contrast to the speed of compression, which is proportional to the number of operations required by the
algorithm, here latency refers to the number of samples that must be analyzed before a block of audio is
processed. In the minimum case, latency is zero samples (e.g., if the coder/decoder simply reduces the
number of bits used to quantize the signal). Time domain algorithms such as LPC also often have low
latencies, hence their popularity in speech coding for telephony. In algorithms such as MP3, however, a
large number of samples have to be analyzed to implement a psychoacoustic model in the frequency
domain, and latency is on the order of 23 ms.

Speech encoding

Speech encoding is an important category of audio data compression. The perceptual models used to
estimate what aspects of speech a human ear can hear are generally somewhat different from those used for
music. The range of frequencies needed to convey the sounds of a human voice is normally far narrower
than that needed for music, and the sound is normally less complex. As a result, speech can be encoded at
high quality using a relatively low bit rate.

This is accomplished, in general, by some combination of two approaches:

Only encoding sounds that could be made by a single human voice.


Throwing away more of the data in the signal—keeping just enough to reconstruct an
"intelligible" voice rather than the full frequency range of human hearing.

The earliest algorithms used in speech encoding (and audio data compression in general) were the A-law
algorithm and the μ-law algorithm.

History
Early audio research was conducted at Bell Labs. There, in 1950,
C. Chapin Cutler filed the patent on differential pulse-code
modulation (DPCM).[48] In 1973, Adaptive DPCM (ADPCM) was
introduced by P. Cummiskey, Nikil S. Jayant and James L.
Flanagan.[49][50]

Perceptual coding was first used for speech coding compression,


with linear predictive coding (LPC).[51] Initial concepts for LPC
date back to the work of Fumitada Itakura (Nagoya University) and
Shuzo Saito (Nippon Telegraph and Telephone) in 1966.[52] Solidyne 922: The world's first
During the 1970s, Bishnu S. Atal and Manfred R. Schroeder at Bell commercial audio bit compression
Labs developed a form of LPC called adaptive predictive coding sound card for PC, 1990
(APC), a perceptual coding algorithm that exploited the masking
properties of the human ear, followed in the early 1980s with the
code-excited linear prediction (CELP) algorithm which achieved a significant compression ratio for its
time.[51] Perceptual coding is used by modern audio compression formats such as MP3[51] and AAC.

Discrete cosine transform (DCT), developed by Nasir Ahmed, T. Natarajan and K. R. Rao in 1974,[16]
provided the basis for the modified discrete cosine transform (MDCT) used by modern audio compression
formats such as MP3,[53] Dolby Digital,[54][55] and AAC.[56] MDCT was proposed by J. P. Princen, A. W.
Johnson and A. B. Bradley in 1987,[57] following earlier work by Princen and Bradley in 1986.[58]

The world's first commercial broadcast automation audio compression system was developed by Oscar
Bonello, an engineering professor at the University of Buenos Aires. [59] In 1983, using the psychoacoustic
principle of the masking of critical bands first published in 1967,[60] he started developing a practical
application based on the recently developed IBM PC computer, and the broadcast automation system was
launched in 1987 under the name Audicom. [61] 35 years later, almost all the radio stations in the world
were using this technology manufactured by a number of companies because the inventor refuses to get
invention patents for his work. He prefers declaring it of Public Domain publishing it [62]

A literature compendium for a large variety of audio coding systems was published in the IEEE's Journal
on Selected Areas in Communications (JSAC), in February 1988. While there were some papers from
before that time, this collection documented an entire variety of finished, working audio coders, nearly all
of them using perceptual techniques and some kind of frequency analysis and back-end noiseless
coding.[63]

Video

Uncompressed video requires a very high data rate. Although lossless video compression codecs perform at
a compression factor of 5 to 12, a typical H.264 lossy compression video has a compression factor between
20 and 200.[64]

The two key video compression techniques used in video coding standards are the DCT and motion
compensation (MC). Most video coding standards, such as the H.26x and MPEG formats, typically use
motion-compensated DCT video coding (block motion compensation).[65][66]
Most video codecs are used alongside audio compression techniques to store the separate but
complementary data streams as one combined package using so-called container formats.[67]

Encoding theory

Video data may be represented as a series of still image frames. Such data usually contains abundant
amounts of spatial and temporal redundancy. Video compression algorithms attempt to reduce redundancy
and store information more compactly.

Most video compression formats and codecs exploit both spatial and temporal redundancy (e.g. through
difference coding with motion compensation). Similarities can be encoded by only storing differences
between e.g. temporally adjacent frames (inter-frame coding) or spatially adjacent pixels (intra-frame
coding). Inter-frame compression (a temporal delta encoding) (re)uses data from one or more earlier or later
frames in a sequence to describe the current frame. Intra-frame coding, on the other hand, uses only data
from within the current frame, effectively being still-image compression.[47]

The intra-frame video coding formats used in camcorders and video editing employ simpler compression
that uses only intra-frame prediction. This simplifies video editing software, as it prevents a situation in
which a compressed frame refers to data that the editor has deleted.

Usually, video compression additionally employs lossy compression techniques like quantization that
reduce aspects of the source data that are (more or less) irrelevant to the human visual perception by
exploiting perceptual features of human vision. For example, small differences in color are more difficult to
perceive than are changes in brightness. Compression algorithms can average a color across these similar
areas in a manner similar to those used in JPEG image compression.[9] As in all lossy compression, there is
a trade-off between video quality and bit rate, cost of processing the compression and decompression, and
system requirements. Highly compressed video may present visible or distracting artifacts.

Other methods other than the prevalent DCT-based transform formats, such as fractal compression,
matching pursuit and the use of a discrete wavelet transform (DWT), have been the subject of some
research, but are typically not used in practical products. Wavelet compression is used in still-image coders
and video coders without motion compensation. Interest in fractal compression seems to be waning, due to
recent theoretical analysis showing a comparative lack of effectiveness of such methods.[47]

Inter-frame coding

In inter-frame coding, individual frames of a video sequence are compared from one frame to the next, and
the video compression codec records the differences to the reference frame. If the frame contains areas
where nothing has moved, the system can simply issue a short command that copies that part of the
previous frame into the next one. If sections of the frame move in a simple manner, the compressor can emit
a (slightly longer) command that tells the decompressor to shift, rotate, lighten, or darken the copy. This
longer command still remains much shorter than data generated by intra-frame compression. Usually, the
encoder will also transmit a residue signal which describes the remaining more subtle differences to the
reference imagery. Using entropy coding, these residue signals have a more compact representation than the
full signal. In areas of video with more motion, the compression must encode more data to keep up with the
larger number of pixels that are changing. Commonly during explosions, flames, flocks of animals, and in
some panning shots, the high-frequency detail leads to quality decreases or to increases in the variable
bitrate.

Hybrid block-based transform formats


Today, nearly all commonly
used video compression methods
(e.g., those in standards
approved by the ITU-T or ISO)
share the same basic architecture
that dates back to H.261 which Processing stages of a typical video encoder
was standardized in 1988 by the
ITU-T. They mostly rely on the
DCT, applied to rectangular blocks of neighboring pixels, and temporal prediction using motion vectors, as
well as nowadays also an in-loop filtering step.

In the prediction stage, various deduplication and difference-coding techniques are applied that help
decorrelate data and describe new data based on already transmitted data.

Then rectangular blocks of remaining pixel data are transformed to the frequency domain. In the main lossy
processing stage, frequency domain data gets quantized in order to reduce information that is irrelevant to
human visual perception.

In the last stage statistical redundancy gets largely eliminated by an entropy coder which often applies some
form of arithmetic coding.

In an additional in-loop filtering stage various filters can be applied to the reconstructed image signal. By
computing these filters also inside the encoding loop they can help compression because they can be
applied to reference material before it gets used in the prediction process and they can be guided using the
original signal. The most popular example are deblocking filters that blur out blocking artifacts from
quantization discontinuities at transform block boundaries.

History

In 1967, A.H. Robinson and C. Cherry proposed a run-length encoding bandwidth compression scheme
for the transmission of analog television signals.[68] The DCT, which is fundamental to modern video
compression,[69] was introduced by Nasir Ahmed, T. Natarajan and K. R. Rao in 1974.[16][70]

H.261, which debuted in 1988, commercially introduced the prevalent basic architecture of video
compression technology.[71] It was the first video coding format based on DCT compression.[69] H.261
was developed by a number of companies, including Hitachi, PictureTel, NTT, BT and Toshiba.[72]

The most popular video coding standards used for codecs have been the MPEG standards. MPEG-1 was
developed by the Motion Picture Experts Group (MPEG) in 1991, and it was designed to compress VHS-
quality video. It was succeeded in 1994 by MPEG-2/H.262,[71] which was developed by a number of
companies, primarily Sony, Thomson and Mitsubishi Electric.[73] MPEG-2 became the standard video
format for DVD and SD digital television.[71] In 1999, it was followed by MPEG-4/H.263.[71] It was also
developed by a number of companies, primarily Mitsubishi Electric, Hitachi and Panasonic.[74]

H.264/MPEG-4 AVC was developed in 2003 by a number of organizations, primarily Panasonic, Godo
Kaisha IP Bridge and LG Electronics.[75] AVC commercially introduced the modern context-adaptive
binary arithmetic coding (CABAC) and context-adaptive variable-length coding (CAVLC) algorithms.
AVC is the main video encoding standard for Blu-ray Discs, and is widely used by video sharing websites
and streaming internet services such as YouTube, Netflix, Vimeo, and iTunes Store, web software such as
Adobe Flash Player and Microsoft Silverlight, and various HDTV broadcasts over terrestrial and satellite
television.

Genetics

Genetics compression algorithms are the latest generation of lossless algorithms that compress data
(typically sequences of nucleotides) using both conventional compression algorithms and genetic
algorithms adapted to the specific datatype. In 2012, a team of scientists from Johns Hopkins University
published a genetic compression algorithm that does not use a reference genome for compression.
HAPZIPPER was tailored for HapMap data and achieves over 20-fold compression (95% reduction in file
size), providing 2- to 4-fold better compression and is less computationally intensive than the leading
general-purpose compression utilities. For this, Chanda, Elhaik, and Bader introduced MAF-based
encoding (MAFE), which reduces the heterogeneity of the dataset by sorting SNPs by their minor allele
frequency, thus homogenizing the dataset.[76] Other algorithms developed in 2009 and 2013 (DNAZip and
GenomeZip) have compression ratios of up to 1200-fold—allowing 6 billion basepair diploid human
genomes to be stored in 2.5 megabytes (relative to a reference genome or averaged over many
genomes).[77][78] For a benchmark in genetics/genomics data compressors, see [79]

Outlook and currently unused potential


It is estimated that the total amount of data that is stored on the world's storage devices could be further
compressed with existing compression algorithms by a remaining average factor of 4.5:1.[80] It is estimated
that the combined technological capacity of the world to store information provides 1,300 exabytes of
hardware digits in 2007, but when the corresponding content is optimally compressed, this only represents
295 exabytes of Shannon information.[81]

See also
HTTP compression Range coding
Kolmogorov complexity Set redundancy compression
Lhasa (computing) Sub-band coding
Minimum description length Universal code (data compression)
Modulo-N code Vector quantization
Motion coding

References
1. Wade, Graham (1994). Signal coding and processing (https://books.google.com/books?id=C
JswCy7_W8YC) (2 ed.). Cambridge University Press. p. 34. ISBN 978-0-521-42336-6.
Retrieved 2011-12-22. "The broad objective of source coding is to exploit or remove
'inefficient' redundancy in the PCM source and thereby achieve a reduction in the overall
source rate R."
2. Mahdi, O.A.; Mohammed, M.A.; Mohamed, A.J. (November 2012). "Implementing a Novel
Approach an Convert Audio Compression to Text Coding via Hybrid Technique" (http://ijcsi.o
rg/papers/IJCSI-9-6-3-53-59.pdf) (PDF). International Journal of Computer Science Issues. 9
(6, No. 3): 53–59. Archived (https://web.archive.org/web/20130320103924/http://ijcsi.org/pap
ers/IJCSI-9-6-3-53-59.pdf) (PDF) from the original on 2013-03-20. Retrieved 6 March 2013.
3. Pujar, J.H.; Kadlaskar, L.M. (May 2010). "A New Lossless Method of Image Compression
and Decompression Using Huffman Coding Techniques" (http://www.jatit.org/volumes/resea
rch-papers/Vol15No1/3Vol15No1.pdf) (PDF). Journal of Theoretical and Applied Information
Technology. 15 (1): 18–23. Archived (https://web.archive.org/web/20100524105217/http://w
ww.jatit.org/volumes/research-papers/Vol15No1/3Vol15No1.pdf) (PDF) from the original on
2010-05-24.
4. Salomon, David (2008). A Concise Introduction to Data Compression. Berlin: Springer.
ISBN 9781848000728.
5. Tank, M.K. (2011). "Implementation of Lempel-ZIV algorithm for lossless compression using
VHDL". Implementation of Limpel-Ziv algorithm for lossless compression using VHDL.
Thinkquest 2010: Proceedings of the First International Conference on Contours of
Computing Technology. Berlin: Springer. pp. 275–283. doi:10.1007/978-81-8489-989-4_51
(https://doi.org/10.1007%2F978-81-8489-989-4_51). ISBN 978-81-8489-988-7.
6. Navqi, Saud; Naqvi, R.; Riaz, R.A.; Siddiqui, F. (April 2011). "Optimized RTL design and
implementation of LZW algorithm for high bandwidth applications" (http://pe.org.pl/articles/20
11/4/68.pdf) (PDF). Electrical Review. 2011 (4): 279–285. Archived (https://web.archive.org/
web/20130520105146/http://pe.org.pl/articles/2011/4/68.pdf) (PDF) from the original on
2013-05-20.
7. Stephen, Wolfram (2002). New Kind of Science (https://www.wolframscience.com/nks/p1069
--data-compression/). Champaign, IL. p. 1069. ISBN 1-57955-008-8.
8. Mahmud, Salauddin (March 2012). "An Improved Data Compression Method for General
Data" (http://www.ijser.org/researchpaper%5CAn-Improved-Data-Compression-Method-for-
General-Data.pdf) (PDF). International Journal of Scientific & Engineering Research. 3 (3):
2. Archived (https://web.archive.org/web/20131102022116/http://www.ijser.org/researchpape
r%5CAn-Improved-Data-Compression-Method-for-General-Data.pdf) (PDF) from the original
on 2013-11-02. Retrieved 6 March 2013.
9. Lane, Tom. "JPEG Image Compression FAQ, Part 1" (http://www.faqs.org/faqs/jpeg-faq/part
1/). Internet FAQ Archives. Independent JPEG Group. Retrieved 6 March 2013.
10. G. J. Sullivan; J.-R. Ohm; W.-J. Han; T. Wiegand (December 2012). "Overview of the High
Efficiency Video Coding (HEVC) Standard" (https://doi.org/10.1109%2FTCSVT.2012.22211
91). IEEE Transactions on Circuits and Systems for Video Technology. IEEE. 22 (12): 1649–
1668. doi:10.1109/TCSVT.2012.2221191 (https://doi.org/10.1109%2FTCSVT.2012.222119
1).
11. "How to choose optimal archiving settings – WinRAR" (https://www.winrar-france.fr/winrar_i
nstructions_for_use/source/html/HELPArcOptimal.htm).
12. "(Set compression Method) switch – 7zip" (https://sevenzip.osdn.jp/chm/cmdline/switches/m
ethod.htm).
13. Wolfram, Stephen (2002). A New Kind of Science (https://archive.org/details/newkindofscien
ce00wolf/page/1069). Wolfram Media, Inc. p. 1069 (https://archive.org/details/newkindofscie
nce00wolf/page/1069). ISBN 978-1-57955-008-0.
14. Arcangel, Cory. "On Compression" (http://www.coryarcangel.com/downloads/Cory-Arcangel-
OnC.pdf) (PDF). Archived (https://web.archive.org/web/20130728082920/http://www.coryarc
angel.com/downloads/Cory-Arcangel-OnC.pdf) (PDF) from the original on 2013-07-28.
Retrieved 6 March 2013.
15. Ahmed, Nasir (January 1991). "How I Came Up With the Discrete Cosine Transform" (https://
www.scribd.com/doc/52879771/DCT-History-How-I-Came-Up-with-the-Discrete-Cosine-Tra
nsform). Digital Signal Processing. 1 (1): 4–5. doi:10.1016/1051-2004(91)90086-Z (https://do
i.org/10.1016%2F1051-2004%2891%2990086-Z).
16. Nasir Ahmed; T. Natarajan; Kamisetty Ramamohan Rao (January 1974). "Discrete Cosine
Transform" (https://www.ic.tu-berlin.de/fileadmin/fg121/Source-Coding_WS12/selected-readi
ngs/Ahmed_et_al.__1974.pdf) (PDF). IEEE Transactions on Computers. C-23 (1): 90–93.
doi:10.1109/T-C.1974.223784 (https://doi.org/10.1109%2FT-C.1974.223784).
S2CID 149806273 (https://api.semanticscholar.org/CorpusID:149806273). Archived (https://
web.archive.org/web/20161208075733/https://www.ic.tu-berlin.de/fileadmin/fg121/Source-C
oding_WS12/selected-readings/Ahmed_et_al.__1974.pdf) (PDF) from the original on 2016-
12-08.
17. CCITT Study Group VIII und die Joint Photographic Experts Group (JPEG) von ISO/IEC
Joint Technical Committee 1/Subcommittee 29/Working Group 10 (1993), "Annex D –
Arithmetic coding", Recommendation T.81: Digital Compression and Coding of Continuous-
tone Still images – Requirements and guidelines (https://www.w3.org/Graphics/JPEG/itu-t8
1.pdf) (PDF), pp. 54 ff, retrieved 2009-11-07
18. Marak, Laszlo. "On image compression" (https://web.archive.org/web/20150528012028/htt
p://www.ujoimro.com/resources/Laszlo_Marak_image_compression.pdf) (PDF). University
of Marne la Vallee. Archived from the original (http://www.ujoimro.com/resources/Laszlo_Mar
ak_image_compression.pdf) (PDF) on 28 May 2015. Retrieved 6 March 2013.
19. Mahoney, Matt. "Rationale for a Large Text Compression Benchmark" (http://cs.fit.edu/~mma
honey/compression/rationale.html). Florida Institute of Technology. Retrieved 5 March 2013.
20. Shmilovici A.; Kahiri Y.; Ben-Gal I.; Hauser S. (2009). "Measuring the Efficiency of the
Intraday Forex Market with a Universal Data Compression Algorithm" (http://www.eng.tau.ac.
il/~bengal/28.pdf) (PDF). Computational Economics. 33 (2): 131–154.
CiteSeerX 10.1.1.627.3751 (https://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.627.
3751). doi:10.1007/s10614-008-9153-3 (https://doi.org/10.1007%2Fs10614-008-9153-3).
S2CID 17234503 (https://api.semanticscholar.org/CorpusID:17234503). Archived (https://we
b.archive.org/web/20090709143601/http://www.eng.tau.ac.il/~bengal/28.pdf) (PDF) from the
original on 2009-07-09.
21. I. Ben-Gal (2008). "On the Use of Data Compression Measures to Analyze Robust Designs"
(http://www.eng.tau.ac.il/~bengal/Journal%20Paper.pdf) (PDF). IEEE Transactions on
Reliability. 54 (3): 381–388. doi:10.1109/TR.2005.853280 (https://doi.org/10.1109%2FTR.20
05.853280). S2CID 9376086 (https://api.semanticscholar.org/CorpusID:9376086).
22. D. Scully; Carla E. Brodley (2006). "Compression and machine learning: A new perspective
on feature space vectors". Data Compression Conference, 2006: 332.
doi:10.1109/DCC.2006.13 (https://doi.org/10.1109%2FDCC.2006.13). ISBN 0-7695-2545-8.
S2CID 12311412 (https://api.semanticscholar.org/CorpusID:12311412).
23. Gary Adcock (January 5, 2023). "What Is AI Video Compression?" (https://massive.io/file-tran
sfer/what-is-ai-video-compression/). massive.io. Retrieved 6 April 2023.
24. Gilad David Maayan (Nov 24, 2021). "AI-Based Image Compression: The State of the Art" (h
ttps://towardsdatascience.com/ai-based-image-compression-the-state-of-the-art-fb5aa6042b
fa). Towards Data Science. Retrieved 6 April 2023.
25. Korn, D.; et al. (July 2002). "RFC 3284: The VCDIFF Generic Differencing and Compression
Data Format" (http://tools.ietf.org/html/rfc3284). Internet Engineering Task Force. Retrieved
5 March 2013.
26. Korn, D.G.; Vo, K.P. (1995). B. Krishnamurthy (ed.). Vdelta: Differencing and Compression.
Practical Reusable Unix Software. New York: John Wiley & Sons, Inc.
27. Claude Elwood Shannon (1948). Alcatel-Lucent (ed.). "A Mathematical Theory of
Communication" (http://www.math.harvard.edu/~ctm/home/text/others/shannon/entropy/entro
py.pdf) (PDF). Bell System Technical Journal. 27 (3–4): 379–423, 623–656.
doi:10.1002/j.1538-7305.1948.tb01338.x (https://doi.org/10.1002%2Fj.1538-7305.1948.tb01
338.x). hdl:11858/00-001M-0000-002C-4314-2 (https://hdl.handle.net/11858%2F00-001M-0
000-002C-4314-2). Archived (https://web.archive.org/web/20110524064232/http://math.harv
ard.edu/~ctm/home/text/others/shannon/entropy/entropy.pdf) (PDF) from the original on
2011-05-24. Retrieved 2019-04-21.
28. David Albert Huffman (September 1952), "A method for the construction of minimum-
redundancy codes" (http://compression.ru/download/articles/huff/huffman_1952_minimum-re
dundancy-codes.pdf) (PDF), Proceedings of the IRE, vol. 40, no. 9, pp. 1098–1101,
doi:10.1109/JRPROC.1952.273898 (https://doi.org/10.1109%2FJRPROC.1952.273898),
archived (https://web.archive.org/web/20051008115257/http://compression.ru/download/arti
cles/huff/huffman_1952_minimum-redundancy-codes.pdf) (PDF) from the original on 2005-
10-08
29. Pratt, W.K.; Kane, J.; Andrews, H.C. (1969). "Hadamard transform image coding".
Proceedings of the IEEE. 57: 58–68. doi:10.1109/PROC.1969.6869 (https://doi.org/10.110
9%2FPROC.1969.6869).
30. "T.81 – DIGITAL COMPRESSION AND CODING OF CONTINUOUS-TONE STILL IMAGES
– REQUIREMENTS AND GUIDELINES" (https://www.w3.org/Graphics/JPEG/itu-t81.pdf)
(PDF). CCITT. September 1992. Retrieved 12 July 2019.
31. "The JPEG image format explained" (https://web.archive.org/web/20190805194553/https://h
ome.bt.com/tech-gadgets/photography/what-is-a-jpeg-11364206889349). BT.com. BT
Group. 31 May 2018. Archived from the original (https://home.bt.com/tech-gadgets/photograp
hy/what-is-a-jpeg-11364206889349) on 5 August 2019. Retrieved 5 August 2019.
32. Baraniuk, Chris (15 October 2015). "Copy protections could come to JPEGs" (https://www.bb
c.co.uk/news/technology-34538705). BBC News. BBC. Retrieved 13 September 2019.
33. "What Is a JPEG? The Invisible Object You See Every Day" (https://www.theatlantic.com/tec
hnology/archive/2013/09/what-is-a-jpeg-the-invisible-object-you-see-every-day/279954/).
The Atlantic. 24 September 2013. Retrieved 13 September 2019.
34. "The GIF Controversy: A Software Developer's Perspective" (https://mike.pub/19950127-gif-l
zw). 27 January 1995. Retrieved 26 May 2015.
35. L. Peter Deutsch (May 1996). DEFLATE Compressed Data Format Specification version 1.3
(https://datatracker.ietf.org/doc/html/rfc1951#section-Abstract). IETF. p. 1. sec. Abstract.
doi:10.17487/RFC1951 (https://doi.org/10.17487%2FRFC1951). RFC 1951 (https://datatrac
ker.ietf.org/doc/html/rfc1951). Retrieved 2014-04-23.
36. Hoffman, Roy (2012). Data Compression in Digital Systems (https://books.google.com/book
s?id=FOfTBwAAQBAJ). Springer Science & Business Media. p. 124.
ISBN 9781461560319. "Basically, wavelet coding is a variant on DCT-based transform
coding that reduces or eliminates some of its limitations. (...) Another advantage is that rather
than working with 8 × 8 blocks of pixels, as do JPEG and other block-based DCT
techniques, wavelet coding can simultaneously compress the entire image."
37. Taubman, David; Marcellin, Michael (2012). JPEG2000 Image Compression Fundamentals,
Standards and Practice: Image Compression Fundamentals, Standards and Practice (http
s://books.google.com/books?id=y7HeBwAAQBAJ&pg=PA402). Springer Science &
Business Media. ISBN 9781461507994.
38. Unser, M.; Blu, T. (2003). "Mathematical properties of the JPEG2000 wavelet filters" (http://inf
oscience.epfl.ch/record/63104). IEEE Transactions on Image Processing. 12 (9): 1080–
1090. Bibcode:2003ITIP...12.1080U (https://ui.adsabs.harvard.edu/abs/2003ITIP...12.1080
U). doi:10.1109/TIP.2003.812329 (https://doi.org/10.1109%2FTIP.2003.812329).
PMID 18237979 (https://pubmed.ncbi.nlm.nih.gov/18237979). S2CID 2765169 (https://api.se
manticscholar.org/CorpusID:2765169).
39. Sullivan, Gary (8–12 December 2003). "General characteristics and design considerations
for temporal subband video coding" (https://www.itu.int/wftp3/av-arch/video-site/0312_Wai/V
CEG-U06.doc). ITU-T. Video Coding Experts Group. Retrieved 13 September 2019.
40. Bovik, Alan C. (2009). The Essential Guide to Video Processing (https://books.google.com/b
ooks?id=wXmSPPB_c_0C&pg=PA355). Academic Press. p. 355. ISBN 9780080922508.
41. Swartz, Charles S. (2005). Understanding Digital Cinema: A Professional Handbook (https://
books.google.com/books?id=tYw3ehoBnjkC&pg=PA147). Taylor & Francis. p. 147.
ISBN 9780240806174.
42. Cunningham, Stuart; McGregor, Iain (2019). "Subjective Evaluation of Music Compressed
with the ACER Codec Compared to AAC, MP3, and Uncompressed PCM" (https://doi.org/1
0.1155%2F2019%2F8265301). International Journal of Digital Multimedia Broadcasting.
2019: 1–16. doi:10.1155/2019/8265301 (https://doi.org/10.1155%2F2019%2F8265301).
43. The Olympus WS-120 digital speech recorder, according to its manual, can store about 178
hours of speech-quality audio in .WMA format in 500 MB of flash memory.
44. Coalson, Josh. "FLAC Comparison" (https://xiph.org/flac/comparison.html). Retrieved
2020-08-23.
45. "Format overview" (https://xiph.org/flac/documentation_format_overview.html). Retrieved
2020-08-23.
46. Jaiswal, R.C. (2009). Audio-Video Engineering. Pune, Maharashtra: Nirali Prakashan.
p. 3.41. ISBN 9788190639675.
47. Faxin Yu; Hao Luo; Zheming Lu (2010). Three-Dimensional Model Analysis and Processing
(https://archive.org/details/threedimensional00yufa). Berlin: Springer. p. 47 (https://archive.or
g/details/threedimensional00yufa/page/n62). ISBN 9783642126512.
48. US patent 2605361 (https://worldwide.espacenet.com/textdoc?DB=EPODOC&IDX=US2605
361), C. Chapin Cutler, "Differential Quantization of Communication Signals", issued 1952-
07-29
49. Cummiskey, P.; Jayant, N. S.; Flanagan, J. L. (1973). "Adaptive Quantization in Differential
PCM Coding of Speech". Bell System Technical Journal. 52 (7): 1105–1118.
doi:10.1002/j.1538-7305.1973.tb02007.x (https://doi.org/10.1002%2Fj.1538-7305.1973.tb02
007.x).
50. Cummiskey, P.; Jayant, Nikil S.; Flanagan, J. L. (1973). "Adaptive quantization in differential
PCM coding of speech". The Bell System Technical Journal. 52 (7): 1105–1118.
doi:10.1002/j.1538-7305.1973.tb02007.x (https://doi.org/10.1002%2Fj.1538-7305.1973.tb02
007.x). ISSN 0005-8580 (https://www.worldcat.org/issn/0005-8580).
51. Schroeder, Manfred R. (2014). "Bell Laboratories" (https://books.google.com/books?id=d9Ik
BAAAQBAJ&pg=PA388). Acoustics, Information, and Communication: Memorial Volume in
Honor of Manfred R. Schroeder. Springer. p. 388. ISBN 9783319056609.
52. Gray, Robert M. (2010). "A History of Realtime Digital Speech on Packet Networks: Part II of
Linear Predictive Coding and the Internet Protocol" (https://ee.stanford.edu/~gray/lpcip.pdf)
(PDF). Found. Trends Signal Process. 3 (4): 203–303. doi:10.1561/2000000036 (https://doi.
org/10.1561%2F2000000036). ISSN 1932-8346 (https://www.worldcat.org/issn/1932-8346).
Archived (https://web.archive.org/web/20100704113551/http://ee.stanford.edu/~gray/lpcip.pd
f) (PDF) from the original on 2010-07-04.
53. Guckert, John (Spring 2012). "The Use of FFT and MDCT in MP3 Audio Compression" (htt
p://www.math.utah.edu/~gustafso/s2012/2270/web-projects/Guckert-audio-compression-svd
-mdct-MP3.pdf) (PDF). University of Utah. Archived (https://web.archive.org/web/201401241
52337/http://www.math.utah.edu/~gustafso/s2012/2270/web-projects/Guckert-audio-compre
ssion-svd-mdct-MP3.pdf) (PDF) from the original on 2014-01-24. Retrieved 14 July 2019.
54. Luo, Fa-Long (2008). Mobile Multimedia Broadcasting Standards: Technology and Practice
(https://books.google.com/books?id=l6PovWat8SMC&pg=PA590). Springer Science &
Business Media. p. 590. ISBN 9780387782638.
55. Britanak, V. (2011). "On Properties, Relations, and Simplified Implementation of Filter Banks
in the Dolby Digital (Plus) AC-3 Audio Coding Standards". IEEE Transactions on Audio,
Speech, and Language Processing. 19 (5): 1231–1241. doi:10.1109/TASL.2010.2087755 (h
ttps://doi.org/10.1109%2FTASL.2010.2087755). S2CID 897622 (https://api.semanticscholar.
org/CorpusID:897622).
56. Brandenburg, Karlheinz (1999). "MP3 and AAC Explained" (http://graphics.ethz.ch/teaching/
mmcom12/slides/mp3_and_aac_brandenburg.pdf) (PDF). Archived (https://web.archive.org/
web/20170213191747/https://graphics.ethz.ch/teaching/mmcom12/slides/mp3_and_aac_br
andenburg.pdf) (PDF) from the original on 2017-02-13.
57. Princen, J.; Johnson, A.; Bradley, A. (1987). "Subband/Transform coding using filter bank
designs based on time domain aliasing cancellation". ICASSP '87. IEEE International
Conference on Acoustics, Speech, and Signal Processing. Vol. 12. pp. 2161–2164.
doi:10.1109/ICASSP.1987.1169405 (https://doi.org/10.1109%2FICASSP.1987.1169405).
S2CID 58446992 (https://api.semanticscholar.org/CorpusID:58446992).
58. Princen, J.; Bradley, A. (1986). "Analysis/Synthesis filter bank design based on time domain
aliasing cancellation". IEEE Transactions on Acoustics, Speech, and Signal Processing. 34
(5): 1153–1161. doi:10.1109/TASSP.1986.1164954 (https://doi.org/10.1109%2FTASSP.198
6.1164954).
59. "Ricardo Sametband, La Nación Newspaper "Historia de un pionero en audio digital" " (http
s://www.lanacion.com.ar/tecnologia/la-historia-de-un-pionero-del-audio-digital-nid187775)
(in Spanish).
60. Zwicker, Eberhard; et al. (1967). The Ear As A Communication Receiver (https://web.archive.
org/web/20000914080525/http://asa.aip.org/books/ear.html). Melville, NY: Acoustical Society
of America. Archived from the original (http://asa.aip.org/books/ear.html) on 2000-09-14.
Retrieved 2011-11-11.
61. "Summary of some of Solidyne's contributions to Broadcast Engineering" (https://web.archiv
e.org/web/20130308063719/http://www.solidynepro.com/indexahtmlp_Hist-ENG%2Ct.htm).
Brief History of Solidyne. Buenos Aires: Solidyne. Archived from the original (http://www.soli
dynepro.com/nosotros-breve-historia/) on 8 March 2013. Retrieved 6 March 2013.
62. "Anuncio del Audicom, AES Journal,July-August 1992, Vol 40, # 7/8, pag 647" (http://www.a
es.org/e-lib/browse.cfm?elib=19076).
63. "File Compression Possibilities" (https://www.gadgetcouncil.com/compress-1GB-files-into-1
0-mb/). A Brief guide to compress a file in 4 different ways. 17 February 2017.
64. Dmitriy Vatolin; et al. (Graphics & Media Lab Video Group) (March 2007). Lossless Video
Codecs Comparison '2007 (http://compression.ru/video/codec_comparison/pdf/msu_lossles
s_codecs_comparison_2007_eng.pdf) (PDF) (Report). Moscow State University. Archived (h
ttps://web.archive.org/web/20080515091507/http://www.compression.ru/video/codec_compa
rison/pdf/msu_lossless_codecs_comparison_2007_eng.pdf) (PDF) from the original on
2008-05-15.
65. Chen, Jie; Koc, Ut-Va; Liu, KJ Ray (2001). Design of Digital Video Coding Systems: A
Complete Compressed Domain Approach (https://books.google.com/books?id=LUzFKU3He
egC&pg=PA71). CRC Press. p. 71. ISBN 9780203904183.
66. Li, Jian Ping (2006). Proceedings of the International Computer Conference 2006 on
Wavelet Active Media Technology and Information Processing: Chongqing, China, 29-31
August 2006 (https://books.google.com/books?id=FZiK3zXdK7sC&pg=PA847). World
Scientific. p. 847. ISBN 9789812709998.
67. "Video Coding" (https://web.archive.org/web/20130523194345/http://csip.ece.gatech.edu/dr
upal7/?q=technical-area%2Fvideo-coding). CSIP website. Center for Signal and Information
Processing, Georgia Institute of Technology. Archived from the original (http://csip.ece.gatec
h.edu/drupal7/?q=technical-area/video-coding) on 23 May 2013. Retrieved 6 March 2013.
68. Robinson, A. H.; Cherry, C. (1967). "Results of a prototype television bandwidth
compression scheme". Proceedings of the IEEE. IEEE. 55 (3): 356–364.
doi:10.1109/PROC.1967.5493 (https://doi.org/10.1109%2FPROC.1967.5493).
69. Ghanbari, Mohammed (2003). Standard Codecs: Image Compression to Advanced Video
Coding (https://books.google.com/books?id=7XuU8T3ooOAC&pg=PA1). Institution of
Engineering and Technology. pp. 1–2. ISBN 9780852967102.
70. Reader, Cliff (2016-08-31). "Patent landscape for royalty-free video coding" (https://web.arch
ive.org/web/20161208075738/https://www.youtube.com/watch?v=wi1BefrfTos). In Tescher,
Andrew G (ed.). Applications of Digital Image Processing XXXIX. Applications of Digital
Image Processing XXXIX. Vol. 9971. San Diego, California: Society of Photo-Optical
Instrumentation Engineers. pp. 99711B. Bibcode:2016SPIE.9971E..1BR (https://ui.adsabs.h
arvard.edu/abs/2016SPIE.9971E..1BR). doi:10.1117/12.2239493 (https://doi.org/10.1117%2
F12.2239493). Archived from the original (https://www.youtube.com/watch?v=wi1BefrfTos)
on 2016-12-08. Lecture recording, from 3:05:10.
71. "The History of Video File Formats Infographic — RealPlayer" (http://www.real.com/resource
s/digital-video-file-formats/). 22 April 2012.
72. "Patent statement declaration registered as H261-07" (https://www.itu.int/net4/ipr/details_ps.
aspx?sector=ITU-T&id=H261-07). ITU. Retrieved 11 July 2019.
73. "MPEG-2 Patent List" (https://www.mpegla.com/wp-content/uploads/m2-att1.pdf) (PDF).
MPEG LA. Archived (https://web.archive.org/web/20190529164140/https://www.mpegla.co
m/wp-content/uploads/m2-att1.pdf) (PDF) from the original on 2019-05-29. Retrieved 7 July
2019.
74. "MPEG-4 Visual - Patent List" (https://www.mpegla.com/wp-content/uploads/m4v-att1.pdf)
(PDF). MPEG LA. Archived (https://web.archive.org/web/20190706184528/https://www.mpe
gla.com/wp-content/uploads/m4v-att1.pdf) (PDF) from the original on 2019-07-06. Retrieved
6 July 2019.
75. "AVC/H.264 – Patent List" (https://www.mpegla.com/wp-content/uploads/avc-att1.pdf) (PDF).
MPEG LA. Retrieved 6 July 2019.
76. Chanda P, Bader JS, Elhaik E (27 Jul 2012). "HapZipper: sharing HapMap populations just
got easier" (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3488212). Nucleic Acids
Research. 40 (20): e159. doi:10.1093/nar/gks709 (https://doi.org/10.1093%2Fnar%2Fgks70
9). PMC 3488212 (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3488212).
PMID 22844100 (https://pubmed.ncbi.nlm.nih.gov/22844100).
77. Christley S, Lu Y, Li C, Xie X (Jan 15, 2009). "Human genomes as email attachments" (http
s://doi.org/10.1093%2Fbioinformatics%2Fbtn582). Bioinformatics. 25 (2): 274–5.
doi:10.1093/bioinformatics/btn582 (https://doi.org/10.1093%2Fbioinformatics%2Fbtn582).
PMID 18996942 (https://pubmed.ncbi.nlm.nih.gov/18996942).
78. Pavlichin DS, Weissman T, Yona G (September 2013). "The human genome contracts
again" (https://doi.org/10.1093%2Fbioinformatics%2Fbtt362). Bioinformatics. 29 (17): 2199–
202. doi:10.1093/bioinformatics/btt362 (https://doi.org/10.1093%2Fbioinformatics%2Fbtt36
2). PMID 23793748 (https://pubmed.ncbi.nlm.nih.gov/23793748).
79. Hosseini, Morteza; Pratas, Diogo; Pinho, Armando (2016). "A Survey on Data Compression
Methods for Biological Sequences" (https://doi.org/10.3390%2Finfo7040056). Information. 7
(4): 56. doi:10.3390/info7040056 (https://doi.org/10.3390%2Finfo7040056).
80. "Data Compression via Logic Synthesis" (https://infoscience.epfl.ch/record/188645/files/067
42961.pdf) (PDF).
81. Hilbert, Martin; López, Priscila (1 April 2011). "The World's Technological Capacity to Store,
Communicate, and Compute Information". Science. 332 (6025): 60–65.
Bibcode:2011Sci...332...60H (https://ui.adsabs.harvard.edu/abs/2011Sci...332...60H).
doi:10.1126/science.1200970 (https://doi.org/10.1126%2Fscience.1200970).
PMID 21310967 (https://pubmed.ncbi.nlm.nih.gov/21310967). S2CID 206531385 (https://ap
i.semanticscholar.org/CorpusID:206531385).

External links
"Part 3: Video compression" (http://dvd-hq.info/data_compression_3.php), Data
Compression Basics
Pierre Larbier, Using 10-bit AVC/H.264 Encoding with 4:2:2 for Broadcast Contribution (http
s://web.archive.org/web/20090905092232/http://extranet.ateme.com/download.php?file=111
4), Ateme, archived from the original (http://extranet.ateme.com/download.php?file=1114) on
2009-09-05
Why does 10-bit save bandwidth (even when content is 8-bit)? (https://web.archive.org/web/
20170830224011/http://extranet.ateme.com/download.php?file=1194) at the Wayback
Machine (archived 2017-08-30)
Which compression technology should be used? (https://web.archive.org/web/20170830224
021/http://extranet.ateme.com/download.php?file=1196) at the Wayback Machine (archived
2017-08-30)
Introduction to Compression Theory (http://media.wiley.com/product_data/excerpt/99/047051
84/0470518499.pdf) (PDF), Wiley, archived (https://web.archive.org/web/20070928023157/h
ttp://media.wiley.com/product_data/excerpt/99/04705184/0470518499.pdf) (PDF) from the
original on 2007-09-28
EBU subjective listening tests on low-bitrate audio codecs (http://tech.ebu.ch/docs/tech/tech
3296.pdf)
Audio Archiving Guide: Music Formats (http://techgage.com/article/audio_archiving_guide_p
art_1_-_music_formats/) (Guide for helping a user pick out the right codec)
MPEG 1&2 video compression intro (pdf format) (https://web.archive.org/web/200709280231
57/http://mia.ece.uic.edu/~papers/WWW/MultimediaStandards/chapter7.pdf) at the Wayback
Machine (archived September 28, 2007)
hydrogenaudio wiki comparison (http://wiki.hydrogenaud.io/index.php?title=Lossless_comp
arison)
Introduction to Data Compression (https://www.cs.cmu.edu/afs/cs/project/pscico-guyb/realwo
rld/www/compression.pdf) by Guy E Blelloch from CMU
Explanation of lossless signal compression method used by most codecs (http://www.monke
ysaudio.com/theory.html)
Videsignline – Intro to Video Compression (https://web.archive.org/web/20100315021124/ht
tp://www.videsignline.com/howto/showArticle.jhtml?articleID=185301351) at the Wayback
Machine (archived 2010-03-15)
Data Footprint Reduction Technology (https://web.archive.org/web/20130527124650/http://p
ublic.dhe.ibm.com/common/ssi/ecm/en/tsu12345usen/TSU12345USEN.PDF) at the
Wayback Machine (archived 2013-05-27)
What is Run length Coding in video compression (http://siliconmentor.blogspot.in/2014/12/w
hat-is-run-length-coding-in-video.html)

Retrieved from "https://en.wikipedia.org/w/index.php?title=Data_compression&oldid=1165037632"

You might also like