0% found this document useful (0 votes)
11 views7 pages

Audio Coding Format

An audio coding format is a representation format for storing or transmitting digital audio, with examples including MP3, AAC, and FLAC. These formats can be categorized as lossless, lossy, or uncompressed, with lossy formats being the most common for streaming due to their smaller size. The document also discusses the history and technical specifications of various audio coding formats and codecs.

Uploaded by

Ranjith
Copyright
© All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
11 views7 pages

Audio Coding Format

An audio coding format is a representation format for storing or transmitting digital audio, with examples including MP3, AAC, and FLAC. These formats can be categorized as lossless, lossy, or uncompressed, with lossy formats being the most common for streaming due to their smaller size. The document also discusses the history and technical specifications of various audio coding formats and codecs.

Uploaded by

Ranjith
Copyright
© All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

Audio coding format

An audio coding format[1] (or sometimes audio


compression format) is a content representation format for
storage or transmission of digital audio (such as in digital
television, digital radio and in audio and video files).
Examples of audio coding formats include MP3, AAC,
Vorbis, FLAC, and Opus. A specific software or hardware
implementation capable of audio compression and
decompression to/from a specific audio coding format is
called an audio codec; an example of an audio codec is
LAME, which is one of several different codecs which Comparison of coding efficiency between
implements encoding and decoding audio in the MP3 audio popular audio formats
coding format in software.

Some audio coding formats are documented by a detailed technical specification document known as an
audio coding specification. Some such specifications are written and approved by standardization
organizations as technical standards, and are thus known as an audio coding standard. The term
"standard" is also sometimes used for de facto standards as well as formal standards.

Audio content encoded in a particular audio coding format is normally encapsulated within a container
format. As such, the user normally doesn't have a raw AAC file, but instead has a .m4a audio file, which
is a MPEG-4 Part 14 container containing AAC-encoded audio. The container also contains metadata
such as title and other tags, and perhaps an index for fast seeking.[2] A notable exception is MP3 files,
which are raw audio coding without a container format. De facto standards for adding metadata tags such
as title and artist to MP3s, such as ID3, are hacks which work by appending the tags to the MP3, and then
relying on the MP3 player to recognize the chunk as malformed audio coding and therefore skip it. In
video files with audio, the encoded audio content is bundled with video (in a video coding format) inside
a multimedia container format.

An audio coding format does not dictate all algorithms used by a codec implementing the format. An
important part of how lossy audio compression works is by removing data in ways humans can't hear,
according to a psychoacoustic model; the implementer of an encoder has some freedom of choice in
which data to remove (according to their psychoacoustic model).

Lossless, lossy, and uncompressed audio coding formats


A lossless audio coding format reduces the total data needed to represent a sound but can be de-coded to
its original, uncompressed form. A lossy audio coding format additionally reduces the bit resolution of the
sound on top of compression, which results in far less data at the cost of irretrievably lost information.
Transmitted (streamed) audio is most often compressed using lossy audio codecs as the smaller size is far
more convenient for distribution. The most widely used audio coding formats are MP3 and Advanced
Audio Coding (AAC), both of which are lossy formats based on modified discrete cosine transform
(MDCT) and perceptual coding algorithms.

Lossless audio coding formats such as FLAC and Apple Lossless are sometimes available, though at the
cost of larger files.

Uncompressed audio formats, such as pulse-code modulation (PCM, or .wav), are also sometimes used.
PCM was the standard format for Compact Disc Digital Audio (CDDA).

History
In 1950, Bell Labs filed the patent on differential pulse-code
modulation (DPCM).[3] Adaptive DPCM (ADPCM) was
introduced by P. Cummiskey, Nikil S. Jayant and James L.
Flanagan at Bell Labs in 1973.[4][5]

Perceptual coding was first used for speech coding


compression, with linear predictive coding (LPC).[6] Initial
concepts for LPC date back to the work of Fumitada Itakura
(Nagoya University) and Shuzo Saito (Nippon Telegraph and
Telephone) in 1966.[7] During the 1970s, Bishnu S. Atal and
Solidyne 922: The world's first
Manfred R. Schroeder at Bell Labs developed a form of LPC
commercial audio bit compression sound
called adaptive predictive coding (APC), a perceptual coding card for PC, 1990
algorithm that exploited the masking properties of the human
ear, followed in the early 1980s with the code-excited linear
prediction (CELP) algorithm which achieved a significant compression ratio for its time.[6] Perceptual
coding is used by modern audio compression formats such as MP3[6] and AAC.

Discrete cosine transform (DCT), developed by Nasir Ahmed, T. Natarajan and K. R. Rao in 1974,[8]
provided the basis for the modified discrete cosine transform (MDCT) used by modern audio
compression formats such as MP3[9] and AAC. MDCT was proposed by J. P. Princen, A. W. Johnson and
A. B. Bradley in 1987,[10] following earlier work by Princen and Bradley in 1986.[11] The MDCT is used
by modern audio compression formats such as Dolby Digital,[12][13] MP3,[9] and Advanced Audio
Coding (AAC).[14]
List of lossy formats

General

Basic
Audio coding Market share (2023)[15] Ref
compression Abbreviation Introduction
standard
algorithm Production Streaming

Dolby Digital (AC- [12][18]


AC3 1991
3) 37–
36–54%[n 1]
Dolby Digital Plus 61%[n 1]
EAC3 2004 [19][20]
(E-AC-3)
Adaptive
Transform ATRAC 1992 Un­known Un­known [12]
Acoustic Coding

MPEG Layer III MP3 1993 15% 19% [9][21]

Advanced Audio
Coding (MPEG-2 / AAC 1997 83% 87% [14][12]
Modified discrete
MPEG-4)
cosine transform
(MDCT) Windows Media
WMA 1999 Un­known Un­known [12]
Audio

Ogg Vorbis Ogg 2000 6% 4% [22][12]

Constrained
Energy Lapped CELT 2011 — — [23]
Transform

Opus Opus 2012 12% 9% [24]

Dolby AC-4 AC4 2014 Un­known Un­known [25]

LDAC LDAC 2015 Un­known Un­known [26][27]

aptX / aptX-HD aptX 1989 Un­known Un­known [28]

Adaptive differential
Digital Theater [29][30]
pulse-code DTS 1990 8% 6%
Systems
modulation
(ADPCM) Master Quality
MQA 2014 Un­known Un­known
Authenticated

MPEG-1 Audio [31]


MP2 1993
Layer II Un­known Un­known
Sub-band coding
(SBC) Musepack MPC 1997

SBC SBC 2003 Un­known Un­known [32]

Speech
Linear predictive coding (LPC)
Adaptive predictive coding (APC)
Code-excited linear prediction (CELP)
Algebraic code-excited linear prediction (ACELP)
Relaxed code-excited linear prediction (RCELP)
Low-delay CELP (LD-CELP)
Adaptive Multi-Rate (used in GSM and 3GPP)
Codec 2 (noted for its lack of patent restrictions)
Speex (noted for its lack of patent restrictions)
Modified discrete cosine transform (MDCT)
AAC-LD
Constrained Energy Lapped Transform (CELT)
Opus (mostly for real-time applications)

List of lossless formats


Apple Lossless (ALAC – Apple Lossless Audio Codec)
Adaptive Transform Acoustic Coding (ATRAC)
Audio Lossless Coding (also known as MPEG-4 ALS)
Direct Stream Transfer (DST)
Dolby TrueHD
DTS-HD Master Audio
Free Lossless Audio Codec (FLAC)
Lossless discrete cosine transform (LDCT)
Meridian Lossless Packing (MLP)
Monkey's Audio (Monkey's Audio APE)
MPEG-4 SLS (also known as HD-AAC)
OptimFROG
Original Sound Quality (OSQ)
RealPlayer (RealAudio Lossless)
Shorten (SHN)
TTA (True Audio Lossless)
WavPack (WavPack lossless)
WMA Lossless (Windows Media Lossless)

See also
Comparison of audio coding formats
Data compression#Audio
Audio file format
List of audio compression formats

Notes
1. The report combines AC-3 & E-AC-3 and separates Dolby Atmos from its market share
calculation. Dolby Atmos can be encoded either lossily with E-AC-3/AC-4[16] or losslessly
with Dolby TrueHD. Music and video streaming providers typically use Dolby Digital Plus
augmented with Dolby Atmos, whereas digital downloads and Blu-ray discs typically use
Dolby TrueHD augmented with Dolby Atmos.[17]

References
1. The term "audio coding" can be seen in e.g. the name Advanced Audio Coding, and is
analogous to the term video coding
2. "Video – Where is synchronization information stored in container formats?" ([Link]
[Link]/questions/357686/where-is-synchronization-information-stored-in-container-formats).
3. US patent 2605361 ([Link]
361), C. Chapin Cutler, "Differential Quantization of Communication Signals", issued 1952-
07-29
4. Cummiskey, P.; Jayant, N. S.; Flanagan, J. L. (1973). "Adaptive Quantization in Differential
PCM Coding of Speech" ([Link] Bell System
Technical Journal. 52 (7): 1105–1118. doi:10.1002/j.1538-7305.1973.tb02007.x ([Link]
rg/10.1002%2Fj.1538-7305.1973.tb02007.x).
5. Cummiskey, P.; Jayant, Nikil S.; Flanagan, J. L. (1973). "Adaptive quantization in differential
PCM coding of speech". The Bell System Technical Journal. 52 (7): 1105–1118.
doi:10.1002/j.1538-7305.1973.tb02007.x ([Link]
007.x). ISSN 0005-8580 ([Link]
6. Schroeder, Manfred R. (2014). "Bell Laboratories" ([Link]
BAAAQBAJ&pg=PA388). Acoustics, Information, and Communication: Memorial Volume in
Honor of Manfred R. Schroeder. Springer. p. 388. ISBN 9783319056609.
7. Gray, Robert M. (2010). "A History of Realtime Digital Speech on Packet Networks: Part II of
Linear Predictive Coding and the Internet Protocol" ([Link]
(PDF). Found. Trends Signal Process. 3 (4): 203–303. doi:10.1561/2000000036 ([Link]
org/10.1561%2F2000000036). ISSN 1932-8346 ([Link]
6).
8. Nasir Ahmed; T. Natarajan; Kamisetty Ramamohan Rao (January 1974). "Discrete Cosine
Transform" ([Link]
min/fg121/Source-Coding_WS12/selected-readings/Ahmed_et_al.__1974.pdf) (PDF). IEEE
Transactions on Computers. C-23 (1): 90–93. doi:10.1109/T-C.1974.223784 ([Link]
10.1109%2FT-C.1974.223784). S2CID 149806273 ([Link]
D:149806273). Archived from the original ([Link]
-Coding_WS12/selected-readings/Ahmed_et_al.__1974.pdf) (PDF) on 2016-12-08.
Retrieved 2019-10-20.
9. Guckert, John (Spring 2012). "The Use of FFT and MDCT in MP3 Audio Compression" (htt
p://[Link]/~gustafso/s2012/2270/web-projects/Guckert-audio-compression-svd
-[Link]) (PDF). University of Utah. Retrieved 14 July 2019.
10. Princen, J.; Johnson, A.; Bradley, A. (1987). "Subband/Transform coding using filter bank
designs based on time domain aliasing cancellation" ([Link]
169405). ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal
Processing. Vol. 12. pp. 2161–2164. doi:10.1109/ICASSP.1987.1169405 ([Link]
1109%2FICASSP.1987.1169405). S2CID 58446992 ([Link]
ID:58446992).
11. Princen, J.; Bradley, A. (1986). "Analysis/Synthesis filter bank design based on time domain
aliasing cancellation" ([Link] IEEE Transactions on
Acoustics, Speech, and Signal Processing. 34 (5): 1153–1161.
doi:10.1109/TASSP.1986.1164954 ([Link]
12. Luo, Fa-Long (2008). Mobile Multimedia Broadcasting Standards: Technology and Practice
([Link] Springer Science &
Business Media. p. 590. ISBN 9780387782638.
13. Britanak, V. (2011). "On Properties, Relations, and Simplified Implementation of Filter Banks
in the Dolby Digital (Plus) AC-3 Audio Coding Standards". IEEE Transactions on Audio,
Speech, and Language Processing. 19 (5): 1231–1241. doi:10.1109/TASL.2010.2087755 (h
ttps://[Link]/10.1109%2FTASL.2010.2087755). S2CID 897622 ([Link]
org/CorpusID:897622).
14. Brandenburg, Karlheinz (1999). "MP3 and AAC Explained" ([Link]
mmcom12/slides/mp3_and_aac_brandenburg.pdf) (PDF). Archived ([Link]
web/20170213191747/[Link]
[Link]) (PDF) from the original on 2017-02-13.
15. "Video Developer Report 2019" ([Link]
zine/Video%20Developer%20Report%202019/[Link])
(PDF). Bitmovin. 2019. Retrieved 5 November 2019.
16. "Does Dolby AC-4 support Dolby Atmos?" ([Link]
es-Dolby-AC-4-support-Dolby-Atmos). Dolby Professional Support. 2023-05-23. Retrieved
2024-11-08.
17. "Just wait until you hear lossless Dolby Atmos Music" ([Link]
heater/lossless-spatial-audio-dolby-atmos-music/). Digital Trends. 2023-05-03. Retrieved
2024-11-08.
18. Britanak, V. (2011). "On Properties, Relations, and Simplified Implementation of Filter Banks
in the Dolby Digital (Plus) AC-3 Audio Coding Standards". IEEE Transactions on Audio,
Speech, and Language Processing. 19 (5): 1231–1241. doi:10.1109/TASL.2010.2087755 (h
ttps://[Link]/10.1109%2FTASL.2010.2087755). S2CID 897622 ([Link]
org/CorpusID:897622).
19. Andersen, Robert Loring; Crockett, B.; Davidson, G.; Davis, Mark; Fielder, L.; Turner,
Stephen C.; Vinton, M.; Williams, P. (1 October 2004). "Introduction to Dolby Digital Plus, an
Enhancement to the Dolby Digital Coding System" ([Link]
92949/[Link]
-[Link]) (PDF). Journal of The Audio Engineering Society. Archived from the original (http
s://[Link]/us/en/technologies/[Link])
(PDF) on 2016-11-19.
20. Digital Audio Compression (AC-3, Enhanced AC-3) Standard ([Link]
si_ts/102300_102399/102366/01.04.01_60/ts_102366v010401p.pdf) (PDF), European
Telecommunications Standards Institute, 20 September 2017, ETSI TS 102 366 V1.4.1
(2017-09, retrieved 21 September 2023
21. Stanković, Radomir S.; Astola, Jaakko T. (2012). "Reminiscences of the Early Work in DCT:
Interview with K.R. Rao" ([Link]
f) (PDF). Reprints from the Early Days of Information Sciences. 60. Retrieved 13 October
2019.
22. [Link] Foundation (2009-06-02). "Vorbis I specification - 1.1.2 Classification" ([Link]
[Link]/vorbis/doc/Vorbis_I_spec.html#x1-50001.1.2). [Link] Foundation. Retrieved
2009-09-22.
23. Terriberry, Timothy B. Presentation of the CELT codec ([Link]
ux_conf_au_CELT_2.ogv). Presentation ([Link]
[Link]) (PDF).
24. Valin, Jean-Marc; Maxwell, Gregory; Terriberry, Timothy B.; Vos, Koen (October 2013). High-
Quality, Low-Delay Music Coding in the Opus Codec. 135th AES Convention. Audio
Engineering Society. arXiv:1602.04845 ([Link]
25. "Dolby AC-4: Audio Delivery for Next-Generation Entertainment Services" ([Link]
[Link]/web/20151204095813/[Link]
[Link]) (PDF). Dolby Laboratories. 2015-06-01. Archived from the
original ([Link]
[Link]) (PDF) on 2015-12-04. Retrieved 2016-04-26.
26. Darko, John H. (2017-03-29). "The inconvenient truth about Bluetooth audio" ([Link]
[Link]/web/20180114020200/[Link]
truth-about-bluetooth-audio/). DAR__KO. Archived from the original ([Link]
[Link]/2017/03/the-inconvenient-truth-about-bluetooth-audio/) on 2018-01-14. Retrieved
2018-01-13.
27. Ford, Jez (2015-08-24). "What is Sony LDAC, and how does it do it?" ([Link]
[Link]/news/sound-image/what-is-sony-ldac-and-how-does-it-do-it-408285). AVHub.
Retrieved 2018-01-13.
28. Ford, Jez (2016-11-22). "aptX HD - lossless or lossy?" ([Link]
d-image/aptx-hd---lossless-or-lossy-442124). AVHub. Retrieved 2018-01-13.
29. "Digital Theater Systems Audio Formats" ([Link]
d/[Link]). Library of Congress. 27 December 2011. Retrieved 10 November 2019.
30. Spanias, Andreas; Painter, Ted; Atti, Venkatraman (2006). Audio Signal Processing and
Coding ([Link] John Wiley &
Sons. p. 338. ISBN 9780470041963.
31. "ISO/IEC 11172-3:1993 – Information technology — Coding of moving pictures and
associated audio for digital storage media at up to about 1,5 Mbit/s — Part 3: Audio" (http://
[Link]/iso/iso_catalogue/catalogue_tc/catalogue_detail.htm?csnumber=22412). ISO.
1993. Retrieved 2010-07-14.
32. Bluetooth SIG, Specification of the Bluetooth System, Profiles, Advanced Audio Distribution
Profile version 1.3. [Link]
doc_id=260859&vId=290074

Retrieved from "[Link]

You might also like