Revision as of 23:47, 29 November 2020 edit Monkbot (talk \| contribs) Bots 3,695,952 edits m Task 18 (cosmetic): eval 23 templates: del empty params (4×); hyphenate params (4×); del \|url-status= (2×); Tag: AWB ← Previous edit		Revision as of 14:20, 29 December 2020 edit undo Monkbot (talk \| contribs) Bots 3,695,952 edits m Task 18 (cosmetic): eval 23 templates: hyphenate params (7×); Tag: AWB Next edit →
Line 25: [[Perceptual coding]] was first used for [[speech coding]] compression, with [[linear predictive coding]] (LPC).<ref name="Schroeder2014">{{cite book \|last1=Schroeder \|first=Manfred R. \|title=Acoustics, Information, and Communication: Memorial Volume in Honor of Manfred R. Schroeder \|date=2014 \|publisher=Springer \|isbn=9783319056609 \|chapter=Bell Laboratories \|page=388 \|chapter-url=https://books.google.com/books?id=d9IkBAAAQBAJ&pg=PA388}}</ref> Initial concepts for LPC date back to the work of [[Fumitada Itakura]] ([[Nagoya University]]) and Shuzo Saito ([[Nippon Telegraph and Telephone]]) in 1966.<ref>{{cite journal \|last1=Gray \|first1=Robert M. \|title=A History of Realtime Digital Speech on Packet Networks: Part II of Linear Predictive Coding and the Internet Protocol \|journal=Found. Trends Signal Process. \|date=2010 \|volume=3 \|issue=4 \|pages=203–303 \|doi=10.1561/2000000036 \|url=https://ee.stanford.edu/~gray/lpcip.pdf \|issn=1932-8346}}</ref> During the 1970s, [[Bishnu S. Atal]] and [[Manfred R. Schroeder]] at [[Bell Labs]] developed a form of LPC called [[adaptive predictive coding]] (APC), a perceptual coding algorithm that exploited the masking properties of the human ear, followed in the early 1980s with the [[code-excited linear prediction]] (CELP) algorithm which achieved a significant compression ratio for its time.<ref name="Schroeder2014"/> Perceptual coding is used by modern audio compression formats such as [[MP3]]<ref name="Schroeder2014"/> and [[Advanced Audio Codec\|AAC]]. [[Discrete cosine transform]] (DCT), developed by [[N. Ahmed\|Nasir Ahmed]], T. Natarajan and [[K. R. Rao]] in 1974,<ref name="DCT">{{cite journal \|author1=Nasir Ahmed \|author1-link=N. Ahmed \|author2=T. Natarajan \|author3=Kamisetty Ramamohan Rao \|journal=IEEE Transactions on Computers\|title=Discrete Cosine Transform\|volume=C-23\|issue=1\|pages=90–93\|date=January 1974 \|doi=10.1109/T-C.1974.223784 \|url=https://www.ic.tu-berlin.de/fileadmin/fg121/Source-Coding_WS12/selected-readings/Ahmed_et_al.__1974.pdf}}</ref> provided the basis for the [[modified discrete cosine transform]] (MDCT) used by modern audio compression formats such as MP3<ref name="Guckert">{{cite web \|last1=Guckert \|first1=John \|title=The Use of FFT and MDCT in MP3 Audio Compression \|url=http://www.math.utah.edu/~gustafso/s2012/2270/web-projects/Guckert-audio-compression-svd-mdct-MP3.pdf \|website=[[University of Utah]] \|date=Spring 2012 \|~~accessdate~~access-date=14 July 2019}}</ref> and AAC. MDCT was proposed by J. P. Princen, A. W. Johnson and A. B. Bradley in 1987,<ref>J. P. Princen, A. W. Johnson und A. B. Bradley: ''Subband/transform coding using filter bank designs based on time ___domain aliasing cancellation'', IEEE Proc. Intl. Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2161–2164, 1987.</ref> following earlier work by Princen and Bradley in 1986.<ref>John P. Princen, Alan B. Bradley: ''Analysis/synthesis filter bank design based on time ___domain aliasing cancellation'', IEEE Trans. Acoust. Speech Signal Processing, ''ASSP-34'' (5), 1153–1161, 1986.</ref> The MDCT is used by modern audio compression formats such as [[Dolby Digital]],<ref name="Luo">{{cite book \|last1=Luo \|first1=Fa-Long \|title=Mobile Multimedia Broadcasting Standards: Technology and Practice \|date=2008 \|publisher=[[Springer Science & Business Media]] \|isbn=9780387782638 \|page=590 \|url=https://books.google.com/?id=l6PovWat8SMC&pg=PA590}}</ref><ref>{{cite journal \|last1=Britanak \|first1=V. \|title=On Properties, Relations, and Simplified Implementation of Filter Banks in the Dolby Digital (Plus) AC-3 Audio Coding Standards \|journal=IEEE Transactions on Audio, Speech, and Language Processing \|date=2011 \|volume=19 \|issue=5 \|pages=1231–1241 \|doi=10.1109/TASL.2010.2087755}}</ref> [[MP3]],<ref name="Guckert">{{cite web \|last1=Guckert \|first1=John \|title=The Use of FFT and MDCT in MP3 Audio Compression \|url=http://www.math.utah.edu/~gustafso/s2012/2270/web-projects/Guckert-audio-compression-svd-mdct-MP3.pdf \|website=[[University of Utah]] \|date=Spring 2012 \|~~accessdate~~access-date=14 July 2019}}</ref> and [[Advanced Audio Coding]] (AAC).<ref name=brandenburg>{{cite web\|url=http://graphics.ethz.ch/teaching/mmcom12/slides/mp3_and_aac_brandenburg.pdf\|title=MP3 and AAC Explained\|last=Brandenburg\|first=Karlheinz\|year=1999\|url-status=live\|archive-url=https://web.archive.org/web/20170213191747/https://graphics.ethz.ch/teaching/mmcom12/slides/mp3_and_aac_brandenburg.pdf\|archive-date=2017-02-13}}</ref> ==List of lossy formats== Line 35: ! Abbreviation ! Introduction ! Market share {{small\|(2019)}}<ref name="Bitmovin">{{cite web \|url=https://cdn2.hubspot.net/hubfs/3411032/Bitmovin%20Magazine/Video%20Developer%20Report%202019/bitmovin-video-developer-report-2019.pdf \|title=Video Developer Report 2019 \|website=[[Bitmovin]] \|year=2019 \|~~accessdate~~access-date=5 November 2019}}</ref> ! {{Abbr\|Ref\|Reference(s)}} \|- Line 55: \| 1993 \| 49% \| <ref name="Guckert">{{cite web \|last1=Guckert \|first1=John \|title=The Use of FFT and MDCT in MP3 Audio Compression \|url=http://www.math.utah.edu/~gustafso/s2012/2270/web-projects/Guckert-audio-compression-svd-mdct-MP3.pdf \|website=[[University of Utah]] \|date=Spring 2012 \|~~accessdate~~access-date=14 July 2019}}</ref><ref name="Stankovic">{{cite journal \|last1=Stanković \|first1=Radomir S. \|last2=Astola \|first2=Jaakko T. \|title=Reminiscences of the Early Work in DCT: Interview with K.R. Rao \|journal=Reprints from the Early Days of Information Sciences \|date=2012 \|volume=60 \|url=http://ticsp.cs.tut.fi/reports/ticsp-report-60-reprint-rao-corrected.pdf \|~~accessdate~~access-date=13 October 2019}}</ref> \|- \| [[Advanced Audio Coding]] ([[MPEG-2]] / [[MPEG-4]]) Line 73: \| 2000 \| 7% \| <ref name="vorbis-mdct">{{cite web \|author=Xiph.Org Foundation \|publisher=Xiph.Org Foundation \|url=http://www.xiph.org/vorbis/doc/Vorbis_I_spec.html#x1-50001.1.2 \|title=Vorbis I specification - 1.1.2 Classification \|date=2009-06-02 \|~~accessdate~~access-date=2009-09-22}}</ref><ref name="Luo"/> \|- \| [[Constrained Energy Lapped Transform]] Line 104: \| 1990 \| 14% \| <ref>{{cite web \|title=Digital Theater Systems Audio Formats \|url=https://www.loc.gov/preservation/digital/formats/fdd/fdd000232.shtml \|website=[[Library of Congress]] \|~~accessdate~~access-date=10 November 2019 \|date=27 December 2011}}</ref><ref>{{cite book \|last1=Spanias \|first1=Andreas \|last2=Painter \|first2=Ted \|last3=Atti \|first3=Venkatraman \|title=Audio Signal Processing and Coding \|date=2006 \|publisher=[[John Wiley & Sons]] \|isbn=9780470041963 \|page=338 \|url=https://books.google.com/?id=a1RULRErhOYC&pg=PA338}}</ref> \|- \| [[Master Quality Authenticated]]

Audio coding format: Difference between revisions