Speech coding: Difference between revisions

Browse history interactively

← Previous edit

Content deleted Content added

VisualWikitext

Revision as of 21:16, 30 October 2024 edit Kvng (talk \| contribs) Extended confirmed users, New page reviewers 115,948 edits m unpiped links using script ← Previous edit		Latest revision as of 22:11, 17 December 2024 edit undo Kvng (talk \| contribs) Extended confirmed users, New page reviewers 115,948 edits m avoid bit/s wrap at slash
(One intermediate revision by one other user not shown)
Line 37: [[Opus (audio format)\|Opus]] is a [[free software]] audio coder. It combines the speech-oriented LPC-based [[SILK]] algorithm and the lower-latency MDCT-based CELT algorithm, switching between or combining them as needed for maximal efficiency.<ref name="homepage">{{cite web \|url = https://opus-codec.org/ \|title=Opus Codec \|work=Opus \|publisher=Xiph.org Foundation \|type=Home page \|access-date=July 31, 2012 }}</ref><ref>{{cite conference \|last1=Valin \|first1=Jean-Marc \|last2=Maxwell \|first2=Gregory \|last3=Terriberry \|first3=Timothy B. \|last4=Vos \|first4=Koen \|title=High-Quality, Low-Delay Music Coding in the Opus Codec \|conference=135th AES Convention \|publisher=[[Audio Engineering Society]] \|date=October 2013 \|arxiv=1602.04845 }}</ref> It is widely used for VoIP calls in [[WhatsApp]].<ref name="Register">{{cite news \|last1=Leyden \|first1=John \|title=WhatsApp laid bare: Info-sucking app's innards probed \|url=https://www.theregister.co.uk/2015/10/27/whatsapp_forensic_analysis/ \|access-date=19 October 2019 \|work=[[The Register]] \|date=27 October 2015}}</ref><ref name="Hazra">{{cite book \|last1=Hazra \|first1=Sudip \|last2=Mateti \|first2=Prabhaker \|chapter=Challenges in Android Forensics \|editor-last1=Thampi \|editor-first1=Sabu M. \|editor-last2=Pérez \|editor-first2=Gregorio Martínez \|editor-last3=Westphall \|editor-first3=Carlos Becker \|editor-last4=Hu \|editor-first4=Jiankun \|editor-last5=Fan \|editor-first5=Chun I. \|editor-last6=Mármol \|editor-first6=Félix Gómez \|title=Security in Computing and Communications: 5th International Symposium, SSCC 2017 \|date=September 13–16, 2017 \|publisher=Springer \|isbn=9789811068980 \|pages=286–299 (290) \|doi=10.1007/978-981-10-6898-0_24 \|chapter-url=https://books.google.com/books?id=1u09DwAAQBAJ&pg=PA290}}</ref><ref name="Srivastava">{{cite book \|last1=Srivastava \|first1=Saurabh Ranjan \|last2=Dube \|first2=Sachin \|last3=Shrivastaya \|first3=Gulshan \|last4=Sharma \|first4=Kavita \|chapter=Smartphone Triggered Security Challenges: Issues, Case Studies and Prevention \|editor-last1=Le \|editor-first1=Dac-Nhuong \|editor-last2=Kumar \|editor-first2=Raghvendra \|editor-last3=Mishra \|editor-first3=Brojo Kishore \|editor-last4=Chatterjee \|editor-first4=Jyotir Moy \|editor-last5=Khari \|editor-first5=Manju \|title=Cyber Security in Parallel and Distributed Computing: Concepts, Techniques, Applications and Case Studies \|date=2019 \|publisher=John Wiley & Sons \|isbn=9781119488057 \|pages=187–206 (200) \|doi=10.1002/9781119488330.ch12 \|s2cid=214034702 \|chapter-url=https://books.google.com/books?id=FzGtDwAAQBAJ&pg=PA200}}</ref> The [[PlayStation 4]] video game console also uses Opus for its [[PlayStation Network]] system party chat.<ref name="playstation">{{cite web\|url=https://doc.dl.playstation.net/doc/ps4-oss/ \|title=Open Source Software used in PlayStation4 \|publisher=Sony Interactive Entertainment Inc. \|access-date=2017-12-11}}{{failed verification\|reason=Source does not indicate how Opus is used\|date=September 2022}}</ref> A number of codecs with even lower [[bit rate]]s have been demonstrated. [[Codec2]], which operates at bit rates as low as {{nowrap\|450~~ ~~ bit/s}}, sees use in amateur radio.<ref>{{cite web \|title=GitHub - Codec2 \|website=[[GitHub]] \|date=November 2019 \|url=https://github.com/x893/codec2}}</ref> NATO currently uses [[MELPe]], offering intelligible speech at {{nowrap\|600~~ ~~ bit/s}} and below.<ref>Alan McCree, “A scalable phonetic vocoder framework using joint predictive vector quantization of MELP parameters,” in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, 2006, pp. I 705–708, Toulouse, France</ref> Neural vocoder approaches have also emerged: [[Lyra (codec)\|Lyra]] by Google gives an "almost eerie" quality at {{nowrap\|3~~ ~~ kbit/s}}.<ref name=":0">{{Cite web \|last=Buckley \|first=Ian \|date=2021-04-08 \|title=Google Makes Its Lyra Low Bitrate Speech Codec Public \|url=https://www.makeuseof.com/google-lyra-speech-codec-public/ \|access-date=2022-07-21 \|website=MakeUseOf \|language=en-US}}</ref> Microsoft's [[Satin (codec)\|Satin]] also uses machine learning, but uses a higher tunable bitrate and is wideband.<ref name=":3">{{Cite web \|last=Levent-Levi \|first=Tsahi \|date=2021-04-19 \|title=Lyra, Satin and the future of voice codecs in WebRTC \|url=https://bloggeek.me/lyra-satin-webrtc-voice-codecs/ \|access-date=2022-07-21 \|website=BlogGeek.me \|language=en-US}}</ref> ===Sub-fields=== Line 83: [[Category:Speech codecs\| ]] [[Category:Data compression]]