Revision as of 16:58, 8 May 2025 edit 81.252.62.13 (talk) Link updated Tag: Visual edit ← Previous edit		Revision as of 17:06, 8 May 2025 edit undo 81.252.62.13 (talk) Link update Tag: Visual edit Next edit →
Line 29: === State of the art === TDNN-based phoneme recognizers compared favourably in early comparisons with HMM-based phone models.<ref name="phoneme detection" /><ref name=":3" /> Modern deep TDNN architectures include many more hidden layers and sub-sample or pool connections over broader contexts at higher layers. They achieve up to 50% word error reduction over [[Mixture model\|GMM]]-based acoustic models.<ref name=":4">Vijayaditya Peddinti, Daniel Povey, Sanjeev Khudanpur, ''[https://web.archive.org/web/20180306041537/https://pdfs.semanticscholar.org/ced2/11de5412580885279090f44968a428f1710b.pdf A time delay neural network architecture for efficient modeling of long temporal contexts]'', Proceedings of Interspeech 2015</ref><ref name=":5">David Snyder, Daniel Garcia-Romero, Daniel Povey, ''[http://danielpovey.com/files/2015_asru_tdnn_ubm.pdf A Time-Delay Deep Neural Network-Based Universal Background Models for Speaker Recognition]'', Proceedings of ASRU 2015.</ref> While the different layers of TDNNs are intended to learn features of increasing context width, they do model local contexts. When longer-distance relationships and pattern sequences have to be processed, learning states and state-sequences is important and TDNNs can be combined with other modelling techniques.<ref name=":6">~~Patrick~~{{Cite web \|last=Haffner, ~~Alexander~~\|first=Patrick \|last2=Waibel, ~~''[http://papers.nips.cc/paper/580-multi-state-time-delay-networks-for-continuous-speech-recognition.pdf~~\|first2=Alex \|date=1991 \|title=Multi-State Time Delay ~~Neural~~ Networks for Continuous Speech Recognition] ~~{{Webarchive~~\|url=https://~~web~~proceedings.~~archive~~neurips.~~org~~cc/~~web~~paper_files/~~20160411090850~~paper/~~http:~~1991/hash/~~papers~~069d3bb002acd8d7dd095917f9efe4cb-Abstract.~~nips~~html \|url-status=live \|website=proceedings.neurips.cc~~/paper/580-multi-state-time-delay-networks-for-continuous-speech-recognition.pdf~~ \|~~date~~publisher=~~2016~~NIPS \|pages=135-~~04-11~~ 142}}~~'', Advances in Neural Information Processing Systems, 1992, Morgan Kaufmann.~~</ref><ref name=":1" /><ref name=":2" /> == Applications ==

Time delay neural network: Difference between revisions