Revision as of 20:55, 23 May 2025 edit OAbot (talk \| contribs) Bots 643,717 edits m Open access bot: url-access updated in citation with #oabot. ← Previous edit		Revision as of 07:43, 24 May 2025 edit undo Cosmia Nebula (talk \| contribs) Extended confirmed users 11,305 edits →top: CNN Tag: Visual edit Next edit →
Line 1: [[File:TDNN Diagram.png\|thumb\|right\|TDNN diagram]] '''Time delay neural network''' ('''TDNN''')<ref name="phoneme detection">{{cite journal \|doi=10.1109/29.21701 \|url=http://www.inf.ufrgs.br/~engel/data/media/file/cmp121/waibel89_TDNN.pdf \|title=Phoneme recognition using time-delay neural networks \|date=1989 \|last1=Waibel \|first1=A. \|last2=Hanazawa \|first2=T. \|last3=Hinton \|first3=G. \|last4=Shikano \|first4=K. \|last5=Lang \|first5=K.J. \|journal=IEEE Transactions on Acoustics, Speech, and Signal Processing \|volume=37 \|issue=3 \|pages=328–339 }}</ref> is a multilayer [[artificial neural network]] architecture whose purpose is to 1) classify patterns with shift-invariance, and 2) model context at each layer of the network. It is essentially a 1-d [[convolutional neural network]] (CNN) without pooling or dilations. Shift-invariant classification means that the classifier does not require explicit segmentation prior to classification. For the classification of a temporal pattern (such as speech), the TDNN thus avoids having to determine the beginning and end points of sounds before classifying them.

Time delay neural network: Difference between revisions