Revision as of 15:19, 15 November 2024 edit Hooman Mallahzadeh (talk \| contribs) Extended confirmed users 4,638 edits →Stacked RNN ← Previous edit		Revision as of 07:32, 16 November 2024 edit undo Hooman Mallahzadeh (talk \| contribs) Extended confirmed users 4,638 edits m →Encoder-decoder Next edit →
Line 82: Two ~~RNN~~RNNs can be run front-to-back in an '''encoder-decoder''' configuration. The encoder RNN processes an input sequence into a sequence of hidden vectors, and the decoder RNN processes the sequence of hidden vectors to an output sequence, with an optional [[Attention (machine learning)\|attention mechanism]]. This was used to construct state of the art [[Neural machine translation\|neural machine translators]] during the 2014–2017 period. This was an instrumental step towards the development of [[Transformer (deep learning architecture)\|Transformers]].<ref>{{Cite journal \|last1=Vaswani \|first1=Ashish \|last2=Shazeer \|first2=Noam \|last3=Parmar \|first3=Niki \|last4=Uszkoreit \|first4=Jakob \|last5=Jones \|first5=Llion \|last6=Gomez \|first6=Aidan N \|last7=Kaiser \|first7=Ł ukasz \|last8=Polosukhin \|first8=Illia \|date=2017 \|title=Attention is All you Need \|url=https://proceedings.neurips.cc/paper/2017/hash/3f5ee243547dee91fbd053c1c4a845aa-Abstract.html \|journal=Advances in Neural Information Processing Systems \|publisher=Curran Associates, Inc. \|volume=30}}</ref> === PixelRNN ===

Recurrent neural network: Difference between revisions