Revision as of 12:56, 12 June 2024 edit Citation bot (talk \| contribs) Bots 5,868,144 edits Added doi-broken-date. \| Use this bot. Report bugs. \| #UCB_CommandLine Tag: Reverted ← Previous edit		Revision as of 21:51, 12 June 2024 edit undo Citation bot (talk \| contribs) Bots 5,868,144 edits Removed parameters. \| Use this bot. Report bugs. \| Suggested by Headbomb \| Category:CS1 maint: DOI inactive as of June 2024 \| #UCB_Category 81/305 Tag: Manual revert Next edit →
Line 28: == Neural models == === Recurrent neural network === Continuous representations or [[Word embedding\|embeddings of words]] are produced in [[recurrent neural network]]-based language models (known also as ''continuous space language models'').<ref>{{cite web \|last1=Karpathy \|first1=Andrej \|title=The Unreasonable Effectiveness of Recurrent Neural Networks \|url=https://karpathy.github.io/2015/05/21/rnn-effectiveness/ \|access-date=27 January 2019 \|archive-date=1 November 2020 \|archive-url=https://web.archive.org/web/20201101215448/http://karpathy.github.io/2015/05/21/rnn-effectiveness/ \|url-status=live }}</ref> Such continuous space embeddings help to alleviate the [[curse of dimensionality]], which is the consequence of the number of possible sequences of words increasing [[Exponential growth\|exponentially]] with the size of the vocabulary, furtherly causing a data sparsity problem. Neural networks avoid this problem by representing words as non-linear combinations of weights in a neural net.<ref name="bengio">{{cite encyclopedia\|title=Neural net language models\|first=Yoshua\|last=Bengio\|year=2008\|encyclopedia=[[Scholarpedia]]\|volume=3\|issue=1\|page=3881\|url=http://www.scholarpedia.org/article/Neural_net_language_models\|doi=10.4249/scholarpedia.3881~~\|doi-broken-date=12 June 2024~~ \|bibcode=2008SchpJ...3.3881B\|doi-access=free\|access-date=28 August 2015\|archive-date=26 October 2020\|archive-url=https://web.archive.org/web/20201026161505/http://www.scholarpedia.org/article/Neural_net_language_models\|url-status=live}}</ref> === Large language models ===

Language model: Difference between revisions