Talk:Large language model: Difference between revisions

Content deleted Content added
Cewbot (talk | contribs)
m Maintain {{WPBS}} and vital articles: 5 WikiProject templates. The article is listed in the level 5 page: Artificial intelligence concepts.
Yoderj (talk | contribs)
Cut Mamba and RNN mention at start of article?
Line 180:
:*:Nice catch, the article appears to be about 900 words below the 6k readable prose threshold. However, the list in table format feels long and unnecessary here. Several items appear to be sourced to corporate blog posts or preprints. If the list is to remain here, it could be reduced to a non-table list of notable links. [[User:WeyerStudentOfAgrippa|WeyerStudentOfAgrippa]] ([[User talk:WeyerStudentOfAgrippa|talk]]) 17:51, 1 February 2024 (UTC)
:I had actually been thinking about the list table recently. I would have suggested creating a modified, chatbot-listing version of the table at [[List of chatbots]], to which [[Comparison of user features of chatbots]] could also probably be merged. –<span style="box-shadow: 0px 0px 12px red;border-radius:9em;padding:0 2px;background:#D00">[[User:Gluonz|<span style="color:#AFF">'''Gluonz'''</span>]]<sup>''' [[User talk:Gluonz|talk]] [[Special:Contributions/Gluonz|contribs]]'''</sup></span> 17:15, 1 February 2024 (UTC)
 
== Reduce emphasis on non-transformer LLMs? ==
 
The opening paragraph includes the text, "Some recent implementations are based on other architectures, such as recurrent neural network variants and Mamba (a state space model).[2][3][4]". I believe this text should be moved MUCH later in the article, if it is mentioned at all. I don't think the citations included are sufficient to demonstrate the notability of these alternatives to the dominant architecture. Is there agreement on this? --[[User:Yoderj|Yoderj]] ([[User talk:Yoderj|talk]]) 21:15, 21 February 2024 (UTC)