Revision as of 19:19, 1 January 2025 edit Maxeto0910 (talk \| contribs) Extended confirmed users 117,104 edits period after sentence Tag: Visual edit ← Previous edit		Revision as of 15:24, 8 January 2025 edit undo SunDawn (talk \| contribs) Extended confirmed users, Page movers, New page reviewers, Pending changes reviewers, Rollbackers, Temporary account IP viewers 69,215 edits →Using Wikimedia projects for artificial intelligence: ce Tag: Visual edit Next edit →
Line 31: Content in Wikimedia projects is useful as a dataset in advancing artificial intelligence research and applications. For instance, in the development of the Google's [[Perspective API]] that identifies toxic comments in online forums, a dataset containing hundreds of thousands of Wikipedia talk page comments with human-labelled toxicity levels was used.<ref>{{Cite news\|url=https://www.engadget.com/2017/09/01/google-perspective-comment-ranking-system/\|title=Google's comment-ranking system will be a hit with the alt-right\|work=Engadget\|date=2017-09-01}}</ref> Subsets of the Wikipedia corpus are considered the largest well-curated data sets available for AI training.<ref name="nyt180724"/><ref name="considerations"/> A 2012 paper reported that more than ~~1000~~1,000 academic articles, including those using artificial intelligence, examine Wikipedia, reuse information from Wikipedia, use technical extensions linked to Wikipedia, or research communication about Wikipedia.<ref>{{cite journal \|last1=Nielsen \|first1=Finn Årup \|title=Wikipedia Research and Tools: Review and Comments \|journal=SSRN Working Paper Series \|date=2012 \|doi=10.2139/ssrn.2129874 \|language=en \|issn=1556-5068}}</ref> A 2017 paper described Wikipedia as the [[mother lode]] for human-generated text available for machine learning.<ref>{{cite journal \|last1=Mehdi \|first1=Mohamad \|last2=Okoli \|first2=Chitu \|last3=Mesgari \|first3=Mostafa \|last4=Nielsen \|first4=Finn Årup \|last5=Lanamäki \|first5=Arto \|title=Excavating the mother lode of human-generated text: A systematic review of research that uses the wikipedia corpus \|journal=Information Processing & Management \|volume=53 \|issue=2 \|pages=505–529 \|doi=10.1016/j.ipm.2016.07.003 \|date=March 2017\|s2cid=217265814 \|url=http://urn.fi/urn:nbn:fi-fe202003057304 }}</ref> A 2016 research project called "One Hundred Year Study on Artificial Intelligence" named Wikipedia as a key early project for understanding the interplay between artificial intelligence applications and human engagement.<ref>{{cite web \|title=AI Research Trends - One Hundred Year Study on Artificial Intelligence (AI100) \|url=https://ai100.stanford.edu/2016-report/section-i-what-artificial-intelligence/ai-research-trends \|website=ai100.stanford.edu \|language=en}}</ref>

Artificial intelligence in Wikimedia projects: Difference between revisions