Wikipedia talk:WikiProject AI Cleanup
This is the talk page for discussing WikiProject AI Cleanup and anything related to its purposes and tasks. |
|
Archives: 1, 2, 3Auto-archiving period: 30 days ![]() |
![]() | This project page does not require a rating on Wikipedia's content assessment scale. It is of interest to the following WikiProjects: | |||||||
|
![]() | To help centralize discussions and keep related topics together, all non-archive subpages of this talk page redirect here. |
![]() | This page has been mentioned by multiple media organizations:
|
AI-generated and article for deletion?
editHi, everyone. According to discussions on fr-WP, the article Farm Management is generated by AI and uses a title which was already used by Farm management (without a capital letter for "management") redirecting to Agricultural science. On the French Wiki, a discussion for deletion is underway and what would be the procedure here? Is using Template:AI-generated and Template:Article_for_deletion ok ? Fabius Lector (talk) 10:56, 30 July 2025 (UTC)
- I've WP:BLARed it. Sohom (talk) 11:12, 30 July 2025 (UTC)
Request to review articles for AI hallucination issues
editI work for Arabella Advisors, a D.C.-based consulting company, and I just posted a long message on the AA Talk page outlining glaring errors on the AA and New Venture Fund articles. Some of these errors seem to be indicative of AI hallucinations, as there are numerous instances where the cited sources don't support the footnoted claim. Is this something that experienced editors here could review? Any help would be appreciated. JJ for Arabella (talk) 19:25, 30 July 2025 (UTC)
- I see no evidence that the concerns raised at Talk:Arabella Advisors stem from LLM usage. The 990 claim currently in the article is from 2020 [1] and appears based on an earlier incarnation of the claim that existed in the initial version of the page [2], which was removed [3] after a different coi editreq brought attention to the unreliable sources supporting it [4].
- That is one raised issue, a review of the others does not indicate LLM usage either. Stating that the use of the term "Subsidiaries" vs "Clients" may be a hallucination is quite a leap, and the New Venture Fund lede sourcing problem can be easily attributed to WP:SYNTH (see the entry for Eric Kessler [5]). I see that you asked about the latter at the Teahouse without providing specific examples, but Cullen328 still advised that such errors can stem from original research [6].
- People have been getting things wrong for as long as they've existed, no model use required, and no model use is evident here. fifteen thousand two hundred twenty four (talk) 21:05, 30 July 2025 (UTC)
- Thank you for the quick response, Fifteen thousand two hundred twenty four. Your argument that the factual inaccuracies and citation errors that I flagged don't stem from LLM usage makes sense and is honestly reassuring. If these issues are simply a reflection of sloppy research then hopefully they can be addressed by reviewing editors. Thank you again for your response and sorry for the false alarm! JJ for Arabella (talk) 14:31, 31 July 2025 (UTC)
I am concerned about the edits by User:EncycloSphere, and left a talk page message for them. That editor's response stated that AI was used in the drafting, but "content quality and sources matter more than method". I also discussed this here with User:Chaotic Enby. My concern is the unencyclopedic tone of the enormous edits being made. Thank you! --Magnolia677 (talk) 10:14, 2 August 2025 (UTC)
- To note, "content quality and sources matter more than method" is a weak argument when, as I pointed out, some sources like Mexico Meets Paris: The Rise of Haute Taquerías in Special:Diff/1302780528 don't seem to exist at all (even after looking for an archived version). Chaotic Enby (talk · contribs) 11:01, 2 August 2025 (UTC)
- Spot checking their edit history it seems all AI. Unreviewed too, the diff Chaotic posted is all dead links and fails WP:V. Jumpytoo Talk 05:49, 3 August 2025 (UTC)
AI images of historical figures
editHey all. Theres currently a discusion (at Talk:James McQueen (writer)) about adding an AI generated image to an article on a named histotrical figure, which in my view goes very much against WP:AIIMAGE. However, the uploader is insisting that they is some exception becasue there's no free image (which again is covered by WP:AIIMAGE). If somebody with more expertise in this area wants to take a look and tell me if I'm off base here that would be very helpful. Cakelot1 ☞️ talk 11:47, 2 August 2025 (UTC)
- If there was one thing with overwhelming consensus in that RFC, it was that AI generated images should not be used to depict actual real-world people under any circumstances. -- LWG talk 20:09, 2 August 2025 (UTC)
Could someone look at the contributions of MaineMax04843 and confirm/infirm my suspicions?
editI found clear evidence of AI slop in their recent contributions, and I suspect many if not most of their older ones also including AI slop in them.
Could someone look that that contribution history and sanity check me here? I don't want to escalate prematurely. Headbomb {t · c · p · b} 00:04, 4 August 2025 (UTC)
- @Magnolia677: this may interest you. Headbomb {t · c · p · b} 00:05, 4 August 2025 (UTC)
- I looked at one of their earliest edits, to third culture kid. The doi for new reference Tan, Koh, & Lim 2021 goes to a different paper by different authors on a related topic, and the reference appears not to exist. New reference "The global nomad experience: Living in liminality" exists offline but is dated 2009 when the actual publication date appears to be 1999. New reference Doyen, Dhaene, et al 2016 is given with a doi that goes to an unrelated paper and has a title from a paper by different authors with a different publication year. New reference Lee & Bain 2007 has a doi that goes to an unrelated paper and does not appear to exist. New reference Cottrell 2002 duplicates an existing reference but with a different book title that does not appear to exist, different page numbers, and malformatted citation template. New reference Cariola 2020 has a doi that goes to an unrelated paper and does not appear to exist. At this point I gave up checking the rest, as I was already convinced that this is unchecked AI slop. —David Eppstein (talk) 00:47, 4 August 2025 (UTC)
- I reviewed the one article they've created, Midcoast Villager, and did not find any signs of LLM use via faulty references like above. However the History section was copied closely from provided sources, and so I have removed and tagged it for revdel. There is also a WP:CRYSTAL issue since the article relies heavily on a source that predates events that are asserted to have happened, I've elected to draftify it to allow for corrections before reintroduction into articlespace. fifteen thousand two hundred twenty four (talk) 01:14, 4 August 2025 (UTC)
- The timing of the edits strongly suggests LLM use, especially considering that they are tagged as mobile web edits. It is highly unlikely that an editor would manually create these seven edits within the span of an hour using the mobile website: Special:Diff/1298369455, Special:Diff/1298368748, Special:Diff/1298367530, Special:Diff/1298367022, Special:Diff/1298365691, Special:Diff/1298364779, Special:Diff/1298363103.I have partially blocked MaineMax04843 (talk · contribs · count) from article space, and invited them to participate in the discussion here. — Newslinger talk 20:25, 8 August 2025 (UTC)
I think this might be of interest to this WikiProject?
While checking if there was an article for Citizen developer, I found this draft that was declined at AfC for being LLM-generated. I think it's salvageable so I'm rewriting it. I'd appreciate some help with this! Rosaece ♡ talk ♡ contributions 22:20, 8 August 2025 (UTC)
- My main advice for rewriting this would be to take the sections that are bulleted lists and rewrite them as prose (non-list text), since prose is better understood in most cases. SuperPianoMan9167 (talk) 23:11, 8 August 2025 (UTC)
- I'm not sure the topic warrants a separate article and would suggest considering expanding Software development#Workers, Programmer, or Software development process#Examples and making a redirect instead. fifteen thousand two hundred twenty four (talk) 23:19, 8 August 2025 (UTC)
Where to request LLM generated reference checks?
editHi -- Priyanshu.sage has made a lot of edits to various articles in 2024 that are almost certainly AI generated, based on this diff. A lot of these contributions have been revdelled so I can't check them all, but that in and of itself is a warning sign.
Where would I request a reference check here? Apologies for not doing it myself, unfortunately I am unfamiliar with this subject area and am not the best person for that. Gnomingstuff (talk) 19:38, 9 August 2025 (UTC)
- Kind of piggybacking off this, how would I do this in general for editors who have created dozens/hundreds of AI edits in the past? I am finding a great deal of users who appear to be serial adders of AI content, but who have been inactive for a year or so so contacting them for confirmation is unlikely to work. User:Vallee is the most recent one whose (massive amounts of) edits I am working through -- see the userpage in this case.
- I've been tagging these edits and adding a talk page message, but there are a lot of them to tag, and the pages and talk pages don't seem to be super active. I have not been deleting these sections since I have no real proof besides the userpage and AI writing tells.
- Let me know if I should be doing something else. I am sorry for creating more work for people -- although arguably these editors are the ones who created the work and I am just flagging it. Gnomingstuff (talk) 20:26, 9 August 2025 (UTC)
- I wonder if we need an AI noticeboard to centralized efforts, like we have for fringe stuff. Headbomb {t · c · p · b} 22:45, 9 August 2025 (UTC)
- This talk page was marked as a noticeboard, as per Wikipedia talk:WikiProject AI Cleanup/Archive 2 § Wikipedia talk:WikiProject AI Cleanup/Archive 2#WP:LLMN?, again. As noted in that section, it wasn't based on very much discussion, though. While personally I think it would be helpful to keep project discussion separate from discussion of specific situations, whatever works best for those actively involved is fine. isaacl (talk) 23:11, 9 August 2025 (UTC)
- Apologies, thought it would be OK based on the other topics here about individual users. Gnomingstuff (talk) 02:23, 10 August 2025 (UTC)
- I think posting about cleaning up instances of AI use to WikiProject AI Cleanup makes perfect sense. This talk page is, in my view, the de facto LLM noticeboard. fifteen thousand two hundred twenty four (talk) 02:41, 10 August 2025 (UTC)
- OK, thanks.
- Expectation setting: There's probably going to be a lot. The way I'm doing this is searching for combinations of AI tell phrases and then checking the sources/contribution history on the diffs. The current search I am working through has 260 results. And obviously a lot of these will be false positives or inconclusive, but that's just one search. Gnomingstuff (talk) 14:42, 10 August 2025 (UTC)
- I think posting about cleaning up instances of AI use to WikiProject AI Cleanup makes perfect sense. This talk page is, in my view, the de facto LLM noticeboard. fifteen thousand two hundred twenty four (talk) 02:41, 10 August 2025 (UTC)
- Apologies, thought it would be OK based on the other topics here about individual users. Gnomingstuff (talk) 02:23, 10 August 2025 (UTC)
- This talk page was marked as a noticeboard, as per Wikipedia talk:WikiProject AI Cleanup/Archive 2 § Wikipedia talk:WikiProject AI Cleanup/Archive 2#WP:LLMN?, again. As noted in that section, it wasn't based on very much discussion, though. While personally I think it would be helpful to keep project discussion separate from discussion of specific situations, whatever works best for those actively involved is fine. isaacl (talk) 23:11, 9 August 2025 (UTC)
- I wonder if we need an AI noticeboard to centralized efforts, like we have for fringe stuff. Headbomb {t · c · p · b} 22:45, 9 August 2025 (UTC)
New logo?
editThe main page for the project seems to have a new logo (one resembling a brain), but I checked the wikitext and it still seems to be using the same file: File:WikiProject AI Cleanup.svg (the logo with a robot and a magnifying glass). The new logo even appears in old page revisions for some reason. I can't find the new logo image here or on Commons. What is going on? SuperPianoMan9167 (talk) 15:25, 10 August 2025 (UTC)
- The logo is actually set in Wikipedia:WikiProject AI Cleanup/style.css as the
background-image
property of theheader_image
class. In Special:Diff/1189363698/1305145654, Waddie96 changed the logo from the robot to the brain. In Wikipedia:WikiProject AI Cleanup, the codetitle="File:WikiProject AI Cleanup.svg"
provides advisory information and does not set the background image. Waddie96, would you like to comment on the logo change? — Newslinger talk 15:46, 10 August 2025 (UTC) Fixed class name. — Newslinger talk 15:51, 10 August 2025 (UTC)- I thought it looked better, what do you think? The other icon is from Codex and it represents bots in currently in Wikimedia UI production and in future. So better not to overlap. waddie96 ★ (talk) 15:49, 10 August 2025 (UTC)
- I like it! Thanks to Newslinger for explaining how the image works (I didn't know it was set through the page CSS). The old logo is still used in a bunch of places though if you want to change them. SuperPianoMan9167 (talk) 15:54, 10 August 2025 (UTC)
- I like it as well, tho I feel like it might give the wrong vibes for some (if you look at it long enough it feels like it is encouraging Cyborg behavior, not necessarily stopping it -- Maybe we need a mop somewhere in the mix?) Sohom (talk) 16:30, 10 August 2025 (UTC)
- Yeh sorry about that, I should have tidied up after myself when I was happy how it looked. waddie96 ★ (talk) 19:33, 10 August 2025 (UTC)
- Maybe we could add a magnifying glass like the one in the old logo. SuperPianoMan9167 (talk) 20:58, 10 August 2025 (UTC)
- Read the icon style guidelines at Codex. And then let me know! waddie96 ★ (talk) 21:12, 10 August 2025 (UTC)
- I like it as well, tho I feel like it might give the wrong vibes for some (if you look at it long enough it feels like it is encouraging Cyborg behavior, not necessarily stopping it -- Maybe we need a mop somewhere in the mix?) Sohom (talk) 16:30, 10 August 2025 (UTC)
- I don't have a strong preference about the WikiProject logo, but I should say that I picked File:OOjs UI icon robot.svg for the icon in {{Collapse AI top}} because it appears to be the image that the robot logo File:WikiProject AI Cleanup.svg was derived from. If the WikiProject logo is changed, then it might make sense to change the icon of {{Collapse AI top}} as well. — Newslinger talk 15:59, 10 August 2025 (UTC)
- Yep exactly. Sorry I haven't had time to change.; waddie96 ★ (talk) 19:36, 10 August 2025 (UTC)
- I like it! Thanks to Newslinger for explaining how the image works (I didn't know it was set through the page CSS). The old logo is still used in a bunch of places though if you want to change them. SuperPianoMan9167 (talk) 15:54, 10 August 2025 (UTC)
- I thought it looked better, what do you think? The other icon is from Codex and it represents bots in currently in Wikimedia UI production and in future. So better not to overlap. waddie96 ★ (talk) 15:49, 10 August 2025 (UTC)
Which you all interested in?
waddie96 ★ (talk) 21:07, 10 August 2025 (UTC)
- None, I prefer the old robot. A brain is a symbol of intelligence, while the current state of "AI" are unintelligent predictive models. I don't think we should conflate the two and further feed into the misconception that these models are intelligent systems. fifteen thousand two hundred twenty four (talk) 21:15, 10 August 2025 (UTC)
- My main issue with the new logo is that it doesn't convey the idea of cleanup, only the "AI" part, and that adding a magnifying glass on top would make it look more crowded.In terms of colors, going for a blue color scheme could lead to confusion with blue links, although having something a bit more vibrant than the current black-and-grey tones would be neat. Maybe purple/magenta? Chaotic Enby (talk · contribs) 21:17, 10 August 2025 (UTC)
- I wonder if we could remix https://thenounproject.com/icon/cleaning-7652032/ overlayed with the AI image above and have a icon that way? Sohom (talk) 21:22, 10 August 2025 (UTC)
- The details in that image makes it a bit too much in my opinion, especially with the proposed logo above. Maybe a magnifying glass like before? (Even with a magnifying glass, we might need something like a color distinction between them to make it visually readable) Chaotic Enby (talk · contribs) 21:25, 10 August 2025 (UTC)
- I'm thinking, if we decide on a color palette for the whole project, assuming the base color is used for the title text around the logo, then we could have the main part of the logo (either the brain or robot) use the highlight color, and the magnifying glass use the base color for contrast. Chaotic Enby (talk · contribs) 21:40, 10 August 2025 (UTC)
- I wonder if we could remix https://thenounproject.com/icon/cleaning-7652032/ overlayed with the AI image above and have a icon that way? Sohom (talk) 21:22, 10 August 2025 (UTC)
- Now that I think about it, in addition to the points mentioned above, this logo is very similar to that of WikiProject Artificial Intelligence; to avoid confusion, I would actually prefer the old logo. SuperPianoMan9167 (talk) 21:23, 10 August 2025 (UTC)
- After reading Fifteen thousand two hundred twenty four's comment, I also believe that the brain icon has a mildly positive connotation (representing brainpower), while the robot icon has a mildly negative connotation (representing a failure
ofin the Turing test). Because of this, I now prefer for {{Collapse AI top}} to retain the robot icon, and I am concerned that the brain icon would project a message that is contrary to the goals of this WikiProject.{{WikiProject Artificial Intelligence}} was changed to use a blue brain icon (File:Icon AI brain blueshaded.svg) instead of its previous nodes icon (File:Hey Machine Learning Logo.png) in Waddie96's edit Special:Diff/1305146173. This change makes more sense for WikiProject Artificial Intelligence, which focuses on coverage of AI in article space, a very different focus than that of WikiProject AI Cleanup. — Newslinger talk 08:44, 11 August 2025 (UTC) Edited — Newslinger talk 09:14, 11 August 2025 (UTC)- Fully agree with that analysis. Chaotic Enby (talk · contribs) 10:43, 11 August 2025 (UTC)
- @Chaotic Enby Please be aware that I spent 10 minutes fixing up the file's licensing. It was about to be tagged for copyright infringement deletion. Please read Commons:Licensing before making any other derivative work of images licensed with free use tags but still require attribution (as they are not public ___domain. It's tricky wording, and frustrating I know. waddie96 ★ (talk) 13:26, 15 August 2025 (UTC)
- Thanks. To clarify, I did not upload the file or write the license myself (it was done by @Queen of Hearts). Additionally, the changes you made to the license were incorrect. Both File:Codex icon robot.svg and File:Codex_icon_search.svg were licensed under CC BY-SA 4.0, and so was File:WikiProject AI Cleanup.svg, so there was no need to switch it to MIT. If the license on the original files was incorrect, please change it there instead of just making changes to derivative files.Finally, this is not how copyright infringement works. If the file has the wrong license, then it will be usually tagged with something like {{Wrong license}} and fixed. There is no speedy deletion criterion for "forgot to properly give attribution". The closest are F3 (for derivative works of non-free content, which is obviously not the case here), and F5 (if the content is missing a source entirely, and with a warning and a grace period of seven days). Chaotic Enby (talk · contribs) 13:50, 15 August 2025 (UTC)
- @Chaotic Enby, @Queen of Hearts, Technically speaking @Waddie96 is not wrong, the onwiki files are marked under the wrong license (not sure why), the original files of the codex icons are indeed under the MIT license as per the LICENSE file for the source code. However, I see this as a simple fixable mistake and not a issue to ask for a copyright infringement deletion. That being said, @Waddie96, please mind your tone, at the moment you are coming off as condescending and combative, in suggesting copyright deletion and implying a inability to understand copyright. Sohom (talk) 14:03, 15 August 2025 (UTC)
- Thanks for the additional explanation. This is a bit of a confusing situation, as the README file indicates that the icons are under CC BY-SA 4.0. Should we conclude that they are automatically dual licensed? As I mentioned above, if that is the case, it could have been helpful to also make the change on the original icons to clarify the situation. Chaotic Enby (talk · contribs) 14:10, 15 August 2025 (UTC)
- Hmm, I did some digging around, and found phab:T383077#10433947, I think dual licensing it on wiki is the best way forward (since the package containing the icons is MIT, but the icons are also under CC-BY-SA (fun and confusing)). There is part of me that is freaking out about TheDJ's last comment since I agree, that by not linking to the icons (as they are used in our interfaces) we are kinda-sorta violating CC-BY-SA, but that's for the Codex team to figure out :) Sohom (talk) 14:23, 15 August 2025 (UTC)
- Edit: And I (kind of) misread, it's CC BY and not CC BY-SA. Chaotic Enby (talk · contribs) 14:29, 15 August 2025 (UTC)
- Thanks for spotting that @Sohom Datta waddie96 ★ (talk) 14:41, 15 August 2025 (UTC)
- I've request on Commons for a bot to reapply new license tag to all Codex icons. And to sort out the author and source fields, suggestions welcome. waddie96 ★ (talk) 20:42, 15 August 2025 (UTC)
- Hmm, I did some digging around, and found phab:T383077#10433947, I think dual licensing it on wiki is the best way forward (since the package containing the icons is MIT, but the icons are also under CC-BY-SA (fun and confusing)). There is part of me that is freaking out about TheDJ's last comment since I agree, that by not linking to the icons (as they are used in our interfaces) we are kinda-sorta violating CC-BY-SA, but that's for the Codex team to figure out :) Sohom (talk) 14:23, 15 August 2025 (UTC)
- Thanks for the additional explanation. This is a bit of a confusing situation, as the README file indicates that the icons are under CC BY-SA 4.0. Should we conclude that they are automatically dual licensed? As I mentioned above, if that is the case, it could have been helpful to also make the change on the original icons to clarify the situation. Chaotic Enby (talk · contribs) 14:10, 15 August 2025 (UTC)
- @Chaotic Enby, @Queen of Hearts, Technically speaking @Waddie96 is not wrong, the onwiki files are marked under the wrong license (not sure why), the original files of the codex icons are indeed under the MIT license as per the LICENSE file for the source code. However, I see this as a simple fixable mistake and not a issue to ask for a copyright infringement deletion. That being said, @Waddie96, please mind your tone, at the moment you are coming off as condescending and combative, in suggesting copyright deletion and implying a inability to understand copyright. Sohom (talk) 14:03, 15 August 2025 (UTC)
- Thanks. To clarify, I did not upload the file or write the license myself (it was done by @Queen of Hearts). Additionally, the changes you made to the license were incorrect. Both File:Codex icon robot.svg and File:Codex_icon_search.svg were licensed under CC BY-SA 4.0, and so was File:WikiProject AI Cleanup.svg, so there was no need to switch it to MIT. If the license on the original files was incorrect, please change it there instead of just making changes to derivative files.Finally, this is not how copyright infringement works. If the file has the wrong license, then it will be usually tagged with something like {{Wrong license}} and fixed. There is no speedy deletion criterion for "forgot to properly give attribution". The closest are F3 (for derivative works of non-free content, which is obviously not the case here), and F5 (if the content is missing a source entirely, and with a warning and a grace period of seven days). Chaotic Enby (talk · contribs) 13:50, 15 August 2025 (UTC)
- @Chaotic Enby Please be aware that I spent 10 minutes fixing up the file's licensing. It was about to be tagged for copyright infringement deletion. Please read Commons:Licensing before making any other derivative work of images licensed with free use tags but still require attribution (as they are not public ___domain. It's tricky wording, and frustrating I know. waddie96 ★ (talk) 13:26, 15 August 2025 (UTC)
- Fully agree with that analysis. Chaotic Enby (talk · contribs) 10:43, 11 August 2025 (UTC)
- Hmmm, listen it wouldn't be the end times at all if we reverted back. I made a WP:BOLD decision. If anyone independent wants to close the discussion with the outcome when it's reached its end, I'm happy either way per WP:BRD. waddie96 ★ (talk) 15:38, 11 August 2025 (UTC)
- I've undone the change pending discussion and consensus here. fifteen thousand two hundred twenty four (talk) 03:50, 12 August 2025 (UTC)
- After reading Fifteen thousand two hundred twenty four's comment, I also believe that the brain icon has a mildly positive connotation (representing brainpower), while the robot icon has a mildly negative connotation (representing a failure
- Prefer the old logo. Sorry - but the magnifying glass over a robot perfectly encapsulates the point of this Project, which is to scrutinise AI-generated text. qcne (talk) 09:00, 11 August 2025 (UTC)
- Emphatically prefer old logo. I won't repeat my last rant, but as others have said, we should absolutely not be giving in to identifying these speculative autocomplete technologies as intelligent, because that misunderstanding being made by other users is why this project has to exist in the first place.
- Also, at an aesthetic level, I'm not a huge fan of this half-brain half-positronics design. Not to be rude, but does this logo describe linear algebra algorithms that emit text, or would it be a good Cyborg t-shirt? Altoids0 (talk) 19:32, 14 August 2025 (UTC)
- @Altoids0 Lol this made me laugh because I'm not sure if you're actually being serious, or if you just joke really well over the Internet. waddie96 ★ (talk) 01:32, 16 August 2025 (UTC)
- I think I did mean it as a joke, apologies if I sounded a little testy. For what it's worth I definitely prefer these brainy designs over the typical butthole logos. Altoids0 (talk) 00:18, 17 August 2025 (UTC)
- No i thought you were seirously upset at AI. I was concerned haha. No stress waddie96 ★ (talk) 00:23, 17 August 2025 (UTC)
- I think I did mean it as a joke, apologies if I sounded a little testy. For what it's worth I definitely prefer these brainy designs over the typical butthole logos. Altoids0 (talk) 00:18, 17 August 2025 (UTC)
- @Altoids0 Lol this made me laugh because I'm not sure if you're actually being serious, or if you just joke really well over the Internet. waddie96 ★ (talk) 01:32, 16 August 2025 (UTC)
Possible new indicator of LLM usage? (broken markup)
editA draft article that I nominated for G15 speedy deletion has a very strange markup feature in it. The draft, Draft:Aleftina Evdokimova, was obviously generated by ChatGPT because of the "oai_citation" and "utm_source=chatgpt.com" codes, but it also has this strange markup in it attached to every reference, like the other codes:
({"attribution":{"attributableIndex":"1009-1"}})
The four-digit index increases going down the page. Are there any editors that are able to tell what this is? It seems like a possible sign of LLM output, but I'm not so sure of it yet. SuperPianoMan9167 (talk) 22:55, 10 August 2025 (UTC)
- I know Reddit isn't a reliable source since it is user-generated, but this post gives a pretty strong confirmation that this is another strange ChatGPT bug: [7] (Also, I realize now that this is just JSON.) SuperPianoMan9167 (talk) 23:00, 10 August 2025 (UTC)
- Looking for other pages with "attributableIndex", I couldn't find any, but, given your research, it is pretty likely to be a ChatGPT bug (although the post makes it a bit too early for GPT-5). It should probably be incorporated into Special:AbuseFilter/1346 sooner rather than later. Chaotic Enby (talk · contribs) 23:04, 10 August 2025 (UTC)
- I searched for it (as a "find this exact text" search) on Google and pretty much every result that has it also has Markdown and/or "oai_citation" in it. It definitely appears to be a ChatGPT quirk. SuperPianoMan9167 (talk) 23:08, 10 August 2025 (UTC)
- I just added it to WP:AISIGNS. Pinging @Sohom Datta to undelete the draft so it can be used as a documentation example there. Chaotic Enby (talk · contribs) 23:11, 10 August 2025 (UTC)
- User:Sohom Datta/attributeIndex should be a dump of the text! Sohom (talk) 23:12, 10 August 2025 (UTC)
- Thanks a lot! Also finding it used in this fr.wiki diff, although it isn't clear whether it was new text or an en.wiki translation. Chaotic Enby (talk · contribs) 23:14, 10 August 2025 (UTC)
- User:Sohom Datta/attributeIndex should be a dump of the text! Sohom (talk) 23:12, 10 August 2025 (UTC)
- I just added it to WP:AISIGNS. Pinging @Sohom Datta to undelete the draft so it can be used as a documentation example there. Chaotic Enby (talk · contribs) 23:11, 10 August 2025 (UTC)
- Added "attributableIndex" to 1346. Sam Walton (talk) 05:46, 11 August 2025 (UTC)
- I searched for it (as a "find this exact text" search) on Google and pretty much every result that has it also has Markdown and/or "oai_citation" in it. It definitely appears to be a ChatGPT quirk. SuperPianoMan9167 (talk) 23:08, 10 August 2025 (UTC)
- Looking for other pages with "attributableIndex", I couldn't find any, but, given your research, it is pretty likely to be a ChatGPT bug (although the post makes it a bit too early for GPT-5). It should probably be incorporated into Special:AbuseFilter/1346 sooner rather than later. Chaotic Enby (talk · contribs) 23:04, 10 August 2025 (UTC)
Complex and multifaceted
editThe search for "is a complex and multifaceted", a common AI turn of phrase, is giving a lot of hits. 2601:182:B00:2B10:4078:A7BF:64F:2B35 (talk) 20:08, 11 August 2025 (UTC)
- Hi -- for that you'll want to include quotes in the search query, "complex and multifaceted." Skimming the ~100 search results from that, I don't see anything that immediately jumps out to investigate, but it's always good to have people looking for this stuff! Gnomingstuff (talk) 20:29, 11 August 2025 (UTC)
Project design
editI've been trying out a small revamp of the top menu's design at User:Chaotic Enby/AI Cleanup, to give it a more polished style. Beyond that, I've been wondering if it could be a good thing to work on a common design language to give the project's pages a cleaner look, if anyone is interested. It will probably just be a color palette and maybe a few templates and page layouts, but the current project pages are a bit of a mess and it could be really worth it to make them visually cleaner. Chaotic Enby (talk · contribs) 14:19, 12 August 2025 (UTC)
An onslaught of seeming AI-generated additions to species articles
editApologies if you see this elsewhere; I'm also crossposting it to species-related projects.
I've been tagging a huge amount of seemingly AI-generated additions to articles about species. Some are "sourced," some not. I suspect that they are due to AI tools that will "write" an article based on provided sources and/or search results provided in a prompt. What seems to happen is that the AI, unable to generate text on a topic, speculates on what may be "likely." At first I considered that maybe it's a copy-paste template because there are a few users who are prolific with these, or perhaps a sockpuppet situation, but I've noticed similar text pattern in other topics as well.
Some examples:
- Diff 1:
While specific distribution data for *Amethysa basalis* is limited, members of the genus are generally found in tropical and subtropical zones.
This user has added many many edits like this (though they're not the only one). The asterisks indicate markdown formatting, a common AI tell. - Diff 2:
Shell Characteristics: While specific morphological details are limited, as a member of the Modiolus genus, it likely...
. A separate AI tell here is this puffery: "This inclusion highlights its relevance in studies of marine biodiversity in the South Atlantic region." Several drafts by the user have been declined for sources not matching text. - Diff 3:
Although specific conservation assessments for Halystina globulus are not available, deep-sea species in general are considered...
. Not also the "is essential" editorializing. - Diff 4:
this remains unconfirmed without direct access to the original description
andSpecific details about its depth range or precise localities within the Philippine region are not well-documented in available literature, suggesting a need for further research.
The "further research" editorializing is common. Note that this user's userpage also shows AI signs, like markdown link formatting. - Diff 5:
Specific morphological details about C. bialata are limited in the provided sources.
- Diff 6:
While specific measurements are not widely detailed, it shares general characteristics with other species...
. This user was blocked for ongoing LLM use. Note the plaintext "footnotes," also.
I could list a lot more but I really don't want to be here all day. Basically, we've been getting swamped with these edits for almost a year, it's worse than we thought, it shows no signs of stopping, and it is way too big for one person. and I'm not a biology expert by any means so I am of limited help doing anything but finding this stuff and flagging it to experts.
Anyway, wanted to bring this to your attention, hopefully people have bandwidth to help take it on. Please tag me if you have questions or remarks, or else I won't see it (because I am busy excavating slop).
Note to the future: I do not want a mass deletion campaign to be kicked off due to this and do not approve of any insults toward the authors involved. Please don't make me regret flagging this. Gnomingstuff (talk) 19:11, 12 August 2025 (UTC)
- Thanks for bringing this up. I've also seen it in a few articles recently, and it is very good that you flagged it for attention. I'm guessing we should add Specific details are limited/not available in WP:AISIGNS, and maybe to Special:AbuseFilter/1325 (although the sentence structure might be a bit too variable for that).If the syntax is too vague for the edit filter, we could unironically train a (very small) language model to learn these sentence structures and run it on recent changes. That could possibly be a more flexible tool than an edit filter to look for "tells" of AI-generated content, assuming we train it on specific tells like this (rather than something like GPTZero which compares AI-generated editorial prose with human-generated editorial prose, and completely misses the baseline of Wikipedia's writing style). Chaotic Enby (talk · contribs) 19:32, 12 August 2025 (UTC)
- Yep, that's one of the AI prose tells I've noted. There are a few more non-species examples at that link, like this ("provided search results") and this (contains a chatbot response)
- As far as an edit filter, this pattern isn't unheard of in older text (example from 2009, example from 2010) so I don't know. Gnomingstuff (talk) 19:46, 12 August 2025 (UTC)
- Regarding the edit filter part, it's about spotting them, not blocking them, so it should be fine – especially since this is bad prose either way. Chaotic Enby (talk · contribs) 19:48, 12 August 2025 (UTC)
- Here's another pattern, looks like chatbot output (and the user's other edits all but confirm it). Gnomingstuff (talk) 21:41, 12 August 2025 (UTC)
- I reverted that edit for two reasons, since is indeed chatbot output:
- It added prose to the "References" section, which should usually only contain {{reflist}} or bibliography-style references and nothing else
- Communication intended for the user (the chatbot is literally saying it searched for sources)
- SuperPianoMan9167 (talk) 22:29, 12 August 2025 (UTC)
- Another distinct pattern of possible chatbot output. Sorry for spam, I can take this to a separate page.
- I reverted that edit for two reasons, since is indeed chatbot output:
- I recently spent many hours cleaning up this kind of stuff on mosquito species articles. The pattern I saw was that the AI generates citations to legitimate publications, but those publications don't contain the claims made in the AI text, which appears to take characteristics that are generally true of all mosquitos and phrase them as though they were specific distinguishing features of a specific species ("A. Mosquito is distinguished by its biting behavior, making it a nuisance to humans and pets."). -- LWG talk 19:38, 12 August 2025 (UTC)
Yet another huge network of unreviewed LLM text
editUgh.
Earlier this year there was a sockpuppet investigation into a couple of users. The investigation brought up some suspected AI use and at least one hallucination. Turns out that combined, these users have made hundreds of edits, most of which are to extremely high profile articles (up to WP:VITAL level 3), all of them seeming to be LLM generated. Some of them are article text, a lot are image captions.
I've gone through and done a quick scan of the most important-seeming edits, and have tagged a lot of articles as a result, but I haven't reviewed every single edit because there are just so many of them. So if anyone else has time to take a look feel free (the ones I have reviewed are mainly the large diffs). In some cases the edits are pretty small, but I feel like having (justified) AI tags at the top of major articles is maybe not the worst idea in the world for awareness raising.... Gnomingstuff (talk) 18:54, 13 August 2025 (UTC)
- Could it be possible to ask editors adding {{AI-generated}} to at the very least verify if the added content qualifies as AI before flagging articles as such? For example, ignoring the paragraph that starts with "In emulation of...", how does a whole article warrants a warn because a person just added two images? (CC) Tbhotch™ 19:08, 13 August 2025 (UTC)
- Because they still contain facts that need verification, and can contain hallucinations (the "especially in 1946" that is inserted out of nowhere). Here, with the first image, several factual assertions are made in a short space: the image is indeed the Teatro de los Insurgentes, that it is specifically the facade, and that the mural on the facade is in fact a visual history. In this case the additions do seem to be factually accurate, but any use of AI essentially poisons the whole well of the edits. The AI-generated template does have a parameter to restrict it to specific sections, but with images that's not all that simple to do -- IMO it's probably more disruptive to have a bunch of section tags than one article tag. Gnomingstuff (talk) 22:36, 13 August 2025 (UTC)
- I agree. Any article that has been maliciously modified to become filled with unverified or dubious claims can reasonably get a banner, even if it's embarrassing for Wikipedia. How I even became aware of articles like Blues being contaminated was through your maintenance tag additions, as I ritualistically flip through the associated category.
- If Gnoming's work is demonstrative of anything, it's of a need for a specific maintenance template for generated captions. Altoids0 (talk) 07:16, 16 August 2025 (UTC)
- I find discussions like these very frustrating because they're often shooting the messenger. I (or anyone else adding templates) didn't suddenly put hundreds of new instances of LLM content into articles. They were there before. Now, they might actually get fixed instead of sitting around undetected for 5 or 10 or 19 years, getting cited in books, etc. It's especially frustrating when the edits were made by someone who already got blocked, for LLM use, yet no one took the time to go back and even tag (let alone fix) the contributions that they already decided were worth blocking over.
- Any embarrassment to Wikipedia is a feature, not a bug. The AI slop will remain whether we spot it or not, so readers might as well know about it. Gnomingstuff (talk) 17:36, 17 August 2025 (UTC)
- Because they still contain facts that need verification, and can contain hallucinations (the "especially in 1946" that is inserted out of nowhere). Here, with the first image, several factual assertions are made in a short space: the image is indeed the Teatro de los Insurgentes, that it is specifically the facade, and that the mural on the facade is in fact a visual history. In this case the additions do seem to be factually accurate, but any use of AI essentially poisons the whole well of the edits. The AI-generated template does have a parameter to restrict it to specific sections, but with images that's not all that simple to do -- IMO it's probably more disruptive to have a bunch of section tags than one article tag. Gnomingstuff (talk) 22:36, 13 August 2025 (UTC)
Georgia (country)
editLooking for input at Talk:Georgia (country)#AI additions Moxy🍁 20:13, 13 August 2025 (UTC)
Misuse of collapse template
editPlease see discussion at Wikipedia:Village pump (miscellaneous)#LLM accusations and non-native speakers and share your thoughts there. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 11:55, 16 August 2025 (UTC)
You are invited to join the discussion at Wikipedia:Village pump (idea lab) § Working towards a policy on generative AI, which is within the scope of this WikiProject. Chaotic Enby (talk · contribs) 00:50, 17 August 2025 (UTC)
ANI thread
editWikipedia:Administrators'_noticeboard/Incidents#Likely_undisclosed_AI/LLM_use_to_expand_articles_by_Covnantay is probably of interest to members of this Wikiproject. Potentially hundreds of articles affected by unvetted AI additions. Hemiauchenia (talk) 00:55, 17 August 2025 (UTC)
Discussion at Talk:Mark Karpelès § AI-generated frog portrait
edit You are invited to join the discussion at Talk:Mark Karpelès § AI-generated frog portrait, which is within the scope of this WikiProject. An editor has added an AI-generated cartoon portrait of a BLP, sourced from the website of the subject's current employer. I removed the image from the article citing WP:AIIMGBLP and WP:AIGI; the editor restored it and defended its inclusion on the basis that AI-generated images should be used when the subject is known to use them (i.e. not generated for the sake of having a photo in an article)
. Requesting input from uninvolved editors as to whether this constitutes a marginal case in which AI-generated imagery depicting BLPs is permissible. DefaultFree (talk) 04:01, 17 August 2025 (UTC)
Signs of AI writing but with a little more "je ne sais quoi"
editHi folks,
I am one of those folks who keeps wanting to participate in Wikipedia but I struggle to feel like I make meaningful contributions, stuff gets done wrong, I spiral and disappear. I'm not giving up though and I feel like I can genuinely help bring a French adaptation to this: https://en.wikipedia.org/wiki/Wikipedia:Signs_of_AI_writing
Is there anyone currently working on this? Should I proceed? Pinkythank (talk) 07:04, 17 August 2025 (UTC)
- No worries, even experienced users often get stuff wrong, you don't have to worry about this! Regarding the French version, French Wikipedia has a sister project to this one, fr:Projet:Observatoire des IA, which has been working on fr:Aide:Identifier l'usage d'une IA générative. That page is still much shorter than ours, and you can definitely help contribute to it! Chaotic Enby (talk · contribs) 11:58, 17 August 2025 (UTC)
Publifye AS
editI wasn't sure if this would be better off here or at Wikipedia:Reliable sources/Noticeboard, but...
Publifye AS (https://publifye.com/) is a self-publishing platform which makes extensive use of AI. This is explained in a disclaimer in the books I've viewed, though it is typically not part of the preview and is only noticeable if you search for "AI" within the book. Some of the authors are even listed as AI, eg: Corbin Shepherd, AI.
Search results for the authors are littered with links to vendors which don't work (eg), so I assume the likes of Amazon, Barnes & Noble, and Everand realised the works were AI generated and removed them.
However, Publifye AS's output is still on Google Books and from there has ended up in at least a dozen Wikipedia article (current search result is 11, and I removed a few earlier). Does anyone have any bright ideas to prevent the use of sources by this publisher? Richard Nevell (talk) 12:45, 24 August 2025 (UTC)
- I've removed what was left. My idea would be to create an edit filter for this and when that matches, it would display a warning to the user regarding this, and it would tag the edit so we can look at edits that have ignored this warning (similar to Special:AbuseFilter/869). Kovcszaln6 (talk) 13:48, 24 August 2025 (UTC)
- Good removals. An edit filter seems like a reasonable step to take here. Stepwise Continuous Dysfunction (talk) 04:50, 25 August 2025 (UTC)
- I've requested one. Kovcszaln6 (talk) 11:43, 27 August 2025 (UTC)
- Good removals. An edit filter seems like a reasonable step to take here. Stepwise Continuous Dysfunction (talk) 04:50, 25 August 2025 (UTC)
If my suspicions are correct, the edits made by user @Ivanisrael06 (Special:Contributions/Ivanisrael06) seem to be, entirely or in large part, generated by AI. This is, to me, somewhat apparent in the wording and tone of their contributions. What is more concerning than that is the very odd page formatting and citation style employed in almost all of their submissions here. While these style issues in and of themselves merit page revisions in most cases, I would definitely appreciate second opinions on the potential use of an LLM. ElooB (talk) 18:05, 24 August 2025 (UTC)
- Was coming here to ask for help for the exact same person..after seeing this that even has sources to Wikipedia. I have revered a few of these changes and I'm not seeing anything that's not AI generated..... We'll need people with some time on their hands to take a look at this. Moxy🍁 18:56, 24 August 2025 (UTC)
- Unusual citation style, plus the near-total absence of wikilinks in the text they add, to me are strongly suggestive of genAI. Worse, these weird references are to Wikipedia itself or to social media, see e.g. this diff. And some of the references are simply made-up, such as "Dhaka Tribune, 2025" in this diff, despite the absence of any Dhaka Tribune articles in the reference list. I think reversion of all these edits is in order. WeirdNAnnoyed (talk) 19:07, 24 August 2025 (UTC)
- The offending edits are all rolled back by now. I guess that should settle that. Ivan, if you read this, please refer to Moxy's message on your talk page for your future contributions on Wikipedia. ElooB (talk) 19:28, 24 August 2025 (UTC)
- thank you for your time and effort.Moxy🍁 19:57, 24 August 2025 (UTC)
a whole class worth of AI edits
editThis fall 2024 course seems to have outright encouraged students to use AI for their edits -- the students' userpages seem to have a lot of essays like this suggesting it's an actual assignment (and the page edits display the usual signs). I know AI edits have been an issue with student edits in general but this seems to be a much more centralized thing, so wanted to post it here in case other classes had something similar around this time.
There's a discussion on the education noticeboard about the current Wiki Ed AI guidelines, which this class predates. Gnomingstuff (talk) 20:01, 25 August 2025 (UTC)
Electrical injury
edit- Electrical injury (edit | talk | history | protect | delete | links | watch | logs | views)
- Johnknollpec (talk · contribs · deleted contribs · logs · filter log · block user · block log)
There is a persistent editor at Electrical injury who has been adding LLM content to this article. He was been warned multiple times on his talk page, and reverted at least twice: [8] [9]. The last time I reverted him he had added quite a few citations that were downright fakes, right down to made-up pubmed and doi numbers.
He just re-applied his edits again, and I don't have time or inclination to look at them. The changes are extensive. Some of them are ok, but he also removed a lot of sourced content, and has many MOS violations. I can't deal with this. Can someone else take a look? GA-RT-22 (talk) 10:12, 26 August 2025 (UTC)
- @GA-RT-22 This is probably Wikipedia:Administrators' noticeboard/Incidents worthy, especially with the fake citations. qcne (talk) 11:17, 26 August 2025 (UTC)
- After finding more examples of problematic content in the recent edits, I went ahead and restored the last good version of the article. -- LWG talk 11:29, 26 August 2025 (UTC)
- I would agree this is WP:AN/I material, because for an article such as this MEDRS sources would be expected. Posting LLM hallucinations is one thing when you're writing about pop music, and something else entirely when you're writing about medical topics. WeirdNAnnoyed (talk) 23:12, 26 August 2025 (UTC)