Wikipedia:Bots/Requests for approval/Staeckerbot
Operator: Staecker
Automatic or Manually Assisted: Eventually automatic, manual during testing
Programming Language(s): python (pywikipedia)
Function Summary: nominate duplicate files for speedy deletion
Edit period(s) (e.g. Continuous, daily, one time run): Probably a daily run
Edit rate requested: I estimate no more than 100 edits per day. A few for each dupe, which seem to pop up a few times per hour (my own informal estimate).
Function Details: The bot will use Special:Newimages to find duplicate files. It seems that most dupes are uploaded within the same 20 minutes or so by inexperienced users who forget the name of their first upload, or want to change the name for other reasons. So the bot will just search the new images since its last run. The bot will first compare the file sizes (which are given in the images gallery, so require no download), and if two files with identical file size are found, the files are downloaded and compared directly.
When a duplicate is found, the bot will choose an orphaned copy and nominate it for speedy deletion, requiring a few edits. These will be the only edits made by the bot.
The bot does not detect scaled copies of files, or two "same" images in different file formats. Also no effort is made to root out old duplicate files which have existed on WP for a long time. Only new files are searched.
Since we only download size-duplicates, and will make very few edits, the server load should be negligible.
- This is my first bot, so I'd appreciate any constructive feedback- Staecker