Interwiki bot

This is an archived version of this page, as edited by Shizhao (talk | contribs) at 12:31, 6 January 2004. It may differ significantly from the current version.

This is a place to discuss about updating interwiki links by using a bot.

There is an interwiki bot coded in python by Rob W.W. Hooft (c) 2003. It request an article, parse interwikis and check if the other wikis are up to date.

From the .py file:

Script to check language links for general pages. This works by downloading the page, and using existing translations plus hints from the command line to download the equivalent pages from other languages. All of such pages are downloaded as well and checked for interwiki links recursively until there are no more links that are encountered. A rationalization process then selects the right interwiki links, and if this is unambiguous, the interwiki links in the original page will be automatically updated and the modified page uploaded.

The robot tries to be soft on the wikipedia server: it uses the new Special:Export feature to get more than one page in one request, and it waits between requests. The robot will (unless changed) not modify more than one page per minute.

This robot has a growing operator community.

Before being used on a wikipedia, its community should authorise such a bot to update the wiki. it can be tested on a small number of pages to demonstrate its purpose.

The best thing is to have an operator for each language that wants to run the robot.

Some statistics:

        articles     interwiki links   average number
 en:     189k             71k             0.4
 de:      43k             75k             1.7
 fr:      22k             60k             2.7
 ja:      23k             33k             1.4
 nl:      17k             51k             3.0
 pl:      18k             21k             1.2
 sv:      19k             18k             0.9
 da:      14k             31k             2.2
 es:      14k             53k             3.8
 eo:      10k             41k             4.1
 zh:       4k             12k             3.0

operators

According to the warning logs generated, eo, es need operators as well!

See Interwiki bot/Getting started for an attempt to get new operators started.

Operators of the bot, and in fact all people that want to discuss the implementation or the usage of this code, are welcome to join the mailing list pywikipediabot-users@lists.sourceforge.net by subscribing at the mailinglist information page

restricted usage

  • en: Andre Engels under username Robbot - only solve_disambiguation.py, not interwiki.py

completed

2004.1.2

I have completed a run of interwiki.py all pages on zh.wikipedia. The result is zipped up and available from:

http://www.cnblog.org/blog/lang040102.zip

Please run the appropriate file through.

af:169 Fixed
ar:98 Fixed
bs:73 Fixed
ca:3102 Fixed
cs:234 Fixed
cy:375 Fixed
da:1681
de:5045
el:172 Fixed
en:3637
eo:4609
es:3439
et:460 Fixed
fi:3743
fr:3463 Fixed
fy:833
gl:96 Fixed
he:612
hi:112 Fixed
hr:1504
hu:114
ia:316
it:4754
ja:3324
ko:444
la:564
lt:65 Fixed
ml:48 Fixed
ms:154
nd:342
nl:777
no:1548
oc:2618
pl:5547
pt:1219
ro:3908
ru:1223
si:1289
sl:3563
sq:4 Fixed
sr:386
sv:6393
tr:22 Fixed
vi:22 Fixed

old software:

li:3
lv:20 
sw:19
tt:40

--Shizhao 14:35, 2 Jan 2004 (UTC)