Wikipedia:Bots/Requests for approval/ImageResizeBot: Difference between revisions
Content deleted Content added
q |
→Discussion: expand |
||
Line 40:
* Question: What do you think about an image such as [[:Image:Cannibalised.jpg]]? As it stands, it is just above your mark (600 × 601) (I agree, it's much too big in its current state), but shrinking it to thumbnail size makes it just below your mark (599 × 600), which seems like a minor difference. And then there are images like [[:Image:Aroundtheworld.jpg]] where the thumbnail seems to be the full size of the image. These are really just test cases to me, as I am just wondering how you plan on handling them (not an argument to ignore them in any way). - [[User:AWeenieMan|AWeenieMan]] ([[User talk:AWeenieMan|talk]]) 21:31, 17 March 2008 (UTC)
**There is probably some marginal difference in size where the costs exceed the minimal benefits. That particular example would shrink 1 pixel in each dimension, and that is a 0.33% change in total pixel count. If the images are to be reviewed for deletion of older revisions, that marginal difference that is appropriate is higher than otherwise. Does the software to be used indicate any level of noise/deterioration that it might produce by resizing? I'll be shocked if it does (certainly not in the marketing materials) but if so that could inform a benchmark. What are the other costs - data storage of additional versions (deleted versions presumably remaining in the deleted history), download/upload bandwith, bot edits in history, possible human review... Being over the 360K limit by 25% would when shrunk to 360K pixels shrink each dimension 10.56%. Being over by 10% would when shrunk to 360K pixels shrink each dimension 4.65%. <small>(1 - 1/sqrt(1+%over360K))</small> So 10% to 25% pixel count margins seem plausible to me - and the bot could flag these as "current version too large, please shrink to X by Y when the image is edited for another reason". [[User:GRBerry|GRBerry]] 22:14, 17 March 2008 (UTC)
***(ec) Well seeing those, I'll probably increase the minimal size to 400,000 square pixels. Basically that puts a bit of leeyway, and gets rid of the problems you mention. Now, the idea from here would be to do the resizing of the obvious cases, get those down to at least thumbnail size. As far as the costs, this bot is actually operating from the wikimedia toolservers, so all the queries are direct database queries, as far as getting that list. The actual resize requires that the bot download the image (to memory, not to hard disk), change the size, and upload the new image size. This is all done inside of [[RAM]]. The point at this stage is really to get the obvious violators, not split hairs, I did not realize that the smaller ones would result in a 1 pixel change. —— '''[[user:Eagle 101|<font color="navy">Eagle</font><font color="red">101]]'''</font><sup>[[user_talk:Eagle 101|Need help?]]</sup> 23:20, 17 March 2008 (UTC)
*From [[User:David Shankbone]], I understand that deleting the oversize version doesn't actually save any space on the servers, since the oversize version is still kept around (like a deleted article). If this theory is correct, has it been factored into the calculation of benefits? [[User:EdJohnston|EdJohnston]] ([[User talk:EdJohnston|talk]]) 23:16, 17 March 2008 (UTC)
**(ec x2)Correct, we know that, the point is to remove the high resolution version of the image. Using our non-free content policies means we use the smallest version we can use. —— '''[[user:Eagle 101|<font color="navy">Eagle</font><font color="red">101]]'''</font><sup>[[user_talk:Eagle 101|Need help?]]</sup> 23:20, 17 March 2008 (UTC)
====Logos====
I've recently discovered that when the Cat:Logos was converted to Cat:Non-Free Logos, the appropriate counter-cat Cat:Free Logos, for simple geometric shapes and words that can't be copyrighted, was not created. One estimate is that about 10% of the 70,000 logo images are actually free. Is there some what you could exclude this cat at first while I try and figure out how to sort and re-tag the logos? Could you give me an intersection of Large Images and Non-Free Logos to see how big an issue this is? '''[[User:MBisanz|<span style='color: #FFFF00;background-color: #0000FF;'>MBisanz</span>]]''' <sup>[[User talk:MBisanz|<span style='color: #FFA500;'>talk</span>]]</sup> 23:18, 17 March 2008 (UTC)
|