This is a list of pages with popup previews which are not as Nature intended. Please feel free to add more pages you find, and suggest fixes to my regexes if you're into that sort of thing.
Currently broken
- Escherichia coli - ideally,
close brackets ortreat brackets as one sentence - Heroin (not first sentence)
- Isis (h2/h3/par should get either all of that or h3/par; it just gets h3)
- Japan (italic intro text included)
- Now fixed in article source - the "—" was out of italics. Maybe paragraphs that include nothing but italicised text and/or interpunction should be excluded.
- Wikipedia:Browse - odd page, but just gives |}
- Great Googly Moogly - blockquotes are problematic (currently they're just deleted).
- Ichthys - what should we do with SVG images?
- User:Wiki_brah - I only get the first two words previewed while I'm set to receive the first 6 sentences. Maybe a problem with the exclamation marks?
- [Thanks for being the first person other than me to add to this page!] This is because we extract the first two pars only. Here, the second "sentence" doesn't end until the third par.
- Singular they - livepreview bug, methinks
- Tamil language - removing ^| lines is a bad idea.
- Whose line is it anyway? - question marks give in main link
- Japanese bobtail - redirect code doesn't like pipes
- Malaysia
- The Book of the Courtier
- University of Wisconsin
- Wikipedia:Wikipedia_Signpost/2005-09-12/Anti-vandal_tools - very long thin image needs to be tamed!
Special:Log and all other Special: pages- Israel Defense Forces - the 1st & 2nd images are located in a table ( {{Template:IDFTable}} ) and are skipped - the image displayed is actually the third, which is less relevant. I assume this happens with all similar pages.
Should work now
- Jelly fungi - taxobox detection needs refinement (ignore "taxobox begin/end placement" templates)
- User_talk:Lupin - my oh my. this is odd.
- Kingdom of Ireland - multiple italic indented paragraphs, which some non-italic and bold bits
- Singer (2 pars? why?)
- SGR - uses TLAdisambig, but not recognized as a disambig
- Transformer gives a table fragment
- England, Wales (table problems)
- Roe v. Wade small, problematic table
- User:Dbenbenn - image tag with square brackets
- American English - over-zealous pruning of italic pages
- Raja Yoga - doubtless another italics-induced calamity
- Osiris - maybe the italics hacks should come after html tag removal
- Politics
- Philosophy - looks like section headings needn't begin lines. or something
- 1968 in film - should kill divs
- Satan - i.e. is not a sentence ending. i.?e.|e.?g.|n.?b.|sic.|etc.| any more?
- United Kingdom
- Goat - taxobox :-( fix idea: delete blocks of lines
- that won't work - eg Deer. just do a special taxobox case (yuck)
- Mohammad Reza Shah Pahlavi
- Alexander the Great
(livepreview bug? not sure)'''{{template}}''' is a problem. hm. could replace templates with spaces instead of deleting? - Genetically modified food - blank. category tag gets in the way - delete not just images, but also category "links"
- Cult figure (linkus interruptus) - Dr., Mr. Mrs. Ms. don't usually end sentences
- Benedict Canyon - real paragraphs don't end in colons. delete blank lines after the first following after non-sentence-endings
- Roman Polański - a new section should break paragraphs too if there's interesting text preceding it. idea: "normalize" section breaks before paragraph split by insert blank line(s) before and removing line breaks afterwards
- Zoology - leading simple table
- Fahd of Saudi Arabia tables again - Template:Xxx begin...Template:Xxx end regex is SLOW.