Revision as of 14:39, 17 March 2024 edit Citation bot (talk \| contribs) Bots 5,871,850 edits Alter: title, template type. Add: chapter-url, chapter, pmid, doi, authors 1-1. Removed or converted URL. Removed parameters. Some additions/deletions were parameter name changes. \| Use this bot. Report bugs. \| Suggested by Headbomb \| Linked from Wikipedia:WikiProject_Academic_Journals/Journals_cited_by_Wikipedia/Sandbox2 \| #UCB_webform_linked 133/686 ← Previous edit		Revision as of 16:03, 20 March 2024 edit undo OAbot (talk \| contribs) Bots 646,409 edits m Open access bot: hdl updated in citation with #oabot. Next edit →
Line 12: Data scraping is generally considered an ''[[ad hoc]]'', inelegant technique, often used only as a "last resort" when no other mechanism for data interchange is available. Aside from the higher [[computer programming\|programming]] and processing overhead, output displays intended for human consumption often change structure frequently. Humans can cope with this easily, but a computer program will fail. Depending on the quality and the extent of [[error handling]] logic present in the [[computer]], this failure can result in error messages, corrupted output or even [[program crash]]es. However, setting up a data scraping pipeline nowadays is straightforward, requiring minimal programming effort to meet practical needs (especially in biomedical data integration).<ref>{{Cite journal \|last=Glez-Peña \|first=Daniel \|date=April 30, 2013 \|title=Web scraping technologies in an API world \|url=https://academic.oup.com/bib/article/15/5/788/2422275 \|journal=Briefings in Bioinformatics \|volume=15 \|issue=5 \|pages=788–797\|doi=10.1093/bib/bbt026 \|pmid=23632294 \|hdl=1822/32460 \|hdl-access=free }}</ref> ==Technical variants<!--'Screen scraping' redirects here-->==

Data scraping: Difference between revisions