Content deleted Content added
No edit summary |
spam |
||
Line 69:
== Legal ==
When scraping websites and services the legal part is often a big concern for companies, for web scraping it greatly depends on the country a scraping user/company is from as well as which data or website is being scraped. With many different court rulings all over the world.<ref>{{cite web|url=http://blog.icreon.us/advise/web-scraping-legality|title=Is Web Scraping Legal? |publisher=Icreon (blog)}}</ref><ref>{{cite web|url=https://arstechnica.com/tech-policy/2014/04/appeals-court-reverses-hackertroll-weev-conviction-and-sentence/|title=Appeals court reverses hacker/troll "weev" conviction and sentence [Updated]|website=arstechnica.com|date=11 April 2014
However, when it comes to scraping search engines the situation is different, search engines usually do not list intellectual property as they just repeat or summarize information they scraped from other websites.
Line 83:
==External links==
{{toomanylinks}}
* [https://scrapy.org/ Scrapy] Open source python framework, not dedicated to search engine scraping but regularly used as base and with a large number of users.
* [http://scraping.compunect.com Compunect scraping sourcecode] - A range of well known open source PHP scraping scripts including a regularly maintained Google Search scraper for scraping advertisements and organic resultpages.
Line 89 ⟶ 90:
* [http://simplehtmldom.sourceforge.net/ PHP Simpledom] A widespread open source PHP DOM parser to interpret HTML code into variables.
*[https://serpapi.com/ SerpApi] Third party service based in the United States allowing you to scrape search engines legally.
[[Category:Search engine software]]
|