Blog scraping: Difference between revisions

Content deleted Content added
m Dangers: tense fix
Line 5:
"''Scraping''" essentially stands for copying, or in the case of copyrighted material, stealing content off a [[blog]] that is not owned by the individual initiating the scraping process. The scraped content is often used on [[Splog|Spam blogs or splogs]].
 
== Dangers ==
 
Obviously, if blog scrapers are gathering content that is copyrighted material, that is a violation of law. But even ignoring for a moment the legal side, there are a number of more practical problems that Blog scraping causes for the person or business whose blog is being scraped. The problem of Blog scraping is particularly worrisome for business owners and business bloggers.
 
Sometimes a blog scraper will copy an entire post off an independent or business blog. That duplicate content will include the author's tag and a link back to the author's site (if that link appears in the author's tag.)
 
Many times though, blog scrapers copy just the portion of the content that is keyword relevant to their splog topic.
 
Why the more 'advanced' Blog scrapers do this is simple. By copying only the content that is relevant to their splog topic, they can increasesincrease the keyword relevancy of their site(s). Secondly, by not scraping the entire post, they eliminate any outbound links which would reduce their search engine ranking.
 
Additionally, scraped content can appear on literally any type of splog or [[RSS (file format)|RSS]] fed spam site. That means an unsuspecting individual could find their creative or even copyrighted material showing up on a site promoting pornography or other type of content that would be offensive to the original author or his/her audience. This can be damaging to the original author's reputation.