Blog scraping: Difference between revisions

Content deleted Content added
Dangers: linked "splog"
Defense: remove section. not a howto. links contain information on defense
Line 16:
Additionally, scraped content can appear on literally any type of splog or [[RSS (file format)|RSS]] fed spam site. That means an unsuspecting individual could find their creative or even copyrighted material showing up on a site promoting pornography or other type of content that would be offensive to the original author or his/her audience. This can be damaging to the original author's reputation.
 
== Defense ==
 
Blog scraping software is becoming more and more "intelligent" as time goes by. The "''smarter''" programs can bypass even the most valiant efforts, but that should not stop you from taking what are some very simple steps to help discourage the majority which are "''average/dumb''" scrapers.
 
'''#1. Include a strongly worded copyright tag'''
 
The bottom of each of your original content posts/articles should include a brief copyright tag. this is your first line of defense. This will prevent all but the most unscrupulous of people from stealing your content. As far as the blog scraping software that spam marketers use, it won’t do much on the prevention side. BUT — it is the only thing that gives you the legal right to go after the offending party.
 
'''#2. Use a summary feed for your business blog'''
 
Instead of choosing to send the full content of your posts via RSS, change the setting of your blog software to use "summary" or "truncated" feeds. This may not stop the couple of "smart" scrapers out there, but it will help reduce the incidence of blog scraping (until Blog scraping makes the next evolutionary jump.) Truncate with caution, however: summary feeds are repellant to many readers who dislike being forced to load an additional webpage instead of reading the story in their news aggregator.
 
'''#3 If you ‘must’ use a full text feed — copyright protect it!'''
 
You can add a copyright footer to your RSS feed. It’s something that can be done easily with most major blogging software. But you probably will not be able to do this with free blogging software such as Blogger.
 
WordPress users can install the "Feed Copywriter Plugin" that makes this process quick, easy, and painless.
 
'''#4 Place a hidden image in each entry'''
 
This is another interesting strategy that can be used. With the hidden image you can then use your referrer logs to track offenders since it's likely they'll never catch the hidden image.
 
== Helpful Links ==