|
© 2003-2007 Googlerankings.com
Googlerankings.com is in no way affiliated with or is the property of Google Inc. |
| Introduction About this guide Common issues Duplicate content in Google
Tools and services |
In order to battle off plagiarism and scraper sites, and also to provide higher quality search results, the Google index is applying a filter to sort out duplicates of web pages and other documents found on the web. The URLs that are judged to point to content that can be found on another URL as well, are being lowered in their importance, and eventually are turned into supplemental results or are dropped out of the index. Known issues Case 1, + Resolution: The immediate shutdown of the mirror site, and all copies of the content you have control of. Redirect visitors to the single copy that you wish to keep.
+ Resolution: You should not have an identical copy of any single web page, nor an entire web site on the web simultaneously to the original. In case you notice your web pages being plagiarized by a 3rd party, contact the webmaster and request its deletion. If the webmaster does not respond, contact the hosting company, the Internet Service Provider, or the Registrar directly, and report the problem to Google representatives through the Google Webmaster Tools control panel.
+ Resolution: Google does its best
to identify the patterns of good-faith duplicate content issues, such
as the www.example.com vs. the example.com versions of the same URL pointing
to a single web page. In certain cases however the algorithm can not decide
whether the duplicate content is spam, the result of erroneous inbound
links or of inconsistent navigation / parameters for the same URL. Case 4, + Resolution: To prevent such issues taking websites by surprise, you may set up a Google Alert at http://www.google.com/alerts for the domain name and inspect reports of any suspicious URLs that use its domain name as a part of the address, or bits of its unique content. Either way, you will need to identify the bot that requests the pages from the website and disallow any further copying of the content through your .htaccess settings. Read more on Hijacking.
Resources Google Webmaster Guidelines http://www.google.com/support/webmasters/bin/answer.py?answer=35769 Avoiding Duplicate Content penalties ( Elixir Systems ) Duplicate Content Issues (Yahoo & Google) ( Search Engine
Roundtable ) Duplicate Content - Get it right or perish ( Webmasterworld ) How do I prevent Googlebot from following links on my pages? (
Google Webmaster Help Center ) How do I tell Googlebot not to crawl a single outgoing link on a page?
( Google Webmaster Help Center ) |
Web site diagnostics Banned from Google
|