View Single Post
Old 07-22-2010, 10:28 PM  
yabate
Confirmed User
 
Industry Role:
Join Date: Jan 2010
Posts: 296
>>> JFGIT <<<

Lets see...

Quote:
Q: Do I have duplicate content? Am I being penalized for it? What should I do about it?
A: In most cases, having duplicate content on your site does not mean your site is penalized. Read this article right now. If you're still concerned or want to know more, read these articles (you're not the first person to ask about duplicate content!):
Sites.Google

Then....

Quote:
What is duplicate content?
Duplicate content generally refers to substantive blocks of content within or across domains that either completely match other content or are appreciably similar. Most of the time when we see this, it's unintentional or at least not malicious in origin: forums that generate both regular and stripped-down mobile-targeted pages, store items shown (and -- worse yet -- linked) via multiple distinct URLs, and so on. In some cases, content is duplicated across domains in an attempt to manipulate search engine rankings or garner more traffic via popular or long-tail queries.

What isn't duplicate content?
Though we do offer a handy translation utility, our algorithms won't view the same article written in English and Spanish as duplicate content. Similarly, you shouldn't worry about occasional snippets (quotes and otherwise) being flagged as duplicate content.

Why does Google care about duplicate content?
Our users typically want to see a diverse cross-section of unique content when they do searches. In contrast, they're understandably annoyed when they see substantially the same content within a set of search results. Also, webmasters become sad when we show a complex URL (example.com/contentredir?value=shorty-george〈=en) instead of the pretty URL they prefer (example.com/en/shorty-george.htm).

What does Google do about it?
During our crawling and when serving search results, we try hard to index and show pages with distinct information. This filtering means, for instance, that if your site has articles in "regular" and "printer" versions and neither set is blocked in robots.txt or via a noindex meta tag, we'll choose one version to list. In the rare cases in which we perceive that duplicate content may be shown with intent to manipulate our rankings and deceive our users, we'll also make appropriate adjustments in the indexing and ranking of the sites involved. However, we prefer to focus on filtering rather than ranking adjustments ... so in the vast majority of cases, the worst thing that'll befall webmasters is to see the "less desired" version of a page shown in our index.
Link to post and preventing methods

Then...

Quote:
Duplicate Content & Multiple Site Issues
More and more site owners are concerned that they might get penalized accidentally or overtly because of duplicate content. If you run mirror sites, will search engines ban you? If you have listings that are similar in nature, is that an issue? What happens if you syndicate content through RSS and feeds? Will other sites be considered the "real" site and rob you of a rightful place in the search results? This session looks at the issues and explores solutions.
Read about Agenda day and listen to speakers.

Then...

Read this:
Duplicate Content & Multiple Site Issues
by Greg Grothaus
Engineer, Google Search Quality
August 12, 2009 - SES San Jose

Quote:
The Duplicate Content "Penalty" Myth
In order to show you the most relevant results, we have omitted some entries very similar to the 20 already displayed.
If you like, you can repeat the search with the omitted results included.
"my site is not returned unless I click on the "In order to show you the most relevant results...."

"sounds like you've triggered a filter which has caused Google to believe your site/page is of low quality"

---------

"I get this [message] when I do a site: search ... I'm worried that they [google] have omitted [my pages] as they see them as duplicate content ... and thus will penalize."
Duplicate content generally refers to... Mostly, this is not deceptive in origin. Examples of non-malicious duplicate content could include:

* Discussion forums that can generate both regular and stripped-down pages targeted at mobile devices
* Store items shown or linked via multiple distinct URLs
* Printer-only versions of web pages

Google tries hard to index and show pages with distinct information. This filtering means, for instance, that if your site has a "regular" and "printer" version of each article, and neither of these is blocked in robots.txt or with a noindex meta tag, we'll choose one of them to list*.
*per query
Read slides! And listen about duplicate content And read whole post

Then...
Little or no original content
Quote:
One of the most important steps in improving your site's ranking in Google search results is to ensure that it contains plenty of rich information that includes relevant keywords, used appropriately, that indicate the subject matter of your content.

However, some webmasters attempt to improve their page's ranking and attract visitors by creating pages with many words but little or no authentic content. Google will take action against domains that try to rank more highly by just showing scraped or other auto-generated pages that don't add any value to users. Examples include:

* Thin affiliate sites: These sites collect pay-per-click (PPC) revenue by sending visitors to the sites of affiliate programs, while providing little or no value-added content or service to the user. These sites usually have no original content and may be cookie-cutter sites or templates with no unique content.
* Doorway pages: Pages created just for search engines
* Auto-generated content: Content generated programatically. Often this will consist of random paragraphs of text that make no sense to the reader but that may contain search keywords.
* Scraped content: Some webmasters make use of content taken from other, more reputable sites on the assumption that increasing the volume of web pages with random, irrelevant content is a good long-term strategy. Purely scraped content, even from high-quality sources, may not provide any added value to your users without additional useful services or content provided by your site. It's worthwhile to take the time to create original content that sets your site apart. This will keep your visitors coming back and will provide useful search results.
Post url

Then...

Handling legitimate cross-domain content duplication<<<--

Then...
__________________
Twistys Hot Babes
yabate is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote