Definition of Duplicate Content

September 29, 2020

What is Duplicate Content?

Duplicate content, which in French means “duplicated content” refers to the content of a web page or a website which is reproduced almost identical or almost on the Web. This is a phenomenon that poses a problem in terms of SEO because search engines track down and sanction the pages or sites affected by duplicate content.

Duplicate content is similar to copy and paste of content on different URLs

It can be textual content alone on a paragraph or textual content with other elements on an entire page. When such content is picked up for publication on another URL, without or with a slight modification, it is considered duplicate content and it is the search engine that makes this "judgment". There are two types of duplicate content.

The first concerns the duplicate pages inside the same site, on different URLs therefore, due either to the need to make a desktop version and a mobile version of a site separately or because of a technical error. or the webmaster. There, the contents are perfectly identical. This often happens on e-shops with their product sheets. The second concerns duplicate pages on different sites.

It can be the result of a redistribution of RSS feeds or that of an almost identical description of a similar product or quite simply the fact of plagiarism. It is a phenomenon very feared by website owners. However, sometimes it is intentional, because necessary, to let the duplicate content persist, in which case it suffices to indicate the source content to the Google robot by the use of the rel = canonical tag and it is the page considered as original which will be indexed.

Duplicate content is annoying for the referencing of a page on a search engine

You should first know that except in the most severe cases, the phenomenon of duplicate content does not prevent the Google search engine from indexing the pages concerned. Google is just trying not to outperform a site by considering the same content multiple times.

What happens to pages classified as duplicate content is losing SERP positions or even being removed from search results. It also happens that an original page is relegated to the background in favor of the content thief when the latter's PageRank is stronger. Apart from the 2 types of duplicate content, there are 3 cases.

The first relates to strictly identical pages. There, only the one with the highest PageRank will be indexed.

The second relates to pages that are similar but differentiated by their Title and Description tags. There, all the pages will be indexed but those which are not considered as the original will only appear in the SERPs by clicking on "relaunch the search including the skipped pages".

The third is for the same Title and Description tags for different pages. There, Google can go so far as not to index the pages considered to be duplicated. The rules to remember are one page = a separate URL and in case of intentionally duplicated content, put the URL of the original page in the canonical tag.

Search This Blog

Muqaddas IT Solution