Manage category pages and duplicate content issues
-
Hi everybody,
I am now auditing this website www.disfracessimon.com
this website has some issues with canonicals and other things. But right now I have found something that I would like to know your opinion.When I was checking parts of the content in google to find duplicate content issues I found this:
I google I searched: "Chaleco de streck decorado con botones" and found
First result: "Hombre trovador" is the one I was checking -> Correct
The following results are category pages where the product is listed in.I was wondering if this could cause any problem related with duplicated content.
Should I no index category pages or should I keep it?
The first result in google was the product page. And category pages I think are good for link juice transfer and to capture some searchs from Google.Any advice?
Thank you
-
Didn't you notice that the paginated categories do not have the rel prev/next implemented? That could explain why page 2 and similar are indexed.
In the specific case of a site like yours, I would recommend to implement the rel prev/next (http://googlewebmastercentral.blogspot.com.es/2011/09/pagination-with-relnext-and-relprev.html), because the only indexable paginated lists are main categories (eg: Adultos, Infantiles, Halloween...), while the subcategories and facets cannot be indexed because of the use of the hashbang ("#") in their URLs.
In general, though, if a site presents hundreds if not thousand of URLs as consequence of pagination (category > subcategory > sub-subcategory / indexable facets), then I would look first the logs so to see if the bots are spending big part of their time crawling those paginated URLs and not so much the product pages.
If that was the case, I usually suggest to go for a mixed strategy:
- Let index the main categories, implementing the rel/prev tags;
- Let index only the first page (the organic landing page) of all the others paginated lists, not using the rel prev/next and using the "noindex,nofollow" meta robots from page 2+
-
Thank you Gianluca for your time and answer!
