Google crawler showing cache of another page
-
For the page http://www.thinkdigit.com/top-products/Laptops-and-PCs/top-10-laptops-124.php google is showing another page in cache (http://www.thinkdigit.com/top-products/Ultrabooks/top-10-ultrabooks-153.php). Please let me know how this happened and how to correct it.
-
Hi,
First things first, the page, http://www.thinkdigit.com/top-products/Ultrabooks/top-10-ultrabooks-153.php is not in Google's index.
Secondly, for both the phrases, 'top 10 laptops' and 'top 10 ultrabooks', the page,
http://www.thinkdigit.com/top-products/Ultrabooks/top-10-ultrabooks-153.php, ranks in the first position from your website, thinkdigit.com
So when you try to look-up the cache for a non-existing page in the index, Google tries to return the closest match and which is, http://www.thinkdigit.com/top-products/Ultrabooks/top-10-ultrabooks-153.php
I see a problem with the Sitemap.xml file for your site. Its not comprehensive and if you look at the cache of it in Google, you will see, the page, http://www.thinkdigit.com/top-products/Laptops-and-PCs/top-10-laptops-124.php is in there but its missing in the current Sitemap.xml file.
Here are three things you might do to make http://www.thinkdigit.com/top-products/Laptops-and-PCs/top-10-laptops-124.php in to the Google's index.
1. From Google webmaster tools account, Fetch as Google the above page and submit.
2. Come up with a comprehensive Sitemap.xml file
3. There is no reference to the Sitemap.xml file from Robots.txt file. You can add it as follows:
Sitemap: http://www.thinkdigit.com/sitemap.xml
You should be good after that. All the best to you my friend.
Regards,
Devanur Rafi
-
Your slightly actually incorrect Devanur, the reason the wrong page is cached is because the page previously had a canonical tag referencing the other page.
If you look at the cache of http://www.thinkdigit.com/top-products/Laptops-and-PCs/top-10-laptops-124.php :::: Click Me
You will see in the source code a canonical tag for the other page:
http://www.thinkdigit.com/top-products/Ultrabooks/top-10-ultrabooks-153.php" />
And the info at the top of cache page confirms Google is counting the one page as the other (see attachment)
-
Hi my friend, if you look at the cache of the URL you gave:
http://www.thinkdigit.com/top-products/Laptops-and-PCs/top-10-laptops-124.php :::: Click Me
You are actually looking at the source code of the following page:
http://www.thinkdigit.com/top-products/Ultrabooks/top-10-ultrabooks-153.php
To confirm this, look at the meta data in the source, it says Ultrabooks.
Now comes the issue where the rel=canonical implementation is incorrect on both the pages as they both point to themselves. Check out the source code of both the pages. Their rel=canonical attributes point to themselves. So as per my original explanation, Google is showing the cache of /top-10-ultrabooks-153.php for top-10-laptops-124.php which is the actual issue at hand. So when you look at the source code of cached page, you are actually looking at the source code of /top-10-ultrabooks-153.php page.
Best,
Devanur Rafi