The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Moz Pro
    4. Crawl Diagnostics - unexpected results

    Crawl Diagnostics - unexpected results

    Moz Pro
    6 2 427
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • nordichouse
      nordichouse last edited by

      I received my first Crawl Diagnostics report last night on my dynamic ecommerce site.

      It showed errors on generated URLs which simply are not produced anywhere when running on my live site. Only when running on my  local development server.

      It appears that the Crawler doesn't think that it's running on the live site.

      For example

      http://www.nordichouse.co.uk/candlestick-centrepiece-p-1140.html

      will go to  a Product Not Found page, and therefore Duplicate Content errors are produced.

      Running

      http://www.nhlocal.co.uk/candlestick-centrepiece-p-1140.html

      produces the correct product page and not a Product Not Found page

      Any thoughts?

      1 Reply Last reply Reply Quote 0
      • donford
        donford last edited by

        Hi nordichouse,

        You may want to check with your CMS provider. The urls are similar to Oscommerce which I'm experienced with, but I can see that isn't an Oscommerce setup. The system should have some sort of URL re-writer to deal with this problem.

        The issue that I see is the system actually doesn't care what you type in between .co.uk/ and -p-1140.html

        For example try this url to get a valid product..

        http://www.nordichouse.co.uk/nipple-clips-p-1000.html
        which is the same as
        http://www.nordichouse.co.uk/-p-1000.html
        But should 301 redirect to: http://www.nordichouse.co.uk/linen-style-collection-p-1000.html

        Oscommerce has a URL 301 re-writer that prevents the system for using incorrect URL's I would hope your system does as well.

        I'm not trying to avoid helping you, but the without an exact knowledge of how the system handles URL's it generates it is hard to troubleshoot, however since it is a CMS somebody who works on it should already have this knowledge.

        My best,

        Don

        nordichouse 1 Reply Last reply Reply Quote 0
        • nordichouse
          nordichouse @donford last edited by

          Thanks, Don

          You are right in your analysis - it is osC, but highly modified by myself. Yes, it does redirect.

          That, however, is not the point.  On the live site, the URL containing 1140 (for example) is never generated.

          The mystery is how the Crawler can find something that isn't there! Magic.

          Alan

          donford nordichouse 3 Replies Last reply Reply Quote 0
          • donford
            donford @nordichouse last edited by

            Did you ever have a product with the id of 1140? If you look at your products table just check the auto number in the product_id column..

            If you did and it was live at some point it could be finding the old product based on the old url it used to have.

            If you never made that product live then I don't know how a crawler could of found a product that doesn't exist unless they starting using some technology that I'm unaware of.

            Since you said you use OSC this what we use to deal with the problem I outlined above..

            Begin Ultimate SEO V2.2d

            Options +FollowSymLinks
            RewriteEngine On

            # RewriteBase instructions
              # Change RewriteBase dependent on how your shop is accessed as below.
              # http://www.mysite.com = RewriteBase /
              # http://www.mysite.com/catalog/ = RewriteBase /catalog/ 
              # http://www.mysite.com/catalog/shop/ = RewriteBase /catalog/shop/

            # Change the following line using the instructions above 
             
            RewriteBase /catalog/

            RewriteRule ^(.)-p-(.).html$ product_info.php?products_id=$2&%{QUERY_STRING}
            RewriteRule ^(.)-c-(.).html$ index.php?cPath=$2&%{QUERY_STRING}
            RewriteRule ^(.)-m-(.).html$ index.php?manufacturers_id=$2&%{QUERY_STRING}
            RewriteRule ^(.)-pi-(.).html$ popup_image.php?pID=$2&%{QUERY_STRING}
            RewriteRule ^(.)-by-(.).html$ all-products.php?fl=$2&%{QUERY_STRING}
            RewriteRule ^(.)-t-(.).html$ articles.php?tPath=$2&%{QUERY_STRING}
            RewriteRule ^(.)-a-(.).html$ article_info.php?articles_id=$2&%{QUERY_STRING}
            RewriteRule ^(.)-au-(.).html$ articles.php?authors_id=$2&%{QUERY_STRING}
            #RewriteRule ^(.)-pr-(.).html$ product_reviews.php?products_id=$2&%{QUERY_STRING}
            RewriteRule ^(.)-pri-(.).html$ product_reviews_info.php?products_id=$2&%{QUERY_STRING}
            RewriteRule ^(.)-f-(.).html$ faqdesk_info.php?faqdesk_id=$2&%{QUERY_STRING}
            RewriteRule ^(.)-fc-(.).html$ faqdesk_index.php?faqPath=$2&%{QUERY_STRING}
            RewriteRule ^(.)-fri-(.).html$ faqdesk_reviews_info.php?faqdesk_id=$2&%{QUERY_STRING}
            RewriteRule ^(.)-fra-(.).html$ faqdesk_reviews_article.php?faqdesk_id=$2&%{QUERY_STRING}
            RewriteRule ^(.)-i-(.).html$ information.php?info_id=$2&%{QUERY_STRING}
            RewriteRule ^(.)-links-(.).html$ links.php?lPath=$2&%{QUERY_STRING}
            RewriteRule ^(.)-pm-([0-9]+).html$ info_pages.php?pages_id=$2&%{QUERY_STRING}
            RewriteRule ^(.
            )-n-(.).html$ newsdesk_info.php?newsdesk_id=$2&%{QUERY_STRING}
            RewriteRule ^(.
            )-nc-(.).html$ newsdesk_index.php?newsPath=$2&%{QUERY_STRING}
            RewriteRule ^(.
            )-nri-(.).html$ newsdesk_reviews_info.php?newsdesk_id=$2&%{QUERY_STRING}
            RewriteRule ^(.
            )-nra-(.).html$ newsdesk_reviews_article.php?newsdesk_id=$2&%{QUERY_STRING}
            RewriteRule ^(.
            )-po-([0-9]+).html$ pollbooth.php?pollid=$2&%{QUERY_STRING}

            End Ultimate SEO V2.2d

            You may try it to see if it helps fix your issue.

            1 Reply Last reply Reply Quote 0
            • nordichouse
              nordichouse @nordichouse last edited by

              Don

              Yes, that is how it is done and there is no problem with that. The above is just how inbound URLs get processed.

              The issue here is how the crawler works. The only possible way for this particular URL to be generated is for a certain parameter to be appended to the URL - and that would be unusual (unless SEOmoz techies tell me different)

              Alan

              1 Reply Last reply Reply Quote 0
              • donford
                donford @nordichouse last edited by

                Hi Nordichouse,

                Sorry it took awhile for me to get back to you on this.

                I agree with  the SEOmoz techs, it doesn't matter if it is a crawler or a actual person, if you go to an invalid url you should be redirected as 301 to the actual page. If the product doesn't exist it should not allow for superfluous urls.

                So basically what you should have is if the product exist then the site redirects to the correct URL. If it doesn't exist then send any query for that product to the same page and display the oscommerce product not found message. By doing this you prevent the system from creating upteenthousand urls for each product.

                If you want to test what I mean you can visit our store a www.rubberstore.com/catalog and try a few urls like:

                catalog/nipple-clips-p-1000.html
                we don't have a product with the id of 1000 so you'll get redirected to the not found message and the root page
                -p-1000.html

                however if you try:
                catalog/a-fake-url-p-29.html
                you'll get redirected to our actual product page matching this product id.

                Hope that makes since. All this is done with the .htaccess url re-writter I posted above.

                1 Reply Last reply Reply Quote 0
                • 1 / 1
                • First post
                  Last post
                • 404 Crawl Diagnostics with void(0) appended to URL
                  Et.helAliceAmalina
                  Et.helAliceAmalina
                  0
                  7
                  709

                • 7,608 High Priority Crawl Diagnostic problems
                  emanbee
                  emanbee
                  0
                  5
                  277

                • Is it normal for Moz to report on nofollow pages in crawl diagnostics?
                  Chris.Menke
                  Chris.Menke
                  0
                  2
                  135

                • How to find page with the link that returns a 404 error indicated in my crawl diagnostics?
                  Tetruss
                  Tetruss
                  0
                  3
                  2.2k

                • Crawl Diagnostics - Crawling way more pages than my site has?
                  LodestoneGen
                  LodestoneGen
                  0
                  5
                  410

                • Crawl Diagnostics returning duplicate content based on session id
                  csingsaas
                  csingsaas
                  0
                  3
                  386

                • How to remove Duplicate content due to url parameters from SEOMoz Crawl Diagnostics
                  dfeg
                  dfeg
                  0
                  7
                  2.3k

                • SEOmoz crawl diagnostics report - what are the duplicate pages urls?
                  SteveOllington
                  SteveOllington
                  0
                  4
                  959

                Get started with Moz Pro!

                Unlock the power of advanced SEO tools and data-driven insights.

                Start my free trial
                Products
                • Moz Pro
                • Moz Local
                • Moz API
                • Moz Data
                • STAT
                • Product Updates
                Moz Solutions
                • SMB Solutions
                • Agency Solutions
                • Enterprise Solutions
                • Digital Marketers
                Free SEO Tools
                • Domain Authority Checker
                • Link Explorer
                • Keyword Explorer
                • Competitive Research
                • Brand Authority Checker
                • Local Citation Checker
                • MozBar Extension
                • MozCast
                Resources
                • Blog
                • SEO Learning Center
                • Help Hub
                • Beginner's Guide to SEO
                • How-to Guides
                • Moz Academy
                • API Docs
                About Moz
                • About
                • Team
                • Careers
                • Contact
                Why Moz
                • Case Studies
                • Testimonials
                Get Involved
                • Become an Affiliate
                • MozCon
                • Webinars
                • Practical Marketer Series
                • MozPod
                Connect with us

                Contact the Help team

                Join our newsletter
                Moz logo
                © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                • Accessibility
                • Terms of Use
                • Privacy