The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Moz Tools
    4. Crawl Diagnostics 2261 Issues with Our Blog

    Crawl Diagnostics 2261 Issues with Our Blog

    Moz Tools
    9 4 179
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • Girlstuff
      Girlstuff last edited by

      I just recently signed up for MOZ, so much information.  I've done the walk through and will continue learning how to us the tools.  But I need your help.

      Our first moz crawl indicated 2261 issues (447 404's, 803 duplicate content, 11 502's, etc).  I've reviewed all of the crawls issues and they are linked to our Yahoo hosted WordPress blog.  Our blog is over 9 years old. The only issue that I'm able to find is our categories are not set up correctly.  I've searched for WordPress assistance on this topic and cant find any issues with our current category set up. Every category link that I click returns Nothing Found Apologies, but no results were found for the requested archive. Perhaps searching will help find a related post.

      http://site.labellaflorachildrensboutique.com/blog/

      Any assistance is greatly appreciated.

      1 Reply Last reply Reply Quote 0
      • MattRoney
        MattRoney last edited by

        Category pages actually turn up as duplicate content in Crawl Diagnostics _really _often. It just means that those categories are linked somewhere on your site, and the resulting category pages look almost exactly like all the others.

        Generally, I recommend you use robots.txt to block crawlers from accessing pages in the category directory. Once that's done and your campaign has re-crawled your site, then you can see how much of the problem was resolved by that one change, and consider what to do to take care of the rest.

        Does that make sense?

        CleverPhD 1 Reply Last reply Reply Quote 2
        • CleverPhD
          CleverPhD @MattRoney last edited by

          One wrinkle.  If the category pages are in Google and potentially ranking well - you may want to 301 them to consolidate them into a more appropriate page (if this makes sense) or if you want to get them out of the index, use a meta noindex robots tag on the page(s) to have them removed from the index, then block them in robots.txt.

          Likewise, you have to remove the links on the site that are pointing to the category pages to prevent Google from recrawling and reindexing etc.

          MattRoney 1 Reply Last reply Reply Quote 1
          • MattRoney
            MattRoney @CleverPhD last edited by

            Oh yeah, that's a great point! I've found that the category pages rarely rank directly, but you'll definitely want to double-check before outright blocking crawlers.

            Just to check my own understanding, CleverPhD, wouldn't crawlers avoid the category pages if they were disallowed by robots.txt (presuming they obey robots.txt), even if the links were still on the site?

            CleverPhD 1 Reply Last reply Reply Quote 0
            • CleverPhD
              CleverPhD @MattRoney last edited by

              Yes, the crawler will avoid the category pages if they are in robots.txt.  It sounded like from the question that this person was going to remove or change the category organization and so you would have to do something with the old URLs (301 or noindex) and that is why I would not use robots.txt in this case so that those directives can be seen.

              If these category pages had always been blocked using robots.txt, then this whole conversation is moo as the pages never got in the index. It is when unwanted pages get in the index that you potentially want to get rid of that things get a little tricky, but workable.

              I have seen issues where there are pages on sites that got into the index and ranking but they were the wrong pages and so the person just blocked with robots.txt.  Those URLs continued to rank and cause problems with the canonical pages that should be ranking.  We had to unblock, let Google see the 301, rank the new pages then put the old URLs back into robots to prevent the old URLs from getting back into the index.

              Cheers!

              MattRoney 1 Reply Last reply Reply Quote 1
              • MattRoney
                MattRoney @CleverPhD last edited by

                Awesome! Thanks for straightening it out. 🙂

                CleverPhD 1 Reply Last reply Reply Quote 0
                • CleverPhD
                  CleverPhD @MattRoney last edited by

                  One other thing I forgot.  This video by Matt Cutts

                  It explains why Google might show a link even though the page was blocked by robots.txt

                  https://www.youtube.com/watch?v=KBdEwpRQRD0

                  Google really tries not to forget URLs and this video reminds us that Google uses links not just for ranking, but discovery so  you really have to pay attention to how you link internally.  This is especially important for large sites.

                  1 Reply Last reply Reply Quote 1
                  • evolvingSEO
                    evolvingSEO last edited by

                    While what Matt and CleverPHD (Hi Paul!) have said is correct - here's your specific issue:

                    Your categories are loading with "ugly" permalinks like this: http://site.labellaflorachildrensboutique.com/blog/?cat=175 (that loads fine)

                    But you are linking to them from the bottom of posts with the "clean" URLs --> http://screencast.com/t/RIOtqVCrs

                    The fix is that Catgory URLs need to load with "clean" URLs and the ugly one should redirect to the clean one.

                    Possible fixes:

                    • Try updating wordpress (I see you're on a slightly older version)
                    • See if you .htaccess file has been modified (ask a developer or your hosting for help with this perhaps)

                    Found another linking issue:

                    This link to Facebook in your left sidebar --> http://screencast.com/t/EqltiBpM it's just coded incorrectly. It adds the current page URL so you get a link like this http://site.labellaflorachildrensboutique.com/blog/category/unique-baby-girl-gifts/www.facebook.com/LaBellaFloraChildrensBoutique instead of your Facebook page: http://www.facebook.com/LaBellaFloraChildrensBoutique

                    You can fix that Facebook link probably in Appearance->Widgets.

                    That one issue is causes about 200 of your broken URLs 🙂

                    CleverPhD 1 Reply Last reply Reply Quote 1
                    • CleverPhD
                      CleverPhD @evolvingSEO last edited by

                      Go Dan!

                      1 Reply Last reply Reply Quote 1
                      • 1 / 1
                      • First post
                        Last post
                      • Shopify crawl issues
                        SilentGorilla
                        SilentGorilla
                        0
                        5
                        629

                      • Crawl diagnostics up to date after Magento ecommerce site crawl?
                        Whebb
                        Whebb
                        0
                        2
                        218

                      • Crawl Diagnostics
                        Peterli
                        Peterli
                        0
                        2
                        242

                      • I have corrected the Problems in Crawl Diagnostics. When would it refresh/ re-crawl my site ?
                        VarunBansal
                        VarunBansal
                        0
                        3
                        475

                      • Crawl Diagnostic Errors
                        rosstaylor
                        rosstaylor
                        0
                        9
                        923

                      • Crawl Diagnostics Summary
                        RikkiD22
                        RikkiD22
                        0
                        3
                        769

                      • Crawl Diagnostics and missing meta tags on noindex blog pages
                        ShaMenz
                        ShaMenz
                        0
                        2
                        835

                      • "Issue: Duplicate Page Content " in Crawl Diagnostics - but these pages are noindex
                        AaronWheeler
                        AaronWheeler
                        0
                        2
                        1.1k

                      Get started with Moz Pro!

                      Unlock the power of advanced SEO tools and data-driven insights.

                      Start my free trial
                      Products
                      • Moz Pro
                      • Moz Local
                      • Moz API
                      • Moz Data
                      • STAT
                      • Product Updates
                      Moz Solutions
                      • SMB Solutions
                      • Agency Solutions
                      • Enterprise Solutions
                      • Digital Marketers
                      Free SEO Tools
                      • Domain Authority Checker
                      • Link Explorer
                      • Keyword Explorer
                      • Competitive Research
                      • Brand Authority Checker
                      • Local Citation Checker
                      • MozBar Extension
                      • MozCast
                      Resources
                      • Blog
                      • SEO Learning Center
                      • Help Hub
                      • Beginner's Guide to SEO
                      • How-to Guides
                      • Moz Academy
                      • API Docs
                      About Moz
                      • About
                      • Team
                      • Careers
                      • Contact
                      Why Moz
                      • Case Studies
                      • Testimonials
                      Get Involved
                      • Become an Affiliate
                      • MozCon
                      • Webinars
                      • Practical Marketer Series
                      • MozPod
                      Connect with us

                      Contact the Help team

                      Join our newsletter
                      Moz logo
                      © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                      • Accessibility
                      • Terms of Use
                      • Privacy