The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Intermediate & Advanced SEO
    4. Status Code 404: But why?

    Status Code 404: But why?

    Intermediate & Advanced SEO
    18 4 1.4k
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • Greenman
      Greenman last edited by

      Google Web Master Tool reported me that I have several 404 staus code.,

      First they were 2, after 4..6 and 10, right now. Every time I add a new page.

      I've got a no CMS managed website. After old website was been deleted, I installed Wordpress, created new page and deleted and blocked (robots.txt) old page.

      Infact all page not found don't exist!!!  (Pic: Page not found).

      The strange thing is that no pages link to those 404 pages (All Wordpress Created page are new!!!). Seomoz doesn't  report me any 404 error (Pic 3)

      I controlled all my pages:

      • No "strange" link in any pages
      • No link reported by Seomoz tool

      Bu why GWMT reports me that one? How can I risolve that problem?
      I'm going crazy!!!

      Regards
      Antonio

      BgelG.png eCaDU.png ZIi2f.jpg

      1 Reply Last reply Reply Quote 0
      • AlanMosley
        AlanMosley last edited by

        If they were in a SE index, they will try them for some time before removing from index., i would not worry

        Greenman 1 Reply Last reply Reply Quote 0
        • Greenman
          Greenman @AlanMosley last edited by

          Hi Alan,

          404 not found pages are not indexed. My big problem is that I don't now where (and How) GWMT found source link (pages that link to not found page)

          AlanMosley Greenman 5 Replies Last reply Reply Quote 0
          • AlanMosley
            AlanMosley @Greenman last edited by

            From you own web page i would asume.

            i would suuggest that even that they are not in index, google is till trying, and that WMT is a bit behind. i have simular for links that i took down moths ago.

            1 Reply Last reply Reply Quote 0
            • Greenman
              Greenman @Greenman last edited by

              Yes, link come from my page. Bu I created new page by Wordpress (and deleted OLD website). So, there are NO link beetwen OLD and NEW pages. How GWMT can find a connection? Webpage Source Code HTML doesn't show any link to those page.

              1 Reply Last reply Reply Quote 0
              • AlanMosley
                AlanMosley @Greenman last edited by

                The 404's are from pages that used to be linked in your old site correct?, if so I suggest that google is still looking for them. Unless you changed your domain name this would be the reason

                1 Reply Last reply Reply Quote 0
                • Greenman
                  Greenman @Greenman last edited by

                  Sorry Alan,
                  but I think that Google can looking for old page yet. This is the reason:

                  I deleted old page form index by GMWT "remove url request"
                  I dissalowed old page by robots.txt

                  The problem is why Google find in NEW page links to OLD page.

                  1 Reply Last reply Reply Quote 0
                  • AlanMosley
                    AlanMosley @Greenman last edited by

                    I dont think they are, i thing they found them long ago, and no matterr if you block them, remove them of whatever, google take for ever to sort itsself out

                    1 Reply Last reply Reply Quote 0
                    • RyanKent
                      RyanKent last edited by

                      When did you change over to the WP site?

                      Today is October 1st and the most recent 404 error shared in your image is from 9/27. If you have made the changes after 9/27, then no new errors have been found since you made the change.

                      Since the moz report shows no crawl errors, your current site is clean assuming your site navigation allowed your website to be fully crawled.

                      The Google errors can be from any website. The next step is to determine the source of the link causing the 404 error. Using the 2nd image you shared, click on each link in the left column of your WMT report. For example, http://www.mangotano.eu/ge/doc/tryit.php shows 3 pages. Click on it and you should see a list of those 3 pages so you can further troubleshoot.

                      Greenman 1 Reply Last reply Reply Quote 0
                      • ShaMenz
                        ShaMenz last edited by

                        Hi Greenman,

                        I would guess that your problem is most likely caused by the fact that you have used the robots.txt method to block the pages you removed.

                        robots.txt is not the most reliable method of ensuring that pages are not indexed. Even though robots.txt tells bots not to crawl a page, Google has openly stated that if a page is found through an external link from another site, they can be crawled and indexed.

                        The most effective way to block pages is to use the noindex meta tag.

                        So, the simple answer is that there are links out there which still point to your old pages. Just because links are not highlighted in OSE or even Google WMT, does not mean that they don't exist. WMT should provide you with the most accurate link information, but even that is not necessarily complete according to Google.

                        Don't forget that there may also be "links" out there in the form of bookmarks or favorites that people keep in their browsers. When clicked these will also generate a 404 response from your server.

                        If you know that you have removed pages you should add 301 redirects to send any traffic to another relevant page. If you do not know the URL's of the pages that have been removed the best way to stop them from returning 404's is to add a catch-all 301 redirect so that any request for a page that does not exist is redirected to a single page. Some people send all of this traffic to the home page, but my preference would be to send it to a custom designed 404 or a relevant category page.

                        Hope that helps,

                        Sha

                        AlanMosley Greenman 2 Replies Last reply Reply Quote 2
                        • AlanMosley
                          AlanMosley @ShaMenz last edited by

                          My gut feeling is that a catch al 301, is not a good thing, I cant give you any evidence, just a bit of reasoning and gut feeling.

                          I always try to put myself in the SE shoes, would i think a lot of 301's pointing to one not relavant page is natual? and would it be hard to detect? I would answer No and No. Although i used to do it to my home page a while ago, I guess i had a different gut feeling back then

                          1 Reply Last reply Reply Quote 1
                          • Greenman
                            Greenman @RyanKent last edited by

                            Hi Ryan,

                            I installed WP in last days of august.

                            Yes... you can see the screenshot about Link Source Pages.
                            All new pages don't have /it/ - /en/ - /fr/ etc. So, all source page are old page.
                            Even if new homepage is there.

                            E8cSI.jpg cZYWB.jpg u8Tqb.jpg MmsYA.jpg

                            RyanKent 1 Reply Last reply Reply Quote 0
                            • Greenman
                              Greenman @ShaMenz last edited by

                              Hi Sha, thanks for your answer.

                              1.** robots.txt is not the most reliable method of ensuring that pages are not indexed**

                              If you use tag noindex, spider will acces to your page but it will not get enough information. So, page will be semi-indexed.

                              My old pages ware been removed, no indexed (by robots) and I sent remove request to Google. No problem with that, no result on the SERP.

                              2.  So, the simple answer is that there are links out there which still point to your old pages...does not mean that they don't exist.

                              You can see by screenshot the link's source: just my old "ghost" pages. No other sources.

                              3. If you know that you have removed pages you should add 301 redirects to send any traffic to another relevant page.

                              How can I add a 301 redirect to a page that doesn't exit? 
                              Old page -> 301 -> New page (Home?). But Old page doesn't exit in Wordpress!!!

                              **I don't want stop 404, I want remove link that bring to deleted pages. **

                              RyanKent Greenman ShaMenz 4 Replies Last reply Reply Quote 0
                              • RyanKent
                                RyanKent @Greenman last edited by

                                Greenman,

                                As I suspected many of the dates of the bad URLs are old, some even being from 2010. I took a look at your home page specifically checking for the URL you highlighted in red on the 4th image. It is not present.

                                My belief is your issue has been resolved by the changes you made. I recommend you continue to monitor WMT for any NEW errors. If you see any fresh dates with 404, that would be a concern which should be investigated. Otherwise the problem appears to be resolved.

                                I also very much support Sha's reply above.

                                1 Reply Last reply Reply Quote 1
                                • RyanKent
                                  RyanKent @Greenman last edited by

                                  Greenman,

                                  I have a simple rule I learned over time. NEVER EVER EVER EVER use robots.txt unless there is absolutely no other method possible to achieve the required result. It is simply bad SEO and will cause problems. The best robots.txt file is a blank one.

                                  When you use CMS software like WP, then it is required for some areas but it's use should be minimized.

                                  How can I add a 301 redirect to a page that doesn't exit?

                                  You have presented 404 errors. Those errors are links TO pages which don't exist, correct? If so, I believe Sha was recommending you can create a 301 redirect from the page which does not exist, to the most relevant page that does exist.

                                  It's a bit of semantics but if you chose to do such, you can create 301s from or to pages that don't exist.

                                  1 Reply Last reply Reply Quote 1
                                  • Greenman
                                    Greenman @Greenman last edited by

                                    Ryan,

                                    what you say is right: The best robots.txt file is a blank one. But I think, tell me I'm right, that robots.txt is better than noindex tag.
                                    You have presented 404 errors. Those errors are links TO pages which don't exist, correct? Yes.

                                    If so, I believe Sha was recommending you can create a 301 redirect from the page which does not exist...

                                    **Ok. But Do I need "real" page to create a 301 redirect?
                                    I deleted those one.

                                    So, to resolve my problem must i redirect old page to most relevant page?**

                                    1 Reply Last reply Reply Quote 0
                                    • RyanKent
                                      RyanKent @Greenman last edited by

                                      But I think, tell me I'm right, that robots.txt is better than noindex tag.

                                      Definitely not. The opposite is true.

                                      A no-index tag tells search engines not to index the page. The content will not be considered as duplicate anymore. But the search engines can still crawl the page and follow all the links. This allows your PR to flow naturally throughout your site. This also allows search engines to naturally read any changes in meta tags. A robots.txt disallow prevents the search engine from looking at any of the page's code. Think of it as a locked door. The crawler cannot read any meta tags and any PR from your site that flows to the page simply dies.

                                      Do I need "real" page to create a 301 redirect?

                                      No. Let's look at a redirect from both ends.

                                      Example 1 - you delete the /apples page from your site. The /apples page no longer exists. After reviewing your site you decide the best replacement page would be the /granny-smith-apples page. Solution: a 301 redirect from the non-existent /apples page to the /granny-smith-apples page.

                                      Example 2 - you delete the /apples page from your site. You no longer carry any form of apples but you do carry other fruit. After some thought you decide to redirect to the /fruit/ category page. Solution: a 301 redirect from the non-existent /apples page to the /fruit/ category page.

                                      Example 3 - you delete the /apples page from your site but you no longer carry anything similar. You can decide to let the page 404. A 404 error is a natural part of the internet. Examine your 404 page to ensure it is helpful. Ideally it should contain your normal site navigation, a site search field and a friendly "sorry the page you are looking for is no longer available" message.

                                      Since you asked about existence of redirected pages, you can actually redirect to a page that does not exist. You could perform a 301 from /apples to a non-existent /apples2 page. When this happens it is almost always due to user error by the person who added the redirect. When that happens anyone who tries to reach the /apples page will be redirected to the non-existent /apples2 page and therefore receive a 404 error.

                                      1 Reply Last reply Reply Quote 2
                                      • ShaMenz
                                        ShaMenz @Greenman last edited by

                                        Antonio,

                                        Ryan has explained this perfectly.

                                        For a more detailed explanation of methods for controlling page indexing, you could read this post on Restricting Robot Access for Improved SEO

                                        It seems from your comments and questions about 301 redirects, that there is some confusion on how they work and why we use them.

                                        A 301 redirect is an instruction to the server which is most commonly done by adding a .htaccess file (if you are using an Apache server).

                                        The .htaccess file is read by the server when it receives a request to serve any page on the site. The server reads each rule in the file and checks to see if the rule matches the existing situation. When a rule matches, the server carries out the action required. If no rule matches, then the server proceeds to serve the reqested page.

                                        So, in Ryan's first example above, there would be a line of code in the .htaccess file that basically says to the server IF the page requested is /apples, send the request to /granny-smith-apples using a 301 (Permanent) Redirect.

                                        The intent of using a 301 Redirect is to achieve two things:

                                        1. To prevent loss of traffic and offer the visitor an alternative landing page.
                                        2. To send a signal to Search Engines that the old page should be removed from the index and replaced with the new page.

                                        The 301 Redirect is referred to as Permanent for this reason. Once the 301 Redirect is recognized and acted upon by the search engine, the page will be permanently removed from the index.

                                        In contrast, the request to remove a page via Google WMT is a "moment in time" option. The page can possibly be re-indexed because it is accessible to crawlers via an external link from another site (unless you use the noindex meta tag instead of robots.txt). Then you would need to resubmit a removal request.

                                        I hope this makes clearer the reasons for my response - basically, the methods you have used are not "closing the door" on the issue, but leaving the possibility open for it to occur again.

                                        Sha

                                        1 Reply Last reply Reply Quote 0
                                        • 1 / 1
                                        • First post
                                          Last post
                                        • 404 or 410 status code after deleting a real estate listing
                                          MikeGracia
                                          MikeGracia
                                          0
                                          4
                                          145

                                        • Of the two examples of markup (microdata, schema) code below, which of the two is better designed for its purpose of Q&A, and what might be suggested to improve upon these lines of code (context: questions and answers within article content.
                                          RedFrog
                                          RedFrog
                                          0
                                          3
                                          1.8k

                                        • Confused about SEO for Forums (Status Code 200)
                                          kentien93
                                          kentien93
                                          0
                                          3
                                          156

                                        • Is it a problem to use a 301 redirect to a 404 error page, instead of serving directly a 404 page?
                                          lcourse
                                          lcourse
                                          0
                                          5
                                          2.0k

                                        • Why would our server return a 301 status code when Googlebot visits from one IP, but a 200 from a different IP?
                                          danatanseo
                                          danatanseo
                                          0
                                          16
                                          1.4k

                                        • HTTP Status Bad Request - 404, but also, add a 400 HTTP Status in certain circumstances?
                                          OrionGroup
                                          OrionGroup
                                          0
                                          4
                                          312

                                        • Wordpress No 404
                                          LukeHutchinson
                                          LukeHutchinson
                                          0
                                          5
                                          231

                                        • Experience with 307 HTTP status code
                                          sesertin
                                          sesertin
                                          0
                                          2
                                          522

                                        Get started with Moz Pro!

                                        Unlock the power of advanced SEO tools and data-driven insights.

                                        Start my free trial
                                        Products
                                        • Moz Pro
                                        • Moz Local
                                        • Moz API
                                        • Moz Data
                                        • STAT
                                        • Product Updates
                                        Moz Solutions
                                        • SMB Solutions
                                        • Agency Solutions
                                        • Enterprise Solutions
                                        • Digital Marketers
                                        Free SEO Tools
                                        • Domain Authority Checker
                                        • Link Explorer
                                        • Keyword Explorer
                                        • Competitive Research
                                        • Brand Authority Checker
                                        • Local Citation Checker
                                        • MozBar Extension
                                        • MozCast
                                        Resources
                                        • Blog
                                        • SEO Learning Center
                                        • Help Hub
                                        • Beginner's Guide to SEO
                                        • How-to Guides
                                        • Moz Academy
                                        • API Docs
                                        About Moz
                                        • About
                                        • Team
                                        • Careers
                                        • Contact
                                        Why Moz
                                        • Case Studies
                                        • Testimonials
                                        Get Involved
                                        • Become an Affiliate
                                        • MozCon
                                        • Webinars
                                        • Practical Marketer Series
                                        • MozPod
                                        Connect with us

                                        Contact the Help team

                                        Join our newsletter
                                        Moz logo
                                        © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                                        • Accessibility
                                        • Terms of Use
                                        • Privacy