Long Url but makes no sense
-
Hi Just joined.
Crawl states that I am getting a lot of errors, looks like the spider is getting confused and looping back on itself ?
Is there a way to see where the crawl was formulated (ie where from) ?
It is generating urls like:
http://www.wickman.net.au/wineauction/wine_auction_alert.aspx/auction/auction/auction/auction/auction/auction/Default.aspx from http://www.wickman.net.au/wineauction/wine_auction_alert.aspx
-
Hi Mark,
Within you Crawl Diagnostics you're able to export your data to CSV (on the topright side of the overview). By doing this you can find the links to the page you think is incorrect.
Hope this helps!
-
Hi Mark,
What I would recommend, is open this page in your browser:
http://www.wickman.net.au/wineauction/wine_auction_alert.aspx
Then view the source page and try to search for the url:
http://www.wickman.net.au/wineauction/wine_auction_alert.aspx/auction/auction/auction/auction/auction/auction/Default.aspx
Are you using a CMS or ecommerce platform for your site like wordpress or x-cart? If so it might be something wrong with the configuration which produces the pages and links automatically
-
Hi Igor,
If you have a look at the page the URL is not found on this page.
-
Hi Martijn,
I did take a look at the source code, did not find that link anywhere, however that page is live:
Not sure if its a dynamically created or a static one...
Mark:
take a look at your physical directories on your server using your favorite FTP software, see if you can follow the same navigation and see if you have these folders and file
-
Did launch a quick crawl on your site, these url were not found (but you have several broken links!). These strange URL usually come from bad robots, and may also come from the interpretation of javascript URL or redirects. GoogleBot gets really confused with them.
If you want to dig, check the servers logs.
-
Thank you. Thats exactly what I needed. Downloaded the csv and was able to find the referring URLS and track back to the offending page.. Searched my site and voila - silly me put a trailing slash after the file prefix in the sitemap.. must have really messed up the robot.
so I had:
<loc>http://www.wickman.net.au/wineauction/wine_auction_alert.aspx/</loc>
Ooops.. I think I like this SEOMoz place already

-
Welcome! We're happy to have you here, and glad to have helped you solve your problem.