The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Moz Tools
    4. Tons of Crappy links in new OSE (Open Site Explorer)

    Tons of Crappy links in new OSE (Open Site Explorer)

    Moz Tools
    10 3 1.3k
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • znotes
      znotes last edited by

      I am starting to miss the old OSE. I've found that for a lot of the pages on our site, the new OSE is showing WAY more links and most of them are garbage nonsense links from China, Russia, and the rest of the internet Wild West.

      For instance, in the old OSE, this page used to show 9 linking domains:

      http://www.uncommongoods.com/gifts/by-recipient/gifts-for-him

      It now shows 454 links. Some of the new links (about 5 of them) are legitimate. The other 400+ are garbage. Some are porn sites, most of them don't even open a web page, they just initiate some shady download. I've seen this for other sites as well (like Urban Outfitters) This is making it much harder for me to do backlink analysis on bc I have no clue how many "Normal" links they have. Is anyone else having this problem ? Any way to filter all this crap out ? See attached screenshot of the list of links I'm getting from OSE.

      NHXnn

      1 Reply Last reply Reply Quote 1
      • sferrino
        sferrino last edited by

        Hello Zack,

        That is an issue that they are working on, I know this because I already discussed this with one of their help desk people. Here is the page that describes the changes: http://www.seomoz.org/blog/brand-new-open-site-explorer-is-here

        In addition to that, here is some additional information I can share with you:

        you may see “questionable” links with weird file extensions. This is due to the crawler reaching much deeper into sites where there are more download links. We are looking into fixing this bug as soon as we can so these won’t be counted as links.

        znotes 1 Reply Last reply Reply Quote 1
        • znotes
          znotes @sferrino last edited by

          OK cool good info, hope they fix it soon!! Any good ideas on how you can filter this crap[ out ?

          sferrino 1 Reply Last reply Reply Quote 0
          • sferrino
            sferrino @znotes last edited by

            2 ways:

            1. Get as CSV and spend the time going through it
            2. Wait it out
            1 Reply Last reply Reply Quote 0
            • carinoverturf
              carinoverturf last edited by

              Hey Zack, I saw the ticket you filed was answered by Aaron, but I just wanted to follow up with you as well. We have made some really exciting changes to the crawler, but, unfortunately, there is a pretty obvious bug as well...

              The reason for the “questionable” links coming from the Internet Wild West is due to the crawler reaching much deeper into sites where there are more download (i.e. binary) links. The first issue is the crawler is counting a binary file as a link, but the larger issue, is that the crawler doesn’t really know how to handle these types of files. This bug is causing some links to be improperly associated with certain domains. This is probably what you're seeing with all the crazy links from China and Russia which don't actually link to the site you're researching.

              There are two steps to addressing this issue: changing how the crawler sees these file types and then fixing how the crawler handles these file types. We have made improvements to our algorithm so that we will be handle the majority of these files correctly, however, this update will need about a month to propagate. The fix for this issue probably won’t be seen for two more updates, meaning late September. Our improvements should catch most of the issues, but there still could be a few cases we haven't addressed. If this happens, don't hesitate to let us know; we love feedback since it helps us improve and make our index even better!

              The next step is to fix how our crawlers handle binary file links and prevent them from being improperly associated with certain domains. We are in the process of working through that issue right now. We’re doing everything we can to resolve this bug as we know it is alarming to see these “questionable” links associated with your sites.I hope this helps and thanks so much for being patient :)Thanks,Carin

              znotes 1 Reply Last reply Reply Quote 3
              • znotes
                znotes @carinoverturf last edited by

                Hey Carin-

                Thank you so much for this in-depth response. Glad to hear that you guys are aware of it and trying to sort it out.  Very interesting info...I'd never hear of "binary" links before but I hope you guys can figure out how to handle these. Seems like a tough task to tackle, just by looking at my CSV it looks like these come in several different forms and they could be hard to identify..I have a few questions:

                1. Is there by chance a URL you could give me that points to the old OSE ?

                2. How often does OSE crawl? Is it a constant process or are there scheduled crawls?

                Thanks!!

                -Zack

                carinoverturf znotes 4 Replies Last reply Reply Quote 0
                • carinoverturf
                  carinoverturf @znotes last edited by

                  Hey Zack,

                  Thanks so much for understanding! We are doing everything we can to get the bug resolved. Binary files are the downloadable files you see as links - .pdf, .exe, .img, etc.

                  I'm really sorry, but we don't have a URL to the old OSE. I saw Steven's response as a workaround - is that possible or are there too many file types to filter out?

                  Our crawlers that provide the metrics to OSE are always crawling, but will take about a month for our fix to propagate through to all the pages we crawl. Once we have removed these links from our crawlers, then we'll have to process the metrics. This is why it's looking like late September for the fix to show up.

                  I really appreciate your patience and understanding, we're doing everything we can to fix it!!

                  Thanks,

                  Carin

                  1 Reply Last reply Reply Quote 0
                  • znotes
                    znotes @znotes last edited by

                    Hey Carin,

                    I just wanted to follow up on this...I'm still seeing these spammy binary files show up as links. Unfortunately it makes OSE quite useless for me in regards to exploring our own backlinks.

                    What is the status of this problem? Has there been any headway ? Why does our site have problems but most others don't?

                    Thanks!

                    -Zack

                    1 Reply Last reply Reply Quote 0
                    • carinoverturf
                      carinoverturf @znotes last edited by

                      Hey Zack,

                      Sorry to hear you're still having problems - we've seen an improvement on most sites at this point. Would you want to send me info on the site you're searching and any filters you are using?

                      If you don't feel comfortable posting that info on this thread, feel free to email me directly: carin@seomoz.org.

                      Thanks!

                      Carin

                      1 Reply Last reply Reply Quote 0
                      • znotes
                        znotes @znotes last edited by

                        Ok thank you. I will email directly.

                        1 Reply Last reply Reply Quote 0
                        • 1 / 1
                        • First post
                          Last post
                        • I am trying to find inbound links for one of my site urls. My question is does SEOMoz able to track all internal links as the Open Site Explorer shows 0 internal links?
                          Jeepster
                          Jeepster
                          0
                          4
                          492

                        • Open Site Explorer and link numbers
                          SamWeber
                          SamWeber
                          0
                          2
                          439

                        • I have a client with a bit over 100 inbound links but Open Site Explorer shows total links on Subdomain as over 66,000, how can this be?
                          KeriMorgret
                          KeriMorgret
                          0
                          4
                          563

                        • Links not appearing on Open Site Explorer
                          atticus7
                          atticus7
                          0
                          3
                          822

                        • No Local Directory links in Open Site Explorer?
                          EricaMcGillivray
                          EricaMcGillivray
                          1
                          4
                          860

                        • Links in Open Site Explorer turning into downloads
                          KeriMorgret
                          KeriMorgret
                          0
                          5
                          784

                        • BOTW links not recognized by Open Site Explorer
                          RyanKent
                          RyanKent
                          0
                          4
                          999

                        • Open Site Explorer Question- Link Value?
                          0
                          2
                          677

                        Get started with Moz Pro!

                        Unlock the power of advanced SEO tools and data-driven insights.

                        Start my free trial
                        Products
                        • Moz Pro
                        • Moz Local
                        • Moz API
                        • Moz Data
                        • STAT
                        • Product Updates
                        Moz Solutions
                        • SMB Solutions
                        • Agency Solutions
                        • Enterprise Solutions
                        • Digital Marketers
                        Free SEO Tools
                        • Domain Authority Checker
                        • Link Explorer
                        • Keyword Explorer
                        • Competitive Research
                        • Brand Authority Checker
                        • Local Citation Checker
                        • MozBar Extension
                        • MozCast
                        Resources
                        • Blog
                        • SEO Learning Center
                        • Help Hub
                        • Beginner's Guide to SEO
                        • How-to Guides
                        • Moz Academy
                        • API Docs
                        About Moz
                        • About
                        • Team
                        • Careers
                        • Contact
                        Why Moz
                        • Case Studies
                        • Testimonials
                        Get Involved
                        • Become an Affiliate
                        • MozCon
                        • Webinars
                        • Practical Marketer Series
                        • MozPod
                        Connect with us

                        Contact the Help team

                        Join our newsletter
                        Moz logo
                        © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                        • Accessibility
                        • Terms of Use
                        • Privacy