The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Other Research Tools
    4. Not getting foreign characters in crawl diagnostics .csv

    Not getting foreign characters in crawl diagnostics .csv

    Other Research Tools
    4 2 187
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • trainSEM
      trainSEM last edited by

      The crawl diagnostics .csv file is showing high-ascii characters instead of the correct language (foreign language website) e.g. Vietnamese, Chinese (both kinds), etc. Is there a way to get this right?

      1 Reply Last reply Reply Quote 0
      • LynnPatchett
        LynnPatchett last edited by

        Hi Ash,

        I had this problem too and here is how I solved it (there might be better ways).

        If the characters are in the page titles, meta tags etc you can open the csv file in open office and then choose save as xls and it will save an excel file which you can then open in excel and the utf8 characters will read ok. This method works great for titles etc but does not decode foreign characters in the urls themselves.

        If the characters are in the url then a way I have found is to download this pretty awesome excel addon (site is in german, I used google translate to figure out what was going on). Then you have some new functions in excel where you can create a 2nd column next to the url column, apply the url decode function to the first column and get readable urls in the second. This addon saved me sooo much time and trouble! It works for greek which I need it for, I assume it will work for chinese also. Let me know if you need more detailed instructions, it took a bit of trial and error to figure out the exact moves needed to get the results you want.

        Hope that helps!

        trainSEM 1 Reply Last reply Reply Quote 1
        • trainSEM
          trainSEM @LynnPatchett last edited by

          Open Office did the trick! Thank you. Would be nice if the Moz app could do UTF-8 natively.

          1 Reply Last reply Reply Quote 0
          • LynnPatchett
            LynnPatchett last edited by

            Glad it helped! I think the issue might be with excel more than Moz, its handling of utf8 csv's has been terrible since day 1! I think there is a way you can use the excel import data function to get the same result but I never had much luck with it and the open office trick seemed less painful.

            1 Reply Last reply Reply Quote 0
            • 1 / 1
            • First post
              Last post
            • Help Need with "Duplicate Content Group" on exported Site Crawl CSV
              dave.kudera
              dave.kudera
              0
              2
              86

            • Why do i get multiple variations of my url with ?order=asc and ?view=list at the end of it in my crawl report?
              alexrbrg
              alexrbrg
              0
              3
              92

            • Rogerbot will not crawl my site! Site URL is https but keep getting and error that homepage (http) can not be accessed. I set up a second campaign to alter the target url to the newer https version but still getting the same error! What can I do?
              DirkC
              DirkC
              0
              3
              200

            • Crawl Diagnostic Errors
              ParvatiSingh
              ParvatiSingh
              0
              3
              111

            • 408 errors in crawl diagnostics
              LynnPatchett
              LynnPatchett
              0
              2
              435

            • Is there a way to get Page Authority values included in the Crawl Diagnostic .csv export?
              Peterli
              Peterli
              0
              2
              109

            • Why do the crawl diagnostics indicate duplicate page content among blog postings hosted by WordPress?
              MikeRoberts
              MikeRoberts
              0
              7
              164

            • Moz "Crawl Diagnostics" doesn't respect robots.txt
              Christy-Correll
              Christy-Correll
              0
              5
              916

            Get started with Moz Pro!

            Unlock the power of advanced SEO tools and data-driven insights.

            Start my free trial
            Products
            • Moz Pro
            • Moz Local
            • Moz API
            • Moz Data
            • STAT
            • Product Updates
            Moz Solutions
            • SMB Solutions
            • Agency Solutions
            • Enterprise Solutions
            • Digital Marketers
            Free SEO Tools
            • Domain Authority Checker
            • Link Explorer
            • Keyword Explorer
            • Competitive Research
            • Brand Authority Checker
            • Local Citation Checker
            • MozBar Extension
            • MozCast
            Resources
            • Blog
            • SEO Learning Center
            • Help Hub
            • Beginner's Guide to SEO
            • How-to Guides
            • Moz Academy
            • API Docs
            About Moz
            • About
            • Team
            • Careers
            • Contact
            Why Moz
            • Case Studies
            • Testimonials
            Get Involved
            • Become an Affiliate
            • MozCon
            • Webinars
            • Practical Marketer Series
            • MozPod
            Connect with us

            Contact the Help team

            Join our newsletter
            Moz logo
            © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
            • Accessibility
            • Terms of Use
            • Privacy