The Moz Q&A Forum

    • Forum
    • Questions
    • My Q&A
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. SEO and Digital Marketing Q&A Forum
    2. Categories
    3. Intermediate & Advanced SEO
    4. Steps you can take to ensure your content is indexed and registered to your site before a scraper gets to it?

    Steps you can take to ensure your content is indexed and registered to your site before a scraper gets to it?

    Intermediate & Advanced SEO
    6 3 862
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with topic management privileges can see it.
    • Qasim_IMG
      Qasim_IMG last edited by

      Hi,

      A clients site has significant amounts of original content that has blatantly been copied and pasted in various other competitor and article sites.

      I'm working with the client to rejig lots of this content and to publish new content.

      What steps would you recommend to undertake when the new, updated site is launched to ensure Google clearly attributes the content to the clients site first?

      One thing I will be doing is submitting a new xml + html sitemap.

      Thankyou

      1 Reply Last reply Reply Quote 0
      • AlanBleiweiss
        AlanBleiweiss last edited by

        Always have a sitemap.xml file with all the URLs you want indexed included in it.  Right after publishing, submit the sitemap.xml file (or files if there are tens of thousands of pages) through Google Webmaster Tools and Bing Webmaster Tools.  Include the Meta "original-source" tag in your page headers.

        Include a Copyright line at the bottom of each page with the site or company name, and have that link to the home page.

        This does not guarantee with 100% certainty that you'll get proper attribution, however these are the best steps you can take in that regard.

        EGOL 1 Reply Last reply Reply Quote 0
        • EGOL
          EGOL @AlanBleiweiss last edited by

          Thanks Alan...  I am surprised to learn about this "original source" information.   There must not have been a lot of talk about it when it was released or I would have seen it.

          Google recently started encouraging people to use the rel="author" attribute.  I am going to use that on my site... now I am wondering if I should be using "original source" too.

          Are you recommending rel="author"?

          Also, reading that full post there is a section added at the end recommending rel="canonical"

          AlanBleiweiss Qasim_IMG 3 Replies Last reply Reply Quote 0
          • AlanBleiweiss
            AlanBleiweiss @EGOL last edited by

            Google continually tries to find new ways to encourage solutions for helping them understand intent, relevance, ownership and authority. It's why Schema.org finally hit this year.  None of their previous attempts have been good enough, and each has served a specific individual purpose.

            So with Schema, the theory is there's a new, unified framework that can grow and evolve, without having to come up with individual solutions.

            The "original source" concept was supposed to address the scraper issue, and there's been some value in that, though it's far from perfect. A good scraper script can find it, strip it out or replace the contents.

            rel="author" is yet one more thing that can be used in the overall mix, though Schema.org takes authorship and publisher identity to a whole new, complex, and so far confused level :-).

            Since Schema.org is most likely not going to be widely adopted til at least early next year, Google's encouraging use of the rel="author" tag as the primary method for assigning authorship at this point, and will continue to support it even as Schema rolls out.

            So if you're looking at a best practices solution, yes, rel="author" is advisable.  Until it's not.  🙂

            1 Reply Last reply Reply Quote 0
            • Qasim_IMG
              Qasim_IMG @EGOL last edited by

              Thanks Alan.

              Guess there's no magic trick that will give you 100% attribution.

              Regarding this tag, do you recommend I add this to EVERY page of the clients website including the homepage? So even the usual about us/contact etc pages?

              Cheers

              Hash

              1 Reply Last reply Reply Quote 0
              • AlanBleiweiss
                AlanBleiweiss @EGOL last edited by

                There are no "best practices" established for the tags' usage at this point.  On the one hand, it could technically be used for every page, and on the other, should only be used when it's an article, blog post, or other individual person's writing.

                1 Reply Last reply Reply Quote 0
                • 1 / 1
                • First post
                  Last post
                • If I put a piece of content on an external site can I syndicate to my site later using a rel=canonical link?
                  EGOL
                  EGOL
                  1
                  6
                  221

                • My site shows 503 error to Google bot, but can see the site fine. Not indexing in Google. Help
                  Everett
                  Everett
                  0
                  3
                  733

                • How can I get Bing to index my subdomain correctly?
                  cos2030
                  cos2030
                  0
                  5
                  764

                • How can I get a list of every url of a site in Google's index?
                  KaneJamison
                  KaneJamison
                  0
                  8
                  1.1k

                • If a website trades internationally and simply translates its online content from English to French, German, etc how can we ensure no duplicate content penalisations and still maintain SEO performance in each territory?
                  Martijn_Scheijbeler
                  Martijn_Scheijbeler
                  0
                  2
                  46

                • Can a website be punished by panda if content scrapers have duplicated content?
                  RG_SEO
                  RG_SEO
                  0
                  5
                  173

                • Large site rel=can or no-index?
                  XNUMERIK
                  XNUMERIK
                  0
                  2
                  190

                • How can we get a site reconsidered for Google indexing?
                  d25kart
                  d25kart
                  0
                  3
                  301

                Get started with Moz Pro!

                Unlock the power of advanced SEO tools and data-driven insights.

                Start my free trial
                Products
                • Moz Pro
                • Moz Local
                • Moz API
                • Moz Data
                • STAT
                • Product Updates
                Moz Solutions
                • SMB Solutions
                • Agency Solutions
                • Enterprise Solutions
                • Digital Marketers
                Free SEO Tools
                • Domain Authority Checker
                • Link Explorer
                • Keyword Explorer
                • Competitive Research
                • Brand Authority Checker
                • Local Citation Checker
                • MozBar Extension
                • MozCast
                Resources
                • Blog
                • SEO Learning Center
                • Help Hub
                • Beginner's Guide to SEO
                • How-to Guides
                • Moz Academy
                • API Docs
                About Moz
                • About
                • Team
                • Careers
                • Contact
                Why Moz
                • Case Studies
                • Testimonials
                Get Involved
                • Become an Affiliate
                • MozCon
                • Webinars
                • Practical Marketer Series
                • MozPod
                Connect with us

                Contact the Help team

                Join our newsletter
                Moz logo
                © 2021 - 2026 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                • Accessibility
                • Terms of Use
                • Privacy