Spurious entries in report
-
duplicate content report shows many errors that are simply forms on every page for user to register for a newsletter and such. Is there a way to filter out these spurious errors from the reporting view? Or better, remove that from being reported in the first place? I wouldn't think calls to action like this are going to be penalized by search engines as duplicate content, or are they?
-
Which type of report is this? Many websites have contact us forms on every page, including major providers of SEO here in the UK. I highly doubt that a form repeated on every web page would be penalized as "duplicate content". I think the issues with duplicate content refer more to:
1. Stolen content, scraper sites etc. being penalized
2. Having multiple URLs for Google to access the same page in your website, and Google not knowing which page to credit (hence rel="canonical" usage).
HTH
-
We use a threshold of either 90% or 95% similar on a page to call it duplicate content. Do you have the same signup form with a lot of different URLs? Or are there a lot of pages that have just a little bit of unique content and much of it is a template with the same information?
-
if i open the links they are just email subscriber forms. So yes, its the same signup form with different URLs
When i open the links up in i see no content at all, just the sign up forms.
And they are all the same form.
I would not want to see that on a report, since its too distracting from real page duplicate content issues..
-
Is there any identifiable information here? First a quick check to see if there's a privacy issue -- if you click on these URLs is there any info in the URL giving away the email address, name, etc of people in the signup form?
Right now there's not a way to hide it on a report, but you might see if there's a way to just exclude that via a robots.txt just to save your crawl budget and make sure that it isn't a problem with the search engines.