Categories: Webmaster Tools :

404 Errors showing from incomplete URLs displayed on scraper sites (not active links)

Showing 1-17 of 17 messages
404 Errors showing from incomplete URLs displayed on scraper sites (not active links) cory_morepro 10/4/11 9:23 AM
I have read the FAQs and checked for similar issues: YES
My site's URL (web address) is: http://www.contractedge.com/
Description (including timeline of any changes made):
Within our Webmaster Tools Crawl Errors report, we've seen a large increase in the number of 404 (Not Found) pages that don't even really exist. When reviewing some of the pages that link to these supposed 404 pages, we see that the pages have partial URLs only. Most of the references are from scraper sites, but we've also seen them on Ask.com. Some of these pages are already gone/deleted, but others are still online and showing 404 errors as late as 10/3. The earliest date for these errors is 9/5/11.

Example:
http://www.contractedge.com/staffingagreement.h..
http://www.contractedge.com/networkinstallationagreemen..
http://www.contractedge.com/websitehost..
http://www.contractedge.com/websiteh...agreement.html

Note: These partial URLs are not actively linked on the source site, but Google is still crawling them and determining 404 (Not Found).

Here are some of the source sites:
http://web-hosting-info1.blogspot.com/2009/04/i-need-sample-contract-form-between.html  (http://www.contractedge.com URL displayed only; not active hyperlink)
http://webhosting.talkwhat.com/view/QgAsGlsGlEmNBxLaTk.html   (http://www.contractedge.com URL displayed only; not active hyperlink)

We believe these could potentially be problematic and cast a negative light on the site in general.

Why is Google including these? Are they impacting the overall rating of the website?

Please advise.

Thanks!
Re: 404 Errors showing from incomplete URLs displayed on scraper sites (not active links) cory_morepro 10/11/11 9:09 AM
We are still seeing these errors reported in Webmaster Tools reports. Is anyone else seeing an influx of crawl errors resulting from these display-only (partial) links?
Re: 404 Errors showing from incomplete URLs displayed on scraper sites (not active links) SheilaMary 10/13/11 3:05 AM
Hello -- yes, I've suddenly started getting them too - links with "...." either in the middle or the end. I've been removing them manually from crawler access but it's a bit of a nuisance. Anyone got any ideas why this is happening and what to do about it?
Re: 404 Errors showing from incomplete URLs displayed on scraper sites (not active links) Steven Lockey 10/13/11 3:11 AM
Don't worry about it, its just Google letting you know that there are links to your site that don't work. It won't affect your rankings. Its just in case its a real link so you can put in a 301 redirect to make it go to the correct page or get the webmaster to correct it. For the scraper sites it doesn't matter too much, althrough its nice it let you know about them so you can block em ;)
Re: 404 Errors showing from incomplete URLs displayed on scraper sites (not active links) SheilaMary 10/13/11 6:19 AM
Cool, thank you -- yes, the links do appear to be from scrapers. :-) So I won't worry too much. Though, when there are loads of them, it makes it harder to spot any "real" broken links.
Re: 404 Errors showing from incomplete URLs displayed on scraper sites (not active links) Steven Lockey 10/13/11 6:56 AM
Block the scrapers if you can and file a DMCA against them with their hosting company (you can find that via a 'whois') and with any luck the scrapers should get removed.
Re: 404 Errors showing from incomplete URLs displayed on scraper sites (not active links) SheilaMary 10/13/11 8:42 AM
OK, will do. :-) What's the best way of blocking the scrapers, though? (sorry for the basic question).
Re: 404 Errors showing from incomplete URLs displayed on scraper sites (not active links) Steven Lockey 10/13/11 9:00 AM
That would depend on the hosting, if its apache or IIS hosted is the main factor.

Either way you'll need to look through the server logs for the website, find the IP or user-agent of the scrapper and (what I do to be evil) redirect it to something nasty (aka an image saying 'anyone viewing this is a dirty content thief and should be shot')

Works for me ;)
Re: 404 Errors showing from incomplete URLs displayed on scraper sites (not active links) SheilaMary 10/14/11 12:46 AM
LOL, thank you :-)) Actually I reckon if a scraper keeps posting a load of duff links then people will stop visiting his site anyway.
Re: 404 Errors showing from incomplete URLs displayed on scraper sites (not active links) Steven Lockey 10/14/11 1:31 AM
Generally they are MFA (made for adsense) sites, they only get hit once by people who happen to find them in the search results.

As panda is wiping out the low-grade content ones they are looking to avoid panda by stealing other people's higher quality content.
Re: 404 Errors showing from incomplete URLs displayed on scraper sites (not active links) JohnMu 11/9/11 2:43 AM
It's possible that these URLs are coming from text on those pages -- I posted some more information about this at http://www.google.com/support/forum/p/Webmasters/thread?tid=78f2440992f9dcdf&hl=en

Cheers
John
Re: 404 Errors showing from incomplete URLs displayed on scraper sites (not active links) cory_morepro 11/9/11 7:27 AM
Thanks for the direct response John. We figured that something like that was the reason. Hopefully the reporting system can be configured to hide URLs that are truncated and/or otherwise incomplete.

Regards.
Re: 404 Errors showing from incomplete URLs displayed on scraper sites (not active links) Bill_H 4/20/12 6:09 PM
John:
Five months later and we are still seeing hundreds of incomplete links, all ending in ... being reported as crawl errors in our webmaster report. Is there any solution?

Thanks in advance,
Bill

Re: 404 Errors showing from incomplete URLs displayed on scraper sites (not active links) ITCF 5/28/12 8:19 AM
Glad to see this thread. I'm seeing a ton of these partial URLs (ending in "..") in GWT too. Just to clarify, is it the opinion of this forum that we can ignore these and they won't effect rankings? Thanks!
Re: 404 Errors showing from incomplete URLs displayed on scraper sites (not active links) Bobby_Hunter 6/14/12 8:02 AM
I really wish Google would do something about all these truncated links in GWT. Sure they don't affect rankings but when there are 1000's of them showing up in our list of crawl errors, it makes it difficult to sift through them and find actual errors that we should be fixing/redirecting/etc. It's been 8 months since this issue was first brought up...
Re: 404 Errors showing from incomplete URLs displayed on scraper sites (not active links) g1smd 6/14/12 12:41 PM
Do NOT adopt these duff URLs in any way, and especially do not redirect those requests elsewhere.

I tick the "Fixed" box in WMT, and they are removed from the list. Some re-appear days or weeks later, and I simply remove them from the list again.


Re: 404 Errors showing from incomplete URLs displayed on scraper sites (not active links) ITCF 6/14/12 6:22 PM
Thanks g1.. I marked em "fixed".. lets see what happens. dG