I have read the FAQs and checked for similar issues: YES
My site's URL is: priacta.com/trog
We had a strange 404 error in our logs today. The user agent was googlebot, and the IP checks out:
Type : 404
Page : www.priacta.com/downloads/download.com
Referred from : Unknown
Time : 20/05/2010 03:34:57
From IP : 18.104.22.168
User Agent : Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)
Some of our pages contain the following (in a script tag), which I think is how Googlebot got confused:
If your unhappy with Google crawling your site/pages - Im sure you are aware of methods to prevent the bot crawling.
Then you won't have a problem.
well, I admit this is a bit funny ... :D ... yet doesn't really do any harm - and isn't wrong on Google's side either. I mean "/downloads/download.com" definitely is an URL (a relative address), and as long as it's there in the script for some reason, Google will try to crawl it. If you say you don't like your logs to be polluted with this kind of entries, either filter the log - or exclude any (pseudo-) address of the kind from crawling in your robots.txt file perhaps.