Google Product Forums

Re: Canonical or Google indexing issues


Robbo Jul 30, 2010 3:34 PM
Posted in group: Webmaster Central Help Forum

Categories: Crawling, indexing & ranking :



<meta http-equiv="Content-Type" content="application/xhtml+xml; charset=UTF-8" />

The above line is fine and correct, I believe, and certainly not the cause of the BOM issue.

If your page is being generated by a CMS (Content Management System), I am not sure how you would remove the BOM.

To remove the BOM if you are able to edit the HTML document at code level:

0.  Make a backup copy of the HTML page just in case things go wrong.

1.  Using a plain-text editor such as Dreamweaver or Notepad, make sure that you have "UTF-8" selected for character encoding and create a blank document.

2.  Highlight  all the page coding (the entire HTML document) and copy/paste to the newly created blank document.

3.  Save the new document using the correct filename (eg index.html or whatever) thus overwriting the previous HTML document of that same name.

4.  Upload the new document to your server as normal.

5.  View the page on the web and ensure that everything looks OK.

6.  Check that the BOM has gone.  Easiest way is validator.w3.org  - for the time being ignore any other error report it might give - just look for the BOM warning.

Actually, it would be a good idea to also do step 6. before you start on the  other steps so that you can see the before and after difference.