Jan 4, 2011 113 reads by Navneet Kaushal

In a Google Webmaster Help thread, JohnMu from Google advised a webmaster that there should be no duplicate URLs or session ID based URLs listed in a Sitemap file. It is quite obvious a fact that in case if there is any duplicate content on your site which has not been blocked from Google, then Google can detect it based on crawling your site against seeing it in a Sitemap file.

Here is an excerpt from the discussion at the help forum,


“If you are not submitting clean URLs in your Sitemap file, you'd be better off not using a Sitemap file. With session-IDs in there, it'll cause more problems (with us crawling and indexing those URLs) than if you just let us crawl your website normally (especially if you really have a clean URL structure). So my advice would be to either delete the Sitemap file, or make sure that the submitted URLs are really exactly the same, clean ones that we find while crawling.”

Although this is not new to webmasters, it comes as a good reminder and for those who find it new should always remember it. Thanks to John for the reminder and also to Barry Schwartz from seroundtable.com who brought it out for more people to see.

The message is clear, send Google only clean URLs in your Sitemap file.

Navneet Kaushal

Navneet Kaushal

Navneet Kaushal is the founder and CEO of PageTraffic, an SEO Agency in India with offices in Chicago, Mumbai and London. A leading search strategist, Navneet helps clients maintain an edge in search engines and the online media. Navneet's expertise has established PageTraffic as one of the most awarded and successful search marketing agencies.
Navneet Kaushal
Navneet Kaushal
Most popular Posts
Upcoming Events
Events are coming soon, stay tuned!More