Dec 5, 2008 116 reads by Navneet Kaushal

What happened to Google Blog Search, that used to index via RSS feeds? But this method was for blogs that published partial feeds! That automatically means that Google Blog Search only indexed that partial portion. But what about the rest of the post? The links and text in that portion weren’t available through Blog Search!

Now the news that is spreading like a fire in the web world is that Google Blog Search is now using the full-text of posts (in a lot of cases), rather than other aspects!

The annoying part was that Google Blog Search was solely dependent on what was put out in a feed, rather than going for the actual indexing of the entire text of a blog post. But now, it seems that the problem has been solved…

Vanessa Fox’s recent “Google Blog Search Changes How It Indexes” article makes the whole aspect very clear.

From what Google posted in its Google Groups:

“We have changed the way we index blog posts to include the full content of the page. We’ve had occasional complaints about the use of the feed content, particularly the problem with partial feeds that you mentioned. The indexing change has improved the results for a lot of queries, both because we have the full content of the page and because we extract links that are missing from the feeds.” -Jeremy Hylton
Google BlogSearch

If it works as as it has been promised by them, that means within seconds, your full post should be indexed and searchable within Google Blog Search!!!

But are you aware of the fact that there are certain drawbacks also! Barry at SE Roundtable has also blogged about this issue . He elaborated that blogs often contains a lot of blogrolls and links to other blogs as well. Now after the full-text post indexing, links that are present in these blogs would allow a lot of people to think that there were new blog posts being done about them.

From what Google also posted in its groups about the issue:

“The downside of this change is that we see more results that match only the blogroll and other parts of the page that are common to all of a blog’s posts.

We expected some problems from blogroll matches, but may have underestimated the impact on searches using the link: operator or where the query matches a blog or blogger’s name. We do expect to fix the problem you’re seeing. We’ll use the full page content, but exclude the content that isn’t really part of the post. I’m not sure if we’ll be able to make the change before the end of the year, but we are working on it and are pretty confident that it can be solved. We’ll post an update here when we’ve got a solution.”

So let us cross our fingers and wait as to what happens next!

Navneet Kaushal

Navneet Kaushal

Navneet Kaushal is the founder and CEO of PageTraffic, an SEO Agency in India with offices in Chicago, Mumbai and London. A leading search strategist, Navneet helps clients maintain an edge in search engines and the online media. Navneet's expertise has established PageTraffic as one of the most awarded and successful search marketing agencies.
Navneet Kaushal
Navneet Kaushal
Most popular Posts
Upcoming Events
Events are coming soon, stay tuned!More