Friday, June 19, 2009

SharePoint Blog Posts and Searching

Yet again, this was something that I found out the hard way.  I created a simple, out of the box site based using the blog site template.  The site created successfully, and myself and my colleagues started posting.  After having generated several blog entries, I started a full crawl to get the blog posts included in the Search indexes. 

Once the crawl was completed, I performed a search against the contents of the blog posts.  Search was unable to index the contents of the blog posts.  It had crawled the titles of the posts, and linked to the page containing the list of posts.  Search would neither crawl posts nor display links to the actual blog post items themselves.  it always returned me to the entire list.

This just didn’t make any sense at all.  How could SharePoint offer Search, but not be able to crawl or index any of it’s own content? 

I thought that it might have something to do with the fact that blog post items are created as “enhanced rich text” instead of just “rich text” or “plain text”.

As a test, I went into the properties of the ‘body’ column of the post item and changed it from “enhanced rich text” to just “rich text”.  I initiated another full crawl (it’s a small MOSS site) and I got the same results (or rather, lack of results).  I also tried as plain text.  No luck.  So, there is something inherently different about blog posts in SharePoint that make it difficult for SharePoint Search to index.  I’m still investigating why this is.

As an alternative, I discovered the Community Kit for SharePoint (CKS) which has an Enhanced Blog Edition.  This SharePoint Solution Package is pretty straightforward to install.  The blog posts created when using it are searchable and the search results list directly to the post item itself, not back to the list of posts.  Beyond that, it has many themes to choose from and offers a few more features than the out of the box SharePoint Blog site.  You can find it on CodePlex.