|
When implementing search auto-complete, how do you implement stemming when you are still typing a word? Background: I am building a search auto-complete feature. This search auto-complete is for the OSQA software, which you are using now. As you type a question title, it searches for question and answer bodies (as well as tags and titles) using the current question title. However, the search implementation will stem the terms in the query. When you are in the middle of typing a word, it is incomplete and cannot be correctly stemmed. What are patterns for including this incomplete word in your search query? |
|
In my mind, it would be far easier to have two query parsers.. one which does stemming, useful when the actual search is going on and one which looks for exact match for auto-completion. Second option would be write your query parser that stems only completed words (i.e. tokenize on spaces and ignore the last word). I like this option better, though it would make the implementation a bit harder. |
|
A gentleman by the name of Ahmet helped me with stemming issues and live search on the SOLR email list. Here's what he said:
|
|
Just related to Joseph's "aside": we have actually implemented a first version of our search autocomplete system and we will soon release for it website search: http://www.metahint.com |
Aside: auto-complete as a web-service was recently proposed László Kozma: http://www.lkozma.net/seven.html#l2