Revision history[back]
click to hide/show revision 1
Revision n. 1

Mar 05 '11 at 14:17

Svetoslav%20Marinov's gravatar image

Svetoslav Marinov
26618

I'd advise you to read at least the following three books:

1) Speech & Language Processing (http://www.cs.colorado.edu/~martin/slp.html) by Daniel Jurafsky & James Martin 2) Foundations of Statistical Natural Language Processing by Manning & Schuetze (http://nlp.stanford.edu/fsnlp/) 3) Introduction to Information Retrieval by Manning, Raghaven & Schuetze (http://nlp.stanford.edu/IR-book/information-retrieval-book.html)

Your problem is not that trivial as it may probably seem to you. In addition, you have to at least take into consideration the idiosyncrasies of each language you'd like your search algorithm to work with.

click to hide/show revision 2
Revision n. 2

Mar 06 '11 at 14:38

Svetoslav%20Marinov's gravatar image

Svetoslav Marinov
26618

I'd advise you to read at least the following three books:

1) Speech & Language Processing (http://www.cs.colorado.edu/~martin/slp.html) by Daniel Jurafsky & James Martin 2) Foundations of Statistical Natural Language Processing by Manning & Schuetze (http://nlp.stanford.edu/fsnlp/) 3) Introduction to Information Retrieval by Manning, Raghaven & Schuetze (http://nlp.stanford.edu/IR-book/information-retrieval-book.html)

Your problem is not that trivial as it may probably seem to you. In addition, you have to at least take into consideration the idiosyncrasies of each language you'd like your search algorithm to work with.

In the book "Programming Collective Intelligence" by Toby Segaran, Chapter 4 you will find a simple python implementation of a search engine.

Btw, for which language do you want to design your search algorithm? You should be aware that there are many kinds of ambiguities in the languages and what is the case for English may not at all be true for Inuit for example.

powered by OSQA

User submitted content is under Creative Commons: Attribution - Share Alike; Other things copyright (C) 2010, MetaOptimize LLC.