Stemming & Hello!

Mark Wotton mwotton at gmail.com
Fri Dec 28 11:56:30 GMT 2012


On 28/12/2012, at 6:31 PM, "Daniël de Kok" <me at danieldk.eu> wrote:

> On Dec 28, 2012, at 4:59 AM, Mark Wotton <mwotton at gmail.com> wrote:
>> oh, also: I've been playing around with a suggester using levenshtein
>> distance to many possible target strings. This isn't the fastest thing
>> in the world: is there a better algorithm in the literature?
>
> If you are trying to find words at a small edit-distance, you might want to look at Burkhard-Keller trees:
>
> http://blog.notdot.net/2007/4/Damn-Cool-Algorithms-Part-1-BK-Trees
> http://www.kafsemo.org/2010/08/03_bk-tree-performance-notes.html
>
> http://hackage.haskell.org/package/bk-tree

I knew I could rely on the learned ladies and gentlemen of Haskell.
Thank you sir, that will do admirably.

Cheers
Mark



More information about the NLP mailing list