Ann: Chatter - a simple library for language processing

Rogan Creswick creswick at gmail.com
Mon Nov 18 21:53:42 GMT 2013


I've been working on a simple NLP library over the past month or two, and I
think it may finally be useful to others.  I would love to hear comments,
criticisms, contributions, etc... ;)

My main objective was to make it extremely easy to do basic NLP tasks in
Haskell, such as POS tagging and document similarity. (and later, Chunking,
NER, co-ref resolution, etc...).

The best example of this is Part-of-speech tagging with Chatter:

{{{
cabal install chatter
ghci
> :m +NLP.POS
> t <- defaultTagger
> tagStr t "This is a test."
"This/dt is/bez a/at test/nn ./."
}}}

Chatter provides POS tagging (with backoff taggers, and a ~83% accurate
trained default tagger), TF-IDF measures, and cosine document similarity.

It also currently contains an adapted version of the Tokenize library,
because I wanted to tokenize Text.  That's a short-term solution; I haven't
had time to make a patch to the tokenize lib.

Links:
 - Hackage: http://hackage.haskell.org/package/chatter-0.0.0.2
 - Github: http://github.com/creswick/chatter

--Rogan
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://projects.haskell.org/pipermail/nlp/attachments/20131118/382f21a5/attachment.htm>


More information about the NLP mailing list