Ann: Chatter - a simple library for language processing

Greg Matheson drbean at freeshell.org
Fri Nov 22 03:12:05 GMT 2013


On Mon, 18 Nov 2013, Rogan Creswick wrote:

> I've been working on a simple NLP library over the past month or two, and I
> think it may finally be useful to others.  I would love to hear comments,
> criticisms, contributions, etc... ;)

I am on Fedora 19, running haskell-platform-2012. I needed to 
specify containers, text and bytestring versions to get it to 
build. GHC.Generics was also in the hidden ghc-prim.

I submitted a pull request.

> My main objective was to make it extremely easy to do basic NLP tasks in
> Haskell, such as POS tagging and document similarity. (and later, Chunking,
> NER, co-ref resolution, etc...).

> The best example of this is Part-of-speech tagging with Chatter:

> {{{
> cabal install chatter
> ghci
> > :m +NLP.POS
> > t <- defaultTagger
> > tagStr t "This is a test."
> "This/dt is/bez a/at test/nn ./."
> }}}

> Chatter provides POS tagging (with backoff taggers, and a ~83% accurate
> trained default tagger), TF-IDF measures, and cosine document similarity.

> It also currently contains an adapted version of the Tokenize library,
> because I wanted to tokenize Text.  That's a short-term solution; I haven't
> had time to make a patch to the tokenize lib.

> Links:
>  - Hackage: http://hackage.haskell.org/package/chatter-0.0.0.2
>  - Github: http://github.com/creswick/chatter

> --Rogan

> _______________________________________________
> NLP mailing list
> NLP at projects.haskell.org
> http://projects.haskell.org/cgi-bin/mailman/listinfo/nlp




More information about the NLP mailing list