- NLTK, and a list of suggested NLTK projects for further thought
- OpenNLP is an umbrella project for all kinds of NLP open-source projects
- ClearTK is a Java-based NLP library
- LingPipe ditto
- GATE ditto
- Xerox has a finite-state tool
OK, so here's the idea, and it's always the same idea. In a given NLP-domain problem, I'd model the data and the toolchain in Decl. Thus given a problem, you'd state the problem in Decl, and refine your solution progressively, always keeping the Decl semantic structure for the problem intact at each step. Here, it's almost a note-taking or documentation tool; the actual program would be written in Python and/or Java and invoked by Decl. It could also be embedded, of course, via Inline - but the point is that Decl needn't be seen as an exclusively Perl-based tool. It's also a litprog tool that can use macros to build anything else.
Ah, well. That's probably not all too clear. I'm tired.
What prompted this flurry of NLP searching was this Yelp blog post about a data set they're releasing to researchers.
No comments:
Post a Comment