0

I've to write a program (in Java) to identify several parts of speech like nouns, adjectives, verbs etc. The program should also identify number (in numeric e.g. 10) and numbers written in plain English (ten, hundred etc) and much more. I'm not sure that what is the way forward. Is there any library available that can help? Can this be done only with regex? Or do I need to learn NLP?

Please suggest a way forward.

4

2 に答える 2

2

(1) OpenNLP

(2) リンパイプ

(3) スタンフォードNLP

上記の 3 つ (Java ベース) はすべて、すぐに使用できる POS の識別に役立ちます。

数値には正規表現を使用します。

于 2013-04-01T20:03:48.327 に答える