特定の単語のシソーラスと分類法を取得するための優れたPythonライブラリをお勧めできますか?
シノニム:
>>> print get_synonym('image')
['picture', 'photo']
分類法:
>>> print get_taxonomy('baseball')
['sports']
そのためにスーパーユーザー権限が必要になる場合があります。
$ pip install nltk
>>> import nltk
>>> from nltk.corpus import wordnet as wn
>>> wn.synsets('dog')
[Synset('dog.n.01'), Synset('frump.n.01'), Synset('dog.n.03'), Synset('cad.n.01'), Synset('frank.n.02'), Synset('pawl.n.01'), Synset('andiron.n.01'), Synset('chase.v.01')]
>>> wn.synset('dog.n.01').definition()
u'a member of the genus Canis (probably descended from the common wolf) that has been domesticated by man since prehistoric times; occurs in many breeds'
>>> wn.synset('dog.n.03').definition()
u'informal term for a man'
>>> baseball = wn.synset('baseball.n.01')
>>> sport = wn.synset('sport.n.01')
>>> picture = wn.synset('picture.n.01')
>>> sport.path_similarity(baseball)
0.16666666666666666
>>> sport.path_similarity(picture)
0.06666666666666667