NLTK数据安装问题
我正在尝试在Mac OSX 10.9上安装NLTK数据。根据NLTK 3.0的说明,下载目录应该设置为/usr/share/nltk_data,这样可以进行集中安装。但是我在这个路径下遇到了一个错误,提示是OSError: [Errno 13] 权限被拒绝: '/usr/share/nltk_data'
我可以把下载目录设置为/Users/ananya/nltk_data来进行集中安装吗?
我机器上安装了Python 2.7
谢谢,
Ananya
1 个回答
7
你试过以下方法吗:
$ sudo python
>>> import nltk
>>> nltk.download()
为了检查下载是否成功,可以试着使用你下载的一些数据集,比如:
>>> from nltk.corpus import wordnet
>>> wordnet.synsets('dog')
[Synset('dog.n.01'), Synset('frump.n.01'), Synset('dog.n.03'), Synset('cad.n.01'), Synset('frank.n.02'), Synset('pawl.n.01'), Synset('andiron.n.01'), Synset('chase.v.01')]
如果数据集没有正确安装,你会看到类似这样的提示:
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
File "/usr/local/lib/python2.7/dist-packages/nltk/corpus/util.py", line 68, in __getattr__
self.__load()
File "/usr/local/lib/python2.7/dist-packages/nltk/corpus/util.py", line 56, in __load
except LookupError: raise e
LookupError:
**********************************************************************
Resource 'corpora/wordnet' not found. Please use the NLTK
Downloader to obtain the resource: >>> nltk.download()
Searched in:
- '/home/alvas/nltk_data'
- '/usr/share/nltk_data'
- '/usr/local/share/nltk_data'
- '/usr/lib/nltk_data'
- '/usr/local/lib/nltk_data'
**********************************************************************