使用 nltk.download() 下载错误 [英] downloading error using nltk.download()
问题描述
我正在使用 Python 试验 NLTK 包.我尝试使用 nltk.download()
下载 NLTK.我收到了这种错误信息.如何解决这个问题呢?谢谢.
我使用的系统是安装在VMware下的Ubuntu.IDE 是 Spyder.
使用nltk.download('all')
后,可以下载一些包,但是下载oanc_masc
要下载特定的数据集/模型,请使用 nltk.download()
函数,例如如果您想下载 punkt
句子标记器,请使用:
$ python3>>>导入 nltk>>>nltk.download('朋克')
如果您不确定您需要哪种数据/模型,您可以从数据 + 模型的基本列表开始:
<预><代码>>>>导入 nltk>>>nltk.download('流行')它将下载流行"资源列表.
确保您拥有最新版本的 NLTK
,因为它一直在改进和不断维护:
$ pip install --upgrade nltk
<小时>
已编辑
如果有人避免从 nltk
下载更大的数据集时出错,来自 https://stackoverflow.com/a/38135306/610569
$ rm/Users//nltk_data/corpora/panlex_lite.zip$ rm -r/Users//nltk_data/corpora/panlex_lite$蟒蛇>>>导入 nltk>>>dler = nltk.downloader.Downloader()>>>dler._update_index()>>>dler._status_cache['panlex_lite'] = 'installed' # 欺骗索引以将 panlex_lite 视为已经安装.>>>dler.download('流行')
如果有人想找到 nltk_data
目录,请参阅 https://stackoverflow.com/a/36383314/610569
并配置 nltk_data
路径,请参阅 https://stackoverflow.com/a/22987374/610569
I am experimenting NLTK package using Python. I tried to downloaded NLTK using nltk.download()
. I got this kind of error message. How to solve this problem? Thanks.
The system I used is Ubuntu installed under VMware. The IDE is Spyder.
After using nltk.download('all')
, it can download some packages, but it gets error message when downloading oanc_masc
To download a particular dataset/models, use the nltk.download()
function, e.g. if you are looking to download the punkt
sentence tokenizer, use:
$ python3
>>> import nltk
>>> nltk.download('punkt')
If you're unsure of which data/model you need, you can start out with the basic list of data + models with:
>>> import nltk
>>> nltk.download('popular')
It will download a list of "popular" resources.
Ensure that you've the latest version of NLTK
because it's always improving and constantly maintain:
$ pip install --upgrade nltk
EDITED
In case anyone is avoiding errors from downloading larger datasets from nltk
, from https://stackoverflow.com/a/38135306/610569
$ rm /Users/<your_username>/nltk_data/corpora/panlex_lite.zip
$ rm -r /Users/<your_username>/nltk_data/corpora/panlex_lite
$ python
>>> import nltk
>>> dler = nltk.downloader.Downloader()
>>> dler._update_index()
>>> dler._status_cache['panlex_lite'] = 'installed' # Trick the index to treat panlex_lite as it's already installed.
>>> dler.download('popular')
And if anyone wants to find nltk_data
directory, see https://stackoverflow.com/a/36383314/610569
And to config nltk_data
path, see https://stackoverflow.com/a/22987374/610569
这篇关于使用 nltk.download() 下载错误的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!