使用 nltk.download() 下载错误 [英] downloading error using nltk.download()

查看:57
本文介绍了使用 nltk.download() 下载错误的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我正在使用 Python 试验 NLTK 包.我尝试使用 nltk.download() 下载 NLTK.我收到了这种错误信息.如何解决这个问题呢?谢谢.

我使用的系统是安装在VMware下的Ubuntu.IDE 是 Spyder.

使用nltk.download('all')后,可以下载一些包,但是下载oanc_masc

时报错

解决方案

要下载特定的数据集/模型,请使用 nltk.download() 函数,例如如果您想下载 punkt 句子标记器,请使用:

$ python3>>>导入 nltk>>>nltk.download('朋克')

如果您不确定您需要哪种数据/模型,您可以从数据 + 模型的基本列表开始:

<预><代码>>>>导入 nltk>>>nltk.download('流行')

它将下载流行"资源列表.

确保您拥有最新版本的 NLTK,因为它一直在改进和不断维护:

$ pip install --upgrade nltk

<小时>

已编辑

如果有人避免从 nltk 下载更大的数据集时出错,来自 https://stackoverflow.com/a/38135306/610569

$ rm/Users//nltk_data/corpora/panlex_lite.zip$ rm -r/Users//nltk_data/corpora/panlex_lite$蟒蛇>>>导入 nltk>>>dler = nltk.downloader.Downloader()>>>dler._update_index()>>>dler._status_cache['panlex_lite'] = 'installed' # 欺骗索引以将 panlex_lite 视为已经安装.>>>dler.download('流行')

如果有人想找到 nltk_data 目录,请参阅 https://stackoverflow.com/a/36383314/610569

并配置 nltk_data 路径,请参阅 https://stackoverflow.com/a/22987374/610569

I am experimenting NLTK package using Python. I tried to downloaded NLTK using nltk.download(). I got this kind of error message. How to solve this problem? Thanks.

The system I used is Ubuntu installed under VMware. The IDE is Spyder.

After using nltk.download('all'), it can download some packages, but it gets error message when downloading oanc_masc

解决方案

To download a particular dataset/models, use the nltk.download() function, e.g. if you are looking to download the punkt sentence tokenizer, use:

$ python3
>>> import nltk
>>> nltk.download('punkt')

If you're unsure of which data/model you need, you can start out with the basic list of data + models with:

>>> import nltk
>>> nltk.download('popular')

It will download a list of "popular" resources.

Ensure that you've the latest version of NLTK because it's always improving and constantly maintain:

$ pip install --upgrade nltk


EDITED

In case anyone is avoiding errors from downloading larger datasets from nltk, from https://stackoverflow.com/a/38135306/610569

$ rm /Users/<your_username>/nltk_data/corpora/panlex_lite.zip
$ rm -r /Users/<your_username>/nltk_data/corpora/panlex_lite
$ python

>>> import nltk
>>> dler = nltk.downloader.Downloader()
>>> dler._update_index()
>>> dler._status_cache['panlex_lite'] = 'installed' # Trick the index to treat panlex_lite as it's already installed.
>>> dler.download('popular')

And if anyone wants to find nltk_data directory, see https://stackoverflow.com/a/36383314/610569

And to config nltk_data path, see https://stackoverflow.com/a/22987374/610569

这篇关于使用 nltk.download() 下载错误的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆