在Python中将NLTK语料库与AWS Lambda函数一起使用 [英] Using NLTK corpora with AWS Lambda functions in Python

查看：74 发布时间：2020/5/18 1:18:13 python nltk aws-lambda

本文介绍了在Python中将NLTK语料库与AWS Lambda函数一起使用的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

在AWS Lambda中使用NLTK语料库(特别是停用词)时遇到了困难.我知道需要下载语料库，并已使用NLTK.download('stopwords')进行了下载，并将其包含在zip文件中，该文件用于上传lambda模块到nltk_data/corpora/stopwords中.

I'm encountering a difficulty when using NLTK corpora (in particular stop words) in AWS Lambda. I'm aware that the corpora need to be downloaded and have done so with NLTK.download('stopwords') and included them in the zip file used to upload the lambda modules in nltk_data/corpora/stopwords.

代码中的用法如下:

from nltk.corpus import stopwords
stopwords = stopwords.words('english')
nltk.data.path.append("/nltk_data")

这将从Lambda日志输出中返回以下错误

This returns the following error from the Lambda log output

module initialization error: 
**********************************************************************
  Resource u'corpora/stopwords' not found.  Please use the NLTK
  Downloader to obtain the resource:  >>> nltk.download()
  Searched in:
    - '/home/sbx_user1062/nltk_data'
    - '/usr/share/nltk_data'
    - '/usr/local/share/nltk_data'
    - '/usr/lib/nltk_data'
    - '/usr/local/lib/nltk_data'
    - '/nltk_data'
**********************************************************************

我还试图通过包含直接加载数据

I have also tried to load the data directly by including

nltk.data.load("/nltk_data/corpora/stopwords/english")

下面会产生不同的错误

module initialization error: Could not determine format for file:///stopwords/english based on its file
extension; use the "format" argument to specify the format explicitly.

从Lambda zip加载数据可能有问题，需要将其存储在外部.例如在S3上说，但这似乎有些奇怪.

It's possible that it has a problem loading the data from the Lambda zip and needs it stored externally.. say on S3, but that seems a bit strange.

任何想法

有人知道我要去哪里错吗?

Does anyone know where I could be going wrong?

在Python中将NLTK语料库与AWS Lambda函数一起使用 [英] Using NLTK corpora with AWS Lambda functions in Python

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

在Python中将NLTK语料库与AWS Lambda函数一起使用 [英] Using NLTK corpora with AWS Lambda functions in Python

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭