在Python 2中，在Python 3中计算加载gensim Word2Vec [英] Load gensim Word2Vec computed in Python 2, in Python 3

查看：1472 发布时间：2017/8/16 23:20:46 python python-3.x encoding gensim word2vec

本文介绍了在Python 2中，在Python 3中计算加载gensim Word2Vec的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我有一个在Python 2中计算出的gensim Word2Vec模型，如下所示：

I have a gensim Word2Vec model computed in Python 2 like that:

from gensim.models import Word2Vec
from gensim.models.word2vec import LineSentence

model = Word2Vec(LineSentence('enwiki.txt'), size=100, 
                 window=5, min_count=5, workers=15)
model.save('w2v.model')

但是，我需要使用它Python 3.如果我尝试加载它，

However, I need to use it in Python 3. If I try to load it,

import gensim
from gensim.models import Word2Vec
model = Word2Vec.load('w2v.model')

它会导致错误：

UnicodeDecodeError: 'ascii' codec can't decode byte 0xf9 in position 0: ordinal not in range(128)

我认为Python2和Python3之间的编码方式存在差异。另外看起来gensim正在使用pickle来保存/加载模型。

I suppose the problem is in differences in encoding between Python2 and Python3. Also it seems like gensim is using pickle to save/load models.

有没有办法设置编码/ pickle选项，以便模型加载正确？或者可以使用一些外部工具来转换模型文件？

Is there a way to set encoding/pickle options so that the model loads properly? Or maybe use some external tool to convert the model file?

在Python 3中重新计算它不是一个选择：它需要太多时间。

Recomputing it in Python 3 is not an option: it takes way too much time.

在Python 2中，在Python 3中计算加载gensim Word2Vec [英] Load gensim Word2Vec computed in Python 2, in Python 3

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

在Python 2中，在Python 3中计算加载gensim Word2Vec [英] Load gensim Word2Vec computed in Python 2, in Python 3

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭