gensim如何计算doc2vec段落向量 [英] How does gensim calculate doc2vec paragraph vectors

查看：228 发布时间：2020/5/18 0:49:41 nlp vectorization gensim word2vec doc2vec

本文介绍了gensim如何计算doc2vec段落向量的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在阅读这篇论文 http://cs.stanford.edu/~quocle/paragraph_vector.pdf

并指出

段落向量和词向量被平均或级联预测上下文中的下一个单词.在实验中，我们使用串联作为合并向量的方法."

" Theparagraph vector and word vectors are averaged or concatenated to predict the next word in a context. In the experiments, we use concatenation as the method to combine the vectors."

串联或求平均如何工作?

How does concatenation or averaging work?

示例(如果第1段包含单词1和单词2):

example (if paragraph 1 contain word1 and word2):

word1 vector =[0.1,0.2,0.3]
word2 vector =[0.4,0.5,0.6]

concat method 
does paragraph vector = [0.1+0.4,0.2+0.5,0.3+0.6] ?

Average method 
does paragraph vector = [(0.1+0.4)/2,(0.2+0.5)/2,(0.3+0.6)/2] ?

也来自这张图片:

据说:

可以将段落标记视为另一个词.它充当记忆当前上下文中缺少的内容的内存–或本段的主题.因此，我们经常称这种模型段向量的分布式存储模型(PV-DM).

The paragraph token can be thought of as another word. It acts as a memory that remembers what is missing from the current context – or the topic of the paragraph. For this reason, we often call this model the Distributed Memory Model of Paragraph Vectors (PV-DM).

段落标记等于等于on的段落向量吗?

Is the paragraph token equal to the paragraph vector which is equal to on?

gensim如何计算doc2vec段落向量 [英] How does gensim calculate doc2vec paragraph vectors

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

gensim如何计算doc2vec段落向量 [英] How does gensim calculate doc2vec paragraph vectors

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭