如何在BERT中创建TokenEmbeddings? [英] How are the TokenEmbeddings in BERT created?

查看：1516 发布时间：2020/5/18 1:04:09 machine-learning nlp word-embedding

本文介绍了如何在BERT中创建TokenEmbeddings?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

In the paper describing BERT, there is this paragraph about WordPiece Embeddings.

我们使用WordPiece嵌入(Wu等， 2016)的词汇量为30,000个.首先每个序列的标记始终是一种特殊的分类令牌([CLS]).最终的隐藏状态与此令牌对应的用作汇总分类的序列表示任务.句子对打包在一起单序列.我们区分句子中的两种方式.首先，我们将它们分开令牌([SEP]).其次，我们添加了学习的嵌入指示每个令牌是否属于到句子A或句子B.如图1所示，我们将输入嵌入表示为E，最后隐藏特殊[CLS]令牌的向量为C 2 RH，以及第i个输入标记的最终隐藏向量作为Ti 2 RH. 对于给定的令牌，其输入表示为通过求和相应的令牌来构造的，段和位置嵌入.可视化这种结构的结构如图2所示.

We use WordPiece embeddings (Wu et al., 2016) with a 30,000 token vocabulary. The first token of every sequence is always a special classification token ([CLS]). The final hidden state corresponding to this token is used as the aggregate sequence representation for classification tasks. Sentence pairs are packed together into a single sequence. We differentiate the sentences in two ways. First, we separate them with a special token ([SEP]). Second, we add a learned embedding to every token indicating whether it belongs to sentence A or sentence B. As shown in Figure 1, we denote input embedding as E, the final hidden vector of the special [CLS] token as C 2 RH, and the final hidden vector for the ith input token as Ti 2 RH. For a given token, its input representation is constructed by summing the corresponding token, segment, and position embeddings. A visualization of this construction can be seen in Figure 2.

据我了解，WordPiece将单词拆分为#I #like #swim #ing之类的单词，但不会生成嵌入.但是我在论文和其他资料中都没有发现任何令牌嵌入的生成方式.他们是否在实际的预训练之前就进行了预训练?如何?还是随机初始化?

As I understand, WordPiece splits Words into wordpieces like #I #like #swim #ing, but it does not generate Embeddings. But I did not find anything in the paper and on other sources how those Token Embeddings are generated. Are they pretrained before the actual Pre-training? How? Or are they randomly initialized?

如何在BERT中创建TokenEmbeddings? [英] How are the TokenEmbeddings in BERT created?

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录关闭

如何在BERT中创建TokenEmbeddings? [英] How are the TokenEmbeddings in BERT created?

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录 关闭

登录关闭