Wordnet中引理和同义词集之间的联系或区别是什么? [英] What is the connection or difference between lemma and synset in wordnet?

查看:160
本文介绍了Wordnet中引理和同义词集之间的联系或区别是什么?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我是NLP和NLTK的完整入门者.

I am a complete beginner to NLP and NLTK.

我无法理解词网中词元和同义词集之间的确切 差异 ,因为两者产生的输出几乎相同.例如,蛋糕这个词就会产生此输出.

I was not able to understand the exact difference between lemmas and synsets in wordnet, because both are producing nearly the same output. for example for the word cake it produce this output.

lemmas :  [Lemma('cake.n.01.cake'), Lemma('patty.n.01.cake'), Lemma('cake.n.03.cake'), Lemma('coat.v.03.cake')]

synsets :  [Synset('cake.n.01'), Synset('patty.n.01'), Synset('cake.n.03'), Synset('coat.v.03')]

请帮助我理解这个概念.

please help me to understand this concept.

谢谢.

推荐答案

这些术语基于单词"lemma"和"synonym"的一般含义.

The terms are based on the general sense of the words "lemma" and "synonym".

引理是wordnet词典中条目的版本:规范形式的单词,具有单一含义.例如,如果您要在字典中查找银行",则规范形式为银行",名词金融机构"和河边"的名词将有单独的词缀,动词"to bank(on)",等等.

A lemma is wordnet's version of an entry in a dictionary: A word in canonical form, with a single meaning. E.g., if you wanted to look up "banks" in the dictionary, the canonical form would be "bank" and there would be separate lemmas for the nouns meaning "financial institution" and "side of the river", a separate one for the verb "to bank (on)", etc.

术语同义词集代表同义词集".一组同义词是一组具有相似含义的单词,例如 ship,skiff,canoe,kayak 可能都是 boat 的同义词.在nltk中,synset实际上是一组具有相关含义的引理.以您的示例(wn.synsets("cake")wn.lemmas("cake")的结果)为例,我们还可以编写:

The term synset stands for "set of synonyms". A set of synonyms is a set of words with similar meaning, e.g. ship, skiff, canoe, kayak might all be synonyms for boat. In the nltk, a synset is in fact a set of lemmas with related meaning. Taking your example (the results of wn.synsets("cake") and wn.lemmas("cake")), we can also write:

>>> synsets[0]
Synset('cake.n.01')
>>> synsets[0].lemmas()
[Lemma('cake.n.01.cake'), Lemma('cake.n.01.bar')]

这些是构成蛋糕"的第一个同义词集的引理.

These are the lemmas making up the first synset given for "cake".

Wordnet提供了许多方法,可让您探索诸如上位词/别名,使用域等的关系.有关更多信息,您应该直接在Wordnet文档中查找. nltk只是为此提供了一个接口.这是Wordnet 词汇表.

Wordnet provides a lot of methods that allow you to explore relationships like hypernyms/hyponyms, usage domains, and more. For more information, you should look directly in the Wordnet documentation; the nltk just provides an interface for it. Here is the Wordnet glossary.

这篇关于Wordnet中引理和同义词集之间的联系或区别是什么?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆