Wordnet(带词义注释)语料库 [英] Wordnet (Word Sense Annotated) Corpus
问题描述
我一直在利用许多不同的语料库进行自然语言处理,而且我一直在寻找一个用Wordnet Word Senses注释的语料库.
I've been utilizing lots of different corpora for natural language processing, and I've been looking for a corpus that has been annotated with Wordnet Word Senses.
我知道可能没有很大的语料库,因为该语料库需要手动构建,但是必须要解决一些问题.
I understand that there probably is not a big corpus with this information, since the corpus needs to be built up manually, but there has to be something to go off of.
如果还没有语料库,那么至少存在一个带有感官注释的ngram数据库(每个单词的每个定义的时间百分比是多少,或者每个单词网定义的数字计数取决于通用程度)这个词的意思是)?
Also if there isn't a corpus in existence, is there at least a sense annotated ngram database (with what percentage of the time a word is each of its definitions, or a numerical count of each wordnet definition depending on how common the word sense is)?
推荐答案
为WordNet注释的三个著名语料库:
Three prominent corpora annotated for WordNet:
- MASC
- WordNet gloss
- SemCor
这篇关于Wordnet(带词义注释)语料库的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!