Wordnet(带词义注释)语料库 [英] Wordnet (Word Sense Annotated) Corpus

查看:149
本文介绍了Wordnet(带词义注释)语料库的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我一直在利用许多不同的语料库进行自然语言处理,而且我一直在寻找一个用Wordnet Word Senses注释的语料库.

I've been utilizing lots of different corpora for natural language processing, and I've been looking for a corpus that has been annotated with Wordnet Word Senses.

我知道可能没有很大的语料库,因为该语料库需要手动构建,但是必须要解决一些问题.

I understand that there probably is not a big corpus with this information, since the corpus needs to be built up manually, but there has to be something to go off of.

如果还没有语料库,那么至少存在一个带有感官注释的ngram数据库(每个单词的每个定义的时间百分比是多少,或者每个单词网定义的数字计数取决于通用程度)这个词的意思是)?

Also if there isn't a corpus in existence, is there at least a sense annotated ngram database (with what percentage of the time a word is each of its definitions, or a numerical count of each wordnet definition depending on how common the word sense is)?

推荐答案

为WordNet注释的三个著名语料库:

Three prominent corpora annotated for WordNet:

  • MASC
  • WordNet gloss
  • SemCor

这篇关于Wordnet(带词义注释)语料库的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆