是否存在帮助检测“主要话题"的算法?一个英文句子? [英] Does an algorithm exist to help detect the "primary topic" of an English sentence?

查看:19
本文介绍了是否存在帮助检测“主要话题"的算法?一个英文句子?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我想知道是否有一种已知的算法可以检测句子的关键概念".

I'm trying to find out if there is a known algorithm that can detect the "key concept" of a sentence.

用例如下:

  1. 用户输入一个句子作为查询(鸡肉尝起来像火鸡吗?)
  2. 我们的系统识别句子的概念(鸡、火鸡)
  3. 它会搜索我们的语料库内容

我们缺乏的领域是确定句子的核心主题"的真正含义.句子Does chicken taste like turkey"的主要主题是chicken",因为用户询问的是鸡肉的味道.而火鸡"是一个不太重要的辅助话题.

The area that we're lacking in is identifying what the core "topic" of the sentence is really about. The sentence "Does chicken taste like turkey" has a primary topic of "chicken", because the user is asking about the taste of chicken. While "turkey" is a helper topic of less importance.

所以...我正在尝试找出是否有一种算法可以帮助我确定句子的主要主题...如果您知道任何内容,请告诉我!!!

So... I'm trying to find out if there is an algorithm that will help me identify the primary topic of a sentence... Let me know if you are aware of any!!!

推荐答案

我实际上做了一个这方面的研究项目,赢得了两场比赛,并且正在参加全国比赛.

I actually did a research project on this and won two competitions and am competing in nationals.

该方法有两个步骤:

  1. 上下文无关语法
  2. 解析句子
  3. 在生成的解析树中,找出所有只从属于类名词短语成分的名词

例如,I ate pie"有两个名词:I"和pie".查看解析树,pie"位于动词短语内,因此它不能成为主语.然而,I"仅位于类 NP 成分的内部.作为唯一的科目候选人,它是科目.在 http://www.candlemind.com 上找到该程序的早期副本.请注意,词汇仅限于基本的单数词,并且没有动词变位,因此它有man"但没有men",有eat"但没有ate".另外,我使用的CFG是手工制作的限量版.我很快就会更新这个程序.

For example, "I ate pie" has 2 nouns: "I" and "pie". Looking at the parse tree, "pie" is inside of a Verb Phrase, so it cannot be a subject. "I", however, is only inside of NP-like constituents. being the only subject candidate, it is the subject. Find an early copy of this program on http://www.candlemind.com. Note that the vocabulary is limited to basic singular words, and there are no verb conjugations, so it has "man" but not "men", has "eat" but not "ate." Also, the CFG I used was hand-made an limited. I will be updating this program shortly.

无论如何,这个程序有限制.我的导师在当前状态下指出,它无法识别主语为真实"NP(语法实际上称为 NP)的句子.例如,月亮是平的不再是一个争论."题目其实是月亮是平的".但是,该程序会将月亮"识别为主题.我会尽快解决这个问题.

Anyway, there are limitations to this program. My mentor pointed out in its currents state, it cannot recognize sentences with subjects that are "real" NPs (what grammar actually calls NPs). For example, "that the moon is flat is not a debate any longer." The subject is actually "that the moon is flat." However, the program would recognize "moon" as the subject. I will be fixing this shortly.

无论如何,这对于大多数句子来说已经足够了......

Anyway, this is good enough for most sentences...

我的研究论文也可以在那里找到.转到第 11 页阅读方法.

My research paper can be found there too. Go to page 11 of it to read the methods.

希望这会有所帮助.

这篇关于是否存在帮助检测“主要话题"的算法?一个英文句子?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆