NLP的Python框架? [英] Python frameworks for NLP?

查看:89
本文介绍了NLP的Python框架?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我正在一个项目中,我必须从一组文章中提取以下信息(这些文章可能涉及任何内容):

I am working on a project wherein I have to extract the following information from a set of articles (the articles could be on anything):

  • 查找在场的任何人的名字,例如巴拉克·奥巴马"

  • People Find the names of any people present, like "Barack Obama"

主题或文章的相关标签,例如议会" 世界能源"

Topic or related tags of the article, like "Parliament", "World Energy"

公司/组织,我应该能够获得提及的任何公司或组织的名称,例如"Apple" "Google"

Company/Organisation I should be able to obtain the names of the any companies or organisations mentioned, like "Apple" or "Google"

在Python中是否可以使用这种NLP框架/库来帮助我完成此任务?

Is there an NLP framework/library of this sort available in Python which would help me accomplish this task?

推荐答案

您真正要寻找的东西在文献"命名实体识别"或NER中称为. 您可能想看一下本教程:

What you are actually looking for is called in literature 'Named entity Recognition' or NER. You might like to take a look at this tutorial:

部分解决此问题的一种简单方法是使用正则表达式提取具有以下

One easy way of solving this problem partially this problem is using regular expressions to extract words having the patterns that you can find in this paper to extract peoples names. This of course might lead to extracting all the categories you are looking for i.e. the topics and the campanies names as well.

还有一个可以使用的API,实际上可以提供与您所寻找的结果相同的API,称为

There is also an API that you can use, that actually gives the same results you are looking for, which is called Alchemy. Unfortunatelly no documentation is available to explain the method they use to extract the topics nor the people's names.

希望这会有所帮助.

这篇关于NLP的Python框架?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆