PHP中的简历/简历解析 [英] Resume/CV Parsing in PHP

查看:542
本文介绍了PHP中的简历/简历解析的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我们正在使用LAMP开发一个基于需求的社交媒体网站.

We are developing a requirement base social media site using LAMP.

为此,我们要在PHP中执行Resume/CV解析.

For that we want to do Resume/CV Parsing in PHP.

我们能够解析Email-id和Phone,但不确定如何解析简历中的其他信息,例如全名,地址,学历,工作等.

We were able to parse Email-id and Phone, but not sure how to parse the other information like full name, address, education, employment etc from the resume.

加上简历/简历可以采用doc,html,rtf,txt等多种格式.

Plus resume/CV can be in various formats like doc,html,rtf,txt etc.

任何人都不知道PHP脚本,我们可以在其中获取数据.或任何可以启动的开发思路.

Anyone know abt the PHP script, where we can grab the data. or any development idea to kick start.

谢谢.

推荐答案

我将查看现有的简历解析器是否具有可以使用的API或可以添加到框架的自定义钩子.查看 Sovren TextKernel

I would see if an existing resume parser has an API you can use or a custom hook you can add to your framework. Check out Sovren or TextKernel

根据Sovren的网站,他们引用:

According to Sovren's website, they quote:

一旦您的SovrenConvertAndParse Web服务实例为 运行时,您将通过SOAP访问它.几乎所有编程 环境具有自动创建Web服务客户端或 Web服务的WSDL自动生成Web服务代理.我们也 在某些环境(例如PHP)中有示例客户端.在任何情况下, 创建Web服务客户端应该是一项非常快速的任务:通常是 几分钟,也许几个小时.

Once your instance of the SovrenConvertAndParse Web Service is running, you will access it via SOAP. Almost all programming environments have the ability to auto-create a web service client or web service proxy automatically from the web service’s WSDL. We also have sample clients for some environments such as PHP. In any case, creating the web service client should be a very quick task: usually a few minutes, maybe a few hours.

创建了Web服务客户端后,您可以调用一个 一种在Web服务上将简历转换和解析的方法 操作,以返回HR-XML输出.

Once you have created your web service client, you can call a single method on the web service to convert and parse a resume in one operation, receiving HR-XML output in return.

http://www.sovren.com/sovren-products-parser- Implementation.php

http://www.sovren.com/sovren-products-web- service.php

TextKernel引号:

TextKernel quotes:

处理所有类型的文档(DOC,DOCX,PDF,RTF, HTML,TIFF,TXT,XML,MSG和EML类型文档). Textkernel开箱即用提供以下11种语言:英语,德语,法语,荷兰语,西班牙语,瑞典语,丹麦语,波兰语,罗马尼亚语,意大利语,斯洛伐克语.

Document processing for all types of documents (DOC, DOCX, PDF, RTF, HTML, TIFF, TXT, XML, MSG, and EML type documents). Textkernel offers the following 11 languages out of the box: English, German, French, Dutch, Spanish, Swedish, Danish, Polish, Romanian, Italian, Slovak.

似乎他们的Web界面称为Sourcebox:

It seems their web interface is called Sourcebox:

Sourcebox可以使用Textkernel的CV Extract!进行完全配置 解析软件.

Sourcebox is fully configurable with Extract!, Textkernel's CV Parsing software.

Sourcebox具有多语言的Web界面,供员工管理 CV队列并手动检查并更正异常.

Sourcebox has a multilingual web interface for staff to manage the CV queue and manually check and correct exceptions.

Sourcebox可用作许多领先的CRM,ATS, 匹配引擎,HRMS系统和您自己的网站或招聘 门户.

Sourcebox can be used as an interface to many leading CRM, ATS, Matching engines, HRMS systems and your own website or recruitment portal.

http://www.textkernel.com/hr_solutions.php?nav=sourcebox

它们似乎都很有希望利用,而不必在这里重新发明轮子,尤其是在使用PHP的情况下.

They both seem promising enough to utilize and not have to reinvent the wheel here, especially with PHP.

这篇关于PHP中的简历/简历解析的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆