清理姓名和地址数据的工具? [英] tools for cleaning name and address data?

查看:73
本文介绍了清理姓名和地址数据的工具?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

每个人用什么工具来清理名称和地址数据

(包括识别非直接明显的重复项)与CRM项目或客户维度的

连接数据

仓库?你喜欢/不喜欢你使用的工具是什么?

如何定制你使用的工具?

解决方案

我们在西班牙语国家有客户,所以如果是a

具体问题我猜这个不行。


感谢您的信息。 :)


2004年4月21日星期三15:39:23 -0700,GL

< GL@noSpam.ReplyToNewsgroup.com>写道:

我使用过 www.MelissaData.com 。他们主要为我工作
地址数据,但他们往往有解析西班牙街道名称,邮政信箱和PMB(个人邮箱)的问题。

GL
Ellen K. < 72 ************************ @ compuserve.com>在消息中写道
新闻:ks ******************************** @ 4ax.com .. < blockquote class =post_quotes>每个人用什么工具来清理名称和地址数据
(包括识别非直接明显的重复项)与CRM项目的连接或数据的客户维度<仓库?你喜欢/不喜欢你使用的工具是什么?
如何定制你使用的工具?




这听起来像是值得追求的,我将检查出来。我们现在是一个VB商店,但我当然可以转换为C ++示例,但我们希望不需要编写任何代码,即

购买解决方案的想法是消除它。


非常感谢。 :)


2004年4月22日00:58:57 -0700, ry * *******@hotmail.com (Ryan)写道:

我使用来自QAS的32位API( www.qas.com )并在Delphi 5中构建我自己的解决方案。他们的示例代码在C ++中但很容易转换(提供伪代码)。您可以使用他们的API编写自己的代码,也可以使用他们的实用工具为您清理数据。后端是SQL 7,但这可能是任何东西。

QAS Batch允许您检查批量数据并且非常好。
QAS Pro允许用户选择(并钻取) to)与搜索字符串匹配的数据。

有了这个(以及Experian的一些信息),我们能够准确地获得98%的客户数据。

我们有几个应用程序用于批量清理数据。
API的工作非常好,而且很快。到目前为止,我一直非常感动。我们可以非常轻松地调整输出并且具有一致的结果格式。在QAS Batch中,您会得到一个结果
代码,告诉您检查地址的每个部分以及匹配或失败的位置的结果。你可以确定哪些通过/不通过你的支票。

对于Pro,有检查的模糊逻辑。这令人印象深刻且准确(特别是在威尔士地址上)。帮助台非常好,并且愿意检查Delphi代码(不支持,但有人认识Delphi)并在我遇到困难时给出一些指示。

Ryan

" GL" < GL@noSpam.ReplyToNewsgroup.com>在消息新闻中写道:< 10 ************* @ news.supernews.com> ...

我使用了来自 www.MelissaData.com 。他们主要为我工作
地址数据,但他们往往有解析西班牙街道名称,邮政信箱和PMB(个人邮箱)的问题。

GL
Ellen K. < 72 ************************ @ compuserve.com>在消息中写道
新闻:ks ******************************** @ 4ax.com ... < blockquote class =post_quotes>>每个人都使用什么工具清理姓名和地址数据
>
>(包括识别非直接明显的重复)与CRM项目的连接或数据的客户维度
>仓库?你喜欢/不喜欢你使用的工具是什么?怎么
> customizable是您使用的工具吗?




如果我没记错的话,VB中也有例子。除了QAS

在编码方面真的很有帮助。你可以下载代码的例子(完整的

?)副本,值得一看。我首先使用这些示例构建了一个VB版本

然后转换为Delphi。


或者你可以看看他们自己的解决方案,因为他们真的是

好​​。您甚至可以将它们的数据发送给他们进行清理,或者让他们在现场为您打开并清理它们。您可以将他们的应用程序集成到您的应用程序中,或者使用它们单独清理数据

。我自己编写了部分API用于体验,但

也因为我们需要添加到文件中的其他信息

基于结果地址所以最简单的方法就是一起完成这一切




为了让你对性能有所了解(尽我所能),只需简单的/>
Delphi应用程序,SQL 7后端(2x2.4Ghz Xeon服务器2Gb)和QAS,我们可以在b-30之间批量清理100,000条记录(2遍地址) />
小时。这是对代码的各种其他补充(无论如何,这很快就是
)。它应该可以通过任何检查。

我们每3个月运行一次(ish)。如果我们只做地址,我们可能会敲第三个




希望有所帮助。


Ryan


Ellen K.< 72 ************************ @ compuserve.com>在留言新闻中写道:< 6o ******************************** @ 4ax.com>。 ..

这听起来值得追求,我会检查一下。我们目前是VB商店,但我当然可以转换为C ++示例,
虽然我们希望不需要编写任何代码,但购买解决方案的想法是消除那个。

非常感谢。 :)



What tools has everyone used for cleaning name and address data
(including identifying not-immediately-obvious duplicates) in
connection with a CRM project or the Customer dimension of a data
warehouse? What did you like/dislike about the tool you used? How
customizable was the tool you used?

解决方案

We have customers in Spanish-speaking countries, so if that is a
specific issue I guess this one wouldn''t work.

Thanks for the info. :)

On Wed, 21 Apr 2004 15:39:23 -0700, "GL"
<GL@noSpam.ReplyToNewsgroup.com> wrote:

I have used components from www.MelissaData.com. They mostly worked for me
for address data, however they tended to have issues parsing Spanish street
names, PO boxes and PMB (Personal Mail Box).

GL

"Ellen K." <72************************@compuserve.com> wrote in message
news:ks********************************@4ax.com.. .

What tools has everyone used for cleaning name and address data
(including identifying not-immediately-obvious duplicates) in
connection with a CRM project or the Customer dimension of a data
warehouse? What did you like/dislike about the tool you used? How
customizable was the tool you used?




This sounds like it''s worth pursuing, I will check it out. We are
currently a VB shop but I could certainly convert from C++ examples,
although we were hoping not to have to code much of anything, i.e. the
idea of a purchased solution was to eliminate that.

Thanks very much. :)

On 22 Apr 2004 00:58:57 -0700, ry********@hotmail.com (Ryan) wrote:

I use the 32 Bit API''s from QAS ( www.qas.com ) and build up my own
solutions in Delphi 5. Their example code is in C++ but pretty easy to
convert (pseudo code provided). You can either write your own code
using their API''s or use one of their utilities to cleanse the data
for you. The backend is SQL 7, but this could be anything.

QAS Batch allows you to check batches of data and is very good.
QAS Pro allows you the user to choose (and drill in to) the data
matched to a search string.

With this (and some info from Experian) we have been able to get 98%
of our customer data accurate.

We have a couple of applications we use for cleansing data in batches.
The API stuff works really well and is quick. I have been very
impressed so far. We can tweak the output very easily and have
consistent formatting of the results. In QAS Batch you get a result
code telling you the results of checking each part of the address and
where it matched or failed. You can determine which ones pass/fail
your checks.

With Pro, there is fuzzy logic with the checking. This is impressive
and accurate (especially on Welsh addresses). The helpdesk were really
good and willing to check over the Delphi code (not supported but
someone there knew Delphi) and give a few pointers when I got stuck.

Ryan
"GL" <GL@noSpam.ReplyToNewsgroup.com> wrote in message news:<10*************@news.supernews.com>...

I have used components from www.MelissaData.com. They mostly worked for me
for address data, however they tended to have issues parsing Spanish street
names, PO boxes and PMB (Personal Mail Box).

GL

"Ellen K." <72************************@compuserve.com> wrote in message
news:ks********************************@4ax.com...

> What tools has everyone used for cleaning name and address data
> (including identifying not-immediately-obvious duplicates) in
> connection with a CRM project or the Customer dimension of a data
> warehouse? What did you like/dislike about the tool you used? How
> customizable was the tool you used?




There were examples in VB as well if I remember correctly. Besides QAS
were really helpful on the coding side. You can download example (full
?) copies of the code which may be worth a look. I built a VB version
first using the examples and then converted to Delphi.

Alternatively you could look at their own solutions as they are really
good. You could even send them the data for them to cleanse, or have
them turn up and cleanse it for you on site. You can possibly
integrate their apps into your app, or use them to cleanse the data
seperately. I wrote my own partly for the experience with API''s, but
also as we have additional information we need to add to the file
based on the results of the address so it was easiest to do this all
together.

To give you an idea of performance (as best I can), with a simple
Delphi app, SQL 7 backend (2x2.4Ghz Xeon server 2Gb) and QAS, we can
batch cleanse 100,000 records (2 passes of address) in between 25-30
hours. This is with all sorts of other additions to the code (which is
pretty quick anyway). It should be able to fly through any checking.
We run this every 3 months (ish). We could probably knock a third off
that if we just did the addresses only.

Hope that helps.

Ryan

Ellen K. <72************************@compuserve.com> wrote in message news:<6o********************************@4ax.com>. ..

This sounds like it''s worth pursuing, I will check it out. We are
currently a VB shop but I could certainly convert from C++ examples,
although we were hoping not to have to code much of anything, i.e. the
idea of a purchased solution was to eliminate that.

Thanks very much. :)



这篇关于清理姓名和地址数据的工具?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆