“过滤过程已经终止”用pdf抓取.msg文件 [英] "The filtering process has been terminated" on crawl of .msg files with pdf's

查看:110
本文介绍了“过滤过程已经终止”用pdf抓取.msg文件的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述


在文件共享中抓取约260,000个文档时,我在Outlook上收到此错误大约2,600的.msg文件:


"过滤过程已经终止"



< p align = left>我签出的消息文件似乎包含了pdf。我将PDF ifilter安装在与MSS2008E服务器相同的服务器上。

解决方案

我刚检查过,我似乎有完全相同的问题。在我安装了PDF ifilter之后网站被抓取了,它仍然弹出了这个错误。

当我搜索。msg 我得到了已经附加了pdf的结果......在成功爬行的15,181个随机文件中,我有245个回击错误。我还发现了一个具有相同错误的.doc文件,但这是因为当时打开Word文档供网络上的某人使用。

我知道我的同事没有200多个e打开的邮件已保存在我们的文件服务器上 - 我们这里只有7个人.... 结果

 

On crawling of ~260,000 documents in a file share, I get this error on Outlook .msg files for about 2,600:

"The filtering process has been terminated"

 

The message files that I have checked out seem to have pdf's contained within.  I have the PDF ifilter installed on the same server as the MSS2008E Server.

解决方案

I just checked, I seem to have the exact same issue. The site was crawled AFTER I installed the PDF ifilter as well, and it still popped up this error.

Yet when I do a search on .msg I get results that DO have pdf's attached ... Out of 15,181 random files crawled successfully, I have 245 that shot back an error. I also found a .doc file that had the same error, though that's because the Word document was opened for use bye someone on the network at the time.

I know my coworkers didn't have over 200+ e-mail messages opened that were saved on our file server - there's only 7 of us here ....


这篇关于“过滤过程已经终止”用pdf抓取.msg文件的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆