“过滤过程已经终止”用pdf抓取.msg文件 [英] "The filtering process has been terminated" on crawl of .msg files with pdf's
本文介绍了“过滤过程已经终止”用pdf抓取.msg文件的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!
问题描述
在文件共享中抓取约260,000个文档时,我在Outlook上收到此错误大约2,600的.msg文件:
"过滤过程已经终止"
< p align = left>我签出的消息文件似乎包含了pdf。我将PDF ifilter安装在与MSS2008E服务器相同的服务器上。
解决方案
我刚检查过,我似乎有完全相同的问题。在我安装了PDF ifilter之后网站被抓取了,它仍然弹出了这个错误。
当我搜索。msg 我得到了已经附加了pdf的结果......在成功爬行的15,181个随机文件中,我有245个回击错误。我还发现了一个具有相同错误的.doc文件,但这是因为当时打开Word文档供网络上的某人使用。
我知道我的同事没有200多个e打开的邮件已保存在我们的文件服务器上 - 我们这里只有7个人.... 结果
On crawling of ~260,000 documents in a file share, I get this error on Outlook .msg files for about 2,600:
"The filtering process has been terminated"
The message files that I have checked out seem to have pdf's contained within. I have the PDF ifilter installed on the same server as the MSS2008E Server.
解决方案
I just checked, I seem to have the exact same issue. The site was crawled AFTER I installed the PDF ifilter as well, and it still popped up this error.
Yet when I do a search on .msg I get results that DO have pdf's attached ... Out of 15,181 random files crawled successfully, I have 245 that shot back an error. I also found a .doc file that had the same error, though that's because the Word document was opened for use bye someone on the network at the time.
I know my coworkers didn't have over 200+ e-mail messages opened that were saved on our file server - there's only 7 of us here ....
这篇关于“过滤过程已经终止”用pdf抓取.msg文件的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!
查看全文