Nutch - 删除段 [英] Nutch - deleting segments

查看:49
本文介绍了Nutch - 删除段的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我有一个包含 4 个段的 Nutch 爬网,这些段使用 bin/nutch solrindex 命令完全索引.现在我所有的存储空间都用完了,所以我可以删除 4 个段并只保留 crawldb 并从我离开的地方继续爬行吗?

I have a Nutch crawl with 4 segments which are fully indexed using the bin/nutch solrindex command. Now I'm all out of storage on the box, so can I delete the 4 segments and retain only the crawldb and continue crawling from where I left it?

由于所有段都被合并并索引到 Solr,我认为删除段没有问题,还是我错了?

Since all the segments are merged and indexed to Solr I don't see a problem in deleting the segments, or am I wrong there?

推荐答案

感谢 Nutch 邮件列表上的帮助,我发现我可以删除那些段.

Thanks to the help on the Nutch mailing list, I found out that I can delete those segments.

这篇关于Nutch - 删除段的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆