Nutch - 删除段 [英] Nutch - deleting segments
本文介绍了Nutch - 删除段的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!
问题描述
我有一个包含 4 个段的 Nutch 爬网,这些段使用 bin/nutch solrindex
命令完全索引.现在我所有的存储空间都用完了,所以我可以删除 4 个段并只保留 crawldb 并从我离开的地方继续爬行吗?
I have a Nutch crawl with 4 segments which are fully indexed using the bin/nutch solrindex
command. Now I'm all out of storage on the box, so can I delete the 4 segments and retain only the crawldb and continue crawling from where I left it?
由于所有段都被合并并索引到 Solr,我认为删除段没有问题,还是我错了?
Since all the segments are merged and indexed to Solr I don't see a problem in deleting the segments, or am I wrong there?
推荐答案
感谢 Nutch 邮件列表上的帮助,我发现我可以删除那些段.
Thanks to the help on the Nutch mailing list, I found out that I can delete those segments.
这篇关于Nutch - 删除段的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!
查看全文