Nutch 原始 Html 保存 [英] Nutch Raw Html Saving

查看：29 发布时间：2021/6/11 18:42:19 nutch

本文介绍了Nutch 原始 Html 保存的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我试图在不同的文件中获取被抓取页面的原始 html，命名为页面的 url.Nutch 是否可以通过排除索引部分将原始 html 页面保存在不同的文件中?

I'm trying to get raw html of crawled pages in different files, named as url of the page. Is it possible with Nutch to save the raw html pages in different files by ruling out the indexing part?

Nutch 原始 Html 保存 [英] Nutch Raw Html Saving

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

Nutch 原始 Html 保存 [英] Nutch Raw Html Saving

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭