创建网络爬虫时的关键考虑因素是什么? [英] What are the key considerations when creating a web crawler?

查看：36 发布时间：2021/9/22 20:27:30 web-crawler

本文介绍了创建网络爬虫时的关键考虑因素是什么?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我今天刚开始考虑创建/定制一个网络爬虫，对网络爬虫/机器人礼仪知之甚少.我发现的大多数有关礼仪的文章都显得陈旧而笨拙，因此我想从网络开发者社区中获得一些当前(和实用)的见解.

I just started thinking about creating/customizing a web crawler today, and know very little about web crawler/robot etiquette. A majority of the writings on etiquette I've found seem old and awkward, so I'd like to get some current (and practical) insights from the web developer community.

为了一个超级简单的目的，我想使用爬虫遍历网络"——站点 XYZ 的标记是否满足条件 ABC?".

I want to use a crawler to walk over "the web" for a super simple purpose - "does the markup of site XYZ meet condition ABC?".

这给我带来了很多问题，但我认为我需要首先解决的两个主要问题是:

This raises a lot of questions for me, but I think the two main questions I need to get out of the way first are:

从一开始就感觉有点不确定"——这种事情可以接受吗?
为了不让人们感到不安，抓取工具应采取哪些具体考虑?

创建网络爬虫时的关键考虑因素是什么? [英] What are the key considerations when creating a web crawler?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

创建网络爬虫时的关键考虑因素是什么? [英] What are the key considerations when creating a web crawler?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭