Facebook Open Graph爬虫的IP地址范围是多少? [英] What's the IP address range of Facebook's Open Graph crawler?

查看:181
本文介绍了Facebook Open Graph爬虫的IP地址范围是多少?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

为了在我们的预览环境中测试Open Graph API,我们需要在我们的防火墙中打一个漏洞,以允许Facebook刮擦我们的对象页面。我们应该允许哪些IP范围?

In order to test the Open Graph API on our preview environment, we need to poke a hole in our firewall to allow Facebook to scrape our object pages. What IP ranges should we allow?

推荐答案

编辑

Facebook已经显示了一些爱,现在正在让任何人都有公开的IP块。

Facebook has been showing some love and is now making the IP block public for anyone to have

http://developers.facebook.com/docs/ApplicationSecurity/#facebook_scraper
https://developers.facebook.com/docs/sharing/best-practices#crawl


Facebook Scraper

一些平台服务,如社交插件和Open
图表要求我们的系统能够访问您的网页。我们
认识到在某些情况下,您可能不希望在公共互联网上,在测试期间或其他安全性
的原因这些
页面。

A number of Platform services such as Social Plugins and the Open Graph require our systems to be able to reach your Web Pages. We recognize that there are situations where you might not want these pages on the public Internet, during testing or for other security reasons.

为了方便起见,您应该在安全性
系统中出现例外情况,以允许Facebook通过在2012年4月之前添加以下IP范围内的
来进行划分。

To facilitate this, you should make exceptions in your security systems to allow Facebook to scrape these pages by adding the following IP ranges, accurate as of April 2012.

31.13.24.0/21
31.13.64.0/18
66.220.144.0/20
69.63.176.0/20
69.171.224.0/19
74.119.76.0/22
103.4.96.0/22
173.252.64.0/18
204.15.20.0/22







而不是IP,您也可以使用您的防火墙的用户代理。


Instead of IP, you can also use the user agent for your firewall.

http://developers.facebook.com/docs/reference/plugins/like/


当Facebook刮了我的页面?

Faceb ook需要抓住你的页面,知道如何在网站上显示

Facebook needs to scrape your page to know how to display it around the site.

Facebook每24小时刮擦你的页面,以确保财产是
更新。当Open Graph
页面的管理员点击Like按钮以及URL被输入到
Facebook URL Linter中时,页面也会被刮除。 Facebook观察到您的网址上的缓存标头 - 它
将按照优先级顺序查看过期和缓存控制。
但是,即使您指定了更长的时间,Facebook也会每24小时刮取
页面。

Facebook scrapes your page every 24 hours to ensure the properties are up to date. The page is also scraped when an admin for the Open Graph page clicks the Like button and when the URL is entered into the Facebook URL Linter. Facebook observes cache headers on your URLs - it will look at "Expires" and "Cache-Control" in order of preference. However, even if you specify a longer time, Facebook will scrape your page every 24 hours.

刮刀的用户代理是: facebookexternalhit / 1.1
(+ http://www.facebook.com/externalhit_uatext.php

The user agent of the scraper is: "facebookexternalhit/1.1 (+http://www.facebook.com/externalhit_uatext.php)"

这篇关于Facebook Open Graph爬虫的IP地址范围是多少?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆