超过390KB的文件抓取失败 [英] Scraper fails on files over ~390KB

查看：104 发布时间：2020/10/19 6:40:21 facebook debugging facebook-opengraph scraper

本文介绍了超过390KB的文件抓取失败的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

Facebook的URL Scarper是否有大小限制？我们在网站上有几本书。具有HMTL文件大小小于一定大小（〜390KB）的文件会被刮擦并正确读取，但较大的4个文件不会被读取。这些较大的项目会得到200的响应代码，并且会打开规范的URL。

Does the Facebook's URL scarper have a size limitation on it? We have several books available on a website. Those that have an HMTL filesize under a certain size (~390KB) get scraped and read properly but the 4 that are larger do not. These larger items get a 200 response code and the canonical URL opens.

所有这些页面都是使用相同的模板构建的，唯一的区别是其中内容的大小每本书以及每本书与网站上其他页面的链接数。

All of these pages are built using the same template, the only differences being the size of the content within each book and the number of links each book makes to other pages on the site.

单击规范URL

在Firefox中打开Firebug或在Chrome中使用开发人员工具打开网络标签
3，列出的故障的* .html大小为>〜390KB， <〜390K成功

单击准确查看我们的抓取工具为您的URL看到的内容

空白页显示失败，HTML显示为成功

click on canonical URL
Open Firebug In Firefox or developer tools in Chrome to network tab 3, The *.html size at >~390KB for the listed failures & <~390K for the successes
Click on "See exactly what our scraper sees for your URL"
Blank page for failures, HTML present for successes

失败：

https://developers.facebook.com/tools/debug/og/object?q=http%3A%2F%2Frcg.org%2Fbooks%2Ftapom.html

https ：//developers.facebook.com/tools/debug/og/object？q = http％3A％2F％2Frcg.org％2Fbooks％2Ftbgpu.html

< a href = https://developers.facebook.com/tools/debug/og/object?q=http%3A%2F%2Frcg.org%2Fbooks%2Fttjc.html rel = nofollow> https：// developers.facebook.com/tools/debug/og/object?q=http%3A%2F%2Frcg.org%2Fbooks%2Fttjc.html

https://developers.facebook.com/tools/debug/ og / object？q = http％3A％2F％2Frcg.org％2Fbooks％2Ftbdse.html

https://developers.facebook.com/tools/debug/og/object?q=http%3A%2F%2Frcg.org%2Fbooks%2Ftapom.html
https://developers.facebook.com/tools/debug/og/object?q=http%3A%2F%2Frcg.org%2Fbooks%2Ftbgpu.html
https://developers.facebook.com/tools/debug/og/object?q=http%3A%2F%2Frcg.org%2Fbooks%2Fttjc.html
https://developers.facebook.com/tools/debug/og/object?q=http%3A%2F%2Frcg.org%2Fbooks%2Ftbdse.html

成功：

https://developers.facebook.com/tools/debug/og/object?q=http%3A% 2F％2Frcg.org％2Fbooks％2Fthogtc.html

https://developers.facebook.com/tools/debug/og/object?q=http%3A%2F%2Frcg .org％2Fbooks％2Faabibp.html

https://developers.facebook.com/tools/debug/og/object?q=http%3A%2F%2Frcg.org% 2Fbooks％2Ftww.html

https：// developers .facebook.com / tools / debug / og / object？q = http％3A％2F％2Frcg.org％2Fbooks％2Ftsosw.html

https：//developers.facebook。 com / tools / debug / og / object？q = http％3A％2F％2Frcg.org％2Fbooks％2Fsyottc.html

https://developers.facebook.com/tools /debug/og/object?q=http%3A%2F%2Frcg.org%2Fbooks%2Fttigtio.html

https://developers.facebook.com/tools/debug/ og / object？q = http％3A％2F％2Frcg.org％2Fbooks％2Faadac.html

https://developers.facebook.com/tools/debug/og/object？ q = http％3A％2F％2Frcg.org％2Fbooks％2Fsiud.html

https://developers.facebook.com/tools/debug/og/object?q=http ％3A％2F％2Frcg.org％2Fbooks％2Ftuyc.html

https://developers.facebook.com/tools/debug/og/object?q=http%3A%2F%2Frcg.org%2Fbooks%2Fthogtc.html
https://developers.facebook.com/tools/debug/og/object?q=http%3A%2F%2Frcg.org%2Fbooks%2Faabibp.html
https://developers.facebook.com/tools/debug/og/object?q=http%3A%2F%2Frcg.org%2Fbooks%2Ftww.html
https://developers.facebook.com/tools/debug/og/object?q=http%3A%2F%2Frcg.org%2Fbooks%2Ftsosw.html
https://developers.facebook.com/tools/debug/og/object?q=http%3A%2F%2Frcg.org%2Fbooks%2Fsyottc.html
https://developers.facebook.com/tools/debug/og/object?q=http%3A%2F%2Frcg.org%2Fbooks%2Fttigtio.html
https://developers.facebook.com/tools/debug/og/object?q=http%3A%2F%2Frcg.org%2Fbooks%2Faadac.html
https://developers.facebook.com/tools/debug/og/object?q=http%3A%2F%2Frcg.org%2Fbooks%2Fsiud.html
https://developers.facebook.com/tools/debug/og/object?q=http%3A%2F%2Frcg.org%2Fbooks%2Ftuyc.html

超过390KB的文件抓取失败 [英] Scraper fails on files over ~390KB

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

超过390KB的文件抓取失败 [英] Scraper fails on files over ~390KB

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭