无法抓取非基于SharePoint的Java网站? [英] Not able to crawl Non-SharePoint java based sites?
问题描述
大家好,
我们已经能够成功抓取多个非SharePoint网站.
We have been able to crawl multiple non-SharePoint sites successfully.
但是我们在抓取ex的基于Java的站点方面面临挑战. https://isdp.nih.gov/.
But we are having challenge in crawling java based sites for ex. https://isdp.nih.gov/ .
在爬网日志中,它仅显示已爬网的根URL.没有错误,没有警告.
In crawl logs it only shows root URL crawled. No errors, no warnings.
如果您能提供一些意见,那将很棒.
It will be great if you could provide some input.
谢谢
Rahul Babar
Rahul Babar
ASP.NET,C#4.0,Sharepoint 2007/2010,Infopath 2007/2010开发人员http://sharepoint247.wordpress.com/
ASP.NET, C# 4.0, Sharepoint 2007/2010, Infopath 2007/2010 Developer http://sharepoint247.wordpress.com/
推荐答案
嗨Rahul,
Hi Rahul,
什么是基于Java的网站?
What is the java based sites?
创建内容源时,您是否选择网站内容源类型?
When you create content source, do you choose web sites content source type?
更改内容源中的抓取"设置以抓取基于Java的网站:
Change the Crawl settings in your content source to crawl the java based sites:
最好的问候,
Lisa Chen
Lisa Chen
这篇关于无法抓取非基于SharePoint的Java网站?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!