无法抓取非基于SharePoint的Java网站? [英] Not able to crawl Non-SharePoint java based sites?

查看:50
本文介绍了无法抓取非基于SharePoint的Java网站?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

大家好,

我们已经能够成功抓取多个非SharePoint网站.

We have been able to crawl multiple non-SharePoint sites successfully.

但是我们在抓取ex的基于Java的站点方面面临挑战. https://isdp.nih.gov/.

But we are having challenge in crawling java based sites for ex. https://isdp.nih.gov/ .

在爬网日志中,它仅显示已爬网的根URL.没有错误,没有警告.

In crawl logs it only shows root URL crawled. No errors, no warnings.

如果您能提供一些意见,那将很棒.

It will be great if you could provide some input.

谢谢

Rahul Babar

Rahul Babar

ASP.NET,C#4.0,Sharepoint 2007/2010,Infopath 2007/2010开发人员http://sharepoint247.wordpress.com/

ASP.NET, C# 4.0, Sharepoint 2007/2010, Infopath 2007/2010 Developer http://sharepoint247.wordpress.com/

推荐答案

嗨Rahul,

Hi Rahul, 

什么是基于Java的网站?

What is the java based sites? 

创建内容源时,您是否选择网站内容源类型?

When you create content source, do you choose web sites content source type? 

更改内容源中的抓取"设置以抓取基于Java的网站:

Change the Crawl settings in your content source to crawl the java based sites:

最好的问候,

Lisa Chen

Lisa Chen 


这篇关于无法抓取非基于SharePoint的Java网站?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆