奇怪的文件系统爬网行为 [英] Strange file system crawling behavior

查看:104
本文介绍了奇怪的文件系统爬网行为的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我们看到文件系统爬网中缺少一些子文件夹,这对我来说很奇怪.  日志中没有错误.

因此,我们正在索引\\ fileserver \ public \ applications \ sourcesafe \ shadow

当我们在以下路径中搜索文件时,什么都不会出现,\ branches \下也没有任何索引:   \\ fileserver \ public \ applications \ sourcesafe \ shadow \ _svn \ repository \ folder \ branches \

如果我们搜索文件(唯一的区别是主干vs分支)\ n \ fileserver \ public \ applications \ sourcesafe \ shadow \ _svn \ repository \ folder \ trunk \,它显示的很好,因此文件夹深度不足问题.

我们创建了一个新的内容源,其中包含丢失的文件夹(所有文件夹均为分支"文件夹),并具有该索引,并且索引良好,因此在分支文件夹上没有权限问题.

我在Search Admin中搜索了URL,我看到它索引了branchs文件夹,但其下没有任何内容.  完全没有错误消息.

关于为什么它拒绝索引以下内容的任何想法?谢谢!

解决方案

请创建内容源并抓取\\ fileserver \ public \ applications \ sourcesafe \ shadow \ _ svn \ repository \ folder \ branches,比较结果.

提供这些内容源(所有文件共享内容源)的爬网日志的屏幕快照,以供研究

管理中心>管理服务应用程序>搜索服务应用程序>抓取日志.

最好的问候,

Linda Zhang


Hey Guys,

We're seeing some subfolders missing from a file system crawl and it's pretty strange to me.  No errors in the logs.

So we are indexing \\fileserver\public\applications\sourcesafe\shadow

When we search for an file in the following path, nothing comes up, nothing is being indexed below \branches\:   \\fileserver\public\applications\sourcesafe\shadow\_svn\repository\folder\branches\

If we search for a file (only difference is trunk vs branch)  \\fileserver\public\applications\sourcesafe\shadow\_svn\repository\folder\trunk\, it comes up fine, so not a folder depth issue.

We created a new content source containing the missing folders (all of which are "branch" folders), and had that index, and it indexes fine, so not a permissions issue on the branches folder.

I searched for the URL in Search Admin, and I see it indexes the branches folder, but nothing underneath it.  No error messages at all.

Any ideas on why it refuses to index anything below?  Thanks!

解决方案

Hi IEEEGSMDiOrio,

Please create a content source and crawl for \\fileserver\public\applications\sourcesafe\shadow \_svn\repository\folder\branches, compare the results.

Provide the screenshot of the crawl log for these content source (all File Shares content sources) for researching.

Central Administration > Manage service applications > Search service application > Crawl log.

Best Regards,

Linda Zhang


这篇关于奇怪的文件系统爬网行为的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆