有没有办法让机器人忽略某些文字? [英] Is there a way to make robots ignore certain text?
问题描述
我有我的博客(如果你愿意,可以从我的个人资料中看到它),这是新鲜的,以及谷歌机器人解析结果。
结果令我感到震惊。显然,我网站上最常见的两个词是rss和feed,因为我使用Comments RSS,Post Feed等链接文本。这两个词将出现在每个帖子中,而其他词将更加罕见。
有没有办法让这些链接从Google的解析中消失?我不希望技术链接获得索引。我只想要内容,标题,说明进行索引。
我在2007年发现了一些关于Google的旧式讨论(我认为在3年内很多事情可能会发生变化,希望这也是)
这个问题不是关于robots.txt以及如何让Google忽略页面。它是关于使它忽略页面的小部分,或者以人类可以看见的方式来转换部件,并且机器人不可见。
有一种简单的方法可以告诉google不要索引文档的一部分,即使用 googleon
和 googleoff
:
< p>这是正常的(X)HTML内容,将被索引由Google提供。< / p>
<! - googleoff:index - >
< p>此(X)HTML内容不会被Google索引。< / p>
<! - googleon:index - >
在本例中,第二段不会被Google编入索引。请注意 index
参数,该参数可以设置为以下任何一个:
< li index
- 由<
Google
定位 - 定位
googleoff:anchor code>area
将不会与目标页面关联
片段
- 内容被 googleoff:snippet
>包围的内容不会被用来为搜索结果创建片段
I have my blog (you can see it if you want, from my profile), and it's fresh, as well as google robots parsing results are. The results were alarming to me. Apparently the most common 2 words on my site are "rss" and "feed", because I use text for links like "Comments RSS", "Post Feed", etc. These 2 words will be present in every post, while other words will be more rare. Is there a way to make these links disappear from Google's parsing? I don't want technical links getting indexed. I only want content, titles, descriptions to get indexed. I am looking for something other than replacing this text with images. I found some old discussions on Google, back from 2007 (I think in 3 years many things could have changed, hopefully this too) This question is not about robots.txt and how to make Google ignore pages. It is about making it ignore small parts of the page, or transforming the parts in such a way that it will be seen by humans and invisible to robots. There is a simple way to tell google to not index parts of your documents, that is using In this example, the second paragraph will not be indexed by Google. Notice the " 这篇关于有没有办法让机器人忽略某些文字?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋! all
- 由<
googleon
and googleoff
:<p>This is normal (X)HTML content that will be indexed by Google.</p>
<!--googleoff: index-->
<p>This (X)HTML content will NOT be indexed by Google.</p>
<!--googleon: index-->
index
" parameter, which may be set to any of the following:
index
— content surrounded by "googleoff: index
" will not be indexed
by Googleanchor
— anchor text for any links within a "googleoff: anchor
" area
will not be associated with the target pagesnippet
— content surrounded by "googleoff: snippet
" will not be used
to create snippets for search resultsall
— content surrounded by "googleoff: all
" are treated with all