有没有办法让机器人忽略某些文字? [英] Is there a way to make robots ignore certain text?

查看:137
本文介绍了有没有办法让机器人忽略某些文字?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我有我的博客(如果你愿意,可以从我的个人资料中看到它),这是新鲜的,以及谷歌机器人解析结果。

结果令我感到震惊。显然,我网站上最常见的两个词是rss和feed,因为我使用Comments RSS,Post Feed等链接文本。这两个词将出现在每个帖子中,而其他词将更加罕见。



有没有办法让这些链接从Google的解析中消失?我不希望技术链接获得索引。我只想要内容,标题,说明进行索引。



我在2007年发现了一些关于Google的旧式讨论(我认为在3年内很多事情可能会发生变化,希望这也是)



这个问题不是关于robots.txt以及如何让Google忽略页面。它是关于使它忽略页面的小部分,或者以人类可以看见的方式来转换部件,并且机器人不可见。

解决方案

有一种简单的方法可以告诉google不要索引文档的一部分,即使用 googleon googleoff

 < p>这是正常的(X)HTML内容,将被索引由Google提供。< / p> 

<! - googleoff:index - >

< p>此(X)HTML内容不会被Google索引。< / p>

<! - googleon:index - >

在本例中,第二段不会被Google编入索引。请注意 index 参数,该参数可以设置为以下任何一个:


    < li

    index - 由< googleoff:index >所包围的内容不会被编入索引
    Google



    定位 - 定位 googleoff:anchor code>area
    将不会与目标页面关联


    片段 - 内容被 googleoff:snippet >包围的内容不会被用来为搜索结果创建片段


    all - 由< googleoff:all >所包围的内容全部处理




I have my blog (you can see it if you want, from my profile), and it's fresh, as well as google robots parsing results are.

The results were alarming to me. Apparently the most common 2 words on my site are "rss" and "feed", because I use text for links like "Comments RSS", "Post Feed", etc. These 2 words will be present in every post, while other words will be more rare.

Is there a way to make these links disappear from Google's parsing? I don't want technical links getting indexed. I only want content, titles, descriptions to get indexed. I am looking for something other than replacing this text with images.

I found some old discussions on Google, back from 2007 (I think in 3 years many things could have changed, hopefully this too)

This question is not about robots.txt and how to make Google ignore pages. It is about making it ignore small parts of the page, or transforming the parts in such a way that it will be seen by humans and invisible to robots.

解决方案

There is a simple way to tell google to not index parts of your documents, that is using googleon and googleoff:

<p>This is normal (X)HTML content that will be indexed by Google.</p>

<!--googleoff: index-->

<p>This (X)HTML content will NOT be indexed by Google.</p>

<!--googleon: index-->

In this example, the second paragraph will not be indexed by Google. Notice the "index" parameter, which may be set to any of the following:

  • index — content surrounded by "googleoff: index" will not be indexed by Google

    anchor — anchor text for any links within a "googleoff: anchor" area will not be associated with the target page

    snippet — content surrounded by "googleoff: snippet" will not be used to create snippets for search results

    all — content surrounded by "googleoff: all" are treated with all

source

这篇关于有没有办法让机器人忽略某些文字?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆