最好的算法以突出在一个HTML文件中给定单词的列表 [英] Best algorithm to highlight a list of given words in an HTML file

查看：139 发布时间：2015/11/30 21:41:41 javascript jquery html algorithm

本文介绍了最好的算法以突出在一个HTML文件中给定单词的列表的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我有一些HTML文件，在这我管不着。因此，我无法改变自己的结构或标记。

I have some HTML files, upon which I have no control. Thus I can't change their structure or markup.

有关每个这些HTML文件，单词的列表将基于另一种算法找到。这些话应在HTML文本高亮显示。例如，如果HTML标记是：

For each of these HTML files, a list of words would be found based on another algorithm. These words should be highlighted in the text of HTML. For example if the HTML markup is:

<p>
Monkeys are going to die soon, if we don't stop killing them. 
So, we have to try hard to persuade hunters not to hunt monkeys. 
Monkeys are very intelligent, and they should survive. 
In fact, they deserve to survive.
</p>

和的词列表是：

are, we, monkey

的结果应该是这样的：

the result should be something like:

<p>
    <span class='highlight'>Monkeys</span> 
    <span class='highlight'>are</span> 
going to die soon, if 
    <span class='highlight'>we</span> 
don't stop killing them. 
So, 
    <span class='highlight'>we</span> 
have to try hard to persuade hunters 
not to hunt 
    <span class='highlight'>monkeys</span>
. They 
    <span class='highlight'>are</span> 
very intelligent, and they should survive. 
In fact, they deserve to survive.
</p>

高亮显示的算法应该：

The highlighting algorithm should:

在不区分大小写
可以使用JavaScript编写的（这种情况发生的内部浏览器）（jQuery是欢迎）
进行快速（适用于某一本书的文字，几乎800页）
在不显示浏览器的著名的停止脚本对话框
适用于肮脏的HTML文件（如支撑无效的HTML标记，比方说未闭合的
元素）（其中一些文件是微软Word的HTML出口，我觉得你有什么我的意思是肮脏的！）
应该preserve原始的HTML标记

be case-insensitive
be written in JavaScript (this happens inside browser) (jQuery is welcomed)
be fast (be applicable for the text of a given book with almost 800 pages)
not showing browser's famous "stop script" dialog
be applicable for dirty HTML files (like supporting invalid HTML markup, say for example unclosed
elements) (some of these files are HTML export of MS Word, and I think you got what I mean by dirty!!!)
should preserve original HTML markup (no markup deletion, no markup change except wrapping intended words inside an element, no nesting change. HTML should look the same before and after edit except highlighted words)

我所做至今：

我得到在JavaScript中的单词列表在一个数组像 [是，我们，猴子]
我尽量选择在浏览器文本节点（它现在有故障）
在每个文本节点我环路，并为每个文本节点，我遍历列表中的每个单词，并设法找到它，把它包一个元素

I get the list of words in JavaScript in an array like ["are", "we", "monkey"]
I try to select text nodes in the browser (which is faulty now)
I loop over each text node, and for each text node, I loop over each word in the list and try to find it and wrap it inside an element

请注意，您可以在线观看这里（用户名：demo@phis.ir，传：演示）。此外当前的脚本可以在页面的源端可见。

Please note that you can watch it online here (username: demo@phis.ir, pass: demo). Also current script could be seen at the end of the page's source.

最好的算法以突出在一个HTML文件中给定单词的列表 [英] Best algorithm to highlight a list of given words in an HTML file

问题描述

推荐答案

相关文章

前端开发最新文章

热门教程

热门工具

登录关闭

最好的算法以突出在一个HTML文件中给定单词的列表 [英] Best algorithm to highlight a list of given words in an HTML file

问题描述

推荐答案

相关文章

前端开发最新文章

热门教程

热门工具

登录 关闭

登录关闭