剥离HTML标签及其内容 [英] Strip HTML tags and its contents
本文介绍了剥离HTML标签及其内容的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!
问题描述
这是一些包含照片的文本。
< span class ='title'> photobyile< /跨度>
我希望函数返回
这是一些包含照片的文字。
这是我试过的:
$ dom = new domDocument;
$ dom-> loadHTML($ string);
$ dom-> preserveWhiteSpace = false;
$ spans = $ dom-> getElementsByTagName('span');
foreach($ spans as $ span)
{
$ naslov = $ span-> nodeValue;
echo $ naslov;
$ string = preg_replace(/ $ naslov /,,$ string);
}
我知道 $ span-> nodeValue
返回跨标签的值,而不是整个标签,但是我不知道如何获取整个标签以及类名。
谢谢,
Ile
解决方案
尝试从DOM树中直接删除跨度。
$ dom = new DOMDocument();
$ dom-> loadHTML($ string);
$ dom-> preserveWhiteSpace = false;
$ elements = $ dom-> getElementsByTagName('span');
while($ span = $ elements-> item(0)){
$ span-> parentNode-> removeChild($ span);
}
echo $ dom-> saveHTML();
I'm using DOM to parse string. I need function that strips span tags and its contents. For example, if I have:
This is some text that contains photo.
<span class='title'> photobyile</span>
I would like function to return
This is some text that contains photo.
This is what I tried:
$dom = new domDocument;
$dom->loadHTML($string);
$dom->preserveWhiteSpace = false;
$spans = $dom->getElementsByTagName('span');
foreach($spans as $span)
{
$naslov = $span->nodeValue;
echo $naslov;
$string = preg_replace("/$naslov/", " ", $string);
}
I'm aware that $span->nodeValue
returns value of span tag and not whole tag, but I don't know how to get whole tag, together with class name.
Thanks, Ile
解决方案
Try removing the spans directly from the DOM tree.
$dom = new DOMDocument();
$dom->loadHTML($string);
$dom->preserveWhiteSpace = false;
$elements = $dom->getElementsByTagName('span');
while($span = $elements->item(0)) {
$span->parentNode->removeChild($span);
}
echo $dom->saveHTML();
这篇关于剥离HTML标签及其内容的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!
查看全文