Perl非贪婪问题 [英] perl non-greedy problem

查看：91 发布时间：2020/7/1 19:52:47 regex perl non-greedy regex-greedy

本文介绍了Perl非贪婪问题的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我对非贪婪的正则表达式有疑问.我已经看到有一些关于非贪婪正则表达式的问题，但它们并不能解决我的问题.

I am having a problem with a non-greedy regular expression. I've seen that there are questions regarding non-greedy regex, but they don't answer to my problem.

问题::我正在尝试匹配大声笑"锚点的href.

Problem: I am trying to match the href of the "lol" anchor.

注意::我知道可以使用perl HTML解析模块来完成此操作，而我的问题是不关于在perl中解析HTML.我的问题是关于正则表达式本身，而HTML只是一个例子.

Note: I know this can be done with perl HTML parsing modules, and my question is not about parsing HTML in perl. My question is about the regular expression itself and the HTML is just an example.

测试用例::我有4个针对.*?和[^"]的测试. 2先产生预期的结果.但是，第三级没有，第四级只是，但是我不明白为什么.

Test case: I have 4 tests for .*? and [^"]. The 2 first produce the expected result. However the 3rd doesn't and the 4th just does but I don't understand why.

问题:

为什么第三次测试在.*?和[^"]的两个测试中均失败?非贪婪的接线员不应该工作吗?
为什么第四项测试在.*?和[^"]的两个测试中都起作用?我不明白为什么在前面加上.*会更改正则表达式. (除了前面的.*之外，第3和第4个测试是相同的.

Why does the 3rd test fail in both tests for .*? and [^"] ? Shouldn't the non-greedy operator work?
Why does the 4th test works in both tests for .*? and [^"] ? I don't understand why including a .* in front changes the regex. (the 3rd and 4th tests are the same except the .* in front).

我可能不太清楚这些正则表达式是如何工作的. perl食谱提到了一些内容，但我认为它不能解决我的问题问题.

I probably don't understand exactly how these regex work. A perl cookbook recipe mentions something but I don't think it answers my question.

use strict;

my $content=<<EOF;
<a href="/hoh/hoh/hoh/hoh/hoh" class="hoh">hoh</a>
<a href="/foo/foo/foo/foo/foo" class="foo">foo </a>
<a href="/bar/bar/bar/bar/bar" class="bar">bar</a>
<a href="/lol/lol/lol/lol/lol" class="lol">lol</a>
<a href="/koo/koo/koo/koo/koo" class="koo">koo</a>
EOF

print "| $1 | \n\nThat's ok\n" if $content =~ m~href="(.*?)"~s ;

print "\n---------------------------------------------------\n";

print "| $1 | \n\nThat's ok\n" if $content =~ m~href="(.*?)".*>lol~s ;

print "\n---------------------------------------------------\n";

print "| $1 | \n\nWhy does not the 2nd non-greedy '?' work?\n"
  if $content =~ m~href="(.*?)".*?>lol~s ;

print "\n---------------------------------------------------\n";

print "| $1 | \n\nIt now works if I put the '.*' in the front?\n"
  if $content =~ m~.*href="(.*?)".*?>lol~s ;

print "\n###################################################\n";
print "Let's try now with [^]";
print "\n###################################################\n\n";


print "| $1 | \n\nThat's ok\n" if $content =~ m~href="([^"]+?)"~s ;

print "\n---------------------------------------------------\n";

print "| $1 | \n\nThat's ok.\n" if $content =~ m~href="([^"]+?)".*>lol~s ;

print "\n---------------------------------------------------\n";

print "| $1 | \n\nThe 2nd greedy still doesn't work?\n"
  if $content =~ m~href="([^"]+?)".*?>lol~s ;

print "\n---------------------------------------------------\n";

print "| $1 | \n\nNow with the '.*' in front it does.\n"
  if $content =~ m~.*href="([^"]+?)".*?>lol~s ;

Perl非贪婪问题 [英] perl non-greedy problem

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

Perl非贪婪问题 [英] perl non-greedy problem

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭