使用正则表达式解析HTML:为什么不呢? [英] Using regular expressions to parse HTML: why not?

查看：93 发布时间：2020/11/24 20:46:29 regex html-parsing

本文介绍了使用正则表达式解析HTML:为什么不呢?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

在stackoverflow上，每个问询者都在使用正则表达式从HTML中获取某些信息，这似乎不可避免地会有一个答案"，其中说不使用正则表达式来解析HTML.

It seems like every question on stackoverflow where the asker is using regex to grab some information from HTML will inevitably have an "answer" that says not to use regex to parse HTML.

为什么不呢?我知道那里有没有引号的真实" HTML解析器，例如 Beautiful Soup ，而且我确定它们功能强大且有用，但是，如果您只是在做简单，快速或肮脏的事情，那么当一些正则表达式语句可以正常工作时，为什么还要烦恼使用如此复杂的事情呢?

Why not? I'm aware that there are quote-unquote "real" HTML parsers out there like Beautiful Soup, and I'm sure they're powerful and useful, but if you're just doing something simple, quick, or dirty, then why bother using something so complicated when a few regex statements will work just fine?

此外，关于正则表达式，我是否不了解某些基本知识，从而使它们成为一般解析的错误选择?

Moreover, is there just something fundamental that I don't understand about regex that makes them a bad choice for parsing in general?

使用正则表达式解析HTML:为什么不呢? [英] Using regular expressions to parse HTML: why not?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

使用正则表达式解析HTML:为什么不呢? [英] Using regular expressions to parse HTML: why not?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭