如何使用grep，regex或perl按照模式提取字符串 [英] How to extract string following a pattern with grep, regex or perl

查看：109 发布时间：2020/5/25 18:40:31 regex perl sed html-parsing text-extraction

本文介绍了如何使用grep，regex或perl按照模式提取字符串的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我有一个看起来像这样的文件:

I have a file that looks something like this:

    <table name="content_analyzer" primary-key="id">
      <type="global" />
    </table>
    <table name="content_analyzer2" primary-key="id">
      <type="global" />
    </table>
    <table name="content_analyzer_items" primary-key="id">
      <type="global" />
    </table>

我需要提取在name=后面的引号内的所有内容，即content_analyzer，content_analyzer2和content_analyzer_items.

I need to extract anything within the quotes that follow name=, i.e., content_analyzer, content_analyzer2 and content_analyzer_items.

我正在Linux机器上执行此操作，因此使用sed，perl，grep或bash的解决方案就可以了.

I am doing this on a Linux box, so a solution using sed, perl, grep or bash is fine.

GNU grep

如果您具有改进的grep版本(例如GNU grep)，则可能有 -P选项可用.此选项将启用类似Perl的正则表达式，允许您使用\K，这是后面的简写.它将重置匹配位置，因此它之前的所有内容都是零宽度.

GNU grep

If you have an improved version of grep, such as GNU grep, you may have the -P option available. This option will enable Perl-like regex, allowing you to use \K which is a shorthand lookbehind. It will reset the match position, so anything before it is zero-width.

grep -Po 'name="\K.*?(?=")' filename

o选项使grep仅打印匹配的文本，而不是整行.

The o option makes grep print only the matched text, instead of the whole line.

另一种方法是直接使用文本编辑器.与Vim一起，完成此操作的各种方法是删除行而不 name=，然后从结果行中提取内容:

Another way is to use a text editor directly. With Vim, one of the various ways of accomplishing this would be to delete lines without name= and then extract the content from the resulting lines:

:v/.*name="\v([^"]+).*/d|%s//\1

标准grep

如果由于某些原因您无权使用这些工具，使用标准grep可以实现类似的效果.但是，没有外观周围将需要稍后的清理:

Standard grep

If you don't have access to these tools, for some reason, something similar could be achieved with standard grep. However, without the look around it will require some cleanup later:

grep -o 'name="[^"]*"' filename

关于保存结果的说明

在以上所有命令中，结果将发送到stdout.它是重要的是要记住，您始终可以通过将其通过管道传输到通过附加文件:

A note about saving results

In all of the commands above the results will be sent to stdout. It's important to remember that you can always save them by piping it to a file by appending:

> result

到命令末尾.

这篇关于如何使用grep，regex或perl按照模式提取字符串的文章就介绍到这了，希望我们推荐的答案对大家有所帮助，也希望大家多多支持IT屋！

查看全文

如何使用grep，regex或perl按照模式提取字符串 [英] How to extract string following a pattern with grep, regex or perl

问题描述

推荐答案

GNU grep

GNU grep

标准grep

Standard grep

关于保存结果的说明

A note about saving results

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

如何使用grep，regex或perl按照模式提取字符串 [英] How to extract string following a pattern with grep, regex or perl

问题描述

推荐答案

GNU grep

GNU grep

标准grep

Standard grep

关于保存结果的说明

A note about saving results

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭