是否可以使用 sed 可靠地转义正则表达式元字符 [英] Is it possible to escape regex metacharacters reliably with sed

查看：56 发布时间：2021/7/6 18:58:27 regex sed

本文介绍了是否可以使用 sed 可靠地转义正则表达式元字符的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我想知道是否可以编写一个 100% 可靠的 sed 命令来转义输入字符串中的任何正则表达式元字符，以便它可以在后续的 sed 命令中使用.像这样:

I'm wondering whether it is possible to write a 100% reliable sed command to escape any regex metacharacters in an input string so that it can be used in a subsequent sed command. Like this:

#!/bin/bash
# Trying to replace one regex by another in an input file with sed

search="/abc\n\t[a-z]\+\([^ ]\)\{2,3\}\3"
replace="/xyz\n\t[0-9]\+\([^ ]\)\{2,3\}\3"

# Sanitize input
search=$(sed 'script to escape' <<< "$search")
replace=$(sed 'script to escape' <<< "$replace")

# Use it in a sed command
sed "s/$search/$replace/" input

我知道有更好的工具来处理固定字符串而不是模式，例如 awk、perl 或 python.我只想证明 sed 是否可行.我想说让我们专注于基本的 POSIX 正则表达式以获得更多乐趣！:)

I know that there are better tools to work with fixed strings instead of patterns, for example awk, perl or python. I would just like to prove whether it is possible or not with sed. I would say let's concentrate on basic POSIX regexes to have even more fun! :)

我尝试了很多东西，但任何时候我都能找到破坏我尝试的输入.我认为将其抽象为 script to escape 不会将任何人引向错误的方向.

I have tried a lot of things but anytime I could find an input which broke my attempt. I thought keeping it abstract as script to escape would not lead anybody into the wrong direction.

顺便说一句，讨论出现了此处.我认为这可能是收集解决方案并可能打破和/或详细阐述它们的好地方.

Btw, the discussion came up here. I thought this could be a good place to collect solutions and probably break and/or elaborate them.

推荐答案

注意:

如果您正在寻找基于此答案中讨论的技术的预打包功能:

bash 函数，即使在多行替换中也能实现健壮的转义可以在这篇文章的底部找到(加上一个 perl 解决方案，它使用 perl 对这种转义的内置支持).
@EdMorton 的回答包含一个工具(bash 脚本) 可以稳健地执行单行替换.
- Ed 的回答现在有 改进版本的 sed 命令，如果您希望 转义字符串，则需要该版本可能与其他正则表达式处理工具一起使用的文字，例如awk和perl.简而言之:用于交叉-tool 使用，\ 必须转义为 \\ 而不是 [\]，这意味着:而不是 \\br/>sed 's/[^^]/[&]/g;下面使用的s/\^/\\^/g'命令，必须使用
  sed 's/[^^\\]/[&]/g;s/\^/\\^/g;s/\\/\\\\/g'
- bash functions that enable robust escaping even in multi-line substitutions can be found at the bottom of this post (plus a perl solution that uses perl's built-in support for such escaping).
- @EdMorton's answer contains a tool (bash script) that robustly performs single-line substitutions.
  - Ed's answer now has an improved version of the sed command used below, which is needed if you want to escape string literals for potential use with other regex-processing tools, such as awk and perl. In short: for cross-tool use, \ must be escaped as \\ rather than as [\], which means: instead of the
    sed 's/[^^]/[&]/g; s/\^/\\^/g' command used below, you must use
    sed 's/[^^\\]/[&]/g; s/\^/\\^/g; s/\\/\\\\/g'
  所有代码片段都假设 bash 作为 shell(符合 POSIX 的重构是可能的):
  
  All snippets assume bash as the shell (POSIX-compliant reformulations are possible):
  
  ^{在信用到期时给予信用:我在这个答案中找到了下面使用的正则表达式.上>}
  
  ^{To give credit where credit is due: I found the regex used below in this answer.}
  
  假设搜索字符串是一个单行字符串:
  
  Assuming that the search string is a single-line string:
```
search='abc\n\t[a-z]\+$[^ ]$\{2,3\}\3'  # sample input containing metachars.

searchEscaped=$(sed 's/[^^]/[&]/g; s/\^/\\^/g' <<<"$search") # escape it.

sed -n "s/$searchEscaped/foo/p" <<<"$search" # if ok, echoes 'foo'
```
  - 除了 ^ 之外的每个字符都放在自己的字符集 [...] 表达式中，以将其视为文字.
    - 注意 ^ 是一个字符.你不能表示为[^]，因为它在那个位置有特殊的意义(否定).

查看全文

是否可以使用 sed 可靠地转义正则表达式元字符 [英] Is it possible to escape regex metacharacters reliably with sed

问题描述

推荐答案

多线解决方案

转义多行字符串文字以用作 `sed` 中的 regex:

MULTI-line Solutions

Escaping a MULTI-LINE string literal for use as a regex in `sed`:

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

是否可以使用 sed 可靠地转义正则表达式元字符 [英] Is it possible to escape regex metacharacters reliably with sed

问题描述

推荐答案

多线解决方案

转义多行字符串文字以用作 sed 中的 regex:

MULTI-line Solutions

Escaping a MULTI-LINE string literal for use as a regex in sed:

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

转义多行字符串文字以用作 `sed` 中的 regex:

Escaping a MULTI-LINE string literal for use as a regex in `sed`:

登录关闭