无法将正则表达式与 sed 匹配 [英] Can't match regex with sed
问题描述
我正在尝试匹配一个模式 (^|~?)(d|x|*)+.(d|x|*)+.(d|x|*)+
与 sed
没有运气.我正在运行的文件是这样的:
I'm trying to match a pattern (^|~?)(d|x|*)+.(d|x|*)+.(d|x|*)+
with sed
without luck.
The file I'm running through is this:
{
"name": "something",
"version": "0.0.1",
"description": "some desc",
"main": "gulpfile.js",
"directories": {
"test": "tests"
},
"dependencies": {
"babel-polyfill": "^6.7.4",
"babel-preset-es2015": "^6.6.0",
"babel-preset-react": "^6.5.0",
"gulp-clean": "^0.3.2",
"jquery": "^2.1.4",
"lodash": "^4.0.0",
"moment": "^2.13.0",
"moment-timezone": "^0.5.0",
"radium": "^0.16.2",
"react": "^15.1.0",
"react-bootstrap-sweetalert": "^1.1.10",
"react-dom": "^15.1.0",
"react-timeago": "^2.2.1",
"sprintf": "^0.1.5",
"smoothscroll": "~0.2.2"
},
"devDependencies": {
"babel": "^6.3.26",
"babelify": "^7.2.0",
"browserify": "~12.0.1",
"console-stamp": "^0.2.0",
"estraverse-fb": "^1.3.1",
"gulp": "^3.9.0",
"gulp-concat": "^2.6.0",
"gulp-sass": "^2.1.1",
"gulp-sourcemaps": "^1.6.0",
"gulp-util": "^3.0.7",
"lodash": "4.5.1",
"lodash.assign": "^3.2.0",
"lodash.isfunction": "^3.0.8",
"lodash.reduce": "^4.3.0",
"node-sass": "3.4.2",
"react-bootstrap": "^0.29.4",
"react-intl": "2.1.0",
"reactify": "1.1.1",
"sweetalert": "^1.1.3",
"vinyl": "^1.1.0",
"vinyl-buffer": "^1.0.0",
"vinyl-source-stream": "^1.1.0",
"watchify": "^3.4.0",
"jsx-to-string": "~0.2.11"
},
"optionalDependencies": {
"pkg-save": "~1.0.2"
},
"scripts": {
"test": "echo "Error: no test specified" && exit 1"
},
"repository": {
"type": "git",
"url": "someurl"
},
"author": "authorname",
"license": "MIT"
}
正如您在 regexr 中看到的,它匹配所需的模式(也匹配版本",但这是我稍后将解决的另一个问题):http://regexr.com/3e324
As you can see in regexr it matches the desired pattern (also matching "version" but that's another issue I'll tackle later): http://regexr.com/3e324
我正在使用以下命令调用 sed:cat package.json |sed 's/(^|~?)(d|x|*)+.(d|x|*)+.(d|x|*)+/Hello/g' -r
I'm invoking invoking sed with the following command:
cat package.json | sed 's/(^|~?)(d|x|*)+.(d|x|*)+.(d|x|*)+/Hello/g' -r
为简洁起见,它输出类似(即未过滤的输入):
For the sake of brevity, it outputs something like (ie. unfiltered input):
...
"dependencies": {
"babel-polyfill": "^6.7.4",
"babel-preset-es2015": "^6.6.0",
"babel-preset-react": "^6.5.0",
"gulp-clean": "^0.3.2",
...
它应该用你好"替换所有数字.
我做错了什么?
与错误标志有关(我试过 /gm
)
或者没有使用正确的正则表达式引擎(我传递了 -r
选项来使用扩展的正则表达式)?
It should replace all digits with "Hello".
What am I doing wrong?
Something to do with bad flags (I've tried /gm
)
or not using the correct regex engine (I'm passing the -r
option to utilize extended regex)?
推荐答案
虽然 POSIX 正则表达式支持一些命名字符类,例如 [[:digit:]]
和 [[:alnum:]]
,它们不支持 d
和 w
等速记类.
While POSIX regular expression support some named character classes, like [[:digit:]]
and [[:alnum:]]
, they do not support shorthand classes such as d
and w
.
一些 GNU 扩展带来了速记类支持,但它们仅限于其中的几个,w
、W
、s
和 S
根据 regular-expressions.info.
Some GNU extensions bring shorthand classes support, but they are restricted to a few of them, w
, W
, s
and S
according to regular-expressions.info.
通过将正则表达式中的 d
替换为 [0-9]
我能够转换您的文档.正则表达式变成(^|~?)([0-9]|x|*)+.([0-9]|x|*)+.([0-9]]|x|*)+
,或者更好 [~^]([0-9x*]+.){2}[0-9x*]
(感谢 Ed Morton!).
By replacing the d
in your regular expression to [0-9]
I was able to transform your document. The regular expression becomes (^|~?)([0-9]|x|*)+.([0-9]|x|*)+.([0-9]|x|*)+
, or better [~^]([0-9x*]+.){2}[0-9x*]
(thanks Ed Morton !).
作为旁注,您的命令可以重写为以下内容,不使用 cat
:
As a side note, your command could be rewritten to the following, which does not use cat
:
sed -E 's/[~^]([0-9x*]+.){2}[0-9x*]/Hello/' package.json
正如 Matt 所指出的,您最好使用 JSON 解析器,例如 jq
.
And as noted by Matt, you'd be better off using a JSON parser such as jq
.
这篇关于无法将正则表达式与 sed 匹配的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!