与 RTL 语言一起使用时的字符串替换函数调用顺序 [英] Order of string replacement function invocations when used with RTL languages

查看：47 发布时间：2021/7/10 18:44:56 javascript regex right-to-left

本文介绍了与 RTL 语言一起使用时的字符串替换函数调用顺序的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

当调用 String.replace 使用替换函数，我们可以检索匹配子字符串的偏移量.

When calling String.replace with a replacement function we're able to retrieve offsets of the matched substrings.

var a = [];
"hello world".replace(/l/g, function (m, i) { a.push(i); });
// a = [2, 3, 9]

在上面的示例中，我们获得了匹配 l 字符的偏移量列表.

In the example above, we're getting a list of offsets for the matching l characters.

我能否指望实现总是按出现的升序调用匹配函数，即使在使用从右到左编写的语言时也是如此?

Can I count on implementations to always invoke the match function in ascending order of occurrence, even when used with languages that are written from right to left?

也就是说:我能确定上面的结果总是 [2,3,9] 而不是 [3,9,2] 或任何其他这些偏移量的排列?

That is: Can I be sure that the result above will always be [2,3,9] and not [3,9,2] or any other permutation of those offsets?

这是对这个问题的跟进，Tomalak 回答:

This is a follow-up on this question that Tomalak answered with:

当然，是的.匹配在源字符串中从左到右处理，因为从左到右是正则表达式引擎处理字符串的方式.

Absolutely, yes. Matches are handled from left to right in the source string because left-to-right is how regular expression engines work their way to a string.

然而，关于 RTL 语言的情况，他也说:

However, regarding the case with RTL languages he also said:

这是个好问题[...] RTL 文本肯定会改变 JavaScript 正则表达式的行为方式.

That's a good question [...] RTL text definitely changes how JavaScript regular expressions behave.

我已经在 Chrome 中使用以下 RTL 代码段进行了测试:

I've tested with the following RTL snippet in Chrome:

var a = [];
"بلوچی مکرانی".replace(/ی/g, function (m, i) { a.push(i); });
// a = [4, 11]

我不会说那种语言，但在查看字符串时，我看到 ی 字符是字符串的第一个字符，也是空格后的第一个字符.但是，由于文本是从右到左书写的，这些位置实际上是 最后一个字符 之前的空白和 字符串中的最后一个字符 - 转换为 [4,11]

I don't speak that language but looking at the string I see the ی character as the first character of the string and as the first character after the white space. However, since the text is written right-to-left those positions are actually the last character before the white space and the last character in the string - which translates into [4,11]

因此，这似乎在 Chrome 中按预期工作.问题是:我可以相信结果在所有兼容的 javascript 实现上都是一样的吗?

So, this seems to work just as expected in Chrome. The question is: Can I trust that the result will be the same on all compliant javascript implementations?

与 RTL 语言一起使用时的字符串替换函数调用顺序 [英] Order of string replacement function invocations when used with RTL languages

问题描述

推荐答案

相关文章

前端开发最新文章

热门教程

热门工具

登录关闭

与 RTL 语言一起使用时的字符串替换函数调用顺序 [英] Order of string replacement function invocations when used with RTL languages

问题描述

推荐答案

相关文章

前端开发最新文章

热门教程

热门工具

登录 关闭

登录关闭