正则表达式无法正确使用土耳其语字符 [英] Regular Expression Doesn't Work Properly With Turkish Characters

查看：259 发布时间：2016/11/18 16:32:01 php regex nlp character turkish

本文介绍了正则表达式无法正确使用土耳其语字符的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

我写一个应该提取以下模式的正则表达式：

I write a regex that should extracts following patterns;

这里是正在尝试的正则表达式;

here is the regular expressions i'm trying;

\ b（c + o + k +）|（ç+ o + k +）\s（g + u + z + e + l）|（g +ü+ z + e + 1 +）\b：无法正常工作

"\b[çc]+o+k+\sg+[üu]+z+e+l+\b" : this works in english but not in turkish characters
"çok": finds "çok" but when i try "ç+o+k+" doesn't work for "çççoookkk", it finds "çoookkk"
"güzel": finds "güzel" but when i try "g+ü+z+e+l+" doesn't work for "gggüüüzzzeeelll"
"\b(c+o+k+)|(ç+o+k+)\s(g+u+z+e+l)|(g+ü+z+e+l+)\b": doesn't work properly
"[çc]ok\sg[uü]zel": I also tried this to get "çok güzel" pattern but doesn't work neither.

我的问题可能是使用土耳其字符的正则表达式运算符。我不知道我该如何解决这个问题。

I thing the problem might be using regex operators with turkish characters. I don't know how can i solve this.

I am using http://www.myregextester.com to check if my regular expressions are correct.

我使用Php编程语言通过Twitter Rest Api从搜索的tweet中获取特定模式。

I am using Php programming language to get a specific pattern from searched tweets via Twitter Rest Api.

感谢，

正则表达式无法正确使用土耳其语字符 [英] Regular Expression Doesn&#39;t Work Properly With Turkish Characters