删除 R 中除撇号以外的所有标点符号 [英] Remove all punctuation except apostrophes in R

查看:101
本文介绍了删除 R 中除撇号以外的所有标点符号的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我想使用 R 的 gsub 从文本中删除除撇号之外的所有标点符号.我对正则表达式相当陌生,但正在学习.

I'd like to use R's gsub to remove all punctuation from a text except for apostrophes. I'm fairly new to regex but am learning.

示例:

x <- "I like %$@to*&, chew;: gum, but don't like|}{[] bubble@#^)( gum!?"
gsub("[[:punct:]]", "", as.character(x))

当前输出(不要中没有撇号)

[1] "I like to chew gum but dont like bubble gum"

期望的输出(我希望不要留下撇号)

[1] "I like to chew gum but don't like bubble gum"

推荐答案

x <- "I like %$@to*&, chew;: gum, but don't like|}{[] bubble@#^)( gum!?"
gsub("[^[:alnum:][:space:]']", "", x)

[1] "I like to chew gum but don't like bubble gum"

上面的正则表达式更加直接.它将所有不是字母数字符号、空格或撇号(插入符号!)的内容替换为空字符串.

The above regex is much more straight forward. It replaces everything that's not alphanumeric signs, space or apostrophe (caret symbol!) with an empty string.

这篇关于删除 R 中除撇号以外的所有标点符号的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆