从UTF-8字符串中删除控制字符 [英] Removing control characters from a UTF-8 string

查看:444
本文介绍了从UTF-8字符串中删除控制字符的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我发现的这个问题但它会删除所有有效的 UTF-8 字符也(我返回一个空字符串,同时有有效的 UTF-8 字符加控制字符)。当我读 UTF-8 ,有没有一个具体的范围控制字符,每个字符集都有自己的控制字符

I found this question but it removes all valid utf-8 characters also (returns me a blank string, while there are valid utf-8 characters plus control characters). As I read about utf-8, there's not a specific range for control characters and each character set has its own control characters.

如何修改的的解决方案上面的>仅删除控制字符

推荐答案

我觉得下面的代码会为你工作:

I think the following code will work for you:

public static string RemoveControlCharacters(string inString)
{
    if (inString == null) return null;
    StringBuilder newString = new StringBuilder();
    char ch;
    for (int i = 0; i < inString.Length; i++)
    {
        ch = inString[i];
        if (!char.IsControl(ch))
        {
            newString.Append(ch);
        }
    }
    return newString.ToString();
}

这篇关于从UTF-8字符串中删除控制字符的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆