获得一个UTF-8 EN codeD的std ::字符串的实际长度？ [英] Getting the actual length of a UTF-8 encoded std::string?

查看：135 发布时间：2015/11/30 13:58:34 c++ algorithm

本文介绍了获得一个UTF-8 EN codeD的std ::字符串的实际长度？的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我的std :: string是UTF-8 EN codeD如此明显，str.length（）返回错误的结果。

我发现这个信息，但我不知道我怎么可以用它来做到这一点：

下面的字节序列用于重新present一个字符。该序列是使用取决于字符的UCS code号：
  00000000  -  0x0000007F：
       0xxxxxxx

   0x00000080  -  0x000007FF：
       110xxxxx 10xxxxxx

   0x00000800  -  0x0000FFFF：
       1110xxxx 10xxxxxx 10xxxxxx

   0x00010000在 -  0x001FFFFF：
       11110xxx 10xxxxxx 10xxxxxx 10xxxxxx
 

我如何能找到一个UTF-8 EN codeD的std ::字符串的实际长度？谢谢

解决方案

其中一个项目我贡献了一个小功能，做的是：

<一个href="http://openlierox.git.sourceforge.net/git/gitweb.cgi?p=openlierox/openlierox;a=blob;f=include/Uni$c$c.h;h=a523b464fc65a7ad875e683cd830b41c9a01934a;hb=HEAD">http://openlierox.git.sourceforge.net/git/gitweb.cgi?p=openlierox/openlierox;a=blob;f=include/Uni$c$c.h;h=a523b464fc65a7ad875e683cd830b41c9a01934a;hb=HEAD

查找 Utf8StringSize 。它依赖于相同的头文件中的另一个微小的作用。

my std::string is utf-8 encoded so obviously, str.length() returns the wrong result.

I found this information but I'm not sure how I can use it to do this:

The following byte sequences are used to represent a character. The sequence to be used depends on the UCS code number of the character:
   0x00000000 - 0x0000007F:
       0xxxxxxx

   0x00000080 - 0x000007FF:
       110xxxxx 10xxxxxx

   0x00000800 - 0x0000FFFF:
       1110xxxx 10xxxxxx 10xxxxxx

   0x00010000 - 0x001FFFFF:
       11110xxx 10xxxxxx 10xxxxxx 10xxxxxx

How can I find the actual length of a UTF-8 encoded std::string? Thanks

解决方案

One of the projects I contribute to has a small function that does that:

http://openlierox.git.sourceforge.net/git/gitweb.cgi?p=openlierox/openlierox;a=blob;f=include/Unicode.h;h=a523b464fc65a7ad875e683cd830b41c9a01934a;hb=HEAD

Look for Utf8StringSize. It depends on another tiny function in the same header file.

这篇关于获得一个UTF-8 EN codeD的std ::字符串的实际长度？的文章就介绍到这了，希望我们推荐的答案对大家有所帮助，也希望大家多多支持IT屋！

查看全文

获得一个UTF-8 EN codeD的std ::字符串的实际长度？ [英] Getting the actual length of a UTF-8 encoded std::string?

问题描述

相关文章

C/C++开发最新文章

热门教程

热门工具

登录关闭

获得一个UTF-8 EN codeD的std ::字符串的实际长度？ [英] Getting the actual length of a UTF-8 encoded std::string?

问题描述

相关文章

C/C++开发最新文章

热门教程

热门工具

登录 关闭

登录关闭