在字符串中定义4字节的UTF-16字符 [英] Defining 4-byte UTF-16 character in a string

查看：732 发布时间：2016/11/19 15:59:04 c# unicode encoding character-encoding utf-16

本文介绍了在字符串中定义4字节的UTF-16字符的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我阅读了有关UTF的问题-8，UTF-16和UCS-2 ，几乎所有答案都给出了UCS-2已过时和C＃使用UTF-16的说法。

，所有我在C＃中创建4字节字符U + 1D11E的尝试失败了，所以我实际上认为C＃只使用UTF-16的UCS-2子集。

有我的尝试：

  string s =\\\ᴑE; //给出2个字符串ᴑE，因为\\\ᴑ是ᴑ
 string s =（char）0x1D11E; //因为溢出而无法编译
 string s = Encoding.Unicode.GetString（new byte [] {0xD8，0x34，0xDD，0x1E}）; C＃字符串真的是UTF-16还是它们实际上是UCS-2？ 
解决方案
使用大写U代替：
 / p> 
 
 
  string s =\U0001D11E; 
  
你忽略了大多数机器都是小端序：
  string t = Encoding.Unicode.GetString（new byte [] {0x34，0xD8，0x1E，0xDD}）; 
  
 
I have read a question about UTF-8, UTF-16 and UCS-2 and almost all answers give the statement that UCS-2 is obsolete and C# uses UTF-16.

However, all my attempts to create the 4-byte character U+1D11E in C# failed, so I actually think C# uses the UCS-2 subset of UTF-16 only.

There are my tries:
string s = "\u1D11E"; // gives the 2 character string "ᴑE", because \u1D11 is ᴑ
string s = (char) 0x1D11E; // won't compile because of an overflow
string s = Encoding.Unicode.GetString(new byte[] {0xD8, 0x34, 0xDD, 0x1E}); // gives 㓘ờ
Are C# strings really UTF-16 or are they actually UCS-2? If they are UTF-16, how would I get the violin clef into my C# string?
 解决方案 
Use capital U instead:
  string s = "\U0001D11E";
And you overlooked that most machines are little-endian:
  string t = Encoding.Unicode.GetString(new byte[] { 0x34, 0xD8, 0x1E, 0xDD });


                        
这篇关于在字符串中定义4字节的UTF-16字符的文章就介绍到这了，希望我们推荐的答案对大家有所帮助，也希望大家多多支持IT屋！


                    
                        查看全文

在字符串中定义4字节的UTF-16字符 [英] Defining 4-byte UTF-16 character in a string

问题描述

相关文章

C#/.NET最新文章

热门教程

热门工具

登录关闭

在字符串中定义4字节的UTF-16字符 [英] Defining 4-byte UTF-16 character in a string

问题描述

相关文章

C#/.NET最新文章

热门教程

热门工具

登录 关闭

登录关闭