Java UTF-8奇怪的行为 [英] Java UTF-8 strange behaviour

查看：68 发布时间：2020/7/13 5:17:01 java utf-8

本文介绍了Java UTF-8奇怪的行为的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在尝试用Java解码一些UTF-8字符串. 这些字符串包含一些组合的unicode字符，例如CC 88(组合diaresis). 根据 http://www.fileformat，该字符序列似乎还可以. info/info/unicode/char/0308/index.htm

I am trying to decode some UTF-8 strings in Java. These strings contain some combining unicode characters, such as CC 88 (combining diaresis). The character sequence seems ok, according to http://www.fileformat.info/info/unicode/char/0308/index.htm

但是转换为String后的输出无效. 有什么主意吗?

But the output after conversion to String is invalid. Any idea ?

byte[] utf8 = { 105, -52, -120 };
System.out.print("{{");
for(int i = 0; i < utf8.length; ++i)
{
    int value = utf8[i] & 0xFF;
    System.out.print(Integer.toHexString(value));
}
System.out.println("}}");
System.out.println(">" + new String(utf8, "UTF-8"));

输出:


    {{69cc88}}
    >i?

Java UTF-8奇怪的行为 [英] Java UTF-8 strange behaviour

问题描述

推荐答案

相关文章

Java开发最新文章

热门教程

热门工具

登录关闭

Java UTF-8奇怪的行为 [英] Java UTF-8 strange behaviour

问题描述

推荐答案

相关文章

Java开发最新文章

热门教程

热门工具

登录 关闭

登录关闭