如何检查Java中字节数组是否包含Unicode字符串? [英] How can I check whether a byte array contains a Unicode string in Java?

查看：175 发布时间：2020/7/13 4:00:11 java regex unicode utf-8

本文介绍了如何检查Java中字节数组是否包含Unicode字符串?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

给出一个字节数组，该数组可以是UTF-8编码的字符串，也可以是任意二进制数据，那么在Java中可以使用哪种方法来确定它是哪个?

Given a byte array that is either a UTF-8 encoded string or arbitrary binary data, what approaches can be used in Java to determine which it is?

该数组可以由类似于以下代码的代码生成:

The array may be generated by code similar to:

byte[] utf8 = "Hello World".getBytes("UTF-8");

或者，它可能是由类似于以下代码的代码生成的:

Alternatively it may have been generated by code similar to:

byte[] messageContent = new byte[256];
for (int i = 0; i < messageContent.length; i++) {
    messageContent[i] = (byte) i;
}

关键点是我们不知道数组包含什么，但是需要找出以便填写以下函数:

The key point is that we don't know what the array contains but need to find out in order to fill in the following function:

public final String getString(final byte[] dataToProcess) {
    // Determine whether dataToProcess contains arbitrary data or a UTF-8 encoded string
    // If dataToProcess contains arbitrary data then we will BASE64 encode it and return.
    // If dataToProcess contains an encoded string then we will decode it and return.
}

如何将其扩展到涵盖UTF-16或其他编码机制?

How would this be extended to also cover UTF-16 or other encoding mechanisms?

如何检查Java中字节数组是否包含Unicode字符串? [英] How can I check whether a byte array contains a Unicode string in Java?

问题描述

推荐答案

相关文章

Java开发最新文章

热门教程

热门工具

登录关闭

如何检查Java中字节数组是否包含Unicode字符串? [英] How can I check whether a byte array contains a Unicode string in Java?

问题描述

推荐答案

相关文章

Java开发最新文章

热门教程

热门工具

登录 关闭

登录关闭