为什么 Java 没有真正的多维数组? [英] Why doesn't Java have true multidimensional arrays?

查看：29 发布时间：2021/11/18 3:24:02 java arrays performance multidimensional-array

本文介绍了为什么 Java 没有真正的多维数组?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

TL;DR 版本，对于那些不想要背景的人，是以下具体问题:

The TL;DR version, for those who don't want the background, is the following specific question:

为什么 Java 没有真正的多维数组的实现?有可靠的技术原因吗?我在这里错过了什么?

Why doesn't Java have an implementation of true multidimensional arrays? Is there a solid technical reason? What am I missing here?

背景

Java 在语法层面有多维数组，可以声明

Background

Java has multidimensional arrays at the syntax level, in that one can declare

int[][] arr = new int[10][10];

但这似乎真的不是人们所期望的.不是让 JVM 分配一个足够大的连续 RAM 块来存储 100 个 int ，而是以 int 的数组的形式出现:所以每一层都是一个连续的 RAM 块，但作为一个整体不是.访问 arr[i][j] 因此相当慢:JVM 必须

but it seems that this is really not what one might have expected. Rather than having the JVM allocate a contiguous block of RAM big enough to store 100 ints, it comes out as an array of arrays of ints: so each layer is a contiguous block of RAM, but the thing as a whole is not. Accessing arr[i][j] is thus rather slow: the JVM has to

找到存储在arr[i]的int[];
索引它以查找存储在 arr[i][j] 中的 int.

find the int[] stored at arr[i];
index this to find the int stored at arr[i][j].

这涉及到查询一个对象从一层到下一层，这是相当昂贵的.

This involves querying an object to go from one layer to the next, which is rather expensive.

在一个层面上，不难看出为什么这不能优化为简单的缩放和添加查找，即使它全部分配在一个固定块中.问题在于 arr[3] 本身就是一个引用，并且可以更改.所以虽然数组的大小是固定的，但我们可以很容易地写

At one level, it's not hard to see why this can't be optimised to a simple scale-and-add lookup even if it were all allocated in one fixed block. The problem is that arr[3] is a reference all of its own, and it can be changed. So although arrays are of fixed size, we could easily write

arr[3] = new int[11];

现在缩放和添加被搞砸了，因为这一层已经增长.您需要在运行时知道所有内容是否仍与以前相同.此外，当然，这将被分配到 RAM 中的其他地方(它必须是，因为它比它要替换的要大)，所以它甚至不在正确的位置进行缩放和添加.

and now the scale-and-add is screwed because this layer has grown. You'd need to know at runtime whether everything is still the same size as it used to be. In addition, of course, this will then get allocated somewhere else in RAM (it'll have to be, since it's bigger than what it's replacing), so it's not even in the right place for scale-and-add.

在我看来这并不理想，原因有二.

It seems to me that this is not ideal, and that for two reasons.

一方面，它慢.我使用这些方法对一维或多维数组的内容进行求和的测试花费了几乎两倍的时间(714 秒对 371 秒)对于多维情况(int[1000000] 和一个 int[100][100][100] 分别填充随机的 int 值，使用热缓存运行 1000000 次).

For one, it's slow. A test I ran with these methods for summing the contents of a single dimensional or multidimensional array took nearly twice as long (714 seconds vs 371 seconds) for the multidimensional case (an int[1000000] and an int[100][100][100] respectively, filled with random int values, run 1000000 times with warm cache).

public static long sumSingle(int[] arr) {
    long total = 0;
    for (int i=0; i<arr.length; i++)
        total+=arr[i];
    return total;
}

public static long sumMulti(int[][][] arr) {
    long total = 0;
    for (int i=0; i<arr.length; i++)
        for (int j=0; j<arr[0].length; j++)
            for (int k=0; k<arr[0][0].length; k++)
                total+=arr[i][j][k];
    return total;
}

其次，因为它很慢，因此鼓励晦涩的编码.如果您遇到一些性能关键的事情，而这些事情可以用多维数组自然完成，您就有动力将其编写为平面数组，即使这会使它变得不自然且难以阅读.您面临着一个令人不快的选择:晦涩的代码或缓慢的代码.

Secondly, because it's slow, it thereby encourages obscure coding. If you encounter something performance-critical that would be naturally done with a multidimensional array, you have an incentive to write it as a flat array, even if that makes the unnatural and hard to read. You're left with an unpalatable choice: obscure code or slow code.

在我看来，基本问题很容易解决.正如我们之前看到的，无法优化的唯一原因是结构可能会发生变化.但是 Java 已经有一种使引用不可更改的机制:将它们声明为 final.

It seems to me that the basic problem could easily enough be fixed. The only reason, as we saw earlier, that it can't be optimised is that the structure might change. But Java already has a mechanism for making references unchangeable: declare them as final.

现在，只需声明它

final int[][] arr = new int[10][10];

还不够好，因为这里只有 arr 是 final:arr[3] 仍然不是，并且可能是改变了，所以结构可能仍然会改变.但是，如果我们有一种声明方式使得它始终是 final，除了在存储 int 值的底层，那么我们将拥有一个完整的不可变的结构，并且可以全部分配为一个块，并通过缩放和添加进行索引.

isn't good enough because it's only arr that is final here: arr[3] still isn't, and could be changed, so the structure might still change. But if we had a way of declaring things so that it was final throughout, except at the bottom layer where the int values are stored, then we'd have an entire immutable structure, and it could all be allocated as one block, and indexed with scale-and-add.

它在语法上看起来如何，我不确定(我不是语言设计师).也许

How it would look syntactically, I'm not sure (I'm not a language designer). Maybe

final int[final][] arr = new int[10][10];

尽管不可否认，这看起来有点奇怪.这意味着: final 在顶层；final 在下一层；不是 final 在底层(否则 int 值本身将是不可变的).

although admittedly that looks a bit weird. This would mean: final at the top layer; final at the next layer; not final at the bottom layer (else the int values themselves would be immutable).

最终性将使 JIT 编译器能够优化这一点，从而将性能提供给一维数组的性能，然后消除以这种方式进行编码的诱惑，只是为了解决多维数组的缓慢问题.

Finality throughout would enable the JIT compiler to optimise this to give performance to that of a single dimensional array, which would then take away the temptation to code that way just to get round the slowness of multidimensional arrays.

(我听到谣言说 C# 做了这样的事情，虽然我也听到另一个谣言说 CLR 实现太糟糕了，不值得拥有......也许他们只是谣言......)

(I hear a rumour that C# does something like this, although I also hear another rumour that the CLR implementation is so bad that it's not worth having... perhaps they're just rumours...)

那么为什么 Java 没有真正的多维数组的实现呢?有可靠的技术原因吗?我在这里错过了什么?

So why doesn't Java have an implementation of true multidimensional arrays? Is there a solid technical reason? What am I missing here?

更新

一个奇怪的旁注:如果您使用 int 而不是 long 作为运行总数，时间上的差异会下降到只有几个百分点.为什么int会有这么小的区别，而long会有这么大的区别?

Update

A bizarre side note: the difference in timings drops away to only a few percent if you use an int for the running total rather than a long. Why would there be such a small difference with an int, and such a big difference with a long?

我用于基准测试的代码，以防有人想尝试重现这些结果:

Code I used for benchmarking, in case anyone wants to try to reproduce these results:

public class Multidimensional {

    public static long sumSingle(final int[] arr) {
        long total = 0;
        for (int i=0; i<arr.length; i++)
            total+=arr[i];
        return total;
    }

    public static long sumMulti(final int[][][] arr) {
        long total = 0;
        for (int i=0; i<arr.length; i++)
            for (int j=0; j<arr[0].length; j++)
                for (int k=0; k<arr[0][0].length; k++)
                    total+=arr[i][j][k];
        return total;
    }   

    public static void main(String[] args) {
        final int iterations = 1000000;

        Random r = new Random();
        int[] arr = new int[1000000];
        for (int i=0; i<arr.length; i++)
            arr[i]=r.nextInt();
        long total = 0;
        System.out.println(sumSingle(arr));
        long time = System.nanoTime();
        for (int i=0; i<iterations; i++)
            total = sumSingle(arr);
        time = System.nanoTime()-time;
        System.out.printf("Took %d ms for single dimension\n", time/1000000, total);

        int[][][] arrMulti = new int[100][100][100];
        for (int i=0; i<arrMulti.length; i++)
            for (int j=0; j<arrMulti[i].length; j++)
                for (int k=0; k<arrMulti[i][j].length; k++)
                    arrMulti[i][j][k]=r.nextInt();
        System.out.println(sumMulti(arrMulti));
        time = System.nanoTime();
        for (int i=0; i<iterations; i++)
            total = sumMulti(arrMulti);
        time = System.nanoTime()-time;
        System.out.printf("Took %d ms for multi dimension\n", time/1000000, total);
    }

}

为什么 Java 没有真正的多维数组? [英] Why doesn't Java have true multidimensional arrays?

问题描述

背景

Background

更新

Update

推荐答案

相关文章

Java开发最新文章

热门教程

热门工具

登录关闭

为什么 Java 没有真正的多维数组? [英] Why doesn&#39;t Java have true multidimensional arrays?

问题描述

背景

Background

更新

Update

推荐答案

相关文章

Java开发最新文章

热门教程

热门工具

登录 关闭

为什么 Java 没有真正的多维数组? [英] Why doesn't Java have true multidimensional arrays?

登录关闭