找到一个非常大的文件的K-最大的元素（而k是非常大） [英] Finding k-largest elements of a very large file (while k is very LARGE)

查看：121 发布时间：2015/11/30 16:20:56 algorithm large-files

本文介绍了找到一个非常大的文件的K-最大的元素（而k是非常大）的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

假设我们有一个非常大的文件，其中包含数十亿美元整数的，我们希望找到 K 这些值最大的元素，

Let's assume that we have a very large file which contains billions of integers , and we want to find k largest elements of these values ,

棘手的部分是， K 本身是非常大太，这意味着我们不能让 K 中的元素存储器（例如，我们有100陀飞轮元素的文件，我们要搜寻到10十亿最大的元素）

the tricky part is that k itself is very large too , which means we cannot keep k elements in the memory (for example we have a file with 100 billon elements and we want to find 10 billion largest elements)

我们如何才能做到这一点 O（N）？

How can we do this in O(n) ?

我的想法：

我们开始读取文件，我们检查它与它保持 K 最大元素（排序增大的顺序）另一个文件，如果该读元件比第一大第二个文件的行，我们删除了第一线，我们将其插入到第二个文件，时间复杂度将是 O（NlogK）（如果我们随机访问了文件，否则这将是O（NK）

We start reading the file and we check it with another file which keeps the k largest elements (sorted in increasing order) , if the read element is larger than the first line of the second file we delete the first line and we insert it into the second file , the time complexity would be of O(NlogK) (if we have random access to that file , otherwise it would be 'O(Nk)'

任何想法，为此在 O（N），我想，如果我们有选择算法的外部版本 （在快速排序的分区算法），我们将能够做到这一点的 O（N），但我找不到它的任何地方。

Any idea to do this in O(n) , I guess if we have external version of Selection algorithm (the partitioning algorithm in quicksort) we would be able to do this in O(n) but I couldn't find it anywhere

找到一个非常大的文件的K-最大的元素（而k是非常大） [英] Finding k-largest elements of a very large file (while k is very LARGE)

问题描述

推荐答案

相关文章

C/C++最新文章

热门教程

热门工具

登录关闭

找到一个非常大的文件的K-最大的元素（而k是非常大） [英] Finding k-largest elements of a very large file (while k is very LARGE)

问题描述

推荐答案

相关文章

C/C++最新文章

热门教程

热门工具

登录 关闭

登录关闭