R无法在ff过程上分配内存.怎么会? [英] R could not allocate memory on ff procedure. How come?

查看：303 发布时间：2020/4/29 3:23:24 r csv memory-management ff large-data

本文介绍了R无法在ff过程上分配内存.怎么会?的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我正在使用具有Intel Xeon处理器和24 GB RAM的64位Windows Server 2008计算机.我在尝试读取11 GB(> 2400万行，20列)的特定TSV(制表符分隔)文件时遇到了麻烦.我通常的同伴read.table使我失败了.我目前正在通过以下过程尝试软件包ff:

I'm working on a 64-bit Windows Server 2008 machine with Intel Xeon processor and 24 GB of RAM. I'm having trouble trying to read a particular TSV (tab-delimited) file of 11 GB (>24 million rows, 20 columns). My usual companion, read.table, has failed me. I'm currently trying the package ff, through this procedure:

> df <- read.delim.ffdf(file       = "data.tsv",
+                       header     = TRUE,
+                       VERBOSE    = TRUE,
+                       first.rows = 1e3,
+                       next.rows  = 1e6,
+                       na.strings = c("", NA),
+                       colClasses = c("NUMERO_PROCESSO" = "factor"))

哪个可以很好地处理约600万条记录，但随后出现一个错误，如您所见:

Which works fine for about 6 million records, but then I get an error, as you can see:

read.table.ffdf 1..1000 (1000) csv-read=0.14sec ffdf-write=0.2sec
read.table.ffdf 1001..1001000 (1000000) csv-read=240.92sec ffdf-write=67.32sec
read.table.ffdf 1001001..2001000 (1000000) csv-read=179.15sec ffdf-write=94.13sec
read.table.ffdf 2001001..3001000 (1000000) csv-read=792.36sec ffdf-write=68.89sec
read.table.ffdf 3001001..4001000 (1000000) csv-read=192.57sec ffdf-write=83.26sec
read.table.ffdf 4001001..5001000 (1000000) csv-read=187.23sec ffdf-write=78.45sec
read.table.ffdf 5001001..6001000 (1000000) csv-read=193.91sec ffdf-write=94.01sec
read.table.ffdf 6001001..
Error in scan(file, what, nmax, sep, dec, quote, skip, nlines, na.strings,  : 
  could not allocate memory (2048 Mb) in C function 'R_AllocStringBuffer'

如果我没记错的话，R抱怨读取数据的内存不足，但是read...ffdf过程不是应该在读取数据时规避大量内存使用的问题吗?我在这里怎么可能做错了?

If I'm not mistaken, R is complaining of lack of memory to read the data, but wasn't the read...ffdf procedure supposed to circumvent heavy memory usage when reading data? What could I be doing wrong here?

R无法在ff过程上分配内存.怎么会? [英] R could not allocate memory on ff procedure. How come?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

R无法在ff过程上分配内存.怎么会? [英] R could not allocate memory on ff procedure. How come?

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭