首页
其他开发
比较fread与read.table的速度以读取100M中的前1M行

比较fread与read.table的速度以读取100M中的前1M行 [英] Comparing speed of fread vs. read.table for reading the first 1M rows out of 100M

查看：361 发布时间：2020/10/15 19:08:05 r dataframe data.table

本文介绍了比较fread与read.table的速度以读取100M中的前1M行的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我有一个14GB的data.txt文件。我通过读取前100万行来比较 read 和 read.table 的速度。看起来 fread 慢得多，尽管不应该这样。显示百分比计数需要一些时间。

I have a 14GB data.txt file. I was comparing the speed of fread and read.table by reading the first 1M rows. It looks like fread is much slower although it is not supposed to be. It takes some time until the percentage counts show up.

可能是什么原因？我以为应该是超级快...我正在使用Windows OS计算机。

What could be the reason? I thought it was supposed to be super fast... I am using a Windows OS computer.

推荐答案

fread mmap s文件。这需要一些时间，并将映射整个文件。这意味着后续的读入操作会更快。

fread mmaps the file. This takes some time, and will map the whole file. This means subsequent "read-ins" will be faster.

read.table 不会 mmap 整个文件。它可以逐行读取文件，并在行1000000处停止。

read.table does not mmap the whole file. It can read in the file line by line [and stop at line 1000000].

您可以在 mmap mmap（）与阅读块


You can see some background on mmap at mmap() vs. reading blocks 
  fread 帮助中的示例突出了这种行为
The examples in the help from fread  highlight this behaiviour 

                        这篇关于比较fread与read.table的速度以读取100M中的前1M行的文章就介绍到这了，希望我们推荐的答案对大家有所帮助，也希望大家多多支持IT屋！


                    
                        查看全文


        
            



        
        
            相关文章
            
                    
                        
                            比较 fread 与 read.table 读取 100M 中前 1M 行的速度;
                        
                    
                    
                        
                            Creating 1m, 5m, 10m, 50m, 100m and 500m grids with OS National Grid codes in R;
                        
                    
                    
                        
                            导入fread与read.table和错误;
                        
                    
                    
                        
                            加入1M行表的查询速度慢;
                        
                    
                    
                        
                            ARKit node disappear after 100m;
                        
                    
                    
                        
                            用单行将文件读取到fread或read.table中的R中;
                        
                    
                    
                        
                            如何读取标题但也跳过行-read.table()?;
                        
                    
                    
                        
                            read.csv与read.table;
                        
                    
                    
                        
                            如何创建 100M 字节缓冲区;
                        
                    
                    
                        
                            如何创建100M字节缓冲区;
                        
                    
                    
                        
                            IOS-ARKit节点在100M后消失;
                        
                    
                    
                        
                            应用引擎中的urlfetch超过1M的问题;
                        
                    
                    
                        
                            可视化 100k 顶点和 1M 边的 Python 工具?;
                        
                    
                    
                        
                            Python工具可视化100k Vertices和1M Edges？;
                        
                    
                    
                        
                            1M以上的App Engine 1.4.0 urlfetch（）数据;
                        
                    
                    
                        
                            read.table function and stdin;
                        
                    
                    
                        
                            优雅地将1M转换为1000000;
                        
                    
                    
                        
                            优雅地将 1M 转换为 1000000;
                        
                    
                    
                        
                            如何阅读标题但也跳过行 - read.table()?;
                        
                    
                    
                        
                            在输入为“ 100K”，“ 100M”的情况下转换金额。等等;
                        
                    
                    
                        
                            javascript - 两物体实际距离为30m, 在1 : 100m的地图电子地图上距离该为多少像素?;
                        
                    
                    
                        
                            如何避免:read.table 截断以 0 开头的数值;
                        
                    
                    
                        
                            R read.table（），我怎样才能读取标题，但也跳过行？;
                        
                    
                    
                        
                            生成具有alpanumeric集1M独特的随机密钥;
                        
                    
                    
                        
                            从assets文件夹加载大于1M的文件;


    
        
            其他开发最新文章
            
                    
                        
                            拒绝显示一个框架，因为它将'X-Frame-Options'设置为'sameorigin';
                        
                    
                    
                        
                            什么是＆QUOT; AW＆QUOT;在部分标志属性是什么意思？;
                        
                    
                    
                        
                            在运行npm install命令时获取'npm WARN弃用'警告;
                        
                    
                    
                        
                            cmake无法找到openssl;
                        
                    
                    
                        
                            从Spark的scala中的* .tar.gz压缩文件中读取HDF5文件;
                        
                    
                    
                        
                            Twitter :: Error :: Forbidden  - 无法验证您的凭据;
                        
                    
                    
                        
                            我什么时候需要一个fb：app_id或者fb：admins？;
                        
                    
                    
                        
                            将.db文件导入R;
                        
                    
                    
                        
                            npm通知创建一个lockfile作为package-lock.json。你应该提交这个文件;
                        
                    
                    
                        
                            拒绝执行内联脚本，因为它违反了以下内容安全策略指令：“script-src'self'”;
                        
                    
            
        
        
            
                热门教程
            
            
                
                    
                        Java教程
                    
                
                
                    
                        Apache ANT 教程
                    
                
                
                    
                        Kali Linux教程
                    
                
                
                    
                        JavaScript教程
                    
                
                
                    
                        JavaFx教程
                    
                
                
                    
                        MFC 教程
                    
                
                
                    
                        Apache HTTP客户端教程
                    
                
                
                    
                        Microsoft Visio 教程
                    
                
            
        
        
            
                热门工具
            
            
                
                
                    
                        Java 在线工具
                    
                
                
                    
                        C(GCC) 在线工具
                    
                
                
                    
                        PHP 在线工具
                    
                
                
                    
                        C# 在线工具
                    
                
                
                    
                        Python 在线工具
                    
                
                
                    
                        MySQL 在线工具
                    
                
                
                    
                        VB.NET 在线工具
                    
                
                
                    
                        Lua 在线工具
                    
                
                
                    
                        Oracle 在线工具
                    
                
                
                    
                        C++(GCC) 在线工具
                    
                
                
                    
                        Go 在线工具
                    
                
                
                    
                        Fortran 在线工具



    
        
            登录
            关闭
        
        
            
                扫码关注1秒登录
            
            
                
            
            
                
                
            
            
                发送“验证码”获取
                |
                15天全站免登陆
            
            
        
    
    





    
		
			友情链接：
            IT屋
            Chrome插件
            谷歌浏览器插件
        
        
            IT屋
            ©2016-2022 琼ICP备2021000895号-1
            站点地图
            站点标签
            SiteMap
            <免责申明>
            本站内容来源互联网,如果侵犯您的权益请联系我们删除.