关于使用多重处理读取文件 [英] About using multiprocessing to read file

查看：48 发布时间：2020/5/13 20:17:18 python multiprocessing

本文介绍了关于使用多重处理读取文件的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我的文件夹中有很多文件，所以我认为我应该使用多进程，然后我使用多进程读取文件夹中的txt文件，但是我比较是否使用多进程的时间，我发现如果不使用游泳池，速度会更快.我不知道为什么那么我应该在什么情况下使用Pool读取文件(大文件?)

I have many files in the folders,so I think I should use multiprocess , then I use multiprocess to read txt file in the folder, But I compare the time if I used multiprocess or not , I found if I don't use pool is more fast. I don't know why , so what situation should I use Pool to read file( huge files?)

using Pool
time:0.5836s
not using Pool
time:0.0076s

代码是，

import pandas as pd
from multiprocessing import Pool
import glob2,os,time

class PandasReadFile:

    def __init__(self):
        print('123')

    def readFilePool(self,path):
        n,t=0,time.time()
        print(t)
        pp = Pool(processes=1)

        # here is using pool
        df = pd.concat(pp.map(self.read_csv, glob2.iglob(os.path.join(path, "*.txt"))))
        # not using pool
        # df = pd.concat(map(pd.read_csv, glob2.iglob(os.path.join(path, "*.txt"))))
        t = time.time() - t
        print('%.4fs' % (t))
        print(df)

    @staticmethod
    def read_csv(filename):
        return pd.read_csv(filename)

if __name__ == '__main__':
    p = PandasReadFile()
    p.readFilePool('D:/')

关于使用多重处理读取文件 [英] About using multiprocessing to read file

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

关于使用多重处理读取文件 [英] About using multiprocessing to read file

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭