Python 多处理:处理父级中的子级错误 [英] Python Multiprocessing: Handling Child Errors in Parent

查看：21 发布时间：2021/12/26 12:39:45 python error-handling multiprocessing

本文介绍了Python 多处理:处理父级中的子级错误的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我目前正在玩多处理和队列.我写了一段代码来从 mongoDB 导出数据，将其映射到关系(平面)结构，将所有值转换为字符串并将它们插入到 mysql 中.

I am currently playing around with multiprocessing and queues. I have written a piece of code to export data from mongoDB, map it into a relational (flat) structure, convert all values to string and insert them into mysql.

这些步骤中的每一个都作为一个进程提交并给定导入/导出队列，这对于在父级中处理的 mongoDB 导出是安全的.

Each of these steps is submitted as a process and given import/export queues, safe for the mongoDB export which is handled in the parent.

正如您将在下面看到的，我使用队列，子进程在从队列中读取无"时会自行终止.我目前遇到的问题是，如果子进程遇到未处理的异常，则父进程无法识别该异常，其余进程将继续运行.我想要发生的是整个shebang退出并充其量重新引发子错误.

As you will see below, I use queues and child processes terminate themselves when they read "None" from the queue. The problem I currently have is that, if a child process runs into an unhandled Exception, this is not recognized by the parent and the rest just Keeps running. What I want to happen is that the whole shebang quits and at best reraise the child error.

我有两个问题:

如何检测父级中的子级错误?
如何在检测到错误后终止我的子进程(最佳实践)?我意识到将无"放入队列以杀死孩子是非常肮脏的.

我使用的是 python 2.7.

I am using python 2.7.

以下是我的代码的基本部分:

Here are the essential parts of my code:

# Establish communication queues
mongo_input_result_q = multiprocessing.Queue()
mapper_result_q = multiprocessing.Queue()
converter_result_q = multiprocessing.Queue()

[...]

    # create child processes
    # all processes generated here are subclasses of "multiprocessing.Process"

    # create mapper
    mappers = [mongo_relational_mapper.MongoRelationalMapper(mongo_input_result_q, mapper_result_q, columns, 1000)
               for i in range(10)]

    # create datatype converter, converts everything to str
    converters = [datatype_converter.DatatypeConverter(mapper_result_q, converter_result_q, 'str', 1000)
                  for i in range(10)]

    # create mysql writer
    # I create a list of writers. currently only one, 
    # but I have the option to parallellize it further
    writers = [mysql_inserter.MySqlWriter(mysql_host, mysql_user, mysql_passwd, mysql_schema, converter_result_q
               , columns, 'w_'+mysql_table, 1000) for i in range(1)]

    # starting mapper
    for mapper in mappers:
        mapper.start()
    time.sleep(1)

    # starting converter
    for converter in converters:
        converter.start()

    # starting writer
    for writer in writers:
        writer.start()

[...初始化mongo db连接...]

[... initializing mongo db connection ...]

    # put each dataset read to queue for the mapper
    for row in mongo_collection.find({inc_column: {"$gte": start}}):
        mongo_input_result_q.put(row)
        count += 1
        if count % log_counter == 0:
            print 'Mongo Reader' + " " + str(count)
    print "MongoReader done"

    # Processes are terminated when they read "None" object from queue
    # now that reading is finished, put None for each mapper in the queue so they terminate themselves
    # the same for all followup processes
    for mapper in mappers:
        mongo_input_result_q.put(None)
    for mapper in mappers:
        mapper.join()
    for converter in converters:
        mapper_result_q.put(None)
    for converter in converters:
        converter.join()
    for writer in writers:
        converter_result_q.put(None)
    for writer in writers:
        writer.join()

Python 多处理:处理父级中的子级错误 [英] Python Multiprocessing: Handling Child Errors in Parent

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

Python 多处理:处理父级中的子级错误 [英] Python Multiprocessing: Handling Child Errors in Parent

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭