pandas -日期范围内每天的新行 [英] Pandas - New Row for Each Day in Date Range

查看：69 发布时间：2020/5/24 2:26:21 python pandas

本文介绍了 pandas -日期范围内每天的新行的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我有一个Pandas df，其中一列(Reservation_Dt_Start)代表日期范围的开始，另一列(Reservation_Dt_End)代表日期范围的结束.

I have a Pandas df with one column (Reservation_Dt_Start) representing the start of a date range and another (Reservation_Dt_End) representing the end of a date range.

我希望将每一行扩展为具有与该日期范围内的日期一样多的记录，而不是每行都有一个日期范围，而每行都代表这些日期之一.

Rather than each row having a date range, I'd like to expand each row to have as many records as there are dates in the date range, with each new row representing one of those dates.

请参见下面的两个图片，以获取示例输入和所需的输出.

See the two pics below for an example input and the desired output.

下面的代码段有效！！但是，对于输入表中的每250行，需要1秒钟才能运行.鉴于我的输入表的大小为120,000,000行，因此此代码将花费大约一周的时间来运行.

The code snippet below works!! However, for every 250 rows in the input table, it takes 1 second to run. Given my input table is 120,000,000 rows in size, this code will take about one week to run.

pd.concat([pd.DataFrame({'Book_Dt': row.Book_Dt,
                         'Day_Of_Reservation': pd.date_range(row.Reservation_Dt_Start, row.Reservation_Dt_End),
                         'Pickup': row.Pickup,
                         'Dropoff' : row.Dropoff,
                         'Price': row.Price}, 

                          columns=['Book_Dt','Day_Of_Reservation', 'Pickup', 'Dropoff' , 'Price']) 
                          for i, row in df.iterrows()], ignore_index=True)

必须有一种更快的方法来执行此操作.有任何想法吗?谢谢！

There has to be a faster way to do this. Any ideas? Thanks!

pandas -日期范围内每天的新行 [英] Pandas - New Row for Each Day in Date Range

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

pandas -日期范围内每天的新行 [英] Pandas - New Row for Each Day in Date Range

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭