使用Pandas将值替换为2列中的条件 [英] Replace value with a condition from 2 columns using pandas

查看：118 发布时间：2020/5/24 1:23:46 python pandas dataframe datetime pandas-groupby

本文介绍了使用Pandas将值替换为2列中的条件的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我有一个如下所示的熊猫数据框

I have a pandas data-frame like as shown below

df1_new = pd.DataFrame({'person_id': [1, 2, 3, 4, 5],
                        'start_date': ['07/23/2377', '05/29/2477', '02/03/2177', '7/27/2277', '7/13/2077'],
                        'start_datetime': ['07/23/2377  12:00:00', '05/29/2477  04:00:00', '02/03/2177  02:00:00', '7/27/2277  05:00:00', '7/13/2077  12:00:00'],
                        'end_date': ['07/25/2377', '06/09/2477', '02/05/2177', '01/01/2000', '01/01/2000'],
                        'end_datetime': ['07/25/2377 02:00:00', '06/09/2477 04:00:00', '02/05/2177 01:00:00', '01/01/2000 00:00:00', '01/01/2000 00:00:00'],
                        'Type' :['IP','IP','OP','OP','IP']})

我想做的是

if ((end_date contains 2000 or end_datetime contains 2000) and (type == IP)) then
     end_date = start_date + 2 days
     end_datetime = start_datetime + 2 days

else ((if end_date contains 2000 or end_datetime contains 2000) and (type == OP)) then 
     end_date = start_date
     end_datetime = start_datetime

这是我尝试过的方法，但未获得准确的输出结果

This is what I tried but it isn't resulting in accurate output

 df['end_date'] = df['start_date'].apply(lambda x: df['start_date'] + pd.DateOffset(days=2) if (x == 'OP' and x == '01/01/2000') else df['start_date'])
 df['end_datetime'] = df['start_datetime'].apply(lambda x: df['start_datetime'] + pd.DateOffset(days=2) if (x == 'OP' and x == '01/01/2000') else df['start_datetime'])

我希望我的输出如下所示

I expect my output to be like as shown below

推荐答案

这里是一个示例.查看评论，我想您会了解基本方法的.

Here is an example. See comments I think you'll understand the basic approach.

from copy import deepcopy
from datetime import datetime
import pandas as pd
from dateutil.relativedelta import relativedelta


df = pd.DataFrame.from_dict({
    'person_id': [1, 2, 3, 4, 5],
    'start_date': ['07/23/2377', '05/29/2477', '02/03/2177', '7/27/2277', '7/13/2077'],
    'start_datetime': ['07/23/2377  12:00:00', '05/29/2477  04:00:00', '02/03/2177  02:00:00', '7/27/2277  05:00:00', '7/13/2077  12:00:00'],
    'end_date': ['07/25/2377', '06/09/2477', '02/05/2177', '01/01/2000', '01/01/2000'],
    'end_datetime': ['07/25/2377 02:00:00', '06/09/2477 04:00:00', '02/05/2177 01:00:00', '01/01/2000 00:00:00', '01/01/2000 00:00:00'],
    'type': ['IP', 'IP', 'OP', 'OP', 'IP']
})


def calculate_days(x):
    # datetime object from string
    x['end_date'] = datetime.strptime(x['end_date'], '%m/%d/%Y')
    x['start_date'] = datetime.strptime(x['start_date'], '%m/%d/%Y')
    x['end_datetime'] = datetime.strptime(x['end_datetime'], '%m/%d/%Y %H:%M:%S')
    x['start_datetime'] = datetime.strptime(x['start_datetime'], '%m/%d/%Y %H:%M:%S')
    # you need only 2000 year...
    if not (x['end_date'].year == 2000 or x['end_datetime'] == 2000):
        return x

    # type conditions and calculations...
    if x['type'] == 'IP':
        x['end_date'] = x['start_date'] + relativedelta(days=2)
        x['end_datetime'] = x['start_datetime'] + relativedelta(days=2)
    elif x['type'] == 'OP':
        x['end_date'] = deepcopy(x['start_date'])
        x['end_datetime'] = deepcopy(x['start_datetime'])
    return x


# apply our custom function
df = df.apply(calculate_days, axis=1)
print(df.head())
#   person_id           start_date  ...         end_datetime type
# 0          1  2377-07-23 00:00:00  ...  2377-07-25 02:00:00   IP
# 1          2  2477-05-29 00:00:00  ...  2477-06-09 04:00:00   IP
# 2          3  2177-02-03 00:00:00  ...  2177-02-05 01:00:00   OP
# 3          4  2277-07-27 00:00:00  ...  2277-07-27 05:00:00   OP
# 4          5  2077-07-13 00:00:00  ...  2077-07-15 12:00:00   IP
# [5 rows x 6 columns]

希望这会有所帮助.

这篇关于使用Pandas将值替换为2列中的条件的文章就介绍到这了，希望我们推荐的答案对大家有所帮助，也希望大家多多支持IT屋！

查看全文

使用Pandas将值替换为2列中的条件 [英] Replace value with a condition from 2 columns using pandas

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

使用Pandas将值替换为2列中的条件 [英] Replace value with a condition from 2 columns using pandas

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭