用列/系列中的值替换pandas子字符串 [英] Replacing a pandas substring with value from a column/Series

查看:114
本文介绍了用列/系列中的值替换pandas子字符串的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我试图用pandas列中的值替换Pandas子字符串.之前尚未回答过这个问题.

I am trying to replace a Pandas substring with the value from a pandas column. This question has not been answered before.

TypeError跟踪(最近一次调用 最后) ----> 1 df ['name'].str.replace('(C)',df.name2)

TypeError Traceback (most recent call last) in ----> 1 df['name'].str.replace('(C)', df.name2)

〜/opt/anaconda3/lib/python3.7/site-packages/pandas/core/strings.py在 包装器(自己,* args,** kwargs)1841)1842
引发TypeError(msg) -> 1843 return func(self,* args,** kwargs)1844 1845包装器.名称 = func_name

~/opt/anaconda3/lib/python3.7/site-packages/pandas/core/strings.py in wrapper(self, *args, **kwargs) 1841 ) 1842
raise TypeError(msg) -> 1843 return func(self, *args, **kwargs) 1844 1845 wrapper.name = func_name

〜/opt/anaconda3/lib/python3.7/site-packages/pandas/core/strings.py在 replace(self,pat,repl,n,case,flags,regex)2714 def replace(self,pat,repl,n = -1,case = None,flags = 0,regex = True):
第2715章真相大白 -> 2716 self._parent,pat,repl,n = n,case = case,flags = flags,regex = regex 2717)2718返回 self._wrap_result(result)

~/opt/anaconda3/lib/python3.7/site-packages/pandas/core/strings.py in replace(self, pat, repl, n, case, flags, regex) 2714 def replace(self, pat, repl, n=-1, case=None, flags=0, regex=True):
2715 result = str_replace( -> 2716 self._parent, pat, repl, n=n, case=case, flags=flags, regex=regex 2717 ) 2718 return self._wrap_result(result)

〜/opt/anaconda3/lib/python3.7/site-packages/pandas/core/strings.py在 str_replace(arr,pat,repl,n,case,flags,regex) 597#检查repl是否有效(GH 13438,GH 15055) 第598章真相大白 -> 599提高TypeError("repl必须为字符串或可调用") 600 601 is_compiled_re = is_re(pat)

~/opt/anaconda3/lib/python3.7/site-packages/pandas/core/strings.py in str_replace(arr, pat, repl, n, case, flags, regex) 597 # Check whether repl is valid (GH 13438, GH 15055) 598 if not (is_string_like(repl) or callable(repl)): --> 599 raise TypeError("repl must be a string or callable") 600 601 is_compiled_re = is_re(pat)

TypeError:repl必须为字符串或可调用

TypeError: repl must be a string or callable

data = {'id': [1, 2, 3, 4], 'name': ['name1 (C)', 'name2 (B)', 'name3', 'name4'],
        'name2':['Jane','Abbie','Luke','Peter']}


df = pd.DataFrame(data)

df['name'].str.replace('\(C\)', df.name2)

预期结果:

    id  name    name2
0   1   name1 Jane  Jane
1   2   name2 Abbie Abbie
2   3   name3   Luke
3   4   name4   Peter

推荐答案

您需要python re.sub和listcomp

You need python re.sub and listcomp

import re

df['new_name'] = [re.sub(r'\(\w+\)', r, s) for r, s in zip(df.name2, df.name)]

Out[280]:
   id       name  name2     new_name
0   1  name1 (C)   Jane   name1 Jane
1   2  name2 (B)  Abbie  name2 Abbie
2   3      name3   Luke        name3
3   4      name4  Peter        name4

这篇关于用列/系列中的值替换pandas子字符串的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆