如何设置 pandas 数据框的子集的样式? [英] How do I style a subset of a pandas dataframe?

查看:92
本文介绍了如何设置 pandas 数据框的子集的样式?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我之前曾问过如何做我仅对熊猫数据框的最后一行设置样式?,并且对我给的玩具问题有一个完美的答案.

I previously asked How do I style only the last row of a pandas dataframe? and got a perfect answer to the toy problem that I gave.

事实证明,我应该使玩具问题更接近我的实际问题.考虑一个文本框超过一列的数据框(我可以对其应用样式):

Turns out I should have made the toy problem a bit closer to my real problem. Consider a dataframe with more than 1 column of text data (which I can apply styling to):

import pandas as pd
import numpy as np
import seaborn as sns
cm = sns.diverging_palette(-5, 5, as_cmap=True)

df = pd.DataFrame(np.random.randn(3, 4))
df['text_column'] = 'a'
df['second_text_column'] = 'b'
df.style.background_gradient(cmap=cm)

但是,像上一个问题一样,我只希望将此样式应用于最后一行.上一个问题的答案是:

However, like the previous question, I wish to only apply this styling to the last row. The answer to the previous question was:

df.style.background_gradient(cmap=cm, subset=df.index[-1])

在这种情况下会出现错误:

which in this case gives the error:

---------------------------------------------------------------------------
TypeError                                 Traceback (most recent call last)
/usr/local/miniconda/lib/python3.7/site-packages/IPython/core/formatters.py in __call__(self, obj)
    343             method = get_real_method(obj, self.print_method)
    344             if method is not None:
--> 345                 return method()
    346             return None
    347         else:

/usr/local/miniconda/lib/python3.7/site-packages/pandas/io/formats/style.py in _repr_html_(self)
    161         Hooks into Jupyter notebook rich display system.
    162         """
--> 163         return self.render()
    164 
    165     @Appender(_shared_docs['to_excel'] % dict(

/usr/local/miniconda/lib/python3.7/site-packages/pandas/io/formats/style.py in render(self, **kwargs)
    457         * table_attributes
    458         """
--> 459         self._compute()
    460         # TODO: namespace all the pandas keys
    461         d = self._translate()

/usr/local/miniconda/lib/python3.7/site-packages/pandas/io/formats/style.py in _compute(self)
    527         r = self
    528         for func, args, kwargs in self._todo:
--> 529             r = func(self)(*args, **kwargs)
    530         return r
    531 

/usr/local/miniconda/lib/python3.7/site-packages/pandas/io/formats/style.py in _apply(self, func, axis, subset, **kwargs)
    536         if axis is not None:
    537             result = data.apply(func, axis=axis,
--> 538                                 result_type='expand', **kwargs)
    539             result.columns = data.columns
    540         else:

/usr/local/miniconda/lib/python3.7/site-packages/pandas/core/frame.py in apply(self, func, axis, broadcast, raw, reduce, result_type, args, **kwds)
   6485                          args=args,
   6486                          kwds=kwds)
-> 6487         return op.get_result()
   6488 
   6489     def applymap(self, func):

/usr/local/miniconda/lib/python3.7/site-packages/pandas/core/apply.py in get_result(self)
    149             return self.apply_raw()
    150 
--> 151         return self.apply_standard()
    152 
    153     def apply_empty_result(self):

/usr/local/miniconda/lib/python3.7/site-packages/pandas/core/apply.py in apply_standard(self)
    255 
    256         # compute the result using the series generator
--> 257         self.apply_series_generator()
    258 
    259         # wrap results

/usr/local/miniconda/lib/python3.7/site-packages/pandas/core/apply.py in apply_series_generator(self)
    284             try:
    285                 for i, v in enumerate(series_gen):
--> 286                     results[i] = self.f(v)
    287                     keys.append(v.name)
    288             except Exception as e:

/usr/local/miniconda/lib/python3.7/site-packages/pandas/core/apply.py in f(x)
     76 
     77             def f(x):
---> 78                 return func(x, *args, **kwds)
     79         else:
     80             f = func

/usr/local/miniconda/lib/python3.7/site-packages/pandas/io/formats/style.py in _background_gradient(s, cmap, low, high, text_color_threshold)
    941             smin = s.values.min()
    942             smax = s.values.max()
--> 943             rng = smax - smin
    944             # extend lower / upper bounds, compresses color range
    945             norm = colors.Normalize(smin - (rng * low), smax + (rng * high))

TypeError: ("unsupported operand type(s) for -: 'str' and 'str'", 'occurred at index text_column')

<pandas.io.formats.style.Styler at 0x7f948dde7278>

似乎来自于它试图对text_column中的字符串进行操作的事实.很公平.我如何告诉它仅适用于所有非文本列的最后一行?我可以给它提供明确的列名来使用或避免,但是我不知道如何将其传递给这个难以理解的subset方法.

which seems to come from the fact that it's trying to do an operation to strings in the text_column. Fair enough. How do I tell it to only apply to the last row for all non-text columns? I'm ok with giving it explicit column names to use or avoid, but I don't know how to pass that into this inscrutable subset method.

我正在跑步:

python version 3.7.3 
pandas version 0.24.2

推荐答案

subset使用tuple对我来说很有效,但是不确定这是否是最优雅的解决方案:

Using a tuple for subset worked for me, but not sure if it is the most elegant solution:

df.style.background_gradient(cmap=cm, 
                             subset=(df.index[-1], df.select_dtypes(float).columns))

输出:

这篇关于如何设置 pandas 数据框的子集的样式?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆