使用函数过滤 pandas 数据框 [英] Filtering a Pandas DataFrame Using a Function

查看：63 发布时间：2020/10/17 22:22:38 python pandas dataframe data-science

本文介绍了使用函数过滤 pandas 数据框的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

该问题与我昨天发布的问题有关，可以在此处。

This question is related to the question I posted yesterday, which can be found here.

因此，我继续将Jan所提供的解决方案应用于整个数据集。解决方案如下：

So, I went ahead and implemented the solution provided by Jan to the entire data set. The solution is as follows:

import re

def is_probably_english(row, threshold=0.90):
    regular_expression = re.compile(r'[-a-zA-Z0-9_ ]')
    ascii = [character for character in row['App'] if regular_expression.search(character)]
    quotient = len(ascii) / len(row['App'])
    passed = True if quotient >= threshold else False
    return passed

google_play_store_is_probably_english = google_play_store_no_duplicates.apply(is_probably_english, axis=1)

google_play_store_english = google_play_store_no_duplicates[google_play_store_is_probably_english]

所以，根据我的意思可以理解，我们正在使用is_probably_english函数过滤google_play_store_no_duplicates DataFrame并将结果（布尔值）存储到另一个DataFrame（google_play_store_is_probably_english）中。然后，使用google_play_store_is_probably_english过滤掉google_play_store_no_duplicates DataFrame中的非英语应用程序，最终结果存储在新的DataFrame中。

So, from what I understand, we are filtering the google_play_store_no_duplicates DataFrame using the is_probably_english function and storing the result, which is a boolean, into another DataFrame (google_play_store_is_probably_english). The google_play_store_is_probably_english is then used to filter out the non-English apps in the google_play_store_no_duplicates DataFrame, with the end result being stored in a new DataFrame.

这是否有意义，并且看起来像解决问题的正确方法？有更好的方法吗？

Does this make sense and does it seem like a sound way to approach the problem? Is there a better way to do this?

使用函数过滤 pandas 数据框 [英] Filtering a Pandas DataFrame Using a Function

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

使用函数过滤 pandas 数据框 [英] Filtering a Pandas DataFrame Using a Function

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭