如何找到真实数据的概率分布和参数? (Python 3) [英] How to find probability distribution and parameters for real data? (Python 3)

查看：878 发布时间：2020/5/4 8:54:19 python machine-learning statistics distribution data-fitting

本文介绍了如何找到真实数据的概率分布和参数? (Python 3)的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我有一个来自sklearn的数据集，并且绘制了load_diabetes.target数据的分布图(即load_diabetes.data用于预测的回归值).

I have a dataset from sklearn and I plotted the distribution of the load_diabetes.target data (i.e. the values of the regression that the load_diabetes.data are used to predict).

之所以使用它，是因为它具有回归sklearn.datasets的变量/属性最少的数目.

I used this because it has the fewest number of variables/attributes of the regression sklearn.datasets.

使用Python 3，如何获得最相似的分布类型和分布参数?

Using Python 3, How can I get the distribution-type and parameters of the distribution this most closely resembles?

所有我知道的target值都是正偏斜的(正偏斜/右偏斜). . . Python中是否有办法提供一些分布，然后最适合target数据/向量?或者，根据给定的数据实际建议适合度?对于那些具有理论统计知识但很少将其应用于真实数据"的经验的人来说，这将是非常有用的.

All I know the target values are all positive and skewed (positve skew/right skew). . . Is there a way in Python to provide a few distributions and then get the best fit for the target data/vector? OR, to actually suggest a fit based on the data that's given? That would be realllllly useful for people who have theoretical statistical knowledge but little experience with applying it to "real data".

奖金使用这种方法找出真实数据"的后验分布会有意义吗?如果没有，为什么不呢?

Bonus Would it make sense to use this type of approach to figure out what your posterior distribution would be with "real data" ? If no, why not?

from sklearn.datasets import load_diabetes
import matplotlib.pyplot as plt
import seaborn as sns; sns.set()
import pandas as pd

#Get Data
data = load_diabetes()
X, y_ = data.data, data.target

#Organize Data
SR_y = pd.Series(y_, name="y_ (Target Vector Distribution)")

#Plot Data
fig, ax = plt.subplots()
sns.distplot(SR_y, bins=25, color="g", ax=ax)
plt.show()

如何找到真实数据的概率分布和参数? (Python 3) [英] How to find probability distribution and parameters for real data? (Python 3)

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录关闭

如何找到真实数据的概率分布和参数? (Python 3) [英] How to find probability distribution and parameters for real data? (Python 3)

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录 关闭

登录关闭