在scikit-learn中从pyspark复制logistic回归模型 [英] Replicate logistic regression model from pyspark in scikit-learn

查看：74 发布时间：2021/5/31 18:37:14 python machine-learning scikit-learn pyspark

本文介绍了在scikit-learn中从pyspark复制logistic回归模型的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

问题:鉴于默认参数值，pyspark和scikit-learn中Logistic回归模型的默认实现(未设置自定义参数)似乎会产生不同的结果.

Problem: The default implementations (no custom parameters set) of the logistic regression model in pyspark and scikit-learn seem to yield different results given their default paramter values.

我正在尝试复制通过pypark执行的逻辑回归(未设置自定义参数)的结果(请参阅:

I am trying to replicate a result from logistic regression (no custom paramters set) performed with pypark (see: https://spark.apache.org/docs/latest/api/python/pyspark.ml.html#pyspark.ml.classification.LogisticRegression) with the logistic regression model from scikit-learn (see: http://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LogisticRegression.html).

在我看来，这两个模型实现(在pyspark和scikit中)不具有相同的参数，因此我不能简单地匹配scikit中的参数以使其适合pyspark中的参数.关于如何在默认配置下匹配这两种型号，有什么解决方案吗?

It appears to me that both model implementations (in pyspark and scikit) do not possess the same parameters, so i cant just simply match the paramteres in scikit to fit those in pyspark. Is there any solution on how to match both models on their default configuration?

参数Scikit模型(默认参数):

Parameters Scikit model (default parameters):

`LogisticRegression(
C=1.0, 
class_weight=None, 
dual=False, 
fit_intercept=True,
intercept_scaling=1, 
max_iter=100, 
multi_class='ovr', 
n_jobs=1,
penalty='l2', 
random_state=None, 
solver='liblinear', 
tol=0.0001,
verbose=0, 
warm_start=False`

参数Pyspark模型(默认参数):

Parameters Pyspark model (default parameters):

LogisticRegression(self, 
featuresCol="features", 
labelCol="label", 
predictionCol="prediction", 
maxIter=100,
regParam=0.0, 
elasticNetParam=0.0, 
tol=1e-6, 
fitIntercept=True, 
threshold=0.5, 
thresholds=None, 
probabilityCol="probability", 
rawPredictionCol="rawPrediction", 
standardization=True, 
weightCol=None, 
aggregationDepth=2, 
family="auto")

非常感谢！

在scikit-learn中从pyspark复制logistic回归模型 [英] Replicate logistic regression model from pyspark in scikit-learn

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录关闭

在scikit-learn中从pyspark复制logistic回归模型 [英] Replicate logistic regression model from pyspark in scikit-learn

问题描述

推荐答案

相关文章

AI人工智能最新文章

热门教程

热门工具

登录 关闭

登录关闭