Spark和ScalaNLP库Breeze可以一起使用吗? [英] Can Spark and the ScalaNLP library Breeze be used together?

查看:213
本文介绍了Spark和ScalaNLP库Breeze可以一起使用吗?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我正在用Apache Spark开发基于Scala的极限学习机.我的模型必须是Spark Estimator,并使用Spark框架才能适合机器学习管道.有谁知道Breeze是否可以与Spark串联使用?我所有的数据都在Spark数据帧中,可以想象,我可以使用Breeze导入它,使用Breeze DenseVectors作为数据结构,然后转换为Estimator部分的DataFrame. Breeze的优点在于,它对Moore-Penrose伪逆具有函数pinv,对于非方阵是逆的.据我所知,Spark MLlib中没有等效功能.我不知道是否可以将Breeze张量转换为Spark DataFrames,因此,如果有人对此有经验,那将非常有用.谢谢!

I'm developing a Scala-based extreme learning machine, in Apache Spark. My model has to be a Spark Estimator and use the Spark framework in order to fit into the machine learning pipeline. Does anyone know if Breeze can be used in tandem with Spark? All of my data is in Spark data frames and conceivably I could import it using Breeze, use Breeze DenseVectors as the data structure then convert to a DataFrame for the Estimator part. The advantage of Breeze is that it has a function pinv for the Moore-Penrose pseudo-inverse, which is an inverse for a non-square matrix. There is no equivalent function in the Spark MLlib, as far as I can see. I have no idea whether it's possible to convert Breeze tensors to Spark DataFrames so if anyone has experience of this it would be really useful. Thanks!

推荐答案

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆