如何解释神经网络层的权重分布 [英] How to interpret weight distributions of neural net layers

查看:462
本文介绍了如何解释神经网络层的权重分布的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我设计了一个三层神经网络,其输入是来自CNN和RNN的串联特征.网络学习的权重取很小的值.对此的合理解释是什么?以及如何解释Tensorflow中的权重直方图和分布?有什么好的资源吗?

这是使用张量板可视化的3层神经网络的第一个隐藏层的权重分布.如何解释呢?所有权重都占零值?

这是3层神经网络的第二个隐藏层的权重分布:

解决方案

如何解释Tensorflow中的权重直方图和分布?

嗯,您可能没有意识到这一点,但是您只是问了ML& 100万美元的问题. AI ...

模型的可解释性是当前研究的一个活跃和过热的领域(认为是圣杯之类的东西),最近提出来的原因不仅仅在于(通常是巨大的)深度学习模型在各种任务中的成功;这些型号目前仅是黑匣子,我们对此自然感到不舒服...

有什么好的资源吗?

可能您所想的资源可能不完全相同,我们在这里还没有一个非常适合的主题,但是由于您提出的问题……:

对于初学者来说,这些就足够了,并可以让您大致了解您所询问的主题...

更新(2018年10月):我在对问题

I have designed a 3 layer neural network whose inputs are the concatenated features from a CNN and RNN. The weights learned by network take very small values. What is the reasonable explanation for this? and how to interpret the weight histograms and distributions in Tensorflow? Any good resource for it?

This is the weight distribution of the first hidden layer of a 3 layer neural network visualized using tensorboard. How to interpret this? all the weights are taking up zero value?

This is the weight distribution of the second hidden layer of a 3 layer neural:

解决方案

how to interpret the weight histograms and distributions in Tensorflow?

Well, you probably didn't realize it, but you have just asked the 1 million dollar question in ML & AI...

Model interpretability is a hyper-active and hyper-hot area of current research (think of holy grail, or something), which has been brought forward lately not least due to the (often tremendous) success of deep learning models in various tasks; these models are currently only black boxes, and we naturally feel uncomfortable about it...

Any good resource for it?

Probably not exactly the kind of resources you were thinking of, and we are well off a SO-appropriate topic here, but since you asked...:

On a more practical level:

These should be enough for starters, and to give you a general idea of the subject about which you asked...

UPDATE (Oct 2018): I have put up a much more detailed list of practical resources in my answer to the question Predictive Analytics - "Why" factor?

这篇关于如何解释神经网络层的权重分布的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆