Biases in weight_regularizer?

First of all, great work, 
In your thesis, the "Dropout as a Bayesian Approximation..." and "Concrete Dropout" article, @yaringal, you seem to apply the Dropout distribution only to the weights and not the biases, which then leads to a p-dependant regularization term that only includes the weight matrices. 

However, in the pytorch implementation (I didn't check the other ones) of the regularization term you sum the squares of `layer.parameters()` which will collect the biases as well. This will lead to a p-dependant regularization term for the biases, which is probably not what you want if you start optimizing p. Is this a bug or am I missing something?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Biases in weight_regularizer? #15

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Biases in weight_regularizer? #15

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions