ZeroOutDense

Bases: tf.keras.layers.Dense

Dense layer with zero-out (weight masking) mechanism.

This masking mechanism includes:

Create an non-trainable persistent mask shapes like the weight matrix.
During build, assign 0 values to weight matrix.
During forward pass call, element-wise multiply the weight matrix with the mask matrix to block the gradient from back-propagating to the weight matrix. Hence, the masked weights are technical not trainable.

Note

Why (3) blocks the gradient in the masked weight? Consider the forward-pass of $masked = w \cdot m$ , its backward pass is $d_w = m \cdot d_{masked}$ , therefore if $m$ is 0, $d_w$ is 0, hence the gradient of weight is 0 when its mask's value is 0.

Parameters:

Name	Type	Description	Default
`zero_out_rate`	`float`	Probability of a weight begin masked.	required
`units`	`int`	Number of units in the layer.	required
`activation`	`str`	Activation function to use.	`None`
`use_bias`	`bool`	Whether the layer uses a bias vector.	`True`
`kwargs`	`dict`	Any Keyword arguments in `tf.keras.layers.Dense`.	`{}`

Forward pass:

$y = x(w \cdot m) + b$

Warning

" $\cdot$ " is element-wise multiplication.

$x$ : input tensor.
$w$ : weight matrix provided by this layer, trainable.
$m$ : mask matrix, provided by this layer, NOT trainable.
$b$ : bias vector, provided by this layer if use_bias is True (Default), trainable.

`build(input_shape)`

Build the layer.

`call(inputs)`

Forward pass.

Parameters:

Name	Type	Description	Default
`inputs`	`tf.Tensor`	Input tensor.	required

Returns:

Name	Type	Description
`outputs`	`tf.Tensor`	Output tensor.

`zero_out_weights()`

Manually assign zero values to the all weights using the mask.