8.16.0.4
17 He Initialization
These functions are used in setting the initial values in a given theta based on the shapes of the tensors desired in the theta.
Returns a list of tensors with shapes defined by θ-shapes with the following intialization rules.
Tensors of rank 1 are initialized to contain only 0.0.
Tensors of rank 2 are initialized to random numbers drawn from a normal distribution with a mean of 0.0 and a variance of (/ 2 fan-in) where fan-in is the last member of the shape.
Tensors of rank 3 are initialized to random numbers drawn from a normal distribution with a mean of 0.0 and a variance of (/ 2 fan-in) where fan-in is the product of the last two members of the shape.
Returns a tensor of shape s according to the rules described in init-theta.
Returns a tensor of shape s with only 0.0s as nested scalars in it.