Functions¶

nnabla_rl.functions.sample_gaussian(mean: nnabla._variable.Variable, ln_var: nnabla._variable.Variable, noise_clip: Optional[Tuple[float, float]] = None) → nnabla._variable.Variable[source]¶

Sample value from a gaussian distribution of given mean and variance.

Parameters

mean (nn.Variable) – Mean of the gaussian distribution
ln_var (nn.Variable) – Logarithm of the variance of the gaussian distribution
noise_clip (Optional[Tuple(float, float)]) – Clipping value of the sampled noise.

Returns

Sampled value from gaussian distribution of given mean and variance

Return type

nn.Variable

nnabla_rl.functions.sample_gaussian_multiple(mean: nnabla._variable.Variable, ln_var: nnabla._variable.Variable, num_samples: int, noise_clip: Optional[Tuple[float, float]] = None) → nnabla._variable.Variable[source]¶

Sample multiple values from a gaussian distribution of given mean and variance. The returned variable will have an additional axis in the middle as follows (batch_size, num_samples, dimension)

Parameters

mean (nn.Variable) – Mean of the gaussian distribution
ln_var (nn.Variable) – Logarithm of the variance of the gaussian distribution
num_samples (int) – Number of samples to sample
noise_clip (Optional[Tuple(float, float)]) – Clipping value of the sampled noise.

Returns

Sampled values from gaussian distribution of given mean and variance

Return type

nn.Variable

nnabla_rl.functions.expand_dims(x: nnabla._variable.Variable, axis: int) → nnabla._variable.Variable[source]¶

Add dimension to target axis of given variable

Parameters

x (nn.Variable) – Variable to expand the dimension
axis (int) – The axis to expand the dimension. Non negative.

Returns

Variable with additional dimension in the target axis

Return type

nn.Variable

nnabla_rl.functions.repeat(x: nnabla._variable.Variable, repeats: int, axis: int) → nnabla._variable.Variable[source]¶

Repeats the value along given axis for repeats times.

Parameters

x (nn.Variable) – Variable to repeat the values along given axis
repeats (int) – Number of times to repeat
axis (int) – The axis to expand the dimension. Non negative.

Returns

Variable with values repeated along given axis

Return type

nn.Variable

nnabla_rl.functions.sqrt(x: nnabla._variable.Variable)[source]¶

Compute the squared root of given variable

Parameters: x (nn.Variable) – Variable to compute the squared root
Returns: Squared root of given variable
Return type: nn.Variable

nnabla_rl.functions.std(x: nnabla._variable.Variable, axis: Optional[int] = None, keepdims: bool = False) → nnabla._variable.Variable[source]¶

Compute the standard deviation of given variable along axis.

Parameters

x (nn.Variable) – Variable to compute the squared root
axis (Optional[int]) – Axis to compute the standard deviation. Defaults to None. None will reduce all dimensions.
keepdims (bool) – Flag whether the reduced axis are kept as a dimension with 1 element.

Returns

Standard deviation of given variable along axis.

Return type

nn.Variable

nnabla_rl.functions.argmax(x: nnabla._variable.Variable, axis: Optional[int] = None, keepdims: bool = False) → nnabla._variable.Variable[source]¶

Compute the index which given variable has maximum value along the axis.

Parameters

x (nn.Variable) – Variable to compute the argmax
axis (Optional[int]) – Axis to compare the values. Defaults to None. None will reduce all dimensions.
keepdims (bool) – Flag whether the reduced axis are kept as a dimension with 1 element.

Returns

Index of the variable which its value is maximum along the axis

Return type

nn.Variable

nnabla_rl.functions.quantile_huber_loss(x0: nnabla._variable.Variable, x1: nnabla._variable.Variable, kappa: float, tau: nnabla._variable.Variable) → nnabla._variable.Variable[source]¶

Compute the quantile huber loss. See following papers for details:

https://arxiv.org/pdf/1710.10044.pdf

https://arxiv.org/pdf/1806.06923.pdf

Parameters

x0 (nn.Variable) – Quantile values
x1 (nn.Variable) – Quantile values
kappa (float) – Threshold value of huber loss which switches the loss value between squared loss and linear loss
tau (nn.Variable) – Quantile targets

Returns

Quantile huber loss

Return type

nn.Variable

nnabla_rl.functions.mean_squared_error(x0: nnabla._variable.Variable, x1: nnabla._variable.Variable) → nnabla._variable.Variable[source]¶

Convenient alias for mean squared error operation

Parameters

x0 (nn.Variable) – N-D array
x1 (nn.Variable) – N-D array

Returns

Mean squared error between x0 and x1

Return type

nn.Variable

nnabla_rl.functions.minimum_n(variables: Sequence[nnabla._variable.Variable]) → nnabla._variable.Variable[source]¶

Compute the minimum among the list of variables

Parameters: variables (Sequence[nn.Variable]) – Sequence of variables. All the variables must have same shape.
Returns: Minimum value among the list of variables
Return type: nn.Variable

nnabla_rl.functions.gaussian_cross_entropy_method(objective_function: Callable[[nnabla._variable.Variable], nnabla._variable.Variable], init_mean: nnabla._variable.Variable, init_var: nnabla._variable.Variable, pop_size: int = 500, num_elites: int = 10, num_iterations: int = 5, alpha: float = 0.25) → Tuple[nnabla._variable.Variable, nnabla._variable.Variable][source]¶

Optimize objective function with respect to input using cross entropy method using gaussian distribution

Examples

>>> import numpy as np
>>> import nnabla as nn
>>> import nnabla.functions as NF
>>> import nnabla_rl.functions as RF
>>> def objective_function(x): return -((x - 3.)**2)
>>> batch_size = 1
>>> variable_size = 1
>>> init_mean = nn.Variable.from_numpy_array(np.zeros((batch_size, state_size)))
>>> init_var = nn.Variable.from_numpy_array(np.ones((batch_size, state_size)))
>>> optimal_x, _ = RF.gaussian_cross_entropy_method(objective_function, init_mean, init_var, alpha=0)
>>> optimal_x.forward()
>>> optimal_x.shape
(1, 1)  # (batch_size, variable_size)
>>> optimal_x.d
array([[3.]], dtype=float32)

Parameters

objective_function (Callable[[nn.Variable], nn.Variable]) – objective function
init_mean (nn.Variable) – initial mean
init_var (nn.Variable) – initial variance
pop_size (int) – pop size
num_elites (int) – number of elites
num_iterations (int) – number of iterations
alpha (float) – parameter of soft update

Returns

mean of elites samples and top of elites samples

Return type

Tuple[nn.Variable, nn.Variable]

nnabla_rl.functions.triangular_matrix(diagonal: nnabla._variable.Variable, non_diagonal: Optional[nnabla._variable.Variable] = None, upper=False) → nnabla._variable.Variable[source]¶

Compute triangular_matrix from given diagonal and non_diagonal elements. If non_diagonal is None, will create a diagonal matrix.

Example

>>> import numpy as np
>>> import nnabla as nn
>>> import nnabla.functions as NF
>>> import nnabla_rl.functions as RF
>>> diag_size = 3
>>> batch_size = 2
>>> non_diag_size = diag_size * (diag_size - 1) // 2
>>> diagonal = nn.Variable.from_numpy_array(np.ones(6).astype(np.float32).reshape((batch_size, diag_size)))
>>> non_diagonal = nn.Variable.from_numpy_array(np.arange(batch_size*non_diag_size).astype(np.float32).reshape((batch_size, non_diag_size)))
>>> diagonal.d
array([[1., 1., 1.],
       [1., 1., 1.]], dtype=float32)
>>> non_diagonal.d
array([[0., 1., 2.],
       [3., 4., 5.]], dtype=float32)
>>> lower_triangular_matrix = RF.triangular_matrix(diagonal, non_diagonal)
>>> lower_triangular_matrix.forward()
>>> lower_triangular_matrix.d
array([[[1., 0., 0.],
        [0., 1., 0.],
        [1., 2., 1.]],
       [[1., 0., 0.],
        [3., 1., 0.],
        [4., 5., 1.]]], dtype=float32)

Parameters

diagonal (nn.Variable) – diagonal elements of lower triangular matrix. It’s shape must be (batch_size, diagonal_size).
non_diagonal (nn.Variable or None) – non-diagonal part of lower triangular elements. It’s shape must be (batch_size, diagonal_size * (diagonal_size - 1) // 2).
upper (bool) – If true will create an upper triangular matrix. Otherwise will create a lower triangular matrix.

Returns

lower triangular matrix constructed from given variables.

Return type

nn.Variable

nnabla_rl.functions.batch_flatten(x: nnabla._variable.Variable) → nnabla._variable.Variable[source]¶

Collapse the variable shape into (batch_size, rest).

Example

>>> import numpy as np
>>> import nnabla as nn
>>> import nnabla_rl.functions as RF
>>> variable_shape = (3, 4, 5, 6)
>>> x = nn.Variable.from_numpy_array(np.random.normal(size=variable_shape))
>>> x.shape
(3, 4, 5, 6)
>>> flattened_x = RF.batch_flatten(x)
>>> flattened_x.shape
(3, 120)

Parameters: x (nn.Variable) – N-D array
Returns: Flattened variable.
Return type: nn.Variable