pytorch / 2 / generated / torch.nn.functional.gelu.html

torch.nn.functional.gelu

torch.nn.functional.gelu(input, approximate='none') → Tensor

When the approximate argument is ‘none’, it applies element-wise the function GELU ( x ) = x Φ ( x ) \text{GELU}(x) = x * \Phi(x)

where Φ ( x ) \Phi(x) is the Cumulative Distribution Function for Gaussian Distribution.

When the approximate argument is ‘tanh’, Gelu is estimated with

GELU ( x ) = 0.5 x ( 1 + Tanh ( 2 / π ( x + 0.044715 x 3 ) ) ) \text{GELU}(x) = 0.5 * x * (1 + \text{Tanh}(\sqrt{2 / \pi} * (x + 0.044715 * x^3)))

See Gaussian Error Linear Units (GELUs).

© 2024, PyTorch Contributors
PyTorch has a BSD-style license, as found in the LICENSE file.
https://pytorch.org/docs/2.1/generated/torch.nn.functional.gelu.html