Return to Article Details Theoretical Analysis of the Universal Approximation Properties of GELU in Neural Networks Download Download PDF