登录查看更多内容

Activation Functions in Neural Networks

Mahdi Bani

Machine Learning Engineer | Python Developer

发布日期: 2023年4月19日

When someone decide to read more about how artificial intelligence work , the sentence "activation functions" will be popular.

In simple terms, activation functions are a mathematical functions that determine the output of a neuron. the activation function is applied to the weighted sum of inputs and biases, and the resulting output is passed on to the next layer of the neural network

We use activation function introduce non-linearity into the output of a neuron, that's mean the neural network will be able to learn complex functions and relations between the input and output

There are a different type of activation functions and each have a strengths and weaknesses point, in this article i will compare and contrast some of the most commonly used activation functions ,including binary, linear, sigmoid, tanh, ReLU, and softmax.

1- Binary Activation Function:

And that's a simple one that maps all inputs to either 0 or 1 depending on whether they are above or below a certain threshold. The binary function is rarely used in modern ANN, and it allow only linearity.

2-Linear Activation Function:

The linear function simply return the input value as its output without any transformation. This function is used in regression problems, where the output is a continuous value, rather than a binary or categorical value.

3-Sigmoid Activation Function:

The sigmoid function is the best choice for binary classification problems, where the output is either 0 or 1. This function maps the input to a value between 0 and 1, using the following formula:

f(x) = 1 / (1 + exp(-x))

The sigmoid function give us S-shaped curve, which allows it to introduce non-linearity into the output of the neural network. However, it suffers from the problem of vanishing gradients, where the gradients become very small as the input value approaches either 0 or 1, which can slow down the learning process.

领英推荐

You, Me and Bayesian Neural Networks (BNNs)

Dean Harries 2 个月前

Unleashing MobileNetV2: Efficient CNN Insights

Machine Learning Reply GmbH 1 年前

How to Build and Use Neural Networks

Sharad Srivastava 6 年前

4-Tanh Activation Function:

The tanh function is similar to the sigmoid function, but maps the input to a value between -1 and 1, using the following formula:

f(x) = (exp(x) - exp(-x)) / (exp(x) + exp(-x))

Like the sigmoid function, the tanh function introduces non-linearity into the output of the neural network, but it has a stronger gradient, which can help speed up the learning process. However, it suffers from the same problem of vanishing gradients as the sigmoid function.

5-ReLU Activation Function:

The ReLU (Rectified Linear Unit) activation function is currently the most popular choice for deep learning problems, and is used in most modern neural networks.

Formula:

f(x) = max(0, x)

It's returns the input value if it is greater than 0, and returns 0 otherwise. The ReLU function has a simple and efficient implementation, and introduces non-linearity into the output of the neural. it is popular because it allows for faster training of deep neural networks compared to other activation functions but yoy can suffer from the "dying ReLU" problem where a large number of neurons can become inactive and not contribute to the network's output.

The softmax activation function is commonly used in the output layer of neural networks for multi-class classification problems.

Formula:

f(x) = e^x / sum(e^x)

It transforms the output of each neuron into a probability distribution over all possible classes. The function ensures that the sum of the probabilities of all classes is equal to 1, making it useful for determining the most likely class. Softmax is often used with the cross-entropy loss function to train neural networks for classification tasks.

要查看或添加评论，请登录

Mahdi Bani的更多文章

Large Language Models: The Wizards Behind Your Text Generation Magic

2024年9月20日

Large Language Models: The Wizards Behind Your Text Generation Magic

Once upon a time, in the mysterious realm of machine learning, Large Language Models (LLMs) were the secret sauce of AI…
Journey of My Malware Classification Project

2024年6月9日

Journey of My Malware Classification Project

Introduction: Embarking on a journey to classify malware using deep learning has been both a challenging and rewarding…

1 条评论
Transfer Learning for CIFAR-10 Classification Using ResNet50

2024年6月9日

Transfer Learning for CIFAR-10 Classification Using ResNet50

Abstract: In this article, we implement transfer learning to classify images in the CIFAR-10 dataset using a…
My Journey in Developing a Malware Classifier

2024年5月15日

My Journey in Developing a Malware Classifier

Embarking on the journey of developing a malware classifier was both a challenge and an opportunity for growth. In this…
Unlocking the Future: A Deep Dive into BTC Price Forecasting

2024年1月4日

Unlocking the Future: A Deep Dive into BTC Price Forecasting

Cryptocurrencies are more popular with years, especially Bitcoin , have captured the attention of investors worldwide…
The art of optimization

2023年4月28日

The art of optimization

Optimization is critical in machine learning because it helps to find the best set of model parameters, minimize the…
Is everything an object in python ?

2022年9月27日

Is everything an object in python ?

Unlike the other language, Python is an OOP(object oriented programming) language and that mean it can organizes…
What happens when you type `ls -l *.c` in the shell ?

2022年8月4日

What happens when you type `ls -l *.c` in the shell ?

To begin with i'am expecting that you have a basic knowledge about shell scripting and linux command. You have to…
C static libraries

2022年6月22日

C static libraries

what is static libraries? In the C programming language, a static library is a compiled object file containing all…

See all articles

Activation Functions in Neural Networks

Mahdi Bani

Machine Learning Engineer | Python Developer

领英推荐

Mahdi Bani的更多文章

社区洞察

其他会员也浏览了

Choosing the Right Activation Function: A Guide for Neural Network Enthusiasts

Neural Network

Backpropagation algorithm?-?A fundamental building block in a neural?network.

A new activation function for Neural Networks - logmoid

Behind the Scenes: A Deep Dive into the Technical Intricacies of Transformers

Attention and the Encoder in Transformer Neural Networks

NEURAL NETWORK IN DETAIL & HOW BIG MNCs IS USING IT

YOLO: An Introduction

Understanding Activation Functions in Neural Networks:

领英推荐

Mahdi Bani的更多文章

Large Language Models: The Wizards Behind Your Text Generation Magic

Journey of My Malware Classification Project

Transfer Learning for CIFAR-10 Classification Using ResNet50

My Journey in Developing a Malware Classifier

Unlocking the Future: A Deep Dive into BTC Price Forecasting

The art of optimization

Is everything an object in python ?

What happens when you type `ls -l *.c` in the shell ?

C static libraries

社区洞察

其他会员也浏览了

Choosing the Right Activation Function: A Guide for Neural Network Enthusiasts

Neural Network

Backpropagation algorithm?-?A fundamental building block in a neural?network.

A new activation function for Neural Networks - logmoid

Behind the Scenes: A Deep Dive into the Technical Intricacies of Transformers

Attention and the Encoder in Transformer Neural Networks

NEURAL NETWORK IN DETAIL & HOW BIG MNCs IS USING IT

YOLO: An Introduction

Understanding Activation Functions in Neural Networks: