DL – Part 1 – Basic models – AI school

Perceptron:

The perceptron is a fundamental building block of artificial neural networks and is the simplest form of a single-layer neural network.
It is a binary linear classifier that takes multiple inputs, applies weights to them, and computes a weighted sum.
The weighted sum is then passed through an activation function (usually a step function) to produce the output, which is either 0 or 1.
The perceptron learning algorithm adjusts the weights based on the prediction error to learn the optimal decision boundary that separates the input data into different classes.
Perceptrons are limited to linearly separable problems and cannot handle more complex patterns or non-linear relationships.

Multi-Layer Perceptron (MLP):

The Multi-Layer Perceptron (MLP) is an extension of the perceptron that introduces one or more hidden layers between the input and output layers.
MLPs can handle more complex patterns and non-linear relationships in data due to the introduction of non-linear activation functions (such as sigmoid, ReLU, or tanh) in the hidden layers.
Each neuron in the hidden layers computes a weighted sum of inputs, passes it through an activation function, and propagates the output to the next layer.
MLPs use the backpropagation algorithm to train the network by iteratively adjusting the weights using gradient descent and updating them based on the error gradients.
MLPs are universal function approximators, meaning they can approximate any continuous function given enough hidden neurons and proper training.

Lorem Ipsum