$f$ is a step function: $$ y = \begin{cases} f(a) &= 1\text{ if } \ge 0\\ f(a) &= 0\text{ if } a< 0\\ \end{cases} $$
$a$ is a weighted sum of inputs $x$
Sunny | Dry | $a$ | $y$ | $t$ |
---|---|---|---|---|
1 (yes) | 1 (yes) | 1 | ||
1 (yes) | 0 (no) | 0 | ||
0 (no) | 1 (yes) | 0 | ||
0 (no) | 0 (no) | 0 |
$w_0 =$ 0 | $w_1 =$ 0 | $b =$ 0 |
INPUT | OUTPUT | |
A | B | A NAND B |
0 | 0 | 1 |
0 | 1 | 1 |
1 | 0 | 1 |
1 | 1 | 0 |
Thus: Any Boolean function can be built out of Perceptrons!
A perceptron is equivalent to a decision boundary.
A straight line can separate blue vs. red
There is no straight line that can separate blue vs. red
Problems where a straight line can separate two classes are called Linearly Separable
Perceptron learning algorithm to learn to classify linearly separable points
A perceptron is equivalent to a decision boundary.
A straight line can separate blue vs. red
There is no straight line that can separate blue vs. red
Problems where a straight line can separate two classes are called Linearly Separable
Most complex problems are not linearly separable
How many hidden layers and how many units per layer do we need? The answer is at most two
Hertz, et al. 1991
Szegedy et al. 2014
Canziani et al. 2018