Authors Radek Osmulski

7 days 30 days All time Recent Popular
THREAD: How is it possible to train a well-performing, advanced Computer Vision model 𝗼𝗻 π˜π—΅π—² 𝗖𝗣𝗨? πŸ€”

At the heart of this lies the most important technique in modern deep learning - transfer learning.

Let's analyze how it

2/ For starters, let's look at what a neural network (NN for short) does.

An NN is like a stack of pancakes, with computation flowing up when we make predictions.

How does it all work?

3/ We show an image to our model.

An image is a collection of pixels. Each pixel is just a bunch of numbers describing its color.

Here is what it might look like for a black and white image

4/ The picture goes into the layer at the bottom.

Each layer performs computation on the image, transforming it and passing it upwards.

5/ By the time the image reaches the uppermost layer, it has been transformed to the point that it now consists of two numbers only.

The outputs of a layer are called activations, and the outputs of the last layer have a special meaning... they are the predictions!