11 key concepts of Machine Learning.

β€” Supervised Learning Edition β€”

πŸ§΅πŸ‘‡

😜

Before starting, remember that, if you follow me, one of your enemies will be immediately destroyed (and you'll get to read more of these threads, of course.)

And if you don't follow me, well, you just hurt my feelings.

😜
1. Labels

(Also referred to as "y")

The label is the piece of information that we are predicting.

For example:

- the animal that's shown in a picture
- the price of a house
- whether a message is spam or not

πŸ‘‡
2. Features

(Also referred to as "x")

These are the input variables to our problem. We use these features to predict the "label."

For example:

- pixels of a picture
- number of bedrooms of a house
- square footage of a house

πŸ‘‡
3. Samples

(This is also known as "examples.")

A sample is a particular instance of data (features or "x.") It could be "labeled" or "unlabeled."

πŸ‘‡
4. Labeled sample

Labeled samples are used to train and validate the model. These are usually represented as (x, y), where "x" is a vector containing all the features, and "y" is the corresponding label.

For example, a labeled sample could be:

([3, 2, 1500], 350000)
5. Unlabeled sample

Unlabeled samples contain features, but they don't contain the label: (x, ?)

We usually use a model to predict the labels of unlabeled samples.

πŸ‘‡
6. Model

A model defines the relationship between features and the label.

You can think of a model as a set of rules that, given certain features, determines the corresponding label.

For example, given the # of bedrooms, bathrooms, and square footage, we get the price.

πŸ‘‡
7. Training

Training is a process that builds a model.

We show the model labeled samples during training and allow the model to gradually learn the relationships between features and the label.

πŸ‘‡
8. Validation

Validation is the process that lets us know whether a model is any good.

Usually, we run a set of (unseen) labeled samples through a model to ensure that it can predict the labels.

πŸ‘‡
9. Inference

Inference is the process of applying a trained model to unlabeled samples to obtain the corresponding labels.

In other words, "inference" is the process of making predictions using a model.

πŸ‘‡
10. Regression

A regression model predicts continuous values, for example:

- the value of a house
- the price of a stock
- tomorrow's temperature

πŸ‘‡
11. Classification

A classification model predicts discrete values, for example:

- the picture is showing a dog or a cat
- the message is spam or not
- the forecast is sunny or overcast

More from Santiago

You gotta think about this one carefully!

Imagine you go to the doctor and get tested for a rare disease (only 1 in 10,000 people get it.)

The test is 99% effective in detecting both sick and healthy people.

Your test comes back positive.

Are you really sick? Explain below πŸ‘‡

The most complete answer from every reply so far is from Dr. Lena. Thanks for taking the time and going through


You can get the answer using Bayes' theorem, but let's try to come up with it in a different β€”maybe more intuitiveβ€” way.

πŸ‘‡


Here is what we know:

- Out of 10,000 people, 1 is sick
- Out of 100 sick people, 99 test positive
- Out of 100 healthy people, 99 test negative

Assuming 1 million people take the test (including you):

- 100 of them are sick
- 999,900 of them are healthy

πŸ‘‡

Let's now test both groups, starting with the 100 people sick:

▫️ 99 of them will be diagnosed (correctly) as sick (99%)

▫️ 1 of them is going to be diagnosed (incorrectly) as healthy (1%)

πŸ‘‡

More from Machine learning

Happy 2⃣0⃣2⃣1⃣ to all.πŸŽ‡

For any Learning machines out there, here are a list of my fav online investing resources. Feel free to add yours.

Let's dive in.
⬇️⬇️⬇️

Investing Services

βœ”οΈ @themotleyfool - @TMFStockAdvisor & @TMFRuleBreakers services

βœ”οΈ @7investing

βœ”οΈ @investing_city
https://t.co/9aUK1Tclw4

βœ”οΈ @MorningstarInc Premium

βœ”οΈ @SeekingAlpha Marketplaces (Check your area of interest, Free trials, Quality, track record...)

General Finance/Investing

βœ”οΈ @morganhousel
https://t.co/f1joTRaG55

βœ”οΈ @dollarsanddata
https://t.co/Mj1owkzRc8

βœ”οΈ @awealthofcs
https://t.co/y81KHfh8cn

βœ”οΈ @iancassel
https://t.co/KEMTBHa8Qk

βœ”οΈ @InvestorAmnesia
https://t.co/zFL3H2dk6s

βœ”οΈ

Tech focused

βœ”οΈ @stratechery
https://t.co/VsNwRStY9C

βœ”οΈ @bgurley
https://t.co/NKXGtaB6HQ

βœ”οΈ @CBinsights
https://t.co/H77hNp2X5R

βœ”οΈ @benedictevans
https://t.co/nyOlasCY1o

βœ”οΈ

Tech Deep dives

βœ”οΈ @StackInvesting
https://t.co/WQ1yBYzT2m

βœ”οΈ @hhhypergrowth
https://t.co/kcLKITRLz1

βœ”οΈ @Beth_Kindig
https://t.co/CjhLRdP7Rh

βœ”οΈ @SeifelCapital
https://t.co/CXXG5PY0xX

βœ”οΈ @borrowed_ideas
Really enjoyed digging into recent innovations in the football analytics industry.

>10 hours of interviews for this w/ a dozen or so of top firms in the game. Really grateful to everyone who gave up time & insights, even those that didnt make final cut πŸ™‡β€β™‚οΈ https://t.co/9YOSrl8TdN


For avoidance of doubt, leading tracking analytics firms are now well beyond voronoi diagrams, using more granular measures to assess control and value of space.

This @JaviOnData & @LukeBornn paper from 2018 referenced in the piece demonstrates one method
https://t.co/Hx8XTUMpJ5


Bit of this that I nerded out on the most is "ghosting" β€” technique used by @counterattack9 & co @stats_insights, among others.

Deep learning models predict how specific players β€” operating w/in specific setups β€” will move & execute actions. A paper here: https://t.co/9qrKvJ70EN


So many use-cases:
1/ Quickly & automatically spot situations where opponent's defence is abnormally vulnerable. Drill those to death in training.
2/ Swap target player B in for current player A, and simulate. How does target player strengthen/weaken team? In specific situations?

You May Also Like

TradingView isn't just charts

It's much more powerful than you think

9 things TradingView can do, you'll wish you knew yesterday: 🧡

Collaborated with @niki_poojary

1/ Free Multi Timeframe Analysis

Step 1. Download Vivaldi Browser

Step 2. Login to trading view

Step 3. Open bank nifty chart in 4 separate windows

Step 4. Click on the first tab and shift + click by mouse on the last tab.

Step 5. Select "Tile all 4 tabs"


What happens is you get 4 charts joint on one screen.

Refer to the attached picture.

The best part about this is this is absolutely free to do.

Also, do note:

I do not have the paid version of trading view.


2/ Free Multiple Watchlists

Go through this informative thread where @sarosijghosh teaches you how to create multiple free watchlists in the free


3/ Free Segregation into different headers/sectors

You can create multiple sections sector-wise for free.

1. Long tap on any index/stock and click on "Add section above."
2. Secgregate the stocks/indices based on where they belong.

Kinda like how I did in the picture below.