BC MACHINE LEARNING

Saved by @zmbnski

Roger Grosse
@RogerGrosse 7 years, 7 months ago 3765 views

Save to PDF Share See On Twitter

Important paper from Google on large batch optimization. They do impressively careful experiments measuring # iterations needed to achieve target validation error at various batch sizes. The main "surprise" is the lack of surprises. [thread]

https://t.co/7QIx5CFdfJ

The paper is a good example of lots of elements of good experimental design. They validate their metric by showing lots of variants give consistent results. They tune hyperparamters separately for each condition, check that optimum isn't at the endpoints, and measure sensitivity.

They have separate experiments where the hold fixed # iterations and # epochs, which (as they explain) measure very different things. They avoid confounds, such as batch norm's artificial dependence between batch size and regularization strength.

When the experiments are done carefully enough, the results are remarkably consistent between different datasets and architectures. Qualitatively, MNIST behaves just like ImageNet.

Importantly, they don't find any evidence for a "sharp/flat optima" effect whereby better optimization leads to worse final results. They have a good discussion of experimental artifacts/confounds in past papers where such effects were reported.

The time-to-target-validation is explained purely by optimization considerations. There's a regime where variance dominates, and you get linear speedups w/ batch size. Then there's a regime where curvature dominates and larger batches don't help. As theory would predict.

Incidentally, this paper must have been absurdly expensive, even by Google's standards. Doing careful empirical work on optimizers requires many, many runs of the algorithm. (I think surprising phenomena on ImageNet are often due to the difficulty of running proper experiments.)

More from Machine learning

Abhishek Thakur
@abhi1thakur

In this thread, I will tell you how to learn python for data science in 1 hour 👇 1/N

It's not possible 2/N

It's not possible 3/N

It's not possible 4/N

It's not possible 5/N

Anson Horton
@AnsonHorton

1/ It's probably not the first thing you think of, but when we started .NET (COM+) in the late 90s, C# didn't exist yet. We were working on it at the same time as the CLR and the framework. So, you might wonder, what language was being used to generate IL and write the BCL?

2/ The answer is a language that we called SMC that Peter Kukol wrote the compiler for. Peter is a flat out amazing engineer and wrote the core parts of the compiler in just a few days. This unblocked the framework team, allowed vetting the runtime and interpreter, etc.

3/ SMC was a trimmed down C++ variant and the compiler was written in itself (i.e. SMC). It didn't support things like destructors, multiple inheritance, virtual base classes, etc. But, overall it enabled progress that would have otherwise been stalled.

4/ It did introduce a problem for the C# language and compiler team though. Every day more and more code was written in SMC, a language that we did not intend to ship externally. Designing a new language is hard (well, at least, a general purpose one that becomes popular).

5/ So there was a constant balancing act between laying enough of the foundation of the language and the C# compiler to replace SMC and making sure that the design lived up to the goals and ideals the team was striving for. If we waited too long to adopt C#,

Advertisement

Pratham Prasoon
@PrasoonPratham

Can you get a job in data science and machine learning without a college degree?

🧵👇

Short Answer: Yes.

Long Answer, keep reading 👇

(Advice from industry experts who I talked to.)

Companies are looking for people who add value.
In order to add value, you'll need skills. Simple as that.

Get the skills to provide value and you'll get the job.

In the age of the internet where everything is pretty much free, why do college degrees matter?

A college degree makes it easier to get the skills to get a job in machine learning or data science.

College degrees also help you make many useful connections and provides many opportunities in the form of internships and whatnot.

College degrees have their own place

$Prashant \U0001f4da$

Prashant 📚
@capeandcode

Ever heard of Autoencoders?

The first time I saw a Neural Network with more output neurons than in the hidden layers, I couldn't figure how it would work?!

#DeepLearning #MachineLearning
Here's a little something about them: 🧵👇

Autoencoders are unsupervised neural networks whose architecture you can picture as two funnels connect from the narrow ends.

These networks are primary focus for compression tasks of data in Machine Learning.

We feed them the data so that they can learn the most important features, a smaller representation while keep the integrity of the data.

Later when someone needs, can just take that small representation and recreate the original, just like a zip file.📥

Being unsupervised, they require no labels.
Our inputs and outputs are same and a simple euclidean distance can be used as a loss function for measuring the reconstruction.

Of course, we wouldn't expect a perfect reconstruction.

We can think of an autoencoder having two components, encoder and decoder, represented by the below equations:

We are just trying to minimize the L here. All the backpropagation rules still hold.

Pratham Prasoon
@PrasoonPratham

There are several skills needed for learning machine learning that no one talks about.

Here are some of them.
(from what I've learned over the past two years)
🧵👇

This thread aims to introduce to you some of the skills often ignored for learning machine learning but are actually very important.

These skills will enable you to learn concepts quickly and more efficiently in this field.

(2 / 9)

1⃣ Reading

Probably one of the most underrated skills on this list.

In machine learning, you HAVE to read a lot of articles, papers, documentation, and whatnot.

It is mostly due to the theoretical nature of this field that reading is an important skill to have.

(3 / 9)

2⃣ Strong fundamentals in programming

Machine learning is just a lot of programming mixed with math and data.

Having clear programming fundamentals is crucial for this field.

I highly suggest you learn about these if you are using Python for machine learning.👇

(4 / 9)

- Object oriented programming in Python :Classes, Objects, Methods
- Lists & List functions
- Dunder Methods
- List comprehension
- List slicing
- String formatting
- List, Dictionaries & Tuples
- *args,**kwargs

(5 / 9)

You May Also Like

Jaya_Upadhyaya
@Jayalko1

SRI BANASHANKARI AMMAN, BANGALORE
#bharatmandir

Located in Southern Bangalore, this is one of the most popular places of worship in Bangalore City.
Devoted to Devi Banashankari, this shrine is visited specially during Rahukala which is believed to be an unfavorable time.

Somanna Shetty, a great devotee of Devi, brought this murti from Badami’s Banshankari Amma Kshetram.
It is said that many years ago, three ladies visited this shrine and requested the pujari to conduct puja for them. Since it was the inauspicious rahu kaal, the pujari refused.

However, upon their insistence, he conducted the puja. When he returned with prasadam, the ladies vanished. This made the people believe that they were none other than the Devi herself. Since then, the people perform poojas here during the Rahukaal.
@GunduHuDuGa

People use lemons as a diya. Lemons are cut from the centre and their pulps are removed, for pouring oil in them.
It is strongly believed that Devi removes all the troubles from the life of her devotees during the Rahukala although it is considered inauspicious for any pooja.

Ramblings of a Sikh
@RamblingSingh

1/ Jassa Singh Ahluwalia & his role in the history of Shri Harmandir Sahib (A Thread)

2/ Jassa Singh was born in 1720/21 at a village called Ahlu in the Lahore District of Punjab.

3/ Jassa Singh was educated in Delhi, in Persian, Arabic & more, under the care & supervision of Mata Sundari Ji.

Mata Sundari Gurdwara in Delhi.

4/ In 1733, Jassa Singh's mama, Thag Singh (ਤਾਗ ਸਿੰਘ), requested Mata Sundari Ji's permission to take Jassa Singh back to Punjab.

5/ At first Mata Ji refused but eventually gave permission for Jassa Singh to go with his mama, giving Jassa Singh a blessing and weapons, informing him, it was now the right time to take up arms.

Advertisement

$\U0001d40d\U0001d41e\U0001d421\U0001d41a \U0001f1ee\U0001f1f3$

𝐍𝐞𝐡𝐚 🇮🇳...
@Neh_hope_faith

THREAD: What is Golden Crescent and Golden Triangle and how it impacts India?

Golden Crescent region of South Asia - comprises of Afghanistan, Iran, and Pakistan - which is a principal global site for opium production and distribution.

Golden Triangle is the area where the borders of Thailand, Laos and Myanmar meet at the confluence of the Ruak and Mekong rivers. The Golden Triangle is called so because of the triangular shape formed by the locations of New Delhi, Agra and Rajasthan on a map.

India is wedged bw world’s 2 largest areas of illicit opium production, Golden Crescent & Golden Triangle, that has made Indian borders vulnerable to drug trafficking. It flows from 2 fronts, NW & NE, that possesses dual concern - violation of border & threat to national security

- Golden Crescent is world’s leading opium and cannabis producing region since 1983.

- Its proximity to Indo-Pak border has worsened the trafficking of hashish and heroin into Indian soil

- Trafficking happens through the border states of J&K, Punjab, Rajasthan and Gujarat.

- Golden Triangle is leading opium producing region of the South-east Asia, also an ancient narcotics supply route to North America & Europe.

- Myanmar-Northeast nexus is recognized by NCB as the root of heroin & chemical drugs

- Myanmar produces 80% of heroin produced in world

$Andrey \uf8ff Azimov$

Andrey  Azimov
@AndreyAzimov

The best way to know if people like your product is NOT:

- Have a lot of emails in the waiting list
- Become #1 on PH
- Become #1 of HN or Reddit
- Have people that saing "I'll pay"
- Have a lot of free users

The best way to know if they're actually pay for it.

The only thing that matters is your product providing a value, and you can't know this until people will pay money for it.

We can have a first signal of a product/market fit after the launch by check how many sales we've got in the first 24h. Some scale:

200+ This has a high potential.
100+: This has a potential.
50+: Some people need it.
10+: People almost don't need it.
0+: People don't need it.

This is for a single time payment product. For a subscription probably it should be less because people hate subscription.

And obviously, you should have a lot of traffic like 3-5k+.

Here is a stat for my products (macOS apps) for the first 24h after the launch:

Brittany Packnett
@MsPackyetti

We should stop confusing what someone was doing when we became aware of them with all they are capable of, interested in, or ever hope to be

Let folks have their many talents, interests and gifts. Life is far more fun with variety, loves.

A lot of folks have come to know me as an activist & I’m grateful that folks care to know me at all.

But I wasn’t born in 2014. I was a whole teacher, executive, policy person, speaker, arts and culture lover, reader, writer, woman of faith, fashion and more before 4 yrs ago 🤷🏾‍♀️

We rightfully complain that marginalized people are not allowed to be fully human.

But we internalize and transfer our oppression daily. It’s a smog. We all breathe it in & act it out.

And then tell WoC “girl ain’t you supposed to be a _______? Why you doing ____?”

Can I live?

And don’t go reading anything personal into this-this isn’t about me necessarily and it’s no subtweet (I try hard not to do that.)

I’ve just been observing that behavior more and more lately. Especially when it comes to marginalized folks.

Evolution should be our aspiration.

“Can’t knock the hustle” should be our anthem.

As long as someone isn’t bringing active and continual harm, why can’t they explore their many sides?