BC DS

10 machine learning questions you should be able to answer.

I started experimenting with a questionnaire before interviewing candidates. Applicants submit a video or written answers to apply for the position.

Thread: Let's look at 10 of these questions.

The questionnaire solves a couple of problems:

1. Weeds out candidates that aren't ready for the job.

2. It significantly cuts down the time we spend talking synchronously.

One problem: It requires extra time from the candidate before the phone interview.
Here are some numbers on a job posting before using the questionnaire:

• 249 applicants
• 13 good applications

That's only 5%.

This sucks.
Numbers after using the questionnaire:

• 44 applicants
• 21 good applications

That's 47%.

This is much better.
I look at it this way:

Sooner or later, I'll ask these questions, and the candidate will have to answer them.

I'd rather give them space and the opportunity to think about their answers without any pressure.

Those who can't or don't want to answer won't apply. That's fine.
I'm not worried about candidates searching for answers or asking someone else.

They will fool the questionnaire, not the interview that will come later.

Remember:

• The goal of the questionnaire and the resume is just to decide who moves to an interview.
Questions must be open-ended. No gotchas.

I want to allow candidates to think creatively and elaborate on their answers.

Preferred: A video response with the answers. Fallback is written answers if the candidate is not comfortable with video.
There are 10 questions on the questionnaire.

These cover basic principles that we want to ensure candidates understand.

Every candidate receives the same questions.

Let's go over them:
1. Provide a couple of examples where you used Supervised and Unsupervised machine learning techniques.

2. Walk us through the process you follow to select the best machine learning algorithm to solve a problem.

3. Explain the trade-off between bias and variance.
4. How do you decide how you should split your dataset when working in a supervised machine learning problem?

5. How do you handle imbalanced datasets in a classification problem?

6. Explain how you would approach reducing overfitting on a model.
7. Explain three different problems where using accuracy, precision, and recall is the best metric, respectively.

8. What would you expect to see as you vary the batch size when training a neural network?
9. What would you expect to see as you vary the learning rate when training a neural network?

10. What are the advantages of a Convolutional Neural Network over a fully connected network for image classification?
I post threads like this every week. You can find them here: @svpino.

Follow along for a good bunch of practical tips and epic stories about my experience with machine learning.

More from Santiago

10 machine learning YouTube videos.

On libraries, algorithms, and tools.

(If you want to start with machine learning, having a comprehensive set of hands-on tutorials you can always refer to is fundamental.)

🧵👇

1⃣ Notebooks are a fantastic way to code, experiment, and communicate your results.

Take a look at @CoreyMSchafer's fantastic 30-minute tutorial on Jupyter Notebooks.

https://t.co/HqE9yt8TkB


2⃣ The Pandas library is the gold-standard to manipulate structured data.

Check out @joejamesusa's "Pandas Tutorial. Intro to DataFrames."

https://t.co/aOLh0dcGF5


3⃣ Data visualization is key for anyone practicing machine learning.

Check out @blondiebytes's "Learn Matplotlib in 6 minutes" tutorial.

https://t.co/QxjsODI1HB


4⃣ Another trendy data visualization library is Seaborn.

@NewThinkTank put together "Seaborn Tutorial 2020," which I highly recommend.

https://t.co/eAU5NBucbm

More from Ds

You May Also Like

Fake chats claiming to be from the Irish African community are being disseminated by the far right in order to suggest that violence is imminent from #BLM supporters. This is straight out of the QAnon and Proud Boys playbook. Spread the word. Protest safely. #georgenkencho


There is co-ordination across the far right in Ireland now to stir both left and right in the hopes of creating a race war. Think critically! Fascists see the tragic killing of #georgenkencho, the grief of his community and pending investigation as a flashpoint for action.


Across Telegram, Twitter and Facebook disinformation is being peddled on the back of these tragic events. From false photographs to the tactics ofwhite supremacy, the far right is clumsily trying to drive hate against minority groups and figureheads.


Declan Ganley’s Burkean group and the incel wing of National Party (Gearóid Murphy, Mick O’Keeffe & Co.) as well as all the usuals are concerted in their efforts to demonstrate their white supremacist cred. The quiet parts are today being said out loud.


The best thing you can do is challenge disinformation and report posts where engagement isn’t appropriate. Many of these are blatantly racist posts designed to drive recruitment to NP and other Nationalist groups. By all means protest but stay safe.