Deep Learning Interviews book: Hundreds of fully solved job interview questions

Deep Learning Interviews book: Hundreds of fully solved job interview questions(github.com)

582 points by piccogabriele 4 years ago | 147 comments

angarg12 4 years ago |

I have been working as an ML Engineer for a few years now and I am baffled by the bar to entry for these positions in the industry.

Not only I need to perform at the Software Engineer level expected for the position (with your standard leetcode style interviews), but I need to pass extra ML specific (theory and practice) rounds. Meanwhile the vast majority of my work consist of getting systems production ready and hunting bugs.

If I have to jump through so many hoops when changing jobs I'll seriously consider a regular non-ML position.

dekhn 4 years ago | |

there's a bunch of gatekeeping to get into ML. Part of it is that ML people don't want non-ML people to know just how much of what they do is drudgery and how little of it is exciting math, or have competition from people with similar skills. And those roles come with a lot of prestige.

I went through all that and am a SWE again instead of an ML engineer. The one thing I learned from all that? "The very best models are distilled from postdoc tears".

Jensson 4 years ago | | |

Getting state of the art performance in ML requires a lot of intuition about equations though. I've seen some of the top ML engineers work at Google, they all have a really good understanding of math, how formulas translates into measurable results etc. An ML education or research background seems less important, if you have that from studying physics or math or anything then it still translates.

I feel the biggest problem for people without an ML background is that you'd think "I don't know what I'm doing, I can't get hired for this job!", but fact is that people with ML backgrounds mostly don't know what they are doing either. They just get standard results by applying standard libraries, any programmer with some math skills could do the same, it is no harder than learning a frontend or backend framework, people just think it would be harder so they lack confidence about it. There are some gotchas you got to learn, but there are a lot of gotchas in both backend and frontend as well.

godelski 4 years ago | | |

I agree with you, but I also wish more ML people knew more math. Though I think there's a difference between research and production (I'm in research).

HeavyStorm 4 years ago | |

I, on the other hand, am baffled b by the lack of basic engineering skills of mods ML workers at my company. Their code is unmaintainable, and they seem to lack the usual problem solving skills I look for in software engineers.

Then again, my company business model leads to terrible hires anyway.

bronxbomber92 4 years ago | |

This is the same for any specialized software engineering role. Compilers, GPGPU, embedded systems, computer graphics, image processing, etc. In an interview panel for any of these roles, you will be expected to be a competent software engineer and have domain knowledge about the sub-field.

capdeck 4 years ago | |

> ... I'll seriously consider a regular non-ML position.

What about asking for more money at the end? Multi-stage complex interview process eliminates more candidates. Some, like you say, will opt for a developer gig instead, probably because ML wasn't something they were interested in to begin with. That narrows down the list of candidates even more. Either "play the game" and ask for more money or don't play the game at all. Let employers pay extra for polished candidates.

angarg12 4 years ago | | |

If what I want is money I think I'm better off getting competing offers as a regular Software Engineer and pumping the numbers.

ivanamies 4 years ago | |

As an "ML Engineer," this is both true and very funny

emotional_fool 4 years ago | |

And most likely will be paid <= software-engineers

coconut_man 4 years ago | |

just like Software Engineer having to pass leetcode round and system design round, I also doubt that ML theory and practice is much harder than system design (theory and practice). Beside that, you get paid more than Software Engineer.

angarg12 4 years ago | | |

I can tell you we are not paid more than Software Engineers, but that might be only my company.

One startup asked me this. They gave me a very vague problem statement, and in 2 days I had to find a couple of recent articles relevant to the problem and prepare a presentation explaining my solution and justifying my decisions.

1970-01-01 4 years ago |

This book has fun problems! Example:

During the cold war, the U.S.A developed a speech to text (STT) algorithm that could theoretically detect the hidden dialects of Russian sleeper agents. These agents (Fig. 3.7), were trained to speak English in Russia and subsequently sent to the US to gather intelligence. The FBI was able to apprehend ten such hidden Russian spies and accused them of being "sleeper" agents.

The Algorithm relied on the acoustic properties of Russian pronunciation of the word (v-o-k-s-a-l) which was borrowed from English V-a-u-x-h-a-l-l. It was alleged that it is impossible for Russians to completely hide their accent and hence when a Russian would say V-a-u-x-h-a-l-l, the algorithm would yield the text "v-o-k-s-a-l". To test the algorithm at a diplomatic gathering where 20% of participants are Sleeper agents and the rest Americans, a data scientist randomly chooses a person and asks him to say V-a-u-x-h-a-l-l. A single letter is then chosen randomly from the word that was generated by the algorithm, which is observed to be an "l". What is the probability that the person is indeed a Russian sleeper agent?

mcemilg 4 years ago |

The ML/DS positions highly competitive these days. I don't get why ML positions requires hard preparations for the interviews more than other CS positions while you do similar things. People expect you to know a lot of theory from statistics, probability, algorithms to linear algebra. I am ok with knowing basic of these topics which are the foundations of ML and DL. But I don't get to ask eigenvectors and challenging algorithm problems in an ML Engineering position at the same while you already proof yourself with a Masters Degree and enough professional experience. I am not defending my PhD there. We will just build some DL models, maybe we will read some DL papers and maybe try to implement some of those. The theory is the only 10% of the job, rest is engineering, data cleaning etc. Honestly I am looking for the soft way to get back to Software Engineering.

light_hue_1 4 years ago |

I've interviewed well over 100 people for DL/ML positions. This may be a good roadmap to what some people ask, but it's a terrible guide to what you should ask. It's like a collection of class exam questions.

Just as in programming, the world is full of people who can recite facts but don't understand them. There is no point in asking what an L1 norm is and asking for its equation. Or say, giving someone the C++ code that corresponds to computing the norm of a vector and asking them "what does this do". Or even worse, showing them some picture of some cross-validation scheme and asking them to name it. Yes, your candidates should be able to do this, but positive answers to these kinds of questions are nearly useless. These are the kinds of questions you get answers to by Googling.

It's far more critical to know what your candidate can do, practically. Create a hypothetical dataset from your domain where the answer is that they need to use an L1 norm. Do they realize this? Do they even realize that the distance metric matters? Are they proposing reasonable distance metrics? Do they understand what goes wrong with different distance metrics? etc. Or problems where they need to use a network but say, padding matters a lot. Or where the particulars of cross validation matter a lot.

This also gives you depth. "name this cross validation scheme" gives you a binary answer "yes, they can do it, or no they can't" And you're done. If you have a hypothetical dataset, you can keep prodding. "Ok, but how about if I unbalance the data" or "what if we now need to fine tune" or "what if the payoffs for precision and recall change in our domain", "what if my budget is limited", etc. It also lets you transition smoothly to other kinds of questions. And to discover areas of deeper expertise than you expected. For example, even for the cross validation questions, if you ask that binary question, you might never discover that a candidate knows about how to use generalized cross validation, which might actually be very useful for your problem.

The uninformative tedious mess that we see in programming interviews? This is the equivalent for ML/DL interviews!

jstx1 4 years ago |

Data science and ML interviews can be tough because it's very difficult to prepare for everything and cover all the theory. A lot of the value you add comes from knowing the theory so it's understandable to test it but it's still hard to prepare well. And you have a take-home and/or LC style problem(s) in addition to the theory interview.

minimaxir 4 years ago | |

The hard questions in DS/ML interviews I've received over the years aren't the theory questions (which I rarely get asked), but the trick SQL questions that often depend on obscure syntax and/or dialect-specific features, or "implement binary search" when I'm not in the mindset for that as that isn't what DS/ML is in the real world.

jstx1 4 years ago | | |

I think they're fine as long as you know the format and have an opportunity to prepare or just get in the right mindset for it. And some things (like binary search) should be easy to write anyway.

The SQL questions can also be a symptom of the type of job - Facebook's first data science round focuses a lot on SQL but that's because it's a very product/analytics/decision-making focused role without that much coding or ML. With data science you have to be more careful about these things when searching for a job; you can't just use the job title as a descriptor.

nerdponx 4 years ago | | |

I had an "implement binary search" interview once. I came away feeling like I was being interviewed for the wrong role. I don't understand how anyone could think that's an appropriate interview task for a DS position.

Raphaellll 4 years ago |

I actually bought this as a physical book on Amazon. Naturally it came as a print-on-demand book. Unfortunately it has many problems in this format. E.g. the lack of margins makes it hard to read the end of sentences towards the gutter. Also some text is pushed into each other. Not sure what source file format you have to provide to Amazon, but it's certainly not the pdf provided in the repo.

Edit:

It seems the overlapping text also occurs on some pdf readers: https://github.com/BoltzmannEntropy/interviews.ai/issues/2

ruph123 4 years ago | |

The last 5 textbooks I bought new on amazon had similar problems. Totally unacceptable. I started returning them and (because most were exclusive to amazon) started buying them new on ebay with great results.

Raphaellll 4 years ago | | |

It's really a hit and miss. This [1] book also came as print-on-demand but looks perfectly fine. Good layout and clean colors.

[1] https://mml-book.github.io/book/mml-book.pdf

time_to_smile 4 years ago |

Fisher Information is under the "Kindergarten" section?

Maybe I've just been interviewing at the wrong places, I'd be very curious if anyone here has been asked to even explain Fisher information in any DS interview?

It's not that Fisher information is a particularly tricky topic, but I certainly wouldn't put it as a "must know" for even the most junior of data scientists. Not that I wouldn't mind living in a world where this was the case... just not sure I live in the same world as the authors.

sdenton4 4 years ago | |

When I was a mathematician it was pretty common to make jokes whenever we actually had to evaluate an integral, along the lines of 'think back to your elementary-school calculus...'

lp251 4 years ago | | |

“integrate by parts, like you learned in middle school”

tf middle school did you go to?!

spekcular 4 years ago |

This is amazing. I am ecstatic.

I've been looking for something exactly like this – and it's executed better than I could have imagined.

(Needs a good proofreader still, though! Also, whatever custom LaTeX template the authors are using is misbehaving a bit in various places. Still great content.)

lvl100 4 years ago |

In my 20s, I was doing data science at a very high level spanning multiple disciplines. Truly state of the art. I would like to think I was quite good at my job.

I am 99% certain I would not have passed the interview bars set today. More specifically, the breadth they expect you to master is very puzzling (and seemingly unrealistic).

master_yoda_1 4 years ago |

My problem with these line of numerous shallow books and courses are

  1) Written by people who has no experience in industry or they are not working on "real" machine learning jobs

  2) They think the standard in industry is pretty low and any BS works. For example the concept of "lagrange multiplier" is missing from the book. One need this concept to understand training convergence guarantee.

la_fayette 4 years ago |

Question aside: using arXiv for distributing such interview questions, seems to me inappropriate. Is there any SEO trick behind it?

seaman1921 4 years ago | |

Yes I was also surprised how this is hosted on arxiv. Can someone explain why this is ok ? It is definitely not a scholarly article.

pugio 4 years ago |

I'm really enjoying the discussion here, as I've been thinking a lot about what a full modern ML/DS curriculum would look like.

I currently work for a non-profit investigating making a free high quality set of courses in this space, and would love to talk to as many people either working in ML/DS or looking to get into the field. (I have ideas but would prefer to ground them in as many real-world experiences as I can collect.)

If anyone here wouldn't mind chatting about this, or even just sharing an experience or opinion, please drop me an email (in my profile).

EDIT: We already have Into to DS, and a Deep RL sequence far along in our pipeline, but are looking to see where we can help the most with available resources.

I really appreciate this Interviews book as an example of what topics might be necessary (and at what level), taking into account the qualifying discussion here, of course.

mrfusion 4 years ago |

Are there deep learning roles that focus more on software engineering and using the tools rather than having a deep understanding of statistics?

time_to_smile 4 years ago | |

> having a deep understanding of statistics?

As someone with a strong background in statistics, please tell me where I can find DS jobs that require this.

For me and all my statistics friends in DS we find much more frustration in how hard it is to pass DS interviews when you understand problems deeper than "use XGBoost". I have found that very few data scientists really even understand basic statistics, I failed an interview once because an interviewer did not believe that logistic regression could be used to solve statistical inference questions (when it and more generally the GLM is the workhorse of statistical work).

And to answer your question, whenever I'm in a hiring manager position I very strongly value strong software engineering skills. DS teams made up of people that are closer to curious engineers tend to greatly outperform teams made up of researchers that don't know you can write code outside of a notebook.

disgruntledphd2 4 years ago | | |

A good conceptual understanding of statistics is always helpful.

It's not really tested for in most places though, where they regard a DS as a service that produces models.

jstx1 4 years ago | |

There are. But

1) the titles will vary a lot (software engineer, ML engineer, research engineer, data scientist etc.) which makes it hard to locate those jobs and to move in the job market in general

2) you still need a reasonable amount of theory (not necessarily too much statistics) to use the tools well. And in all likelihood you will be tested on it in some way during the interviews.

3) the interviews/job descriptions that don't emphasise the theory often will be for jobs where you get a title like Machine Learning Engineer but you focus more on the infrastructure rather than on the ML code

fault1 4 years ago | |

I would say on average MLE roles tend to be more SWEng heavy. But some roles are as much creating infrastructure as running the tools.

throwaway6734 4 years ago | |

I think they're called research engineering roles or ML engineering

nutanc 4 years ago |

Now someone just train a model using these questions and answers and we will let the model take all future interviews.

d4rkp4ttern 4 years ago |

This would be great resource for creating a DL/AI course. Or chapter quizzes for such a course.

However, one of the important things when interviewing someone is that the person has not seen the question before. So as an interviewer my impulse would be to first ensure that my question is NOT in this book :)

Or perhaps even if it is in the book, if the question is advanced enough, I could test how they articulate and reason through the solution, so I know they are not simply regurgitating the answer?

pradn 4 years ago |

I think I know the answer to this, but how bad should I feel for being a software engineer with little-to-no knowledge of deep learning. I suspect it's not bad at all since the software engineering field has split into a few camps, and mine - backend systems work - isn't in the same universe as the machine learning one, for the most part.

jstx1 4 years ago | |

Not bad at all. I'm a data scientist and my not knowing React doesn't affect me one bit.

agentofoblivion 4 years ago |

I commend the author’s effort, but this is not reflective of any interviews I’ve been part of, which is many across several industries and levels. Bayesian Deep Learning? Chapter 2 in Kindergarten? If anyone asked me a question on that, I would kindly ask them to eat shit.

erwincoumans 4 years ago |

Wow, nice resource! Wish it had some sections about (deep) reinforcement learning and its algorithms. Looks like it is in the plan though.

jstx1 4 years ago | |

RL is still kind of niche - the number of companies that ship anything using RL and the number of jobs that require it are both quite low.

master_yoda_1 4 years ago | | |

just a clarification I think you are confused between RL and robotics. RL algorithm could be used anywhere either in ads, nlp, computer vision etc.

kragen 4 years ago |

Why are all the em dashes missing from the PDF?

aesthesia 4 years ago | |

This may be a rendering issue. Some interaction of the Computer Modern font, the TeX layout algorithm, and Chrome's rendering engine sometimes ends up making em-dashes and minus signs invisible.

kragen 4 years ago | | |

I'm not using Chrome's rendering engine, is he?

pietromenna 4 years ago |

Wow! Great resource! Thank you!