Vladimir Vapnik Joins Facebook Research

Vladimir Vapnik Joins Facebook Research(facebook.com)

111 points by gie 11 years ago | 25 comments

patkai 11 years ago |

Am I the only one assuming that this many excellent scientists moving from academia is a loss for science in general? Will they really publish research in the same way they did before?

(Make no mistake, I can fully understand them, professors paid 80k per year, lacking resources, fighting bureaucrats, it is a great thing that they are recognised and at last paid what they deserve for devoting their lives to science.)

buttproblem 11 years ago | |

While I am curious too about your question, Vapnik was previously working in industry at NEC Labs in New Jersey.

patkai 11 years ago | | |

Good point. But I do wonder about Facebook's journal publishing policy.

xamdam 11 years ago | |

- Highly doubt it's purely a money decision

- Vapnik is joining a number of people he previously worked with

- Getting huge computational resources and seeing your ideas applied to real data is rewarding

iandanforth 11 years ago |

This is another great example of the unreasonable effectiveness of data. LeCunn, Hinton, Ng, Vapnik were all recruited on the basic fact that there is simply no way to do cutting edge research today without access to the data and computing resources of Google/Facebook/Yahoo/Baidu.

Edit: "No way" is inaccurate. I should have said it is much easier to do at these companies. Also it is inaccurate to imply this is the only reason these great minds have joined these companies.

azakai 11 years ago | |

> LeCunn, Hinton, Ng, Vapnik were all recruited on the basic fact that there is simply no way to do cutting edge research today without access to the data and computing resources of Google/Facebook/Yahoo/Baidu.

I don't see many details here, are you sure that's the case?

There are other reasons a giant of the field might decide to work at Facebook. They might give him more freedom than his previous employer. Perhaps friends of his already work at Facebook. The location and compensation may also play into it.

I don't want to be skeptical for no reason, but you're championing a popular narrative which I don't see direct support for in this instance.

robrenaud 11 years ago | |

There is great, big data driven research coming out of Stanford using Common Crawl. For example, see http://www-nlp.stanford.edu/projects/glove/ . They successfully train an 840 billion token corpus.

Vapnik is a big theory guy. Though I am not sure he has done anything of big practical importance recently, his immense contribution to ML (the SVM) was done at a time when machines were many orders of magnitudes weaker than they are now.

mturmon 11 years ago | | |

"In writing this book I had one more goal in mind: I wanted to stress the practical power of abstract reasoning. The point is that during the last few years at different computer science conferences, I heard reiteration of the following claim:

  Complex theories do not work, simple algorithms do.

"One of the goals of this book is to show that, at least in the problems of statistical inference, this is not true. I would like to demonstrate that in this area of science a good old principle is valid:

  Nothing is more practical than a good theory.

-- From Vapnik's preface to The Nature of Statistical Learning Theory

Vapnik is not well-described as a "theory guy". That implies that he's not interested in connections between theory and practice, and this is most profoundly not the case. He has arguably been the most successful ML researcher ever as far as connecting abstract theory to real-world outcomes.

Besides the SVM: the VC dimension started out as a lemma regarding set counting, and he pushed it to the surprising (even shocking) conclusion of universal consistency for very general classes of estimators.

nl 11 years ago | | |

There is great, big data driven research coming out of Stanford using Common Crawl. For example, see http://www-nlp.stanford.edu/projects/glove/ . They successfully train an 840 billion token corpus.

I haven't seen this paper before (thanks!!). How different is it to Word2Vec?

Clearly the pre-trained vectors at that scale (and much bigger than the ones released with Word2Vec) are new and very exciting.

tlarkworthy 11 years ago | |

Vapnik is philosophically not big data. SVMs are data efficient, at the cost of O(n^3) partitioning algorithms. His work has been more about maximizing the utility of the data you have.

j2kun 11 years ago | |

I think the real reason is because Facebook plans to recreate the Bell labs style of industry research, where researchers have license to do whatever they want.

Teodolfo 11 years ago | |

That is not accurate at all. These people can and did do lots of research in academia and had plenty of data. They obviously get paid a lot more in industry, however.

moab 11 years ago | | |

Sources? I've seen first hand the limits of the 'data' we had at school. Papers constantly citing in their experiments: "the largest real world graphs we are aware of are on the order of 1B vertices" (a twitter graph, from something like 2011. The other highly cited one is the live-journal graph).

There's a massive dearth of data in academia. This is also why you see people like Kleinberg working directly with facebook on network research.

vrnut 11 years ago |

I'm probably not the only VR nut who confused the person in the title with Vladimir Vukićević, the Director of Engineering of Mozilla who has done worked on some Oculus-centric web vr stuff for Mozilla.

http://blog.bitops.com/blog/2014/06/26/first-steps-for-vr-on...

vanderZwan 11 years ago |

A few weeks ago an article on Nautil.us about innovations in machine learning. Vladimir Vapnik was mentioned, specifically how he used poetry to teach a machine handwriting. Very fascinating article in general:

http://nautil.us/issue/6/secret-codes/teaching-me-softly

dmmcenzie 11 years ago |

Never heard of the guy. Who?

cfrs 11 years ago | |

"The original SVM algorithm was invented by Vladimir N. Vapnik" http://en.wikipedia.org/wiki/Support_vector_machine

guard-of-terra 11 years ago | |

http://en.wikipedia.org/wiki/VC_dimension