Google is acquiring Kaggle

Google is acquiring Kaggle(techcrunch.com)

810 points by Perados 9 years ago | 156 comments

kornish 9 years ago |

This is obviously a talent acquisition in more ways than one (the Kaggle team, but also their ability to source machine learning talent). I wonder to what degree it's also a Tensorflow promotion move? It seems like Google is very interested in growing a community around it.

For example: some friends who run a seed-stage biotech deep learning startup were offered a considerable discount by the Google Cloud folks. Their ask? That the company switch to Google Cloud, rewrite some proprietary software in Tensorflow, and heavily publicize both moves.

I wonder if we'll see Kaggle gain a specific bent towards that ecosystem.

nl 9 years ago | |

Not clear to me why this is a talent acquisition. The Kaggle team (Ben in particular) have some talents in ML, but I'd be surprised if they have anyone there working day to day on ML tasks.

It seems to me more like an old school product-and-media acquisition: Google like the product, and love the audience. This is a good way to get both.

Cyph0n 9 years ago | | |

I think parent's focus was on the "sourcing ML talent" part rather than the Kaggle team itself.

moomin 9 years ago | | |

Not the team themselves, the competitors...

alex_dev 9 years ago | |

Last I heard was Kaggle runs atop Azure and is heavily a C# shop. It'll be interesting to see the transition to Google Cloud if that's the case.

ofek 9 years ago | | |

I can confirm that Kaggle runs on Azure because I block all Microsoft IPs (to avoid the ninja Windows 10 upgrade) and must disable the blocker in order to go on the site.

alpb 9 years ago | | |

Makes sense. Azure LBs do not support ICMP and all ping packets are dropped. You can't ping any Azure-hosted services. Kaggle.com fits the description.

latkin 9 years ago | | |

They are also known to have used F#, and even provided a testimonial to this effect: http://fsharp.org/testimonials/. Can't say if it's still used, though. That's two recent high-profile acquisitions (with Jet.com) for F# shops.

> At Kaggle we initially chose F# for our core data analysis algorithms because of its expressiveness. We’ve been so happy with the choice that we’ve found ourselves moving more and more of our application out of C# and into F#. The F# code is consistently shorter, easier to read, easier to refactor, and, because of the strong typing, contains far fewer bugs.

> As our data analysis tools have developed, we’ve seen domain-specific constructs emerge very naturally; as our codebase gets larger, we become more productive.

> The fact that F# targets the CLR was also critical - even though we have a large existing code base in C#, getting started with F# was an easy decision because we knew we could use new modules right away.

rattray 9 years ago | | |

Google Cloud supports Windows, right? What would be the problem? (Honest question)

spullara 9 years ago | | |

The IP of kaggle.com reverse DNS is cloudapp.net which is a Microsoft Azure domain so I think that this makes sense.

kornish 9 years ago | | |

That's really interesting to hear. I wouldn't read too much into it, I was mostly just speculating. It's quite likely that they mostly scooped them up for the rolodex that is their user database.

In any case, congrats to the Kaggle team!

markovbling 9 years ago | | |

I think this may have something to do with Jeremy Howard's time as president there - I remember watching a few of his tutorials a couple of years ago when he was still at Kaggle and he was really into C#.

yuhong 9 years ago | | |

I wonder if Nest has support contracts for any Java 6/7 they are still using.

manojlds 9 years ago | |

Why does Google want to promote TensorFlow? To make people use more of their cloud offerings?

nostrademons 9 years ago | | |

Likely to avoid their mistake with MapReduce, where by around 2011 candidates were coming in to interviews and saying "MapReduce? That's sorta like Hadoop, right?"

There's value in controlling mindshare; keep everything proprietary too long, and people just use open-source clones that may be inferior but can actually be used by the majority of the talent pool.

kornish 9 years ago | | |

I imagine part of it is that businesses built on Tensorflow play nice with Google Cloud at their TPUs, but mostly I suspect it's just a mindshare thing. If Google becomes the place that all the top data scientists want to work – such that they don't even have to be poached – that's a Very Good Thing for them. It probably doesn't hurt if those data scientists come in already familiar with a tool Google uses internally.

Kind of reminds me of the genius move by Tesla to crowdsource collection of self-driving car information. Experts want to get where they have the data to train their models, and if Tesla propels itself ahead of the pack for number of miles of real-world training data, then that makes them very attractive to talent.

mtgx 9 years ago | | |

If all machine learning experts use TensorFlow, all the machine learning chips coming out will be highly optimized for TensorFlow. Higher competition among TensorFlow chips = better acquisition prices for Google. They also don't have to go around convincing chip makers to support TensorFlow (like they did, for instance, with the VP8/VP9 codec).

ktamiola 9 years ago | |

I am curious to see what will happen to Tensor Flow. I hope the code will get clean up... I also hope they will eventually pay somebody to do it, as the open source option clearly generates heterogeneous nightmare.

deepnotderp 9 years ago | |

The rewrite in TensorFlow is somewhat worrying though, since TensorFlow is open source, meaning that there's no real benefit to google if it's written in TensorFlow (except for recruitment purposes).

It's worrying since it suggests that google might be planning to make it, or at least parts of it proprietary in the future....

For the record, I don't think that google will, but I'm still worried about the possibility....

digitalzombie 9 years ago | | |

Doubt it.

They made angular and they didn't some how proprietary it.

The more worrisome stuff is when they close shop on services or completely change a framework.

TensorFlow isn't a service so we don't need to worry. And I doubt they would change TensorFlow so much like angular 1 to 2 to 3 kinda deal. If it does happen Keras library abstract it iirc.

I think their goals is to get people to use their cloud services imo. They do the same with their nexus without the SD card to push people to the cloud.

Also I think it's almost like the idea of controlling a framework instead of being on the whim of some other company. I'm looking at Oracle and Java here.

Facebook have their NN. Google have their owns. So they don't have politics to deal with.

maxander 9 years ago | | |

Recruitment is an important purpose, though. Having a steady supply of pre-Tensorflow-trained engineers available is presumably why they opened up Tensorflow to begin with. They're not going to benefit more than that anytime soon by closing it off again.

estsauver 9 years ago | | |

I think their play has been building specialized hardware that executes TensorFlow better than anyone else. "You could use a GPU to do this, but check out our custom ASIC that does it 400x faster for 1/5th the cost..."

adw 9 years ago | | |

> It's worrying since it suggests that google might be planning to make it, or at least parts of it proprietary in the future....

That would be the Google-only Tensorflow acceleration hardware they have.

jboggan 9 years ago |

I have a soft spot in my heart for Kaggle. I was motivated to get into the software industry 5 years ago when they ran their first Facebook hiring challenge. How else to break into an industry I had no degree in?

I didn't do so well in the competition but it got me coding every day and it gave me enough to talk about that I figured I could sell all my things and ride a motorcycle to California and start knocking on doors. It worked, after a fashion.

I also have a soft spot in my heart for Kaggle because I interviewed there during my first month in San Francisco and it was absolutely the worst interview of my life.

conjectures 9 years ago |

Kaggle is a great idea, but it's steadily getting more annoying to use.

1) Cruft on all landing pages and having to click through to get to the comps page which is the site.

2) Annoying focus on exploratory notebooks. Inevitably they aren't powerful enough and people link through to external sites.

3) Forcing the use of 3rd party compute platforms to enter comps. Half the fun for me is messing around with my own ideas and this just gets in the way. These should be optional rather than required.

4) Poor incentives. Many of the comps have tiny prizes for the value of work that gets done. They're also concentrated way too much at the top. Unless there's something I want to try out, the expected value of participating is way too low to do it just for the giggles.

marcelsalathe 9 years ago |

https://www.crowdAI.org is an open source alternative. Disclaimer, my research group at EPFL started the platform, because we think there should be a community-based open source version that is open to anyone. Always looking for contributors!

Edit (1): Github https://github.com/crowdAI/crowdai Edit (2): We're currently re-designing the whole site to look & feel better.

wapz 9 years ago | |

I just looked at the site and it sounds real exciting (but way too difficult for me). Can I ask how you guys are funded? I saw that there is a ~$2000 payout for the winner of the most recent challenge.

marcelsalathe 9 years ago | | |

The platform itself is funded by institutional research funding we get at EPFL. For some of the monetary prizes, these typically come from the corresponding projects.

iamseiko 9 years ago |

That's disappointing. Google will probably keep the service alive for recruiting and the consumer base, while most of it's technologies will probably be shut off. Being owned by Google might also mean that some companies might not want to post challenges on Kaggle anymore, like Facebook or Microsoft.

inlined 9 years ago | |

I really don't understand this assumption that all acquisitions are going to lead to disaster. I work in the Firebase team at Google and couldn't be happier that they've joined (it's what got me to return to Google). Google doubled down on the product and it's grown in ways that Firebase could never have achieved on its own. All while integrating into the broader ecosystem of Cloud.

Firebase then acquired DivShot and people cried doom. Yes DivShot was shut down--after completely rearchitecting Firebase's CLI and Hosting to have DivShot's open source web hosting framework with the features of both product lines. The CEO of DivShot now runs Firebase Hosting's product line and has massive resources at his disposal to push his (great) agenda of simple and speedy static web services.

halflings 9 years ago | |

What is kaggle's technology really? The notebooks? It's a rather experimental feature that doesn't work all that well most of the time, and shouldn't be going nowhere as a lot of people still appreciate them.

I agree for big companies though, even if that doesn't make a whole lot of sense (as there's no private info involved here, the competitions being in the open including the data provided by these companies).

myth_drannon 9 years ago | |

Some people spend years competing and trying to get top rankings. It was a great signal to show to recruiters/potential employers. If Google shuts it down, all this work will be gone.

codesternews 9 years ago |

This is worst news I read today. Kaggle independently serve more purpose to community than a baby of some large giants. I love kaggle and I am very disappointed that google acquire everything we love.

soheil 9 years ago |

I'm a little sad about this, what will Google do with this? Are they going to drain its soul? I think at a minimum the people behind Kaggle won't feel the same urge to keep building , maintaining and growing it the same way as before, specially as the $$$ flows in their pockets. It will probably change direction by people at Google in control and I'm not sure if that's a good thing since they didn't just built something like this on their own or a better version of it if they were really good at doing stuff like this themselves.

ehsankia 9 years ago | |

They just officially announced it at NEXT. It was presented by Fei Fei Li, who is known for the ImageNet project, which one one of the first big open datasets that really helped advance this field.

The way she presented the news is that they will aim to advance that vision, but we'll have to wait an see how their vision pans out.

jph00 9 years ago |

Why does the article say that Ben Hamner was involved in the founding in 2010? He joined years later. Some basic fact checking would be nice, even in tech articles...

(Ben has been a great contributor, mind you.)

kornish 9 years ago | |

Probably because Crunchbase has Ben's title as "Co-founder & CTO", which is also what his LinkedIn says.

It's not unheard of for people who came on ex-post facto to be offered cofounder titles to sweeten the deal or for other reasons (titles are cheaper than equity, I suppose).

Edit: just rechecked and his LinkedIn notes Nov 2011 as the start date for Kaggle. Guess it was easier to condense it into one sentence than to explain the difference to TechCrunch's audience?

redcalx 9 years ago | |

Yeh as I recall Jeremy Howard was chief boffin at the start, and left some time later to start his own biomedical data analysis company (also in SF).

kornish 9 years ago | | |

Funny thing: the comment you're replying to was posted by Jeremy Howard. Pardon if I'm missing some tongue-in-cheekiness.

throw_away_777 9 years ago |

Congrats to the Kaggle team! One great thing about Kaggle was that the team listened and sought out feedback from users (even if they didn't always follow the feedback). I hope that doesn't change with the acquisition.

nojvek 9 years ago | |

It's amazing how focused Google is on AI compared to the other giants. I think it's a great investment on Google's part and congrats to Kaggle.

I hope the mission of the site doesn't change. I think Facebook did a great job with whatsapp and instagram. I expect the same with Kaggle.

chis 9 years ago |

DrivenData.org is a solid competitor without much publicity. Maybe they'll take over some of the traffic if Kaggle changes for the worse.

dthal 9 years ago |

Well, supposing this is correct...Congratulations to Anthony and the rest of the Kaggle team! Those guys do a great job. Hopefully they get rewarded for it.

macca321 9 years ago |

Congrats to Jeff and the rest of the team. I'd be interested to hear how much .NET survives the transition!

nstart 9 years ago |

This could well end up being a fantastic move for Google to also acquire customers in its platform. If Kaggle moved large pieces of its competition to be automatically hosted on GCE it might be a good win for Google. So like Kaggle's "kernels", GCE machine learning tools would become an extension that's usable with it in a really simple way. Not entirely sure what that might look like, but it feels like this kind of integration would be the best for both parties.

alantrrs 9 years ago |

Since we're sharing alternatives:

https://empiricalci.com is a dashboard to keep track of your experiments & compare them on public benchmarks.

luckystartup 9 years ago |

> Kaggle, which has about half a million data scientists on its platform, ...

Are there really that many data scientists? I thought it was a niche specialty. Is there enough work for that many people?

rcar 9 years ago | |

Think they maybe put a 0 in the wrong spot. Kaggle's leaderboard only shows ~50k: https://www.kaggle.com/rankings

codezero 9 years ago | |

I think they mean unique users.

govg 9 years ago | |

A lot of them are academics who participate out of interest, and I'm sure a significant amount are regular software engineers trying to get their hands dirty with ML.

outericky 9 years ago |

Best of luck to the Kaggle team. We attended a data scientist conference they presented at in 2012 which led to our YC application, and formation for SimpleLegal. Hats off...

qkhhly 9 years ago |

Google probably want to use Kaggle as Google cloud entry point for the data scientist community. Kaggle has a lot of student and entry level data scientist. Getting those users to start to use Google cloud could potentially drive the growth of lots of potential customers.

leblancfg 9 years ago | |

I think you hit the nail straight on the head. Sure, Tensorflow will also probably get pushed in the form of tutorials, etc. but I certainly think it's rather related to bring a way to popularize GCS.

Nydhal 9 years ago |

I'm not sure if this is good or bad news. I wonder what google motives are and how they will influence kaggle if this becomes reality.

sonabinu 9 years ago | |

Hate to think that some of the beauty of Kaggle being independent will be lost but that's likely

seangrogg 9 years ago | |

As with many of Google's hires chances are they see it less about acquiring a "product" and more about getting access to what that product produces - an extremely large number of leads in a high-demand space that they're currently trying to ramp up themselves.

leblancfg 9 years ago | | |

Interesting, though, as they never needed to own that platform to mine it for hiring leads.

deepnotderp 9 years ago |

Only Google can spend this much on what's ultimately a recruiting project.

soheil 9 years ago | |

They have 500,000 developers, do the math at 30% commission for each assuming a $180k salary that's $50k even if they hire 0.1% of them that adds up to $50k * 500 = 25m they probably paid a few times more than that but not a few hundred times, which therefore makes this a pretty sweet deal for Google assuming the community keeps growing.

throw_away_777 9 years ago | | |

That 500,000 developers is a vanity metric. The real number is around 50,000 who have completed a challenge, and around 5,000 who are active on the site. Also of those 5,000 users the large majority are employed somewhere else.

gpawl 9 years ago | | |

do you have buy kaggle to hire kaggle members?

trhway 9 years ago | |

giving that acquihires in AI seem to go like $10M/head, Google access to that Rolodex would pay out pretty quickly

asafira 9 years ago | |

You don't think this would drive data scientists to use Google's cloud platform? I.e., if the most well-known data science competition uses the platform, then they will use it after the fact since it's what they know best. Right?

jader201 9 years ago |

Official announcements:

Google: https://cloudplatform.googleblog.com/2017/03/welcome-Kaggle-... (HN: https://news.ycombinator.com/item?id=13822635)

Kaggle: http://blog.kaggle.com/2017/03/08/kaggle-joins-google-cloud/ (HN: https://news.ycombinator.com/item?id=13822727)

alxvio 9 years ago |

Just announced at Google Next '17. https://www.youtube.com/watch?v=j_K1YoMHpbk&feature=youtu.be

rochak 9 years ago |

Good luck to Kaggle's employees. They have done a phenomenal job.

EternalData 9 years ago |

I think having a dataset on who is really interested in machine learning and applying it in practice can only help Google. Plus, if they kind of lurk on the side, you don't get enough of the Google brand overwhelming Kaggle so that it disrupts the community, but in the back of the minds of people going into competitions and who are in the know, it might help incentivize people who think "Hey, Google is really interested in this".

sullyj3 9 years ago |

How could a company called "Google" not acquire a company called "Kaggle"? This makes me giggle.

gumboshoes 9 years ago |

The Kragle has been sold?! http://vignette1.wikia.nocookie.net/evil/images/c/c0/Kragle....

ebbv 9 years ago |

Guess that means Kaggle users can expect it to be shut down in the next five years.

sireat 9 years ago | |

Sadly I think 5 years to sunset is an optimistic estimate.

huula 9 years ago |

Don't know why. Just don't think this is going to happen.

pizza 9 years ago |

Deep Mind keeps acquiring appendages

moizsajid 9 years ago |

Really excited about this acquisition! Might open new avenues for the data science community.

nafizh 9 years ago |

This must imply Kaggle has some internal software that Google want?

deepnotderp 9 years ago | |

Nah, they just want a good recruiting station.

snissn 9 years ago | |

I am wondering if it's a marketing and recruiting play

sunilkumarc 9 years ago | |

According to the article, Google mainly wants the Kaggle community.

tzs 9 years ago |

I wonder if that's the only one they want, or if they are also going to try to get other relics such the Knife of Exact Zero, the Fleece-Crested Scepter of Que-Teep, or the Orb of Ti-Teleest?

inopinatus 9 years ago |

Hopefully there will be no uncertainties in the acquisition. If not they can form a team to fix them. But I'm joking around: this is a Google-Kaggle niggle gaggle giggle.

maverick_iceman 9 years ago |

Anyone knows what was the price?

danaliv 9 years ago |

Whoever named this company has literally never spoken to a woman.

nojvek 9 years ago | |

Care to explain what kaggle means to a woman?

Do you mean kegel, the exercise?

ScottBurson 9 years ago | |

I think the name is a little odd too. Does anyone know how they came up with it?

Danylon 9 years ago | | |

> I didn’t have any money when I started the company to purchase a domain name so I built an algorithm that iterated phonetic domain names and printed out a list of what was available. My wife and I went through the list and “Kaggle” was the one we picked. It’s algorithmically generated.

> It’s a terrible name because most Americans pronounce it “kagel” [rhymes with “bagel”] which sounds like the pelvic floor exercises. Australians pronounce it “kaggel” [rhymes with “haggle”].

-- Anthony Goldbloom, http://www.intelfreepress.com/news/a-marketplace-for-data-sc...

joelthelion 9 years ago |

Yuck.

mostafab 9 years ago |

good news, I did not like the whole Kaggle concept anyway: thousands of people over-engineering solutions for one problem, paid peanuts, while there are more rewarding problems than talent available. It was a huge waste of scarce brainpower. I am launching my Kaggle alternative, landing page here: http://startcrowd.club/ Thanks Google for eliminating my competitor.