AlphaGo beats the world champion Lee Sedol in first of five matches

AlphaGo beats the world champion Lee Sedol in first of five matches(twitter.com)

1085 points by atupem 10 years ago | 573 comments

sethbannon 10 years ago |

I was at the 2003 match of Garry Kasparov vs Deep Junior -- the strongest chess player of all time vs what was at that point the strongest chess playing computer in history. Kasparov drew that match, but it was clear it was the last stand of homo sapiens in the man vs machine chess battle. Back then, people took solace in the game of Go. Many boldly and confidently predicted we wouldn't see a computer beat the Go world champion in our lifetimes.

Tonight, that happened. Google's DeepMind AlphaGo defeated the world Go champion Lee Sedol. An amazing testament to humanity's ability to continuously innovate at a continuously surprising pace. It's important to remember, this isn't really man vs machine, as we humans programmed the algorithms and built the computers they run on. It's really all just circuitous man vs man.

Excited for the next "impossible" things we'll see in our lifetimes.

maaku 10 years ago | |

> Many boldly and confidently predicted we wouldn't see a computer beat the Go world champion in our lifetimes.

Sadly as I write this my uncle and personal hero who spent 17 years of his life working towards a Ph.D. on abstraction hierarchies for use in Go artificial intelligence, has been moved into hospice care. I'm just glad that in the few days that are left he has a chance to see this happen, even if it is not the good old-fashioned approach he took.

[1] He recently started rewriting the continuation of this research in golang, available on Github: https://github.com/Ken1JF/ah

new299 10 years ago | | |

Thanks for the link. I started reading his thesis. While AlphaGo is obviously exciting, it seems like your uncles approach could help us better understand how humans play go which seems hugely valuable. I look forward to exploring his thesis further.

jakub_h 10 years ago | | |

> This project is the sixth attempt to implement the model:

Finally, the sixth attempt is written in the right language! Now it will succeed for sure.

hosh 10 years ago | |

The 2003 match was a brute force approach.

AlphaGo's architecture resembles much closer to how humans think and learn.

I initially learned Go to be able to have some chance of an AI. I then had some transformative experiences that coincided with my early kyu learning of basic Go lessons. On of the big lessons in Go is to learn how to let go of something. Taking solace in anything on the Go board is one of the blocks you work through when you develop as a Go player.

I had already known about two years ago that just the Monte Carlo approach was already scalable. If Moore's Law continues, it was a matter of time before the Monte Carlo approach would start challenging the professional ranks -- it had already gotten to the point where you just needed to throw more hardware at it.

AlphaGo's architecture adds a different layer to it. The Deep Learning isn't quite as flexible as the human mind, but it can do something that humans can't: learn non-stop, 24/7 on one subject. We're seeing a different tipping point here, possibly the same kind of tipping point when we witnessed the web browser back in the early 90s, and the introduction of the smartphone in the mid '00s. This is way bigger (to use a Go terminology) than what happened with chess.

gcr 10 years ago | | |

This isn't about Moore's Law though. From the AlphaGo paper:

    > During the match against Fan Hui, AlphaGo evaluated thousands of times
    > fewer positions than Deep Blue did in its chess match against
    > Kasparov; compensating by selecting those positions more intelli-
    > gently, using the policy network, and evaluating them more precisely,
    > using the value network—an approach that is perhaps closer to how
    > humans play. Furthermore, while Deep Blue relied on a handcrafted
    > evaluation function, the neural networks of AlphaGo are trained
    > directly from gameplay purely through general-purpose supervised and
    > reinforcement learning methods

hosh 10 years ago | | |

And just to emphasize the big point here:

The AlphaGo that beat the 2p European champion five months ago was not as strong as the AlphaGo that beat Lee Sedol (9p). I don't think this was just the AlphaGo team throwing more hardware. I think they had been constantly running the self-training during the intervening months so that AlphaGo was improving itself.

If that is so, then the big thing here isn't that AlphaGo is the first AI to win an official match with the currently world's strongest Go player. It's that within less than half a year, AlphaGo was able to learn and grow to go from challenging a 2p to challenging the world's strongest player. Think about that.

seanp2k2 10 years ago | | |

What I always think about with AI, and speaking to the "man's programming vs man" higher up, is what we'll get when we can teach computers how to solve these problems so that e.g. They could come up with the solution for + implement something like this on their own.

hitekker 10 years ago | | |

Is the Monte Carlo approach specific to Go in terms of A.I. challenges? Or is the Monte Carlo approach gaining traction in other A.I. problems as well?

I am tremendously unfamiliar with recent A.I developments.

areyousure 10 years ago | |

> Many boldly and confidently predicted we wouldn't see a computer beat the Go world champion in our lifetimes.

Can anyone provide some written references to this effect? Last time I searched (extensively), I couldn't really find anyone saying this.

baby 10 years ago | | |

What kind of source do you want? It's a saying in the go community, people believed (including me) that a bot couldn't beat a human in our lifetime, some people had more extreme view and thought that it would never be possible.

Florin_Andrei 10 years ago | | |

FWIW, before AlphaGo defeated Fan Hui 2-dan last year, everyone was saying that would not be possible before 2025 or so. That was the consensus.

bjourne 10 years ago | |

> Many boldly and confidently predicted we wouldn't see a computer beat the Go world champion in our lifetimes.

To add to that, in Godel Escher Bach, Hofstadter in 1979 predicted that no chess engine would ever beat a human grandmaster player. It just goes to show how hard it is to predict what is, and also will remain, impossible for machines!

olau 10 years ago | |

Man building tool vs. man. :)

d0mine 10 years ago | |

Francis Collins? said something like: people underestimate the change in the short term and overestimate it in the long term (there is my flying DeLorean and the hotel on the Moon).

adenadel 10 years ago | | |

I think you got it backwards. Bill Gates said

"We always overestimate the change that will occur in the next two years and underestimate the change that will occur in the next ten. Don't let yourself be lulled into inaction."

esturk 10 years ago | |

I've never felt playing against what is suppose to be an entire room of machines (wether Deep Blue or Watson) to be fair. What would be fair is to limit the total mass of the computer to say 200kg and leave it at that. What is effectively happening is AlphaGo is running on a distributed system of many, many machines. Even Watson took an entire room. Google is paying a premium to push AlphaGo to win.

taneq 10 years ago | | |

It's a proof-of-concept. What they've proved is that the same kind of intelligence required to play Go can be implemented with computer hardware. Before now, software couldn't beat a ranked human player at Go no matter how much computing power we threw at it. Now we can. Give it ten years and, between algorithmic optimizations and advances in processing, you'll have an unbeatable Go app on your phone.

knaik94 10 years ago | | |

The real achievement is in the algorithm. To make an analogy, the accomplishment of putting a man on the moon required that we understand enough to make a rocket. We could have put hundreds of car engines together but that wouldn't ever have gotten us to the moon.

21 10 years ago | | |

A top smartphone chess program can beat pretty much all but the best few players in the world. Do you think it's fair to pit a 150 gram device against a 70 kg human?

lololomg 10 years ago | | |

Today a tiny $300 desktop computer can beat any human at chess. It only took a few years after the Deep Blue vs Kasparov game.

SmellyGeekBoy 10 years ago | | |

Seems like a pretty arbitrary limitation. 70 years ago Colossus filled an entire room, now it can be emulated on a Raspberry Pi. The really groundbreaking part is the algorithm.

agildehaus 10 years ago | | |

Has Google talked about the amount of computing resources they're throwing at this match? I'd be very interested to know.

ramblerman 10 years ago | | |

What a strange sentiment. You would only delay the inevitable outcome. Sure, it wouldn't win now, but processing power will become stronger and machines get smaller. What was the point?

Confusion 10 years ago | | |

Chess engines and processing power have since then advanced to a point where my phone can now reliably beat Carlsen. There is no reason to suppose Go is different in that respect. In 10 years, DeepMind will fit into a phone.

eitally 10 years ago | | |

Besides disagreeing with you, this actually isn't true at all. In competition, AlphaGo doesn't rely on particularly expansive hardware. For training, yes, but not for playing.

nefitty 10 years ago | | |

Considering the complexity of the human brain, it seems only fair to balance out a competitor's handicap in some way. Your idea seems to anticipate the logical progression of these tests: "Nature made this mind inside this small object, the brain, why don't we do that?" Regardless, the trend of course is toward miniaturization. I see news like this recent story: "Glass Disc Can Store 360 TB" http://petapixel.com/2016/02/16/glass-disc-can-store-360-tb-... to back up imagined futures like the film Her: https://youtu.be/WzV6mXIOVl4 (and that film doesn't even address whether the OSes are connected through a wireless network).

This stuff is happening fast, and we might have found ourselves, historically, in a place of unintelligible amounts of change. And possibly undreamt of amounts of self-progression.

nkrisc 10 years ago | | |

It's the software that's impressive. Why does how many physical computers it takes to run the software matter? It's physical footprint will almost certainly shrink as computers get more powerful.

talles 10 years ago | | |

Are you suggesting to measure computing power by kilograms? That's even stupider than measuring software complexity by LOC.

nsns 10 years ago | |

...it was the last stand of homo sapiens in the man vs machine...

But who made that machine?

I'd say a more precise evaluation would be that the ability to program a machine to assist in playing chess outdid the ability to play chess without such assistance.

eru 10 years ago | | |

Your point is entirely valid, but was already made by the very comment you are replying too..

dwaltrip 10 years ago |

This is my generation's Gary Kasparov vs. Deep Blue. In many ways, it is more significant.

Several top commentators were saying how AlphaGo has improved noticeably since October. AlphaGo's victory tonight marks the moment that go is no longer a human dominated contest.

It was a very exciting game, incredible level of play. I really enjoyed watching it live with the expert commentary. I recommend the AGA youtube channel for those who know how to play. They had a 9p commenting at a higher level than the deepmind channel (which seemed geared towards those who aren't as familiar).

cgearhart 10 years ago |

I was really hoping to see a more technical discussion than what I found here in the comments. It's too bad that such a cool accomplishment gets reduced to arguments about the implications for an AI apocalypse and "moving the goalposts". This isn't strong AI, and it was at least believed to be possible (albeit incredibly difficult), but it is still a remarkable achievement.

To my mind, this is a really significant achievement not because a computer was able to beat a person at Go, but because the DeepMind team was able to show that deep learning could be used successfully on a complex task that requires more than an effective feature detector, and that it could be done without having all of the training data in advance. Learning how to search the board as part of the training is brilliant.

The next step is extending the technique to domains that are not easily searchable (fortunately for DeepMind, Google might know a thing or two about that), and to extend it to problems where the domain of optimal solutions is less continuous.

j2kun 10 years ago | |

> without having all of the training data in advance

What? They certainly trained the algorithm on a huge database of professional go games. It's even in the abstract. [1]

[1]: http://www.nature.com/nature/journal/v529/n7587/full/nature1...

cgearhart 10 years ago | | |

> What?

Exactly

They used the game database to learn the value network, then reinforcement learning of the policy network was performed on self-play games. I.e., the machine learned to play from existing data, then played against itself to learn the search heuristics (the policy network) without the need for expert data.

spot 10 years ago | | |

they were amateur expert games from the KGS server.

clickok 10 years ago |

I posted in the earlier thread because this one wasn't up yet[1].

Some quick observations

1. AlphaGo underwent a substantial amount of improvement since October, apparently. The idea that it could go from mid-level professional to world class in a matter of months is kinda shocking. Once you find an approach that works, progress is fairly rapid.

2. I don't play Go, and so it was perhaps unsurprising that I didn't really appreciate the intricacies of the match, but even being familiar with deep reinforcement learning didn't help either. You can write a program that will crush humans at chess with tree-search + position evaluation in a weekend, and maybe build some intuition for how your agent "thinks" from that, plus maybe playing a few games. Can you get that same level of insight into how AlphaGo makes its decisions? Even evaluating the forward prop of the value network for a single move is likely to require a substantial amount of time if you did it by hand.

3. These sorts of results are amazing, but expect more of the same, more often, over the coming years. More people are getting into machine learning, better algorithms are being developed, and now that "deep learning research" constitutes a market segment for GPU manufacturers, the complexity of the networks we can implement and the datasets we can tackle will expand significantly.

4. It's still early in the series, but I can imagine it's an amazing feeling for David Silver of DeepMind. I read Hamid Maei's thesis from 2009 a while back, and some of the results presented mentioned Silver's implementation of the algorithms for use in Go[2]. Seven years between trying some things and seeing how well they work and beating one of the best human Go players. Surreal stuff.

---

1. https://news.ycombinator.com/reply?id=11251526&goto=item%3Fi...

2. https://webdocs.cs.ualberta.ca/~sutton/papers/maei-thesis-20... (pages 49-51 or so)

3. Since I'm linking papers, why not peruse the one in Nature that describes AlphaGo? http://www.nature.com/nature/journal/v529/n7587/full/nature1...

Aissen 10 years ago |

Just for context, this is the first of a five-game match. Next one tomorrow at the same time! (6am CEST, 8pm PT).

moonshinefe 10 years ago | |

Thank you. The title on HN here didn't imply it was a 5 game series at all, nor did the tweet it linked to.

It's a cool win but despite the way the titles are being presented, this isn't over yet.

rybosome 10 years ago |

What an incredible moment - I'm so happy to have experienced this live. As noted in the Nature paper, the most incredible thing about this is that the AI was not built specifically to play Go as Deep Blue was. Vast quantities of labelled Go data were provided, but the architecture was very general and could be applied to other tasks. I absolutely cannot wait to see advancements in practical, applied AI that come from this research.

ktRolster 10 years ago | |

Here's the Nature article: http://www.nature.com/news/google-ai-algorithm-masters-ancie... (it has a link to the free paper, as well)

The position evaluation heuristic was developed using machine learning, but it was also combined with more 'traditional' algorithms (meaning the monte-carlo algorithm). So it was built specifically to play go (in the same way deep blue used tree searching specifically to play chess.....though tree searching is applicable in other domains).

mark_l_watson 10 years ago |

I just wrote a blogg about this. I was up to 1am this morning watching the game live. I became interested in AI in the 1970s and the game of Go was considered to be a benchmark for AI systems. I wrote a commercial Go playing program for the Apple II that did not play a very good game by human standards but did play legally and understood some common patterns. At about the same time I was fortunate enough to get to play both the woman's world Go champion and the national champion of South Korea in exhibition games.

I am a Go enthusiast!

The game played last night was a real fight in three areas of the board and in Go local fights affect the global position. AlphaGo played really well and world champion (sort of) Lee Sedol resigned near the end of the game.

I used to work with Shane Legg, a cofounder off DeepMind. Congratulations to everyone involved.

tunesmith 10 years ago |

I watched the commentary that Michael Redmond gave (9-dan-professional) and he didn't point out one obvious mistake that Lee Sedol made the entire match. Just really high quality play by AlphaGo.

Really amazing moment to see Lee Sedol resign by putting one of his opponent's stones on the board.

mathgenius 10 years ago | |

Yeah according to Redmond, it seemed that AlphaGo made a few "mistakes" whereas Sedol made none. And yet AlphaGo came out substantially ahead. So I'm not sure what that means. Perhaps we need to see more in-depth analysis of the moves, but it seems that AlphaGo just out-calculated Sedol.

nkurz 10 years ago | | |

I wonder if their move selection algorithm takes into account the "surprise" factor: given two moves that are almost equal in strength when analyzed to a depth of N, chose the one that looks worst at N-1. That is, if all else is equal, assume that you can search deeper than your human opponent, and lay traps accordingly.

bencoder 10 years ago |

I was really expecting Lee Sedol to win here. I'm very excited, and congratulations to the DeepMind team, but I'm a bit sad about the result, as a go player and as a human.

studentrob 10 years ago | |

If it's any consolation, there are still tons of things humans are far better at than machines.

atemerev 10 years ago | | |

The only remaining are language-related. Natural languages are the next focal point of AI research.

visarga 10 years ago | |

But now every amateur will have access to unlimited play against Lee Sedol-level opponents.

spacehome 10 years ago | | |

Yea, let me just go home and grab my hundreds of GPUs and CPUs.

jonbaer 10 years ago |

"AlphaGos Elo when it beat Fan Hui was 3140 using 1202 CPUs and 176 GPUs. Lee Sedol has an equivalent Elo to 3515 on the same scale (Elos on different scales aren't directly comparable). For each doubling of computer resources AlphaGo gains about 60 points of Elo."

taneq 10 years ago | |

So has AlphaGo raised its level so far just by continuing with the games against itself? Or did they just throw their entire server farm at it? (Or both, probably.)

Teodolfo 10 years ago | | |

Demis said it used roughly the same hardware resources as against Fan Hui?

jonbaer 10 years ago | | |

I would bet a mix of both. What will be more interesting is if it ends at 5-0 if we will see AlphaGo vs. Darkforest (Facebook's engine) soon after.

geebee 10 years ago |

Terrific accomplishment.

Just a question to throw out there - does anyone feel like statements like this one "But the game [go] is far more complex than chess, and playing it requires a high level of feeling and intuition about an opponent’s next moves."

… seem to show a lack of understanding of both go and chess?

I understand there may be some cross-sports trash talking, but chess, played at a high level by humans, relies on these things as well. The more structured nature of chess means that it is (or at least was) more amenable to analysis by brute force computer algorithm, but no human evaluates and scores hundreds of millions of positions while playing chess or go.

Eh, the mainstream media is going to say this regardless, and I suppose it's just unrealistic to expect them to draw a distinction between complex for humans and amenable to brute force computation but statements like this always seemed to show a remarkable lack of awareness of how people actually play these games (though I am not an especially skilled chess or go player).

narrator 10 years ago |

The funny thing about AI at this scale is we don't really know why the computer does what it does. It's more of a inductive extrapolation that we can verify that a technique works for a small problem, so we'll throw a whole bunch of GPU power and data at it and it SHOULD work for a big problem. How it actually works is fuzzy though as there's just a couple of gigabytes of floats representing weights in neural networks. No human can look at that and say: "Oh! I see why it made that move". It's so much data that it becomes kind of nebulous what the AI is doing.

cm2012 10 years ago |

After Go, the next AI challenge they're looking at is Starcraft: https://twitter.com/deeplearning4j/status/706541229543071745

sago 10 years ago | |

The obvious problem is that speed of tactical execution can make up for a lot of strategic thought. The famous example: you can rush a line of siege tanks with zerglings if you can micro them fast enough[0].

[0]:https://www.youtube.com/watch?v=IKVFZ28ybQs

LockeWatts 10 years ago | | |

I hope that in the interest of fair play they'll limit their AI to 300 APM or so. Make it win not on mechanical execution, but on decision making.

zouhair 10 years ago | |

Good luck with that.

LockeWatts 10 years ago | | |

Starcraft in many ways is a much easier game for an AI to beat top pros at than Go.

Cookingboy 10 years ago | |

If that's true...they really just hate the Koreans LOL

tarvaina 10 years ago |

The YouTube video: https://www.youtube.com/watch?v=vFr3K2DORc8

acid__ 10 years ago | |

Is this video laggy and constantly showing "The match will start in -0- seconds" for anyone else?

mijoharas 10 years ago | | |

Yes, apparently it gets better towards the end, but I have had to give up watching the start because it is too annoying.

hrnnnnnn 10 years ago |

We still have Arimaa. It's designed specifically to make it difficult for computers to play.

http://arimaa.com/arimaa/

simonbw 10 years ago | |

A computer won the Arimaa challenge last year.

https://en.wikipedia.org/wiki/Arimaa

faizshah 10 years ago | | |

I guess the only thing left is to design a game where you can change the rules of the game as a turn.

hrnnnnnn 10 years ago | | |

Well, that's that then :| Or maybe this will spur the human players to improve :)

praptak 10 years ago | |

It was designed to be hard to win with the algorithms known to researchers back then. I cannot tell if it has something substantially harder than Go has. High branching factor, deep tree and positions that are hard to evaluate with a simple heuristic are present in Go. Does Arimaa have something else up its sleeve?

hrnnnnnn 10 years ago | | |

I think one of the main things was that you can move up to four pieces (or one piece four times, or two pieces two times, etc) in each turn.

codecamper 10 years ago |

A human was beaten with some thousands of CPUS & GPUS. On a calorie level, the human is still more efficient.

On a time to learn these skills... going from zero (computer rolls off assembly line) to mastery, the computer wins.

Actually maybe the computer wins even on the caloric level, if you consider all the energy that was required to get the human to that point (and all the humans that didn't get to that point, but tried).

ragebol 10 years ago | |

But the computer certainly does not win on the amount of training samples required. The human is at the same level as the computer now for Go, but the computer has had much more training samples as Lee Sedol could process in his lifetime.

The next step is to reduce the training time/samples for the computer to get the same performance.

jules 10 years ago | | |

That's silly. Why would you want to put human limitations on the computer? We don't artificially put computer limitations on the human.

aab0 10 years ago | | |

> but the computer has had much more training samples as Lee Sedol could process in his lifetime.

That's not obvious at all. I don't think you appreciate how rigorous and demanding the training of a Go world champion is, how utterly devoted to Go they need to be: http://lesswrong.com/lw/n8b/link_alphago_mastering_the_ancie...

Radim 10 years ago |

Beating humans in Go is, in itself, not all that exciting. Go bots have been beating strong humans for quite some time now (just not the very top humans).

There are other implications that make this AlphaGo progress super exciting though. Go captures strategic elements that go well beyond the microcosm of one nerdy board game.

That's the real reason Go has been around for >2,000 years, and why this AI progress is relevant, despite its limited "game domain".

I wrote about it here, from my perspective of an avid Go player & machine learning professional [1].

[1] http://rare-technologies.com/go_games_life/

habosa 10 years ago | |

I disagree with this due to the rate of AlphaGo's progress. Consider CrazyStone which was the previous state of the art in Go computers. That program reached 5dan after many years of development and has not shown any signs of being able to reach Lee Sedol level (9dan).

In October of this year AlphaGo beat a 5dan player, bringing it into the range of CrazyStone. Only ~6 months later it beats a 9dan player which means it is now ~400 Elo higher. This means the new version would be predicted to beat the old version ~99% of the time.

Such incredible consistent progress of a problem considered somewhat intractable is notable and exciting. Imagine where this machine will be in 6 more months.

habosa 10 years ago | | |

Edit: Fan Hui was only 2dan so this is even more insane.

hendekagon 10 years ago | |

Yes. This is the point. To what extent can AlphaGo transfer what it has learned in Go to other domains. It was a very smart move to train their first AI on Go!

moonshinefe 10 years ago |

Can someone explain why this is more impressive than a computer beating top chess players over a decade ago? I'm not very familiar with Go, and while there were far more squares on a Go board, it seems less sophisticated than chess to me.

Maybe Go has way more moves possible and emergent strategies or something I'm not taking into account.

agentultra 10 years ago |

Isn't this jumping the shark a bit? It's a 5-game match. The first was really, really close.

Matetricks 10 years ago | |

It's worth mentioning that Lee Sedol mentioned in an interview that even if he loses a single game against AlphaGo, he will have lost the match. He was expecting to win all 5 games.

agentultra 10 years ago | | |

I hope that doesn't shake his determination and ability to concentrate. He could still win.

zeven7 10 years ago | |

As a ~1 dan amateur, whether the game was "really, really close" was not clear to me. For one of my games, yeah it was close, but professional play is on such another level I'm not sure how close this game should be considered.

I watched two 9d pro commentaries, Redmond's and Kim Myungwan's. Redmond was obviously being charitable in saying the game was close near the end. Myungwan said the victory was apparent several moves before the resignation, and Myungwan also said AlphaGo was clearly stronger than himself.

Either way, even if this game should be considered close, it's still not clear if AlphaGo was holding back in order to hold a secure win. It's possible it can play at a higher level, but it wasn't needed. We can't really know AlphaGo's strength until (if) it is beaten. The following matches will be very interesting.

slm_HN 10 years ago | |

Jumping the shark and jumping the gun are two different things. You want the second one.

Florin_Andrei 10 years ago | |

Yes, but it's still a historic first. It's the first time a machine wins a game against a top human player.

kowdermeister 10 years ago |

I'm truly amazed also, I'm not surprised or shocked. Once I knew that the previous master was beaten, I knew it's just a matter of time to see the #1 player topped.

What would be shocking is to find out that a famous writer, musician or scientist is in fact, just an alias for an advanced AI system :) It needs a little trick, because people should be tricked into believing that there's a real person behind the name.

Oh wait, I just remembered that there's a (mediocre) movie made on the subject: S1m0ne ( http://www.imdb.com/title/tt0258153/ )

Are you saying it won't happen? Think of the guys saying the same of go :)

pgeorgi 10 years ago | |

> What would be shocking is to find out that a famous writer, musician or scientist is in fact, just an alias for an advanced AI system :)

so, Milli VanAIlli?

(https://en.wikipedia.org/wiki/Milli_Vanilli)

kowdermeister 10 years ago | | |

Nice catch, that would be the perfect working title for the project :)

ankurdhama 10 years ago |

What this actually means is that "the approach" AlphaGo team developed to "computationally" play Go, which is an computationally intractable problem, will be very useful in other computationally intractable problems. The media is going to get crazy without understanding what actually happened. If you are going very hysteric over this and thinking that robots are going to take over then please try this:- Before the start of the game add/remove/update any rules of the game and tell both the players - the human and computer - at the start of the game about new rules and lets see who wins.

conanbatt 10 years ago |

This not only shows the insane advances in computer AI, but an incredible advancement between the Fan Hui games and this one. Im still going through the kifu to get a sense of how could it have improved so much in only 6 months.

ktRolster 10 years ago | |

I feel like it started being more aggressive, playing more fighting moves....whereas in the last match it was playing mostly a defensive game (I'm not an expert by any stretch of the imagination, though).

apetresc 10 years ago | | |

It's exactly the opposite – this last game was much more aggressive on both sides than the Fan Hui match.

imh 10 years ago |

I want to scratch my itch and play some go. I suck, and playing against other players online I get destroyed so quickly I feel like I'm ruining their fun. Where can I find a fun bot with variable difficulty?

terryf 10 years ago |

Extremely interesting news and kind of sad as a human being :)

I don't really know that much about AI, but hopefully some experts can tell me - how different are the networks that play go vs chess for example? Or recognise images vs play go?

What I mean is - if you train a network to play go and recognise images at the same time, will the current techniques of reinforcement learning/deep learning work or are the techniques not sufficient at the moment?

If that works, then it really does seem like a big step towards AGI.

GolDDranks 10 years ago | |

This is basically a combination. A "traditional" chess program would use a tree search, but trees get quickly ot of hand since they grow exponentially. The trick is to prune them, and they trained a network to do that. It selects just the moves that look good to it. (It has some level of randomness to it, too) After reaching deep enough in the search tree, they use another network to evaluate who's winning. Usually this is hard to do in Go, and that's why the second network is quite novel and helpful.

So, they use a combination of techniques. And they're doing well at it.

terryf 10 years ago | | |

right, yes, but my question was meant to be a bit more general - this and various other results have shown that it is possible to train a deep net to do a specific task very successfully - my question was if it's possible to train it to do two or more tasks as successfully or will the network then have to be exponentially larger. I suppose there is no known way to "combine" trained networks together.

devy 10 years ago |

I had a feeling that AlphaGo would beat Lee Sedol yesterday after watching Fan Hui's interview [1].

According to Hui's recall, the defeat all came down to these things: the state of the mind, confidence and human error. The gaming psychology is a big part of the game, without the feelings of fear of being defeated and almost never making mistakes like humans do, machine intelligence beating human at the highest level of competitive sports/games is inevitable. However, to truly master to game of Go, which in ancient Chinese society, it's more of an philosophy or art form than a competitive sport, there is still a long way to go.

There were a ton of details Hui cannot speak of due to the non-disclosure agreement he signed with DeepMind, but those were the gist of the interview.

In the end, AlphaGo match is 'a win for humanity', as Eric Schmidt put it. [2]

[1] http://synchuman.baijia.baidu.com/article/344562 (In Chinese)

Google Translate: https://translate.google.com/translate?hl=en&sl=zh-CN&tl=en&...

[2] http://www.zdnet.com/article/alphago-match-a-win-for-humanit...

pushrax 10 years ago |

That sequence on the right side was excellent, I am so impressed with the level of play.

ccvannorman 10 years ago |

reference: SGF file on OGS: https://online-go.com/demo/114161

To my untrained eye, AlphaGo was already way ahead by move 29 in the match tonight with black having a weak group in the upper side, while black wasted a lot of moves on the right side as white kept pushing (Q13, Q12), which white erased later because those pushes were 4th line for black and the area was too big too control. Black never had a chance to recover this bad fight. After those reductions and invasion on right side white came back to the 3-3 at C17 which feels like solidified the win.

Some people are asking what was the losing move for Lee Sedol? I wanted to joke and say "the first one.." but maybe R8 was too conservative being away from the urgent upper side where white started all the damage.

GraffitiTim 10 years ago |

A historic moment here, folks.

Incredible, and in my opinion a little terrifying.

nbaksalyar 10 years ago | |

What's more terrifying is that AlphaGo can learn even further, from these and other matches. Just hard to imagine what lies ahead.

taneq 10 years ago | | |

The Singolarity is here!

visarga 10 years ago | | |

AlphaGo playing itself would be like playing Lee Sedol or better, so it has itself as a great training companion.

singhrac 10 years ago | |

Not that terrifying? It's a very special purpose-built tool.

We'll still always have Calvinball! https://xkcd.com/1002/

cromwellian 10 years ago | | |

The way AlphaGo plays is more general purpose than other game playing systems. It doesn't have any heuristics, rules, or game rule books programmed into it, it learned to play Go like you would. The same technique could be applied to other areas, the same way DeepMind originally got super-human level Atari 2600 game performance purely by having it watch pixels.

oneeyedpigeon 10 years ago | | |

I went looking for that xkcd to post an "obligatory xkcd needs updating" comment, but you beat me to it!

What surprises me about it is that connect four was only solved in 1995; that seems relatively late for a 6x7 grid with only 7 possible moves per turn.

ausjke 10 years ago |

No surprise at all, human brain is an organ with limited neurons, and computer doubles its performance very 18 months. In fact not just the chess, I would say that AI will beat human all around at unlimited ratio in the future, when they learned how to improve themselves especially.

bwang29 10 years ago |

I was just thinking, does AlphaGo's game strategy also emulate some sort of psychological strategies used by real human, such as bullying, confusing or making fun of its opponent when it sees fit.

nefitty 10 years ago |

What do you guys think of the future progress on the game Go? Will our only chance against AI be to team up with an AI to beat the lone AI? Like in this article about centaur chess players: http://www.wired.co.uk/magazine/archive/2014/12/features/bra... (2014) It all sounds very Gundam Wing to me.

nopinsight 10 years ago |

Deep Blue:

Massive search +

Hand-coded search heuristics +

Hand-coded board position evaluation heuristics [1]

AlphaGo:

Search via simulations (Monte Carlo Tree Search) +

Learned search heuristics (policy networks) +

Learned patterns (value networks) [2]

Human strongholds seem to be our ability to learn search heuristics and complex patterns. We can perform some simulations but not nearly as extensively as what machines are capable of.

The reason Kasparov could hold himself against Deep Blue 200,000,000-per-second search performance during their first match was probably due to his much superior search heuristics to drastically focus on better paths and better evaluation of complex positions. The patterns in chess, however, may not be complex enough that better evaluation function gives very much benefits. More importantly, its branching factor after using heuristics is low enough such that massive search will yield substantial advantage.

In Go, patterns are much more complex than chess with many simultaneous battlegrounds that can potentially be connected. Go’s Branching factor is also multiple-times higher than Chess’, rendering massive search without good guidance powerless. These in turn raise the value of learned patterns. Google stated that its learned policy networks is so strong “that raw neural networks (immediately, without any tree search at all) can defeat state-of-the-art Go programs that build enormous search trees”. This is equivalent to Kasparov using learned patterns to hold himself against massive search in Deep Blue (in their first match) and a key reason Go professionals can still beat other Go programs.

AlphaGo demonstrates that combining algorithms that mimic human abilities with powerful machines can surpass expert humans in very complex tasks.

The big questions we should strive to answer before it is too late are:

1) What trump cards humans still hold against computer algorithms and massively parallel machines?

2) What to do when a few more breakthroughs have enabled machines to surpass us in all relevant tasks?

Note: It is not entirely clear from the IBM article that the search heuristics is hand-coded, but it seems likely from the prevalent AI technique at the time.

[1] https://www.research.ibm.com/deepblue/meet/html/d.3.2.html [2] http://googleresearch.blogspot.com/2016/01/alphago-mastering...

scott_hardy 10 years ago |

What an amazing game to watch. Congratulations to the AlphaGo team, and good luck to both players in the next four games!

randomgyatwork 10 years ago |

AI is good for rules based systems, but most of the worlds problems that need to be solved don't have rules in the same way a board game does. Sure it's cool that a computer beat a human at a board game, but thats like celebrating a penguin being better at fishing than a person with bare hand

mrdrozdov 10 years ago |

How much did this match cost the AlphaGo team? (From a computing resources perspective)

bane 10 years ago |

It's almost kind of bad timing in the U.S., what with one of the most insane primary seasons in our history -- this will probably not make the news at all let alone the front page like Kasparov's and Magnus's games did.

joe563323 10 years ago |

Learning from experience goes both to the program and to the champion. Does this mean if the champion keeps playing with the machine several times, he has a chance of winning?

dropdatabase 10 years ago |

I don't think a computer could ever beat me at Calvinball

panic 10 years ago |

It'll be interesting to see what new things we learn about Go itself from DeepMind. The game is very deep, and apparently we haven't found the bottom yet!

visarga 10 years ago | |

Instead of the prize money, if I were Lee Sedol I'd request unlimited play time against the latest AlphaGo.

socrates2016 10 years ago |

I think it will be very interesting if Lee Sedol can win one. Humans have different blueprints and environments. Who is to say a human can't become better?

couchand 10 years ago |

When there's a computer that can beat the world champion at both go and chess with no modifications, then I'll be scared.

picozeta 10 years ago | |

You just take your top-notch Go/Chess engines and detect the game in the initial step.

georgehaake 10 years ago |

I have read a fair amount about how it was written without much detail. Anyone know what it was written in?

chimtim 10 years ago |

AlphaGo can be beaten. It uses reinforcement learning so it will perform the set of moves that in the past led to its win. So predictable. Sedol just needs to take control and make it play in a predictable fashion. Also, perhaps play obscure moves that AlphaGo wouldn't have trained on. Perhaps next year's Go winner will have a PhD in computer science.

kul 10 years ago |

Here's one: how long until a computer can beat a human assisted by a computer?

visarga 10 years ago | |

Will humans be able to keep up with the depth of analysis these AIs will have, or will it become a problem for the AI to dumb down its thinking in order for us to grasp it?

More generally, scientists using AI for research will probably have to do research on the research, to understand what the AI discoveries mean. Maybe they mean something we can't grasp at all, in which case they go completely over our heads, like ants trying to learn about the finer points of financial markets. We will probably have to learn new concepts and even new languages designed by the AI to convey the meaning.

d0m 10 years ago | | |

I wouldn't say "dumb down" but it definitely needs to explain why it took some lines of reasoning. With deep learning, you need to rebuild the whole system with different test-cases to change a minor behavior.. but imagine if we could just say "Why did you do that? XYZ. And adjust it: "Oh, gotcha. You can't because of ABC", and then the AI has that problem solved. I guess that would be the next step in AI. I think it's called symbolic reasoning.

Here's a very good article: http://dustycloud.org/blog/sussman-on-ai/ (A conversation with Sussman on AI and asynchronous programming)

cing 10 years ago | |

Ah yes, I believe in the machine learning community that ensemble learning technique is called "meat bagging"

vancan1ty 10 years ago |

Does Lee Sedol have access to AlphaGo training games and/or matches?

pvinis 10 years ago |

i would like to see the same match, but switched placed. alphago plays itself, this time as black, to kind of see the choices it would make, and if they would align with lee's.

tvvocold 10 years ago |

Poll: https://news.ycombinator.com/item?id=11250806

EGreg 10 years ago |

Does this mean in the next few decades, computers will make better sex partners and companions than any human?

Eliezer 10 years ago | |

Worrying about the effect of strong AI on sexual relationships is like worrying about the effect on US-Chinese trade patterns if the Moon crashes into the Earth.

ue_ 10 years ago | | |

Who says? I mean, computers can be used for multiple purposes, and although some applications of AI seem more "noble" or "intellectual" than others, the pursuit of knowledge and sexual relationships are both sense pleasures that we indulge in to make ourselves feel good. On a very large scale, one is hardly more noble than the other.

gjm11 10 years ago | | |

You're right. You should get in touch with the author of this http://lesswrong.com/lw/xu/failed_utopia_42/ and tell him :-).

koder2016 10 years ago | | |

...or like worrying about being on board of a heavier than air aircraft (surely that's impossible).

jholman 10 years ago | | |

And yet, I feel that US-Chinese trade patterns could directly affect my life.

mcintyre1994 10 years ago | |

Maybe someone can hack a dating/popular fetish forum or one of the many IM services used heavily for that sort of thing and we can get a dataset to teach computers to sext!

visarga 10 years ago | |

I am sure they will. People will have to rival perfect mannered AIs for other people's feels.

supergirl 10 years ago |

after so much press about this, it would be funny if overall the human wins

21 10 years ago |

The thing that was supposed to take at least 10 years happened. Only last month people were still saying that no way AlphaGo will beat the champion and that it will be crushed. Today everybody will have seen it coming and say that it was normal.

Yet people will still tell that worrying about AI taking over is like worrying about overpopulation on Mars, and that this is a problem at least 50 years out.

simonh 10 years ago | |

Highly optimised single-function algorithms like this are impressive stuff and can lead to useful tools, but that's it. This gets us no closer to strong AI than a tic tac toe program. Until we have systems that can tackle a wide range of fundamentally different problems and independently adapt strategies for dealing with one class of problems to deal with other classes of problems, systems like Alphago will remain one trick wonders with little relevance to 'true' AI.

Edit: I do understand that the techniques used to implement Alphago can be used to implement other single-function solvers. That doesn't make it a general purpose strong AI.

Houshalter 10 years ago | | |

Welcome to the AI effect! Every time AI makes an accomplishment, it is disregarded. The goalposts are perpetually moved. "AI is whatever computers can't do yet."

People said for years that Go would never be beaten in our lifetime. They said this because Go has a massive search space. It can't be beaten by brute force search. It requires intelligence, the ability to learn and recognize patterns.

And it requires doing that at the level of a human. A brute force algorithm can beat humans by doing a stupid thing far faster than a human can. But a pattern recognition based system has to beat us by playing the same way we do. If humans can learn to recognize a specific board pattern, it also has to be able to learn that pattern. If humans can learn a certain strategy, it also has to be able to learn that strategy. All on it's own, through pattern recognition.

And this leads to a far more general algorithm. The same basic algorithm that can play Go, can also do machine vision, it can compose music, it can translate languages, or it could drive cars. Unlike the brute force method that only works one one specific task, the general method is, well, general. We are building artificial brains that are already learning to do complex tasks faster and better than humans. If that's not progress towards AGI, I don't know what is.

Eliezer 10 years ago | | |

In case you're not aware, AlphaGo's key component is based on the same type of Deepmind system that learned to play dozens of Atari games, to superhuman levels, by watching the pixels, without any programmatic adaptation to the particular Atari game. At least the version of AlphaGo that played in October was far less specialized for Go than Deep Blue was for chess. Demis Hassabis says that next up after this is getting Deepmind to play Go without any programmatic specialization for Go. Your reply would be appropriate if we were talking about Deep Blue, chess, and 1997.

empath75 10 years ago | | |

1) this isn't a single function algorithm 2) the human mind is FULL of ugly, highly optimized hacks that accomplish one thing well enough for us to survive. Don't assume that human intelligence is this magical general intelligence, rather than a collection of single function algorithms.

lololomg 10 years ago | | |

'True AI will always be defined as anything a computer can not yet do'

Confusion 10 years ago | | |

You are not a general purpose strong AI either. Your mind could easily consist of a whole bunch of single-function solvers, combined into networked components.

See this interview between Kurzweil and Minsky: https://www.youtube.com/watch?v=RZ3ahBm3dCk#action=share

danielbarla 10 years ago | | |

Regarding your edit, I'd wager that it won't stay true for long. Eventually the single-function problem of orchestrating and directing a bunch of sub-solvers in a similar manner to the human brain will become feasible. At that point true general purpose AI will exist, for all intents and purposes.

taneq 10 years ago | | |

I don't think anyone's claiming that AlphaGo is an AGI in and of itself, just that it's a significant step towards one. There's still a lot to go before we can toss a standardized piece of hardware+code into an arbitrary situation and have it 'just figure it out'.

JabavuAdams 10 years ago | | |

> This gets us no closer to strong AI than a tic tac toe program.

We don't know that, actually. Maybe GAI isn't one shining simple algo, but cobbling together a bunch of algorithms like this one.

exDM69 10 years ago | |

It's only been the first round and I'm not throwing in the towel yet. Unlike AlphaGo, Lee Sedol has an opportunity to learn from their opponents since AlphaGo takes about 30 days of wall clock time to train the networks. There will be 5 games during the next week.

Despite my optimism, the writing is on the wall. AlphaGo and algorithms like it will only improve as you throw more CPU time at them. I actually want Lee Sedol to win, not because it would uphold some kind of human supremacy but because I want to see the AI guys put some more effort (and CPU time) into it. It would be a real shame if they'd win on their first attempt.

panic 10 years ago | |

DeepMind is an algorithm which clearly improves a lot on traditional tree search. But how will a better tree search algorithm lead to AI taking over? Why does winning at Go mean AI is closer to taking over the world than when it won at Chess?

Eliezer 10 years ago | | |

It's not an improved tree search. Deepmind was almost pro level purely using the deep learning network before doing any Monte Carlo search.

One way of looking at the significance of this is that it might tell us that relatively simple machine learning algorithms can capture key aspects of the versatile human cortical capacity to learn things like Go using sheer pattern recognition. (It's amazing that human visual cortex can do that.) If the human brain were more mystical in its power, then human-level ability to recognize Go patterns wouldn't have been penetrable at all to a comparatively simple neural algorithm like deep learning.

From another standpoint, this could show that we're reaching the point where, when an AI algorithm starts to reach interesting levels of performance, a much-encouraged Google dumps 100,000 tons of GPU time and 5 months of a few dozen researchers' time to improve the algorithm right past the human performance level. In N years from now when it's a more interesting AI system doing more interesting cognition, we could see a more interesting result materialize when a big organization, encouraged by some key milestone, invests 5 months of time and massively more computing power to further improve an AI system.

21 10 years ago | | |

The issue is that we don't take the threat seriously, since we believe we have plenty of time to solve it. Some don't even believe it's a threat at all (will unplug it, right?)

This 10 years to beat go prediction shows that our time estimates are wildly ignorant.

kevinwang 10 years ago | | |

ah, but it's not just tree search. It uses neural nets, which makes it different than the chess algorithms.

chipsy 10 years ago | | |

It isn't a tree search algorithm.

typeformer 10 years ago | | |

@panic I'm sorry but you don't understand go.

onion2k 10 years ago | |

Yet people will still tell that worrying about AI taking over is like worrying about overpopulation on Mars, and that this is a problem at least 50 years out.

As hard as writing AI for a problem space like "how to win at Go" is, it's several orders of magnitude easier than creating a general AI with the self-awareness required to see us as a threat.

Houshalter 10 years ago | | |

There is a lot of progress being made in AI right now. Hard problems that were expected to take decades, are being beaten regularly. Who is to say how much AI could advance in the next 20-30 years? Do you really believe there's less than a 50% chance strong AI won't be invented in your lifetime?

CamperBob2 10 years ago | | |

The interesting thing is that this algorithm taught itself to master the game. If you're writing a chess engine, you just "write the AI for the problem space" in a few dozen lines of code, as you put it, and let brute force do the rest. In the case of DeepMind, faced with a game where a traversal approach is numerically impossible, its programmers gave it the ability to improve by playing against itself.

That's the difference between a neural net -- which until about an hour ago was a phrase that reliably set off my snake-oil detector -- and a simple tree search. DeepMind just beat a Go master... and nobody can say exactly how it did it.

That's a big deal.

pfisch 10 years ago | | |

In other words, it could be created in the next 10 to 20 years.

We should be taking steps to outlaw black box algorithms right now but I'm sure we won't.

Teodolfo 10 years ago | |

Many of the people saying not to worry ALSO predicted AlphaGo. So let's not get ahead of ourselves.

Eliezer 10 years ago | | |

[Citation needed.]

I for one did not say to not worry, and I bet $1500 vs. $2250 on Sedol winning the match before getting cold feet and arbing my outstanding bets down to $400 vs. $700.

YeGoblynQueenne 10 years ago | |

Go is literally infinitely more easy to solve than general intelligence. "Literally" in the sense that Go has a finite number of board states, while a general intelligence must be able to deal with an infinite amount of novel situations, presumably by generalising from previously experienced ones.

Infinity is a real problem. When you try to learn from examples, you first need to see "enough" examples of whatever you're trying to learn. If there are infinitely many such examples, no matter how clever you are in tackling your search space there will always be infinitely many examples of infinitely many situations you've never come across, and that you won't be able to learn.

The typical example of this is language. You could give a learner all phrases of a given language every produced and it would still be missing an infinite amount of necessary examples. Somehow (and it's freaky when you stop to think about it) humans get around this and we can produce and understand parts of infinity, without sweating it.

Machine learning is simply incapable of generalising like that and anyone who thinks AGI is just around the corner has just failed to consider what "general" really, really means.

Though to be fair, now that I had my little rant I have to admit that you don't need to go "general intelligence" before you can be really, really dangerous. Even if AI doesn't "take over" it can do a lot of damage, frex if we start using autonomous weapon systems or hand over critical infrastructure maintenance to limited and inflexible mechanical intelligence.

escoz 10 years ago | | |

Yes, go have a finite number of board states, just 2.08168199382×10^170, just a bit over the 10^80 atoms in the universe.

https://en.wikipedia.org/wiki/Go_and_mathematics

z0r 10 years ago | |

I was one of the people who rooted for the champion last month, and my position hasn't changed. Last night's game was very exciting, but if the pressure doesn't get to Sedol he has a chance. He made some mistakes in the game and AlphaGo played incredibly well. General opinion I've seen so far is that the lead switched hands in a few places, although I'm waiting for professional commentaries to illuminate what possibilities were there for both players - the played game is always just the tip of the iceberg

tim333 10 years ago | |

Some people may say AI being a worry is 50 years but not usually smart ones who know anything much about it.

bitmapbrother 10 years ago |

Some people were downplaying the victory of AlphaGo over the European champion because he was only a 2p player. I wonder what they have to say now.

z0r 10 years ago | |

The last victory was significant, but this victory was far more significant. The professional dan scale isn't exactly linear and the ranks can't simply be compared numerically even when they are granted by the same organization - and Korea, China and Japan all have at least one organization of professional go players that each maintain their own rankings. Sedol is a current top player who has won many, _many_ titles and Fan Hui is 10 years out of regular professional play and doesn't have a title to his name. What people were saying before is still true today. All of the reporting has suffered from the usual problems of describing something specialized to the general public, and all the typical inaccuracies of such journalism (compounded by Google's PR department being the source of some of it).

Congratulations to the team at Deepmind, and I'm wishing good luck for Sedol in the remaining matches - if he wins we would certainly get to see a second series rematch some months down the line, and that would be very exciting for go fans everywhere.

jorgecurio 10 years ago |

man I am fired up to watch tonights game...like I am fired up for UFC

there should be like a North American Go Nationals or something like that televised on twitch

Anyone putting money down on Sedol? He said it will be either 5-0 or 4-1 in his favor.

arao 10 years ago |

Lee is not the best player NOW.

thomasahle 10 years ago |

Giant spoiler! Does Hacker News have any policy against these things?

typeformer 10 years ago |

Lee Sedol should have played that top left 3,3 move earlier (at least before white covered it) WTF. Humanity is not longer at the top of the intelligence pyramid...

andrepd 10 years ago |

He has lost 1 game of a 5 game match, on a handicap. Hardly a defeat.

fogleman 10 years ago | |

What handicap?

andrepd 10 years ago | | |

I misread it, he had only the usual first move advantage handicap.

tunesmith 10 years ago | |

There was no handicap - 7.5 komi is traditional to counteract the benefit of moving first.

kllrnohj 10 years ago | |

> on a handicap

What handicap? There was no handicap?