Google's Intelligence Designer

Google's Intelligence Designer(technologyreview.com)

97 points by finisterre 11 years ago | 28 comments

mturmon 11 years ago |

These MIT Tech Review articles, alas, emphasize hype: "No one had ever demonstrated software that could learn to master such a complex task from scratch," and "But until DeepMind’s Atari demo, no one had built a system capable of learning anything nearly as complex as how to play a computer game, says Hassabis."

I think the article must have overlooked significant activity in training learning systems to play games well. The glaring omission for me was Neurogammon (1987), later TD-Gammon (1992), developed by Gerry Tesauro and colleagues (http://en.wikipedia.org/wiki/TD-Gammon).

Neurogammon was, at the time, a sensation at the same conference the article coyly refers to as "a leading research conference on machine learning." The paper has almost 1000 citations. A curious omission.

cryptoz 11 years ago | |

Aren't these quite different tasks, though? There's a big difference in 'learning to play a specific game well' vs. 'learning to play arbitrary games'; such a big difference that I think they're entirely different disciplines. Correct me if I'm wrong, but the software in the research you reference was given the ruleset to the game, right? And DeepMind's software is not given that information, I think. I doubt they intentionally omitted that work, I think it's more likely they didn't consider it relevant enough.

mturmon 11 years ago | | |

Thanks for the correction. The "arbitrary" qualifier is not in TFA, but (as, indeed, you said) that's the point of the demo, e.g.: https://www.youtube.com/watch?v=EfGD2qveGdQ Note that they're using just the video signal from the game as input.

It's really a sad comment on the state of reporting at MIT Tech Review that you learn more about the tech from a youtube video than from an article.

(My complaint is not with the DeepMind people, it's with the article, which should put the work in context.)

fragsworth 11 years ago | | |

They are different tasks, but I'm not seeing any really clear descriptions of what DeepMind can and cannot do. It's possible that their software is only good at a very specific kind of thing that happened to be in line with what Google wants. And for all we know, they could be at a complete loss as to how to progress.

I mean, to what extent did they restrict what it means to be an "arbitrary game"? I highly doubt their software can play Pictionary, for instance, but I haven't found anything that really explains their limitations.

Because of this, I am leaning towards the cynical and assume it's just hype, and not actually that incredible.

Houshalter 11 years ago | | |

It's basically the same algorithm, or at least very similar. The main difference is they use huge neural networks running on GPUs, and they feed it raw video data, rather than the game board state directly.

It's not any less impressive though, to my knowledge no one had done anything like that before. That is, beating video games with raw video data and reinforcement learning.

amoruso 11 years ago | |

Here's the original research paper if you're interested.

http://arxiv.org/abs/1312.5602

I'll just quote their introduction instead of trying to summarize the paper:

"Our goal is to create a single neural network agent that is able to successfully learn to play as many of the games as possible. The network was not provided with any game-specific information or hand-designed visual features, and was not privy to the internal state of the emulator; it learned from nothing but the video input, the reward and terminal signals, and the set of possible actions—just as a human player would. Furthermore the network architecture and all hyperparameters used for training were kept constant across the games. So far the network has outperformed all previous RL algorithms on six of the seven games we have attempted and surpassed an expert human player on three of them."

mturmon 11 years ago | | |

My gripe is with the post, not the paper. But you're right, the best way to figure out what's new is to go to the source.

The paper does a good job going over related work (section 3), beginning with the example I gave.

falcor84 11 years ago | |

Indeed. I am also surprised that no one mentioned Tom Murphy's Sigbovik paper from April 1st 2013 - "The First Level of Super Mario Bros. is Easy with Lexicographic Orderings and Time Travel ... after that it gets a little tricky" http://www.cs.cmu.edu/~tom7/mario/mario.pdf

Murphy created an agent that can play arbitrary games by inspecting the RAM and attempting to maximize the score.

See also this writeup on Ars Technica - http://arstechnica.com/gaming/2013/04/this-ai-solves-super-m...

Houshalter 11 years ago | | |

While interesting, he uses a brute force approach (try every possible combination of moves so many seconds into the future and see which one is the best.)

higherpurpose 11 years ago |

FYI this is also the guy that made Elon Musk fear strong AI. Elon Musk invested in DeepMind in the early days just to see where AI is going.

xnxn 11 years ago |

For a brief, horrifying moment, I thought this was the name of a product.

What a time to be alive.

drewda 11 years ago |

Machine learning folks don't know the history of CS or AI, so they've reinvented neural networks as "deep learning"?

Or, industry types are looking for the next big thing, after "big data," and have rebranded neural networks as "deep learning"?

I don't mean to be too cynical, but I still don't understand if "deep learning" represents any meaningful advance besides the ML and EE communities finding the benefits of a certain amount of structure, which is already well established in other lines of research.

piratebroadcast 11 years ago |

This guy vs Shingy, AOL's "Digital Prophet".