MUM: A new AI milestone for understanding information

MUM: A new AI milestone for understanding information(blog.google)

257 points by chris_f 5 years ago | 208 comments

floatrock 5 years ago |

A not-so-subtle reading shows Google is doubling down on ecommerce applications here:

> It could also understand that, in the context of hiking, to “prepare” could include things like fitness training as well as finding the right gear.

> fall is the rainy season on Mt. Fuji so you might need a waterproof jacket.

> MUM could also surface helpful subtopics for deeper exploration — like the top-rated gear or best training exercises

> you might see results like where to enjoy the best views of the mountain, onsen in the area and popular souvenir shops

Or, my favorite line:

> MUM would understand the image and connect it with your question to let you know your boots would work just fine. It could then point you to a blog with a list of recommended gear.

(in other words: "Thanks for showing you're interested in hiking gear. Here's a lot of hiking gear you can buy.")

rexreed 5 years ago | |

There's an even bigger picture than possibly monetizing ecommerce revenue (through... ads?). The biggest impact is that they get to use all the content generated on the Internet to create these search "results" that synthesize information from multiple sources without ever having to share traffic or ad revenue with those content sources. Clever.

judge2020 5 years ago | | |

This really is a section that needs regulation. You basically have to use and allow Google to crawl your site if you want a website findable by 95%+ of Americans, so websites really should be able to tell google how they're allowed to use the scraped data instead of just 'for anything'. Maybe a meta tag would work well.

bjterry 5 years ago | | |

In the current world, information wants to be free. In the AI-powered future, knowledge wants to be free.

mhoad 5 years ago | | |

I don't know if this is actually true or not but I suspect that a big part of their thinking is that "we are just presenting 'facts' and facts are not subject to copyright laws".

jonnycomputer 5 years ago | | |

Forgive the analogy but that sounds like a parasitic relationship, and one that might kill off, or at least impoverish, its host. Even if Google isn't doing that, the potential exists. The counter is paywall, I suppose.

colordrops 5 years ago | |

Another not-so-subtle reading shows google doubling down on being "responsible" which has a lot of collateral damage when they block or de-emphasize legitimate results that don't fit their own goals.

sangnoir 5 years ago | | |

It rings a little hollow when they fired members of/disbanded their nominally independent internal ML Ethics unit after a member published a paper raising some flags on the kind of models Google is betting its future on.

glenstein 5 years ago | |

I don't think the language you've quoted was explicitly intended that way. But I think you're onto something. I think high-context answers open up all kinds of new contextual surfaces where ads can be placed, products + product categories suggested.

derefr 5 years ago | |

I don’t know if it’s “e-commerce” specifically, or just a more general fact that Google own a search engine, and want to surface URLs from their index as answers to questions, when appropriate. And, when you think about it, why would you be linking to a page — rather than giving a straightforward answer — unless you’re linking to a product page / review / other page that offers you a direct means of solving a problem that goes beyond a conversational answer?

floatrock 5 years ago | | |

> unless you’re linking to a product page / review / other page that offers you a direct means of solving a problem

Embedded in this answer seems to be the mindset that only buying things will solve problems.

Don't get me wrong -- I'm not a consumerist luddite, I use my credit card points like any good and proper citizen -- but when your mindset is "all problems can be solved by buying more shit", well, that's a pretty lonely existence.

Google's gotta make money, and helping people buy useful shit is a fine way of doing it, but just don't fall into the mindset trap that everything solution in life is just a Google Pay away.

zepto 5 years ago | |

The Google of the future is a conversation with a salesman.

tachyonbeam 5 years ago | | |

Or maybe it's a personal assistant that lives in your phone and asks you how you're doing everyday, acts like a friend, inquires about your mental health and well-being... And then subtly nudges you in the direction of buying X,Y,Z thing or service to help you fill that existential void in your life.

rexreed 5 years ago |

Search quality at Google has been decaying over the past decade. Accuracy and quality of search results is compromised to optimize advertising revenue, penalize competitors or neutralize threats, and cater to the various needs of political or regulatory authorities.

Google's search was at its peak in 2008 when advertising hadn't fully compromised search quality. Google is an advertising business that supports its otherwise money losing properties. Why will things change in the future because you can synthesize data from multiple sources only to compromise that quality with the realities of Google's business model?

aledalgrande 5 years ago |

Content of the article:

- 1000 times more powerful than BERT, but still transformer architecture

- trained on 75+ languages, can transfer knowledge between languages

- can do text and images (not audio and video yet)

- can understand context, go deeper in a topic and generate content

Not much apart from their words about how amazing it is. Paper? Demo?

cromwellian 5 years ago |

In most sci-fi, you ask the ship computer a question and it can answer using the sum total of all human information.

But judging by the comments her, when Captain Picard asks the ship how long to Starbase 17 at Warp 9, rather than answer you want it to tell the Captain to visit WarpTravelCalculator.com

If you publish information in this world, there’s nothing preventing people from learning it and rewriting it in a new way. Humans do it all the time and they don’t pay the people they learned it from a portion of proceeds.

Future AI will do this too. I want machine learning to read every book and paper ever written and be able to answer queries and summarize things for me.

We may need to find a better model for encouraging content contribution to society besides copyright and demanding royalties on every use.

mfer 5 years ago | |

The analogy here doesn't work well for a few reasons....

1. It mixes mapping math calculations with published information like texts.

2. The AI in star trek worked to serve the end user, in this case Picard. In our world the AI systems are designed to serve the software's owner such as Google. It's not trying to give you the best answer. Instead it's trying to provide you responses that make Google the most money or get them into positions of power and influence the leaders want.

3. Star Trek takes place in a world where the Federation doesn't use money and everyone is motivated to put in a hard days work. On most planets they don't have poor. This does not fit the societal cultural dynamic we have now.

> We may need to find a better model for encouraging content contribution to society besides copyright and demanding royalties on every use.

Right now we have a problem where people are trying to step on content creators. I was reading an example of where singers were trying to get added to songs as writers when they didn't write songs so they could get more of the writers royalty from sales. We live in a world where some will beg, borrow, steal, plagiarize, and generally try to hurt others to get a leg up. Including many at big businesses who would leverage AI for that.

We may hope for the best but we should plan for the worst.

selfhoster11 5 years ago | |

Most of those starship computers are autonomous. In the current "AI" model, they would be reduced to a mere glorified Amazon Echo speaker. I think that's an important distinction to have.

zepto 5 years ago | |

Very much this. People yearn for a world of a giant number of websites and software packages like in the old days, but the reality is that a humane computer may not need a lot of different interfaces.

anigbrowl 5 years ago |

When I tell people I work on Google Search, I’m sometimes asked, "Is there any work left to be done?" The short answer is an emphatic “Yes!” There are countless challenges we're trying to solve so Google Search works better for you.

Sorry to be off-topic but it's hard to get excited about blue sky ventures when the search UI offers no capability for simple things like delivering search results in date order. You can filter results by date, but not sort them.

cblconfederate 5 years ago |

I really hope Google gets some competition in their NN endeavors because they are creating an economy that sucks in free information and eventually spews out buying recommendations. In the past they would compensate websites for providing the precious raw material for their results with advertising. With DL models websites don't need to get anything back. This will lead to stale information or pretty much end the web

azinman2 5 years ago | |

You’re being downvoted but it’s actually an interesting issue. Many companies (yelp) already have suffered from quick results… at a certain point Google will have a hive mind but little reason to have you go any further. This is good as a user (hypothetically), but does not contribute back at all to the producers of such information who may have additional value to unlock.

Meanwhile the Reddit’s and whatnots can’t afford to not have Google index them, so this is just the price of admission. I wonder if they need an expansion to do not crawl that lets you specify how the data could be used?

shadowgovt 5 years ago | |

Are there other reasons than financial compensation that someone would put facts on a web page?

erikerikson 5 years ago | | |

They believe it increases the probability of a world outcome the publisher prefers (e.g. activism, advancement of humanity, ...).

davedx 5 years ago | | |

Do you think Wikipedia is driven by financial compensation?

ping_pong 5 years ago |

Wasn't Google supposed to have some sort of AI that could make phone calls for you? It looked amazing when they demo'ed it but I haven't heard diddly squat since then. Did they cancel that project?

refulgentis 5 years ago | |

It works just fine and has been active for a year or so now, except in one state (Indiana?)

datguacdoh 5 years ago | |

might be region specific, but I can use it from my Google home devices. less useful when pandemic hit.

roca 5 years ago |

Their hiking question is an odd example. Technology like this is probably perfectly fine for asking questions with low downside for wrong answers. But if someone asks "I've hiked Mt Pirongia and now I want to hike Mt Taranaki; how do I need to prepare differently?" and Google erroneously answers "nothing", that could get someone killed.

xapata 5 years ago | |

Are you suggesting that's a reason to not do this research?

roca 5 years ago | | |

Not at all. I'm suggesting that when writing up a PR blog post, choose examples where applying your technology is a sensible and safe thing to do.

ljm 5 years ago |

An AI named after the British diminutive for 'mother' is surely a wise choice. I would not trust this AI unless it kissed my forehead and tucked me into bed.

drdeca 5 years ago | |

I'm reminded of the parody search engine/character named "MOM" depicted in the tower-building game "World of Goo". She promises to make lots of cookies and offers to send emails with many promotional offers.

dbuder 5 years ago | |

You will do as your MUM says. Mum knows best. Yyou will eat the bugs and you will like it.

ColinHayhurst 5 years ago | |

Mum's the word https://en.wikipedia.org/wiki/Mum%27s_the_word

moritonal 5 years ago | |

“When in trouble come to Mum, Mum will do your little sum”

Don't know if it's related, but the above is Arup’s speech for the computer he christened Mumbo-Jumbo.

mark_l_watson 5 years ago | |

My first thought was comparing to “Mother” in the book/movie Alien.

cblconfederate 5 years ago | |

it's just temporary until they perfect DADDY

bobthechef 5 years ago | | |

In this day and age? Not likely. Daddy's turned into a eunuch. Mum's in charge now. There, there... Come to Big Mum.

sjg007 5 years ago |

A lot of knowledge on the internet is just wrong. Also a lot of scientific progress is driven by folks persisting against the current dogma. So that seems like a big problem. I imagine this is true for almost any subject where there is tribal domain expertise.

atemerev 5 years ago |

...and still, Google Suggestions cannot understand that in Switzerland, some population do not speak German (e.g. here in Geneva, we are a trilingual country), and only shows me search completion in German (from the browser search bar). And there is no way to change language there. I would prefer English.

aembleton 5 years ago | |

what is your `Accept-Language` header set to?

atemerev 5 years ago | | |

English.

benjaminjosephw 5 years ago |

This isn't "better search" it's entrenched market domination from the only player with enough smarts, data and (crucially) users to make this work.

While Google is building a bigger and "better" Behemoth we should ask if this kind of innovation is really doing anything at all to make the world a better place in a meaningful way. Better monetization of search seems like a way to make the world worse in my opinion.

fassssst 5 years ago |

I love how the example is a problem only a rich techie would have.

phpsuks 5 years ago | |

The examples are created by non-tech people

d--b 5 years ago |

There is no doubt that given the current state of AI, these requests would produce bullshit answers. AI is just not capable of constructing the proper conceptual models for now. But it sure can give you some answers.

It's sad to see that they'll be spending so much time, effort and money on this...

raybb 5 years ago |

Edit:

"Google MUM MultiTask Unified Model Introduction" https://youtu.be/s7t4lLgINyo

I originally posted the LaMDA video: https://youtu.be/aUSSfo5nCdM

aledalgrande 5 years ago | |

Even in the video he is just citing the same content of the article.

tachyonbeam 5 years ago | | |

This video is so silicon valley, it's amazing. They've obviously spent a lot of money producing it, but it's all vague claims, there isn't even a compelling demo. I'm guessing they're aiming for an audience of mainstream journalists, but they're not actually launching a new product per-se. What gives? Why are they trying to hype something that's not ready, isn't going to be released as a product, and that they're not willing to properly showcase or even explain at any level of detail?

aphextron 5 years ago |

>Take this scenario: You’ve hiked Mt. Adams. Now you want to hike Mt. Fuji next fall, and you want to know what to do differently to prepare.

Ah yes, that totally common scenario which I'm faced with all the time.

I love this. It perfectly illustrates the peril we are in with the current state of AI research. That the author would choose this as a problem to solve shows exactly the socioeconomic class they come from, and how that influences the way they solve problems. It may seem like a trivial and meaningless example, but these subtle biases will creep their way into these systems and be amplified. And you can bet that this kind of work is the foundation for what will become the technology that eventually governs every facet of our lives once AGI is a thing.

I, for one, am terrified of the implications that a bougie tech bro AI overlord entails.

1vuio0pswjnm7 5 years ago |

Some Googler or Google fan replied to me yesterday with, "Sheesh. Why the FUD."

Ask MUM.

lifeisstillgood 5 years ago |

A bit off topic but I am wondering if there are open knowledge graphs in public?

Ignoring AI etc, my kids play a couple of games where there is clearly some backend that "knows" Taylor Swift is a Singer, is Female, and has acted in this movie X

You can go a long way in a Turing test with that and I was wondering if folks knew where those graphs were built ?

blackbear_ 5 years ago | |

Wikidata [1]! They also offer a SPARQL endpoint [2], which you can use to programmatically answer those kind of questions. As an example, the page for Taylor Swift is [3].

[1] https://www.wikidata.org/wiki/Wikidata:Main_Page

[2] https://www.wikidata.org/wiki/Wikidata:SPARQL_query_service/...

[3] https://www.wikidata.org/wiki/Q26876

ArthurDevNL 5 years ago | |

I think http://conceptnet.io/ is what you're looking for!

sjg007 5 years ago |

Makes sense. I want insights and context. If Google can do that synthesis that’s great. I do wonder about the training data and data quality though. When I do these targeted searches you have to filter the spam... books are somewhat better but nothing beats talking to someone who lives it or did it.

robkop 5 years ago |

I can't see any link to an actual paper, anyone know if they released one for this?

ColinHayhurst 5 years ago | |

https://news.ycombinator.com/item?id=27207404#27218897

aledalgrande 5 years ago | |

I can't find one either and this article is just fluff.

Lyapunov_Lover 5 years ago |

I see a lot of people here expressing doubts and confusion. I want to try to clear up some of that.

The key notion here is scale relativity. This is the reason why transformer models have been so, well, transformative. Bigger models are better than smaller models in a proportional manner. That is, they display scale relativity. Where is the limit? Where does this break down? We don't know. We haven't found the ceiling yet.

Another important notion is multimodality. When you can cross-reference your text-based knowledge of an apple with your image-based knowledge of an apple, you can use this information as leverage. Archimedes said, "Give me a place to stand on, and I'll move the Earth." It might seem ridiculous to say that the same is true when it comes to information, but it is. Informational leverage is powerful. Multimodality allows you to make very accurate predictions. The McGurk effect is a nice demonstration of how we do the exact same thing. We rely on visual information from a speaker's lips to predict what they're going to say. In other words: we make use of multimodal leverage.

The twin notions of scale relativity and multimodality explain what makes MUM possible. As some of you have pointed out, there's another aspect that we can't ignore: utility. Google will be using MUM to make money. Which means that they'll have to train MUM to make you spend it. But if you're uncomfortable with this idea, you are uncomfortable with capitalism in general. Which is fair, but I think it's important to keep it in mind.

As I'm sure they've already considered at Google, MUM can be used to revolutionize education. Imagine people all over the world having access to an expert instructor who can answer all of your questions. You might think this sounds like a dream, but we're a mere stone toss away from achieving it. That's the true power of scale relativity + multimodality: we can now make advanced systems that can communicate with us.

I appreciate the skeptics and naysayers here: you keep the rest of us sane. For that, I thank you. At the same time, I want you to open your eyes to the possibility that something very important and transformative is happening right now. You don't have to go full Kurzweil, but I think you would benefit from reflecting on the opportunities this new technology might offer.

Ajedi32 5 years ago | |

Yeah, I'm a little surprised at all the negativity here considering the game-changing potential of this sort of research. The HN crowd has always been a pretty cynical bunch, but come on! A single model that can extract information from images, text, and webpages across multiple languages and generate answers in response to natural language questions written by a user? This feels like straight-up wizardry!

endisneigh 5 years ago |

I find it difficult that Google wants search to be easier for the end user - for example I believe a very long time ago you could setup sites to exclude from all of your searches - I don’t think this is possible any longer.

tinyhouse 5 years ago |

As usual, a lot of AI hype from Google.

42droids 5 years ago |

"Since MUM can surface insights based on its deep knowledge of the world" Which just means taken from the millions of websites written by humans and used without permission or any payment.

alcover 5 years ago |

  "Is there any work left to be done?"

The short answer is an emphatic “Yes! Dismantling your monster of a corporation!”

sboomer 5 years ago |

Any millennial who is using search for some time would easily know where to find what he needs. This sounds like Google is trying hard to drive more money out of its search business.

aaron695 5 years ago |

> "Is there any work left to be done?"

Google could search captions on all the Youtube (etc) videos. Not sure why this doesn't happen. Along with a few other big resources not indexed.

I think the big thing with the article(Taken as a workable technology) is it's not search, it's getting other peoples information and transforming it into a Google resource.

Which does add to humanities knowledge, but it's owned and profited on by Google.

SiempreViernes 5 years ago |

When the text starts with "Is there any work left to be done?" The short answer is an emphatic “Yes!” I was sort of hoping they would announce that pinterest will now be banned from all non-image search results...

Instead it's an announcement that Google has made a new, even bigger, pile of linear algebra that can sort of answer questions and won't end up like Watson.

I like that they put in a deadpan bit about how they are very ethical when they make and then exploit their huge collections of data found by their spiders. There sure hasn't been any AI controversy at google this quarter, no sir-e!

gerdesj 5 years ago |

"When I tell people I work on Google Search, I’m sometimes asked, "Is there any work left to be done?" The short answer is an emphatic “Yes!”

Hands up everyone who is 100% satisfied with Search ... ... OK no one.

So now we have an unsolved problem left behind in favour of ... chat about mountains ...

"MUM has the potential to transform how Google helps you with complex tasks. Like BERT, MUM is built on a Transformer architecture, but it’s 1,000 times more powerful. MUM not only understands language, but also generates it."

Piss off and while you are at it, get BERT to explain my response to MUM or vice versa.

If MUM can decipher my immediately prior sentence given this input then I might start to get interested.

rabbits77 5 years ago |

There is nothing in that press release that could not have been done in the 1980s with Prolog.

Yeah, it’d have been more code but you would not have needed to destroy a forest to train the thing.

This is the NLP trade off of the 21st century. The code is easier to write but the model is completely opaque, and you need to really burn a lot of electricity to make it work.

xkapastel 5 years ago | |

This is totally false, I dare you to write anything close to e.g. BERT with Prolog.

kajecounterhack 5 years ago | | |

> This is the NLP trade off of the 21st century. The code is easier to write but the model is completely opaque, and you need to really burn a lot of electricity to make it work.

This is basically a meme now. We actually have a pretty good understanding of how the models work. In fact that understanding is how you can do things like build chatbots that don't spew hate.

Also the electrical cost of ML training large language models is indeed high (e.g GPT-3 has 175B params and is estimated at 190,000 kWh to train on GPUs). But the folks who pay the cost (Basically OpenAI, Google, MSFT, Facebook, Amazon) are incentivized to make that go down (TPUs are way more efficient than GPUs), and they are incentivized to do it infrequently because it costs $$$.

FWIW Google's datacenters are also technically carbon neutral. I know that's not great because carbon credits don't have the impact that folks think they have, but there is definitely a difference in ecological impact from datacenter electricity and other kinds of energy usage (e.g cars all burning fossil fuels).

Okay also let's compare to bitcoin, which is the real ecological disaster if we want to talk about inefficient software: ~387,096,774 kWh PER DAY. _and_ incentivizing things like cheap coal, and miners are definitely not using their crypto wealth to purchase carbon offset credits :(

smokefoot 5 years ago | | |

I mean yes. But it is a funny example to choose to illustrate the power of a NN approach. They're talking about mountains--an entity that has very concrete and definable attributes (e.g., height). And the rest of the examples are similarly dealing with semi-structured data that could theoretically be represented in RDF or something like that.

There's been a bit of discussion on HN lately about the effectiveness of sophisticated models vs. just good metadata.

h0l0cube 5 years ago | | |

The better wager is: I dare you to write and train up a sophisticated real-time neural network model that can interpret human language and provide reliably useful contextual search results with the compute power and memory constraints of the 80s.