Facebook M – The Anti-Turing Test

Facebook M – The Anti-Turing Test(medium.com)

438 points by arik-so 10 years ago | 156 comments

username223 10 years ago |

Can M wade through phone support menus and cancel Comcast subscriptions? I look forward to a darkly-humorous future in which we pit poorly-paid third-world citizens against each other in wars of call center attrition. Better, once we equip them with those sound-board UIs that play pre-recorded answers in native English-speaking voices (can't find the link), English can become a transmission protocol that few people deal with directly.

Hortinstein 10 years ago | |

> I look forward to a darkly-humorous future in which we pit poorly-paid third-world citizens against each other in wars of call center attrition.

This sounds like a 10 page side story in a Neil Stephenson Book. I love it.

JadeNB 10 years ago | | |

It's very nearly part of the strategy of one of the players in Stross's Accelerando (http://www.amazon.com/Accelerando-Singularity-Charles-Stross...).

Scarblac 10 years ago | | |

In Greg Egan's Permutation City, they have interactive 3D video email, and interactive 3D video spam, and interactive 3D video spam filters.

The spam tries to act like a perfectly normal message as long as it is talking to the spam filter, and as soon as it thinks it is talking to a real person, it shows its spam message. The spam filter tries to impersonate the recipient as best as it can (in 3D video), meanwhile trying to figure out whether the message is spam.

The spam filters are unfortunately humstrung by the fact that they can only become close to real conscious AI and not further, because taking them all the way there would mean you'd be exposing a conscious being to spam all its life, which would be torture and thus criminal. Spammers don't care.

IIRC, this is just a side anecdote in some paragraphs somewhere, but I love it.

CPLX 10 years ago | | |

Was thinking Douglas Adams myself.

andorov 10 years ago | |

Perhaps a future where modern English has become archaic but still spoken only by computers to each other, because the upgrade is not cost effective

envy2 10 years ago | |

M can indeed cancel Comcast accounts. Use this knowledge wisely...

gohrt 10 years ago | |

HN hosted an article recently about someone's project that cancel's Comcast for you.

nathancahill 10 years ago | | |

https://news.ycombinator.com/item?id=10320509

tonylucas 10 years ago | |

I would hope in the future that more and more services are actually directly available to be interacted with over one or more messaging networks, enabling easy, asynchronous communication.

Whether Comcast would want to make it that easy for people to cancel is entirely a different issue though :)

graeham 10 years ago |

"I'm AI but humans help train me"

The implication to me is that the chat is with a human, who is using an AI tool with the intention of training that tool. What better way to train a new service than to launch it, then answer all the weird, unexpected questions with humans? Gradually more of the questions get answered, the AI gets better trained, and the human-AI becomes increasing more AI.

Further, as the AI gets better, the human working with it has to do less, so they can roll out the service to more users without requiring more staff. Perhaps eventually, no human is needed.

pjc50 10 years ago |

This is a new angle on the app-outsourcing-to-low-paid-contractors "technology": it's so dehumanising that you have to pretend to be a computer while you work!

It's also strikingly similar to the original "mechanical turk".

provemewrong 10 years ago | |

>It's also strikingly similar to the original "mechanical turk".

That's what the M stands for.

personjerry 10 years ago | | |

No, it's not. Unless you have a citation...?

argonaut 10 years ago |

Why is this even a question? According to Facebook itself, it's mostly human-driven.

http://recode.net/2015/11/03/facebooks-virtual-assistant-m-i...

BukhariH 10 years ago | |

> "The opinion is split as to whether or not it’s a real AI, and there seems to be no way of proving its nature on way or the other."

Clearly, the author didn't even do the most basic fact checking. Since, Facebook clearly told everyone that M was going to be AI that was assisted by humans.

It's literally in the announcement post: https://www.facebook.com/Davemarcus/posts/10156070660595195

> "It's powered by artificial intelligence that's trained and supervised by people."

icebraining 10 years ago | | |

But that's exactly the point: is it AI trained by people, or people aided by AI tools? It's not the same.

betandr 10 years ago | |

I suppose the interesting point is more that it's being marketed as not really operated by humans as a positive when in many instance in our modern world the reverse is true. And also it's an interesting application of the Turing Test too. :)

br3w5 10 years ago | |

I think it's an interesting exercise in how to prove the level of human involvement in this AI.

fab1an 10 years ago |

Facebook's strategy here is to build an AI brand before they have the actual technology, which could make a lot of sense. At the same time the interactions between M's team and its users will provide meaningful data to train the AI on.

andreasvc 10 years ago | |

I've never heard vaporware characterized as something that "could make a lot of sense"; how could it?

Plus I'm doubtful whether the data would be very meaningful. A bunch of people adversarially trying to figure out whether the AI is real is not representative or generally useful data.

drumdance 10 years ago | | |

Microsoft Windows was vaporware for years. They famously did a "demo" that was just a manipulation of graphics. But Bill Gates correctly grasped that the future of the business rested on it and set about building the brand.

__john 10 years ago | | |

I think the data would be very useful considering a vast majority of users won't be using it in this way, also consider that people who query for things will learn the limitations of the system providing the information and adapt their queries accordingly. If you have a system capable of providing meaningful results to highly complex queries then you can start bridging the gap between how people interact with machines vs how they interact with humans.

reitanqild 10 years ago | | |

> I've never heard vaporware characterized as something that "could make a lot of sense"; how could it?

but it isn't vaporware. It very much exist and works only it is possibly actively misleading about how it works.

agopaul 10 years ago | | |

"Do things that don't scale"?

grahamburger 10 years ago | | |

I see it as similar to Uber building a system for scheduling self-driving cars before they have the self-driving cars.

bgilroy26 10 years ago | | |

They're expecting people to use it like Siri or Cortina or 'OK Google'

chrisBob 10 years ago |

I just had the perfect idea for a test, but then I went back to the recent discussion on Mimic[0] and double checked my favorite example[1]. Google has already updated their support, but there is a chance that Facebook M is still behind. Test them now before it is too late:

"When is the next Τаylοr Ѕwіft concert in my area?"

[0] https://news.ycombinator.com/item?id=10437619

[1] https://www.google.com/search?q=Τаylοr+Ѕwіft

jwalton 10 years ago |

Bad typing is definitely not enough to measure if an AI is really a human. As a teenager, I wrote a chatbot for an online text based game. I have it knowledge of a QWERTY keyboard layout, and when "typing" it had a small random chance of pressing the key next to the key it wanted to press. It would also sometimes transpose characters. Sloppy typing can be simulated.

Might be an interesting test to do a statistical analysis of your subject's mistakes against a corpus of real human mistakes, since there are many common mistakes humans make, and a random AI might make inhuman mistakes, but this would of course not be conclusive.

espadrine 10 years ago | |

Simulating bad typing is only necessary to fake a human. Here, having bad typing when faking an AI is stranger.

That said, AIs trained through the chat transcripts of a large number of conversations may produce mistakes. I remember reading a paper that gave good results that way, with the side-effect that it produces typing mistakes as a result. I cannot find that paper again, unfortunately.

Edit: found it! http://arxiv.org/pdf/1506.05869v1.pdf

_lce0 10 years ago | | |

I wonder if there's code available, from that paper. Results look promising.

Thanks for sharing!

jwalton 10 years ago | |

I just realized I typed "have" instead of "gave". Now everyone knows I'm a robot. -_-

downandout 10 years ago |

Clearly there are some humans behind M that are doing things that Facebook would rather entrust to humans (like making phone calls). However, the phone call only proves that this specific aspect of M is human.

In the end, though, I suppose it doesn't matter. I'm going to guess that the ultimate end-game on M is for Facebook to collect advertising/affiliate revenue from recommending things. For example, if someone asks for a Chinese restaurant, plumber, dentist, lawyer, etc. in their city, the one they suggest could be the one that paid Facebook for it. As long as these types of fees make it profitable for Facebook, it doesn't matter if the service needs to be powered by millions of humans. In fact, that would be great - it would mean millions of new jobs.

Larry Page famously told an early investor that Google wasn't yet sure how it would make money, but that search was the only situation in which people would tell a computer what they wanted, and that there had to be a way to make money from that. M is exactly the same - a way to get people to tell Facebook what they want, and it puts them in a great position to monetize it.

kayoone 10 years ago | |

Basically what GoButler or Magic are already doing, just AI based. But this is the goal for all similar services, otherwise it will not scale.

jasperry 10 years ago |

I wonder if you can ask M to use fewer exclamation points. From the conversations I saw in the article, it's a little too chirpy (or should I say "clippy") for my taste.

eric_h 10 years ago | |

I had the same sentiment. Similarly - when I ask Siri what time it is on my new Apple TV, after midnight it always says Zzzz... as if it's judging me for staying up late.

I'm not a huge fan of AIs fake emoting all the time. Occasionally, it's amusing, but all the time it just rubs me the wrong way.

lhnz 10 years ago |

I wonder how many years it will be before a real AI can compete with Facebook's AI?

I guess by the time that's possible Facebook's pretend AI will already have cornered the market.

The public will only be able to see that you were the late entrant and that while your AI is faster it's occasionally incorrect in peculiar ways...

This seems like a fairly solid plan by Facebook to crown themselves the winners of a race that hasn't yet finished.

AJ007 10 years ago | |

Pretending you have a working technology when you don't has been a recent theme in the startup world.

drivers99 10 years ago | | |

Thomas Edison claimed to have a long lasting light bulb before he actually did. He showed it to reporters one at a time in a booth. Between observers, he would change out the light for a fresh one. Source "How We Got to Now: Light" (on Netflix currently, at least in the US). Found the clip on PBS. Skip to 2:20 for the specific part: http://www.pbs.org/how-we-got-to-now/big-ideas/light/

eli 10 years ago | | |

I hardly think that's a recent development. Plenty of technology has been sold that way for a long time.

icebraining 10 years ago | | |

Vaporware is neither new nor restricted to the startup world. There's actually a whole (sub)genre of music based on the concept!

rblatz 10 years ago | | |

It's just an extension of "do things that don't scale."

trop 10 years ago | | |

Or compare in literature, to Victor Pelevin's (somewhat dystopian) novel Omon Ra's "highly complex automated systems"...

TorKlingberg 10 years ago | |

> I guess by the time that's possible Facebook's pretend AI will already have cornered the market.

And cost Facebook a lot of money. Are they planning to pay for personal assistants for everyone?

Grue3 10 years ago |

"Can you solve this CAPTCHA for me?"

(provided the CAPTCHA is sufficiently OCR-resistant)

andreasvc 10 years ago | |

If you want to avoid giving information about whether you are an AI or human, you simply respond "No."

Besides, the CAPTCHA's that are sufficiently hard to solve for computers are already hard for humans as well.

kuschku 10 years ago | | |

> Can you tag for me all photos in my album that contain a kitten, but not a dog, with "kitten", and those that contain a kitten and a dog, with "pets<3"?

That should keep it busy.

thanatropism 10 years ago | | |

Question isn't "will you?" -- it's "can you?"

mrdrozdov 10 years ago |

I don't get the part about the reverse number lookup. Couldn't they be using a disposable phone number that is allocated to Facebook? That's what Handy, Airbnb, Uber, etc. do. Why would they have to block their caller id? And how does either method prove or disprove that M is human?

shawabawa3 10 years ago | |

That doesn't prove that M's human, what prove's it's human is that a Human voice called. The fact it says Facebook is just evidence that it was indeed from M, and not just him getting his friend to call him and pretend.

throwaway7767 10 years ago | | |

> The fact it says Facebook is just evidence that it was indeed from M, and not just him getting his friend to call him and pretend.

It doesn't really prove anything, since caller ID is extremely easy to spoof (I used to call my mates from the emergency number for kicks when I was younger). Not that I have any doubt as to the credibility of the story.

Sujan 10 years ago | | |

https://youtu.be/6QRvTv_tpw0?t=28

;)

adrianN 10 years ago |

This reminds me of the Focused people in Vernor Vinge's "A Deepness in the Sky", slaves that were integrated into the computer system to provide function that surpassed the computer's built in intelligence.

austinjp 10 years ago | |

I've not read that. Personally i felt we're one step closer to the "cookies" in Charlie Brooker's "White Christmas" episode of Black Mirror.

From the article :

“Our test participant was impressed with how much M could do, but was sometimes disappointed at how long it took,” UserTesting’s report reads. “He concluded that it would be very useful if he could set it to perform a non-urgent tasks for him while he worked on other things.”

That made me shudder. One person tutting at the poor performance of "it". It seems plausible that robot-powered tasks would complete rapidly, and humans power the slower processes.

So the participant didn't know it was human-powered. If anything, that makes things worse.

agopaul 10 years ago |

Has anyone tried to hack a script that put in communication two M chat sessions?

Lewton 10 years ago |

Tihs is uslulay how I tset wehther i'm tlaknig to an ai or not

grey-area 10 years ago | |

https://duckduckgo.com/?q=Tihs+is+uslulay+how+I+tset - responds with:

Did you mean this is usually how i test?

Lewton 10 years ago | | |

Haha touché

dvh 10 years ago | |

I tested your question on my chatbot:

Q: Tihs is uslulay how I tset wehther i'm tlaknig to an ai or not

A: I am AI

avinassh 10 years ago | | |

link to your chat bot? and link to code, if its open source?

tux3 10 years ago | |

This is always impressive to me, I didn't realize a thing until "tlaknig"! Although I suppose you could specifically train an AI to recogize this.

Gankro 10 years ago | | |

To be clear, this is applying the classic observation that if you keep the first and last letter correct, humans are really good at unjumbling the center.

maxerickson 10 years ago | | |

Taking the best match from a spell checker on unrecognized words will handle most of it.

A threshold would probably work better against a mix of jumbled words and real gibberish.

codeshaman 10 years ago |

Regardless of wether M is currently more human than AI, we could project that in the not-so-distant future (after it's trained), M will be mostly, 99% AI.

The technology itself will become more and more available and other companies will also use similar AI tech to work with customers.

The ultimate moment will be when the AIs start talking to each other in human language, each 'thinking' that the other is a human.

That will be the moment when the machines have decided something for you and while at first you'll think that you triggered that, at some point it will become unclear - is the human triggering the AI or is the AI triggering the human.

Pretty soon, everything we consume and everywhere we go will be controlled (and, a bit later, predestined and programmed for us) by the AI.

rvac 10 years ago |

It's a human using Siri to answer your questions.

VLM 10 years ago | |

Based on punctuation analysis, word choice, tone, its not just any human but a mid 20s white female. Probably front ending google.

Real comedy would be going to mturk to try and find the task to communicate try to crack it recursively "M find me the mechanical turk task for this request".

hellbanner 10 years ago |

So how well does this scale, if all of Facebook's users are using "M" like this?

adrianb 10 years ago | |

Launch is limited to selected users in Silicon Valley.

hellbanner 10 years ago | | |

That explains why Facebook.M has such knowledge of the local area.

egmalek 10 years ago |

Imagine if the 1.3B Facebook users were eligible to be called upon by the AI on a Quora way to answer a question the AI couldn't answer alone...

nickodell 10 years ago | |

I think the quality of the answers would be along the lines of Yahoo Answers.

swang 10 years ago |

Did the author ask it about how we can avert the heat death of the universe?

evv 10 years ago | |

There is insufficient data for a meaningful answer.

aeturnum 10 years ago |

What a silly conclusion. The fact that a human called his land line does not mean M (the thing in messenger) isn't an AI. At best, it proves that M has humans who work for M making phone calls.

I don't have any insight or opinion about the question of how human M is, but this article seems makes a bunch of assumptions that make the whole investigation somewhat silly.

azernik 10 years ago | |

I think the distinction is a philosophical question. Are they humans who run errands for the AI, or is the AI a tool the humans use?

Perhaps better to think of them as coworkers, each specializing in their strengths.

The question is, just how much of the workload is the machine capable of handling? Because I think that's the big indicator of scalability.

sidcool 10 years ago |

Humans working at facebook scale! Would be interesting to see how many people are employed to do this...Are they Googling?

MasterScrat 10 years ago | |

The reference to Google Maps was a surprise to me... Does't FB typically relies on Bing maps?

sidcool 10 years ago | | |

Indeed, they do. Facebook has traditionally been inclined towards Microsoft than Google. Probably because they are the lesser rivals in business.

joss82 10 years ago |

But Facebook publicly admitted that the service is powered by real humans:

http://www.wired.com/2015/08/facebook-launches-m-new-kind-vi...

kriro 10 years ago |

So basically the suspicion is that M is a concierge MVP? From reading the chat excerpt I'd agree.

Edit: It would be interesting to devise a way in which you can make two Ms talk to each other (or have M talk to Siri etc.). Maybe "can you pretend to be a customer for my XYZ business"

mahdiponline 10 years ago |

As much as I appreciate the effort, I don't think proving M has humans behind it is any of help.

We write AIs. We try to make them act just like us. We teat them in everyway we can imagine and we expect them to act like a human would in response. Providing an algorithm for this is not always useful or maybw not even possible.

My theory on this is that Facebook is powering M with both people and some sort of AI software that not only analyzes and sometimes finds the best response, but it also analyzes the conversations people on both sides made.

Now this can be useful on several levels. Facebook can improve it's AI algorithm in less time, the AI can help people on their job in the meantime (by analyzing their work and commenting on it)

free2rhyme214 10 years ago |

This guy is hilarious. If you're reading HN comments you know that AI isn't quite there yet right? We're easily 5-10 years away from anything you're looking for.

SilasX 10 years ago |

>The most noteworthy aspect of this reply is that “Google Maps” wasn’t capitalized, suggesting that maybe, just maybe, a human typed it out in a hurry.

Or they're smart enough to add random mistakes. When I started a project for setting up multiple ways to say the same form letter, I thought of adding a random-typo feature to make it look like humans were writing it. I'm sure these guys are at least as cheeky as me...

Quanttek 10 years ago | |

But they don't try to convince you M is a human. Indeed the opposite. So it would be rather stupid to add typos to an AI, when you want people to see it as an AI

mizzao 10 years ago |

I don't think there has to be a huge controversy here. It's perfectly plausible to build a system that contains a hybrid of human and machine intelligence, where the humans work on the more fuzzy questions that cannot be directly answered yet, and the interactions used to fill in the gap as the AI is improved for later.

nhf 10 years ago | |

Yep! There's a lot of research going on in that area. For example, http://dl.acm.org/citation.cfm?id=2702416

Some recent work on fusing machine learning with Mechanical Turk workers to create "sensors".

gjm11 10 years ago |

For something with a similar flavour, see http://dangermouse.brynmawr.edu/csem/coffeehouse.html; start reading where it says "Post Scriptum".

traverseda 10 years ago | |

Didn't like that semicolon.

gjm11 10 years ago | | |

Ugh. Here's the URL without punctuation after it: http://dangermouse.brynmawr.edu/csem/coffeehouse.html

moey 10 years ago |

Do you guys think one person, working alone, can develop an turing passing AI?

ben_utzer 10 years ago |

Is it me or all the photos are blurred? I can't read them

yyhhsj0521 10 years ago | |

My photos were blurred and I found in F12 developer tool that they failed to load. After a little fiddling with my network they successfully loaded and became clear.

berdario 10 years ago | |

It's not just you. It's basically unreadable on Firefox on Android (and what's worse... the page prevent arbitrary zooming, unless you "request the Desktop page").

It's depressing how a supposedly well-designed platform like Medium still falls short of providing an usable mobile interface.

7Z7 10 years ago | | |

Can confirm, it's perfectly readable on Safari on iPhone.

Also zoomable.

Sir_Cmpwn 10 years ago | |

Just you, I'm afraid.

SIOP 10 years ago |

Thanks for this. Fascinating. A really interesting article.

Houshalter 10 years ago |

Most likely it's mostly AI, that redirects to a human when it's confused. Most of those responses look pre-programmed.

wallzz 10 years ago |

Does anyone know how can I try it ? I can't find a link or something like that.

richard_mcp 10 years ago | |

Looks like a limited roll-out to users in the Bay Area.

niix 10 years ago |

Reminds me of the chat bots I used to write for AOL and AIM "progz".

pinkrooftop 10 years ago |

Human or AI the level of service provided seems pretty amazing

Maksadbek 10 years ago |

Is M currently for USA, it did not appear in my contact yet.

dyeje 10 years ago |

It ends so many sentences with exclamations!

yoavm 10 years ago |

this might be one of the worst jobs ever