IBM's Watson For Business: The $1 Billion Siri Slayer

IBM's Watson For Business: The $1 Billion Siri Slayer(fastcompany.com)

109 points by shuaib 12 years ago | 68 comments

hooande 12 years ago |

Watson could be described as a natural language search engine. This is no small thing. It's linguistic abilities were showcased on jeopardy, though it's wins might have had more to do with speed of processing and "buzzing in" than it did with being really smart. Watson is quite possibly the most sophisticated specific use natural language program to ever exist (as opposed to general use nlp, which is star trek level problem).

That said, the approach and subsequent utility might not live up to the hype that IBM is pumping out. It's one thing to search very quickly. Being able to discover patterns that lead to new levels of understanding and predictable relationships is another thing entirely. IBM is more search vs predict in part because they only have so much data to work with. All of the medical books in the world are a drop in the bucket in terms of algorithmic understanding. Watson has mastered working with all available information. Collecting and processing massive data sets is another challenge that IBM hasn't been willing to tackle yet.

IBM is billing Watson as the all singing, all dancing solution to the world's data problems. They're tackling a lot of problems in diverse areas. I hope it works out, the world needs as much help as it can get. But IBM has shifted their core mission to be consulting and I wonder if Watson's purpose will be to support that more than becoming a Super Siri type software project that could do the most good.

taejo 12 years ago | |

> its wins might have had more to do with speed of processing and "buzzing in" than it did with being really smart.

It may only be better than humans at buzzing in, but being as good as humans at natural language search, but faster and more consistently (doesn't make mistakes when tired; works just as well in Kampala as New York; can be audited when it makes mistakes) is already better than humans.

bnegreve 12 years ago | |

> That said, the approach and subsequent utility might not live up to the hype that IBM is pumping out. It's one thing to search very quickly. Being able to discover patterns that lead to new levels of understanding and predictable relationships is another thing entirely.

I don't get this, Watson might not be able to provide new levels of understanding but it still does a much better job than current search engines, so why do you think it cannot live up to the hype? Why do you think that it is not a significant improvement? What makes you think the approach is wrong? I need some clarifications :)

macspoofing 12 years ago | | |

>but it still does a much better job than current search engines

Does it?

mikeash 12 years ago | |

Regarding Watson's speed on Jeopardy, that was certainly a big advantage for it. However, consider that no matter how fast it is, a machine that "only" gets 50% of the questions right after it buzzes in (which would be an amazing accomplishment already) would lose the game horribly. That it won so solidly shows that it goes well beyond mere speed.

AJ007 12 years ago | | |

From the consumer's end, we expect computers to produce accurate calculations almost always. If a calculator produced the wrong answer to a basic mathematical function we would throw it out.

Humans are error prone, even when doing things they know and are good at.

Artificial intelligence is marketed as being a machine that is as smart as a human, but somehow we infer that because AI is a machine it will not make human mistakes. Mistakes are what produces learning.

The question becomes, do we only release AI for public use when it is assigned to a narrow range of problems and trained to 99.9% accuracy? Or does a consumer just throw AI at unknown, or even non trainable, problems and we take the result with a grain of salt? (Non trainable being something like predicting the value of the S&P 500 in 24 months.)

Perhaps a new words will be formed to describe AI, its behavior, accuracy, and experience? For now there is a lot of "one size fits all" and "holy grail" seeking. Big companies with armies of sales people seem to prefer this.

jijiwaiwai 12 years ago | |

Watson is is now only a search engine you need to train and define your own domain models and logics in side the App. The bloom depends on wether there can be enough Apps on the platform. Remember: those Apps are enterprise level and will always be developed by companies. Why don't these companies just deploy their app in other clouds and connect with a 'siri' like voice interface? I can't see any values inside this platform. Vertical search and IP is not hard for developers today. Small companies can do that without IBM's help.

eggoa 12 years ago |

“allows business users to send natural language questions and raw data sets into the cloud, for Watson to crunch, uncover, and visualize insights; without the need for advanced analytics training. After analyzing the data, Watson will deliver results to its users through graphic representations that are easy to understand, interact with, and share with colleagues; all in a matter of minutes.”

This sounds a lot like my present job description.

tsunamifury 12 years ago | |

Remember, no one cares about data, they want a story and the evidence to back that up. Watson may just make it easier for you to focus on the story telling and evidence rather than the crunching.

rapht 12 years ago | |

And mine too ! Better become a software engineer, I guess...

nl 12 years ago |

I'm currently working on an my own open source version of Watson/Siri/Google Now. (It can answer "What is the capital of Brazil" Yay!).

As part of that I've been leaning as much as I can about how Watson actually works.

The most useful information can be found by Googling "Deep QA" which is what IBM has dubbed their question answering pipeline.

A slide deck like [2] is a good place to start if you are interested in this.

[1] Yeah, I know that is kind of a crazy thing to work on. It's actually even more stupid than you may think, because I want it make it self-hostable, with the ability to keep your own data separately to the rest of the application (ie, enforcing privacy).

[2] http://www.cs.hku.hk/news/2011/WatsonHongKong_talk_ppt.pdf

xerophtye 12 years ago | |

Not crazy at all. I have been dreaming on working on something similar even before watson was announced (though i was just a Freshmen back then). And was hellishly jealous of that team. So I can totally understand your motivations for this.

Do hit me up if you'd welcome help with this. Email is in HN profile

nl 12 years ago | | |

You have to fill in you email in the about section for it to be visible. Mine should be showing up though - feel free to get in touch.

fortepianissimo 12 years ago |

I really don't appreciate the media's blood thirst - the slayer of this and the killer of that. Why can't we just have something that contributes in a non-zero-sum game?

normloman 12 years ago | |

You never had journalistic training. Journalists frame stories to satisfy criteria of "newsworthiness," a combined measure of the story's importance, urgency, and entertainment value. Just like a good novel has a dramatic conflict, journalists are taught to report on stories with conflict (which are often more interesting to read than dull, peaceful hum-drum). And if the story doesn't have conflict built-in, they make conflict by framing the story to include a conflict narrative. Journalists refer to the way they frame a story as their "angle." Anything can be news with the right angle.

Real life example: in the 90s, journalists reported on the "Great Hacker War," a "virtual gang war" between two competing hacker groups, LOD & MOD. In reality, the event was a scuffle between some hackers in a chat room, which resulted in some minor hacking, name calling, and prank phone calls. But that didn't make for a great headline.

milhous 12 years ago | | |

The headline is so absurd that I'm not reading it. But an excellent analysis.

sheetjs 12 years ago | |

Relevant clip from the Daily Show: http://www.thedailyshow.com/watch/thu-february-4-2010/the-bl...

ar7hur 12 years ago |

So far Watson for developers/business is 100% PR and 0% real. In November they announced the Watson API. Where is the API? Where is the documentation? Where are the examples? Google it and you'll get a torrent of PDF press release and incentives to call their sales team.

I'm afraid Watson is just a PR stunt. Was it oversold by IBM engineers to their executives? Or by the executives to the PR team? Or by the PR team to the press? I don't know. But they lost control of it.

Spooky23 12 years ago | |

Students at RPI have access to it, I saw a student app using it at a hackathon awhile back.

http://watson.rpi.edu/

kitd 12 years ago | |

Access to the "ecosystem" (ie API) is controlled at the moment, but you can request access here: http://www-03.ibm.com/innovation/us/watson/getting_started.s...

pervycreeper 12 years ago |

I wonder if the medical profession's refusal to adopt Watson has more to do with its performance or with guild protectionism.

bhouston 12 years ago |

Seems like a lot of hype still. It needs a killer app and I haven't yet seen one.

kylemclaren 12 years ago | |

Something like this?

http://youtu.be/8lGJ0h_jAp8

elwell 12 years ago | | |

That doesn't look like a prototype, just cgi.

scotty79 12 years ago |

Watson in its original incarnation could be already quite useful. It could provide workers with valuable input on what their manager has in mind when he babbles incomprehensibly throwing his favorite buzzwords at random.

At least 20% of full-stack programmers job is to figure out how people want the computer to behave and all we have to work with is chaos of words that flow from their mouths and fingers.

wil421 12 years ago |

So Siri is for consumer devices and Watson is going to be for businesses. I dont think they are going to be killing each other since they are targeting different markets.

taopao 12 years ago | |

Most of Siri's Watson-style smarts are just Wolfram Alpha queries, no?

RandallBrown 12 years ago | | |

I'm not sure.

I asked Siri "Did Michigan win its bowl game?" and it gave me the right answer and said "Michigan lost to Kansas state in the Buffalo Wild Wings Bowl.

Wolfram Alpha, with the same query, just gave me information about the state of Michigan.

Bing gave me search results about Michigan Football, and Google's top results were an article about the actual bowl game. I don't have an Android phone to try Google Now.

robg 12 years ago | |

Until consumer-oriented businesses start building apps atop it.

mcintyre1994 12 years ago | | |

I'd like to see that happen if only to push Google to open a Google Now API. To me this sounds like Wolfram Alpha for business more than Siri or Google Now though.

jonathansizz 12 years ago | |

Yes, Google Now is a much bigger competitor.

balozi 12 years ago |

Looks like IBM might be trying to buy good press. Plus, it's just over a week till Q4 earnings report. I bet Rometty has seen the numbers.

robg 12 years ago |

Best part? What was living room-sized is now three pizza boxes. It's less hardware, more software. That's exciting! AWS for AI.

lingben 12 years ago | |

I heard this as well, any idea how this was accomplished?

mbesto 12 years ago | | |

http://www.reddit.com/r/technology/comments/1ushn7/ibms_wats...

Comment 1 - Most likely it's not a full "Watson" but rather a network appliance that slots into a rack. Watson is powered by 2,880 8-core IBM POWER7 processors, which AFAIK haven't received a core bump or a die shrink since their introduction in 2011.

Comment 2 - POWER7 (which came out in 2009) was replaced by POWER7+ in 2012. IBM shrunk the lithography but kept the die size the same, so they used the extra space for more cache, a crypto accelerator, a compression/decompression accelerator, and some other goodies. There were able to bump up the clock speed as well. Core for core, POWER7+ is about a 20% improvement, but you're right, no more cores per socket so there is no way they would see the kind of shrink described in the article if they kept the same amount of compute power. IBM did come out with a new blade design (Flex Systems) with denser packaging, but that combined with the faster CPU will still only get them about 2/3rd of the way there (still impressive).

Already__Taken 12 years ago | | |

More likely it's because it won't be using in situations that require sub-1 second answers which might make more modest hardware acceptable.

wmf 12 years ago | | |

AFAIK the amount of hardware needed depends on the size of your corpus, so I would expect it to vary.

robg 12 years ago | | |

Honestly, look at the new MacPros. The computing power from even 10 years ago is amazing. WATSON will soon be software only for most standard servers. Looking at the SoftLayer acquisition in that light is very exciting.

hipaulshi 12 years ago |

Wow. I wish they could provide an open source library. But that was probably too much to ask for. Still, this is amazing news.

isaacpei 12 years ago | |

it was funny that just 3 days ago wallstreet journal talked about this: IBM Struggles to Turn Watson Computer Into Big Business http://online.wsj.com/news/articles/SB1000142405270230488710...

It would be more convincing if Watson accomplished some real knowledge archievment such as finding a cure for a specific disease, or publish some papers enhance our understanding of some research topics ...

Unix starts small, it works. Google starts small, it helps us tremendously. Haven't seen something starts as a big business plan can success greatly? even Microsoft started small ...

nathan_f77 12 years ago |

This is awesome. It really makes me feel like we're taking one more step into the future.