Google turned me into a serial killer

Google turned me into a serial killer(hristo-georgiev.com)

840 points by Kaizeras 5 years ago | 294 comments

_etyf 5 years ago |

Google's Knowledge Graph info boxes are automatically-generated and littered with errors. I've been burned twice on operating hours, once for my local bank and once for a convenience store, driving over each time only to find the place closed. In both cases, the correct hours were posted on the business's website. I've also seen bad KG results for medical conditions, listing the wrong symptoms or describing easily-treatable maladies as "Incurable". Now I actively ignore the info box and intentionally click through to an authoritative non-Google website.

To a non-technical user, I'm sure the box looks like a human-curated result which they're more likely to trust. Maybe that's the goal of Google's UI choices. Couldn't be further from the truth.

slipframe 5 years ago | |

Technology Connections had a great observation about this in his video about touch lamps: the problem with Google's knowledge graph is that it takes everything it reads on the internet at face value.

https://youtu.be/TbHBHhZOglw?t=58 (0:58 through 4:20)

agumonkey 5 years ago | | |

> takes everything it reads on the internet at face value.

it's terrible but a lot of people [0] think digital information, and thus internet, is truth

[0] and if google itself falls for this.. no wonder if people do too

progre 5 years ago | | |

Worth watching for the capacitater joke alone

judge2020 5 years ago | |

> . In both cases, the correct hours were posted on the business's website

Google has started favoring info inputted into the Google My Business listing (the same one that dictates what appears in Google Maps for claimed businesses), so if the owners are updating their website but not updating that (or checking their email for emails from google that say '[business], are you open on 4th of July?') it'll show incorrect info.

da_chicken 5 years ago | | |

That's still Google's fault. If their service is providing out-of-date information, it's their fault. Making it sound like the business's fault for not using Google's other service is just reinforcing the idea that Google's information is the only information that's real.

This is the problem with the web search and scaping provider also trying to be a thousand other services at the same time. It's a constant conflict of interest.

neartheplain 5 years ago | | |

Even if 90% of businesses play along (doing Google's data entry work for them), the 10% chance of my wasting half an hour makes it worth ignoring the info box and double-checking each time.

nyhc99 5 years ago | | |

Google recently changed the hours of operation on my business' Google My Business listing to an hour later than I had it set to. I have no idea why; I certainly didn't tell them to. We found out when we got some angry calls from customers who showed up after we were closed. It's hard to explain to someone that we don't own our own listing on Google, and that we're at their mercy to approve the changes we submit and hope they don't make their own arbitrary updates.

1vuio0pswjnm7 5 years ago | |

The even uglier aspect of this, I think, is that Google is trying to keep you on their site and discourage you from visiting the authoritative site. That is classic middleman (unwanted intermediary) behaviour. Google produces no content but they sure interested in everyone else's.

gruez 5 years ago | | |

> That is classic middleman (unwanted intermediary) behaviour.

Unwanted by whom? The reader? If you don't want google as an intermediary you can just... not use them. If you want to search but not have an intermediary, you can just run your own search engine and index. The publisher? Again, if a publisher doesn't want an search engine to be an intermediary then they can always get their site delisted. Turns out that most sites don't do that, or do and then quickly revert back, so clearly they want Google as an intermediary.

cntrmmbrpsswrd 5 years ago | |

This is not true. While the content may initially be auto-generated, the Google KG is worked on extensively by real people. There are regional teams which work to verify and source the information that is out there. My sister works on the traditional Chinese team (mostly out of Taiwan or Taiwanese ABCs). These teams, as far as I know, are WFH contract workers.

neartheplain 5 years ago | | |

Can users tell when info box answers have been auto-generated vs. curated? How often is KG content updated or re-verified, as in the case of businesses changing their operating hours? Are KG verifiers subject matter experts, e.g. medical professionals, or unskilled workers?

Too 5 years ago | | |

Real people or not, I've been burned by false opening hours on google maps so many times I will never trust it again. I'd rather spend 20 seconds extra to visit the official webpage rather than a 25% chance of traveling half an hour to realize it's closed. Even if extensive work by a human team can reduce this to a 10% chance it's still not worth my trouble.

As the old adage goes. The only thing worse than no documentation is outdated/incorrect documentation.

joe_the_user 5 years ago | | |

I believe you but anyone got links? This is interesting information.

freedomben 5 years ago | |

As far as I can tell the KG is also the source for Google Home when it answers questions. That's even more rough since it sounds authoritative and there are no visual cues to imply that it may be inferred.

apozem 5 years ago | | |

Right. Voice assistants are a much harder format for Google, because you can’t just display a list of links. I ask my tube a question, I expect one (1) correct answer.

blueblimp 5 years ago | |

I once saw it source its info from a wrong answer in a multiple-choice quiz.

philwelch 5 years ago | |

If Google isn’t legally liable for this under defamation statutes, maybe they should be.

mcv 5 years ago | | |

The example in the article sounds like a pretty blatant case of libel/slander. Though not as downright dangerous as some of your examples.

matheusmoreira 5 years ago | |

Google's medical information boxes are irresponsible. They are not accurate enough to serve the general population. They aren't detailed enough to be a useful reference for professionals either. Even a Wikipedia article is a better resource.

jokethrowaway 5 years ago | |

Opening hours has been mostly surprisingly accurate - but I agree, it failed a few times.

Opening hours and Google Maps are the only things that get me on Google from DuckDuckGo these days (even if Google Maps keeps getting worse every year).

cwkoss 5 years ago |

You should seriously consider suing for defamation if this isn't fixed within a few days. This is an egregious error, especially if it was done by an automated system (and this could be a systemic issue affecting many others).

I think you have a decent chance of getting a five figure+ settlement from this. Talk to a lawyer about your options.

EDIT: When I search "Hristo Georgiev" (from US IP) there is no longer an image in the infobox. (As of 21:55:10 UTC, June 24, 2021)

I think a google engineer saw this HN post :-D

(You could still talk to a lawyer - remedying it now does not alter the fact that you were previously defamed. But Google has a stronger position having now remedied it)

tzs 5 years ago |

> It turns out that Google's knowledge graph algorithm somehow falsely associated my photo with the Wikipedia article about the serial killer. Which is also surprisingly strange because my name isn't special or unique at all; there are literally hundreds of other people with my name, and despite of all that, my personal photo ended up being associated with a serial killer. I can't really explain to myself how this happened, but it's weird. In any case, I am now in the process of reporting this Knowledge Graph bug to Google.

I believe that there is a simpler explanation.

The Wikipedia article is there in that side box because it is the top hit for "hristo georgiev" on Google's main search page. The picture is there because it is the top hit for "hristo georgiev" on Google's image search page.

inimino 5 years ago | |

Yes, but: the text snippet is clearly labeled "Wikipedia" and is in the same box with the photo. Combined with the Wikipedia article being the first result, this would certainly give the average person casually searching the impression that the data in the box comes from Wikipedia, which tends to be a reasonably accurate, conservative source on living persons.

The idea of mashing up the first image search result with the wikipedia snippet with no indication they are from totally unrelated sources seems pretty careless and irresponsible.

hermitdev 5 years ago | | |

One might be so inclined to label Google a peddler of fabricated misinformation...

I seriously can't believe that the sources of the image and text aren't labeled with their respective sources. Doing so seems basic, obvious and trivial. Not doing so seems to be a blatant attempt to hide expected inaccuracies and make meaningless combinations of information seem more authoritative than it actually is.

While I think this individual would have a hard time in any court system, let alone the US court system, could Wikipedia perhaps have a claim of damages for libel (or something to this effect) due to misattributed information and reputational damage?

cwkoss 5 years ago | |

That the explanation is simple doesn't make the result any less obscenely defamatory.

joe_the_user 5 years ago | |

"That explanation is the same explanation"

Or rather, all the knowledge graphic does is stuff not much more complicated than associate picture to name to article.

But what is pernicious is that presents itself as a knowledge graph and sometimes appears to have knowledge and so it seems to people to be a somewhat authoritative statement. And that causes people not-critically-thinking people to reach false and destructive beliefs.

jetrink 5 years ago |

The info boxes sometimes contain surprising errors. I recently searched for Picasa and was informed that it was invented by Pablo Picasso in 2002! Google 'knows' Picasso died in the 1970s and it 'knows' he released a popular software program in 2002. That's an obvious contradiction requiring the simplest of rules to detect, but the system is just a dumb text extractor. (Interestingly, Google assistant gave the correct answer for 'who created Picasa?' so that system must use a different knowledge-base.)

I, being compulsively helpful, reported the error and it was quickly fixed. Maybe I'm part of the problem.

yongjik 5 years ago |

I think it was Rachel - many years ago, for a few days, when you search "Rachel" at Google it would show a snippet from Wikipedia:

> Rachel was a Biblical figure, the favorite of Jacob's two wives, and the mother of Joseph and Benjamin, two of the twelve progenitors of the tribes of Israel ...

... along with a happy smiling face of some office worker somewhere, named Rachel, of course. It was glorious.

teraflop 5 years ago |

Sci-fi author Greg Egan has written about falling victim to a similar phenomenon, where photos of other people were showing up next to descriptions of him. No serial killers involved, though.

http://gregegan.net/ESSAYS/GOOGLE/Google.html

hpkuarg 5 years ago |

I'm glad this guy has a sense of humor about this, but I really hope that Google does right by him and that he doesn't get stuck in their Byzantine customer service process.

jeswin 5 years ago | |

Actually he should sue them. The damage to his reputation and prospects are real.

acjohnson55 5 years ago | | |

He should sue to get it fixed if they don't fix it, but I believe he'd have to show evidence of harm if he were to sue for damages.

protomyth 5 years ago | | |

Figuring out or demonstrating the "Quantifiably injurious" part might be very hard https://thelawdictionary.org/article/when-to-sue-for-defamat...

xxs 5 years ago | | |

Right to be forgotten would be much easier to invoke.

Suing in Bulgaria is likely to end up nowhere with Bulgaria having the worst courts in the EU (a primary reason not being in Schengen)

paulpauper 5 years ago | | |

this is google we're talking about. so good luck with that.

MeinBlutIstBlau 5 years ago | | |

I just looked it up and google does have a workflow that allows you to send them a court order.

Let's start there first before issuing a lawsuit. You only sue if you can prove damages for defamation. Which, as tepid as most people are on here about patent trolls, I can't imaging taking on google. It's literally like taking on god at this point.

runawaybottle 5 years ago | |

Lol what? They have customer service process? News to me, how to reach them?

martyvis 5 years ago | | |

I found a Google One subscription is helpful for that. It's only a few dollars a month, which in fact I fund for free using Google Rewards dollars. You then get to speak to a real live person that actually respond properly, even with handwritten emails.

SrslyJosh 5 years ago |

> The rampant spread of fake news and cancel culture has made literally everyone who's not anonymous vulnerable.

Google screwing up their knowledge graph is neither "fake news" nor "cancel culture". Misusing these terms makes them useless for actual discussion.

dcow 5 years ago | |

You missed the point. In a world of rampant “fake news” and “cancel culture”, Google screwing up their knowledge graph is dangerous ammunition that could be used to attack somebody’s reputation (which might lead to real/permanent damages before or regardless of whether the error can be corrected).

DoreenMichele 5 years ago |

This kind of thing is part of why I go ask people questions in areas where I don't have a lot of domain knowledge rather than just search for it. (That's not to imply I don't do a search first. That seems awkwardly worded and I can't think of a better way to say it.)

I'm a decent read of people and talented at figuring out who actually makes sense and should be listened to. So going to people with domain knowledge and talking to them is usually the most efficient and effective means for me to get meaningful information when I am out of my depth.

It's also why I try to be patient with people online and answer seemingly "dumb" questions instead of telling people to google it. In many cases, if you aren't familiar with the subject, you won't know the best search terms and you won't know that the top result is commercial garbage and not really the gold standard source on the subject.

I routinely provide links for things like SRO because not only do people often not know that stands for Single Room Occupancy, if you google it you get a variety of unrelated hits (Standing Room Only, for example).

https://en.m.wikipedia.org/wiki/Single_room_occupancy

I was involved for a time with The TAG Project. I generally try to remember to provide a link for that as well because when I search for it, the thing I was involved with is not the top hit.

The top hit is thetagproject.com. I worked for tagfam.org and it is typically the third hit when I search for it.

I fairly often see people being obnoxious about "you should Google that" and I sometimes understand why they are aggravated with certain things, but I generally think that's asshole behavior.

walrus01 5 years ago |

If you want to see a terrible example of Google automatically finding a "best" search result for the #1 entry of something, google the following:

"how many raccoons can fit"

generationP 5 years ago |

Greg Egan (the geometer and sci-fi writer) had a similar (if much less harmful) issue for years, which he blogged about at https://www.gregegan.net/ESSAYS/GOOGLE/Google.html .

"We made a profile of you, and if you think it's wrong, you'll have to register and share the right info with us" has been one of the safest giveaways of data hucksters. I used to think Google was the one exception, but by now I believe I should have trusted the rule.

At least someone is having fun with it: https://www.forgednfast.com/why-was-google-search-telling-pe....

a-and 5 years ago |

What's unfortunate is that Hristo Georgiev is a very common Bulgarian name.

This isn't a case where a highly uncommon name can lead to a high degree of certainty in association.

dhosek 5 years ago | |

I have a longstanding project where I rank graduate creative writing programs by their alumni's appearances in a selection of prize anthologies. This means I spend a lot of time googling authors. There are a number of authors whose internet presence is shadowed by criminals with the same name. Then there are those who are shadowed by more famous people of the same name such as the Australian poet Kate Middleton or the New England essayist Ravi Shankar. Then there are the authors whose names are the same as other writers. So far I haven't had to do an IMDB-style (II) after someone's name although it's come close with some authors differing only by the presence or absence of a middle initial. And one instance I had to try three times to find the correct author of one particular name because there were two others (not anthologized) who published under identical names. I have a short story that turns on the whole name confusion thing that was published last year. https://sandyriverreview.com/wp-content/uploads/2020/11/2020...

nonameiguess 5 years ago | | |

I was so happy to find out 15 years ago that I have the same name as multiple pro athletes, both about the same age as me, too. I very much want to be un-Googleable.

ab_testing 5 years ago |

I think this is a legit case of defamation and the author should be able to sue Google in local courts and get a judgement.

eitland 5 years ago |

News at 11: Google search results are now almost as bad as what they replaced, sometimes worse.

I've said some time ago already that there is a multi billion niche waiting for whoever wants to do what Google used to do:

- input field in middle of page

- user types text into field

- software shows list of pages that contain said text. Modifiers can be used to influence exactly how exactly the matching will be

- the company is nice and reliable and goes out of their way not to be evil

progre 5 years ago | |

I'm still sad that runnaroo went away as it felt like searching google felt like 10 years ago.

https://www.runnaroo.com/

tanto 5 years ago |

Is not really hard to imagine that more automation of this kind might result in some automated processes which results in someone get shot at a border by light handed policy.

This kind of thing should have very hard legal consequences for a company like Google.

Imagine being labeled as some kind of murder/rapist/pedophile whatever and moving into a neighborhood which gets angry fast.

varjag 5 years ago |

A great ice breaker for your next job interview.

soneca 5 years ago | |

And a deal breaker for dating apps.

ocdtrekkie 5 years ago | | |

I mean, the fact that the serial killer in question has been dead for forty years makes this conclusively "a funny story" and not "a red flag". But it also probably is mandatory to cover this ice breaker before the person you swiped on Googles you.

"Just FYI, funny story, Google thinks I'm a serial killer. But that guy's been dead for years, and Google is mixed up."

andai 5 years ago | | |

You'd be surprised...

html5web 5 years ago |

Google your name from time to time. I do it to protect my personal information. I don't want my personal information, including my home address, email, phone number etc. to be exposed on search.

Tenoke 5 years ago | |

I literally just got my wallet back because someone Googled me, and could find an email easily since my site ranks well for my name so there's definitely some benefit to being exposed.

kevingadd 5 years ago | | |

Having your personal email on your personal website is a little different from someone having posted your home address and phone # on some 'pay us money for people's personal info' website, though, and the latter is what you're going to spot by doing searches

ghaff 5 years ago | |

Some things are public records or, like email, are pretty much inevitably exposed if you do things in public. But fortunately a lot of the "deep web" stuff that used to be free generally isn't any longer. (And cell phone numbers aren't accessible nearly as much as landlines were.)

neil_s 5 years ago |

I filed a bug. I don't work on the KG team but hopefully it'll get redirected to the right people and fixed asap.

murphyslab 5 years ago |

I recently encountered the same underlying problem with Google's knowledge graph.

I do a lot of scientific image analysis using an ancient (but reliable!) piece of software called ImageJ [0]. There's a more recent distro of the same called FIJI [1]. So when I tried looking for how to extract EXIF data for GPS coordinates using ImageJ (not even mentioning FIJI), Google returned an info box about the Fiji-the-nation and provided the coordinates of said nation:

https://i.imgur.com/HxSh8Zv.png

[0]: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5554542/

[1]: https://imagej.net/software/fiji/

tgbugs 5 years ago | |

Glorious. False positives on the synonym match gone wild. This is the kind of thing that deep learning was supposed to solve. The fact that it is failing from specific (imagej) to general (fiji) is beyond the wildest dreams of anyone doing quality assurance. I'm guessing the neural nets have been slacking by trying to do pure substitution matches and the devs have been rewarding them at the worst possible time. Wow. This is like seeing a toddler who is screaming in public and the parents are actively giving them candy to try and get them to stop. Wow.

In case anyone doubts that this is real I'm seeing it too https://i.imgur.com/PXepl6C.png

eitland 5 years ago | | |

This started going downhill about a decade ago. I wrote about it 8 years ago and I am fairly sure I had seen it going on for a while already at that time: https://techinorg.blogspot.com/2013/03/?m=0

pulse7 5 years ago |

I would sue for damage.

EDIT: Because they put an image with unterlated information together in such a way that it misleads people.

encryptluks2 5 years ago | |

Damages require proof. Unless they can show they lost something because of the mistake, then there are no damages.

techlaw 5 years ago | | |

Although in the US damages for defamation can include compensatory damages (intended to "make the plaintiff whole" by compensating for monetary losses) they can also include general damages for non-economic impacts (for example mental anguish & damage to reputation) as well as other types of damages.

However, not all US states allow all types of damage claims and/or have special rules or higher burdens of proof related to those types of claims.

Generally speaking though, it is incorrect to say that somebody must show that they have had actual, monetary damages in order to be successful in a defamation lawsuit.

This overview from the Legal Information Institute (Cornell Law School) has some helpful info: https://www.law.cornell.edu/wex/defamation

aikah 5 years ago | | |

> Damages require proof. Unless they can show they lost something because of the mistake, then there are no damages.

You're basing your understanding of slander/libel on US laws I presume? If that person lives in Europe, generally, the bar for a successful lawsuit is __extremly low__ , it only requires the information to be blatantly false, there is no need to demonstrate the victim incurred any damages.

andai 5 years ago | | |

Isn't the search result page a form of slander? The burden of proof rests entirely on the slanderer.

jeswin 5 years ago | | |

It could certainly affect someone's mental health and confidence.

smdz 5 years ago |

Similar mismatch stuff exists with Google Scholar on two patent applications(now abandoned) where I contributed. It has my name but somebody else's photo and job title. Cannot apply to correct it just because I do not have any University email.

frumper 5 years ago |

This is a winning headline. It's got drama and levity that just drew me in. Bravo sir.

edit: The article was an interesting read too.

protomyth 5 years ago |

Google also has a problem with its news.google.com when they get their news from certain sources. Snoopes headlines are shortened and put up as if they were true. This resulted into some really vile headlines.

bryanrasmussen 5 years ago |

>I am now in the process of reporting this Knowledge Graph bug to Google.

In an ironic twist, the process of trying to get support from Google will probably drive him to become a serial killer.

shadowgovt 5 years ago | |

It's actually pretty simple: click the Feedback button, identify the flawed data, and explain the need to correct it.

That dataflow is human-curated.

DevKoala 5 years ago |

When solving problems at scale with a 0.0000001% error rate ends up destroying someone’s life.

H8crilA 5 years ago | |

The error rate is way, way higher than that. But yes, still low, and that's what almost certainly happened here.

DevKoala 5 years ago | | |

I was just throwing a number. I don’t work at Google, but I have heard from friends who work at that scale that when bugs only affect a few thousand people, it is “safe” to ship.

eitland 5 years ago | |

The error rate is so high that I see errors on a weekly or even daily basis when I fall back to Google and I only do 2 - 40 searches a day on Google these days.

I'm not searching a lot of Bulgarian names but I do search for npm packages and the like and despite my best efforts they frequently show me something completely different than what I searched for.

The saddest thing however is that DDG is just as bad, I use it just because I don't like Google and because it is easier to get from DDG to Google than the other way around.

throwdbaaway 5 years ago |

Somehow there is no mention here yet about this infamous rant: https://dgraph.io/blog/post/why-google-needed-graph-serving-...

> I started a project to unite all Google OneBoxes under this graph indexing system, which involved weather, flights, events, and so on.

Now we know who's to blame :)

romseb 5 years ago |

Suggestion: Create a new Wikipedia article with the same Name, upload your own profile photo onto it and put a Disambiguation (Programmer/Hacker) in it so that Google will associate it correctly.

Alternatively, add a drawing of the rapist to the original Wikipedia article.

Interestingly, for me, another Hristo (german principal investigator) appears on the right side when I google the name.

Tenoke 5 years ago | |

You'd be surprised to learn how hard it is to add anything to Wikipedia and have it stay accepted.

romseb 5 years ago | | |

Yes, I have seen the deletion fetish on Wikipedia first-hand many times. It made me stop contributing to it.

hadlock 5 years ago | | |

I added the wiki page for Kamala Harris' dad (a professor) to wikipedia (shortly after Biden announced her as his running mate in the election), and within literally 2 minutes, two different people had flagged it for deletion. One reason given was that academics need to be especially notable to warrant a wiki page, completely ignoring his relation to her daughter, which was mentioned in the original draft of the article stub. All proposals for deletion have since been removed, but I was surprised at the ferocity at which users want to delete new articles.

ishiz 5 years ago | |

> Suggestion: Create a new Wikipedia article with the same Name, upload your own profile photo onto it and put a Disambiguation (Programmer/Hacker) in it so that Google will associate it correctly.

That Wikipedia article would probably meet the criteria for Speedy Deletion and just causes unnecessary effort for the Wikipedia editors.

xxs 5 years ago | |

No way he'd be able to add an article about himself. He'd need to be somewhat famous/notable for the article to stick...

ant6n 5 years ago | | |

Hes famous for falsely having been accused of being a serial killer by Google!

heavyset_go 5 years ago | |

It's against Wikipedia's rules to create articles about yourself.

spoonjim 5 years ago |

If you find the right lawyer you will get at least $1 million out of this. Even if you don't feel like doing this, PLEASE do it for the greater good. Google will only start caring about these things if it costs them money. Money is the only language a corporation is fundamentally equipped to understand.

draaglom 5 years ago |