A catalog of naturally occurring images whose Apple NeuralHash is identical

A catalog of naturally occurring images whose Apple NeuralHash is identical(github.com)

337 points by hongsy 4 years ago | 294 comments

foxfluff 4 years ago |

My take on this is that the system is by and large useless.

It won't catch anything but the dumbest of dumb criminals, because those who care about CSAM can surely figure out a better way to share images, or find a way to obfuscate their images enough to bypass the system (the lower the false positive rate, the easier it must be to trick the system).

So what's left when all the criminals this is supposed to catch have figured it out?

False positives. Only false positives.

Is it really worth turning personal devices into snitches that don't even do a good job of protecting children?

Also, numbers about false positives must be taken with a grain of salt because of the non-uniform distribution of perceptual hashes. It might be that your random vacation photos and kitty pics have a 1-in-a-million chance of a fapo, but someone who happens to (say) live in an apartment that has been laid out very similarly to a scene in pictures appearing in the CSAM database may have a massively higher chance of fapos for photos taken in their home.

tzs 4 years ago | |

> It won't catch anything but the dumbest of dumb criminals

Dumb is a pretty accurate description of a large fraction of criminals. For the most part you only get smart criminals when you are talking about crimes where you have to be smart to even plan and carry out the crime.

BiteCode_dev 4 years ago | | |

Given the average user don't know what a url is and a pedophile can use the darknet, I'd say criminals are not all dumb.

nullc 4 years ago | | |

Yes, but when you admit that the target is just the dumb criminals, then why adopt a scheme that has false positives?

Decompress and downsample. Drop the least significant bit or two, maybe do it in the dct domain instead. SHA256. It'll preserve matching for at least some cases of recompression and downsampling. But finding an unrelated image that matches is as hard as attacking SHA256, the only false positives that could be found would be from erroneous database entries.

sabellito 4 years ago | | |

> Dumb is a pretty accurate description of a large fraction of criminals.

Is there any reading on that? I'd love it to be true.

throwaway0a5e 4 years ago | | |

You only hear about the criminals who get caught and crimes that go unsolved get blamed on these kinds of criminals.

prirun 4 years ago | |

> Is it really worth turning personal devices into snitches that don't even do a good job of protecting children?

Yes, because the point is not to protect children. It's to get everyone used to the idea that their content is being monitored. Once that is accomplished, other forms of monitoring can and will be added.

panta 4 years ago | | |

Exactly. It's a Trojan Horse (https://en.wikipedia.org/wiki/Trojan_Horse) to make more pervasive individual control the new normality. The current motivations are just a pretext.

Retric 4 years ago | |

Perceptual hashes are only used to reduce the search space for human review. Apple doesn’t have images in the CSAM database to do a comparison, but if it’s just a picture of a door their going to reject it. Also, because human review is an expense Apple’s incentives are to minimize the number of times it happens, thus the requirement for multiple collisions.

woofie11 4 years ago | | |

I don't really want my family photos reviewed by strangers. "Reducing the search space" of photos on my phone isn't an outcome I want to live with. At the time someone is looking at photos of my, my wife/husband/girlfriend/boyfriend, and my kids, they'd better have a darned good reason (e.g. a search warrant).

I'd also appreciate if Apple let me know if my false positives were reviewed and found to not be CASM.

jdavis703 4 years ago | | |

> Apple’s incentives are to minimize the number of times it happens, thus the requirement for multiple collisions.

How can we be sure they won’t cut costs by increasing worker load? I could see them giving each reviewer less time to review individual pictures before passing it on to law enforcement.

zionic 4 years ago | | |

Apple's human review is largely useless.

Trolls will be able to easily use tools slightly modify ambiguous adult porn to collide with a "known CP hash".

A human reviewer will see a blurry grayscale derivative of adult pornographic content and hit "report" every time.

nullc 4 years ago | | |

> Perceptual hashes are only used to reduce the search space for human review.

False. The Apple proposed system leaks the cryptographic keys needed to decode the images conditional on the match (threshold of matches) of the faulty neuralhash perceptual hash.

Matching these hashes results in otherwise encrypted highly confidential data being decodable by apple, accessable on their servers to the relevant staff along with anyone who compromises them or coerces them.

wyager 4 years ago | | |

Edit: I incorrectly claimed there wasn’t manual review - see below

wpietri 4 years ago | |

I knew a probation officer for sex offenders. They told me that most of them were quite dumb. What the repeat offenders were, though, is dedicated. They had all day to try to avoid getting caught, and the PO had a few minutes per week per offender.

It's true that in any arms race, a given advance gets adapted to. This will surely catch a bunch of people up front and then a pretty small number thereafter as the remainder learn to avoid iPhones. But that's how arms races work. You could say that about almost any advance in fighting CSAM.

woofie11 4 years ago | | |

I think it's only the dumb ones who get caught.

Source: I've met a few white collar criminals.

mayoff 4 years ago | |

> It won't catch anything but the dumbest of dumb criminals, because those who care about CSAM can surely figure out a better way to share images

Apparently that better way is by using Facebook. Facebook made 20.3 million reports to NCMEC in 2020.

https://www.missingkids.org/content/dam/missingkids/gethelp/...

foxfluff 4 years ago | | |

Yeah, Facebook's blog post makes me wonder what all the stuff they report actually is. When people say CSAM, I think "kids getting raped" but apparently there's stuff that people find humorous or outrageous and spread it like a meme (and not like pornography).

"We found that more than 90% of this content was the same as or visually similar to previously reported content. And copies of just six videos were responsible for more than half of the child exploitative content we reported in that time period."

"we evaluated 150 accounts that we reported to NCMEC for uploading child exploitative content in July and August of 2020 and January 2021, and we estimate that more than 75% of these people did not exhibit malicious intent (i.e. did not intend to harm a child). Instead, they appeared to share for other reasons, such as outrage or in poor humor (i.e. a child’s genitals being bitten by an animal)."

Based on this, I wouldn't conclude that FB is the platform where people pedos go share their stash of child porn.

Their numbers also include Instagram, which I believe is quite popular among teenagers? I wonder how likely it is for teens' own selfies and group pics get flagged and reported to NCMEC.

(https://about.fb.com/news/2021/02/preventing-child-exploitat...)

nullc 4 years ago | | |

> Facebook made 20.3 million reports to NCMEC in 2020.

Which appears to have resulted in what... 5 prosecutions?

UncleMeat 4 years ago | |

> It won't catch anything but the dumbest of dumb criminals, because those who care about CSAM can surely figure out a better way to share images, or find a way to obfuscate their images enough to bypass the system (the lower the false positive rate, the easier it must be to trick the system).

Given the reported numbers of illegal images detected by similar systems within Facebook and Google, I think it is very clear that this will catch a lot of illegal content.

zionic 4 years ago | | |

Facebook and google are not catching 20m people a year, they're mostly flagging and removing tor/proxy-based throwaway accounts.

volta83 4 years ago | |

The false positive rate reported in the blogpost for imagenet was 1 in a trillion, and the author concludes that this algorithm is better than they expected.

foxfluff 4 years ago | | |

"After running the hashes against 100 million non-CSAM images, Apple found three false positives"

So closer to 1/10M. The reporting threshold is made artificially higher by requiring more than one positive.

But anyway, that's beside the point.

A perceptual hash is not uniformly distributed; it's not a random number. Likewise for photos taken in a specific setting; they do not approach the randomness of a set of random images.

So someone snapping a photos in a setting that has features similar to a set of photos in the CSAM database may risk a massively higher false positive rate. It's no longer a million sided dice, it could be a thousand sided dice when your outputs happen to be clustered around similar values due to similar setting.

But I can't say I care about false positives. To me the system is bad either way.

numbsafari 4 years ago | |

Sometimes the best way to catch the really smart or sophisticated criminals is to exploit their less smart and less sophisticated accomplices, co-conspirators, peers, acquaintances, or even their victims.

devmor 4 years ago | |

The point of these innovations is never the stated purposes. To catch criminals is an excuse. I would bet a great deal that this system is by and large pressured by state actors for the purpose of creating a new political surveillance tool.

ak391 4 years ago | |

can try a web demo of it here on huggingface https://huggingface.co/spaces/akhaliq/AppleNeuralHash2ONNX

woofie11 4 years ago | |

> False positives. Only false positives.

I really doubt this. In the long term, a few people Apple wants to frame will surely slip into the mix. If Apple didn't want Trump to win, a CASM flag a week before the election might do it.

user-the-name 4 years ago | |

> It won't catch anything but the dumbest of dumb criminals

This includes the vast majority of pedophiles.

rowanG077 4 years ago | | |

Do you have any source that pedophilia correlates very strongly with low intelligence?

foxfluff 4 years ago | | |

Where did you find the statistics about pedophiles' intelligence?

roody15 4 years ago |

Apple has yet to make a valid reason for implementing client side CSAM scanning.

According to Apple only images that will be uploaded to iCloud will be scanned.

If this is the case there is zero reason to scan locally and you can just scan the uploaded image once it is on the server.

Apple has not implemented E2E nor has it released a statement indicating this will be implemented in the future.

toxik 4 years ago |

Sigh, for the last time, it doesn't actually matter if the NeuralHash is identical. You need multiple images matching, and then the images are compared by another system on Apple's end, which you don't know anything about.

The system is specifically designed so that colliding images does not pose a threat to the user.

NeuralHash and the CSAM scanning is grotesque, but please, criticize it for what it is, not some bullshit that is easily dismissed as technical ignorance.

scotty79 4 years ago |

Why are exact collisions interesting? They are not intended to be compared exactly.

This algorithm doesn't even give exact matches for the same image on different hardware.

https://github.com/AsuharietYgvar/AppleNeuralHash2ONNX

Note: Neural hash generated here might be a few bits off from one generated on an iOS device. This is expected since different iOS devices generate slightly different hashes anyway. The reason is that neural networks are based on floating-point calculations. The accuracy is highly dependent on the hardware. For smaller networks it won't make any difference. But NeuralHash has 200+ layers, resulting in significant cumulative errors.

AnonC 4 years ago |

A very relevant point on this entire discourse about Apple’s on-device CSAM scanning:

According to the U.S. law, key snippets of which are quoted on the Stratechery blog (by Ben Thompson), Apple isn’t obligated to scan for CSAM. It’s only obligated to act on CSAM if it finds them.

While it’s good for Apple to scan on its systems (iCloud) like Facebook, Google and other companies do on their servers, it’s inappropriate to do it on individual devices, which starts with the assumption that anyone who has iCloud photos enabled is a potential CSAM hoarder and needs to pay with their device’s battery life and time for the scanning to happen and report back. It’s a sort of micro-robbery that Apple is doing on the devices when there is no legal compulsion to do so.

Everything else on trusting Apple’s NeuralHash or the sanctity of the NCMEC hashes come later, IMO.

I sincerely hope Apple realizes that it’s got a dud solution on hand, eats humble pie (which it’s usually not capable of) and ditches this whole thing. I know a lot of egos at Apple are at stake here. But doing the right thing matters for a company that claims that “privacy is a fundamental human right” and has a CEO who’s a member of a marginalized/discriminated community and understands the risks of these efforts.

slownews45 4 years ago |

"This is a false-positive rate of 2 in 2 trillion image pairs (1,431,168^2)"

That is not bad. As a tool to filter down what apple human reviewers need to look at this is pretty good.

Ultimately these images will make it to a human reviewer who can make a call as they would in any flagging system.

Could a backend server side system do a more precise hash (96 bits is not a ton) prior to human review?

nitrogen 4 years ago |

The technology is not why the Apple system is unwanted. It's just extra fuel for the fire.

This system is unwanted because it puts a spy literally in your house and in your hands. It's bad enough that cloud everything blurs the line between what's yours and what's mine. Placing any law enforcement tech on a user's own device takes that line between "public" and "private" and completely erases it.

nullc 4 years ago | |

Absolutely. The problem is Apple introducing a spy into your home.

This alone should be bad enough, but some people are rather trusting. Showing that the spy is also tripping balls both exposes additional risks and emphasizes that Apple neither has their best interest at heart nor is putting adequate care into their actions. The latter gives people reason to question apple's claims of additional protection mechanisms that are non-falsifiable.

nobrains 4 years ago |

Please help me understand. Isn't this the reason why the process involved a final manual review? If so, isn't the point of having identical hashes moot? Or is the point that having more identical hashes means reviewing more personal pictures manually, leading to a privacy issue?

mns 4 years ago | |

I don't think I would trust a huge corporation with this. Plus, leaving the review to some internal classified process where some poor faceless guy needs to reach an unrealistically high quota of reviewed images per day to get his bonus, might be a bit of a risk.

madeofpalk 4 years ago | | |

it's not just internal policy - the safety vouchers will not decrypt (technically impossible) unless there are ~30 matches. It is a policy encoded in cryptography.

StrLght 4 years ago |

I don't really get what this repository is trying to achieve and what's the point of collecting collisions. Collisions will happen, that's just how it is with hashes.

It's already a public knowledge that Apple has 2 more systems (some server-side verification and a manual check later) to prevent false-positives. So what's the point of researching collisions in NeuralHash?

yosito 4 years ago |

> a catalog

Can two collisions really be called a catalog?

dathinab 4 years ago | |

It's a WIP catalog where everyone who stumbles over one can put it in.

It could in the future be used to e.g. improves this algorithms.

yeldarb 4 years ago | |

PRs welcome!

eesmith 4 years ago | |

A catalog of a thousand pages begins with the first entry.

nannal 4 years ago | | |

And a story may start with the first word, but if I present the word "Octopus" and say check out my story, you're going to be well within bounds to question me on it.

supperburg 4 years ago |

It would be a shame if thousands of people regularly uploaded hash collisions to their iCloud overwhelming apples human review capacity

theshadowknows 4 years ago |

I’m glad that people are trying to figure out any technical flaws in the system as best they can, but if I’m being honest I do trust Apple’s engineers to have built something that is solid from a technical stand point.

Am I correct in that the primary reason folks are so upset is that the system could (probably) be easily modified such that -any- content could invoke legal action? That the main problem is really the scanning at all, and not the chances that it could be attacked by an individual actor but instead by a government?

read_if_gay_ 4 years ago | |

Governments don’t get to search your house because some people out there have CP at home. Why should your smartphone be different?

ryeguy_24 4 years ago | | |

This sums up the frustration very eloquently.

pille 4 years ago | |

I can’t speak for everyone, but that’s certainly a technical part of it. Another big part of the problem is that it’s insulting to presume everyone guilty, and make them to use their own resources (own phone, own battery cycles) to investigate them as if they were suspects. But that’s been discussed plenty on other threads here at HN.

tucosan 4 years ago | |

It might be solid from a technical standpoint. Once you built it, governments will be coming and asking for more. Are you aware that the Chinese government already has been granted access to the infrastructure holding the keys to iCloud in China?

peteretep 4 years ago | |

Exactly that. The tech seems fine, but I live in a country with a government that has strong censorship laws, and I do not trust Apple to not bend to countries like China in extending this to political content.

teekert 4 years ago |

I still think the biggest problem is that at some point a human is going to look at a false positive, this may be picture of my naked children and this human may not have the best intentions with my picture.

That said, Nextcloud is my backend and I do not upload anything to iCloud (except for MS authenticator 2fa backups), so I'm safe right?

nextlevelwizard 4 years ago |

How are any of these "naturally occurring" when all (4) examples are things cut out of context on a white background.

Yeah two sticks (ski and nail) are visually similar on a white background. Why is this news to anyone?

EDIT: if you are going to downvote please leave a comment unless you are just downvoting for wrong think.

verygoodname 4 years ago | |

As it is explained in the "readme" part, in this specific context, "naturally occurring" means that no one has purposefully manipulated any of the images to make them collide: that the images were already published and "out there" and happen to collide. In other words, it does not necessarily imply that the images correspond to natural photographic scenes (which seems to be your interpretation of it).

Besides, you could probably "naturally" obtain such type of colliding images by photographing similar-looking objects against a white (or generally featureless) background. Furthermore, it suggests/demonstrates that similar-looking images with similar backgrounds can lead to unexpected collisions in practice (i.e. "naturally"), even if you do not assume an adversarial scenario.

Are you sure that, if you take a picture of a naked body part, it won't collide with anything that looks similar in their database?

nextlevelwizard 4 years ago | | |

It is unlikely unless you manage to capture some position and happen to have some background. This whole thing is a nothingburger. This is one of those weird things were many people have baseless gut reactions and then try to go and prove if flawed even though they don't have a complete picture.

It is unlikely that there is a collision of benign image with the database and even if that happens it is not some automatic process that just sends cops to your house to raid it.

Of course we can get bunch of collitions with essentially same images, I don't get why this is so magical just squint your eyes and I'm sure you have two objects with in your reach that could be made to collide, but that isn't a gotcha on any level

Cyberdog 4 years ago |

Isn't a hash collision from similar images the point of the whole thing?

At any rate, IANAL, but I'm pretty sure you can't be convicted based on a hash alone. If you get busted for possession of a picture of a nematode and you can show the jury it's just a picture of an axe that has the same value when run through this algorithm, you'll be fine. And there's a decent chance prosecutors won't chase down individuals who will just have a single collision in their photo library with this tech in the first place - people who have dozens or hundreds will be much more interesting.

ya3r 4 years ago |

Technically speaking, this does not prove that an adversarial attack is possible on the CSAM system of apple, Given that apple has another not released neural hash system on their servers which is potentially larger and works better than the one on device.

The more interesting technical question for me is: do collisions transfer across models? or how to find collisions that transfer across models?

erdos4d 4 years ago |

Is it possible for the courts to use this system to search a defendant's phone for leaked documents say? Like if NSA learns that one of a small group leaked document X, can they get a court to force Apple to add the hash of Document X to the database on that group of people's phones? If so, I bet this becomes the new norm for investigating leaks.

tyingq 4 years ago |

I guess don't upload pictures of peaches, poppy buds, phallic cacti, and so on to iCloud.

JoshTko 4 years ago |

The threshold of collisions Apple is using before review is 40

dathinab 4 years ago | |

I think no one is anymore afraid of 40 accidental natural image collisions.

But un-natural image collisions or bad images in the database and similar are a different matter and had been the main critique point from the get to go as far as I can tell.

dathinab 4 years ago | | |

Also given how many people use IPhones, how many pictures they have and how often they have many similar pictures, thinks are not necessary that simple.

I wouldn't be surprised if some flat, small height fully adult (e.g. 30) woman does some sexting and goes from 0 to >40 collisions in a month. Not because of arbitrary collisions but because the similarity some of here sexting pictures might have with the ones from a 14y old but older looking girl (which e.g. where forced and ended up in the database).

programmer_dude 4 years ago |

Can this affect people who do not use Apple products?

nullc 4 years ago | |

As a non-apple user you could be impacted indirectly by people you know being directly impacted or by Apple's practices being imported into the law. E.g. laws that attempt to outlaw encryption lacking apple-like backdoors.

theshrike79 4 years ago | |

No, how would it?

programmer_dude 4 years ago | | |

Then why is it such a big deal on hackernews and elsewhere?

gok 4 years ago |

The "catalog" has two entries.

floor_ 4 years ago |

Why not combine this with a second different hash?

E: Better yet, only run the second hash if you have a collision, which should be very rare.

yeldarb 4 years ago | |

Apple does, and I created a proof of concept for how it might work to guard against adversarially perturbed images here: https://blog.roboflow.com/apples-csam-neuralhash-collision/

theshrike79 4 years ago | |

How do you know they aren't doing this on the backend after the initial on-device match?

Grustaf 4 years ago |

This is of course entertaining, but since Apple has already tested for this, with 100 million images, and adjusted the rules accordingly, it has no practical implications.

hhsbz 4 years ago |

Is it really a catalogue when there only are two of them?

I find it amusing that they probably ran this tool against a set of millions or even billions of images and this is the best they could come up with. They are practically praising Apple here lmao