Police in England and Wales told to halt AI use in court statements

Police in England and Wales told to halt AI use in court statements(ft.com)

82 points by nmstoker 2 hours ago | 26 comments

> [...] he had intervened at forces that were deploying commercially available AI tools before they had been properly assessed [...] “All forces have got a good policy on the use of Copilot,” Murray said. “All forces will have a policy that says, ‘Check everything that it produces’.”

Not only are they using AI before they've properly assessed them, they also end up using Copilot which must be one of the worse AIs currently available, probably because of existing Microsoft relations. And on top of all that, they hope to be able to rely on "Please review the outputs" which obviously isn't an actual solution here, of course people will get complacent and throw stuff over the wall whenever they can.

Aurornis 1 hour ago | |

> “All forces will have a policy that says, ‘Check everything that it produces’.”

Everyone I talk to (including outside of tech) is going through this phase at their companies. It’s not working.

Checking the output seems like a simple request, but the question becomes: Check against what? If the police are making a document that sources from another report that another officer used AI to produce from their notes which were also run through AI and on and on, an inconsistency that leaks in at a previous step will check out when someone reviews the output against the inputs.

We’re all also discovering that many people’s idea of reviewing the output is to skim it and verify that it looks convincing enough. Checking facts is hard and takes time. These people are using AI because they want to work less, not to give themselves extra work.

prymitive 1 hour ago | | |

One can ask, what is a practical difference between “Check everything that it produces” and “Do all the work yourself”?

It’s not typing that’s the bottleneck, at least not often, so this is essentially assuming that you can do all the needed work without actually doing it, which is obviously wishful thinking.

kerabatsos 1 hour ago | |

The mindset must be that if you use AI (which I happen to advocate for) you are also responsible for the output, if you use the output publicly. AI is obviously very powerful if used responsibly - the human is responsible for it once it is used - however it’s used.

techblueberry 1 hour ago | | |

I think the problem is that, this is practically speaking impossible adjacent. I think generally speaking writing is way easier than editing, especially at scale. This isn’t binary or all or nothing, it’s not like “you can never use AI”. But I think we need to go back to augmentation over generation.

A person produces the content and AI removes barriers, and contextually accelerates the process keeping you in a flow state, rather than AI generates human edits.

analog31 31 minutes ago | |

Something happening in the US right now is that the "presumption of regularity" is being openly challenged by judges. To the best of my understanding , it's the presumption that the testimony of the police is truthful until proven otherwise.

I think "check everything that it produces" will ultimately have to happen in cross examination on the witness stand. "Did you use AI" will be the first question.

bluefirebrand 1 hour ago | |

> on top of all that, they hope to be able to rely on "Please review the outputs" which obviously isn't an actual solution here, of course people will get complacent and throw stuff over the wall whenever they can.

This is honestly the fundamental problem of AI as I see it

When we offload our work to a different person we can calibrate our expectations to our past experiences with that person. With AI the experience is not very consistent. To use AI effectively you basically should treat it as a low trust, brand new coworker every single time you use it

That doesn't really scale, so people have two choices: be constantly hyper vigilant for mistakes the AI makes, or become complacent and trust it more than they should

People rightly point out that humans make mistakes too, not just AI. But humans have a pretty manageable cap on the amount of output they can produce. One human can pretty thoroughly review the outputs of a small team of other humans

One human can't possibly thoroughly review the volume of output that an LLM they are prompting can produce

gdulli 1 hour ago | | |

Yeah, it's like declaring self driving safe because people are told to remain alert with their hands on the wheel, ready to take over in an instant. It's a charade.

skydhash 5 minutes ago | | |

That’s why most people that says LLM doesn’t work. It’s not that it can’t produce a good output once in a while, it’s that you can’t guarantee it. Or reduce the risks of a bad output. It’s a chaotic element and the cost of being alert enough to ensure consistency (if it’s feasible at all) is higher than just doing without.

But AI proponents are more than happier to brandish carefully curated anecdotes than to do a systematic study of risks and impacts.

delichon 1 hour ago |

We can get ambitious and try to head toward a form of statement more probative than even an officer personally typing a report: Have them narrate the facts of the event and the reasons for their decisions as soon as possible after the incident, as a video. Additions and corrections made later would be separate annotations. Where text is needed, auto-transcribe.

Courts prefer to have live witness testimony for a good reason. Detectives prefer to have statements made with the events as fresh as possible for a good reason. At the same time an oral report can save time and labor. Where we can take police or witness testimony verbally, more promptly, and with less work, and including body language, we should.

And video is more AI tamper evident than text.

techblueberry 1 hour ago |

I feel like this is where AI like -

Are we thinking about how we’re using it, or???

It seems like; there’s two kinds of data that might go into this, boilerplate and subjective information. Subjective information should be input by the police, because I would assert the specific wording matters. It matters that the words used to describe what the policeman saw comes out of the policeman’s brain. If it’s boilerplate, I’d AI really more reliable then copy-paste?

echelon_musk 1 hour ago |

At nearly £500 a year is an FT subscription worth it? Am I going to get invaluable stock tips that will cover the sub?!

tgv 1 hour ago |

I never thought AI would be the fork in the road to Idiocracy. Can you believe that the people whose evidence and testimony in court means so much, value The Great Hallucinator over hand work? They give a few nice sounding options for using AI ("checking child porn"), but it of course won't end there. They already started. People are so fucking lazy.

dylan604 55 minutes ago | |

It's funny to me how shocked people seem to be at the realization of just how fucking lazy people are. Sure, there are definitely differences in actual improvement through technology compared to sheer laziness. Movies tropes like Idicocray or even animated like Wall-E weren't far off either. It's just so much easier to be lazy. The number of people that do not go down these sci-fi trope timelines will be pretty small to the point of just being the weird odd balls that everyone else would just shut up already.

tgv 1 minute ago | | |

What's kept us going in the past then? Was it really just poverty?