Linux security mailing list 'almost unmanageable'

Linux security mailing list 'almost unmanageable'(theregister.com)

190 points by jonbaer 10 hours ago | 90 comments

l1k 9 hours ago |

Fun fact (or not so fun if you're a subscriber):

Somebody is spamming kernel mailing lists under the name Marian Corcodel with a 26 MByte message multiple times per day containing a collection of nonsensical patches. Looks AI-generated, perhaps with the intention to poison LLMs. This has been going on for a few days now.

https://lore.kernel.org/all/CAGg4U=GNtCObd_Nbm_1Rr5FEvPb69Yz...

probably_wrong 9 hours ago | |

I'd warn HN users not to click on that link simply because it will load a 26Mb message that will likely cause quite a strain on kernel.org's servers if everyone here does it.

sillysaurusx 7 hours ago | | |

I was curious how much of an impact HN could have. Napkin math:

HN gets 24M views a day. Assume those views are evenly distributed across the front page (they aren’t), and that’s about 1M views for each front page post, assuming each user clicks on one post.

By the rule of 10s (also not exact), there are 10x less views on comment threads. So assume around 100k views on a comment thread as a theoretical average.

If everyone in this thread clicked on the link, that would be 2.6 TB of transfer across the day. But by the rule of 10’s we have to assume 10x fewer people will interact (upvote, click, anything) than view. So we’re down to 260GB transfer over the course of a day.

I wonder how close that is. It seems plausible that a link in the top comment of a thread could garner 10,000 clicks.

That’s still about one click every 8 seconds, which at 10Mbit/s would indeed overwhelm the server by a factor of about 2.5x. But I clicked through and it loaded in just a few seconds, so presumably the pipe is faster than 10Mbit/s.

Another caveat is that many websites are already several megabytes, so it seems strange that 26Mb would be the breaking point for a reasonable web host.

jedberg 2 hours ago | | |

It's mirrored by Akamai, which is designed to repeatedly serve the same thing over and over. It won't really hurt anyone.

jmalicki 8 hours ago | | |

Does a 26MB message actually cause noticeable strain on the server much beyond loading the page? I would think serving a contiguous 26MB chunk would be relatively similar to say 20 normal sized messages.

leonidasrup 9 hours ago | | |

https://web.archive.org/web/20260518134447/https://lore.kern...

neksn 5 hours ago | | |

The page is gzipped in transit - only 5 MB of traffic are generated.

shevy-java 8 hours ago | | |

Thank you for the warning. I rarely click on links these days though; only exception I make for HN links for main articles.

Phelinofist 6 hours ago | |

> perhaps with the intention to poison LLMs

How does that work?

stefan_ 6 hours ago | | |

This is just nonsensical changes and slurs, but particularly degenerate input data can cause big issues in training:

https://x.com/gabriberton/status/2051873677998956851

st_goliath 9 hours ago |

Here's the actual mailing list post: https://lore.kernel.org/lkml/CAHk-=wi+JvcuKF2NaD_rGiYrwkR6rx...

Actual context: Linux 7.1-rc4 release, Linus remarked on a specific documentation change.

The Register somehow turned this into an "article" that says a lot less with roughly the same number of words, and provides "context" by linking to a number of unrelated articles.

throawayonthe 8 hours ago | |

here is what seems to be the relevant documentation: https://docs.kernel.org/process/security-bugs.html

see "If you resorted to AI assistance to identify a bug, you must treat it as public." and https://docs.kernel.org/process/security-bugs.html#responsib...

Sebguer 4 hours ago | |

The Register has always been a... weird 'news' source, but they've gotten significantly worse over the last year or two.

Sweepi 10 hours ago |

"Torvalds' remarks contrast with recent comments from fellow kernel maintainer Greg Kroah-Hartman, who recently told The Register that AI has become an increasingly useful tool for the FOSS community."

Does it? Both points can be true at the same time.

ses1984 10 hours ago | |

Linus also said

“AI tools are great, but only if they actually help, rather than cause unnecessary pain and pointless make-believe work,” he wrote. “Feel free to use them, but use them in a way that is productive and makes for a better experience.”

So I think the closing remark from the register isn’t really appropriate given the context from the quotes they pulled.

moezd 8 hours ago |

I think it's time the report-only intake should stop. If a reporter can't reproduce at least one use case or can't summarise it in two sentences, it should be classified as spam. LLMs write beautiful reports, it's just that sometimes it doesn't bear anything resembling the truth.

nashadelic 8 hours ago | |

couldn't an llm be used for verification like we're seeing some OSS projects do? Some projects are moving so fast, its almost certain there's little human involvement.

Tempest1981 7 hours ago | | |

At my job, multiple people have vibe-coded bug-triage utilities. They're great for grouping duplicates.

But now we need an AI tool to consolidate the triage utilities.

trelbutate 8 hours ago |

Will never understand why some people prefer mailing lists to do development, it always feels like the most convoluted way to hold a discussion, especially if there are multiple topics at the same time.

It probably doesn't really change that much in this scenario but with a forum or any other topics-based platform you can at least just close and ignore these things without it affecting everyone else.

rnxrx 8 hours ago |

It seems like LLMs are actually pretty good at the sorts of things needed to manage a high-volume mailing list (summarizing, looking for dupes, sentiment, flagging things, etc), even if only as augmentation for human eyes.

That said, I get why this would rankle a lot of the folks involved.

rolandog 7 hours ago | |

That's just a security/protection racket with extra steps: "Someone is paying us to hurt your business/site; pay us money to defend your site against our attacks".

olive-n 9 hours ago |

I like to imagine that LLM's ability to optimize code is like an extension of the training-loop in deep learning. The loss function is some kind of metric representing security and/or performance (or the lack of it) of the code and we use the LLM as the gradient/diff generator to iterate in batches over the code and fine tune it.

Imagine the current state being for the most part a collection of local maxima in security. To push the system in a more optimal state, you either need skilled people and time to overcome the barrier to a new local maximum or you throw AI at it and evaluate whether you land in a more optimal state.

I think after some time of turbulent exploit/patch cycles we will reach a stable state again, where the code converges against a new local minimum that even with AI requires significant effort (time and tokens) to overcome. Or ideally a global maximum.

With time, the LLMs improve, so the diffs/gradients get better and we will be able to reach optimal points for any software faster.

My problem with the idea is that apparently it is assumed that OSS contributors and especially maintainers will generously donate their time to get this machinery into a state that makes the optimization loop work well - just for the AI labs to turn around and sell access to the optimized models for increasingly larger amounts of money.

AI generated code can be great. Hand rolled code can be bad. The rules are the same in both cases. Make sure your code changes are focused (no random changes just because you happen to be in the file/dir or notice something) and make sure you don't break anything else along the way.

oncallthrow 7 hours ago |

I think this will sort itself out over time, as people realise that it’s no longer impressive whatsoever to land an AI-assisted PR to the Linux kernel.

VLM 6 hours ago |

Make it anonymous and the problem will go away.

The problem is people trying to get individual credit for merely running a script that spams a mailing list. Many of those people are likely not even C programmers or programmers at all.

Without the immense personal reward and recognition and job offers as a motivation, the problem will disappear.

The problem will also disappear with time as the people lauding and celebrating and hiring security researchers of the past will quickly abandon LLM generated spam as a positive signal; running a prompt that sends spam is, if anything, a strong negative indicator of infosec ability and skill.

LLMs are a tool. Like all tools, most people can't or won't use them responsibly or profitably although they are useful in the correct hands.

thewebguyd 5 hours ago | |

I really like this idea. Removes the fame, blog & resume/job hunting incentive from it.

The kernel isn't the only OSS project with this issue either. Requiring submissions & issues to be anonymous could help a ton of other open source projects currently drowning in AI slop issues.

NoSalt 8 hours ago |

So ... who, exactly, is AI supposed to be "helping"???

pavon 5 hours ago | |

The bug reports are helpful! Many Linux developers including Linus, Greg Kroah-Hartman, Andrew Morton, Chris Mason, and Willy Tarreau have all commented positively on all the legitimate problems that are being found with LLM. Here is just one example article[1].

This is just a workflow issue. In the past it was very rare for multiple people to find and report a security vulnerability at once, so it made sense to keep the discussion private until they were ready to release a fix. With AI that is happening all the time, so it makes more sense for the discussion to be in public to avoid duplication. So they changed the policy accordingly. That is it.

[1]https://lwn.net/Articles/1066581/

nottorp 7 hours ago | |

The "security researches" who post those bugs. Their goal being self promotion.

newswasboring 8 hours ago |

> Torvalds' remarks contrast with recent comments from fellow kernel maintainer Greg Kroah-Hartman, who recently told The Register that AI has become an increasingly useful tool for the FOSS community

Thats kinda a misrepresentation. They are talking about two different things. Linus is trying to point out incorrect use of a tool while GKH is praising a correct use. This sentence felt weird at the end of the article, kind like rage bait. And I took it :P.

stabbles 9 hours ago |

Isn't it mostly the medium that's problematic? With an issue tracker it's easier to close as duplicate

mixxit 3 hours ago |

I feel like the ability to speed up finding bugs will exceed your ability to fix them and/or review ai PRs

It will almost likely find issues that require fundamental design changes to even fix some of these

Dangerous waters ahead for data security and vital infrastructure

827a 2 hours ago |

I hate to be "that guy" but there's a reason why most of the industry stopped using mailing lists for things like this. Extremely impressive that Linux lasted this long.

perching_aix 6 hours ago |

Nonsense advice, he's just asking for duplicate slop patches too this way.

It's a catch 22. Why not make a separate list for AI generated reports that can be subscribed to instead? If the claim is that these are not private anyhow, no reason not to, and then a reasonable expectation could be held against submitters to check against existing reports.

That is unless it is still absolutely sensitive, in which case the only way forward that I see is to start using AI for triaging and duplicate detection as well.

quotemstr 8 hours ago |

Maybe it's time to require public zero-knowledge proofs of a working exploits before privately-delivered exploit details can be considered.

shevy-java 8 hours ago |

So ... first, AI slop is killing mankind slowly. Skynet is winning here.

On the other hand ... IF the bug report is real, and let's assume that AI slop reports at the least a few bugs that are indeed real, then I really think it should not make a difference WHO or WHAT reports these bugs. I would not disagree on fake bugs or bogus bug reports wasting time of humans, but this is a quality difference then. Surely people can tweak AI models to be better at finding bugs too. Besides, they should auto-fix that. Is AI still too stupid to fully replace humans? Other than killing them with spam, as it does right now.

new_account_100 9 hours ago |

AI (read: LLM technology) is the most powerful spam weapon ever invented.

kirtivr 7 hours ago |

I'd really like maintainers to get their hands dirty with AI agents as well to help speed up the reviews.

Over the last year there have been way too many stories and Twitter posts like these.

Yes, maintainers are overloaded, but that's only because we haven't yet built the tools to support them.

Other than such statements, I would, as a builder like to hear the sorts of tools and requirements maintainers are looking for which would make their work easier!

We need to move fast without breaking things.

sockaddr 7 hours ago | |

I'm a huge AI advocate but even I can't get on board with this.

Feel free to fork the kernel and maintain your own vibe-coded disaster.

dathinab 7 hours ago | | |

I'm confused by your answer, the previous post doesn't seem to be about vibe-coding at all.

It seems to be more about:

1. auto grouping duplicate security reports

2. auto validating if they are likely viable or likely nonsense

3. auto checking if they have recently been patched

4. auto assessing if they likely "invalide" for other reasons (e.g. they are for a very old long time no longer maintained Linux version, out of tree drivers, etc.)

I mean practically all of that isn't trivial to get working in a way appropriate for the Linux security mailing list and comes with many not so obvious complications. But also non of that is vibe coding and in most cases this is is more about AI doing a per-assemsment of send security issues to speed up the review of them, then it is about the AI doing the final decision.

wang_li 4 hours ago | |

The world doesn't need to support the projects and research areas that interest you. How about we do something better: No one is allowed to say or write anything about AI or AI generated slop until AI is 100% perfect and produces zero errors and does everything with perfect efficiency.

AI trash like this is like showing up to a baseball game with a pitching machine and demanding that they let you join in and be the pitcher using your machine. Just because your slop cannon is fun and exciting to you doesn't make anyone else obligated to join your club just because you fired your slop on them.