The funny rules of SpamAssassin in 2023

The funny rules of SpamAssassin in 2023(updown.io)

230 points by alexis2b 2 years ago | 90 comments

srmarm 2 years ago |

I've been using SpamAssassin for at least 15 years and it's sadly gotten less useful as the spam arms race has moved on. We regularly see people on here post about deliverability issues with Gmail/Outlook but the truth is that sender reputation is by far the biggest indicator of whether a message will be spam - these type of rules are just counting deckchairs on the titanic in comparison.

And this plays into the strengths of the big mail networks in detection. It's a bonus to them that every time they block a smaller host there is a good chance that sender will consider a move to office365 or Google Workspace for their mail.

As an aside, not sure if OP is related to them but updown.io is a nice service and I appreciate the simple PAYG pricing! For what it's worth their mails seem to get through successfully to me too.

Also for those facing mail delivery issues (or just practicing good email hygiene) - I recommend www.mail-tester.com - they give you an email address to send a mail to and carry out a heap of tests - including checking against SpamAssassin + blacklists, SPF/DNS/etc testing.

rlpb 2 years ago | |

> It's a bonus to them that every time they block a smaller host there is a good chance that sender will consider a move to office365 or Google Workspace for their mail.

The irony is that a substantial amount of the spam I receive comes from those platforms.

nradov 2 years ago | | |

Are you certain the spam is actually coming from IP addresses controlled by those platforms? It's common for spammers to fake the SMTP headers.

cbsmith 2 years ago | | |

Because of course, its an arms race.

BSDobelix 2 years ago | |

I like rspamd much more (performance and redis) than SpamAssassin, and as you mentioned:

-https://www.mail-tester.com

-https://www.learndmarc.com

-https://mecsa.jrc.ec.europa.eu/en/

Are exellent tool's to check your "deliverability".

CableNinja 2 years ago | | |

Suprised to not see https://mxtoolbox.com in this list too

op00to 2 years ago | |

I switched from GMail to a personal Microsoft 365 domain when Google decided they didn't want to give me free email/domain services anymore. 365 was cheaper. I got about 10x the amount of spam to my 365 Junk folder than I did to the Junk folder in GMail. I would spend 10 minutes a day going through the junk folder to pick out false positives. I woud have inexplicable issues with missing email with 365, where the root cause was always SPF issues from a third party sender. The big issue was event tickets mailed from a third party ticket service provider using the venue's domain name rather than the ticket provider's domain.

I switched back to GMail a few months ago, and not only do I see less stuff in my Junk folder (indicating Google is blocking stuff rather than identifying it) but also I have not seen a single false positive. Hopefully that means Google is more effective, but there's no way to tell if I'm missing legitimate email. So far, no complaints.

yabones 2 years ago | | |

Microsoft's spam filter is fundamentally broken. It's been that way for decades. There's an entire cottage industry of snakeoil salesmen that want to sell in-line antispam gateways to bolt onto 365, and the worst part is that they have a very good reason to exist...

fer 2 years ago | | |

Strange, while I keep my GMail address I don't use it for anything new anymore since roughly 50% of the positives are false (no false negatives, though).

alexis2b 2 years ago | |

> As an aside, not sure if OP is related to them but updown.io is a nice service and I appreciate the simple PAYG pricing! For what it's worth their mails seem to get through successfully to me too.

Not related in any way except as an happy customer. They added a blog recently and this article caught my eye because of the nightmare that is mail delivery issue for everyone.

I found it particularly ironic that you now have to think like a spammer (i.e. look at spam detection engine source code to find a way to circumvent their heuristics) in order to get your totally valid email delivered (^_^).

edit: typo

adrienjarthon 2 years ago | | |

Thank you

andrewfromx 2 years ago | |

there needs to be like a mozillia vs chrome thing here no? What's the best try so far for something like letsencrypt or mozilla foundation for not owned by big tech email so "will consider a move to office365 or Google Workspace for their mail" the sender has this other awesome option?

jeffbee 2 years ago | | |

If you wanted to operate a haven for independent email hosting, where you want to assure deliverability in the face of Gmail's sender reputation system, you would need to classify your outbound traffic, and have a death penalty for spammers. If you tolerate any activity that peers classify as spam, that would tank your reputation.

jmyeet 2 years ago | |

> ... the truth is that sender reputation is by far the biggest indicator of whether a message will be spam

I couldn't agree with this more. I want people to remember this whenever the topic of decentralization or federation comes up. People see this as a technical problem. it's not. It's a political and organizational problem. Even with email, which is fully decentralized (other than the ICANN TLDs) running your own node still incredibly difficult. And those reasons aren't technical at all.

JohnFen 2 years ago | |

I've kinda given up on reputation scores to indicate spam/ham, personally, and rely more heavily on textual analysis rules. Going by "reputation" caused me far too many false positives.

Retric 2 years ago | |

Reputation works well because of those other rules. If every office365/gmail email got through and everything lose was blocked spammers would just move to those platforms. Thus email inspection is a critical component enabling reputation based filtering.

uean 2 years ago |

I love the analysis. But I hate that the 'fixed' email ends up being wordier for no reason at all.

Brevity has value. Having to bloat content (an email to get past anti-spam; a cooking blog to rank better within Google SEO; ...) brings back memories of high-school english papers, or the modern equivalent ChatGPT.

adrienjarthon 2 years ago | |

100% agree, I also hate that I had to do this.

chrismorgan 2 years ago | | |

Another piece of feedback: the link doesn’t look like a link any more. It wasn’t great before, but the verbiage made it adequately clear. But now it’s terrible, because the wording doesn’t suggest an action, and it doesn’t look like a link or a button. You should either restore its underline and lean into “link”, or give a background colour or (generally better) gradient and lean into “button”. But when it’s just a border, it doesn’t look like a button, especially when there’s a tick after it. And change the wording again.

layer8 2 years ago | | |

Couldn’t you add some “hidden” text instead, e.g. white on white or display:none?

matthews2 2 years ago |

Putting your outbound emails through SpamAssassin as part of a regression test sounds like a really good idea - would have never thought of doing that myself!

londons_explore 2 years ago |

Having the rules public seems to take away most of the benefits...

Any smart spammer will just tweak his spam to not hit these rules... And if he hasn't, it's because the vast majority of people don't use SpamAssassin

creeble 2 years ago |

I tried using SpamAssassin (via Proxmox Mail Gateway, which makes it much easier to set up) to replace a Barracuda email appliance (it was destined to get a *6x* service price increase in 2024!), and after several months of trying to get the number of FPs down, I gave up.

The problem wasn't just the number of FPs (which were much higher than the 'Cuda) -- it was that they came from real people, who were often common senders. This is not corporate email, or anything that was even remotely spam (except as SA's crazy ruleset determined). These all required whitelisting, and it became a real chore for all my users to keep up with all the whitelisting.

So back to the Barracuda for another year. It lets a little more spam through, but virtually no FPs. I just couldn't make SA get the same performance, even with many tweaks to the weights and rulesets.

linsomniac 2 years ago |

It's been a very long time since I ran a mail server, but for a decade or more I pumped all our outgoing mail through Hashcash because it gave a good boost to the Spam Assassin score. We'd crank it through the largest one, and it would add ~60sec to the mail delivery, unless we had a bunch of outgoing mail, but it was worth it I felt.

jwr 2 years ago |

I've been using SpamAssassin since, well, forever, in internet terms. My recent facepalm moment was when I noticed that E-mails from the Playdate developer forum (Playdate is a really cool tiny gaming console) land in my spam folder, because anything in the .date domain (and the forum uses play.date as the domain) is assumed to be "dating spam".

layer8 2 years ago | |

Given the double meaning of “play date”, it’s not surprising that it would cause a higher score, even if it used a different TLD.

loloquwowndueo 2 years ago | |

I have deny listed most of those new funny tld as it’s indeed a good indication of spam. Here the face palm should be playdate’s because they have realized their domain looks like a spam domain.

JohnFen 2 years ago | | |

Yes, I do the same. It's very useful to me because there is no non-generic TLD that I would be getting legitimate email from, but it may not work well for people who do want to get emails from such TLDs.

bschne 2 years ago |

I couldn't help but think about mechanistic interpretability research on large neural models reading this — I guess this is what happens when humans do something similar, adding and removing tweaks here and there to better fit this or that case, over a long period of time.

jaimex2 2 years ago |

I won the war a while ago now.

I basically trash all emails not in my contact lists. Easy.

TwoNineFive 2 years ago |

Spamassassin is doing it's job here, and doing a good job!

Most spammers and marketing/sales sleezoids never think they are doing anything wrong. They are totally empathy incapable. Or they know they are scum and don't care. Either way.

OP talks about adding "invisible text" and other such common spammer tactics to get around some of the rules. Zero self-awareness.

At no point did this person ever think "did I do something wrong?". No, it's that shitty Spamassassin!

lloydatkinson 2 years ago |

Some of those highlighted rules, such as using CC or having the string “can help” being used to decide if something is spam or not is so absurd I’ll make sure to never use SpamAssasin.