ADMIN: The plague of ill-defined content filters...

Matti Aarnio (matti.aarnio@zmailer.org)
Thu, 7 Jun 2001 13:45:12 +0300


[ If you want to comment about this topic, respect the Reply-To ! ]

The last week or so has seen a profileration of email bounces during
the mail delivery, or "DATA" phase.

I can somewhat sympathise the goals of the filters, but some are quite
unfathomable of how they decide what exact content is undesirable, and
what isn't...

The most common ill-defined filters seem to essentially grep for a sequence
of characters anywhere in the message body without paying attention to such
small details as if that character sequence forms a word, or is part of some
other word.

Example (obfuscated against stupid grep):

the l i n k to ...

It was matched against:

i n k

Right, definitely the same thing, isn't it ?

Better filters look for longer character sequences, possibly even detecting
word separators.

At the end of the page:
http://vger.kernel.org/majordomo-info.html
you can see how VGER filters through-going traffic -- "Taboo things"...
That thing works so well, that rarely anything truly spammish goes thru.
Senders of such messages don't get any notification at all.
(In case the message is e.g. too large, they get a word that it has been
diverted to Approval.)

Best filters give parametrizable amounts of points to found matches, some
may be harmless enough to exist elsewere, while others are dead give-aways
of spam. When the message accumulates high enough treshold value of such
points, it can fairly certainly be classified as spam.

Best sites let at least system postmaster receive anything without it
being filtered -- I have seen cases where big corporate input filter says
in its reject message that "do contact 'filtops@...' if you t h i n k
something is wrong...", and when I send the exact message there, it bounces...

So, if you all the sudden loose your subscription, it could be just
because your ISP decided that a 3 character sequence anywhere in the
message body is a sign of spam.

/Matti Aarnio co-postmaster of vger.kernel.org
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/