[Mimedefang] Missed Spam

David F. Skoll dfs at roaringpenguin.com
Tue Aug 12 09:31:01 EDT 2003


On Tue, 12 Aug 2003, Will McCorkle wrote:

>  SPAM: Content analysis details:   (0.80 hits, 20 required)

Spammers often use things like <!fooer> to separate words because the
Perl HTML::Parser module does not parse them very well!  They behave
completely differently from normal HTML tags.

You may want to try a body test matching:

	/<!(?!(--|doctype))/i

and add some points for that.  It penalizes "<!anything", unless
anything is "--" (an HTML comment) or "doctype" (often found in HTML)

Regards,

David.



More information about the MIMEDefang mailing list