[Mimedefang] Image validator/OCR SA plugin
Martin Blapp
mb at imp.ch
Fri Apr 14 15:27:22 EDT 2006
> Interesting... What's the performance like with this? How many messages
> do you scan per day with it?
It is rather fast. On a Pentium IV 3Ghz I can scan a average jpg/gif picture in
0,2 - 0,3 seconds.
I've limited the scantime to 5 seconds per image, and I allow only three images
to be scanned per mail. Of course this is user configurable.
The greps here are just up to now, not a full day.
grep hits= /var/log/maillog | wc -l
78050
grep "X-Spam-Status: Yes" /var/log/maillog | wc -l
48400
grep hits=.*SPAMPIC /var/log/maillog | wc -l
9572
grep "X-Spam-Status: Yes.*hits=.*SPAMPIC" /var/log/maillog | wc -l
9558
grep "X-Spam-Status: Yes.*hits=.*SPAMPIC" /var/log/maillog | grep HTML_IMAGE_ONLY | wc -l
9528
# grep HTML_IMAGE_ONLY /var/log/maillog | wc -l
35834
This means 60% of all mails we get are SPAM. More than 10% of the SPAM
are some gif and jpg pictures advertizing for stocks and meds.
But almost 45% of all mails match HTML_IMAGE_ONLY, so it's unusable
at all. I even use lower scores for those rules now - which gives
me less FPS:
score HTML_IMAGE_ONLY_04 1.400
score HTML_IMAGE_ONLY_08 1.300
score HTML_IMAGE_ONLY_12 1.200
score HTML_IMAGE_ONLY_16 1.100
score HTML_IMAGE_ONLY_20 0.950
score HTML_IMAGE_ONLY_24 0.900
score HTML_IMAGE_ONLY_28 0.700
score HTML_IMAGE_ONLY_32 0.400
Martin
More information about the MIMEDefang
mailing list