[Mimedefang] Training SA when mail is not stored locally?

Michael Faurot mfaurot at atww.org
Wed Feb 4 14:21:56 EST 2004


In article <00c701c3eb50$1762d550$ceaa2799 at mcilink.com> you wrote:

> I'm running an MD/SA gateway for a customer where mail is scanned,
> tagged, and forwarded directly to their servers (nothing is stored
> locally), but I need to train SpamAssassin and beef up its bayes db.
> How do people typically gather ham and spam to train the box under these
> conditions?  Is it possible to do it without too much intervention on
> the customer's part? 

Yes, just use the bayes_auto_learn option in SA.  If there's a good amount
of traffic going through the box, it should build up a corpus fairly
quickly.  You may also want to tweak bayes_auto_learn_threshold_nonspam
and bayes_auto_learn_threshold_spam if you don't like the defaults.
I wound up leaving bayes_auto_learn_threshold_nonspam at its default
but adjusted bayes_auto_learn_threshold_spam to 8.0.



More information about the MIMEDefang mailing list