[Mimedefang] Training SA when mail is not stored locally?

Kelsey Cummings kgc at sonic.net
Wed Feb 4 15:54:55 EST 2004


On Wed, Feb 04, 2004 at 02:21:56PM -0500, Michael Faurot wrote:
> In article <00c701c3eb50$1762d550$ceaa2799 at mcilink.com> you wrote:
> 
> > I'm running an MD/SA gateway for a customer where mail is scanned,
> > tagged, and forwarded directly to their servers (nothing is stored
> > locally), but I need to train SpamAssassin and beef up its bayes db.
> > How do people typically gather ham and spam to train the box under these
> > conditions?  Is it possible to do it without too much intervention on
> > the customer's part? 
> 
> Yes, just use the bayes_auto_learn option in SA.  If there's a good amount
> of traffic going through the box, it should build up a corpus fairly
> quickly.  You may also want to tweak bayes_auto_learn_threshold_nonspam
> and bayes_auto_learn_threshold_spam if you don't like the defaults.
> I wound up leaving bayes_auto_learn_threshold_nonspam at its default
> but adjusted bayes_auto_learn_threshold_spam to 8.0.

You may also want to look at the following bug in SA's bugzilla.  We've got
patches maintained to 2.61 right now, expect to have patches to 2.63, etc.
Working with the devs to get the system merged into the dist tree.

http://bugzilla.spamassassin.org/show_bug.cgi?id=2167

If you have any questions, I'll be happy to answer them off-list.

-- 
Kelsey Cummings - kgc at sonic.net           sonic.net, inc.
System Administrator                      2260 Apollo Way
707.522.1000 (Voice)                      Santa Rosa, CA 95407
707.547.2199 (Fax)                        http://www.sonic.net/
Fingerprint = D5F9 667F 5D32 7347 0B79  8DB7 2B42 86B6 4E2C 3896



More information about the MIMEDefang mailing list