Hi Phil,

Thanks for this tip but what about the efficiency of the Bayes Database
after this operation ?

I was thinking that the most this file can "remember", the most the =
bayes
filtering is efficient... In the limit of a reasonable file size of =
course !

As Richard said, "with the sa-learn --force-expire" ... "it deletes a =
lot of
entrys", but the file's size still remain the same.

Is ther a way to export the real records of the file before deleting it =
and
then re-import them back to it ? Shall we use something similar to
check_whitelist and trim_whitelist tools ?

-----Message d'origine-----
De : Randal, Phil [mailtorandal@herefordshire.gov.uk]=20
Envoy=E9 : mardi 12 juin 2007 09:37
=C0 : Richard Smits; users@spamassassin.apache.org
Objet : RE: How to decrease the bayes database size

bayes_seen just grows like topsy. All you need to do is delete it and =
let SA
recreate it.

Stop spamd / MailScanner / whatever.

check permissions on bayes_seen

rm bayes_seen

restart

do an sa-learn to make sure it still works (if it doesn't, reset =
permissions
on the newly created bayes_seen).

Cheers,

Phil
--
Phil Randal
Network Engineer
Herefordshire Council
Hereford, UK =20

> -----Original Message-----
> From: Richard Smits [mailto:R.Smits@tudelft.nl]
> Sent: 12 June 2007 08:30
> To: users@spamassassin.apache.org
> Subject: How to decrease the bayes database size
>=20
> Hello,
>=20
> We realy need some help here. It has come to our attention that our=20
> bayes database is 2.4 GB big. It is really slowing down our servers=20
> and they have a big cpu load.
>=20
> Now we have tried the trick with the sa-learn --force-expire , and it=20
> deletes a lot of entrys, but the file is not getting any smaller.
>=20
> 79K Jun 12 09:26 bayes_journal
> 20M Jun 12 09:26 bayes_toks
> 2.5G Jun 12 09:26 bayes_seen*
>=20
> Does anyone has some tricks to help us out ?
>=20
> Greetings... Richard Smits
>=20
> ----
> 0.000 0 3 0 non-token data: bayes=20
> db version
> 0.000 0 14201082 0 non-token data: nspam
> 0.000 0 7760360 0 non-token data: nham
> 0.000 0 916962 0 non-token data: ntokens
> 0.000 0 1181559955 0 non-token data: oldest atime
> 0.000 0 1181633069 0 non-token data: newest atime
> 0.000 0 1181633115 0 non-token data: last journal=20
> sync atime
> 0.000 0 1181604237 0 non-token data: last=20
> expiry atime
> 0.000 0 43200 0 non-token data: last expire=20
> atime delta
> 0.000 0 360013 0 non-token data: last expire=20
> reduction count
>=20
> ----------------------
>=20