[ < ] | [ > ] | [ << ] | [ Up ] | [ >> ] | [Top] | [Contents] | [Index] | [ ? ] |
Before you can begin to filter spam based on statistics, you must create these statistics based on two mail collections, one with spam, one with non-spam. These statistics are then stored in a dictionary for later use. In order for these statistics to be meaningful, you need several hundred emails in both collections.
Gnus currently supports only the nnml back end for automated dictionary creation. The nnml back end stores all mails in a directory, one file per mail. Use the following:
Usually you would call spam-stat-process-spam-directory
on a
directory such as `~/Mail/mail/spam' (this usually corresponds
the the group `nnml:mail.spam'), and you would call
spam-stat-process-non-spam-directory
on a directory such as
`~/Mail/mail/misc' (this usually corresponds the the group
`nnml:mail.misc').
When you are using IMAP, you won't have the mails available
locally, so that will not work. One solution is to use the Gnus Agent
to cache the articles. Then you can use directories such as
`"~/News/agent/nnimap/mail.yourisp.com/personal_spam"' for
spam-stat-process-spam-directory
. See section 6.8.5 Agent as Cache.
If you want to regenerate the statistics from scratch, you need to reset the dictionary.
spam-stat
hash-table, deleting all the statistics.
When you are done, you must save the dictionary. The dictionary may be rather large. If you will not update the dictionary incrementally (instead, you will recreate it once a month, for example), then you can reduce the size of the dictionary by deleting all words that did not appear often enough or that do not clearly belong to only spam or only non-spam mails.
[ < ] | [ > ] | [ << ] | [ Up ] | [ >> ] | [Top] | [Contents] | [Index] | [ ? ] |