Short insight into the spam filtering with the new B8-filter (General)

by Auge ⌂, Sunday, May 15, 2022, 11:15 (136 days ago) @ Micha


… flagging ham …

I ask, because I am a bit confused by the two options to "report and flag as ham" or to only "flag as ham"? What's the difference?

If you click to "report and flag as ham", the word list of the postings is stored to the database, and the related HAM counter of each word is increased, cf.

mlf2_b8_wordlist --> `token`, `count_ham`, `count_spam`

Using this table one can evaluate whether a single word (the token) occurs more often in SPAM or HAM entries. Since a single word does not allow for a statistically firm decision, the probability of the whole word list is evaluated.

The option "flag as ham" means, that the posting is flagged as HAM but the word list is not used to train/improve the filter (not stored to mlf2_b8_wordlist).

Thank you for this insight.

Tschö, Auge

Trenne niemals Müll, denn er hat nur eine Silbe!

Complete thread:

 RSS Feed of thread