
Short insight into the spam filtering with the new B8-filter (General)
Hello,
With flagging ham you mean all entries, that are actually not flagged as spam?
Yes.
I ask, because I am a bit confused by the two options to "report and flag as ham" or to only "flag as ham"? What's the difference?
Take a look to the basic equation. The filter needs both, HAM training data and SPAM training data. If you only flag SPAM postings, it is nothing else then a black list. However, the filter evaluates the words in an entry and estimates the probability that these words are often used in SPAM (or HAM) postings.
So, in my opinion, it is not a good choice to use Akismet in parallel with B8, if you like to train the filter.
If you click to "report and flag as ham", the word list of the postings is stored to the database, and the related HAM counter of each word is increased, cf.
mlf2_b8_wordlist --> `token`, `count_ham`, `count_spam`
Using this table one can evaluate whether a single word (the token) occurs more often in SPAM or HAM entries. Since a single word does not allow for a statistically firm decision, the probability of the whole word list is evaluated.
The option "flag as ham" means, that the posting is flagged as HAM but the word list is not used to train/improve the filter (not stored to mlf2_b8_wordlist).
/Micha
--
applied-geodesy.org - OpenSource Least-Squares Adjustment Software for Geodetic Sciences
Complete thread:
- Release thread for version branch 2.5 -
Auge,
2019-06-01, 21:10
- Release of version 2.4.99.1, testing release -
Auge,
2019-06-01, 21:57
- Version 2.4.99.1, updatable but not installable - Auge, 2019-06-04, 11:15
- Release of version 2.4.99.3, testing release, with EDIT -
Auge,
2019-09-24, 20:29
- Release of version 2.4.99.3, testing release -
Micha,
2019-09-25, 08:50
- Release notes for 2.4.99.2 amended - Auge, 2019-09-25, 10:46
- Release of version 2.4.99.3, testing release -
Micha,
2019-09-25, 08:50
- First release in the version branch 2.5, 20220508.1 🚀 -
Auge,
2022-05-08, 20:16
- A few hints about the upgrade of this forum to 20220508.1 - Auge, 2022-05-09, 10:01
- There is a little but breaking bug in version 20220508.1 😒 - Auge, 2022-05-09, 13:00
- Release of version 20220509.1 - Auge, 2022-05-09, 19:37
- Next day, next bug with 20220508.1 and 20220509.1 - Auge, 2022-05-10, 17:04
- Short insight into the spam filtering with the new B8-filter -
Auge,
2022-05-13, 11:42
- Short insight into the spam filtering with the new B8-filter -
Micha,
2022-05-13, 11:47
- Short insight into the spam filtering with the new B8-filter -
Auge,
2022-05-13, 13:42
- Short insight into the spam filtering with the new B8-filter -
Micha,
2022-05-14, 07:40
- Short insight into the spam filtering with the new B8-filter - Auge, 2022-05-15, 11:15
- Short insight into the spam filtering with the new B8-filter -
Micha,
2022-05-14, 07:40
- Short insight into the spam filtering with the new B8-filter -
Auge,
2022-05-13, 13:42
- Short insight into the spam filtering with the new B8-filter -
Micha,
2022-05-13, 11:47
- Release of version 20220517.1 - Auge, 2022-05-17, 20:22
- Release of version 2.4.99.1, testing release -
Auge,
2019-06-01, 21:57