Short insight into the spam filtering with the new B8-filter (General)

by Auge ⌂, Friday, May 13, 2022, 11:42 (702 days ago)


Since 2022-05-08 this forum run with a version with an additional local spam recognition system, named B8. It's a Bayes filter that should classify content as spam or ham (no spam). The filter is based on the work of the 18th century mathematician Thomas Bayes.

Here, in this forum, we use not only this mehtod but also Bad Behavior and the black lists that work both locally as well as the external services, the forum script integrates optionally.

From sunday evening (2022-05-08, circa 20:00 UTC) til now (with the transient outage of the forum) eight entries got classified as spam. None of them was a false positive (a ham entry, that was erroneously classfied as spam). All affected entries was hidden from the forum visitors and the only task for a team member (in this case it was me) was, to manually check every single entry and to confirm (or deny) that it was really spam.

That way the filter learns, that "he" was right what will improve the detection in prospective. It's a bit of regulary work to start with but over the time the filter will get better and better and need less regulary attention.

Nice one!

Tschö, Auge

Trenne niemals Müll, denn er hat nur eine Silbe!

Complete thread:

 RSS Feed of thread