Efficiency of new anti spam feature

Hello

some information can be found at the developer website.

I'm curious to see the amount of false positives and negatives and what time and count of words it needs to get stable.

Yes, me too. It depends on the HAM AND SPAM frequency of a forum. If you never train SPAM, all entries will classified as HAM. Training for both, detecting spam and ham, is the most important task. For that reason, do NOT classified spam posted by a Sockenpuppe. Restrict the training to SPAM written by bots.

I think, especially the different languages are a challenge for the script and the forum operators.

If the forum is operated in e.g. German language and spam entries are only in English, it will be quite easy to detect the spam (my opinion). So, I don't think, that one can give a more general answer to this topic.

often overlapping with the languages of the valid entries?

THAT is the challenge which is (hopefully) solved by Bayes statistics ;-)

How can we provide a dataset of training data for the forum operators (in the light of different languages), so they have not to start at the point 0?

This point is discussed in the B8 documentation. In the end, it make not sense to provide such a database because of the different languages. A Russia forum does not benefit from a German or English database.

/Micha

--
applied-geodesy.org - OpenSource Least-Squares Adjustment Software for Geodetic Sciences

Complete thread:

RSS Feed of thread

New anti spam features and their effectivity Auge 2019-01-22, 03:31
- New anti spam features and their effectivity Micha 2019-01-22, 08:04
  - New anti spam features and their effectivity Auge 2019-01-22, 08:21
    - New anti spam features and their effectivity Micha 2019-01-22, 08:37
      - New anti spam features and their effectivity Auge 2019-01-23, 07:32
        
        New anti spam features and their effectivity Micha 2019-01-23, 09:18
        
        New anti spam features and their effectivity Auge 2019-01-23, 10:05
        
        New anti spam features and their effectivity Micha 2019-01-23, 10:20
    - New anti spam features and their effectivity Auge 2019-02-03, 01:40
      - New anti spam features and their effectivity Micha 2019-02-03, 07:17
        
        New anti spam features and their effectivity Auge 2019-02-03, 08:12
        
        New anti spam features and their effectivity Micha 2019-02-03, 08:16
        
        New anti spam features and their effectivity Auge 2019-02-03, 08:49
        
        New anti spam features and their effectivity Auge 2019-02-04, 12:37
        
        New anti spam features and their effectivity Micha 2019-02-04, 02:26
- New anti spam features and their effectivity Magma 2019-01-23, 01:10
  - New anti spam features and their effectivity Auge 2019-01-23, 07:27
    - New anti spam features and their effectivity Magma 2019-02-02, 09:10
      - New anti spam features and their effectivity Auge 2019-02-03, 12:34
- New anti spam features and their efficiency, @Micha Auge 2019-02-11, 12:36
  - New anti spam features and their efficiency, @Micha Micha 2019-02-11, 12:50
    - New anti spam features and their efficiency, @Micha Auge 2019-02-11, 01:03
      - New anti spam features and their efficiency, @Micha Micha 2019-02-11, 01:10
        
        New anti spam features and their efficiency, @Micha Auge 2019-02-11, 01:20
        
        Efficiency of new anti spam feature Auge 2019-02-11, 02:31
        
        Efficiency of new anti spam feature Micha 2019-02-11, 03:05
        
        Efficiency of new anti spam feature Auge 2019-02-11, 03:24
        
        Efficiency of new anti spam feature Micha 2019-02-11, 03:46
        
        Efficiency of new anti spam feature Auge 2019-02-11, 05:35
        
        Efficiency of new anti spam feature Micha 2019-02-11, 06:07
        
        Efficiency of new anti spam feature Auge 2019-02-11, 06:26
        
        Efficiency of new anti spam feature Micha 2019-02-11, 06:36

my little forum

Efficiency of new anti spam feature (Technics)