language-icon Old Web
English
Sign In

Web Spam Hunting @ Budapest

2008 
We use a combination, in the expected order of their strength, of the following classificators: SVM over tf.idf, an augmented set of the public statistical spam features, graph stacking and text classification by latent Dirichlet allocation and compression, the latter two only used in our second submission.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    8
    References
    3
    Citations
    NaN
    KQI
    []