Bayesian Spam Filtering Article Index for
Bayesian
Shopping
Spam
Website Links For
Bayesian
 

Information About

Bayesian Spam Filtering




Bayesian filtering was proposed by Sahami et al. (1998)1 and gained attention in 2002 when it was described in a paper by Paul Graham .2 Since then it has become a popular mechanism to distinguish illegitimate Spam Email from legitimate Email (sometimes called ''ham''). Many modern mail programs implement Bayesian spam filtering. Users can also install separate Email Filtering Programs . Server-side email filters, such as SpamAssassin , SpamBayes , Bogofilter and ASSP , make use of Bayesian spam filtering techniques, and the functionality is sometimes embedded within Mail Server software itself.

Bayesian Poisoning is a technique used by spammers in an attempt to degrade the effectiveness of spam filters that rely on Bayesian filtering. A spammer practicing Bayesian poisoning will send out emails with large amounts of legitimate text (gathered from legitimate news or literary sources).


MATHEMATICAL FOUNDATION


Bayesian Email Filter s take advantage of Bayes' Theorem . Bayes' theorem, in the context of spam, says that the probability that an email is spam, given that it has certain words in it, is equal to the probability of finding those certain words in spam email, times the probability that any email is spam, divided by the probability of finding those words in any email: