Title Of Paper:
Applying feature transformation using Relative Frequency with Power Transformation and Lemmatization in automatic Spam Filtering
Author's Name :  Augustine Malero
KeyWords:  Spam filtering, machine learning, TFIDF, RFPT, lemmatization.
Pages:  21 -27
Volume: 2
Issue: 10
Year: 2014

Advances in Information and communication technology have paved a way for electronic mail commonly referred as email to become the medium of communication. Over the recent years this medium has become the target of abuse through spamming. One of the approaches of combating spamming is the use of automatic spam filtering through machine learning. The conventional features in automatic spam filtering are Term Frequency with Inverse Document Frequency (TFIDF). In this paper, an alternative approach is presented with the use of Relative Frequency with Power Transformation (RFPT) coupled with lemmatization technique. The techniques used considerably show improvements over the conventional one that is TFIDF.

Full Text: