Here we will introduce the dataset we will be using for this project.
The dataset used in this project is from Apache SpamAssassin.
Apache SpamAssassin is the #1 Open Source anti-spam platform giving system administrators a filter to classify email and block spam (unsolicited bulk email).
It uses a robust scoring framework and plug-ins to integrate a wide range of advanced heuristic and statistical analysis tests on email headers and body text including text analysis, Bayesian filtering, DNS blocklists, and collaborative filtering databases.
Apache SpamAssassin is a project of the Apache Software Foundation (ASF). You can find more about them from the below link:
https://spamassassin.apache.org/
The dataset we will be using is hosted at the below link:
http://spamassassin.apache.org/old/publiccorpus/
Let us begin!
No hints are availble for this assesment
Answer is not availble for this assesment
Loading comments...