Skip to content

deBurglar/SMS-Spam-Predictor

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

5 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸ“© SMS Spam Predictor

A machine learning-based web application that classifies SMS messages as Spam or Ham (Not Spam) using NLP and classification algorithms.

The dataset is a highly imbalanced one with the percentage of HAM(non-spam)messages being 85%.

Spam Prediction Banner


πŸš€ Features

  • βœ… Predicts whether a given message is spam or not
  • βœ… Clean and intuitive UI
  • βœ… Trained on the popular SMS Spam Collection Dataset
  • βœ… Visualization of word frequencies and message length distributions

🧠 Model Used

  • Algorithm: Naive Bayes / Support Vector Machine (choose whichever you used)
  • Libraries: scikit-learn, pandas, numpy, matplotlib, seaborn
  • Vectorization: CountVectorizer / TF-IDF

πŸ“Š Exploratory Data Analysis

Message Length Distribution

  • Spam messages tend to be longer and include promotional words.
  • Word clouds and bar plots were used to identify top keywords in spam vs ham.

πŸ§ͺ How to Run Locally

  1. Clone the repo
    git clone https://github.com/deBurglar/SMS-Spam-Predictor.git
    cd SMS-Spam-Predictor

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors