A content-based movie recommendation system using TF-IDF Vectorization and Cosine Similarity to suggest similar movies based on user input.
Content-Based Filtering – Recommends movies based on similar genres, keywords, and cast.
TF-IDF Vectorization – Converts movie descriptions into numerical feature vectors.
Cosine Similarity – Computes similarity scores between movies.
User Input Matching – Uses fuzzy matching to handle spelling errors in movie searches.
git clone https://github.com/yourusername/movie-recommendation.git
cd movie-recommendationpython -m venv venv
source venv/bin/activate # For macOS/Linux
venv\Scripts\activate # For Windowspip install -r requirements.txt- The dataset is stored in movies.csv.
- Ensure that the
movies.csvfile is placed in thedata/folder. - If the dataset is missing, download it from Kaggle or use a sample dataset.
python recommendation.pyEnter your favorite movie: Inception
Movies suggested for you:
1. Interstellar
2. The Prestige
3. Memento
...
After generating recommendations, the following metrics are computed:
- Precision@k
- Recall@k
- F1 Score@k
- Mean Reciprocal Rank (MRR)
The metrics are calculated by comparing the recommended movies against a set of relevant movies or a ground truth for a group of users.
🔹 Deploy as a Flask/Streamlit Web App
🔹 Add Collaborative Filtering for hybrid recommendations
🔹 Improve search accuracy using Word Embeddings (BERT, Word2Vec)
🔹 Implement GPU Acceleration for large-scale similarity computations
This project is licensed under the MIT License - feel free to use, modify, and share!
Your Name: Sudiksha Rajavaram