In Natural Language Processing (NLP), text preprocessing is the first step in the process of building a model.
The various text preprocessing steps are:
- Tokenization
- Lower casing
- Stop words removal
- Stemming
- Lemmatization
I used the nltk python library to implement the text preprocessing tasks
pip install nltk