Skip to content

Data Cleaning and Preprocessing Tool for Data Science, Data Analytics and Machine Learning Professionals

License

Notifications You must be signed in to change notification settings

ahmedkansulum/DataPrep-Pro

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

2 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

DataPrep Pro - Data Cleaning and Preprocessing Tool

Python License Version

A comprehensive application for data cleaning, preprocessing, feature engineering, and encoding - designed for data science students, data analysts, and machine learning professionals.

πŸš€ Features

  • Data Cleaning: Remove special characters, handle missing values, standardize formats
  • Feature Engineering: Create interaction features, polynomial features, datetime extraction
  • Encoding: Label encoding and one-hot encoding for categorical variables
  • File Support: CSV, Excel, JSON, Text files
  • Batch Processing: Process multiple files simultaneously
  • Smart Merging: Automatic merging based on common columns
  • Multiple Export Formats: CSV, Excel, JSON
  • Machine Learning Ready: Outputs cleaned data ready for ML models
  • Data Analytics: Prepares data for exploratory analysis and visualization

πŸ“¦ Installation

Option 1: Run from Source

# Clone the repository
git clone https://github.com/ahmedkansulum/DataPrep-Pro.git
cd DataPrep-Pro

# Install dependencies
pip install -r requirements.txt

# Run the application
python src/main.py

About

Data Cleaning and Preprocessing Tool for Data Science, Data Analytics and Machine Learning Professionals

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages