This repository contains a Python script for building and training a neural network model to predict hydrophobicity values based on amino acid sequences. The model is implemented using TensorFlow and Keras.
Hydrophobicity is an essential property of biomolecules, and predicting it can provide insights into the structure and function of proteins. This program utilizes neural networks to predict hydrophobicity values from amino acid sequences.
- Python 3.x
- TensorFlow
- pandas
- numpy
- matplotlib
Install the required packages using:
pip install -r requirements.txtThe dataset used for training the model is stored in the file HumanPlasma2023-04.csv. It contains amino acid sequences and corresponding hydrophobicity values.
- Clone the repository:
git clone https://github.com/sohamazing/peptide.git
cd peptide- Install dependencies:
pip install -r requirements.txt- Run the script:
python3 hydrophobicity.pyThe neural network model consists of multiple dense layers with dropout and regularization for hydrophobicity prediction. The architecture details can be found in the script hydrophobicity.py.
The model is trained and evaluated on a dataset split into training, validation, and test sets. The evaluation metrics include Mean Squared Error (MSE) and Mean Absolute Error (MAE), providing insights into the model's performance.
The script generates a plot illustrating the training and validation loss curves over epochs. This visualization helps in assessing the training progress and potential overfitting.