Live Application: https://protein-hmm-app-production.up.railway.app π
A sophisticated web application for analyzing protein secondary structure and understanding structure-function relationships using advanced computational biology algorithms.
Transform any protein sequence into detailed structural and functional insights in seconds! This tool bridges the gap between protein sequence and biological function by:
- π¬ Predicting 8-state secondary structure using the enhanced Chou-Fasman method
- π― Identifying functional regions and their biological roles
- π§ͺ Analyzing binding potential and active sites
- π Providing flexibility analysis for protein dynamics
- ποΈ Detecting structural motifs like helix-turn-helix and beta hairpins
- π Educational content about structure-function relationships
- 8-state DSSP classification: H (Ξ±-helix), G (3ββ-helix), I (Ο-helix), E (Ξ²-strand), B (Ξ²-bridge), T (turn), S (bend), C (coil)
- Enhanced Chou-Fasman algorithm with contextual analysis
- 60-65% prediction accuracy with confidence scoring
- Color-coded visualization for easy interpretation
- Functional region identification with biological explanations
- Binding site prediction based on flexibility and structural context
- Motif detection for common functional patterns
- Flexibility profiling showing rigid vs. dynamic regions
- Interactive sequence visualization with structure mapping
- Statistical composition analysis (helical, extended, coil content)
- Detailed residue-by-residue breakdown
- Professional scientific presentation
- Single sequence analysis with instant results
- Batch processing for multiple sequences (up to 50)
- Example proteins (insulin, lysozyme, myoglobin)
- Educational modules explaining the science
- Mobile-responsive design
- Visit: https://protein-hmm-app-production.up.railway.app
- Paste your protein sequence (amino acids only: ACDEFGHIKLMNPQRSTVWY)
- Click "Analyze Structure & Function"
- Explore the results!
- Raw sequence:
MKWVTFISLLLLFSSAYSRGVFR... - FASTA format (for batch analysis):
>Protein1 MKWVTFISLLLLFSSAYSRGVFR >Protein2 RDTHKSEIAHRFKDLGEEHFKG
Try these example proteins to see the tool in action:
π― Insulin (Hormone)
GIVEQCCTSICSLYQLENYCN
Small hormone with predominantly helical structure
π¦ Lysozyme (Enzyme)
KVFGRCELAAAMKRHGLDNYRGYSLGNWVCAAKFESNFNTQATNRNTDGSTDYGILQINSRWWCNDGRTPGSRNLCNIPCSALLSSDITASVNCAKKIVSDGNGMNAWVAWRNRCKGTDVQAWIRGCRL
Antimicrobial enzyme with mixed Ξ±/Ξ² structure
π« Myoglobin (Oxygen Storage)
VLSEGEWQLVLHVWAKVEADVAGHGQDILIRLFKSHPETLEKFDRFKHLKTEAEMKASEDLKKHGVTVLTALGAILKKKGHHEAELKPLAQSHATKHKIPIKYLEFISEAIIHVLHSRHPGNFGADAQGAMNKALELFRKDIAAKYKELGYQG
All-alpha oxygen storage protein
Our enhanced implementation of the classic Chou-Fasman algorithm uses:
- Amino acid propensities for different secondary structures
- Local context analysis with sliding windows
- Post-processing rules for realistic structure assignment
- 8-state DSSP mapping for detailed classification
The tool identifies how structure drives function through:
- Helical regions: Often involved in stability and binding
- Ξ²-strands: Critical for enzyme active sites and interfaces
- Flexible loops: Frequently contain regulatory and binding sites
- Turns and bends: Important for protein dynamics and allosteric changes
Understanding protein structure-function relationships is crucial for:
- Drug design: Identifying binding sites and conformational changes
- Protein engineering: Modifying function through structural changes
- Disease research: Understanding how mutations affect protein function
- Evolutionary biology: Analyzing conservation of functional elements
- π΄ Red: Ξ±-helix (H), 3ββ-helix (G), Ο-helix (I)
- π΅ Blue: Ξ²-strand (E), Ξ²-bridge (B)
- π’ Green: Turn (T), Bend (S)
- β« Gray: Coil/Loop (C)
- Helical regions: Structural stability, protein-protein interactions
- Extended regions: Enzyme active sites, Ξ²-sheet formation
- Flexible regions: Binding sites, regulatory elements, allosteric sites
- Turn regions: Conformational flexibility, surface loops
- >80%: High confidence prediction
- 60-80%: Good confidence prediction
- <60%: Lower confidence, consider experimental validation
- Backend: Python 3.11+ with Flask
- Frontend: Modern HTML5, CSS3, JavaScript
- Algorithms: Enhanced Chou-Fasman method
- Deployment: Railway platform
- Architecture: RESTful API design
POST /api/predict_structure_function
POST /api/analyze_batch
GET /api/get_educational_content
flask>=2.3.0
flask-cors>=4.0.0
numpy>=1.24.0This application is deployed on Railway with automatic deployments from GitHub.
- Fork this repository
- Connect to Railway: https://railway.app
- Deploy from GitHub
- Your app goes live automatically!
git clone https://github.com/LDolanLDolan/protein-hmm-app.git
cd protein-hmm-app
pip install -r requirements.txt
python app.py- π§ Machine Learning Models: Deep learning for higher accuracy
- 𧬠3D Structure Visualization: Interactive molecular graphics
- π Advanced Analytics: Phylogenetic analysis, domain architecture
- π Real-time Collaboration: Share and discuss results
- π± Mobile App: Native iOS/Android applications
- Training on PDB data: Real experimental structures
- Ensemble methods: Combining multiple prediction algorithms
- Homology modeling: 3D structure prediction
- Functional annotation: GO terms and pathway analysis
Perfect for:
- Bioinformatics courses: Hands-on protein analysis
- Structural biology classes: Understanding secondary structure
- Research training: Learning computational biology tools
- Self-study: Interactive learning about proteins
We welcome contributions! Areas for improvement:
- Algorithm enhancements: Better prediction methods
- UI/UX improvements: Enhanced user experience
- Educational content: More learning modules
- Performance optimization: Faster analysis
- Testing: Comprehensive test coverage
This project is open source and available under the MIT License.
LDolanLDolan - Initial work and development
- Chou & Fasman: Original secondary structure prediction method
- DSSP: Defining secondary structure classification
- Railway: Excellent deployment platform
- Open source community: Tools and libraries that made this possible
- π Live App: https://protein-hmm-app-production.up.railway.app
- π Issues: GitHub Issues
- π¬ Discussions: GitHub Discussions
Try it now: https://protein-hmm-app-production.up.railway.app π
Transforming protein sequences into biological insights, one analysis at a time. π§¬β¨