Real-time cryptocurrency price prediction platform powered by machine learning
Features • How It Works • Tech Stack • User Guide • Technical Docs
CryptoPred is an end-to-end machine learning platform for cryptocurrency price prediction. It processes real-time market data, computes technical indicators, analyzes social sentiment, and generates price forecasts using state-of-the-art ML models.
Built for traders, researchers, and fintech teams who need reliable, explainable predictions at scale.
Stream trades from major exchanges with sub-second latency. Automatic aggregation into OHLCV candles at multiple timeframes.
40+ technical indicators computed in real-time: moving averages, RSI, MACD, Bollinger Bands, volume profiles, and more.
LunarCrush integration for social media sentiment scores, galaxy scores, and market correlation metrics.
Production-ready LightGBM and ensemble models with automatic hyperparameter optimization and model versioning.
Automatic detection of data distribution shifts and model degradation with alerting.
Grafana dashboards for monitoring data pipelines, model performance, and prediction quality.
┌─────────────┐ ┌─────────────┐ ┌─────────────┐ ┌─────────────┐
│ Exchanges │───▶│ Kafka │───▶│ RisingWave │───▶│ MLflow │
│ (Binance) │ │ (Streams) │ │ (Features) │ │ (Models) │
└─────────────┘ └─────────────┘ └─────────────┘ └─────────────┘
│
┌─────────────┐ ┌─────────────┐ ┌─────────────┐ │
│ LunarCrush │───▶│ Sentiment │───▶│ Combined │◀─────────┘
│ (API) │ │ Features │ │ Predictions │
└─────────────┘ └─────────────┘ └─────────────┘
- Data Ingestion — Real-time trade streams from Binance WebSocket API
- Feature Engineering — Technical indicators and sentiment features computed via streaming SQL
- Model Training — Periodic retraining with Optuna hyperparameter search
- Prediction — Continuous inference with confidence intervals
- Monitoring — Drift detection and performance tracking
| Component | Technology |
|---|---|
| Language | Python 3.13 |
| ML Framework | LightGBM, scikit-learn, Optuna |
| Streaming | Apache Kafka (Strimzi) |
| Feature Store | RisingWave (streaming SQL) |
| MLOps | MLflow |
| Orchestration | Kubernetes |
| Monitoring | Grafana, Prometheus |
| Data Source | Binance API, LunarCrush API |
| Exchange | Pairs | Timeframes |
|---|---|---|
| Binance | BTC/USDT, ETH/USDT, SOL/USDT | 1m, 5m, 15m, 1h |
Additional pairs can be configured via environment variables.
| Metric | Value |
|---|---|
| RMSE (5-min horizon) | < 0.5% |
| Direction Accuracy | > 55% |
| Retraining Frequency | Daily |
| Inference Latency | < 100ms |
Metrics based on backtesting. Past performance does not guarantee future results.
- Streaming-first — All data processing uses Apache Kafka and RisingWave streaming SQL
- Cloud-native — Kubernetes-native with Helm charts for production deployment
- MLOps-ready — MLflow integration for experiment tracking and model registry
- Observable — Prometheus metrics, Grafana dashboards, structured logging
| Document | Description |
|---|---|
| User Guide | Setup, configuration, and daily operations |
| Technical Documentation | Architecture, API reference, development guide |
See ROADMAP.md for the detailed development plan including:
- On-chain analytics & derivatives data
- Deep learning models (LSTM, Transformer, TFT)
- Uncertainty quantification & conformal prediction
- Backtesting framework with strategy simulation
- Trading signal generation & risk management
- REST/WebSocket API for predictions
- Web dashboard & alerting integrations
This project is proprietary software. All rights reserved.
Built with Python, Kafka, RisingWave, LightGBM, and MLflow