| license | datasets | language | library_name | metrics | base_model | tags | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
mit |
|
|
cloud-agents |
|
|
|
A lightweight and horizontally scalable distributed computing system for training large language models, specifically designed for OpenPeerLLM.
- Distributed tensor operations for model training
- CouchDB-based coordination layer
- Automatic agent discovery and load balancing
- Horizontal scaling capabilities
- Fault tolerance and recovery
- Integration with OpenPeerAI's OpenPeerLLM
pip install -r requirements.txt- Set up CouchDB instance
- Copy
.env.exampleto.envand configure your settings - Start the coordinator node
- Launch agent nodes
# Start coordinator
python -m cloud_agents.coordinator
# Start agent (on each machine)
python -m cloud_agents.agentcoordinator: Manages job distribution and agent coordinationagent: Handles tensor operations and model trainingcouchdb_client: Interface for CouchDB communicationtensor_ops: Distributed tensor operationsutils: Helper functions and utilities
MIT