MorphousAI is a simple and intuitive AI application that lets you upload images, generate images from text, and convert text to audio — all powered by OpenAI APIs.
This project showcases how you can build a multi-modal AI tool using OpenAI capabilities to handle image generation, text-to-image, and text-to-speech workflows. GitHub
✔️ Upload an image and generate outputs
✔️ Convert user-entered text into another image
✔️ Convert text to natural-sounding audio
✔️ Uses OpenAI APIs for generative models
✔️ Easy to extend & customize
This project combines several AI modalities:
-
Image Upload Module
- Allows users to upload an image from their device.
-
Text-to-Image Conversion
- Uses OpenAI’s image generation APIs to create images from prompts.
-
Text-to-Audio Conversion
- Uses OpenAI’s text-to-speech APIs to synthesize speech from text.
Developers can extend or modify this to include additional AI workflows like speech-to-text, image captioning, or more advanced multi-modal interactions.
📌 Java (primary codebase)
📌 Spring AI to integrate with OpenAI (for image & audio generation)
📌 Vaadin - Web frontend for uploads & interaction
Before running the project locally, make sure you have:
- Docker Running
- An OpenAI API Key (sign up at https://platform.openai.com/)
- Clone the repository
git clone https://github.com/ankitagrahari/MorphousAI.git
cd MorphousAI- Install dependencies
mvn clean install- Configure your environment
Create a .env file in the root:
OPENAI_API_KEY=<your_openai_api_key_here>4. Start the app
mvn spring-boot:run -Dspring-boot.run.arguments="--server.port=9090"Your app should now be running locally.
⸻
📌 Usage
✏️ Text to Image
1. Enter a text prompt.
2. The app sends the prompt to the OpenAI image API.
3. View the generated image in the UI.
🔉 Text to Audio
1. Enter text you want to hear.
2. The app uses OpenAI’s text-to-speech API.
3. Download/play the generated audio.
📷 Image Upload
1. Upload an image from your device.
2. Extend this module to perform image captioning, text extraction, or further generative steps.
⸻
🧪 Example Screenshots
⸻
🧠 How to Contribute
Contributions are welcome! You can help by: • Adding new AI features (speech-to-text, image captioning) • Improving UI/UX • Writing tests or bug fixes • Updating documentation
To contribute:
git fork https://github.com/ankitagrahari/MorphousAI git clone <your_forked_repo> git checkout -b feature/my-awesome-feature
Submit a pull request and describe your changes.
⸻
📄 License
Distributed under the MIT License. See LICENSE for more information.
⸻
❤️ Acknowledgements
Thanks to: • OpenAI for powerful generative models 🌟 • ✨ The developer community for inspiration
⸻
📞 Contact
If you have questions, feel free to reach out:
Ankit Agrahari – 💬 GitHub: https://github.com/ankitagrahari 📎 Email: (ankitagrahari.rkgit@gmail.com)
⸻
🎉 Happy building with MorphousAI!