Skip to content

Helps upload image, convert text to image and text to audio using OpenAi

Notifications You must be signed in to change notification settings

ankitagrahari/MorphousAI

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

MorphousAI

MorphousAI is a simple and intuitive AI application that lets you upload images, generate images from text, and convert text to audio — all powered by OpenAI APIs.

This project showcases how you can build a multi-modal AI tool using OpenAI capabilities to handle image generation, text-to-image, and text-to-speech workflows. GitHub


🚀 Features

✔️ Upload an image and generate outputs
✔️ Convert user-entered text into another image
✔️ Convert text to natural-sounding audio
✔️ Uses OpenAI APIs for generative models
✔️ Easy to extend & customize


🧠 How It Works

This project combines several AI modalities:

  1. Image Upload Module

    • Allows users to upload an image from their device.
  2. Text-to-Image Conversion

    • Uses OpenAI’s image generation APIs to create images from prompts.
  3. Text-to-Audio Conversion

    • Uses OpenAI’s text-to-speech APIs to synthesize speech from text.

Developers can extend or modify this to include additional AI workflows like speech-to-text, image captioning, or more advanced multi-modal interactions.


🛠️ Tech Stack

📌 Java (primary codebase)
📌 Spring AI to integrate with OpenAI (for image & audio generation)
📌 Vaadin - Web frontend for uploads & interaction

🔧 Prerequisites

Before running the project locally, make sure you have:


📝 Setup & Installation

  1. Clone the repository
git clone https://github.com/ankitagrahari/MorphousAI.git
cd MorphousAI
  1. Install dependencies
mvn clean install
  1. Configure your environment
Create a .env file in the root:

OPENAI_API_KEY=<your_openai_api_key_here>

⚠️ Make sure not to commit your .env file or API key to GitHub.

4.	Start the app
mvn spring-boot:run -Dspring-boot.run.arguments="--server.port=9090"

Your app should now be running locally.

📌 Usage

✏️ Text to Image

1.	Enter a text prompt.
2.	The app sends the prompt to the OpenAI image API.
3.	View the generated image in the UI.

🔉 Text to Audio

1.	Enter text you want to hear.
2.	The app uses OpenAI’s text-to-speech API.
3.	Download/play the generated audio.

📷 Image Upload

1.	Upload an image from your device.
2.	Extend this module to perform image captioning, text extraction, or further generative steps.

🧪 Example Screenshots

Screenshot 2025-12-16 at 7 35 34 AM Screenshot 2025-12-16 at 7 35 59 AM Screenshot 2025-12-16 at 7 37 12 AM Screenshot 2025-12-16 at 7 39 36 AM

🧠 How to Contribute

Contributions are welcome! You can help by: • Adding new AI features (speech-to-text, image captioning) • Improving UI/UX • Writing tests or bug fixes • Updating documentation

To contribute:

git fork https://github.com/ankitagrahari/MorphousAI git clone <your_forked_repo> git checkout -b feature/my-awesome-feature

Submit a pull request and describe your changes.

📄 License

Distributed under the MIT License. See LICENSE for more information.

❤️ Acknowledgements

Thanks to: • OpenAI for powerful generative models 🌟 • ✨ The developer community for inspiration

📞 Contact

If you have questions, feel free to reach out:

Ankit Agrahari – 💬 GitHub: https://github.com/ankitagrahari 📎 Email: (ankitagrahari.rkgit@gmail.com)

🎉 Happy building with MorphousAI!

About

Helps upload image, convert text to image and text to audio using OpenAi

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages