An AI-powered application that converts food menu images to Braille text for visually impaired users.
- Upload menu images
- Extract text using AI-powered document understanding (LayoutLMv2)
- Process and structure menu text using LLMs
- Convert text to Braille
- Display Braille in multiple formats (text, visual, side-by-side)
- Download as PDF in different formats
- Fork this repository to your GitHub account
- Go to Hugging Face Spaces
- Click "Create new Space"
- Choose "Streamlit" as the SDK
- Connect your GitHub account and select this repository
- Choose hardware requirements (recommend at least GPU for better performance)
- Click "Create Space"
- Go to Hugging Face Spaces
- Click "Create new Space"
- Choose "Streamlit" as the SDK
- Give your Space a name
- Choose hardware requirements (recommend at least GPU for better performance)
- Click "Create Space"
- Clone the Space repository locally
- Copy all files from this project to the cloned repository
- Push the changes to the Space repository
- Minimum: CPU (2 vCPUs, 16 GB RAM)
- Recommended: GPU (T4 or better)
- Document AI: microsoft/layoutlmv2-base-uncased
- Text Processing: meta-llama/Meta-Llama-3-8B-Instruct (with fallback to mistralai/Mistral-7B-Instruct-v0.2)
- Context Enhancement: facebook/bart-large-cnn
- Clone this repository
- Install dependencies:
pip install -r requirements.txt - Run the application:
streamlit run app.py
- Improved menu section recognition
- Support for multiple languages
- Physical Braille printer integration
- Mobile app version