Real-Time Sign Language Recognition to Speech Transcription using Deep Learning

(Course based project)

Introduction

Course: (17-644) Applied Deep Learning | Carnegie Mellon University

This project addresses the challenges faced by visually impaired communicating with their environment by designing and implementing a real-time assistive system that leverages deep learning techniques to translate static sign language gestures into both textual and auditory outputs. Specifically, the system employs a Convolutional Neural Network (CNN) trained on the Sign Language MNIST dataset to classify American Sign Language (ASL) hand signs from live video input. The classified gestures are then converted into spoken words using a text-to-speech engine, enabling real-time audio feedback.

The system was developed using Python and integrates several key libraries and frameworks. OpenCV facilitates video capture from a webcam, MediaPipe is used for hand detection and landmark tracking, and pyttsx3 provides offline speech synthesis capabilities. Together, these components form a pipeline capable of capturing, classifying, and vocalizing hand gestures in real time using only a standard computing device and webcam.

Dataset

Download the Sign Language MNIST dataset on kaggle, create a folder named "data" in the project folder and place the "sign_mnist_train.csv" and "sign_mnist_test" datasets within.

Building and running

Run the cells in the model_notebook.ipynb_ to build and train the model then run the program main.py

Paper

Read the project paper here: Paper Link

Disclaimer

Using any part or entirety of this work as a requirement for any course or project requirement is considered cheating.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
data		data
LICENSE		LICENSE
README.md		README.md
main.py		main.py
model_notebook.ipynb		model_notebook.ipynb
model_playground.ipynb		model_playground.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Real-Time Sign Language Recognition to Speech Transcription using Deep Learning

Introduction

Course: (17-644) Applied Deep Learning | Carnegie Mellon University

Dataset

Building and running

Paper

Disclaimer

About

Uh oh!

Releases

Packages

Languages

License

FonyaBrandone/Sign-language-project-deep-learning

Folders and files

Latest commit

History

Repository files navigation

Real-Time Sign Language Recognition to Speech Transcription using Deep Learning

Introduction

Course: (17-644) Applied Deep Learning | Carnegie Mellon University

Dataset

Building and running

Paper

Disclaimer

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages