OCR & Navigation algorithm

The OCR system is used to convert images and videos of text into letters , words, and sentence. It is widely used in various fields to convert/ extract the information from the image or video . It is also used in signature recognition , automated data evaluation , and security systems. It is commercially used to validate data records , passport doc, place signs, business cards ,printouts of static data, and so on . OCR is a field of research in pattern recognition, deep learning ,artificial intelligence and computer vision.

• Project goals:

We will use the existing "OCR" system on images so we will improve it to a system that will work on existing videos and also extract text from real-time video so that we use navigation algorithms and find real-time location, we will need to use existing libraries in python such as OpenCV "so most of the work will be from it," Tesseract , OCR . Project scope – The project will include existing directories (OpenCV, Tesseract, OCR), movies, images stored in google drive. A project will integrate with deep learning so we use On the Nero network we will have 2 layers so that is : the first Is to enable our SIGMOID output which gives us a probability of the area containing the text or not. A second layer represents the geometry of the images we can use this geometry to derive the coordinates .

Understanding OpenCV OCR and Tesseract text recognition:

work process: We will perform both (1) text detection and (2) text recognition using OpenCV, Python, and Tesseract. Using this model we were able to detect and localize the bounding box coordinates of text contained in an image. The next step is to take each of these areas containing text and actually recognize and OCR the text using OpenCV and Tesseract.

Performs text detection using OpenCV’s EAST text detector, a highly accurate deep learning text detector used to detect text in natural scene images. 2.Once we have detected the text regions with OpenCV, we’ll then extract each of the text ROIs and pass them into Tesseract, enabling us to build an entire OpenCV OCR pipeline! The underlying OCR engine itself utilizes a Long Short-Term Memory (LSTM) network, a kind of Recurrent Neural Network (RNN). As a first step we are almost ready so we will need to use navigation algorithms through which we will know the location so that we consider vectors on all sides and also compare the image size with the real image content through the pixels we will know the user distance.

Example This is a project that is a small part of a big project so we have to assemble the system on a robot and test the system so that it is active on a vehicle will work in real time and help the vehicle identify text on the road or any text.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
images		images
src		src
README.md		README.md
SRS OCR 2020 .pdf		SRS OCR 2020 .pdf
Vision_Statement.pdf		Vision_Statement.pdf
sdd_template .pdf		sdd_template .pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OCR & Navigation algorithm

• Project goals:

OCR & Navigation Game

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

JbareenM/RealTime_OCR

Folders and files

Latest commit

History

Repository files navigation

OCR & Navigation algorithm

• Project goals:

OCR & Navigation Game

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages