ocr_pdf

Simple python3 script to make PDF searchable. Made to make my life easier when scanning schoolwork.

Requirements:

Tesseract. Tested with 4.1.1
PIL and pdf2image.

Usage:

usage: makepdf.py [-h] [-o OUTPUT] input_file

Simple script to make PDFs searchable

positional arguments:
  input_file            filepath for PDF to be targeted

optional arguments:
  -h, --help            show this help message and exit
  -o OUTPUT, --output OUTPUT
                        output filepath

Upcoming (?) Features:

Use /tmp/ to store files
Ability to output to plaintext
Take in other image formats as input, and output a PDF
Ability to apply an adaptive contrast/sharpening filter to PDF

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
makepdf.py		makepdf.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ocr_pdf

Requirements:

Usage:

Upcoming (?) Features:

About

Uh oh!

Releases

Packages

Languages

Shah06/ocr_pdf

Folders and files

Latest commit

History

Repository files navigation

ocr_pdf

Requirements:

Usage:

Upcoming (?) Features:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages