ocr-table

Open source MIT Python

GitHub

277

Stars

Forks

Issues

Watchers

6 years

Last Commit

Terminal & CLI Tools E-book Management

About ocr-table

Extract tables from scanned image PDFs using Optical Character Recognition.

Web Self-hosted

Python

Source Code

Published by

Visit View Profile

This project aims to extract tables from scanned image PDFs using Optical Character Recognition.

Clear the pdf/ folder and copy all your pdf files to be scanned in it.
Run the OCR:
```
 python3 shellocr.py
```
The scanned text files shall be available in the txt/ folder once the process completes.