Metadata-Version: 2.1
Name: pytextractor
Version: 0.0.2
Summary: text extractor from images
Home-page: https://github.com/danwald/pytextractor/
Author: danny crasto
Author-email: danwald79@gmail.com
License: MIT
Platform: UNKNOWN
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.6
Classifier: Programming Language :: Python :: 3.7
Classifier: License :: OSI Approved :: MIT License
Description-Content-Type: text/markdown
Requires-Dist: imutils (==0.5.2)
Requires-Dist: opencv-python (==4.0.0.21)
Requires-Dist: Pillow (==5.4.1)
Requires-Dist: pytesseract (==0.2.6)

# pytextractor
python ocr using tesseract/ with EAST opencv text detector

Uses the EAST opencv detector defined [here](https://www.pyimagesearch.com/2018/08/20/opencv-text-detection-east-text-detector/) with [pytesseract](https://github.com/madmaze/pytesseract) to extract text(default) or numbers from images.

```
usage: text_detection.py [-h] [-east EAST] [-c CONFIDENCE] [-w WIDTH]
                         [-e HEIGHT] [-d] [-n] [-p PERCENTAGE]
                         images [images ...]

Text/Number extractor from image

positional arguments:
  images                path(s) to input image(s)

optional arguments:
  -h, --help            show this help message and exit
  -east EAST, --east EAST
                        path to input EAST text detector 
  -c CONFIDENCE, --confidence CONFIDENCE
                        minimum probability required to inspect a region[0.5]
  -w WIDTH, --width WIDTH
                        resized image width (should be multiple of 32)[320]
  -e HEIGHT, --height HEIGHT
                        resized image height (should be multiple of 32)[320]
  -d, --display         Display bounding boxes
  -n, --numbers         Detect only numbers
  -p PERCENTAGE, --percentage PERCENTAGE
                        Expand/shrink detected bound box[2.0]
```


