This is an external link to https://datascience.blog.wzb.eu/2017/02/16/data-mining-ocr-pdfs-using-pdftabextract-to-liberate-tabular-data-from-scanned-documents/.
Data Mining OCR PDFs — Using pdftabextract to liberate tabular data from scanned documents
If you spotted a mistake or want to comment on this post, please contact me:
post -at- mkonrad -dot- net
.