This is a re-implementation of Thompson et al.'s algorithm for OCR post-correction. The implementation is built on the spellchecker "pyspellchecker". "Customised" refers to that suggested corrections ...
This is the source code for the paper Post-OCR Document Correction with Large Ensembles of Character Sequence-to-Sequence Models by Ramirez-Orta et al., (2021). In this paper, we propose a novel ...
Furthermore, there are other fascinating technical bits under the hood for both local and Cloud OCR: Prizmo Go offers image stabilization through sharpness tracking, and it pre-processes an image as ...