Either automatically from the scanned document to remove the information?

There is a task scanning a large volume of documents. In these documents there is information that you want to automatically remove. Key words. For example, a specific item in the specifications. To scan a document (pdf, jpg, etc.) has these words had
Is there such a software?
April 7th 20 at 11:00
2 answers
April 7th 20 at 11:02
Finreader
CineForms

Both have sdk's for corporate clients
Feinrider. Have not seen the functionality for the automatic editing of raster document. Can share the link on the description?
CineForms - kind of like all about video processing

The SDK... I would love to just get ready for the operation - Natalia23 commented on April 7th 20 at 11:05
@Natalia23,

https://help.abbyy.com/en-us/finereader/14/user_gu...

https://habr.com/ru/post/153617/

Long time no digging, threw the matching links, view - Otho_Wunsc commented on April 7th 20 at 11:08
@Natalia23To heap deletion of words possible after recognition. Separate quest's handwritten notes. - Otho_Wunsc commented on April 7th 20 at 11:11
@Otho_Wunsc, I read it. Maybe I incorrectly set the task. You need to the software itself, in automatic mode, found by scanning the words from the dictionary and edit them immediately in order to save got image that is already dipped with words
And Your link have to do it manually... - Natalia23 commented on April 7th 20 at 11:14
April 7th 20 at 11:04
Option1:
Resposne tesseract'ω HOCR, find in it the right words and their coordinates. Imagemagick'om painted the words on the scan coordinates.
Option2:
Recognize FineReader'Ohm, export to djvu extracted from the djvu text layer with coordinates and parsim it. Then the same thing with Imagemagick.
You can automate this with scripts.

Find more questions by tags ScanningText recognition