How to put some pdf documents that contain the right words?
There is a file Cabinet of the arbitration court (kad.arbitr.ru), which contains various open documents in pdf.
You need to put links to documents that have certain keywords.
I'm still learning python, so please give a tip in which direction to move, what to read/see which features to consider. Maybe there are some similar solutions?
After a few search queries gets the captcha - there is some problem in parsing?
asked June 3rd 19 at 19:31
After verifying that the text is available - PyPDF2.pdf.PageObject.extractText
Find more questions by tags PythonParsing