You could spin up paperless-ngx. Or use pdf24 creator. Beware paperless consume will delete the file.
I used paperless-ngx before and it works pretty good.
A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.
For Example
We welcome posts that include suggestions for good self-hosted alternatives to popular online services, how they are better, or how they give back control of your data. Also include hints and tips for less technical readers.
Useful Lists
You could spin up paperless-ngx. Or use pdf24 creator. Beware paperless consume will delete the file.
I used paperless-ngx before and it works pretty good.
I will check it up, i have Stirlingpdf and I see it also has ocr support
tesseract-ocr
? You can download it via apt or something similar.
paperless-ngx has built in ocr but I don't think it would fit your needs
I will check it up
Windows 11 has this built in if you take a screenshot
Didn't know that,i use flameshot for screenshots,i will take a look thnx
I'm not sure I understand you correctly. Do you want to apply OCR to PDFs or to Screenshots?
For PDFs there's the excellent ocrmypdf which paperless-ngx uses under the hood.
Nextcloud AIO (all-in-one) comes with full text search installed, which brings tesseract to nextcloud. so you can let tesseract-ocr run over all documents and then they will be searchable with Elasticsearch.