Heyo! Can anyone recommend a free (as in beer) option for transforming image PDFs to OCR'd PDFs [1] ? French support + macOS required, FLOSS preferred.

[1]: I'm not sure if I'm very clear 😕. Here's my use case: I have an app on my phone that scans documents to PDFs but it doesn't do any OCR. I also have a bunch of digital documents for which I don't have a paper version anymore. I'd like to OCR these documents to make them searchable and allow copy/paste.

@Crocmagnon I have used PDFSandwich

http://www.tobias-elze.de/pdfsandwich/

OCR is done by tesseract, which isn't top grade but works for me.
Follow

@ben @themactep Thanks you both for your suggestions! pdfsandwich seems like a cool wrapper around tesseract and other tools.
It seems to work OK for french documents.

I'd be happy to have GUI suggestions as well!

At least I have a nice CLI tool in my toolbelt now 👍

Sign in to participate in the conversation
Fosstodon

Fosstodon is an English speaking Mastodon instance that is open to anyone who is interested in technology; particularly free & open source software.