What’s the state of the art for free OCR (Japanese specifically)? Is Tesseract still the best option?

Sounds like the consensus is that there are some promising developments but currently Tesseract is still the strongest option.

@misty I cannot speak about Japanese, but we use on a daily basis Tesseract for European languages (also Cyrillic using ones) and we are happy with. It also allows to catch intertitles of silent films written in Fraktur (the old German alphabet) even when it’s a handwritten letter. Therefore I presume that it works quite fine for Japanese as well.

Sign in to participate in the conversation is a space for folks interested in productive conversations about, well, digital preservation! If you enjoy talking about how to do memory work with computers, or even with cardboard boxes of old photos, you belong with us on Many of us are/were Twitter users looking for an inclusive and community supported approach to social media. If any of these things sound good to you, consider joining us now.