What’s the state of the art for free OCR (Japanese specifically)? Is Tesseract still the best option?
@misty I cannot speak about Japanese, but we use on a daily basis Tesseract for European languages (also Cyrillic using ones) and we are happy with. It also allows to catch intertitles of silent films written in Fraktur (the old German alphabet) even when it’s a handwritten letter. Therefore I presume that it works quite fine for Japanese as well.
Tesseract implemented LSTMs recently which is among the current best methods for OCR
digipres.club is a space for folks interested in productive conversations about, well, digital preservation! If you enjoy talking about how to do memory work with computers, or even with cardboard boxes of old photos, you belong with us on digipres.club. Many of us are/were Twitter users looking for an inclusive and community supported approach to social media. If any of these things sound good to you, consider joining us now.