Discussion about this post

User's avatar
Piotr Jaskulski's avatar

When it comes to HTR/OCR tools,

The problem is that ABBYY FineReader does not cope well with 19th-century printed books from Eastern Europe, even though it is Russian software. And Transkribus is a commercial system and by no means cheap if you need to process large volumes of material. It is probably wiser to invest in eScriptorium/Kraken. But not in every case; if we have several thousand pages of manuscripts from dozens of people, this may lead to the need to retrain many models, which is time-consuming. Gemini, however, reads 19th- and 20th-century handwriting well enough to significantly speed up the work. Of course, LLMs make different kinds of errors than traditional HTR models; one must pay particular attention to proper names and areas with less legible handwriting, where the likelihood of hallucinations is greater.

MASAKO | NewForever's avatar

As a traveler of vibecoding, this was truly fascinating to me (thank you!).

10 more comments...

No posts

Ready for more?