- 著者
-
宮川 創
- 出版者
- 関西大学アジア・オープン・リサーチセンター
- 雑誌
- KU-ORCASが開くデジタル化時代の東アジア文化研究 : オープン・プラットフォームで浮かび上がる、新たな東アジアの姿
- 巻号頁・発行日
- pp.323-336, 2022-03-31
This paper discusses the differences and suitable uses of three handwritten text recognition (HTR) programs developed in Europe: Transkribus, eScriptorium/Kraken, and OCR4all. It commences with an overview of deep learning, HTR, and OCR (optical character recognition) before progressing to review the three programs of interest from the perspectives of history, developer, accuracy rate, layout recognition (including writing orientation), user experience, and cost. All three programs use deep-learning machine-learning technologies. They have also all been proven to reach accuracy rates of close to one hundred percent when appropriately trained depending on the quality of the images of handwritten text, training data, and validation data. Second, the user experience is very important; Transkribus has the simplest installation procedure and graphical user interface, while OCR4all and eScriptorium require users to have expert computer skills. Third, in terms of cost, users of Transkribus are required to purchase credits to access the system and use HTR models to recognize a new text, while eScriptorium and OCR4all do not rely on credit purchase. Finally, we conclude this paper with an overview of suitable cases for each program.