The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences
Download
Share
Publications Copernicus
Download
Citation
Share
Articles | Volume XLVIII-M-9-2025
https://doi.org/10.5194/isprs-archives-XLVIII-M-9-2025-629-2025
https://doi.org/10.5194/isprs-archives-XLVIII-M-9-2025-629-2025
01 Oct 2025
 | 01 Oct 2025

InscriberNet: A Novel Framework for Chinese Character Recognition in Stone Inscriptions using CNNs and Swin Transformer

Akanksha Jain and Jaehong Ahn

Keywords: Optical Character Recognition (OCR), Chinese characters, CNN, Multi-Scale Attention, Swin Transformer, Hwaeom Stone Sutra

Abstract. Stone inscriptions pose distinct challenges for optical character recognition due to erosion, disruption, and fragmentation. We present InscriberNet, a deep learning network designed for the recognition of Chinese characters from degraded stone surfaces. It incorporates a CNN-based denoising module, a ResNet-based feature extractor, a Multi-Scale Attention Mechanism, and a Swin Transformer to capture both local and global data. InscriberNet, assessed using the Hwaeom Stone Sutra dataset, attains an accuracy of 86.5%, surpassing conventional models such as CRNN and CNN-based denoising OCR. The findings underscore its resilience, efficacy, and relevance for the digitization and preservation of culturally important historical documents.

Share