<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "https://jats.nlm.nih.gov/nlm-dtd/publishing/3.0/journalpublishing3.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article" dtd-version="3.0" xml:lang="en">
<front>
<journal-meta>
<journal-id journal-id-type="publisher">ISPRS-Archives</journal-id>
<journal-title-group>
<journal-title>The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences</journal-title>
<abbrev-journal-title abbrev-type="publisher">ISPRS-Archives</abbrev-journal-title>
<abbrev-journal-title abbrev-type="nlm-ta">Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci.</abbrev-journal-title>
</journal-title-group>
<issn pub-type="epub">2194-9034</issn>
<publisher><publisher-name>Copernicus Publications</publisher-name>
<publisher-loc>Göttingen, Germany</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.5194/isprs-archives-XLVIII-M-2-2023-557-2023</article-id>
<title-group>
<article-title>AI-ASSISTED DIGITALISATION OF HISTORICAL DOCUMENTS</article-title>
</title-group>
<contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Ferro</surname>
<given-names>S.</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Pelillo</surname>
<given-names>M.</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Traviglia</surname>
<given-names>A.</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
</contrib>
</contrib-group><aff id="aff1">
<label>1</label>
<addr-line>Ca’ Foscari University of Venice, DAIS, via Torino 155, 30172 Venice, Italy</addr-line>
</aff>
<aff id="aff2">
<label>2</label>
<addr-line>Istituto Italiano di Tecnologia (IIT), Centre for Cultural Heritage Technology, via Torino 155, 30172 Venice, Italy</addr-line>
</aff>
<pub-date pub-type="epub">
<day>24</day>
<month>06</month>
<year>2023</year>
</pub-date>
<volume>XLVIII-M-2-2023</volume>
<fpage>557</fpage>
<lpage>562</lpage>
<permissions>
<copyright-statement>Copyright: &#x000a9; 2023 S. Ferro et al.</copyright-statement>
<copyright-year>2023</copyright-year>
<license license-type="open-access">
<license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri"  xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p>
</license>
</permissions>
<self-uri xlink:href="https://isprs-archives.copernicus.org/articles/XLVIII-M-2-2023/557/2023/isprs-archives-XLVIII-M-2-2023-557-2023.html">This article is available from https://isprs-archives.copernicus.org/articles/XLVIII-M-2-2023/557/2023/isprs-archives-XLVIII-M-2-2023-557-2023.html</self-uri>
<self-uri xlink:href="https://isprs-archives.copernicus.org/articles/XLVIII-M-2-2023/557/2023/isprs-archives-XLVIII-M-2-2023-557-2023.pdf">The full text article is available as a PDF file from https://isprs-archives.copernicus.org/articles/XLVIII-M-2-2023/557/2023/isprs-archives-XLVIII-M-2-2023-557-2023.pdf</self-uri>
<abstract>
<p>Preserving historical archival heritage involves not only physical measures to safeguard these valuable texts but also providing for their digital preservation. However, merely &lt;i&gt;digitising&lt;/i&gt; manuscripts and codexes is not enough. A further step is needed: the &lt;i&gt;digitalisation&lt;/i&gt; of their content, i.e. the verbatim transcription of scanned texts. This process enables the accurate preservation of their textual content, making it easier to search for information and conduct further analyses. With the help of artificial intelligence, particularly Deep Neural Networks (DNNs), automatic handwriting recognition can be performed. In this study, we employed a Convolutional Recurrent Neural Network (CRNN), an established type of DNN, to determine the minimum amount of labelled data required to automatically transcribe five different historical datasets that vary in language and time period. The results show that a Character Error Rate (CER) lower than 10% can be achieved with just a few hundred labelled text lines in almost all cases.</p>
</abstract>
<counts><page-count count="6"/></counts>
</article-meta>
</front>
<body/>
<back>
</back>
</article>