<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "https://jats.nlm.nih.gov/nlm-dtd/publishing/3.0/journalpublishing3.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article" dtd-version="3.0" xml:lang="en">
<front>
<journal-meta>
<journal-id journal-id-type="publisher">ISPRS-Archives</journal-id>
<journal-title-group>
<journal-title>The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences</journal-title>
<abbrev-journal-title abbrev-type="publisher">ISPRS-Archives</abbrev-journal-title>
<abbrev-journal-title abbrev-type="nlm-ta">Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci.</abbrev-journal-title>
</journal-title-group>
<issn pub-type="epub">2194-9034</issn>
<publisher><publisher-name>Copernicus Publications</publisher-name>
<publisher-loc>Göttingen, Germany</publisher-loc>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.5194/isprs-archives-XLVIII-1-W5-2025-1-2025</article-id>
<title-group>
<article-title>A novel CAD-aided coarse-to-fine framework of RGBD-to-point clouds registration</article-title>
</title-group>
<contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Ai</surname>
<given-names>Mengchi</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
</contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Elhabiby</surname>
<given-names>Mohamed</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
</contrib>
<contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>El-Sheimy</surname>
<given-names>Naser</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
</contrib-group><aff id="aff1">
<label>1</label>
<addr-line>Micro Engineering Tech. Inc., Calgary, AB, Canada</addr-line>
</aff>
<aff id="aff2">
<label>2</label>
<addr-line>Dept. of Geomatics Engineering, University of Calgary, 2500 University Dr NW, Calgary, AB T2N 1N4, Canada</addr-line>
</aff>
<pub-date pub-type="epub">
<day>04</day>
<month>11</month>
<year>2025</year>
</pub-date>
<volume>XLVIII-1/W5-2025</volume>
<fpage>1</fpage>
<lpage>6</lpage>
<permissions>
<copyright-statement>Copyright: &#x000a9; 2025 Mengchi Ai et al.</copyright-statement>
<copyright-year>2025</copyright-year>
<license license-type="open-access">
<license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri"  xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p>
</license>
</permissions>
<self-uri xlink:href="https://isprs-archives.copernicus.org/articles/XLVIII-1-W5-2025/1/2025/isprs-archives-XLVIII-1-W5-2025-1-2025.html">This article is available from https://isprs-archives.copernicus.org/articles/XLVIII-1-W5-2025/1/2025/isprs-archives-XLVIII-1-W5-2025-1-2025.html</self-uri>
<self-uri xlink:href="https://isprs-archives.copernicus.org/articles/XLVIII-1-W5-2025/1/2025/isprs-archives-XLVIII-1-W5-2025-1-2025.pdf">The full text article is available as a PDF file from https://isprs-archives.copernicus.org/articles/XLVIII-1-W5-2025/1/2025/isprs-archives-XLVIII-1-W5-2025-1-2025.pdf</self-uri>
<abstract>
<p>Accurate registration between RGB-D images and point clouds is a critical task for various indoor applications. Estimating the relative pose by aligning the sensor frame with indoor 3D point clouds significantly enhances environmental perception and scene understanding. Existing research primarily focuses on cross-modal feature association through traditional unsupervised methods or supervised learning-based approaches. However, these methods often rely on strong assumptions, such as the availability of an initial pose or substantial overlap between the RGB-D images and the target point clouds. Moreover, the quality of registration is highly sensitive to the density and completeness of the point clouds. To address these limitations, this paper presents a novel coarse-to-fine registration framework with the aid of CAD models. First, a data enhancement process is introduced using the Scan2CAD method to replace functional objects (e.g., chairs and tables) with CAD models, improving semantic and quality consistency. Second, a geometry-aware graph matching is computed to identify regions of interest (ROI) within the point cloud map and estimate the initial pose of the RGBD sensor. Finally, an iterative fine matching using cross-modal is introduced to refine the initial estimated pose. Experimental validation on the ScanNet dataset demonstrates that the proposed framework achieves robust and accurate registration between RGB-D images and 3D point clouds.</p>
</abstract>
<counts><page-count count="6"/></counts>
</article-meta>
</front>
<body/>
<back>
</back>
</article>