<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<TEI xmlns="http://www.tei-c.org/ns/1.0">
  <teiHeader>
    <fileDesc>
      <titleStmt>
        <title type="main" level="a">Optimal Number of Cue Objects for Photo-Based Indoor Localization</title>
        <author>
          <persName n="1">
            <forename>Youngsun</forename>
            <surname>Chung</surname>
            <placeName type="affiliation">Yonsei University, Korea, Republic of</placeName>
          </persName>
          <persName n="2" ref="https://orcid.org/0000-0001-9845-6687" type="ORCID">
            <forename>Daeyoung</forename>
            <surname>Gil</surname>
            <placeName type="affiliation">Yonsei University, Korea, Republic of</placeName>
          </persName>
          <persName n="3" ref="https://orcid.org/0000-0002-3522-2733" type="ORCID">
            <forename>Ghang</forename>
            <surname>Lee</surname>
            <placeName type="affiliation">Yonsei University, Korea, Republic of</placeName>
          </persName>
        </author>
        <respStmt>
          <resp>This is a section of <title>CONVR 2023 - Proceedings of the 23rd International Conference on  Construction Applications of Virtual Reality </title>(DOI: <idno type="DOI">10.36253/979-12-215-0289-3</idno>) by </resp>
          <name>Pietro Capone, Vito Getuli, Farzad Pour Rahimian, Nashwan Dawood, Alessandro Bruttini, Tommaso Sorbi</name>
        </respStmt>
      </titleStmt>
      <publicationStmt>
        <publisher>Firenze University Press</publisher>
        <pubPlace>Florence</pubPlace>
        <date when="2023">2023</date>
        <idno type="DOI">https://doi.org/10.36253/10.36253/979-12-215-0289-3.98</idno>
        <availability>
          <p>Available for academic research purposes</p>
          <p>Open Access</p>
          <p>Copyright Author(s)</p>
          <licence source="text" target="https://creativecommons.org/licenses/by-nc/4.0/legalcode">
            <p>Content licence CC BY-NC 4.0</p>
          </licence>
          <licence source="metadata" target="https://creativecommons.org/publicdomain/zero/1.0/legalcode">
            <p>Metadata licence CC0 1.0</p>
          </licence>
        </availability>
      </publicationStmt>
      <sourceDesc>
        <p>This is original content, published for academic research purposes</p>
      </sourceDesc>
    </fileDesc>
    <encodingDesc>
      <appInfo>
        <application version="2.2" ident="Booksflow">
          <desc>Digital edition XML powered by Booksflow</desc>
        </application>
      </appInfo>
    </encodingDesc>
    <profileDesc>
      <abstract xml:lang="en">
        <p>Building information modeling (BIM) is widely used to generate indoor images for indoor localization. However, changes in camera angles and indoor conditions mean that photos are much more changeable than BIM images. This makes any attempt at localization based on the similarity between real photos and BIM images challenging. To overcome this limitation, we propose a reasoning-based approach for determining the location of a photo by detecting the cue objects in the photo and the relationships between them. The aim of this preliminary study was to determine the optimal number of cue objects required for an indoor image. If there are too few cue objects in an indoor image, it results in an excessive number of location candidates. Conversely, if there are too many cue objects, the accuracy of object detection in an image decreases. Theoretically, a larger number of cue objects would improve the reasoning process; however, too many cue objects could lead to declining object detection performance. The experimental results demonstrated that of two to five cue objects, three cue objects is most likely to yield optimal performance</p>
      </abstract>
      <textClass>
        <keywords>
          <list>
            <item>indoor location determination</item>
            <item>BIM</item>
            <item>reasoning</item>
          </list>
        </keywords>
      </textClass>
    </profileDesc>
  </teiHeader>
  <text>
    <body>
      <p>It is available online at https://doi.org/10.36253/10.36253/979-12-215-0289-3.98<ref target="https://doi.org/10.36253/10.36253/979-12-215-0289-3.98" /></p>
      <div>
        <listBibl>
          <head>References</head>
          <bibl n="137609">
            <bibl>Acharya, D., Khoshelham, K., &amp;amp; Winter, S. (2019). BIM-PoseNet: Indoor camera localisation using a 3D indoor model and deep learning from synthetic images. ISPRS Journal of Photogrammetry and Remote Sensing, 150, 245–258.</bibl>
            <idno type="DOI">10.1016/j.isprsjprs.2019.02.020</idno>
          </bibl>
          <bibl n="138422">
            <bibl>Acharya, D., Roy, S., Khoshelham, K., &amp;amp; Winter, S. (2020). A recurrent deep network for estimating the pose of real indoor images from synthetic image sequences. Sensors, 20(19), 1–20.</bibl>
            <idno type="DOI">10.3390/s20195492</idno>
          </bibl>
          <bibl n="137434">
            <bibl>Alam, M., Hossain, A. K. M., &amp;amp; Mohamed, F. (2022). Performance evaluation of recurrent neural networks applied to indoor camera localization. International Journal of Emerging Technology and Advanced Engineering, 12(8), 116–124.</bibl>
            <idno type="DOI">10.46338/ijetae0822_15</idno>
          </bibl>
          <bibl n="138498">
            <bibl>Bay, H., Tuytelaars, T., &amp;amp; Van Gool, L. (2006). SURF: Speeded up robust features. In A. Leonardis, H. Bischof, &amp;amp; A. Pinz (Eds.), Computer Vision – ECCV 2006 (pp. 404–417). Springer.</bibl>
            <idno type="DOI">10.1007/11744023_32</idno>
          </bibl>
          <bibl n="137759">
            <bibl>Guan, K., Ma, L., Tan, X., &amp;amp; Guo, S. (2016). Vision-based indoor localization approach based on SURF and landmark. 2016 International Wireless Communications and Mobile Computing Conference (IWCMC), (pp. 655–659).</bibl>
            <idno type="DOI">10.1109/IWCMC.2016.7577134</idno>
          </bibl>
          <bibl n="138627">
            <bibl>Ha, I., Kim, H., Park, S., &amp;amp; Kim, H. (2018). Image retrieval using BIM and features from a pretrained VGG network for indoor localization. Building and Environment, 140, 23–31.</bibl>
            <idno type="DOI">10.1016/j.buildenv.2018.05.026</idno>
          </bibl>
          <bibl n="138225">
            <bibl>Kang, H., Park, Y., &amp;amp; Kim, Y. (2019). Improvement model of defect information management system for apartment buildings. Korean Journal of Construction Engineering and Management, 20(4), 13–21.</bibl>
            <idno type="DOI">10.6106/KJCEM.2019.20.4.013</idno>
          </bibl>
          <bibl n="139389">
            <bibl>Kim, D., &amp;amp; Kim, J. (2023). CT-Loc: Cross-domain visual localization with a channel-wise transformer. Neural Networks, 158, 369–383.</bibl>
            <idno type="DOI">10.1016/j.neunet.2022.11.014</idno>
          </bibl>
          <bibl n="138499">Kim, J. (2022). Identifying indoor locations of close-up photos using deep learning and building information modeling objects. Yonsei University. http://www.riss.kr/link?id=T16372630</bibl>
          <bibl n="137412">
            <bibl>Kim, K.-T., Lim, M.-G., &amp;amp; Kim, G.-T. (2014). History management technology of building construction and maintenance using vector photo information and BIM. Journal of the Korea Institute of Building Construction, 14(6), 605–613.</bibl>
            <idno type="DOI">10.5345/JKIBC.2014.14.6.605</idno>
          </bibl>
          <bibl n="138138">
            <bibl>Li, Y., Kambhamettu, R. H., Hu, Y., &amp;amp; Zhang, R. (2022). ImPos: An image-based indoor positioning system. 2022 IEEE 19th Annual Consumer Communications &amp;amp; Networking Conference (CCNC), (pp. 144–150).</bibl>
            <idno type="DOI">10.1109/CCNC49033.2022.9700699</idno>
          </bibl>
          <bibl n="136783">Redmon, J., Divvala, S., Girshick, R., &amp;amp; Farhadi, A. (2016). You only look once: Unified, real-time object detection. Computer Vision and Pattern Recognition Conference (CVPR), (pp. 779-788). https://openaccess.thecvf.com/content_cvpr_2016/html/Redmon_You_Only_Look_CVPR_2016_paper.html</bibl>
          <bibl n="139538">
            <bibl>Simonyan, K., &amp;amp; Zisserman, A. (2015). Very deep convolutional networks for large-scale image recognition. arXiv.</bibl>
            <idno type="DOI">10.48550/arXiv.1409.1556</idno>
          </bibl>
        </listBibl>
      </div>
    </body>
  </text>
</TEI>