<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<TEI xmlns="http://www.tei-c.org/ns/1.0">
  <teiHeader>
    <fileDesc>
      <titleStmt>
        <title type="main" level="a">Identifying Hazards in Construction Sites Using Deep Learning-Based Multimodal with CCTV Data</title>
        <author>
          <persName n="1" ref="https://orcid.org/0000-0003-2652-821X" type="ORCID">
            <forename>Dai</forename>
            <surname>Quoc Tran</surname>
            <placeName type="affiliation">Sungkyunkwan University, Korea, Republic of</placeName>
          </persName>
          <persName n="2" ref="https://orcid.org/0000-0002-1777-5297" type="ORCID">
            <forename>Yuntae</forename>
            <surname>Jeon</surname>
            <placeName type="affiliation">Sungkyunkwan University, Korea, Republic of</placeName>
          </persName>
          <persName n="3">
            <forename>Seongwoo</forename>
            <surname>Son</surname>
            <placeName type="affiliation">Sungkyunkwan University, Korea, Republic of</placeName>
          </persName>
          <persName n="4" ref="https://orcid.org/0000-0002-5096-3310" type="ORCID">
            <forename>Minsoo</forename>
            <surname>Park</surname>
            <placeName type="affiliation">Sungkyunkwan University, Korea, Republic of</placeName>
          </persName>
          <persName n="5" ref="https://orcid.org/0000-0001-8970-0668" type="ORCID">
            <forename>Seunghee</forename>
            <surname>Park</surname>
            <placeName type="affiliation">Sungkyunkwan University, Korea, Republic of</placeName>
          </persName>
        </author>
        <respStmt>
          <resp>This is a section of <title>CONVR 2023 - Proceedings of the 23rd International Conference on  Construction Applications of Virtual Reality </title>(DOI: <idno type="DOI">10.36253/979-12-215-0289-3</idno>) by </resp>
          <name>Pietro Capone, Vito Getuli, Farzad Pour Rahimian, Nashwan Dawood, Alessandro Bruttini, Tommaso Sorbi</name>
        </respStmt>
      </titleStmt>
      <publicationStmt>
        <publisher>Firenze University Press</publisher>
        <pubPlace>Florence</pubPlace>
        <date when="2023">2023</date>
        <idno type="DOI">https://doi.org/10.36253/10.36253/979-12-215-0289-3.61</idno>
        <availability>
          <p>Available for academic research purposes</p>
          <p>Open Access</p>
          <p>Copyright Author(s)</p>
          <licence source="text" target="https://creativecommons.org/licenses/by-nc/4.0/legalcode">
            <p>Content licence CC BY-NC 4.0</p>
          </licence>
          <licence source="metadata" target="https://creativecommons.org/publicdomain/zero/1.0/legalcode">
            <p>Metadata licence CC0 1.0</p>
          </licence>
        </availability>
      </publicationStmt>
      <sourceDesc>
        <p>This is original content, published for academic research purposes</p>
      </sourceDesc>
    </fileDesc>
    <encodingDesc>
      <appInfo>
        <application version="2.2" ident="Booksflow">
          <desc>Digital edition XML powered by Booksflow</desc>
        </application>
      </appInfo>
    </encodingDesc>
    <profileDesc>
      <abstract xml:lang="en">
        <p>The use of closed-circuit television (CCTV) for safety monitoring is crucial for reducing accidents in construction sites. However, the majority of currently proposed approaches utilize single detection models without considering the context of CCTV video inputs. In this study, a multimodal detection, and depth map estimation algorithm utilizing deep learning is proposed. In addition, the point cloud of the test site is acquired using a terrestrial laser scanning scanner, and the detected object's coordinates are projected into global coordinates using a homography matrix. Consequently, the effectiveness of the proposed monitoring system is enhanced by the visualization of the entire monitored scene. In addition, to validate our proposed method, a synthetic dataset of construction site accidents is simulated with Twinmotion. These scenarios are then evaluated with the proposed method to determine its precision and speed of inference. Lastly, the actual construction site, equipped with multiple CCTV cameras, is utilized for system deployment and visualization. As a result, the proposed method demonstrated its robustness in detecting potential hazards on a construction site, as well as its real-time detection speed</p>
      </abstract>
      <textClass>
        <keywords>
          <list>
            <item>deep learning</item>
            <item>multimodal</item>
            <item>multiCCTV</item>
            <item>synthetic data</item>
            <item>pointcloud</item>
          </list>
        </keywords>
      </textClass>
    </profileDesc>
  </teiHeader>
  <text>
    <body>
      <p>It is available online at https://doi.org/10.36253/10.36253/979-12-215-0289-3.61<ref target="https://doi.org/10.36253/10.36253/979-12-215-0289-3.61" /></p>
      <div>
        <listBibl>
          <head>References</head>
          <bibl n="138922">
            <bibl>Abdelhamid, T. S., &amp;amp; Everett, J. G. (2000). Identifying Root Causes of Construction Accidents. Journal of Construction Engineering and Management, 126(1), 52–60.</bibl>
            <idno type="DOI">10.1061/(ASCE)0733-9364(2000)126:1(52)</idno>
          </bibl>
          <bibl n="137626">Jeon, Y., Tran, D. Q., Park, M., &amp;amp; Park, S. (2023). Leveraging Future Trajectory Prediction for Multi-Camera People Tracking. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 5398–5407.</bibl>
          <bibl n="138179">Li, Z., Liu, F., Yang, W., Peng, S., &amp;amp; Zhou, J. (2021). A survey of convolutional neural networks: Analysis, applications, and prospects. IEEE Transactions on Neural Networks and Learning Systems.</bibl>
          <bibl n="138308">Lyu, C., Zhang, W., Huang, H., Zhou, Y., Wang, Y., Liu, Y., Zhang, S., &amp;amp; Chen, K. (2022). Rtmdet: An empirical study of designing real-time object detectors. arXiv Preprint arXiv:2212.07784.</bibl>
          <bibl n="136974">Park, M., Tran, D. Q., Bak, J., &amp;amp; Park, S. (2022). Advanced wildfire detection using generative adversarial network-based augmented datasets and weakly supervised object localization. International Journal of Applied Earth Observation and Geoinformation, 114.</bibl>
          <bibl n="139066">
            <bibl>Park, M., Tran, D. Q., Bak, J., &amp;amp; Park, S. (2023). Small and overlapping worker detection at construction sites. Automation in Construction, 151, 104856.</bibl>
            <idno type="DOI">10.1016/j.autcon.2023.104856</idno>
          </bibl>
          <bibl n="137048">Ranftl, R., Lasinger, K., Hafner, D., Schindler, K., &amp;amp; Koltun, V. (2020). Towards robust monocular depth estimation: Mixing datasets for zero-shot cross-dataset transfer. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(3), 1623–1637.</bibl>
          <bibl n="137946">Redmon, J., Divvala, S., Girshick, R., &amp;amp; Farhadi, A. (2016). You only look once: Unified, real-time object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 779–788.</bibl>
          <bibl n="138291">
            <bibl>Tran, D. Q., Park, M., Jeon, Y., Bak, J., &amp;amp; Park, S. (2022). Forest-Fire Response System Using Deep-Learning-Based Approaches With CCTV Images and Weather Data. IEEE Access, 10, 66061–66071.</bibl>
            <idno type="DOI">10.1109/ACCESS.2022.3184707</idno>
          </bibl>
          <bibl n="138517">Tran, D. Q., Park, M., Jung, D., &amp;amp; Park, S. (2020). Damage-Map Estimation Using UAV Images and Deep Learning Algorithms for Disaster Management System. Remote Sensing, 12(24), 4169.</bibl>
          <bibl n="138620">
            <bibl>Zhao, Z.-Q., Zheng, P., Xu, S.-T., &amp;amp; Wu, X. (2019). Object Detection With Deep Learning: A Review. IEEE Transactions on Neural Networks and Learning Systems, 30(11), 3212–3232.</bibl>
            <idno type="DOI">10.1109/TNNLS.2018.2876865</idno>
          </bibl>
        </listBibl>
      </div>
    </body>
  </text>
</TEI>