<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<TEI xmlns="http://www.tei-c.org/ns/1.0">
  <teiHeader>
    <fileDesc>
      <titleStmt>
        <title type="main" level="a">Extracting Information from Construction Safety Requirements Using Large Language Model</title>
        <author>
          <persName n="1" ref="https://orcid.org/0000-0003-0080-751X" type="ORCID">
            <forename>Si</forename>
            <surname>Tran</surname>
            <placeName type="affiliation">Chung Ang University, Korea, Republic of</placeName>
          </persName>
          <persName n="2">
            <forename>Nasrullah</forename>
            <surname>Khan</surname>
            <placeName type="affiliation">Chung Ang University, Korea, Republic of</placeName>
          </persName>
          <persName n="3">
            <forename>Emmanuel Charles</forename>
            <surname>Kimito</surname>
            <placeName type="affiliation">Chung Ang University, Korea, Republic of</placeName>
          </persName>
          <persName n="4" ref="https://orcid.org/0000-0002-7884-5316" type="ORCID">
            <forename>Akeem</forename>
            <surname>Pedro</surname>
            <placeName type="affiliation">Chung Ang University, Korea, Republic of</placeName>
          </persName>
          <persName n="5" ref="https://orcid.org/0000-0002-5217-2010" type="ORCID">
            <forename>Mehrtash</forename>
            <surname>Soltani</surname>
            <placeName type="affiliation">Chung Ang University, Korea, Republic of</placeName>
          </persName>
          <persName n="6" ref="https://orcid.org/0000-0002-6909-5189" type="ORCID">
            <forename>Rahat</forename>
            <surname>Hussain</surname>
            <placeName type="affiliation">Chung Ang University, Korea, Republic of</placeName>
          </persName>
          <persName n="7" ref="https://orcid.org/0009-0008-9739-1867" type="ORCID">
            <forename>Taehan</forename>
            <surname>Yoo</surname>
            <placeName type="affiliation">Chung Ang University, Korea, Republic of</placeName>
          </persName>
          <persName n="8" ref="https://orcid.org/0000-0003-2256-300X" type="ORCID">
            <forename>Chansik</forename>
            <surname>Park</surname>
            <placeName type="affiliation">Chung Ang University, Korea, Republic of</placeName>
          </persName>
        </author>
        <respStmt>
          <resp>This is a section of <title>CONVR 2023 - Proceedings of the 23rd International Conference on  Construction Applications of Virtual Reality </title>(DOI: <idno type="DOI">10.36253/979-12-215-0289-3</idno>) by </resp>
          <name>Pietro Capone, Vito Getuli, Farzad Pour Rahimian, Nashwan Dawood, Alessandro Bruttini, Tommaso Sorbi</name>
        </respStmt>
      </titleStmt>
      <publicationStmt>
        <publisher>Firenze University Press</publisher>
        <pubPlace>Florence</pubPlace>
        <date when="2023">2023</date>
        <idno type="DOI">https://doi.org/10.36253/10.36253/979-12-215-0289-3.76</idno>
        <availability>
          <p>Available for academic research purposes</p>
          <p>Open Access</p>
          <p>Copyright Author(s)</p>
          <licence source="text" target="https://creativecommons.org/licenses/by-nc/4.0/legalcode">
            <p>Content licence CC BY-NC 4.0</p>
          </licence>
          <licence source="metadata" target="https://creativecommons.org/publicdomain/zero/1.0/legalcode">
            <p>Metadata licence CC0 1.0</p>
          </licence>
        </availability>
      </publicationStmt>
      <sourceDesc>
        <p>This is original content, published for academic research purposes</p>
      </sourceDesc>
    </fileDesc>
    <encodingDesc>
      <appInfo>
        <application version="2.2" ident="Booksflow">
          <desc>Digital edition XML powered by Booksflow</desc>
        </application>
      </appInfo>
    </encodingDesc>
    <profileDesc>
      <abstract xml:lang="en">
        <p>The construction industry has long been recognized for its complex safety regulations, which are essential to ensure the well-being of on-site employees. However, navigating these regulations and ensuring compliance can be challenging due to the volume and complexity of the documents involved. This study proposes a novel approach to extracting information from construction safety documents utilizing Large Language Models (LLM), called CSQA, to provide real-time, precise answers to queries related to safety regulations. The approach comprises three modules: (1) the construction safety investigation module (CSI) collects safety regulations for building the information needed. By leveraging a collection of safety regulation PDFs, the system follows a process of text extraction, preprocessing, and global indexing for efficient search. (2) The safety condition identification module (SCI) retrieves the CSI database; after that, the LLM, with its extensive training, processes user queries, searches the indexed regulations, and retrieves pertinent information. (3) the safety information delivery (SID) would provide the answer to the user and incorporate a feedback mechanism to further refine system accuracy based on user responses. Preliminary evaluations reveal the system's superior performance over traditional search engines, owing to its ability to grasp query context and nuances. The CSQA presents a promising method for accessing safety regulations, with potential benefits including reduced non-compliance incidents, enhanced worker safety, and streamlined regulatory consultations in construction</p>
      </abstract>
      <textClass>
        <keywords>
          <list>
            <item>Construction safety document</item>
            <item>extraction</item>
            <item>LLM</item>
          </list>
        </keywords>
      </textClass>
    </profileDesc>
  </teiHeader>
  <text>
    <body>
      <p>It is available online at https://doi.org/10.36253/10.36253/979-12-215-0289-3.76<ref target="https://doi.org/10.36253/10.36253/979-12-215-0289-3.76" /></p>
      <div>
        <listBibl>
          <head>References</head>
          <bibl n="138927">
            <bibl>Baker, H., Hallowell, M. R., &amp;amp; Tixier, A. J. P. (2020). Automatically learning construction injury precursors from text. Automation in Construction, 118, 103145.</bibl>
            <idno type="DOI">10.1016/J.AUTCON.2020.103145</idno>
          </bibl>
          <bibl n="137094">
            <bibl>Bao, L., Tran, S. V. T., Nguyen, T. L., Pham, H. C., Lee, D., &amp;amp; Park, C. (2022). Cross-platform virtual reality for real-time construction safety training using immersive web and industry foundation classes. Automation in Construction, 143, 104565.</bibl>
            <idno type="DOI">10.1016/J.AUTCON.2022.104565</idno>
          </bibl>
          <bibl n="139308">Construction Work | Statistics Korea. (n.d.). Retrieved October 6, 2022, from http://kostat.go.kr/portal/eng/pressReleases/4/5/index.board</bibl>
          <bibl n="137456">
            <bibl>Feng, D., &amp;amp; Chen, H. (2021). A small samples training framework for deep Learning-based automatic information extraction: Case study of construction accident news reports analysis. Advanced Engineering Informatics, 47, 101256.</bibl>
            <idno type="DOI">10.1016/J.AEI.2021.101256</idno>
          </bibl>
          <bibl n="137516">
            <bibl>Jeong, H., Shin, &amp;amp; Wonsang. (2023). An Analysis on the Safety Management Level of Domestic Medium Construction Companies and Its Improvement Measures. Korean Journal of Construction Engineering and Management, 24(3), 20–30.</bibl>
            <idno type="DOI">10.6106/KJCEM.2023.24.3.020</idno>
          </bibl>
          <bibl n="136895">
            <bibl>Kang, Jeong, H., Chae, J., &amp;amp; Kang, Y. (2023). Distribution of Occupational Safety and Health Management Costs (OSHMC) by Project Size and Activity Type with the Consideration of Accident Rates. Korean Journal of Construction Engineering and Management, 24(4), 44–51.</bibl>
            <idno type="DOI">10.6106/KJCEM.2023.24.4.044</idno>
          </bibl>
          <bibl n="139627">OSHA fatality report. (n.d.). Retrieved October 6, 2022, from https://www.osha.gov/stop-falls</bibl>
          <bibl n="138443">
            <bibl>Rupasinghe, N. K. A. H., &amp;amp; Panuwatwanich, K. (2021). UNDERSTANDING CONSTRUCTION SITE SAFETY HAZARDS THROUGH OPEN DATA: TEXT MINING APPROACH. ASEAN Engineering Journal, 11(4), 160–178.</bibl>
            <idno type="DOI">10.11113/AEJ.V11.17871</idno>
          </bibl>
          <bibl n="136985">
            <bibl>Tran, S. V.-T., Lee, D., Bao, Q. L., Yoo, T., Khan, M., Jo, J., &amp;amp; Park, C. (2023). A Human Detection Approach for Intrusion in Hazardous Areas Using 4D-BIM-Based Spatial-Temporal Analysis and Computer Vision. Buildings 2023, Vol. 13, Page 2313, 13(9), 2313.</bibl>
            <idno type="DOI">10.3390/BUILDINGS13092313</idno>
          </bibl>
          <bibl n="137011">Tran, S. V., Bao, L. Q., Nguyen, L. T., &amp;amp; Pedro, A. (2022). Development of Computer Vision and BIM-cloud based Automated Status Updating for Construction Safety Monitoring. Nov 2022. cloud_based_Automated_Status_Updating_for_Construction_Safety_Monitoring</bibl>
          <bibl n="137607">
            <bibl>Tran, S. V. T., Khan, N., Lee, D., &amp;amp; Park, C. (2021). A Hazard Identification Approach of Integrating 4D BIM and Accident Case Analysis of Spatial–Temporal Exposure. Sustainability 2021, Vol. 13, Page 2211, 13(4), 2211.</bibl>
            <idno type="DOI">10.3390/SU13042211</idno>
          </bibl>
          <bibl n="137808">
            <bibl>Tran, S. V. T., Nguyen, T. L., Chi, H. L., Lee, D., &amp;amp; Park, C. (2022). Generative planning for construction safety surveillance camera installation in 4D BIM environment. Automation in Construction, 134, 104103.</bibl>
            <idno type="DOI">10.1016/J.AUTCON.2021.104103</idno>
          </bibl>
          <bibl n="137979">
            <bibl>Wu, C., Li, X., Guo, Y., Wang, J., Ren, Z., Wang, M., &amp;amp; Yang, Z. (2022). Natural language processing for smart construction: Current status and future directions. Automation in Construction, 134, 104059.</bibl>
            <idno type="DOI">10.1016/J.AUTCON.2021.104059</idno>
          </bibl>
          <bibl n="138135">
            <bibl>Zhong, B., He, W., Huang, Z., Love, P. E. D., Tang, J., &amp;amp; Luo, H. (2020). A building regulation question answering system: A deep learning methodology. Advanced Engineering Informatics, 46, 101195.</bibl>
            <idno type="DOI">10.1016/J.AEI.2020.101195</idno>
          </bibl>
        </listBibl>
      </div>
    </body>
  </text>
</TEI>