<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<TEI xmlns="http://www.tei-c.org/ns/1.0">
  <teiHeader>
    <fileDesc>
      <titleStmt>
        <title type="main" level="a">Efficient Data Curation Using Active Learning for a Video-Based Fire Detection</title>
        <author>
          <persName n="1">
            <forename>Keyur</forename>
            <surname>Joshi</surname>
            <placeName type="affiliation">Ruhr-University Bochum, Germany</placeName>
          </persName>
          <persName n="2" ref="https://orcid.org/0000-0001-7853-1395" type="ORCID">
            <forename>Angelina</forename>
            <surname>Aziz</surname>
            <placeName type="affiliation">Ruhr-University Bochum, Germany</placeName>
          </persName>
          <persName n="3">
            <forename>Philip</forename>
            <surname>Dietrich</surname>
            <placeName type="affiliation">Bosch Sicherheitssysteme GmbH, Germany</placeName>
          </persName>
          <persName n="4" ref="https://orcid.org/0000-0002-2729-7743" type="ORCID">
            <forename>Markus</forename>
            <surname>König</surname>
            <placeName type="affiliation">Ruhr-University Bochum, Germany</placeName>
          </persName>
        </author>
        <respStmt>
          <resp>This is a section of <title>CONVR 2023 - Proceedings of the 23rd International Conference on  Construction Applications of Virtual Reality </title>(DOI: <idno type="DOI">10.36253/979-12-215-0289-3</idno>) by </resp>
          <name>Pietro Capone, Vito Getuli, Farzad Pour Rahimian, Nashwan Dawood, Alessandro Bruttini, Tommaso Sorbi</name>
        </respStmt>
      </titleStmt>
      <publicationStmt>
        <publisher>Firenze University Press</publisher>
        <pubPlace>Florence</pubPlace>
        <date when="2023">2023</date>
        <idno type="DOI">https://doi.org/10.36253/10.36253/979-12-215-0289-3.60</idno>
        <availability>
          <p>Available for academic research purposes</p>
          <p>Open Access</p>
          <p>Copyright Author(s)</p>
          <licence source="text" target="https://creativecommons.org/licenses/by-nc/4.0/legalcode">
            <p>Content licence CC BY-NC 4.0</p>
          </licence>
          <licence source="metadata" target="https://creativecommons.org/publicdomain/zero/1.0/legalcode">
            <p>Metadata licence CC0 1.0</p>
          </licence>
        </availability>
      </publicationStmt>
      <sourceDesc>
        <p>This is original content, published for academic research purposes</p>
      </sourceDesc>
    </fileDesc>
    <encodingDesc>
      <appInfo>
        <application version="2.2" ident="Booksflow">
          <desc>Digital edition XML powered by Booksflow</desc>
        </application>
      </appInfo>
    </encodingDesc>
    <profileDesc>
      <abstract xml:lang="en">
        <p>Video-based fire detection is a crucial object detection problem that relies on accurate and reliable data to detect fires. However, collecting and labeling fire-related data can be time-consuming and expensive, making it difficult to obtain sufficient data for training machine learning models. To address this challenge, uncertainty-based active learning techniques can be used to iteratively select the most informative samples for labeling. This can reduce the amount of labeled data needed to achieve high model performance and has the potential to even prune the training data with fewer informative samples. The traditional sampling-based uncertainty estimation methods are computationally expensive. Hence, an efficient prior network-based ensemble distillation State-of-the-Art approach is evaluated on an internal dataset which still requires relatively higher overhead computation making it difficult for production deployment. A biased softmax differencing-based uncertainty approach and a feature-based hard data mining approach are proposed and compared with the distillation approach. The novel approaches are found to have a very low overhead uncertainty estimation time compared to the ensemble distillation approach and traditional sampling techniques. The methods are evaluated in the context of curating the unlabeled pool data and improving the training data. For completeness, the experiments are performed on three different data sizes, and overall, the frame-wise selection strategy is proved to be better than the sequence-wise querying strategy. The Principal Component Analysis (PCA)-based hard data mining outperformed other methods and improved the model performance by 16.33% with AUC2% metric when compared with the random selection of data. The approach even outperformed the main network trained on full data by 7.33%, henceforth improving the training data by using informative 26.39% data. The results indicate that novel data mining provides efficient training and pool data curation</p>
      </abstract>
      <textClass>
        <keywords>
          <list>
            <item>Uncertainty Estimation</item>
            <item>Active Learning</item>
            <item>Object Detection</item>
            <item>Outlier Detection</item>
            <item>Feature-based cluster analysis</item>
            <item>Video-based Fire Detection</item>
          </list>
        </keywords>
      </textClass>
    </profileDesc>
  </teiHeader>
  <text>
    <body>
      <p>It is available online at https://doi.org/10.36253/10.36253/979-12-215-0289-3.60<ref target="https://doi.org/10.36253/10.36253/979-12-215-0289-3.60" /></p>
      <div>
        <listBibl>
          <head>References</head>
          <bibl n="137727">
            <bibl>Breunig, M. M., Kriegel, H.-P., Ng, R. T., &amp;amp; Sander, J. (2000). LOF: Identifying density-based local outliers. In Proceedings of the 2000 ACM SIGMOD international conference on Management of data (pp. 93-104).</bibl>
            <idno type="DOI">10.1145/342009.335388</idno>
          </bibl>
          <bibl n="136662">Campbell, R. (2018). Fire in Industrial or Manufacturing Properties. Technical report, National Fire Protection Association (NFPA). Retrieved July 24, 2023, from NFPA report website: https://www.nfpa.org/News-and-Research/Data-research-and-tools/Building-and-Life-Safety/Fires-in-US-Industrial-and-Manufacturing-Facilities</bibl>
          <bibl n="137305">
            <bibl>Choi, J., Elezi, I., Lee, H.-J., Farabet, C., &amp;amp; Alvarez, J. M. (2021). Active Learning for Deep Object Detection via Probabilistic Modeling. In&amp;#160;Proceedings of the IEEE/CVF International Conference on Computer Vision&amp;#160;(pp. 10264-10273).</bibl>
            <idno type="DOI">10.48550/arXiv.2103.16130</idno>
          </bibl>
          <bibl n="138516">
            <bibl>Deepshikha, K., Yelleni, S. H., Srijith, P. K., &amp;amp; Mohan, C. K. (2021). Monte Carlo DropBlock for Modelling Uncertainty in Object Detection. In Computing Research Repository (CoRR).</bibl>
            <idno type="DOI">10.48550/arXiv.2108.03614</idno>
          </bibl>
          <bibl n="136828">Desai, S. V., Chandra, A. L., Guo, W., Ninomiya, S., &amp;amp; Balasubramanian, V. N. (2019, October). An Adaptive Supervision Framework for Active Learning in Object Detection. In Proceedings of the British Machine Vision Conference. https://api.semanticscholar.org/CorpusID:199472921</bibl>
          <bibl n="137306">Gal, Y., &amp;amp; Ghahramani, Z. (2016). Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning. In International conference on machine learning, PMLR (pp. 1050-1059). https://proceedings.mlr.press/v48/gal16.html</bibl>
          <bibl n="138896">
            <bibl>Gal, Y., Islam, R., &amp;amp; Ghahramani, Z. (2017). Deep Bayesian Active Learning with Image Data. In International conference on machine learning, PMLR (pp.&amp;#160;1183-1192).</bibl>
            <idno type="DOI">10.48550/arXiv.1703.02910</idno>
          </bibl>
          <bibl n="137493">
            <bibl>Jolliffe, I. T., &amp;amp; Cadima, J. (2016). Principal component analysis: A review and recent developments. Philosophical transactions of the royal society A: Mathematical, Physical and Engineering Sciences,&amp;#160;374(2065), 20150202.</bibl>
            <idno type="DOI">10.1098/rsta.2015.0202</idno>
          </bibl>
          <bibl n="138897">
            <bibl>Kendall, A., &amp;amp; Gal, Y. (2017). What Uncertainties Do We Need in Bayesian Deep Learning for Computer Vision? Advances in neural information processing systems,&amp;#160;30.</bibl>
            <idno type="DOI">10.5555/3295222.3295309</idno>
          </bibl>
          <bibl n="137278">Lakshminarayanan, B., Pritzel, A., &amp;amp; Blundell, C. (2017). Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles. Advances in neural information processing systems,&amp;#160;30. https://api.semanticscholar.org/CorpusID:6294674</bibl>
          <bibl n="139255">
            <bibl>Malinin, A., &amp;amp; Gales, M. (2018). Predictive Uncertainty Estimation via Prior Networks. Advances in neural information processing systems,&amp;#160;31.</bibl>
            <idno type="DOI">10.48550/arXiv.1802.10501</idno>
          </bibl>
          <bibl n="139201">
            <bibl>Malinin, A., Mlodozeniec, B., &amp;amp; Gales, M. (2019). Ensemble Distribution Distillation. In 8th International Conference on Learning Representations.</bibl>
            <idno type="DOI">10.48550/arXiv.1905.00076</idno>
          </bibl>
          <bibl n="137357">
            <bibl>Manivannan, I. (2020). A Comparative Study of Uncertainty Estimation Methods in Deep Learning Based Classification Models. Technical report, Hochschule Bonn-Rhein-Sieg University of Applied Sciences, Department of Computer Science.</bibl>
            <idno type="DOI">10.18418/978-3-96043-085-8</idno>
          </bibl>
          <bibl n="138944">
            <bibl>Nguyen, V.-L., Shaker, M. H., &amp;amp; H&amp;#252;llermeier, E. (2022). How to measure uncertainty in uncertainty sampling for active learning. Machine Learning,&amp;#160;111(1), 89-122,</bibl>
            <idno type="DOI">10.1007/s10994-021-06003-9</idno>
          </bibl>
          <bibl n="137728">
            <bibl>Schorn, C., &amp;amp; Gauerhof, L. (2020). FACER: A Universal Framework for Detecting Anomalous Operation of Deep Neural Networks. In 2020 IEEE 23rd International Conference on Intelligent Transportation Systems (pp. 1-6).</bibl>
            <idno type="DOI">10.1109/ITSC45102.2020.9294226</idno>
          </bibl>
          <bibl n="138328">
            <bibl>Sousa, M. J., Moutinho, A., &amp;amp; Almeida, M. (2020). Thermal Infrared Sensing for Near Real-Time Data-Driven Fire Detection and Monitoring Systems. Sensors (Basel, Switzerland), 20(23), 6803.</bibl>
            <idno type="DOI">10.3390/s20236803</idno>
          </bibl>
          <bibl n="138662">
            <bibl>Streiner, D. L., &amp;amp; Cairney, J. (2007). What’s under the ROC? An introduction to receiver operating characteristics curves. The Canadian Journal of Psychiatry,&amp;#160;52(2), 121-128.</bibl>
            <idno type="DOI">10.1177/070674370705200210</idno>
          </bibl>
          <bibl n="137625">
            <bibl>Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Rabinovich, A. (2015). Going Deeper with Convolutions. In&amp;#160;Proceedings of the IEEE conference on computer vision and pattern recognition&amp;#160;(pp. 1-9).</bibl>
            <idno type="DOI">10.1109/CVPR.2015.7298594</idno>
          </bibl>
        </listBibl>
      </div>
    </body>
  </text>
</TEI>