<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<TEI xmlns="http://www.tei-c.org/ns/1.0">
  <teiHeader>
    <fileDesc>
      <titleStmt>
        <title type="main" level="a">Gen AI and Interior Design Representation: Applying Design Styles Using Fine-Tuned Models</title>
        <author>
          <persName n="1">
            <forename>Hyun</forename>
            <surname>Jeong</surname>
            <placeName type="affiliation">Yonsei University, Korea, Republic of</placeName>
          </persName>
          <persName n="2" ref="https://orcid.org/0000-0003-2009-0376" type="ORCID">
            <forename>Youngchae</forename>
            <surname>Kim</surname>
            <placeName type="affiliation">Yonsei University, Korea, Republic of</placeName>
          </persName>
          <persName n="3" ref="https://orcid.org/0009-0002-5362-328X" type="ORCID">
            <forename>Youngjin</forename>
            <surname>Yoo</surname>
            <placeName type="affiliation">Yonsei University, Korea, Republic of</placeName>
          </persName>
          <persName n="4" ref="https://orcid.org/0009-0004-7001-2346" type="ORCID">
            <forename>SeungHyun</forename>
            <surname>Cha</surname>
            <placeName type="affiliation">Korea Advanced Institute of Science Technology, Korea, Republic of</placeName>
          </persName>
          <persName n="5" ref="https://orcid.org/0000-0002-5179-6550" type="ORCID">
            <forename>Jin-Kook</forename>
            <surname>Lee</surname>
            <placeName type="affiliation">Yonsei University, Korea, Republic of</placeName>
          </persName>
        </author>
        <respStmt>
          <resp>This is a section of <title>CONVR 2023 - Proceedings of the 23rd International Conference on  Construction Applications of Virtual Reality </title>(DOI: <idno type="DOI">10.36253/979-12-215-0289-3</idno>) by </resp>
          <name>Pietro Capone, Vito Getuli, Farzad Pour Rahimian, Nashwan Dawood, Alessandro Bruttini, Tommaso Sorbi</name>
        </respStmt>
      </titleStmt>
      <publicationStmt>
        <publisher>Firenze University Press</publisher>
        <pubPlace>Florence</pubPlace>
        <date when="2023">2023</date>
        <idno type="DOI">https://doi.org/10.36253/10.36253/979-12-215-0289-3.95</idno>
        <availability>
          <p>Available for academic research purposes</p>
          <p>Open Access</p>
          <p>Copyright Author(s)</p>
          <licence source="text" target="https://creativecommons.org/licenses/by-nc/4.0/legalcode">
            <p>Content licence CC BY-NC 4.0</p>
          </licence>
          <licence source="metadata" target="https://creativecommons.org/publicdomain/zero/1.0/legalcode">
            <p>Metadata licence CC0 1.0</p>
          </licence>
        </availability>
      </publicationStmt>
      <sourceDesc>
        <p>This is original content, published for academic research purposes</p>
      </sourceDesc>
    </fileDesc>
    <encodingDesc>
      <appInfo>
        <application version="2.2" ident="Booksflow">
          <desc>Digital edition XML powered by Booksflow</desc>
        </application>
      </appInfo>
    </encodingDesc>
    <profileDesc>
      <abstract xml:lang="en">
        <p>This paper explores the applicability of Image-generation AI in the field of interior architectural design, with a particular focus on automating interior design representation based on design styles. Interior design representation involves a complex process that integrates visual elements with functionality and user experience. Effectively visualizing this process is essential for facilitating communication among the various stakeholders involved in the design process. However, traditional visualization methods are constrained by expert resources, costs, and time limitations. In contrast, image-generation AI has the potential to automate various design elements, including design styles, components, and spatial arrangements, to enhance representation. In this study, we evaluated the performance of a base model using various design styles and, based on the evaluation results, selected styles for fine-tuning. The methodology for fine-tuning these design styles involved the following steps: 1) data preparation and preprocessing, 2) hyperparameter optimization, and 3) model training and construction. Utilizing the fine-tuned model thus constructed, we conducted image generation demonstrations. The research results revealed that design styles not well represented by the base model were effectively captured, and high-quality images were generated by the fine-tuned model. Notably, this fine-tuned model demonstrated the ability to represent images of specific design styles with a high degree of accuracy in capturing the characteristics and keywords associated with each style, compared to the base model. This implies that through fine-tuning image-generation AI, a wide range of applications can be inferred when aiming to create customized designs by considering these aspects. In conclusion, this study explores an efficient approach to interior design representation in the field of interior architecture by employing image-generation AI and proposes a method to effectively generate visualized images by training on design style keywords. Through this approach, our study can contribute to improving the interior design process by facilitating the generation of visualized images that reflect design styles. Furthermore, the study aims to suggest the potential for applying this approach not only to the field of interior architecture but also across various domains to achieve effective visualization</p>
      </abstract>
      <textClass>
        <keywords>
          <list>
            <item>Interior Architecture Design</item>
            <item>Interior Design Representation</item>
            <item>Generative AI</item>
            <item>Model Fine-tuning</item>
          </list>
        </keywords>
      </textClass>
    </profileDesc>
  </teiHeader>
  <text>
    <body>
      <p>It is available online at https://doi.org/10.36253/10.36253/979-12-215-0289-3.95<ref target="https://doi.org/10.36253/10.36253/979-12-215-0289-3.95" /></p>
      <div>
        <listBibl>
          <head>References</head>
          <bibl n="137889">Ah-Soon, C., &amp;amp; Tombre, K. (1997). Variations on the analysis of architectural drawings. In Proceedings of the fourth international conference on document analysis and recognition (Vol. 1, pp. 347-351). IEEE.</bibl>
          <bibl n="139181">Brock, A., Donahue, J., &amp;amp; Simonyan, K. (2018). Large scale GAN training for high fidelity natural image synthesis. arXiv preprint arXiv:1809.11096.</bibl>
          <bibl n="139669">Ching, Francis DK. (2011). A visual dictionary of architecture: John Wiley &amp;amp; Sons.</bibl>
          <bibl n="138398">Chiu, M. L. (1995). Collaborative design in CAAD studios: shared ideas, resources, and representations. In Proceedings of International Conference on CAAD Future Vol. 95, (pp. 749-759).</bibl>
          <bibl n="139610">Eckert, C.; Stacey, M. Sources of inspiration: A language of design. Des. Stud. 2000, 21, 523–538.</bibl>
          <bibl n="139537">Goldschmidt, G. Creative architectural design: Reference versus precedence. J. Archit. Plan. Res. 1998, 258–270.</bibl>
          <bibl n="139205">Ho, J., Jain, A., &amp;amp; Abbeel, P. (2020). Denoising diffusion probabilistic models. Advances in neural information processing systems, 33, 6840-6851.</bibl>
          <bibl n="137632">Isola, P., Zhu, J. Y., Zhou, T., &amp;amp; Efros, A. A. (2017). Image-to-image translation with conditional adversarial networks. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1125-1134).</bibl>
          <bibl n="137136">Jeong, H., &amp;amp; Lee, J. K. (2023). Study on the Applicability of Image Generation AI in Interior Architecture: Generating Images Based on Interior Design Styles. Proceeding of Spring Annual Conference of KHA, Vol.35, No.1 (pp. 247-250), Jeju, Korea.</bibl>
          <bibl n="137256">Karras, T., Laine, S., Aittala, M., Hellsten, J., Lehtinen, J., &amp;amp; Aila, T. (2020). Analyzing and improving the image quality of stylegan. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 8110-8119).</bibl>
          <bibl n="139221">
            <bibl>Kim, J., &amp;amp; Lee, J.K. (2020) Stochastic Detection of Interior Design Styles Using a Deep-Learning Model for Reference Images. Appl. Sci. 10, 7299.</bibl>
            <idno type="DOI">10.3390/app10207299</idno>
          </bibl>
          <bibl n="136700">Kim, J., Song, J., &amp;amp; Lee, J. K. (2019, January). Approach to auto-recognition of design elements for the intelligent management of interior pictures. In Proceedings of the 24th International Conference on Computer-Aided Architectural Design Research in Asia: Intelligent and Informed, CAADRIA (pp. 785-794).</bibl>
          <bibl n="136822">Lee, S.H, Kim, J.S, Song, J.Y &amp;amp; Lee, J.K. (2020). Augmented Reality-Based Approach for Design On-Site Visualization - Focusing on the Example of Space Improvement Design at a University Academic Information Center. Journal of the Korean Society of Interior Design, 29(1), 97-104.</bibl>
          <bibl n="137633">
            <bibl>Lee, J.K., Lee, S., Kim, Y., &amp;amp; Kim, S. (2023). Augmented virtual reality and 360 spatial visualization for supporting user-engaged design, Journal of Computational Design and Engineering, Volume 10(3), Pages 1047–1059,</bibl>
            <idno type="DOI">10.1093/jcde/qwad035</idno>
          </bibl>
          <bibl n="137257">Liu, Z., Lin, Y., Cao, Y., Hu, H., Wei, Y., Zhang, Z., &amp;amp; Guo, B. (2021). Swin transformer: Hierarchical vision transformer using shifted windows. In Proceedings of the IEEE/CVF international conference on computer vision (pp. 10012-10022).</bibl>
          <bibl n="138399">Nichol, A., Dhariwal, P., Ramesh, A., Shyam, P., Mishkin, P., McGrew, B., &amp;amp; Chen, M. (2021). Glide: Towards photorealistic image generation and editing with text-guided diffusion models.</bibl>
          <bibl n="139138">Oppenlaender, J. 2022. The creativity of text-to-image generation. In Proceedings of the 25th International Academic Mindtrek Conference. pp. 192-202.</bibl>
          <bibl n="139628">Oxman, R. (2006). Theory and design in the first digital age. Design studies, 27(3), 229-265.</bibl>
          <bibl n="138929">Ramesh, A., Dhariwal, P., Nichol, A., Chu, C., &amp;amp; Chen, M. 2022. Hierarchical text-conditional image generation with clip latents. arXiv preprint arXiv:2204.06125.</bibl>
          <bibl n="137343">Rombach, R., Blattmann, A., Lorenz, D., Esser, P., &amp;amp; Ommer, B. 2022. High-resolution image synthesis with latent diffusion models. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. pp. 10684-10695.</bibl>
          <bibl n="137299">Saharia, C., Chan, W., Saxena, S., Li, L., Whang, J., Denton, E. L., &amp;amp; Norouzi, M. 2022. Photorealistic text-to-image diffusion models with deep language understanding. Advances in Neural Information Processing Systems, 35, 36479-36494.</bibl>
          <bibl n="139555">Song, J., Meng, C., &amp;amp; Ermon, S. (2020). Denoising diffusion implicit models. arXiv preprint arXiv:2010.02502.</bibl>
          <bibl n="138497">Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., &amp;amp; Polosukhin, I. 2017. Attention is all you need. Advances in neural information processing systems, 30.</bibl>
        </listBibl>
      </div>
    </body>
  </text>
</TEI>