<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<TEI xmlns="http://www.tei-c.org/ns/1.0">
  <teiHeader>
    <fileDesc>
      <titleStmt>
        <title type="main" level="a">Generative Design Intuition from the Fine-Tuned Models of Named Architects’ Style</title>
        <author>
          <persName n="1" ref="https://orcid.org/0009-0002-5362-328X" type="ORCID">
            <forename>Youngjin</forename>
            <surname>Yoo</surname>
            <placeName type="affiliation">Yonsei University, Korea, Republic of</placeName>
          </persName>
          <persName n="2">
            <forename>Hyun</forename>
            <surname>Jeong</surname>
            <placeName type="affiliation">Yonsei University, Korea, Republic of</placeName>
          </persName>
          <persName n="3" ref="https://orcid.org/0000-0003-2009-0376" type="ORCID">
            <forename>Youngchae</forename>
            <surname>Kim</surname>
            <placeName type="affiliation">Yonsei University, Korea, Republic of</placeName>
          </persName>
          <persName n="4" ref="https://orcid.org/0009-0004-7001-2346" type="ORCID">
            <forename>SeungHyun</forename>
            <surname>Cha</surname>
            <placeName type="affiliation">Korea Advanced Institute of Science Technology, Korea, Republic of</placeName>
          </persName>
          <persName n="5" ref="https://orcid.org/0000-0002-5179-6550" type="ORCID">
            <forename>Jin-Kook</forename>
            <surname>Lee</surname>
            <placeName type="affiliation">Yonsei University, Korea, Republic of</placeName>
          </persName>
        </author>
        <respStmt>
          <resp>This is a section of <title>CONVR 2023 - Proceedings of the 23rd International Conference on  Construction Applications of Virtual Reality </title>(DOI: <idno type="DOI">10.36253/979-12-215-0289-3</idno>) by </resp>
          <name>Pietro Capone, Vito Getuli, Farzad Pour Rahimian, Nashwan Dawood, Alessandro Bruttini, Tommaso Sorbi</name>
        </respStmt>
      </titleStmt>
      <publicationStmt>
        <publisher>Firenze University Press</publisher>
        <pubPlace>Florence</pubPlace>
        <date when="2023">2023</date>
        <idno type="DOI">https://doi.org/10.36253/10.36253/979-12-215-0289-3.91</idno>
        <availability>
          <p>Available for academic research purposes</p>
          <p>Open Access</p>
          <p>Copyright Author(s)</p>
          <licence source="text" target="https://creativecommons.org/licenses/by-nc/4.0/legalcode">
            <p>Content licence CC BY-NC 4.0</p>
          </licence>
          <licence source="metadata" target="https://creativecommons.org/publicdomain/zero/1.0/legalcode">
            <p>Metadata licence CC0 1.0</p>
          </licence>
        </availability>
      </publicationStmt>
      <sourceDesc>
        <p>This is original content, published for academic research purposes</p>
      </sourceDesc>
    </fileDesc>
    <encodingDesc>
      <appInfo>
        <application version="2.2" ident="Booksflow">
          <desc>Digital edition XML powered by Booksflow</desc>
        </application>
      </appInfo>
    </encodingDesc>
    <profileDesc>
      <abstract xml:lang="en">
        <p>This paper suggests the potential application of generative artificial intelligence-based image generation technology in the field of architecture, for early phase shape planning, using the styles of renowned architects. The study employed the following approaches: 1) Intensive image generation based on the styles of 20 architects to test the AI's recognition ability and image quality. 2) Additional training was conducted for architects with low recognition rates to construct an enhanced learning model in the quality of image generation. 3) In addition to generating architectural visualization images using existing architects' design styles, alternative styles were proposed through design combinations, aiming to concretize ambiguous idea communication in the early stages of design and enhance its efficiency. The study sheds light on the future prospects of applying this generative AI model in the field of architecture</p>
      </abstract>
      <textClass>
        <keywords>
          <list>
            <item>Design Style of Architects</item>
            <item>Generative AI</item>
            <item>Image Generation</item>
            <item>Fine-tuning</item>
          </list>
        </keywords>
      </textClass>
    </profileDesc>
  </teiHeader>
  <text>
    <body>
      <p>It is available online at https://doi.org/10.36253/10.36253/979-12-215-0289-3.91<ref target="https://doi.org/10.36253/10.36253/979-12-215-0289-3.91" /></p>
      <div>
        <listBibl>
          <head>References</head>
          <bibl n="139105">
            <bibl>Abdallah, Z., Du, L., &amp;amp; Webb, G. (2017). Data Preparation. In: Sammut, C., Webb, G.I. (eds) Encyclopedia of Machine Learning and Data Mining. Springer.</bibl>
            <idno type="DOI">10.1007/978-1-4899-7687-1_62</idno>
          </bibl>
          <bibl n="139447">Akin, O. (1978). How do architects design. Artificial Intelligence and Pattern Recognition in Computer Aided Design, 65-104.</bibl>
          <bibl n="138223">
            <bibl>Atilola, O., Tomko, M., &amp;amp; Linsey, J. S. (2016). The effects of representation on idea generation and design fixation: A study comparing sketches and function trees. Design studies, 42, 110-136.</bibl>
            <idno type="DOI">10.1016/j.destud.2015.10.005</idno>
          </bibl>
          <bibl n="139498">
            <bibl>Borji, A. (2023). Generated Faces in the Wild: Quantitative Comparison of Stable Diffusion, Midjourney and DALL-E 2.</bibl>
            <idno type="DOI">10.48550/arXiv.2210.00586</idno>
          </bibl>
          <bibl n="138420">Chiu, M. L. (1995). Collaborative design in CAAD studios: shared ideas, resources, and representations. In Proceedings of International Conference on CAAD Future (Vol. 95, pp. 749-759).</bibl>
          <bibl n="136655">
            <bibl>David, A., Joy, E., Kumar, S., &amp;amp; Bezaleel, S.J. (2022). Integrating Virtual Reality with 3D Modeling for Interactive Architectural Visualization and Photorealistic Simulation: A Direction for Future Smart Construction Design Using a Game Engine. Second International Conference on Image Processing and Capsule Networks, 300.</bibl>
            <idno type="DOI">10.1007/978-3-030-84760-9_17</idno>
          </bibl>
          <bibl n="139154">Eastman, C.M. (1999). Building Product Models: Computer Environments, Supporting Design and Construction. (1st ed.). CRC Press: Florida, (Chapter 1).</bibl>
          <bibl n="137378">
            <bibl>Fonseca, D., Redondo, E., Valls, F., &amp;amp; Villagrasa, S. (2017). Technological adaptation of the student to the educational density of the course. A case study: 3D architectural visualization, Computers in Human Behavior, 72, 599-611.</bibl>
            <idno type="DOI">10.1016/j.chb.2016.05.048</idno>
          </bibl>
          <bibl n="138297">Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., ... &amp;amp; Bengio, Y. (2014). Generative adversarial nets. Advances in neural information processing systems, 27.</bibl>
          <bibl n="139344">Greenberg, D. P. (1974). Computer graphics in architecture. Scientific American, 230(5), 98-107. https://www.jstor.org/stable/24950079.</bibl>
          <bibl n="138397">Hu, E. J., Shen, Y., Wallis, P., Allen-Zhu, Z., Li, Y., Wang, S., ... &amp;amp; Chen, W. (2021). Lora: Low-rank adaptation of large language models. https://ar5iv.labs.arxiv.org/html/2106.09685.</bibl>
          <bibl n="137239">
            <bibl>Karras, T., Laine, S., Aittala, M., Hellsten, J., Lehtinen, J., &amp;amp; Aila, T. (2020). Analyzing and improving the image quality of stylegan. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 8110-8119).</bibl>
            <idno type="DOI">10.1109/CVPR42600.2020.00813</idno>
          </bibl>
          <bibl n="139323">
            <bibl>Kheir Al-Kodmany. (2001). Visualization Tools and Methods for Participatory Planning and Design. Journal of Urban Technology, 8(2), 1-37.</bibl>
            <idno type="DOI">10.1080/106307301316904772</idno>
          </bibl>
          <bibl n="138983">
            <bibl>Kim, J., &amp;amp; Lee, J. K. (2020). Stochastic detection of interior design styles using a deep-learning model for reference images. Applied Sciences, 10(20), 7299.</bibl>
            <idno type="DOI">10.3390/app10207299</idno>
          </bibl>
          <bibl n="136696">
            <bibl>Kim, J., Song, J., &amp;amp; Lee, J. K. (2019, January). Approach to auto-recognition of design elements for the intelligent management of interior pictures. In Proceedings of the 24th International Conference on Computer-Aided Architectural Design Research in Asia: Intelligent and Informed, CAADRIA (pp. 785-794).</bibl>
            <idno type="DOI">10.52842/conf.caadria.2019.2.785</idno>
          </bibl>
          <bibl n="139590">
            <bibl>Koutamanis, A. (2000) Digital architectural visualization. Automation in Construction, 9(4), 347-360.</bibl>
            <idno type="DOI">10.1016/S0926-5805(99)00018-7</idno>
          </bibl>
          <bibl n="138875">
            <bibl>Oppenlaender, J. (2022, November). The Creativity of Text-to-Image Generation. In Proceedings of the 25th International Academic Mindtrek Conference (pp. 192-202).</bibl>
            <idno type="DOI">10.1145/3569219.3569352</idno>
          </bibl>
          <bibl n="139345">
            <bibl>Oppenlaender, J., Visuri, A., Paananen, V., Linder, R., &amp;amp; Silvennoinen, J. (2023). Text-to-Image Generation: Perceptions and Realities.</bibl>
            <idno type="DOI">10.48550/arXiv.2303.13530</idno>
          </bibl>
          <bibl n="139378">
            <bibl>Ramesh, A., Dhariwal, P., Nichol, A., Chu, C., &amp;amp; Chen, M. (2022). Hierarchical text-conditional image generation with clip latents.</bibl>
            <idno type="DOI">10.48550/arXiv.2204.06125</idno>
          </bibl>
          <bibl n="137283">
            <bibl>Rombach, R., Blattmann, A., Lorenz, D., Esser, P., &amp;amp; Ommer, B. (2022). High-resolution image synthesis with latent diffusion models. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 10684-10695).</bibl>
            <idno type="DOI">10.48550/arXiv.2112.10752</idno>
          </bibl>
          <bibl n="137178">
            <bibl>Saharia, C., Chan, W., Saxena, S., Li, L., Whang, J., Denton, E. L., ... &amp;amp; Norouzi, M. (2022). Photorealistic text-to-image diffusion models with deep language understanding. Advances in Neural Information Processing Systems, 35, 36479-36494.</bibl>
            <idno type="DOI">10.48550/arXiv.2205.11487</idno>
          </bibl>
        </listBibl>
      </div>
    </body>
  </text>
</TEI>