Domain-Specific Small Language Models for Creative Text Generation: A Case Study on the Backrooms

Dissertação de Mestrado
por Caroline Félix de Oliveira
Publicado: 09/06/2026 - 10:16
Última modificação: 09/06/2026 - 10:17

Linha de pesquisa: Inteligência Artificial

Resumo: Large Language Models (LLMs) occupy a prominent place among Machine Learning models that have been revolutionizing modern life as an indispensable support tool in the most diverse areas of human activity. Nevertheless, the popularization of large LLMs faces a significant limiting challenge: the high cost associated with their use, whether through expensive hardware, cloud processing or paid APIs. In this light, the main objective of this work is to propose an approach to generate smaller language models, which can be trained in more modest hardware and that are effective in generating texts related to specific domains. The case study chosen for that are The Backrooms, an online legend which describes an alternative reality with different levels that are described with very unique, surreal and rich descriptions. For this, a Backrooms dataset, was created and used to produce two small language models (SLMs) to generate Backrooms level descriptions: Backrooms-Llama, conceived from the fine-tuning of the open source pretrained model Llama 1B; and Tiny Backrooms, a decoder-only architecture pretrained from scratch and then fine-tuned. The performance of these two models was evaluated against the famous and much larger DeepSeek-V3. The generated level descriptions of the three models were evaluated by using chatGPT 4o-mini as judge and human judges. Additionally, a brand new evaluation method, which consists on generating pictures with the output of each competitor model and, then, evaluating these pictures alongside their descriptions with chatGPT 4o, was proposed. The results found are very promising, showing that, even though DeepSeek performed better in general, Backrooms-Llama and Tiny Backrooms, operating in a much more modest architecture, also got mainly relevant evaluations and were able to learn how to generate Backrooms level descriptions. Such results indicate that, in creative generative tasks, a lower cost domain-specific SLM, when properly optimized, can achieve performance comparable to that of consecrated LLMs.

Link para a defesa: https://teams.microsoft.com/dl/launcher/launcher.html?url=%2F_%23%2Fmeet...

Coorientador: Marcelo Zanchetta do Nascimento - Universidade Federal de Uberlândia, Centro de Ciências Exatas e Tecnologia, Faculdade de Ciências da Computação.
Banca Examinadora: 
Fabíola Souza Fernandes Pereira - Universidade Federal de Uberlândia, Centro de Ciências Exatas e Tecnologia, Faculdade de Computação.
Ronaldo Cristiano Prati - Universidade Federal do ABC, Centro de Matemática, Computação e Cognição.
Data e Horário: 
18/06/2026 - 14:30
Virtual, 2121 1B
Uberlândia, Minas Gerais, Brasil
38400-902
Campus Santa Mônica - Bloco 1B - Sala 230
Complemento: 
1B