Logo do repositório

Illustrating Classic Brazilian Books using a Text-To-Image Diffusion Model

dc.contributor.authorMahlow, Felipe Rodrigues Perche [UNESP]
dc.contributor.authorZanella, Andre Felipe
dc.contributor.authorCastaneda, William Alberto Cruz
dc.contributor.authorSarzi-Ribeiro, Regilene Aparecida [UNESP]
dc.contributor.institutionUniversidade Estadual Paulista (UNESP)
dc.contributor.institutionMaringá State University
dc.contributor.institutionTechnologycal Federal University of Paraná
dc.date.accessioned2025-04-29T20:08:57Z
dc.date.issued2024-01-01
dc.description.abstractIn recent years, Generative Artificial Intelligence (GenAI) has undergone a profound transformation in addressing intricate tasks involving diverse modalities such as textual, auditory, visual, and pictorial generation. Within this spectrum, text-to-image (TTI) models have emerged as a formidable approach to generating varied and aesthetically appealing compositions, spanning applications from artistic creation to realistic facial synthesis, and demonstrating significant advancements in computer vision, image processing, and multimodal tasks. The advent of Latent Diffusion Models (LDMs) signifies a paradigm shift in the domain of AI capabilities. This article delves into the feasibility of employing the Stable Diffusion LDM to illustrate literary works. For this exploration, seven classic Brazilian books have been selected as case studies. The objective is to ascertain the practicality of this endeavor and to evaluate the potential of Stable Diffusion in producing illustrations that augment and enrich the reader's experience. We will outline the beneficial aspects, such as the capacity to generate distinctive and contextually pertinent images, as well as the drawbacks, including any shortcomings in faithfully capturing the essence of intricate literary depictions. Through this study, we aim to provide a comprehensive assessment of the viability and efficacy of utilizing AI-generated illustrations in literary contexts, elucidating both the prospects and challenges encountered in this pioneering application of technology.en
dc.description.affiliationSão Paulo State University, Bauru Campus
dc.description.affiliationMaringá State University, Maringá Campus
dc.description.affiliationTechnologycal Federal University of Paraná, Guarapuava Campus
dc.description.affiliationUnespSão Paulo State University, Bauru Campus
dc.format.extent1000-1008
dc.identifierhttp://dx.doi.org/10.1109/TLA.2024.10789626
dc.identifier.citationIEEE Latin America Transactions, v. 22, n. 12, p. 1000-1008, 2024.
dc.identifier.doi10.1109/TLA.2024.10789626
dc.identifier.issn1548-0992
dc.identifier.scopus2-s2.0-85212574298
dc.identifier.urihttps://hdl.handle.net/11449/307314
dc.language.isoeng
dc.relation.ispartofIEEE Latin America Transactions
dc.sourceScopus
dc.subjectdiffusion models
dc.subjectillustration
dc.subjectimage generation
dc.subjecttext-to-image
dc.titleIllustrating Classic Brazilian Books using a Text-To-Image Diffusion Modelen
dc.typeArtigopt
dspace.entity.typePublication
unesp.author.orcid0000-0001-9816-1440[1]
unesp.author.orcid0009-0003-5797-106X[2]
unesp.author.orcid0000-0002-9803-1387[3]
unesp.author.orcid0000-0001-6267-6549[4]

Arquivos

Coleções