Illustrating Classic Brazilian Books using a Text-To-Image Diffusion Model

Mahlow, Felipe Rodrigues Perche [UNESP]; Zanella, Andre Felipe; Castaneda, William Alberto Cruz; Sarzi-Ribeiro, Regilene Aparecida [UNESP]

doi:10.1109/TLA.2024.10789626

Illustrating Classic Brazilian Books using a Text-To-Image Diffusion Model

dc.contributor.author	Mahlow, Felipe Rodrigues Perche [UNESP]
dc.contributor.author	Zanella, Andre Felipe
dc.contributor.author	Castaneda, William Alberto Cruz
dc.contributor.author	Sarzi-Ribeiro, Regilene Aparecida [UNESP]
dc.contributor.institution	Universidade Estadual Paulista (UNESP)
dc.contributor.institution	Maringá State University
dc.contributor.institution	Technologycal Federal University of Paraná
dc.date.accessioned	2025-04-29T20:08:57Z
dc.date.issued	2024-01-01
dc.description.abstract	In recent years, Generative Artificial Intelligence (GenAI) has undergone a profound transformation in addressing intricate tasks involving diverse modalities such as textual, auditory, visual, and pictorial generation. Within this spectrum, text-to-image (TTI) models have emerged as a formidable approach to generating varied and aesthetically appealing compositions, spanning applications from artistic creation to realistic facial synthesis, and demonstrating significant advancements in computer vision, image processing, and multimodal tasks. The advent of Latent Diffusion Models (LDMs) signifies a paradigm shift in the domain of AI capabilities. This article delves into the feasibility of employing the Stable Diffusion LDM to illustrate literary works. For this exploration, seven classic Brazilian books have been selected as case studies. The objective is to ascertain the practicality of this endeavor and to evaluate the potential of Stable Diffusion in producing illustrations that augment and enrich the reader's experience. We will outline the beneficial aspects, such as the capacity to generate distinctive and contextually pertinent images, as well as the drawbacks, including any shortcomings in faithfully capturing the essence of intricate literary depictions. Through this study, we aim to provide a comprehensive assessment of the viability and efficacy of utilizing AI-generated illustrations in literary contexts, elucidating both the prospects and challenges encountered in this pioneering application of technology.	en
dc.description.affiliation	São Paulo State University, Bauru Campus
dc.description.affiliation	Maringá State University, Maringá Campus
dc.description.affiliation	Technologycal Federal University of Paraná, Guarapuava Campus
dc.description.affiliationUnesp	São Paulo State University, Bauru Campus
dc.format.extent	1000-1008
dc.identifier	http://dx.doi.org/10.1109/TLA.2024.10789626
dc.identifier.citation	IEEE Latin America Transactions, v. 22, n. 12, p. 1000-1008, 2024.
dc.identifier.doi	10.1109/TLA.2024.10789626
dc.identifier.issn	1548-0992
dc.identifier.scopus	2-s2.0-85212574298
dc.identifier.uri	https://hdl.handle.net/11449/307314
dc.language.iso	eng
dc.relation.ispartof	IEEE Latin America Transactions
dc.source	Scopus
dc.subject	diffusion models
dc.subject	illustration
dc.subject	image generation
dc.subject	text-to-image
dc.title	Illustrating Classic Brazilian Books using a Text-To-Image Diffusion Model	en
dc.type	Artigo	pt
dspace.entity.type	Publication
unesp.author.orcid	0000-0001-9816-1440[1]
unesp.author.orcid	0009-0003-5797-106X[2]
unesp.author.orcid	0000-0002-9803-1387[3]
unesp.author.orcid	0000-0001-6267-6549[4]

Coleções

Artigos

Illustrating Classic Brazilian Books using a Text-To-Image Diffusion Model

Arquivos

Coleções