The Potential of Visual ChatGPT for Remote Sensing

Osco, Lucas Prado; Lemos, Eduardo Lopes de; Gonçalves, Wesley Nunes; Ramos, Ana Paula Marques [UNESP]; Marcato Junior, José

doi:10.3390/rs15133232

The Potential of Visual ChatGPT for Remote Sensing

dc.contributor.author	Osco, Lucas Prado
dc.contributor.author	Lemos, Eduardo Lopes de
dc.contributor.author	Gonçalves, Wesley Nunes
dc.contributor.author	Ramos, Ana Paula Marques [UNESP]
dc.contributor.author	Marcato Junior, José
dc.contributor.institution	University of Western São Paulo (UNOESTE)
dc.contributor.institution	Universidade Federal de Mato Grosso do Sul (UFMS)
dc.contributor.institution	Universidade Estadual Paulista (UNESP)
dc.date.accessioned	2025-04-29T18:06:05Z
dc.date.issued	2023-07-01
dc.description.abstract	Recent advancements in Natural Language Processing (NLP), particularly in Large Language Models (LLMs), associated with deep learning-based computer vision techniques, have shown substantial potential for automating a variety of tasks. These are known as Visual LLMs and one notable model is Visual ChatGPT, which combines ChatGPT’s LLM capabilities with visual computation to enable effective image analysis. These models’ abilities to process images based on textual inputs can revolutionize diverse fields, and while their application in the remote sensing domain remains unexplored, it is important to acknowledge that novel implementations are to be expected. Thus, this is the first paper to examine the potential of Visual ChatGPT, a cutting-edge LLM founded on the GPT architecture, to tackle the aspects of image processing related to the remote sensing domain. Among its current capabilities, Visual ChatGPT can generate textual descriptions of images, perform canny edge and straight line detection, and conduct image segmentation. These offer valuable insights into image content and facilitate the interpretation and extraction of information. By exploring the applicability of these techniques within publicly available datasets of satellite images, we demonstrate the current model’s limitations in dealing with remote sensing images, highlighting its challenges and future prospects. Although still in early development, we believe that the combination of LLMs and visual models holds a significant potential to transform remote sensing image processing, creating accessible and practical application opportunities in the field.	en
dc.description.affiliation	Faculty of Engineering and Architecture and Urbanism University of Western São Paulo (UNOESTE), Rod. Raposo Tavares, km 572, Limoeiro
dc.description.affiliation	Faculty of Computing Federal University of Mato Grosso do Sul (UFMS), Av. Costa e Silva-Pioneiros, Cidade Universitária
dc.description.affiliation	Departament of Cartography São Paulo State University (UNESP) Centro Educacional, R. Roberto Simonsen, 305
dc.description.affiliation	Faculty of Engineering Architecture and Urbanism and Geography Federal University of Mato Grosso do Sul (UFMS), Av. Costa e Silva-Pioneiros, Cidade Universitária
dc.description.affiliationUnesp	Departament of Cartography São Paulo State University (UNESP) Centro Educacional, R. Roberto Simonsen, 305
dc.identifier	http://dx.doi.org/10.3390/rs15133232
dc.identifier.citation	Remote Sensing, v. 15, n. 13, 2023.
dc.identifier.doi	10.3390/rs15133232
dc.identifier.issn	2072-4292
dc.identifier.scopus	2-s2.0-85164886236
dc.identifier.uri	https://hdl.handle.net/11449/297261
dc.language.iso	eng
dc.relation.ispartof	Remote Sensing
dc.source	Scopus
dc.subject	artificial intelligence
dc.subject	image analysis
dc.subject	visual language model
dc.title	The Potential of Visual ChatGPT for Remote Sensing	en
dc.type	Artigo	pt
dspace.entity.type	Publication
relation.isOrgUnitOfPublication	bbcf06b3-c5f9-4a27-ac03-b690202a3b4e
relation.isOrgUnitOfPublication.latestForDiscovery	bbcf06b3-c5f9-4a27-ac03-b690202a3b4e
unesp.author.orcid	0000-0002-0258-536X[1]
unesp.author.orcid	0009-0000-0898-4372[2]
unesp.author.orcid	0000-0002-8815-6653[3]
unesp.author.orcid	0000-0001-6633-2903[4]
unesp.author.orcid	0000-0002-9096-6866[5]
unesp.campus	Universidade Estadual Paulista (UNESP), Faculdade de Ciências e Tecnologia, Presidente Prudente	pt

Coleções

Presidente Prudente - FCT - Faculdade de Ciências e Tecnologia

The Potential of Visual ChatGPT for Remote Sensing

Arquivos

Coleções