Logo do repositório

Classificação de genótipos de soja quanto os seus atributos fisiológicos usando aprendizagem de máquina e diferentes sensores espectrais

Carregando...
Imagem de Miniatura

Orientador

Teodoro, Paulo Eduardo

Coorientador

Teodoro, Larissa Pereira Ribeiro

Pós-graduação

Agronomia - FEIS

Curso de graduação

Título da Revista

ISSN da Revista

Título de Volume

Editor

Universidade Estadual Paulista (Unesp)

Tipo

Tese de doutorado

Direito de acesso

Acesso abertoAcesso Aberto

Resumo

Resumo (inglês)

High‐precision phenotyping combined with machine learning algorithms enables a more efficient exploration of soybean genetic variability. By reducing the time and subjectivity of evaluations, this approach accelerates the selection of superior genotypes and enhances breeding programs. Chapter 1 presents a critical review of the literature, situating the research topic within the current state of the art and establishing the theoretical foundations that underpin this dissertation. In chapter 2, 32 soybean genotypes were classified based on physiological variables using a VIS–NIR sensor (400~825 nm). The reflectance data were grouped into 20 representative bands, and measurements included net photosynthesis, internal CO₂ concentration, stomatal conductance, and transpiration. Genotypes were clustered by k‑means into two groups, which were then used as output variables in the machine‑learning models. The algorithms were tested with continuous wavelength inputs and band averages and evaluated using the percentage of correct classifications and F‑score. Results indicated that the set of 32 genotypes split into one cluster with 20 genotypes and another with 12; cluster 2 exhibited higher mean stomatal conductance, CO₂ concentration and transpiration, whereas cluster 1 showed slightly higher photosynthesis. In classification, support vector machine (SVM) and logistic regression achieved higher accuracy when the full spectrum was used; the J48 algorithm performed best with the band averages. The superior performance of J48 with aggregated data indicates that the choice of input type influences algorithm efficiency. Chapter 3 evaluated 32 F₃ soybean populations with a hyperspectral spectroradiometer (350~2500 nm). Physiological traits were measured 60 days after emergence, and the same leaves were evaluated spectrally. The data were analyzed both as continuous spectra and as aggregated bands. Cluster 1 showed higher photosynthesis and water‑use efficiency, whereas cluster 2 displayed higher stomatal conductance and transpiration. In classification, the continuous spectrum outperformed the aggregated bands. J48 and REPTree achieved the highest accuracies and F‑scores, followed by SVM and neural networks; Random Forest and logistic regression exhibited lower performance.

Resumo (inglês)

High‐precision phenotyping combined with machine learning algorithms enables a more efficient exploration of soybean genetic variability. By reducing the time and subjectivity of evaluations, this approach accelerates the selection of superior genotypes and enhances breeding programs. Chapter 1 presents a critical review of the literature, situating the research topic within the current state of the art and establishing the theoretical foundations that underpin this dissertation. In chapter 2, 32 soybean genotypes were classified based on physiological variables using a VIS–NIR sensor (400~825 nm). The reflectance data were grouped into 20 representative bands, and measurements included net photosynthesis, internal CO₂ concentration, stomatal conductance, and transpiration. Genotypes were clustered by k‑means into two groups, which were then used as output variables in the machine‑learning models. The algorithms were tested with continuous wavelength inputs and band averages and evaluated using the percentage of correct classifications and F‑score. Results indicated that the set of 32 genotypes split into one cluster with 20 genotypes and another with 12; cluster 2 exhibited higher mean stomatal conductance, CO₂ concentration and transpiration, whereas cluster 1 showed slightly higher photosynthesis. In classification, support vector machine (SVM) and logistic regression achieved higher accuracy when the full spectrum was used; the J48 algorithm performed best with the band averages. The superior performance of J48 with aggregated data indicates that the choice of input type influences algorithm efficiency. Chapter 3 evaluated 32 F₃ soybean populations with a hyperspectral spectroradiometer (350~2500 nm). Physiological traits were measured 60 days after emergence, and the same leaves were evaluated spectrally. The data were analyzed both as continuous spectra and as aggregated bands. Cluster 1 showed higher photosynthesis and water‑use efficiency, whereas cluster 2 displayed higher stomatal conductance and transpiration. In classification, the continuous spectrum outperformed the aggregated bands. J48 and REPTree achieved the highest accuracies and F‑scores, followed by SVM and neural networks; Random Forest and logistic regression exhibited lower performance.

Descrição

Palavras-chave

Fenotipagem fisiológica, Aprendizagem de máquina, Sensores hiperespectrais, Classificação de genótipos

Idioma

Português

Citação

SANTOS, Regimar Garcia dos. Classificação de genótipos de soja quanto os seus atributos fisiológicos usando aprendizagem de máquina e diferentes sensores espectrais. 2025. Tese (Doutorado) - Universidade Estadual Paulista (UNESP), Faculdade de Engenharia, Ilha Solteira, 2025.

Itens relacionados

Unidades

Item type:Unidade,
Faculdade de Engenharia
FEIS
Campus: Ilha Solteira


Departamentos

Cursos de graduação

Programas de pós-graduação

Item type:Programa de pós-graduação,
Agronomia
Código CAPES: 33004099079P1