Full Terms & Conditions of access and use can be found at https://www.tandfonline.com/action/journalInformation?journalCode=tres20 International Journal of Remote Sensing ISSN: 0143-1161 (Print) 1366-5901 (Online) Journal homepage: https://www.tandfonline.com/loi/tres20 Examining region-based methods for land cover classification using stochastic distances R. G. Negri, L. V. Dutra, S. J. S. Sant'Anna & D. Lu To cite this article: R. G. Negri, L. V. Dutra, S. J. S. Sant'Anna & D. Lu (2016) Examining region- based methods for land cover classification using stochastic distances, International Journal of Remote Sensing, 37:8, 1902-1921, DOI: 10.1080/01431161.2016.1165883 To link to this article: https://doi.org/10.1080/01431161.2016.1165883 Published online: 11 Apr 2016. Submit your article to this journal Article views: 134 View Crossmark data Citing articles: 4 View citing articles https://www.tandfonline.com/action/journalInformation?journalCode=tres20 https://www.tandfonline.com/loi/tres20 https://www.tandfonline.com/action/showCitFormats?doi=10.1080/01431161.2016.1165883 https://doi.org/10.1080/01431161.2016.1165883 https://www.tandfonline.com/action/authorSubmission?journalCode=tres20&show=instructions https://www.tandfonline.com/action/authorSubmission?journalCode=tres20&show=instructions http://crossmark.crossref.org/dialog/?doi=10.1080/01431161.2016.1165883&domain=pdf&date_stamp=2016-04-11 http://crossmark.crossref.org/dialog/?doi=10.1080/01431161.2016.1165883&domain=pdf&date_stamp=2016-04-11 https://www.tandfonline.com/doi/citedby/10.1080/01431161.2016.1165883#tabModule https://www.tandfonline.com/doi/citedby/10.1080/01431161.2016.1165883#tabModule Examining region-based methods for land cover classification using stochastic distances R. G. Negri a, L. V. Dutra b, S. J. S. Sant'Annab and D. Lu c aInstituto de Ciência e Tecnologia, UNESP – Univ. Estadual Paulista, São Paulo, Brazil; bDivisão de Processamento de Imagens, INPE – Inst. Nacional de Pesquisas Espaciais, São Paulo, Brazil; cCenter for Global Change and Earth Observations, MSU – Michigan State University, East Lansing, MI, USA ABSTRACT A recent alternative to standard pixel-based classification of remote-sensing data is region-based classification, which has proved to be particularly useful when analysing high-resolution imagery of complex environments, such as urban areas, or when addressing noisy data, such as synthetic aperture radar (SAR) images. First, following certain criteria, the imagery is decomposed into homogeneous regions, and then each region is classified into a class of interest. The usual method for region-based classification involves using stochastic distances, which measure the distances between the pixel distributions inside an unknown region and the representative distributions of each class. The class, which is at the minimum distance from the unknown region distribution, is assigned to the region and this procedure is termed stochastic minimum distance classification (SMDC). This study reports the use of methods derived from the original SMDC, Support Vector Machine (SVM), and graph theory, with the objective of identifying the most robust and accurate classification methods. The equiva- lent pixel-based versions of region-based analysed methods were included for comparison. A case study near the Tapajós National Forest, in Pará state, Brazil, was investigated using ALOS PALSAR data. This study showed that methods based on the nearest neighbour, derived from SMDC, and SVM, with a specific kernel function, are more accurate and robust than the other analysed methods for region-based classification. Furthermore, pixel-based methods are not indicated to perform the classification of images with a strong presence of noise, such as SAR images. ARTICLE HISTORY Received 17 March 2015 Accepted 7 March 2016 1. Introduction The use of region-based classification methods has been increasing, particularly with high-resolution imagery over urban areas, where pixel-based classification normally fails because of the high heterogeneity and complexity of such environments (Liu and Xia 2010; Gigandet et al. 2005; Maillard and Alencar-Silva 2013). Herholz et al. (2014) applied region-based classification to analyse medical imagery. Liu, Wang, and Gong (2014) used this classification approach with light detection and ranging (lidar) data. Region-based CONTACT R. G. Negri rogerio.negri@ict.unesp.br Instituto de Ciência e Tecnologia, UNESP – Univ. Estadual Paulista, São José dos Campos, São Paulo, Brazil INTERNATIONAL JOURNAL OF REMOTE SENSING, 2016 VOL. 37, NO. 8, 1902–1921 http://dx.doi.org/10.1080/01431161.2016.1165883 © 2016 Informa UK Limited, trading as Taylor & Francis Group http://orcid.org/0000-0002-4808-2362 http://orcid.org/0000-0002-7757-039X http://orcid.org/0000-0003-4767-5710 http://www.tandfonline.com classification is also especially useful for radar data, which are normally analysed using pixel-based methods (Freitas et al. 2008; Li et al. 2012a; Li et al. 2012b; Zhang et al. 2013). Region-based classifiers first aggregate pixels into homogeneous objects using seg- mentation techniques and then classify the objects individually (Liu and Xia 2010). Typically, classification is performed using a statistical distance between the representa- tive distribution of each class of interest and the pixel distribution inside an unknown region. As presented in Silva et al. (2011), the class at the minimum distance to the unknown region distribution is assigned to the region and this process is known as stochastic minimum distance classification (SMDC). A Gaussian assumption is used for the standard statistical distance definition (Richards and Jia 2005). Negri, Dutra, and Sant’Anna (2012a) theoretically introduced distinctive ways of using stochastic distances for region-based classification, which have been tested with simu- lated data. The first study to use the Bhattacharyya kernel function (Kondor and Jebara 2003) and apply Support Vector Machine (SVM) to region-based classification problems was presented in Negri, Dutra, and Sant’Anna (2012b). Another method that yielded good results for multispectral image classification was proposed by Camps-Valls, Tatyana, and Zhou (2007). This method is based on graph classification and its formalization allows for the use of kernel functions. For these characteristics, it is possible to use the Bhattacharyya kernel function and apply the method proposed in Camps-Valls, Tatyana, and Zhou (2007) for region-based classifica- tion, similar to that of Negri, Dutra, and Sant’Anna (2012b). The present work analyses the methods presented in Silva et al. (2011), Negri, Dutra, and Sant’Anna (2012a), Negri, Dutra, and Sant’Anna (2012b), and Camps-Valls, Tatyana, and Zhou (2007) (with the latter two methods using the Bhattacharyya kernel function) for region-based classification. The equivalent pixel-based versions of the region-based analysed methods were included for comparison. In this investigation, a practical evaluation of the methods is presented for land use and land cover (LULC) classification using ALOS PALSAR imagery in a study area near the Tapajós National Forest in the western part of Pará state, Brazil. Two classification scenarios with different classes were considered in this study to evaluate the analysed methods. 2. Theoretical background 2.1. Stochastic distances Stochastic distances were used as discrimination measures. These distances quantify the separability of two sets of information. Probability density functions are used to model the information distribution in each set. The separability of the sets is equivalent to the distance between their probability functions. The Jeffries–Matusita distance (JM) is a stochastic distance that is usually adopted in remote-sensing applications (Richards and Jia 2005): JMðC;DÞ ¼ ð x2X ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi fCðx;ΘCÞ p � ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi fDðx;ΘDÞ ph i2 dx; (1) INTERNATIONAL JOURNAL OF REMOTE SENSING 1903 where fC and fD are probability density functions, with parameters ΘC and ΘD; which model the information distribution of the sets C and D; such elements belong to X. Assuming fC and fD are Gaussian multivariate distributions, Equation (1) can be reformu- lated as (Richards and Jia 2005) JMðC; DÞ ¼ 2 1� e�BðC;DÞ � � ; (2) where Bð�; �Þ is the Bhattacharyya distance assuming the Gaussian multivariate distribu- tion, defined by BðC;DÞ ¼ 1 8 ðμC � μDÞT P C þ P D 2 � ��1 ðμC � μDÞ þ 1 2 ln jPC þ P DjffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffijPCjj P Dj p ! ; (3) where μZ and P Z are the mean vector and covariance matrix, respectively, estimated for a set Z and T; j � j, and ð�Þ�1 represent the transpose, determinant, and inverse matrix operations, respectively. 2.2. Stochastic distances and region-based classification If I is an image defined on a support S � N 2 and X is an attribute space, IðsÞ ¼ x denotes that a pixel s 2 S of I has an attribute vector x 2 X: The region-based classification process consists of associating the class ωj; j ¼ 1; . . . ; c; with a region Ri � S; i ¼ 1; . . . ; r: Ri is a set of pixels sa; a ¼ 1; . . . ;#Ri; where the attributes of sa are obtained from I(sa) and # is the cardinality operator. In this context, the support of I is partitioned into r disjoint regions by a segmentation process. Regions represent sets of spatially connected pixels whose attribute vectors meet a particular uniformity criterion. In the classification process, all pixels of the same region are assigned to a single class. For a supervised region-based method, it is necessary to construct a set of labelled regions D ¼ fðRi; ωjÞ 2 S� Ω : i ¼ 1; . . . ;m; j ¼ 1; . . . ; cg; where m is the number of training regions. The notation ðRi;ωjÞ indicates that Ri is assigned to ωj: As mentioned above, SMDC is adopted for region-based classification in Silva et al. (2011). In this case, the pixel distribution in an unlabelled region is used to estimate a probability distribution. This region is then associated with the class with the closest distribution, according to an adopted stochastic distance. The class distributions are modelled based on information from D. Formally, if we let Ri be an unlabelled region and let MðfRi ; fωjÞ be a stochastic distance between the distributions of the attribute vectors of the pixels in Ri and the class ωj; an assignment ðRi;ωjÞ is made when the following rule is satisfied: ðRi;ωjÞ , j ¼ argminMðfRi ; fωjÞ; j ¼ 1; . . . ; c: (4) In Equation (4), fωj is estimated by considering the attribute vectors of the pixels of all of the labelled regions assigned to ωj in D. Alternatives to SMDC based on simple changes in Mð�; �Þ were derived in Negri, Dutra, and Sant’Anna (2012a). The first proposed alternative, called the stochastic minimum averaged distance classifier (SMADC), uses Mmeanð�; �Þ instead of Mð�; �Þ; which is defined as 1904 R. G. NEGRI ET AL. MmeanðfRi ; fωjÞ ¼ 1 tj Xtj l¼1 MðfRi ; fω jRlÞ; (5) where fωjRl is the probability distribution that models the lth training region assigned to ωj; which contains tj training regions in D. Another alternative is the stochastic nearest neighbour classifier (SNNC), obtained by replacing Mð�; �Þ with Mminð�; �Þ; which returns the shortest distance between Ri and one of the training regions assigned to ωj: Mminð�; �Þ is defined as MminðfRi ; fωjÞ ¼ minfMðfRi ; fωjRlÞ : l ¼ 1; . . . ; tjg: (6) A third alternative is a generalization of Equation (6), which transforms Equation (4) into a stochastic version of the k-nearest neighbour (SkNN) when Mð�; �Þ is substituted by Mknnð�; �Þ; and is defined as MknnðfRi ; fωjÞ ¼ e�hjðfRi Þ; (7) where hjðfRiÞ ¼ #fðR;ωjÞ 2 VkðRiÞg; such that Vk(Ri) is the set of k training regions close to Ri given a distance Mð�; �Þ: Formally, VkðRiÞ ¼ fð�Rp;ωqÞ 2 D : 0