+ All Categories
Home > Documents > Assessing uncertainty in fuzzy land cover classification ... · Assessing uncertainty in fuzzy land...

Assessing uncertainty in fuzzy land cover classification ... · Assessing uncertainty in fuzzy land...

Date post: 02-Aug-2018
Category:
Upload: vukhuong
View: 216 times
Download: 0 times
Share this document with a friend
13
INTERNATIONAL JOURNAL OF GEOMATICS AND GEOSCIENCES Volume 5, No 2, 2014 © Copyright by the authors - Licensee IPA- Under Creative Commons license 3.0 Research article ISSN 0976 – 4380 Submitted on August 2014 published on November 2014 332 Assessing uncertainty in fuzzy land cover classification by confusion index Ganesh Prasad M.S 1 , Manoj K. Arora 2 1- Professor, Department of Civil Engineering, The National Institute of Engineering, Mysore, India 2- Director, PEC University of Technology, Chandigarh, India [email protected], [email protected] ABSTRACT In recent years, uncertainty has become an important subject in assessing the quality of remote sensing image classification. Classification uncertainty is due to poor class definition, transition zones and the presence of mixed pixels in remote sensing data. Fuzzy classification approaches aim to estimate the proportions of specific classes that occur within each pixel. Partial class membership values derived from fuzzy classification serve as baseline information to assess classification uncertainties and allow the depiction of spatial variation of uncertainty. Providing uncertainty information at pixel level may assist in increasing the confidence in using thematic maps produced from remote sensing image classification. Many metrics have been developed to quantify pixel-wise classification uncertainty. In the present study, two formulations of confusion index are used. Literature state that, the two forms of confusion index provide similar information. The present study aims at examining whether these two formulations provide similar information or not. Multispectral image from Landsat- 7 ETM+ sensor was subjected to fuzzy c-means classification. The derived class membership values for each pixel were used in quantifying classification uncertainty. A comparative analysis of classification uncertainty provided by two forms of confusion index was carried out. The results from the study show that the two forms of confusion index provide dissimilar information on classification uncertainty. Keywords: Remote sensing, Fuzzy classification, Quality, Uncertainty, Confusion index 1. Introduction Remote sensing from satellite based sensors provides synoptic views of the earth surface at regular time intervals, and has been considered as an attractive source of data acquisition. Transformation of observed remote sensing data in the form of spectral responses into thematic classes representing earth surface features is achieved by a number of image classification procedures. Digital image classification in supervised mode is generally used to produce land use and land cover maps from remotely sensed data. A conventional statistical classifier like maximum likelihood classifier (MLC) allocates each pixel in the image to one of the classes in which it has the highest probability of membership. The uses of such ‘hard’ or ‘crisp’ classification methods are only appropriate when the classes are discrete and mutually exclusive. However, most of the land use/land cover features are continuous rather than discrete. Classifying continuous features as discrete may result in loss of information about the spatial distribution of a thematic class. Fuzzy logic introduced by Zadeh (1965) can be used to classify and assess continuous thematic classes. The use of fuzzy classification to produce accurate and reliable land cover maps is gaining importance due to its continuous nature of class representation. Fuzzy
Transcript

INTERNATIONAL JOURNAL OF GEOMATICS AND GEOSCIENCES

Volume 5, No 2, 2014

© Copyright by the authors - Licensee IPA- Under Creative Commons license 3.0

Research article ISSN 0976 – 4380

Submitted on August 2014 published on November 2014 332

Assessing uncertainty in fuzzy land cover classification by confusion index Ganesh Prasad M.S1, Manoj K. Arora2

1- Professor, Department of Civil Engineering, The National Institute of Engineering,

Mysore, India

2- Director, PEC University of Technology, Chandigarh, India

[email protected], [email protected]

ABSTRACT

In recent years, uncertainty has become an important subject in assessing the quality of

remote sensing image classification. Classification uncertainty is due to poor class definition,

transition zones and the presence of mixed pixels in remote sensing data. Fuzzy classification

approaches aim to estimate the proportions of specific classes that occur within each pixel.

Partial class membership values derived from fuzzy classification serve as baseline

information to assess classification uncertainties and allow the depiction of spatial variation

of uncertainty. Providing uncertainty information at pixel level may assist in increasing the

confidence in using thematic maps produced from remote sensing image classification. Many

metrics have been developed to quantify pixel-wise classification uncertainty. In the present

study, two formulations of confusion index are used. Literature state that, the two forms of

confusion index provide similar information. The present study aims at examining whether

these two formulations provide similar information or not. Multispectral image from Landsat-

7 ETM+ sensor was subjected to fuzzy c-means classification. The derived class membership

values for each pixel were used in quantifying classification uncertainty. A comparative

analysis of classification uncertainty provided by two forms of confusion index was carried

out. The results from the study show that the two forms of confusion index provide dissimilar

information on classification uncertainty.

Keywords: Remote sensing, Fuzzy classification, Quality, Uncertainty, Confusion index

1. Introduction

Remote sensing from satellite based sensors provides synoptic views of the earth surface at

regular time intervals, and has been considered as an attractive source of data acquisition.

Transformation of observed remote sensing data in the form of spectral responses into

thematic classes representing earth surface features is achieved by a number of image

classification procedures. Digital image classification in supervised mode is generally used to

produce land use and land cover maps from remotely sensed data. A conventional statistical

classifier like maximum likelihood classifier (MLC) allocates each pixel in the image to one

of the classes in which it has the highest probability of membership. The uses of such ‘hard’

or ‘crisp’ classification methods are only appropriate when the classes are discrete and

mutually exclusive. However, most of the land use/land cover features are continuous rather

than discrete. Classifying continuous features as discrete may result in loss of information

about the spatial distribution of a thematic class.

Fuzzy logic introduced by Zadeh (1965) can be used to classify and assess continuous

thematic classes. The use of fuzzy classification to produce accurate and reliable land cover

maps is gaining importance due to its continuous nature of class representation. Fuzzy

Assessing uncertainty in fuzzy land cover classification by confusion index

Ganesh Prasad M.S and Manoj K. Arora

International Journal of Geomatics and Geosciences

Volume 5 Issue 2, 2014 333

classification is a quantitative, iterative method for classifying thematic classes as continuous

over geographic space. Fuzzy classification approaches aim to estimate the proportions of

specific classes that occur within each pixel.

It has been reported that fuzzy classification approaches provide a general though not optimal

solution to mixed pixel problem. The strong relationship between fuzzy membership values

and true proportions of land cover classes has been supported by Maselli et al. (1994),

Atkinson et al. (1997), Deer and Eklund (2003). Further, extraction of class proportions in a

pixel through fuzzy classification facilitates to derive information on classification

uncertainties that are beyond the possibilities of conventional error matrix approach. This has

been advantageous to users who are in need of uncertainty information for their specific

applications and to indicate the quality of classification.

In recent years, uncertainty has become an important subject in remote sensing and has

attracted attention of many researchers (Canters, 1997; Kiiveri, 1997; Heuvelink, 1998;

Foody, 2002) and is a key issue in data quality assessment. Providing uncertainty

information at pixel level may assist in increasing the confidence in using thematic maps

produced from digital classification of remote sensing data.

This paper attempts to quantify classification uncertainty associated with an image pixel

using two measures suggested by Burrough et al. (1997). A comparison of results obtained

from these two measures is also made. For the purpose, class membership values derived

from fuzzy –c means classification of Landsat ETM+ data is used. This paper is organized

into four sections. The next section briefly discusses the experimental data used. Section 3

describes fuzzy classification of experimental data and subsequent estimation of classification

uncertainty using two forms of confusion index. In Section 4, a comparison of two forms of

confusion index is made based on experimental results and hypothetical examples. Finally,

summary and conclusions are provided in Section 5.

2. Experimental data description

The remote sensing data considered for this study is a multi-spectral image from Landsat-7

ETM+ sensor, which acquires data in eight spectral bands . The size of the image considered

for this study is 327 x 221 pixels, with spatial resolution of 30 meter, covering a part of

Syracuse city area in U.S.A. Although, the sensor acquires the data in different spectral bands,

only six are considered here, since the spatial resolutions of panchromatic (band 8) and

thermal infrared band (band 6) are different from other bands. Of the 6 bands available , three

bands viz. band 2 (0.52-0.6 µm), band 3 (0.63-0.69 µm) and band 5 (1.55-1.75 µm ) were

selected for classification purpose.

Transformed divergence was used in this study to select the best set of bands from

multispectral images. Transformed divergence (TD) was computed based on the training

data statistics. The larger the TD, the greater is the separation between the training signatures

of those classes. A TD value of 2000 suggests an excellent separability; a value of 1900

provides good separation and values less than 1700 represent poor separability between

classes (Jensen, 1986). From the transformed divergence analysis conducted on all the six

bands in synthetic and remote sensing dataset, a TD value of 2000 was obtained for the

combination of bands 2, 3 and 5. Therefore, these three bands were used as input into the

classifier selected for deriving class membership values.

Figure 1 shows the false colour composite (FCC) of the image of the study area. The central

portion of the image consists of urban area, whereas, remaining portion of the image is

mostly covered by vegetation. The urban area also covers a diverse environment including

Assessing uncertainty in fuzzy land cover classification by confusion index

Ganesh Prasad M.S and Manoj K. Arora

International Journal of Geomatics and Geosciences

Volume 5 Issue 2, 2014 334

lakes, parks, residential area, vegetation and roads, thereby introducing spatial complexity

due to class mixture and overlapping, and thus may be regarded as area with high uncertainty

in class allocation. Keeping in view the diverse environmental characteristics of the study

area, five major land cover classes were selected. These classes include, trees, grass, bare

soil, water and impervious. The characteristics of these classes are given in Table 1.

In the present data set, for example, the classes, trees and grass generally tend to overlap due

to spectral similarities. Further, the class impervious which consists of buildings, roads etc.,

may sometimes include the other class viz., bare soil. This overlapping of class definition

introduces uncertainty in class allocation. Moreover, the data itself consists of large quantity

of mixed pixels (about 45%), which contribute to the aspects of classification uncertainty to a

large extent.

Table 1: Characteristics of major thematic classes in the study area

Thematic classes considered for

classification

Description

Trees Forest, variety of native plants

Grass Open space, shrubs, golf course,

play grounds, parks, lawns

Bare soil Exposed soil surfaces, patches

Water Lakes, ponds, channels

Impervious Residential area, major and minor

roads, open roof

Figure 1: False colour composite of the Landsat ETM+ sensor data

(Red: band 5, Green: band 3, Blue: band 2)

3. Fuzzy classification

Fuzzy c-means classifier (FCM) described by Bezdek et al. (1984) was adopted to classify

the study image. This algorithm has proven especially popular (Legleiter and Goodchild,

2005: Bastin, 1997: Wu and Yang, 2002: Yang et al., 2003) and has been used to produce

land cover maps from remotely sensed data (Zhang and Stuart, 2001). In most of the

situations the FCM classifier may be advantageous (Shalan et al., 2003) as it is not dependent

Assessing uncertainty in fuzzy land cover classification by confusion index

Ganesh Prasad M.S and Manoj K. Arora

International Journal of Geomatics and Geosciences

Volume 5 Issue 2, 2014 335

on the data distributional assumptions. The fuzzy membership values derived from FCM

range between 0 (no membership) and 1 (full membership) and specify the degree of

belongingness of a pixel to a specific class. Although, FCM is an unsupervised classifier

(Bezdek et al., 1984) it may be used in supervised mode (Key et al., 1989: Foody, 2000).

Fuzzy c-means algorithm is an iterative clustering algorithm which subdivides data into c-

clusters or classes. The algorithm is based on iterative optimization of an objective function

Jm, which is a generalised least squared errors function and is defined as (Bezdek, 1981),

( ) ∑ −∑==

=c

k A

mN

jm vxJ kjjkXVU

1

2

1

)(:, µ (1)

where U is a fuzzy c partition of dataset X containing N pixels.

xj = vector denoting spectral response of a pixel j

c = fixed and known number of clusters in X ;

m = weighting exponent or fuzzy exponent; 1 ≤ m ≤ ∞ which defines the degree of fuzziness

(µ jk)m = the mth power of xj s membership in cluster k.

V is the collection of cluster centres vk, ║xj–vk║2A is the squared distance djk between xj and vk

with )()(2

2vxvxvxd kj

T

Ajk Akjkj −== −−

where A = positive definite weight matrix

Given the values of c and m, the algorithm iteratively assigns pixels to clusters and

recalculates cluster centres until Jm achieves a local minimum. The fuzzy partition U =

[ µ jk ] is referred to as the grade of membership of xj to the cluster k with µ jk ∈ [0, 1] .

The formulation of FCM contains a weighting factor m, which describes the degree of

fuzziness to be provided in the classification. The value of m varies from 1 (no fuzziness or

hard classification) to ∞ (complete fuzziness), however a value in the range of 1.5 to 3 may

be generally adopted (Shalan et al., 2003). Through several experiments, a value of 2.0 was

found to be suitable for the classification of this dataset. The number of training samples were

201, 172, 31, 204 and 296 for the classes trees, grass, bare soil, water and impervious

respectively. In this study, FCM classification in supervised mode using Euclidean distance

metric has been performed on PARBAT (www.lucieer.net, 2014), a public domain software

for advanced remote sensing image processing and visual fuzzy classification.

The results of the classification were five fraction images (Figure 2 (a) to (e)) corresponding

to five land cover classes considered in the experiment. In Figure 2(a) to (e), bright pixels

indicate higher class membership values. And, darker pixels correspond to lower values of

class membership. When each pixel is assigned to the class to which it has the highest

membership, the result is a ‘defuzzified’ or ‘hardened’ image equivalent to a classified image.

The defuzzified image is shown in Figure 2 (f). As expected the two classes trees and grass

overlapped in the outer portion of the image (See Figure 2), where as the class water is

correctly classified. However, the class impervious tends to get mixed up with other classes

viz. trees and grass. A small amount of overlap is also observed between the classes bare soil

and impervious.

Assessing uncertainty in fuzzy land cover classification by confusion index

Ganesh Prasad M.S and Manoj K. Arora

International Journal of Geomatics and Geosciences

Volume 5 Issue 2, 2014 336

(a) Trees (b) Grass

(b) Bare soil (d) Water

(e) Impervious (f) Defuzzified image

Figure 2: Class membership images (fraction images) derived from FCM classification

(fuzzy exponent m =2) and defuzzified image showing thematic classes.

3.1 Classification Uncertainty

The main idea of fuzzy classification approach is to associate a pixel with every class

considered in the classification scheme, with variable degree of class memberships. Partial

class membership values derived from fuzzy classification can serve as baseline information

to assess classification uncertainties and allow the depiction of spatial variation of

uncertainty. To elaborate this, if we consider the output of a fuzzy classifier for a given pixel,

the output is a finite set of class membership values. Larger is the set of membership values

for a pixel, lesser is the precision of information available and higher is the ambiguity.

Further, for some regions class membership values may be equal or nearly equal in a pixel.

The more similar the membership values for two or more classes, the greater the confusion

about to which class a pixel actually belongs. In such cases too, it is not clear that which class

dominates and may lead to a state of ambiguity. In such a situation, the ambiguity in class

assignment can be represented by measures such as entropy (Maselli et al., 1994), Quadratic

score (Glasziou and Hilden, 1989: Fatemi et al., 2004 ), non- specificity, U-uncertainty

(Dubois and Prade, 1987: Ricotta, 2005) and confusion index (Burrough et al., 1997). In the

present study, a measure of ambiguity viz. confusion index in difference and ratio forms

suggested by Burrough et al., (1997) is considered for the quantification of classification

uncertainty.

Assessing uncertainty in fuzzy land cover classification by confusion index

Ganesh Prasad M.S and Manoj K. Arora

International Journal of Geomatics and Geosciences

Volume 5 Issue 2, 2014 337

Let µjk

be the class membership value of jth pixel in kth class, with k = 1,2,…. n classes,

and 10 ≤≤ µjk

with the normalization constraint of 11

=∑=

n

kjk

µ ,which is generally imposed in

fuzzy classification. Further, let { }n

jjjjC µµµµµ .,.........,,)(321

= be an ordered (in descending

order) fuzzy set having class membership values of a pixel j in which the numbers 1,2…n

indicate first highest membership, second highest membership ( µn

j will obviously be

minimum).

Consider the first and second highest class membership values µ1

j, µ

2

j for a pixel j. The

difference between these two is an indirect representation of conflict in assigning the pixel to

one of the classes. For example, if the difference is large, it is an indication of lesser conflict.

On the other hand, a difference of zero or near zero indicates high degree of conflict. Let the

difference between the first and second highest membership values be D, given by

D = ( µ1

j- µ

2

j ) (2)

Normalizing the difference with the maximum membership value and deducting it from the

total membership of one, yields a measure of confusion as,

Measure of confusion = µ

µµ

max

21)(

1jj

−− (3)

where µmax is the maximum class membership value for the pixel j. The denominator in the

second term, i.e., µmax can take two values. A value of 1, which is the limiting value of

membership or the maximum class membership value as derived from classification process.

When the second term in Eq. (3) is normalized with respect to µmax

=1, the measure of

confusion will be equal to 1 - ( µ1

j- µ

2

j ). When it is normalized with respect to maximum

membership value for the pixel, i.e., µ1

j, the measure of confusion will be equal to

µ

µ1

2

j

j.

Burrough et al., (1997) provided these two formulations (Eq. (4) and (5)) and termed both as

confusion index (CI).

µ

µ1

2

j

jCI = (4)

)(121

µµ jjCI −−= (5)

CI can take values between 0 and 1. If CI → 0, then one class dominates and there is little

confusion in allocating pixel to that class and the degree of uncertainty is low. If CI → 1, then

both µ1

j and µ

2

j are equal or near equal and there is maximum confusion about the class to

which the pixel belongs and the degree of uncertainty is high. Ideally, in a land use/land

cover map derived from FCM classification, the zones represented by those pixels with CI

values equal to 1 or nearly 1 would indicate the geographic boundaries between fuzzy classes.

Burrough et al., (1997), while providing the two formulations of CI, stated that these two

produce similar results. Legleiter and Goodchild (2005) termed these two formulations (Eq. 4

and Eq. 5) as ratio (CIratio) and difference (CIdifference) forms of confusion index. and utilized

them to derive maps of classification uncertainty to identify heterogeneous and complex areas.

Assessing uncertainty in fuzzy land cover classification by confusion index

Ganesh Prasad M.S and Manoj K. Arora

International Journal of Geomatics and Geosciences

Volume 5 Issue 2, 2014 338

However, it is not known whether these two formulations produced similar maps of

classification uncertainty.

Anomalies can be observed in depicting the quality of classification based on uncertainty

estimated using two forms of confusion index. For instance, consider one of the cases where

the highest membership value µ1

j = 0.4 and the second highest membership value µ

2

j = 0.15

for a pixel. The value of uncertainty estimated for this case by the ratio form of confusion

index is 0.375, while it is equal to 0.75 by the difference form of confusion index. If the ratio

form of confusion index is used to represent classification uncertainty, the value of 0.375 for

this pixel can generally be considered as low. Correspondingly, if one wants to express the

quality of classification based on uncertainty, a value of 0.375 does not draw much attention.

On the other hand, if the difference form of confusion index is used for the same pixel,

uncertainty value of 0.75, which is twice that of the value provided by CIratio, is considered as

a high value and hence the thematic quality of the pixel may be treated as significantly

ambiguous. Due to these anomalies between the two forms of confusion index, quality

assessment process itself may be dubious. Therefore, this study aims at comparing the

results obtained from the two formulations of confusion index. Such a comparative

evaluation of two forms of confusion indices may help in understanding their behaviour and

relative efficacies.

To quantify classification uncertainty, the derived fuzzy membership values have been

arranged in descending order (i.e., images showing the highest membership values, second

highest membership values and third highest membership values etc.) and have been used as

input to computational models. Based on the formulations for confusion index CI (Eq. 4 and

Eq. 5), classification uncertainty maps were generated for the study area. Figures 3 and 4

show spatial pattern of classification uncertainty for the experimental image using two

formulations of CI. In these uncertainty images, dark pixels indicate the regions classified

with no or minimum uncertainty, whereas brighter pixels indicate uncertain classification or

the zones of confusion. A comparison of Figures 3 and 4 indicate, ‘uncertain’ pixels

appearing more brightly in Figure 4 than those in Figure 3. This is apparent in the image

statistics given in Table 2, with average CIdifference equal to 0.587 which is greater than

average CIratio value of 0.396.

Figure 3: Spatial distribution of classification uncertainty represented by ratio form of

confusion index (CIratio). Dark pixels: low classification uncertainty (low values for CIratio),

Bright pixels: High classification uncertainty (high values for CIratio)

Assessing uncertainty in fuzzy land cover classification by confusion index

Ganesh Prasad M.S and Manoj K. Arora

International Journal of Geomatics and Geosciences

Volume 5 Issue 2, 2014 339

Figure 4: Spatial distribution of classification uncertainty represented by difference form of

confusion index (CIdifference). Dark pixels: low classification uncertainty (low values for

CIdifference), Bright pixels: High classification uncertainty (high values for CIdifference)

Figure 5: (a) Histogram of CIratio image (b) Histogram of CIdifference image

Table 2: Assessing classification uncertainty: comparison between statistics of uncertainty

maps

Min Max Mean(µ) Std.

Deviation(σ)

CIratio 0.0 1.0 0.396 0.291

CIdifference 0.0 1.0 0.587 0.300

4. Comparison of two forms of confusion index- Methods adopted

The global statistics of CIratio and CIdifference indicated in Table 2 do not provide an insight

into their characteristics, alternative ways such as visual comparison of histograms, spatial

autocorrelation measure and correlation coefficients from regression were used for the

comparative evaluation of uncertainty measures. The visual comparison of histograms

(Figure 5 a and b) indicates some differences between two forms of confusion index. In the

histogram of CIratio image, the distribution of values of confusion index across the image is

almost uniform except that higher values of uncertainty can be observed for less number of

pixels. Whereas, in the histogram of CIdifference image, higher values of uncertainty can be

Assessing uncertainty in fuzzy land cover classification by confusion index

Ganesh Prasad M.S and Manoj K. Arora

International Journal of Geomatics and Geosciences

Volume 5 Issue 2, 2014 340

observed for sufficiently large quantity of pixels. This is augmented by the global statistics of

CIratio and CIdifference images (Table 2). Although, the range of these two metrics is the same

i.e., 0 to 1, the mean value of CIratio is 0.396 and it is equal to 0.587 for CIdifference image.

This disparity is pointing towards lower values of CIratio for pixels in ambiguous zones and

higher values of CIdifference for the same pixels. Higher values for CIdifference may be due to the

fact that CIdifference is the result of aggregation of CIratio and another measure of class

membership saturation viz. Exaggeration uncertainty (Zhu, 1997) using a fuzzy operator (

Ganesh Prasad and Arora, 2008).

A linear regression was carried out to understand the relation between two forms of confusion

index. The correlation coefficient between CIratio and CIdifference was found to be 0.81

indicating the absence of perfect correlation between them. Since, this regression was carried

out on pixel by pixel basis for two values given by CIratio and CIdifference, a comparison based

on the spatial dependency of uncertainty could not be achieved. Therefore, spatial

autocorrelation analysis was carried out to explore the spatial dependency between pixels

based on the values of uncertainty.

4.1 Comparison based on spatial autocorrelation

Spatial autocorrelation refers to the effect that a certain characteristic at a given location has

on that same characteristic at neighbouring locations. In other words, given a group of

mutually exclusive units in a two dimensional plane, if the presence, absence or degree of a

certain characteristic affects the presence, absence or degree of the same characteristic in

neighbouring units, then the phenomenon is said to exhibit spatial autocorrelation (Cliff and

Ord, 1973). Spatial autocorrelation analysis examines whether the observed value of a

variable at a location is independent of the values of the variable at neighbouring locations. If

dependence exists, the variable is said to exhibit spatial autocorrelation. This spatial

dependency can be measured using an indicator called Moran’s I statistic, which is based on

cross products to measure value association and is calculated for N observations on a variable

x at locations i, j as (Cliff and Ord, 1973),

∑∑

=N

i

N

i

N

jjiij

z

zzw

iW

N

dI2

)( (6)

where, wij (elements of spatial weight matrix) is the weight at a distance d such that wij =1 if

location j is within distance d from location i ; otherwise wij =0; z’s are deviations (i.e., zi =

xi – xmean , zj = xj – xmean for variable x ) and W is the sum of all weights where i ≠ j. Moran’s

I varies from +1.0 for a perfect positive correlation (e.g., for a clumped pattern) to -1.0 for a

perfect negative correlation (e.g., for a checkerboard pattern). Positive value for Moran’s I

indicates that similar values tend to cluster and negative value indicates that dissimilar values

have a high tendency to cluster in a given space. Moran’s I value close to zero indicates that

the values are arranged randomly and independently in space, without any autocorrelation.

Considering each pixel as a mutually exclusive unit having its location in the image space

and the magnitude of classification uncertainty as the variable, spatial autocorrelation

analysis has been carried out for CIratio and CIdifference images. The values of Moran’s I for

CIratio and CIdifference images were 0.604 and 0.73 respectively. These positive values indicate

that the values of classification uncertainty are spatially correlated, though the correlation is

not perfect. However, dissimilar values of 0.604 and 0.73 suggest that the spatial variation of

Assessing uncertainty in fuzzy land cover classification by confusion index

Ganesh Prasad M.S and Manoj K. Arora

International Journal of Geomatics and Geosciences

Volume 5 Issue 2, 2014 341

CIratio is different than CIdifference . The results from visual comparison of uncertainty maps,

image histograms and correlation analysis have pointed towards a clear distinction between

the information on uncertainty conveyed by the two forms of confusion index. Although,

Burrough et al., (1997) stated that two forms of confusion index produce similar results, it

does not appear so as indicated here.

4.2 Comparison based on zonation

From the hypothetical example discussed in 3.1, it is evident that under certain situations

there occurs a considerable difference between two forms of confusion index. Therefore, in

order to verify whether the anomaly between two forms of confusion index is significant for a

dataset, a simple approach has been adopted here. In this approach, the pixels have been

grouped into four qualitative zones. This has been done using two criteria.

Criterion I: Four qualitative zones corresponding to low, medium, high and very high values

of confusion index have been considered. These four zones correspond to the values in the

range (0.0 - 0.25), (0.25 to 0.50), (0.50 to 0.75) and (0.75 to 1.0). This zonation was done

arbitrarily. The percentage of pixels in each zone is represented in Figure 6. In medium and

high zones of uncertainty the variation between CIratio and CIdifference is 8.67% and 6.52%

respectively. In the zone of low uncertainty CIratio has 20.17% pixels more than CIdifference ,

where as CIdifference has 22.31% more pixels than CIratio in the zone of very high uncertainty.

Criterion II: In this approach, zonation is made based on image statistics of CIratio and

CIdifference i.e., mean (µ) and standard deviation (σ). Percentage of pixels with low

uncertainty (≤(µ - σ)), medium ((µ - σ) to µ), high (µ to (µ + σ)) and very high uncertainty (>

(µ + σ)) are represented in Figure 7.

In the zone of low uncertainty the variation between CIratio and CIdifference is less and is equal

to 0.13% , whereas in the zone of high uncertainty the variation is 1.78% only. In the zone of

medium uncertainty CIratio has 10.99% pixels more than CIdifference , where as CIdifference has

12.9% more pixels than CIratio in the zone of very high uncertainty.

Figure 6: Comparison of CIratio and CIdifference based on number pixels in each of the

four bins of equal size

Assessing uncertainty in fuzzy land cover classification by confusion index

Ganesh Prasad M.S and Manoj K. Arora

International Journal of Geomatics and Geosciences

Volume 5 Issue 2, 2014 342

Figure 7: Comparison of CIratio and CIdifference based on number pixels in each of the

four bins of unequal size

Therefore, from the results obtained by the comparison of percentage of pixels in different

zones (low, medium, high and very high) based on the values of CIratio and CIdifference (criteria

I and II), it is evident that the two forms of confusion index behave differently for a given

classification.

From this analysis, it is observed that the difference exists in the depiction of quality of a

pixel using ratio and difference forms of confusion index both qualitatively and spatially.

Therefore, the information on classification uncertainty in the form of confusion as expressed

by the two forms of confusion index is considered as different and the two forms of confusion

index does not provide similar results as stated by Burrough et al.,(1997).

5. Summary and conclusions

Uncertainty plays a vital role in land use land cover classification of remotely sensed data.

When the remotely sensed data to be classified consists of abundant mixed pixels and /or

overlapping classes for example, different types of vegetation, classification of such an image

results in high degree of uncertainty. Fuzzy classification of land use/land cover provides a

framework that allows partial memberships in multiple classes and gradual transition between

fuzzy classes, which can be considered as an advantage over conventional crisp classification.

Class membership values derived from FCM classification are used to quantify pixel wise

classification uncertainty, which further be used to represent spatial pattern of classification

uncertainty. Uncertainty maps thus generated are useful in identifying and quantifying areas

of high/low classification reliability. Burrough’s confusion index (CI) used in this study can

be appropriate in knowing to what extent the spatial distribution of thematic classes derived

from fuzzy classification is ambiguous. The objective of this paper is to compare

classification uncertainty maps derived from two formulations of confusion index. Although,

the two forms of confusion index are intended to provide similar information, certain

situations may exist when these two exhibit considerable difference. A comparative analysis

Assessing uncertainty in fuzzy land cover classification by confusion index

Ganesh Prasad M.S and Manoj K. Arora

International Journal of Geomatics and Geosciences

Volume 5 Issue 2, 2014 343

of classification uncertainty provided by two forms of confusion index is carried out. The

result from the study shows that the two forms of confusion index provide dissimilar

information on classification uncertainty both quantitatively and spatially. It is difficult to

suggest any one form of confusion index measure as appropriate or ideal measure of

classification uncertainty. However, it is concluded here that the selection of an appropriate

measure of classification uncertainty may be made by considering the factors viz., quantity of

mixed pixels present in the image to be classified, the type of uncertainty to be represented

and the theoretical framework of uncertainty measure.

6. References

1. Atkinson, P.M., Cutler, M.E.J., and Lewis, H., 1997, Mapping sub-pixel proportional

land cover with AVHRR imagery, International Journal of Remote Sensing, 18, pp

917-935.

2. A-Xing Zhu, 1997, Measuring uncertainty in class assignment for natural resource

maps under fuzzy logic, Photogrammetric Engineering and Remote Sensing, 63(10),

pp 1195-1202.

3. Bastin, L., 1997, Comparison of fuzzy c-means classification, linear mixture

modelling and MLC probabilities as tools for unmixing coarse pixels, International

Journal of Remote Sensing, 18(17), pp 3629-3648.

4. Bezdek, J.C., Ehrlich, R. and Full, W., 1984, FCM: The fuzzy c-means clustering

algorithm, Computers and Geosciences, 10, pp 191-203.

5. Burrough, P.A., Van Gaans, P.F.M., and Hootsmans, R., 1997, Continuous

classification in soil survey: spatial correlation, confusion and boundaries, Geoderma,

77, pp 115-135.

6. Canters F., 1997, Evaluating the uncertainty of area estimates derived from fuzzy land

cover classification, Photogrammetric Engineering and Remote Sensing, 63(4), pp

403-414.

7. Cliff, A.D., Ord, J.K., (1973), Spatial Autocorrelation (London: Pion Limited)

8. Deer, P.J., Eklund,P., 2003, A study of parameter values for a Mahalanobis distance

fuzzy classifier, Fuzzy Sets and Systems, 137, pp 191-213.

9. Dubois, D. and Prade, H., 1987, Properties of measures of information in evidence

and possibility theories, Fuzzy Sets and Systems, 24, pp 161-182.

10. Fatemi, S.B., Mojaradi B. and Varshosaz., 2004, "Introducing an accuracy indicator

based on uncertainty related measure", XXth ISPRS Congress, Istanbul, Turkey,

Commission 4, pp 974-979.

11. Foody, G.M., 2000, Estimation of sub-pixel land cover composition in the presence of

untrained classes, Computers and Geosciences, 26, pp 469-478.

12. Foody, G.M., 2002, Status of land cover classification accuracy assessment, Remote

Sensing of Environment, 80, pp 185-201.

Assessing uncertainty in fuzzy land cover classification by confusion index

Ganesh Prasad M.S and Manoj K. Arora

International Journal of Geomatics and Geosciences

Volume 5 Issue 2, 2014 344

13. Glasziou, P. and Hilden, J., 1989, Test selection measures, Medical Decision

Making, 9, pp 133-141.

14. Heuvelink, G.B.M., (1998), Error Propagation in Environmental Modelling with GIS.

Taylor and Francis, London.

15. Jensen, J.R., (1986). Introductory Digital Image processing, Prentice-Hall, NJ.

16. Key, J.R., Maslanik, J.A., Barry, R.G., 1989, Cloud classification from satellite data

using a fuzzy sets algorithm: a polar example, International Journal of Remote

Sensing, 10(12), pp 1823-1842.

17. Kiiveri H.T., 1997, Assessing, representing and transmitting positional uncertainty in

maps, International Journal of Geographic Information Science, 11(1), pp 33-52.

18. Legleiter, C.J., Goodchild, M.F., 2005, Alternative representations of in-stream

habitat: classification using remote sensing, hydraulic modeling, and fuzzy logic,

International Journal of Geographical Information Science, 19(1), pp 29-50.

19. Lucieer, A., (2004), Parbat-a software tool for segmentation and explorative

visualization of remotely sensed imagery. URL: http://parbat.lucieer.net

20. Mark Gahegan, Manfred Ehlers, 2000, A frame work for the modelling of uncertainty

between remote sensing and geographic information systems, ISPRS Journal of

Photogrammetry and Remote Sensing, 55, pp 176-188.

21. Maselli, F., Conese, C., And Petkov, L., 1994. Use of probability entropy for the

estimation and graphical representation of the accuracy of maximum likelihood

classification, ISPRS Journal of Photogrammetry and Remote Sensing, 49, pp 13-20.

22. Ricotta, C., 2005, On possible measures for evaluating the degree of uncertainty of

fuzzy thematic maps, International Journal of Remote Sensing, 26, pp 5573-5583.

23. Shalan, M. A., Arora, M. K., and Ghosh, S. K., 2003, An evaluation of fuzzy

classifications from IRS 1C LISS III imagery : a case study, International Journal of

Remote Sensing, 24, pp 3179-3186.

24. Wu, K.L., Yang, M.S., 2002, Alternative c-means clustering algorithms, Pattern

Recognition, 35, pp 2267-2278.

25. Yang, M.S., Hwang, P.Y., Chen, D.H., 2003, Fuzzy clustering algorithms for mixed

feature variables, Fuzzy Sets and Systems, 141, pp 301-317.

26. Zadeh, L.A., 1965, Fuzzy sets, Information Control, 8, pp 338-353.

27. Zhang, J., Stuart, N., 2001, Fuzzy methods for categorical mapping with image based

land cover data, International Journal of Geographic Information Science, 15(2), pp

175-195.


Recommended