+ All Categories
Home > Documents > Elitist Chemical Reaction Optimization for Contour-Based...

Elitist Chemical Reaction Optimization for Contour-Based...

Date post: 06-Aug-2020
Category:
Upload: others
View: 1 times
Download: 0 times
Share this document with a friend
15
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, VOL. 53, NO. 5, MAY 2015 2845 Elitist Chemical Reaction Optimization for Contour-Based Target Recognition in Aerial Images Haibin Duan, Senior Member, IEEE, and Lu Gan Abstract—Target recognition for aerial images is an important research issue in remote sensing applications. Many feature-based recognition methods have been introduced for target recognition. Nevertheless, these methods have their limitations when consid- ering the large amount of data provided by satellite imagery. In this paper, we explore several techniques for target recognition in aerial images with a contour matching approach. Contours in our approach are detected by a contour grouping strategy and described by edge potential function, which provides an attraction field for edges with similar curves. In this sense, target recognition can be formulated as an optimization problem. An improved chemical reaction optimization (CRO) algorithm is proposed in this paper to deal with the target matching problem. Experi- mental results demonstrate the robustness and high efficiency of our approach over the state-of-the-art evolutionary algorithms, which include the original CRO, predator–prey biogeography- based optimization, an improved version of brain storm opti- mization, artificial bee colony, quantum-behaved particle swarm optimization, a self-adaptive differential evolution algorithm, and stud genetic algorithm. In addition, several case studies regarding remote sensing are also presented. The results show that the proposed method is capable of improving the application ability of recognizing target in aerial images. Index Terms—Aerial image, edge potential function (EPF), elitist chemical reaction optimization (ECRO), target recognition. I. I NTRODUCTION A UTOMATIC target recognition (ATR) systems play an important role in multiple applications, including intelli- gence, surveillance, and reconnaissance [1]–[3]. A typical ATR process is composed of three stages, i.e., detection, discrimi- nation, and recognition, and applies various computer vision techniques such as image segmentation, feature extraction, and template matching [4], [5]. Target recognition is a vital post- processing for the region of interest extracted in the detection and discrimination steps, to distinguish a target from clutter and nontargets [6]. Since target recognition is fundamental to the Manuscript received June 2, 2014; revised July 31, 2014, September 12, 2014, and September 26, 2014; accepted October 26, 2014. This work was supported in part by the National Natural Science Foundation of China under Grants 61425008, 61333004, and 61273054, by the Aeronautical Foundation of China under Grant 20135851042, by the Top-Notch Young Talents Program of China, and by the Graduate Innovation Foundation for Beihang University under Grant YCSJ-01-2014-01. The authors are with the State Key Laboratory of Virtual Reality Technol- ogy and Systems, School of Automation Science and Electrical Engineering, Beihang University, Beijing 100191, China (e-mail: [email protected]). Color versions of one or more of the figures in this paper are available online at http://ieeexplore.ieee.org. Digital Object Identifier 10.1109/TGRS.2014.2365749 whole ATR process, it has been gaining increasing importance in recent years. In the remote sensing literature, many schemes have been established to tackle the target recognition problem and can be classified into two main categories: feature-based strategy [7]–[14] and template-based strategy [15]–[26]. Feature-based schemes often address a target recognition problem by object or scene classification. The crucial part of these schemes lies in the extraction of features. In the last few years, a wide variety of vi- sual features have been proposed, from low-level features (e.g., raw pixel values, filter-bank responses, and local feature de- scriptors [7]–[11]) to high-level cues (e.g., context and semantic information [12]–[14]). Shackelford and Davis [7] combined both pixel- and object-based features for object classification in their fuzzy model. Porway et al. [12] came up with a hierarchical and contextual model that is learned in a statistical framework for parsing aerial images. More recently, Cheriyadat [8] incorporated sparse coding into bag-of-visual-words model to encode low-level features for aerial scene classification. This unsupervised learning method achieves the highest accuracies with the UCMERCED data set [9]. In spite of the fact that these methods described above have demonstrated their effectiveness in target recognition, they are not without limitations: 1) Their architectures are generally complex and ambiguous; 2) a great amount of data are needed in the training stage; and 3) they are computationally expensive and time consuming. Compared with feature-based schemes, template-based schemes are much more simple and efficient. The key insight of these schemes is similarity measurement. A matching pro- cedure of some image features (e.g., scale-invariant feature transform [15]–[17], edges [18], [19], contours, or shapes [20]– [24]) is generally adopted to seek out the occurrence of a tar- get template or a reference image [18]. The invariance of the contour representation to brightness change, color, texture, scale, and rotation can significantly reduce the number of train- ing samples. Therefore, contour matching has become a very popular method for target recognition. In a typical contour matching problem, the contour of an object is generally detected, segmented, and then involved in a procedure to be matched to a template contour [22]. Thus, a good contour detection algorithm is essential for con- tour matching. In [23], Kennedy et al. proposed a contour grouping approach called contour cut for object detection, in which salient contours in an image are identified by sol- ving a Hermitian eigenvalue problem. This approach makes an 0196-2892 © 2014 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
Transcript
Page 1: Elitist Chemical Reaction Optimization for Contour-Based ...hbduan.buaa.edu.cn/papers/2015_IEEE_TGRS_Duan.pdf · compared with traditional contour matching methods in terms of performance

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, VOL. 53, NO. 5, MAY 2015 2845

Elitist Chemical Reaction Optimization forContour-Based Target Recognition

in Aerial ImagesHaibin Duan, Senior Member, IEEE, and Lu Gan

Abstract—Target recognition for aerial images is an importantresearch issue in remote sensing applications. Many feature-basedrecognition methods have been introduced for target recognition.Nevertheless, these methods have their limitations when consid-ering the large amount of data provided by satellite imagery. Inthis paper, we explore several techniques for target recognitionin aerial images with a contour matching approach. Contours inour approach are detected by a contour grouping strategy anddescribed by edge potential function, which provides an attractionfield for edges with similar curves. In this sense, target recognitioncan be formulated as an optimization problem. An improvedchemical reaction optimization (CRO) algorithm is proposed inthis paper to deal with the target matching problem. Experi-mental results demonstrate the robustness and high efficiency ofour approach over the state-of-the-art evolutionary algorithms,which include the original CRO, predator–prey biogeography-based optimization, an improved version of brain storm opti-mization, artificial bee colony, quantum-behaved particle swarmoptimization, a self-adaptive differential evolution algorithm, andstud genetic algorithm. In addition, several case studies regardingremote sensing are also presented. The results show that theproposed method is capable of improving the application abilityof recognizing target in aerial images.

Index Terms—Aerial image, edge potential function (EPF), elitistchemical reaction optimization (ECRO), target recognition.

I. INTRODUCTION

AUTOMATIC target recognition (ATR) systems play animportant role in multiple applications, including intelli-

gence, surveillance, and reconnaissance [1]–[3]. A typical ATRprocess is composed of three stages, i.e., detection, discrimi-nation, and recognition, and applies various computer visiontechniques such as image segmentation, feature extraction, andtemplate matching [4], [5]. Target recognition is a vital post-processing for the region of interest extracted in the detectionand discrimination steps, to distinguish a target from clutter andnontargets [6]. Since target recognition is fundamental to the

Manuscript received June 2, 2014; revised July 31, 2014, September 12,2014, and September 26, 2014; accepted October 26, 2014. This work wassupported in part by the National Natural Science Foundation of China underGrants 61425008, 61333004, and 61273054, by the Aeronautical Foundationof China under Grant 20135851042, by the Top-Notch Young Talents Programof China, and by the Graduate Innovation Foundation for Beihang Universityunder Grant YCSJ-01-2014-01.

The authors are with the State Key Laboratory of Virtual Reality Technol-ogy and Systems, School of Automation Science and Electrical Engineering,Beihang University, Beijing 100191, China (e-mail: [email protected]).

Color versions of one or more of the figures in this paper are available onlineat http://ieeexplore.ieee.org.

Digital Object Identifier 10.1109/TGRS.2014.2365749

whole ATR process, it has been gaining increasing importancein recent years.

In the remote sensing literature, many schemes have beenestablished to tackle the target recognition problem and canbe classified into two main categories: feature-based strategy[7]–[14] and template-based strategy [15]–[26]. Feature-basedschemes often address a target recognition problem by object orscene classification. The crucial part of these schemes lies in theextraction of features. In the last few years, a wide variety of vi-sual features have been proposed, from low-level features (e.g.,raw pixel values, filter-bank responses, and local feature de-scriptors [7]–[11]) to high-level cues (e.g., context and semanticinformation [12]–[14]). Shackelford and Davis [7] combinedboth pixel- and object-based features for object classificationin their fuzzy model. Porway et al. [12] came up with ahierarchical and contextual model that is learned in a statisticalframework for parsing aerial images. More recently, Cheriyadat[8] incorporated sparse coding into bag-of-visual-words modelto encode low-level features for aerial scene classification. Thisunsupervised learning method achieves the highest accuracieswith the UCMERCED data set [9]. In spite of the fact that thesemethods described above have demonstrated their effectivenessin target recognition, they are not without limitations: 1) Theirarchitectures are generally complex and ambiguous; 2) a greatamount of data are needed in the training stage; and 3) they arecomputationally expensive and time consuming.

Compared with feature-based schemes, template-basedschemes are much more simple and efficient. The key insight ofthese schemes is similarity measurement. A matching pro-cedure of some image features (e.g., scale-invariant featuretransform [15]–[17], edges [18], [19], contours, or shapes [20]–[24]) is generally adopted to seek out the occurrence of a tar-get template or a reference image [18]. The invariance ofthe contour representation to brightness change, color, texture,scale, and rotation can significantly reduce the number of train-ing samples. Therefore, contour matching has become a verypopular method for target recognition.

In a typical contour matching problem, the contour of anobject is generally detected, segmented, and then involvedin a procedure to be matched to a template contour [22].Thus, a good contour detection algorithm is essential for con-tour matching. In [23], Kennedy et al. proposed a contourgrouping approach called contour cut for object detection,in which salient contours in an image are identified by sol-ving a Hermitian eigenvalue problem. This approach makes an

0196-2892 © 2014 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.

Page 2: Elitist Chemical Reaction Optimization for Contour-Based ...hbduan.buaa.edu.cn/papers/2015_IEEE_TGRS_Duan.pdf · compared with traditional contour matching methods in terms of performance

2846 IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, VOL. 53, NO. 5, MAY 2015

excellent performance in avoiding foreground and backgroundclutter and then detecting reliable contours for natural images.In this paper, the contours of aerial images are extracted byimplementing this approach and described by edge potentialfunction (EPF) for target recognition afterward.

EPF was first proposed by Dao et al. [25] as an innovativesimilarity evaluation of contours. Traditional contour matchingmethods generally require sophisticated descriptions of con-tours and large computation of point-to-point distances [16],[20]. Loosely speaking, EPF is a kind of contour descriptorswhich is used to attract the template or sketch of the tar-get without needing to compute point-to-point distances. Thisnovel approach has been demonstrated to be more promisingcompared with traditional contour matching methods in termsof performance and computational complexity [25]–[28].

A Canny–Rothwell edge detector is used to build the edgemap in [25]. However, the edge map of a high-resolution aerialimage is enriched with noise and textures, which makes therepresentation of target ambiguous. Therefore, salient contoursof objects instead of naive edges are considered in this paper.The salient contours are proved to be more powerful to inhibitnoise, textures, and other interferences. Since the target shapeis expressed in a robust way by contours, this target recog-nition method becomes more efficient and practical for aerialimages.

Since EPF builds an attraction field for image contours toattract the target template, the matching problem becomes anoptimization problem that can be solved by various strategies.Dao et al. found the optimal matching with a template usinga genetic algorithm (GA)-based optimization [25]. An artificialbee colony (ABC) optimized EPF approach is used for targetrecognition of low-altitude aircraft in [26]. More recently, thedifferential search algorithm is exploited in template matchingand shows good performance [29]. Evolutionary algorithmshave been widely applied to image matching in recent years dueto their efficiency of exploring the entire search space (i.e., thewhole image). This paper investigated several state-of-the-artevolutionary algorithms for contour-based aerial image targetrecognition.

The chemical reaction optimization (CRO) algorithm is anewly developed evolutionary algorithm inspired by the natureof chemical reactions [30]. A novel optimization framework isestablished in CRO by mimicking the behaviors of moleculesinvolved in several kinds of chemical reactions. CRO hasdemonstrated excellent performances in various engineeringapplications, including task scheduling problem [31], brushlessmotor design [32], and different economic dispatch problems[33]. In CRO, the objective function value is defined as potentialenergy (PE) of a molecule, just like the concept of potential inEPF. In this sense, CRO algorithm is particularly appropriatefor matching contours described by EPF.

However, CRO is essentially stochastic when selecting whichmolecule will be involved in the next reaction and in someprocesses of reactions. This does not contribute to convergencein effect. To overcome the drawback, this paper developed anew elitist CRO (ECRO) algorithm by introducing a seriesof elitist strategies into the processes of molecule selectionand renewal. Our ECRO algorithm is then applied to contour

matching in aerial images. Experimental results demonstratethat the proposed strategies can improve the convergence abilityand searching performance compared with the original CRO.Furthermore, several state-of-the-art evolutionary algorithms,including predator–prey biogeography-based optimization(PPBBO) [34], an improved version of brain storm optimization(BSO-II) [35], ABC [36], quantum-behaved particle swarmoptimization (QPSO) [37], a self-adaptive differential evolu-tion (jDE) algorithm [38], and stud GA (SGA) [39], are alsoemployed to implement target recognition for comparativepurposes.

The remainder of this paper is organized as follows.Section II introduces the contour cut approach and the principleof EPF. Section III describes the basic principle of the originalCRO. In Section IV, our ECRO algorithm is proposed, and theoverall recognition framework is provided. Some theoreticalanalyses of ECRO algorithm are also given in this section. De-tails of our comparative experiments and results are presentedin Section V. Conclusions are finally drawn in Section VI.

II. CONTOUR CUT AND EPF

The calculation of EPF originally started from the edge mapextracted from simple digital images by the Canny–Rothwelledge extractor in [25]. For high-resolution aerial images withcluttered backgrounds, although edge information is preservedwell by the edge extractor, other undesired signals such as noiseand textures are also maintained. This makes the representationof target less prominent and unstable, thus influencing therecognition performance. Kennedy’s contour cut approach [23]formulates contour detection as a problem of searching forclosed topological cycles in graphs, producing much cleanercontours that are easy to recognize in later process. This ap-proach is adopted to construct contour maps for aerial imagesin this paper.

A. Contour Map Extraction

The contour cut approach uses a graph formulation and anuntangling-cycle cost function provided in [40] for contourgrouping. Let us construct a graph G = (V,E,W ) from animage, where graph nodes V and graph edges E correspondto vertices and edges within the image, respectively. Graphweights W are assigned according to the relative angles ofimage edges. Since persistent cycles in this weighted graphcorrespond to salient contours in the image, a contour canbe defined as (C,O), where C ⊆ V is a set of vertices andO : C → {1, . . . , |C|} (|C| is the number of the vertices in C)is a function specifying an ordering of C. The distinguishedadvantages of this method stem from a graph circulation matrixF introduced to measure the separation of a contour from theothers (the external cut) and the entanglement caused by graphedges within the contour which infringe the ordering O (theinternal cut). The calculation of F is as follows:

F = diag(π) · P (1)

where P = (diag(∑

j Wij))−1W , i and j denote different

nodes, and diag(π) is a diagonal matrix of which all diagonalelements are π. Nodes with a distant ordering are considered

Page 3: Elitist Chemical Reaction Optimization for Contour-Based ...hbduan.buaa.edu.cn/papers/2015_IEEE_TGRS_Duan.pdf · compared with traditional contour matching methods in terms of performance

DUAN AND GAN: ECRO FOR CONTOUR-BASED TARGET RECOGNITION IN AERIAL IMAGES 2847

to be part of the internal cut. Let a positive integer k be thewidth of the contour. Then, the external and internal cuts can bedefined as follows:⎧⎨

⎩Ecut(C) =

∑i∈C,j �∈C

Fij

Icut(C,O) =∑

(i,j)∈C,|O(i)−O(j)|>k

Fij .(2)

After that, we can evaluate a given contour by the costfunction in which the two cuts are both considered

Ccut(C,O) =Icut(C,O) + Ecut(C)

Vol(C)(3)

where the volume Vol is the sum of weights of all edges incidentwith the contour, and it is defined as

Vol(C) =∑

i∈C,j∈EFij . (4)

Optimizing the cost function is a hard combinatorial task. Toaccelerate the computation without loss of accuracy, a circularembedding strategy is employed to encode the two kinds of cutsby mapping graph nodes to points in the complex plane. Using acircular embedding, nodes of the contour are mapped to pointsalong a circle, while other nodes are mapped to the origin bythe following formula:

xj = rj exp(iθj) (5)

where i =√−1 is the imaginary unit and j is an index of nodes.

The radius rj is set to one if j ∈ C and zero if otherwise, andthe angle θj = O(j)(2π/ |C|). The ordering of each node isencoded by θj , and whether it is a part of the contour is encodedby rj . Therefore, for the circular embedding of a contourx ∈ C |C| (x is a vector composed of xj), (2) and (3) can beupdated as follows:

⎧⎪⎪⎪⎪⎨⎪⎪⎪⎪⎩

Ecut(x) =∑

(i,j)∈EFijri (1− rj)

Icut(x) =∑

(i,j)∈CFijrirj

(1− cos

(θj − θi − 2π

|C|

))

Vol(x) =∑

(i,j)∈EFijri.

(6)

Substituting (5) and (6) into (3), we can finally obtain

Ccut(x)=Icut(x)+Ecut(x)

Vol(x)=

x∗[diag(π)−H

(2π|C|

)]x

x∗diag(π)x(7)

where H(2π/|C|)=(F exp(−i2π/|C|)+FTexp(i2π/|C|))/2, x∗

is the conjugate transpose of the complex vector x, and FT

is the transpose of the matrix F . So far, contour grouping isconverted into the problem of minimizing (7). Kennedy furthertransformed it into a Hermitian eigenvalue problem, and the fullderivation of this process is given in [23].

In this paper, the contour cut approach is adopted to extractthe contour map from aerial images. This method starts withedge detection, followed by optimizing a contour cut criterionestablished above. Then, EPF can be used to build powerfulrepresentations of the contour map for matching.

B. EPF

The conception of EPF is derived from the physics of elec-tricity, modeling the image edges as charged elements in orderto generate an attraction field over objects with similar shapes[25]. In EPF, a template of target is thus expected to be attractedby a set of equivalent charged edge points which maximizethe potential. This provides an efficient method for contourmatching instead of the consumptive calculation of point-to-point distances in traditional algorithms.

EPF simulates the phenomenon in electricity in which a setof point charges Qi in a homogeneous background generates apotential. The intensity of the potential is determined by

v �(r) =1

4πε

∑i

Qi

|⇀r − ⇀

ri|(8)

where �r and �ri are the observation point and the charge location,respectively, and ε is the electrical permittivity of the medium.Similarly, the ith point in an image at coordinates (ui, vi) canbe regarded as a point charge Qeq(ui, vi); then, the potential ofeach pixel in the image can be calculated as follows:

EPF(u, v) =1

4πεeq

∑i

Qeq(ui, vi)√(u− ui)2 + (v − vi)2

(9)

where εeq is the equivalent permittivity of image background.Dao et al. also proposed several alternative approaches in

[25], among which windowed EPF (WEPF) is a significantimprovement of the EPF similarity measure and computationspeed. In this model, a window W (εeq) is defined to limitthe influence radius of a charged element to the potential.The potential of each pixel in WEPF model is calculatedaccording to

WEPF(u, v)=Q

4πεeq

∑(ui,vi)∈W (εeq)

1√(u− ui)2+(v − vi)2

.

(10)

Here, all edge points are modeled as equal charges Q tosimplify the assumption. The window is defined to center oneach pixel in the image, and only edge points within it areconsidered. This mechanism mitigates the impact of clutter andhighly dense edge map, thus improving both robustness andspeed of contour matching in the next stage.

In this paper, WEPF is adopted to translate contour map intoedge potential. In the matching process, a contour templateis expected to be attracted by the part of edge potential fieldwhere a similar shape is present. The higher the similarityof the contour template and the shape is, the higher the totalattraction generated by the potential field is. Once a set ofpoints with the maximal potential has been found, the searchedtarget is recognized. Since the potential function is complexand multimodal, good optimization strategies are essential forthe matching problem. This paper proposed a new ECROfor contour-based target recognition and further investigatedseveral advanced evolutionary algorithms for this visual task.

Page 4: Elitist Chemical Reaction Optimization for Contour-Based ...hbduan.buaa.edu.cn/papers/2015_IEEE_TGRS_Duan.pdf · compared with traditional contour matching methods in terms of performance

2848 IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, VOL. 53, NO. 5, MAY 2015

TABLE IELEMENTARY ATTRIBUTES OF A MOLECULE

TABLE IIELEMENTARY REACTIONS IN CRO

III. CRO

CRO was initially designed to work in the discrete domain.In 2012, Lam et al. developed a real-coded version of CRO[41], which is suitable for both continuous and discrete op-timization problems. In CRO, some elementary reactions arein analogy with the crossover operator of GA, and the energyconservation law produces similar effects of the Metropolisalgorithm used in simulated annealing (SA). Therefore, CROenjoys the advantages of both SA and GA and demonstratedits strong competitive advantage in solving many real-worldproblems [31]–[33]. This section describes the basic principleof the original CRO.

The basic operating agents of CRO are molecules, and thestructure of a molecule represents a potential solution. Eachmolecule has some elementary attributes of which the mathe-matical meanings are listed in Table I. The first two attributesare essential for an optimization problem, while the last fourattributes are designed to help the algorithm escape from localoptima (more discussion can be found later in this section).Other attributes can be optionally introduced, depending onpractical problems, e.g., the upper limit, the lower limit, andthe current optimal solution. Since molecules change theirstructures and, consequently, their energies through collisions,four kinds of elementary reactions are designed according tothe type of collisions in CRO. The operators of these reactionsare given in Table II.

Details of the specific operators in four kinds of reactionsare problem dependent and designed in Section IV. Since everychemical reacting system tends to release energy, the initialreactants with excess energy undergo a series of collisions, gothrough some transition states, and become the final products in

Fig. 1. Illustration of a chemical reaction on the PES.

low-energy stable states on the PE surface (PES). This processcan be illustrated in Fig. 1. Chemical reactions attempt to seekthe structure of a molecule which has the lowest PE on thePES. When the molecule with the lowest PE is generated after asequence of reactions, the structure of this molecule is the bestsolution of the problem.

Furthermore, there are several laws and restrictions distin-guishing CRO from other evolutionary algorithms. The mostimportant law in CRO is the conservation of energy, which canbe described as follows:

E ′total =E ′

buffer + PEω′1+ KEω′

1+ PEω′

2+ KEω′

2

+ · · ·+ PEω′n+ KEω′

n

=Ebuffer + PEω1+ KEω1

+ PEω2+ KEω2

+ · · ·+ PEωm+ KEωm

=Etotal (11)

where E ′buffer and Ebuffer denote the energies in an energy

buffer before and after a reaction, and KE represents the kineticenergy of a molecule. The terms n and m are the numbers ofproducts and reactants, respectively. In order to avoid moleculesfrom getting stuck to a local minimum, CRO has also designeda restriction to govern the acceptance of a new solution in eachreaction. This restriction can be described as follows:

Eexcess > 0 (12)

where Eexcess is the excessive energy in a reaction, which iscalculated by subtracting the PE of products from the totalenergy of reactants. Overall, if (11) and (12) are both satisfied,a reaction can be successfully implemented; otherwise, it willbe canceled.

Another restriction also plays an important role in escapingfrom local optima. Violent reactions such as decomposition andsynthesis should happen if

Indexcurrent − Indexlast > decThres (13)

where decThres is a user-defined threshold for inactive degree.When the number of consequent reactions in which a bettersolution has not been generated is sufficiently large, CRO is

Page 5: Elitist Chemical Reaction Optimization for Contour-Based ...hbduan.buaa.edu.cn/papers/2015_IEEE_TGRS_Duan.pdf · compared with traditional contour matching methods in terms of performance

DUAN AND GAN: ECRO FOR CONTOUR-BASED TARGET RECOGNITION IN AERIAL IMAGES 2849

considered to get attentive in local optima. Then, more severereactions happen to make a big change of molecular structures.As a result, the energy conservation law cooperates with theserestrictions to maintain the diversity of molecular structures andkeep CRO away from local optima.

IV. ECRO-OPTIMIZED CONTOUR-BASED

TARGET RECOGNITION

In previous sections, the contour extraction method and theconcept of EPF have been discussed, as well as the originalCRO algorithm. Although CRO is superior in many applica-tions, some stochastic processes in CRO have a negative impacton its convergence property. In this section, we designed anew ECRO algorithm aiming at the issue of target recognitionin aerial images. Then, the procedure of ECRO-optimizedcontour-based target recognition is given in detail.

A. Problem Formulation

In this method, we define a binary image containing a con-tour of the target as a template, and we have a test image whichthe template attempts to match. Before the execution of ECRO,WEPF of the contour map extracted from a test image by thecontour cut approach is calculated. The ultimate goal of thismethod is to find the optimal translation parameters (tu, tv),rotation angle (θ), and scaling factor (s) of the template withrespect to a test image. So, the structure of a molecule isdefined as

ω = (tu, tv, θ, s). (14)

If the target is contained in a test image, the correctly roto-translated and scaled template will be fitted within the potentialfield of this image. To evaluate whether the presence of thetemplate is in a test image by similarity measure, the objectivefunction of ECRO is defined as a matching score, based on thepotential field

f(ω) =1

N (ω)

N(ω)∑n(ω)=1

{WEPF

(u(ω)n , v(ω)

n

)}(15)

where n(ω) is the nth pixel of the template contour after ageometric transformation according to the four elements ofω, (u

(ω)n , v

(ω)n ) are its corresponding vertical and horizontal

coordinates, and N (ω) is the total number of pixels in thetransformed template.

The average attraction generated by a test image upon theroto-translated and scaled template is calculated using (15).Once f(ω) is maximized by a combination of geometric param-eters in ω, the test image is considered to contain the particulartarget, and the target recognition task is thus efficiently accom-plished. Since ECRO seeks a minimal nonnegative PE value,the PE of a molecule with structure ω is set as follows:

PEω = M − f(ω) (16)

where M is a given positive constant large enough to guaranteea legal PE. So far, we have formulated target recognition as a

problem of optimizing the geometric parameters, which can beconveniently tackled by the ECRO algorithm.

B. ECRO

In the original CRO algorithm, molecule(s) involved in thenext reaction is (are) randomly chosen. Although this operatorcontributes to the diversity of population, the converging rate ofCRO is low since molecules with worse solution can be chosenin every reaction. This makes the target recognition methodtime consuming and cannot be applied to a real-time practicalapplication. To overcome this shortcoming, we introduced eli-tist selection procedure and operators with elitist strategies intothe original CRO algorithm and demonstrated the efficiency.

The elitist selection procedure speeds up the convergence ofCRO while still retains its performance. It is executed by defin-ing two attributes of the molecule: affinity and concentration.The affinity represents the quality of a solution to the problem,and the concentration reflects the proportion of molecules withsimilar structures in the current population [42].

We rearrange all molecules in terms of PE in ascendingsequence, and the affinity of a molecule moli with structure ωi

is defined as

Aff(moli) = rand · (1-rand)i−1 (17)

where i is the location index of the molecule after rearrange-ment and rand is a random number in (0, 1). Thus, the affinityvalue is only related with the index of the molecule rather thanits PE. Moreover, the concentration value of a molecule can beobtained by⎧⎪⎪⎪⎨

⎪⎪⎪⎩Con(moli) =

m∑j=1

Ks(moli,molj)

m

Ks(moli,molj) =

{1, . . . ‖ωi, ωj‖ ≤ ConThres0, . . . otherwise

(18)

where ConThres is the concentration threshold and m is thetotal number of molecules.

In an elementary reaction, a roulette selection can be adoptedto choose molecule(s) to collide, and the selection probabilityof a molecule is calculated as follows:

Pro(moli) =

Aff(moli)Con(moli)

m∑i=1

{Aff(moli)Con(moli)

} . (19)

Therefore, molecules with high concentration value would berejected while those with high affinity value would be selected,which helps ECRO select the elitist of population. The elitistselection procedure could avoid the elitist from staying un-changed during iterations, thus contributing to convergence.Note that, although the molecule is selected with priors, thelaws and restrictions still enable the algorithm getting rid oflocal minima.

In ECRO, the molecule is selected through the elitist se-lection procedure, and some mechanisms stated previously areused to make a choice of reaction types. Each type of reactions

Page 6: Elitist Chemical Reaction Optimization for Contour-Based ...hbduan.buaa.edu.cn/papers/2015_IEEE_TGRS_Duan.pdf · compared with traditional contour matching methods in terms of performance

2850 IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, VOL. 53, NO. 5, MAY 2015

corresponds to a search operator that changes the molecularstructure and, then, the PE. However, the operators in the ori-ginal CRO are totally stochastic processes in which the globalinformation is not used at all. In order to improve the per-formance of CRO on target recognition, elitist strategies arecombined into these operators. The pseudocodes of on-wall in-effective collision and decomposition are shown in Operators 1and 2, respectively.

Operator 1 OnwallIneffectiveCollision (ω)Input: a solution ωDuplicate ω to produce ω′

Get i randomly in the set of {1, 2, . . . , n}λ ← 1− (FEcurrent/FElimit)ω′(i) ← ω′(i) + λ · sz · (ωrep(i)− ω′(i))Check (ω′)Output: ω′

Operator 2 Decomposition (ω)Input: a solution ωDuplicate ω to produce ω′

1 and ω′2

λ ← 1− (FEcurrent/FElimit)for i ← 1 to n/2 do

Get rand randomly in (0, 1)if (rand > 0.5) then

ω′1(i) ← ω′

1(i) + λ · sz · (ωrep(i)− ω′1(i))

elseω′2(i) ← ω′

2(i) + λ · sz · (ωrep(i)− ω′2(i))

end ifend forCheck (ω′

1); Check (ω′2)

Output: ω′1 and ω′

2

These two operators are designed to generate a neighbor ofthe solution ω selected by elitist selection toward the globalbest solution ωrep. Here, n is the dimension of the solution,and it is set as four in the target recognition. FEcurrent andFElimit are the current and maximum function evaluations(FEs), respectively, and the sz term denotes the step size towardthe global best solution. The important variable λ decreasesas FEcurrent increases and becomes zero when the maximumFE criterion is matched. Since λ can control the intensity ofperturbations, global search gradually turns to local search asiteration proceeds. In this way, ECRO is supposed to convergefaster and find a more precise result. The check function is usedto ensure that both upper and lower limits are met, which is de-fined according to practical problems. In this paper, the functionexamines the validity of the geometric parameters in this way:If the size of the roto-translated and scaled template accordingto ω exceeds the size of the test image, then one dimension ofω is regenerated randomly until the size limits are met.

Intermolecular ineffective collision and synthesis operatorsare similar to the crossover operator in GA, which produces off-spring by recombining information from two parents. In ECRO,

two superior reactants exchange a part of their structures tocreate two products by intermolecular ineffective collision.In synthesis, they are combined into a single product. Theseoperators are described as follows:

Operator 3 IntermolecularIneffectiveCollision (ω1, ω2)Input: solutions ω1 and ω2

for i ← 1 to n doGet rand randomly in (0, 1)if (rand > 0.5) then

ω′1(i) ← ω1(i)

ω′2(i) ← ω2(i)

elseω′1(i) ← ω2(i)

ω′2(i) ← ω1(i)

end ifend forCheck (ω′

1); Check (ω′2)

Output: ω′1 and ω′

2

Operator 4 Synthesis (ω1, ω2)Input: solutions ω1 and ω2

for i ← 1 to n doGet rand randomly in (0, 1)if (rand > 0.5) then

ω′(i) ← ω1(i)elseω′(i) ← ω2(i)

end ifend forCheck (ω′)Output: ω′

By implementing these operators, the elitist of populationcan be maintained and recombined to produce more promisingsolutions. ECRO thus combines the framework of CRO andthe advantages of GA. Experimental comparisons demonstratethe superiorities of ECRO in Section V. After the design ofoptimizing variables, objective function, size limits, and oper-ators with elitist strategies, we can directly implement ECROalgorithm to accomplish the visual task.

C. Implementation Procedure

The implementation procedure of the proposed ECRO-optimized contour-based target recognition can be described asfollows.

Step 1) Contour extraction: Obtain the test image, and con-duct median filtering operation to it to alleviate theeffect of noise. Adopt contour cut approach to detectsalient contours in the image, in order to improve therobustness of matching in cluttered environments,and then extract them to build a contour map.

Page 7: Elitist Chemical Reaction Optimization for Contour-Based ...hbduan.buaa.edu.cn/papers/2015_IEEE_TGRS_Duan.pdf · compared with traditional contour matching methods in terms of performance

DUAN AND GAN: ECRO FOR CONTOUR-BASED TARGET RECOGNITION IN AERIAL IMAGES 2851

Fig. 2. Schematic diagram of ECRO-optimized contour-based target recognition.

Step 2) WEPF calculation: According to the contour mapobtained by Step 1) and the windowed edge potentialfield function model given in (10), calculate WEPFof the test image.

Step 3) ECRO search: Initialize the parameters of ECRO,such as the population of molecules iniPopSize,the kinetic energy iniKE, and the energy in thecenter buffer iniBuffer. Initialize the maximum FEsFElimit, the step size sz used for search, and somecontrol parameters decThres, synThres, lossRate,and collRate. Assign random solutions to the molec-ular structures as (14) within the upper and lowerlimits.

Step 4) Calculate the fitness value PE of each moleculeaccording to (15) and (16) and then the affinityand concentration value according to (17) and (18)based on its PE. Calculate the selection probabilityof each molecule according to (19). Generate arandom number rand in the range of (0, 1). If rand >collRate, select a molecule by roulette selection, andgo to Step 5). Otherwise, select two molecules, andgo to Step 6).

Step 5) If Indexcurrent − Indexlast > decThres, the decom-position is successfully implemented when restric-tions are satisfied. Otherwise, conduct the on-wallineffective collision operator. Go to Step 7).

Page 8: Elitist Chemical Reaction Optimization for Contour-Based ...hbduan.buaa.edu.cn/papers/2015_IEEE_TGRS_Duan.pdf · compared with traditional contour matching methods in terms of performance

2852 IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, VOL. 53, NO. 5, MAY 2015

TABLE IIICOMPLEXITY ANALYSIS OF ADDED OPERATIONS IN ECRO

Step 6) If the synthesis criterion is matched, the synthesisis successfully implemented when restrictions aresatisfied. Otherwise, conduct the intermolecular in-effective collision operator.

Step 7) Check for any new minimum solution. Update theglobal best solution ωrep, current FEs FEcurrent, andsome other variables in ECRO.

Step 8) Check whether the stopping criterion is matched.If so, output the optimal solution and its objectivefunction value f (ω). Otherwise, return to Step 4).

Step 9) Visualize the matching result by marking the roto-translated and scaled template according to the opti-mal solution on the original test image.

The schematic diagram of the contour-based target recogni-tion using the ECRO algorithm is given in Fig. 2.

D. Theoretical Analysis

The theoretical analysis of evolutionary algorithms is veryimportant. Here, we have concisely analyzed our ECRO algo-rithm in respect of computational complexity and convergenceproperty. Reference [43] gives a survey on the time complexityof evolutionary algorithms developed in the recent decade.To simplify the problem, we compare the difference betweenthe time complexity of ECRO and that of the original CRO.In ECRO, the molecule is selected through elitist selectionprocedure; thus, there are some operations only involved inECRO. The complexity analysis of these added operations inan iteration is given in Table III.

Therefore, the total complexity of the added operations inECRO in all iterations is Θ(FElimit ·m2). Even though ECROimplements more calculations compared with the original CRO,the advantage of our method is that the increase in the amountof computation is not great with the significant improvement ofperformance.

In [44], Lam et al. have already demonstrated that CROcan be modeled as a finite absorbing Markov chain. Since thesolution space of the target recognition problem in this paper isdiscrete, large, and finite, it can be regarded as a combinatorialoptimization problem A. We denote the solution space of A byχ and the additional information of a molecule, including thekinetic energy, the local minimum, etc., by ζ. Then, a moleculeω can be represented by an element of χ× ζ. Accordingto [44], the evolving process of CRO on solving A can bemodeled by an absorbing Markov chain {SA

t }+∞t=0 which sat-

isfies P{St+1 �∈ Ω|St ∈ Ω} = 0, t = 0, 1, 2, . . .. Here, SAt

Δ=

ωA,rept × ωA

1 × ωA2 × · · · × ωA

n , where ωA,rept is the current

TABLE IVCONTROL PARAMETERS IN THE EIGHT ALGORITHMS

best solution up to time t, ωAi ∈ χ× ζ for 1 ≤ i ≤ nmax, and

nmax is the maximum population size of CRO.Under the framework of Markov process, Lam et al. have

proved the convergence of CRO by associating CRO with aMarkov chain that constitutes a nonrecessionary sequence. De-tails of the proof are given in [44]. Since the whole frameworkof our ECRO is the same as that of CRO, the conclusions ofCRO algorithm also apply to ECRO. To investigate the effectof elitist strategies on the convergence property, we examinedsolution graphs based on the operators designed in our ECROaccording to [44]. Since there is not a solution graph that isnot optimum reachable, ECRO will converge to an optimalsolution.

V. EXPERIMENTS ON AERIAL IMAGES

To test the feasibility and effectiveness of the proposedECRO algorithm in contour-based target recognition, a series

Page 9: Elitist Chemical Reaction Optimization for Contour-Based ...hbduan.buaa.edu.cn/papers/2015_IEEE_TGRS_Duan.pdf · compared with traditional contour matching methods in terms of performance

DUAN AND GAN: ECRO FOR CONTOUR-BASED TARGET RECOGNITION IN AERIAL IMAGES 2853

Fig. 3. Target recognition results obtained by the eight evolutionary algorithms for Case I: (a) Test image, (b) relevant contour map obtained by the contourcut approach, (c) relevant edge map extracted by the Canny–Rothwell algorithm, (d) edge potential distribution of (b), (e) template contour, (f) optimal matchingresults of these algorithms, and (g) evolution curves of these algorithms.

of experiments has been conducted on two real remote sensingdata sets: UCMERCED data set [9] and aerial images extractedfrom Google Earth [45]. In this paper, the performance ofECRO algorithm is compared with those of the original CROand other state-of-the-art evolutionary algorithms, includingPPBBO [34], BSO-II [35], ABC [36], QPSO [37], jDE [38],and SGA [39]. In the last few years, these algorithms havebeen widely used for optimization in the fields of science andengineering. Most of them are the improved version of the

original algorithms, showing more excellent performance. Weemploy these algorithms to optimize the matching score of atemplate and a test image. When the maximal score is obtainedby algorithms, the particular target such as an airplane and aboat is recognized in aerial images.

As a preprocessing step, we implement scaling to preventnumerical difficulties in the calculation and ensure that theupper and lower limits defined in algorithms are suitable fordifferent cases. All test images are normalized in size as

Page 10: Elitist Chemical Reaction Optimization for Contour-Based ...hbduan.buaa.edu.cn/papers/2015_IEEE_TGRS_Duan.pdf · compared with traditional contour matching methods in terms of performance

2854 IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, VOL. 53, NO. 5, MAY 2015

TABLE VPERFORMANCE COMPARISON OF THE EIGHT ALGORITHMS ON THE THREE TEST CASES

256 × 256, and all templates are normalized as 120 × 120.The experimental platform is a PC with Intel Core i5, 2.6-GHzCPU, 4-GB memory, and 32-b Windows 7. All algorithms areimplemented by Matlab 8.0.0.783(R2012b) and use the sametemplate, contour map, and WEPF description of the test imagefor fair comparisons. In all test cases, the control parameters inthese algorithms are set as follows: 1) for CRO and ECRO, thecommon parameters are taken as in [41], and other parametersare selected empirically through trial and error, and 2) for otheralgorithms, the parameters are those widely adopted by otherresearch works, and good performances are reported. Theseparameters are given in Table IV.

We first test our method on the UCMERCED data set [9],images in which are manually extracted aerial orthoimageryfrom the U.S. Geological Survey National Map. This data setcontaining 21 challenging scene categories with 100 samplesper class is quite representative in the research of aerial sceneclassification and target recognition. The objective of this ex-

periment (Case I) is to find an airplane at the airport with atemplate. In this experiment, the template is manually extractedfrom the edge map of the test image. The population size andmaximum iteration in each algorithm are set as 200 and 50,respectively.

Fig. 3 shows the best matching results of Case I obtainedby the aforementioned eight algorithms during 30 independentruns and the convergence curves of the best fitness values. Thestatistical results of these algorithms are also listed in Table V,where the better values are highlighted in boldface. Note thatthe matching score of each algorithm reported in Fig. 3 is notthe prime fitness value but similarity measure of the matchingobtained by (15). Since two molecules at most are involvedin a reaction, the best value for every 200 FEs in CRO-basedalgorithms is considered as the fitness value for a generation inother algorithms for comparative purposes.

Fig. 3(b) is the contour map of the test image obtained by thecontour cut approach, in which different groups of contours are

Page 11: Elitist Chemical Reaction Optimization for Contour-Based ...hbduan.buaa.edu.cn/papers/2015_IEEE_TGRS_Duan.pdf · compared with traditional contour matching methods in terms of performance

DUAN AND GAN: ECRO FOR CONTOUR-BASED TARGET RECOGNITION IN AERIAL IMAGES 2855

Fig. 4. Many-to-one matching results obtained by ECRO using the template in Fig. 3(e): (a)–(c) are the recognition results for different targets with similar shapeto the model. The matching scores of (a)–(c) are 11.0134, 12.3478, and 8.9939, respectively.

Fig. 5. Discrimination test results obtained by ECRO: (a)–(c) are the recognition results for a target from its corresponding alternatives. The template in eachtest is shown in each image. The matching scores of (a)–(c) are 16.2215, 22.4547, and 15.7331, respectively.

represented using different colors. Compared with the edge mapextracted by the Canny–Rothwell algorithm in [25], the contourcut approach inhibits noise and textures in an effective wayand extracts salient and integral contours of the target, which isquite beneficial to matching in the next stage. For example, thelawn abundant in textures is fully extracted in Fig. 3(c), whileabandoned by the contour cut approach. The reason for this isthat edge detection methods such as the Canny–Rothwell onlymake use of the gradient information in an image and thereforecannot distinguish between textures and the target. The firststep of contour cut approach is edge detection, and then, thearchitecture information of the detected edges is fully utilizedto obtain salient and integral contours of the target. Since lawnsare very common backgrounds in aerial images, the contour cutapproach shows outstanding performance in the preprocessingstep of aerial target recognition.

Many matching methods rely on one-to-one matching ofdescriptors to a target. In these methods, the template mustbe extracted from the same object as the target. Contour-basedtarget recognition can address this problem by using many-to-one matching of image contours to a target. Namely, we canuse a template extracted from a typical object to recognize acategory of targets. In this approximate matching, the match-ing score can be treated as the degree of similarity between

an object and the target. For example, we use the templatecontour in Fig. 3(e) to recognize airplanes in Fig. 4. Eachresult is obtained by implementing the ECRO algorithm foronce. Although the targets are not the extract same as thetemplate, they can be successfully recognized. Experimentalresults demonstrated that contour representation can deal withthe slight shape deformation of the target, and the proposedrecognition method is effective in a category of targets whichhave similar but not identical shapes.

At the same time, our method has acute discernment ofobjects with different shapes. In the precise matching, a highmatching score can distinguish the identical shape from thesimilar shape. In Fig. 5, some aerial scenes, in which severalairplanes with different shapes are presented, have been usedto test the discriminating power of our method. The templatein each test is shown in each image. Each result is obtainedby implementing the ECRO algorithm for once. Results showthat the proposed method can effectively discriminate the visualtarget from its corresponding alternatives, thus being moreresistant to interferences.

The following experiments (Case II and Case III) are per-formed on the aerial images captured from Google Earth[45]. In these experiments, the visual task is to recognize aboat alongside the wharf. The population size and maximum

Page 12: Elitist Chemical Reaction Optimization for Contour-Based ...hbduan.buaa.edu.cn/papers/2015_IEEE_TGRS_Duan.pdf · compared with traditional contour matching methods in terms of performance

2856 IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, VOL. 53, NO. 5, MAY 2015

Fig. 6. Target recognition results obtained by the eight evolutionary algorithms for Case II (map data are copyright of 2014 Google and 2014 Digital Globe):(a) Test image, (b) relevant contour map obtained by the contour cut approach, (c) relevant edge map extracted by the Canny–Rothwell algorithm, (d) edgepotential distribution of (b), (e) template contour, (f) optimal matching results of these algorithms, and (g) evolution curves of these algorithms.

iteration are 200 and 100, respectively. Figs. 6 and 7 showthe best results of Case II and Case III during 30 independentruns and the convergence curves of the best fitness values. Thecorresponding statistical results are illustrated in Table V, andthe results of many-to-one matching experiment using ECROin Case II are shown in Fig. 8.

It can be observed that almost all tested algorithms can findthe correct target in a relatively clean background in Case I, butin the latter two cases, only part of algorithms can recognize thetarget precisely in complex backgrounds within 100 iterations.For example, although ABC found high matching scores inCase II, Fig. 6(f) shows that it has actually gotten stuck to alocal extremum instead of a global optimum. A similar situationhas been observed in Case III in which PPBBO found higherfitness values than ABC. This is because of the character of

the objective function and the behavior of the algorithm. Sincethe function is complex and multimodal with more than onepeak, there are some local optima which are not the correctresults but have relatively high fitness values. This also reflectsthat ABC and PPBBO have worse ability to maintain thediversity of population and to get rid of local optima comparedwith other algorithms. It is noticed that the character of theobjective function does not influence the performance of ourECRO algorithm. In all cases, ECRO has successfully found theglobal optimum that is the true position of the target, demon-strating the effectiveness of both objective function and ECROalgorithm.

In addition, there is a severe translation of the visualizedresults obtained by QPSO in Case II and ABC and jDE inCase III. The reason is that these algorithms have very low

Page 13: Elitist Chemical Reaction Optimization for Contour-Based ...hbduan.buaa.edu.cn/papers/2015_IEEE_TGRS_Duan.pdf · compared with traditional contour matching methods in terms of performance

DUAN AND GAN: ECRO FOR CONTOUR-BASED TARGET RECOGNITION IN AERIAL IMAGES 2857

Fig. 7. Target recognition results obtained by the eight evolutionary algorithms for Case II (map data are copyright of 2014 Google and 2014 Digital Globe):(a) Test image, (b) relevant contour map obtained by the contour cut approach, (c) relevant edge map extracted by the Canny–Rothwell algorithm, (d) edgepotential distribution of (b), (e) template contour, (f) optimal matching results of these algorithms, and (g) evolution curves of these algorithms.

Fig. 8. Many-to-one matching results obtained by ECRO using the template in Fig. 6(e) (map data are copyright of 2014 Google and 2014 Digital Globe): (a)–(c) arethe recognition results for different targets with similar shapes to the model. The matching scores of (a)–(c) are 14.5745, 17.2863, and 14.9043, respectively.

Page 14: Elitist Chemical Reaction Optimization for Contour-Based ...hbduan.buaa.edu.cn/papers/2015_IEEE_TGRS_Duan.pdf · compared with traditional contour matching methods in terms of performance

2858 IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, VOL. 53, NO. 5, MAY 2015

converging rate and have not found an optimum within 100 iter-ations. In this regard, the fast convergence speed of an algorithmis crucial for target recognition, particularly in aerial imageswith complex backgrounds. In the three groups of experiments,the best matching results obtained by our ECRO algorithm areobviously better than those obtained by others. An intuitivereason is that the underlying principle of CRO-based algorithmsis the conservation of energy, which is more appropriate for theoptimization of EPF. In addition, CRO-based algorithms have avariable population size in the implementation process. By ad-justing the population size, they can prefer either diversificationor intensification according to the problem to be solved.

Although the original CRO has an acceptable performanceamong these algorithms, the efficiency of CRO is low comparedwith ECRO. Since CRO behaves like a random search to traversethe whole PES, the practical time for convergence may be verylong. By utilizing the elitist of the whole population, our ECROcan be considered to focus on some promising regions in PESwhere the global optimum solution is more likely to be, proved toenhance the efficiency of CRO for the target recognition problem.

Furthermore, ECRO has an excellent searching performance.From Table V, it can be seen that the maximal matching scoresin the three test cases are 24.5365, 18.3489, and 16.0261,respectively, which are all obtained by ECRO. Moreover,ECRO obtained the best average matching scores in Case I andCase III, which are much higher than those obtained by others.Although the variances in ECRO are still higher than those inother algorithms, it can be observed that ECRO has enhancedthe robustness of the original CRO. Based on an overall con-sideration of computational complexity, convergence property,efficiency, and searching ability, the proposed ECRO algorithmis superior to the seven advanced evolutionary algorithms insolving the contour-based aerial target recognition problem.

VI. CONCLUSION

A novel ECRO-optimized contour-based target recognitionmethod has been proposed in this paper, aiming at designingan automatic, fast, and robust process of target recognitionfor aerial images. At the systematic level, a salient contouridentification approach is adopted to extract discriminatingcontour groups in aerial images. This method also integratesthe conception of EPF to build an attraction pattern for imagecontours to attract the template. At the algorithmic level, animproved CRO algorithm has been proposed to seek for theoptimal matching results in this paper.

This hybrid method takes advantage of the stability of con-tour cut approach and the accuracy of EPF, having demon-strated very good performances even in the presence of noiseand clutters. Experimental results in case studies show that ourmethod succeeded in target recognition in aerial images throughtranslation, rotation, and scaling. Moreover, it is effective formany-to-one matching when the template is not directly cap-tured from the test image, thus making the target recognitionmethod more robust and applicable.

The proposed ECRO algorithm incorporates elitist strategiesinto the framework of CRO, which include elitist selection,evolution, and crossover. The elitist strategies can improve the

searching ability of CRO by utilizing the global information. Inaddition, a time-variant parameter is adopted here to control theintensity of elitist evolution.

Several advanced evolutionary algorithms, includingPPBBO, BSO-II, ABC, QPSO, jDE, SGA, and the originalCRO, are also employed in our work to accomplish targetrecognition in aerial images. The statistical performances ofthe algorithms are presented in Section V. It is concluded thatECRO outperforms the state-of-the-art evolutionary algorithmsin terms of exploring ability and convergence rate. As a result,the proposed method can enhance the accuracy and robustnessof target recognition for aerial images.

ACKNOWLEDGMENT

The authors would like to thank Y. Yang and S. Newsam forproviding the UC Merced Land Use Data Set and Google Earthfor the map data. Map data are copyright of 2014 Google and2014 Digital Globe. All rights reserved.

REFERENCES

[1] D. E. Dudgeon and R. T. Lacoss, “An overview of automatic target recog-nition,” Lincoln Lab. J., vol. 6, no. 1, pp. 3–10, 1993.

[2] B. Bhanu, “Automatic target recognition: State of the art survey,” IEEETrans. Aerosp. Electron. Syst., vol. AES-22, no. 4, pp. 364–379, Jul. 1986.

[3] B. Bhanu et al., “Guest editorial introduction to the special issue onautomatic target detection and recognition,” IEEE Trans. Image Process.,vol. 6, no. 1, pp. 1–6, Jan. 1997.

[4] J. I. Park, S. H. Park, and K. T. Kim, “New discrimination features forSAR automatic target recognition,” IEEE Geosci. Remote Sens. Lett.,vol. 10, no. 3, pp. 476–480, May 2013.

[5] G. Gao, “An improved scheme for target discrimination in high-resolutionSAR images,” IEEE Trans. Geosci. Remote Sens., vol. 49, no. 1, pp. 277–294, Jan. 2011.

[6] C. J. F. Gallant, “Automatic target recognition for synthetic apertureradar,” R. Can. Air Force J., vol. 2, no. 3, pp. 8–18, 2013.

[7] A. K. Shackelford and C. H. Davis, “A combined fuzzy pixel-based andobject-based approach for classification of high-resolution multispectraldata over urban areas,” IEEE Trans. Geosci. Remote Sens., vol. 41, no. 10,pp. 2354–2363, Oct. 2003.

[8] A. M. Cheriyadat, “Unsupervised feature learning for aerial scene classi-fication,” IEEE Trans. Geosci. Remote Sens., vol. 52, no. 1, pp. 439–451,Jan. 2014.

[9] Y. Yang and S. Newsam, “Bag-of-visual-words and spatial extensions forland-use classification,” in Proc. ACM Int. Conf. Adv. Geograph. Inf. Syst.,2010, pp. 270–279.

[10] S. Das, T. T. Mirnalinee, and K. Varghese, “Use of salient featuresfor the design of a multistage framework to extract roads from high-resolution multispectral satellite images,” IEEE Trans. Geosci. RemoteSens., vol. 49, no. 10, pp. 3906–3931, Oct. 2011.

[11] D. X. Dai and W. Yang, “Satellite image classification via two-layer sparsecoding with biased image representation,” IEEE Geosci. Remote Sens.Lett., vol. 8, no. 1, pp. 173–176, Jan. 2011.

[12] L. Bruzzone and L. Carlin, “A multilevel context-based system for clas-sification of very high spatial resolution images,” IEEE Trans. Geosci.Remote Sens., vol. 44, no. 9, pp. 2587–2600, Sep. 2006.

[13] M. Lienou, H. Maitre, and M. Datcu, “Semantic annotation of satelliteimages using latent Dirichlet allocation,” IEEE Geosci. Remote Sens.Lett., vol. 7, no. 1, pp. 28–32, Jan. 2010.

[14] J. Porway, K. Wang, and S. C. Zhu, “A hierarchical and contextual modelfor aerial image understanding,” in Proc. IEEE Conf. Comput. Vis. PatternRecog., 2008, pp. 1–4.

[15] L. P. Dorado-Munoz, M. Velez-Reyes, A. Mukherjee, and B. Roysam,“A vector SIFT detector for interest point detection in hyperspectral im-agery,” IEEE Trans. Geosci. Remote Sens., vol. 50, no. 11, pp. 4521–4533,Nov. 2012.

[16] Art. ID. 1Y. Han, J. Choi, Y. Byun, and Y. Kim, “Parameter optimizationfor the extraction of matching points between high-resolution multisensorimages in urban areas,” IEEE Trans. Geosci. Remote Sens., vol. 52, no. 9,Sep. 2013.

Page 15: Elitist Chemical Reaction Optimization for Contour-Based ...hbduan.buaa.edu.cn/papers/2015_IEEE_TGRS_Duan.pdf · compared with traditional contour matching methods in terms of performance

DUAN AND GAN: ECRO FOR CONTOUR-BASED TARGET RECOGNITION IN AERIAL IMAGES 2859

[17] B. Li et al., “Image matching based on two-column histogram hashingand improved RANSAC,” IEEE Geosci. Remote Sens. Lett., vol. 11, no. 8,pp. 1433–1437, Aug. 2014.

[18] A. Hajdu and I. Pitas, “Optimal approach for fast object-template match-ing,” IEEE Trans. Image Process., vol. 16, no. 8, pp. 2048–2057,Aug. 2007.

[19] C. F. Olson and D. P. Huttenlocher, “Automatic target recognition bymatching oriented edge pixels,” IEEE Trans. Image Process., vol. 6, no. 1,pp. 103–113, Jan. 1997.

[20] C. D. Ruberto and A. Morgera, “ACO contour matching: A dominantpoint approach,” in Proc. 4th Int. Congr. Image Signal Process., 2011,pp. 1391–1395.

[21] F. Eugenio and F. Marques, “Automatic satellite image georeferencingusing a contour-matching approach,” IEEE Trans. Geosci. Remote Sens.,vol. 41, no. 12, pp. 2869–2880, Dec. 2003.

[22] P. Srinivasan, Q. Zhu, and J. Shi, “Many-to-one contour matching fordescribing and discriminating object shape,” in Proc. IEEE Conf. Comput.Vis. Pattern Recog., San Francisco, CA, USA, 2010, pp. 1673–1680.

[23] R. Kennedy, J. Gallier, and J. Shi, “Contour cut: Identifying salient con-tours in images by solving a Hermitian eigenvalue problem,” in Proc.IEEE Conf. Comput. Vis. Pattern Recog., Providence, RI, USA, 2011,pp. 2065–2072.

[24] S. Belongie, J. Malik, and J. Puzicha, “Shape matching and object recog-nition using shape contexts,” IEEE Trans. Pattern Anal. Mach. Intell.,vol. 24, no. 4, pp. 509–522, Apr. 2002.

[25] M. S. Dao, F. G. B. De Natale, and A. Massa, “Edge potential functions(EPF) and genetic algorithms (GA) for edge-based matching of visualobjects,” IEEE Trans Multimedia, vol. 9, no. 1, pp. 120–135, Jan. 2007.

[26] C. F. Xu and H. B. Duan, “Artificial bee colony (ABC) optimizededge potential function (EPF) approach to target recognition for low-altitude aircraft,” Pattern Recognit. Lett., vol. 31, no. 13, pp. 1759–1772,Oct. 2010.

[27] F. Battisti, M. Carli, F. G. B. De Natale, and A. Neri, “Ear recognitionbased on edge potential function,” in Proc. SPIE Int. Soc. Opt. Eng.,Burlingame, CA, USA, 2012, Art. ID. 829508.

[28] Y. Wang, Y. Ma, and Q. Chen, “A method of line matching based onfeature points,” J. Softw., vol. 7, no. 7, pp. 1539–1545, Jul. 2012.

[29] L. Gan and H. B. Duan, “Biological image processing via chaotic dif-ferential search and lateral inhibition,” Optik-Int. J. Light Electron. Opt.,vol. 125, no. 9, pp. 2070–2075, May 2014.

[30] A. Y. S. Lam and V. O. K. Li, “Chemical-reaction-inspired metaheuristicfor optimization,” IEEE Trans. Evol. Comput., vol. 14, no. 3, pp. 381–399,Jun. 2010.

[31] J. Xu, A. Y. S. Lam, and V. O. K. Li, “Chemical reaction optimization fortask scheduling in grid computing,” IEEE Trans. Parallel Distrib. Syst.,vol. 22, no. 10, pp. 1624–1631, Oct. 2011.

[32] H. B. Duan and L. Gan, “Orthogonal multi-objective chemical reactionoptimization approach for the brushless DC motor design,” IEEE Trans.Magn., to be published.

[33] K. Bhattacharjee, A. Bhattacharya, and S. H. N. Dey, “Oppositional realcoded chemical reaction optimization for different economic dispatchproblems,” Int. J. Electr. Power Energy Syst., vol. 55, pp. 378–391,Feb. 2014.

[34] W. R. Zhu and H. B. Duan, “Chaotic predator–prey biogeography-basedoptimization approach for UCAV path planning,” Aerosp. Sci. Technol.,vol. 32, no. 1, pp. 153–161, Jan. 2014.

[35] Y. H. Shi, “An optimization algorithm based on brainstorming process,”Int. J. Swarm Intell. Res., vol. 2, no. 4, pp. 35–62, Oct. 2011.

[36] H. B. Duan, Y. M. Deng, X. H. Wang, and C. F. Xu, “Small and dimtarget detection via lateral inhibition filtering and artificial bee colonybased selective visual attention,” PLoS ONE, vol. 8, no. 8, p. e72035,Aug. 2013.

[37] Y. G. Fu, M. Y. Ding, and C. P. Zhou, “Phase angle-encoded and quantum-behaved particle swarm optimization applied to 3-D route planning forUAV,” IEEE Trans. Syst., Man, Cybern. A, Syst., Humans, vol. 42, no. 2,pp. 511–526, Mar. 2012.

[38] J. Brest, V. Zumer, and M. S. Maucec, “Self-adaptive differential evolu-tion algorithm in constrained real-parameter optimization,” in Proc. IEEECongr. Evol. Comput., Vancouver, BC, Canada, 2006, pp. 215–222.

[39] W. Khatib and P. Fleming, “The stud GA: A mini revolution?” in ParallelProblem Solving From Nature, A. Eiben, T. Back, M. Schoenauer, andH. Schwefel, Eds. New York, NY, USA: Springer-Verlag, 1998.

[40] Q. Zhu, G. Song, and J. Shi, “Untangling cycles for contour grouping,”in Proc. IEEE 11th Int. Conf. Comput. Vis., Rio de Janeiro, Brazil, 2007,pp. 1–8.

[41] A. Y. S. Lam, V. O. K. Li, and J. Q. Yu, “Real-coded chemical reactionoptimization,” IEEE Trans. Evol. Comput., vol. 16, no. 3, pp. 339–353,Jun. 2012.

[42] K. Meng, H. G. Wang, Z. Y. Dong, and K. P. Wong, “Quantum-inspiredparticle swarm optimization for valve-point economic load dispatch,”IEEE Trans. Power Syst., vol. 25, no. 1, pp. 215–222, Feb. 2010.

[43] P. S. Oliveto, J. He, and X. Yao, “Time complexity of evolutionary algo-rithms for combinatorial optimization: A decade of results,” Int. J. Autom.Comput., vol. 4, no. 3, pp. 281–293, Jul. 2007.

[44] A. Y. S. Lam, V. O. K. Li, and J. Xu, “On the convergence of chemicalreaction optimization for combinatorial optimization,” IEEE Trans. Evol.Comput., vol. 17, no. 5, pp. 605–620, Oct. 2013.

[45] Google Inc., Google Earth 7.1.2.2041, Mountain View, CA, USA,Oct. 2013. [Online]. Available: http://www.google.com/earth/

Haibin Duan (M’07–SM’08) was born in Shandong,China, in 1976. He received the Ph.D. degree fromNanjing University of Aeronautics and Astronautics,Nanjing, China, in 2005.

He was an Academic Visitor of the NationalUniversity of Singapore, Singapore, in 2007, a Se-nior Visiting Scholar at The University of Suwon,Hwaseong, South Korea, in 2011, an Engineer withShenyang Aircraft Design Research Institute, Avia-tion Industry Corporation of China (AVIC), in 2006,and a Technician with AVIC Aviation Motor Control

System Institute from 1996 to 2000. He is currently a Full Professor with theSchool of Automation Science and Electrical Engineering, Beihang University(formerly Beijing University of Aeronautics and Astronautics), Beijing, China,where he is the Head of the Bio-inspired Autonomous Flight Systems ResearchGroup. He has authored or coauthored more than 70 publications. His researchinterests are multiple unmanned-aerial-vehicle autonomous formation controland biological computer vision.

Lu Gan was born in Sichuan, China, in 1992. Shereceived the B.S. degree in automation from theUniversity of Electronic Science and Technology ofChina, Chengdu, China, in 2013. She is currentlyworking toward the M.S. degree in the State KeyLaboratory of Virtual Reality Technology and Sys-tems, School of Automation Science and ElectricalEngineering, Beihang University (formerly BeijingUniversity of Aeronautics and Astronautics), Bei-jing, China. Her research interests are bio-inspiredcomputation and computer vision.


Recommended