56 IEEE ENGINEERING IN MEDICINE AND BIOLOGY MAGAZINE MARCH/APRIL 2007
MA
CH
INE
LEA
RNIN
G IN
TH
E LI
FE S
CIE
NC
ES
0739-5175/07/$25.00©2007IEEE
Cardiovascular disease (CVD) is a global epidemicthat is the leading cause of death worldwide (17million deaths per year) [8]. It is the single largestcontributor to “disability adjusted life years”
(DALYs): 10% of DALYs in low- and middle-income nationsand 18% of DALYs in high-income nations. Hence, the WorldHealth Organization and the Centers for Disease Control agreethat CVD is no longer an epidemic but a pandemic. In theUnited States, CVD accounted for 38% of all deaths in 2002[7] and was the primary or contributing cause in 60% of alldeaths. Coronary heart disease (CHD) accounts for more thanhalf of CVD deaths (roughly 7.2 million deaths worldwideevery year, and one of every five deaths in the United States),and it is the single largest killer in the world.
It is well known that early detection (along with prevention)is an excellent way of controlling CHD. CHD can be detectedby measuring and scoring the regional and global motion ofthe left ventricle (LV) of the heart. It typically results in wall-motion abnormalities [i.e., local segments of the LV wallmove abnormally (move weakly, not at all, or out of sync withthe rest of the heart)], and sometimes motion in multipleregions, or indeed the entire heart, is compromised. The LVcan be imaged in a number of ways. The most commonmethod is the echocardiogram, which is an ultrasound video ofdifferent two-dimensional cross sections of the LV.
Unfortunately, echocardiograms are notoriously difficult tointerpret, even for the best of physicians. Inter-observer stud-ies have shown that even world-class experts agree on theirdiagnosis only 80% of the time [12], and intra-observer stud-ies have shown a similar variation when the expert reads thesame case twice at widely different points in time. There is atremendous need for an automated “second-reader” systemthat can provide objective diagnostic assistance, particularly tothe less-experienced cardiologist.
In this article, we address the task of building a computer-aided diagnosis system that can automatically detect wall-motion abnormalities from echocardiograms. We providesome medical background on cardiac ultrasound and the stan-dard methodology used by cardiologists to score wall-motionabnormalities. We also describe our real-life dataset, whichconsists of echocardiograms used by cardiologists at St.Francis Heart Hospital to diagnose wall-motion abnormali-
ties. We then provide an overview of our proposed system,which was built on top of an algorithm that detects and tracksthe inner and outer walls of the heart [3]–[6]. It consists of aclassifier that classifies the local region of the heart wall (andthe entire heart) as normal or abnormal based on the wallmotion. We also describe our methodology for feature selec-tion and classification, followed by our experimental results.
Medical Background Knowledge
What Is Coronary Artery Disease?The human heart is divided into four chambers: the left andright atrium and the left and right ventricle. The LV is the cham-ber responsible for pumping oxygenated blood to the entirebody. As a result, it is the largest and strongest of the four cham-bers. Figure 1 shows the layout of the heart chambers in relationto one another; the LV is in the lower right part of the figure.
The heart is fed by three major coronary arteries: the leftanterior descending (LAD), right coronary artery (RCA),and the left circumflex coronary artery (LCX). All three ofthese vessels feed the muscle surrounding the LV. Coronaryartery disease results from the development of plaque withinthe artery, which usually deposits along the walls. When theplaque restricts normal blood flow to an extreme extent thepatient will experience chest pain, known as angina. Whenthe blood flow to the heart muscle is reduced, the functionof that piece of muscle fed by the blocked artery will beginto become impaired. This is known as ischemia. This func-tional impairment can be seen from ultrasound images ofthe heart, also called echocardiograms (echos).
One of the first effects of coronary artery disease is that themotion of the heart wall during contraction will becomeimpaired. Accurate regional wall-motion analysis of the LV is anessential component of interpreting echos to detect this effect.
Divisions of the HeartThere are many imaging modalities that have been used tomeasure myocardial perfusion, left ventricular function, andcoronary anatomy for clinical management and research; forthis project we chose to use echocardiography. The CardiacImaging Committee of the Council on Clinical Cardiology ofthe American Heart Association has created a standardized
Automated HeartAbnormality Detection UsingSparse Linear ClassifiersA Computer-Aided Diagnosis System for DetectingHeart Wall-Motion Abnormalities from Echocardiograms
BY MALEEHA QAZI, GLENN FUNG,SRIRAM KRISHNAN, JINBO BI, R. BHARAT RAO, AND ALAN S. KATZ
©BRAND X, PHOTODISC
IEEE ENGINEERING IN MEDICINE AND BIOLOGY MAGAZINE MARCH/APRIL 2007 57
recommendation for the orientationof the heart, angle selection, andnames for cardiac planes and num-ber of myocardial segments [1].This is the standardization used inthis project. Echo images are col-lected from four standard views:apical 4 chamber (A4C), apical 2chamber (A2C), parasternal longaxis (PLAX) or apical 3 chamber(A3C), and parasternal short axis(PSAX) (shown in Figure 2). Theplanes used to cut the heart to dis-play these standard views are dis-played in Figure 3 from reference[2]. The long-axis view extendsfrom the LV apex through the aorticvalve plane. The short-axis view isperpendicular to the long-axis viewresulting in a circular view of theLV. The four-chamber view is per-pendicular to both the long- andshort-axis views and includes theleft and right ventricle and left andright atrium. If one rotates the 4-chamber view plane counterclock-wise about 60◦ , the two-chamberview is obtained, which shows theLV and the left atrium.
The LV is divided into 17myocardial segments. The short-axisview that results in a circular viewof the LV can be taken at three loca-tions: near the apex (apical), at themiddle (mid-cavity), or near thebase (basal). The most desirableview is the mid-cavity cut. If onelays these three resultant ringsagainst one another, all segments ofthe heart are visible in relationshipto one another, as shown in Figure 4(modified from reference [1]). TheLAD feeds segments 1, 2, 7, 8, 13,14 and 17; the RCA feeds segments3, 4, 9, 10 and 15; and the LCXfeeds segments 5, 6, 11, 12, and 16.
Understanding the DataThe data are based on standard adulttransthoracic B-mode ultrasoundimages collected from the four stan-dard views described previously.Currently, we only utilize two of thefour possible views: A4C and A2C,which show 12 of the 16 total seg-ments [we ignore the apex (segment17) since it is near impossible tomeasure]. These 12 views areenough to achieve our goal of classi-fying hearts. Even though we haveimages at different levels of stress(resting, low-dose stress, peak-dose
Fig. 1. Major parts of heart labeled, including the four chambers of the human heart:the left and right atrium, and the left and right ventricle.
1) Superior Vena Cava
To the Lungs
6) Pulmonary Valve
From the Lungs(To Left Atrium)
4) Tricuspid Valve
2) Inferior Vena Cava
13) Aorta
7) Pulmonary Artery(To the Lungs)
8) Pulmonary Veins(From the Lungs)
10) Mitral Valve
12) Aortic Valve
Oxygenated Blood
Unoxygenated Blood
3) RightAtrium
5) RightVentricle
11) LeftVentricle
9) LeftArium
Fig. 2. Echocardiographic views for wall-motion evaluation. In the short-axis view, at thebase and midventribular levels, the left ventricle is divided into anterior septum (2,8) andanterior free wall (1,7), lateral (6,12), posterior (5,11), inferior free wall (4,10), and posteri-or septal (3,9) segments. These same wall segments are seen in apical views as indicat-ed, plus the anterior (13), septal (14), inferior (15), and lateral (16) apical segments areseen. Modified from reference [2] (segment numbers have been corrected to reflectstandard naming convention being used).
Echo Views for Wall Motion
Short-Axis (Base) Short-Axis (Mid-LV)
LVRV2 1
6
54
3 LV
RV 8 7
12
1110
9
16 14
8
2
AoLA
5
11 LV
16 13
7
1
LA
4
10 LV
16
12
6
LARA
14
9
3
RVLV
Apical 4-Chamber Apical 2-Chamber Apical Long-Axis
58
stress, recovery), this work is based on images taken when thepatient was resting. The goal is to automatically provide aninitial score, or classification, to determine whether a heart isnormal or abnormal given the ultrasound.
The echo data was collected from St. Francis HeartHospital in Roslyn, New York. The data consist of 141
cases that will be used for training and 59 cases that are ear-marked as the final test set; all of which were generatedusing exercise stress. All the cases have been labeled at thesegment level by a group of trained cardiologists. The heart-level classification labels can be obtained from the segment-level labels by applying the following definition given to usby the doctors: a heart is considered abnormal if two ormore segments are abnormal.
Preparation of the DataOur application consists of two main parts: image processingand classification. The echos are run through an algorithmthat automatically detects and tracks both the interior (endo-cardial) and exterior (epicardial) borders of the LV [4], [6].Motion interferences (e.g., probe motion, patient movement,respiration, etc.) are compensated for by using global motionestimation based on robust statistics outside the LV. This isdone so that only the heart’s motion is analyzed. Then numer-ical feature vectors, which are extracted from the dual con-tours tracked through time, form the basis for the regionalwall-motion classification.
Image ProcessingThe first step toward classification of the heart involvesautomatic contour generation of the LV [5]. Ultrasound isknown to be noisier than other common medical imagingmodalities such as MRI or CT, and echos are even worsedue to the fast motion of the heart muscle and respiratoryinterferences. The framework used by our algorithm is idealfor tracking echo sequences since it exploits heteroscedastic(i.e., location-dependent and anisotropic) measurementuncertainties. The process can be divided into two steps:
border detection and border tracking.Border detection involves localizing theLV on multiple frames of the imageclip (shown in Figure 5 as a box drawnaround the LV), and then detecting theLV’s shape within that box. Seventeencontrol points are placed along the inte-rior border of the LV to show where theborder was detected. These points arethen extended outward to find the exter-nal (epicardial) border of the LV.
Border tracking involves trackingboth these contours together from oneframe to the next through the entiremovie clip. Motion interferences (e.g.,probe motion, patient movement, respi-ration, etc.) are compensated for byusing global motion estimation basedon robust statistics outside the LV. Thisglobal motion estimation can be seen inFigure 6 as a vertical line near the cen-ter of the image.
After detection and tracking, numeri-cal features are computed from the dualcontours tracked through time. The fea-tures extracted are both global (involv-ing the whole LV) and local (involvingindividual segments visible in theimage) and are based on velocity, thick-ening, timing, volume changes, etc.
IEEE ENGINEERING IN MEDICINE AND BIOLOGY MAGAZINE MARCH/APRIL 2007
Fig. 4. Display, on a circumferential polar plot, of the 17 myocardial segments andthe recommended nomenclature for tomographic imaging of the heart. Modifiedfrom reference [1].
1. Basal Anterior2. Basal Anteroseptal3. Basal Inferoseptal4. Basal Inferior5. Basal Inferolateral6. Basal Anterolateral
13. Aptical Anterior14. Aptical Septal15. Aptical Inferior16. Aptical Lateral17. Apex
7. Mid Anterior 8. Mid Anteroseptal 9. Mid Inferoseptal10. Mid Inferior11. Mid Inferolateral12. Mid Anterolateral
1
7
13
17
15
10
4
39
14
82
16
126
11
5
A4C
A3C
A2C
Left Ventricular Segmentation
Fig. 3. The three basic image planes used in transthoracicechocardiography. The ventricles have been cut away toshow how these image planes intersect the left and rightventricles. Dashed lines indicated the image planes at thegreat vessel and the atrial levels. From reference [2].
Short Axis View
4-Chambers View
Long Axis View
S KAPLAN
IEEE ENGINEERING IN MEDICINE AND BIOLOGY MAGAZINE MARCH/APRIL 2007 59
Extracted FeaturesA number of features have been developed to characterizecardiac motion in order to detect wall-motion abnormalities,among them: global and local ejection fraction (EF) ratio,radial displacement, circumferential strain, velocity, thick-ness, thickening, timing, eigenmotion, curvature, and bend-ing energy. Some of these features, including timing,eigenmotion, curvature, local EF ratio, and bending energy,are based only on the inner (endocardial) contour. Due to thepatient examination protocol, only the systole (i.e., contrac-tion phase of the heart) is recorded for some patients. Inorder for the features to be consistent, the systole is extractedfrom each patient based on the cavity area change. For eachframe, the LV cavity area can be estimated accurately basedon the inner (endocardial) contour of that frame. The framecorresponding to the maximal cavity area that is achieved atthe end of diastolic phase (expansion phase of the heart) isthe frame considered to be the beginning of systole. Theframe corresponding to the minimal cavity area (achieved atthe end of systolic phase) is the frame assumed to be the endof systole. For the time being, all features are computedbased only on the systolic phase. However, the methods usedto calculate the features are generally applicable for the dias-tolic phase as well.
The following list describes some of the many features.➤ Timing-based features examine the synchronousness of the
cardiac motion; i.e., whether all the points along the LVmove consistently or not.
➤ Eigenmotion-based features determine the most significantmoving direction of a point and the amount of its motionin that direction.
➤ Curvature-based features are mainly aimed at detectingabnormalities at the apex. This is also useful in identifyingmore general abnormalities associated with cardiac shapes.If a segment is dead, it may still move because it is con-nected to other segments, but we can observe that its shapewill largely remain unchanged during the cardiac cycles.Curvature can capture this type of information.
➤ Local EF ratio features are aimed atcapturing local cardiac contractionabnormalities.
➤ Bending energy features of thecontour, assuming that the provid-ed contour is made of elasticmaterial and moving under ten-sion, may be used to capture thecardiac contraction strength of asegment or the whole LV.
➤ Circumferential strain features, alsocalled fractional shortening, mea-sure how much the contourbetween any two control pointsshrinks in the systolic phase.
We had a total of 192 local (i.e., cal-culated per segment) and global (i.e.,involving the whole LV, as shown inany one view) features, all of whichwere continuous. They included thefeatures mentioned above as well asothers not described here. As a generalrule, the global version of certain fea-tures (e.g., radial displacement, radial
velocity, etc) can be calculated by taking the mean, or stan-dard deviation, of the six segments’ respective feature valuesfrom any one view.
Data MiningThe classification algorithm used in the system is based on anovel feature selection technique, which is in turn based onmathematical programming. As a result, we obtain a hyper-plane-based classifier that only depends on a subset of numeri-cal features extracted from the dual contours tracked throughtime, and these are then used to provide classification for eachsegment and the entire heart.
Classification and Feature SelectionOne of the difficulties in constructing a classifier for this taskis the problem of feature selection. It is a well-known fact thatreducing the feature dependence of a classifier improves theclassifier’s generalization capability. However, the problem
of selecting an “optimal” minimumsubset of features from a large pool(which is in the order of hundreds) ofpotential original features is known tobe non-deterministic polynomial time(NP)-hard. Recently, Mika et al. pro-posed a novel mathematical program-ming formulation for linear Fisher’sdiscriminant (LFD) using kernels [16],[15]. This new formulation included aregularization term similar to that usedin the standard support vector machine(SVM) formulation [17]. We willmake use of Mika’s formulation butuse a 1-norm instead of the 2-norm toobtain solutions that are more sparseand hence dependent on a smallernumber of features. The next sectiondescribes the details of the approach.
Linear Fisher’s DiscriminantThe general idea behind LFD is to findthe best subspace mapping such that itcaptures the best separation between
Fig. 5. One frame from an A4C image clip with the box show-ing the localized left ventricle, and the dots representing thecontrol points along the detected inner contour.
Fig. 6. One frame from an A4C image clipwith the outer and inner contour controlpoints shown. The vertical line near themiddle shows use of global motion com-pensation, and the two squares denotethe centers of the individual contours.
60
the classes. Our problem involves binary classification; i.e.,there are only two classes: positive (abnormal heart), and neg-ative (normal heart) {±}.
Let Ai ∈ Rd×l be a matrix containing the l training datapoints on d-dimensional space and li the number of labeledsamples for class i, i ∈ {±}. LFD [11] is the projection vectorα, which maximizes,
J(α) = αTSBα
αTSWα(1)
where
SB = (m+ − m−) (m+ − m−)T
SW =∑
i∈{±}
1
li
(Ai − mie
Tli
) (Ai − mie
Tli
)
are the between and within class scatter matrices, respectively,and mi = (1/li)Aieli is the mean of class wi and eli is an lidimensional vector of ones. For (1) to be maximized, thenumerator should be large, which represents the inter-classdivision (we want to push the classes as far apart as possible),and the denominator should be small, which represents theintra-class division (we want the points of any one class to beas near to one another as possible).
Transforming the above problem into a convex quadraticprogramming problem provides us some algorithmic advan-tages. First, notice that if α is a solution to (1), then so is anyscalar multiple of it. Therefore, to avoid multiplicity of solu-tions, we impose the constraint αTSB α = b2, which is equiva-lent to αT(m+ − m−) = b where b is some arbitrary positivescalar. Then, the optimization problem (1) becomes
minα∈Rd αTSWα
s.t.αT(m+ − m−) = b
(2)
For binary classification problems the solution of thisproblem is
α∗ = bS−1W (m+ − m−)
(m+ − m−)T S−1W (m+ − m−).
(3)
According to this expansion, since S−1W is positive definite,
unless the difference of the class means along a given fea-ture is zero, all features contribute to the final discriminant.If a given feature in the training set is redundant, its contri-bution to the final discriminant would be artificial and not
desirable. As a linear classifier, LFD is well suited to han-dle features of this sort provided that they do not dominatethe feature set; i.e., the ratio of redundant to relevant fea-tures is not significant. Although the contribution of a sin-gle redundant feature to the final discriminant would benegligible when several of these features are available atthe same time, the overall impact could be quite significantleading to poor prediction accuracy. Apart from thisimpact, in the context of LFD these undesirable featuresalso pose numerical constraints on the computation of S−1
Wespecially when the number of training samples is limited.Indeed, when the number of features, d is higher than thenumber of training samples, l, SW becomes ill-conditionedand its inverse does not exist. Hence, eliminating the irrele-vant and redundant features may provide a two-fold booston the performance.
In what follows we propose a sparse formulation of LFD.The proposed approach incorporates a regularization con-straint on the conventional algorithm and seeks to eliminatethose features with limited impact on the objective function.
Sparse Linear Fisher’s DiscriminantVia Linear ProgrammingWe propose a formulation similar to the one used for 1-normSVM classifiers [9], where the 1-norm is introduced for bothmeasuring the classification error and regulation. The use ofthe 1-norm instead of the 2-norm leads to linear programmingformulations where very sparse solutions can be obtained. Asparse projection vector α implies that many input space fea-tures do not play a role in determining the linear classifier. Inother words,
αi = 0 ⇒ the classifier does not depend on feature i.
Our objective is to formulate an algorithm that can be seen asan approximation to (1) and that provides a sparse projectionvector α. In order to achieve this, we add a regularization termto the objective function of (2):
minα∈Rd ναTSWα + ‖α‖1
s.t.αT(m+ − m−) = b
(4)
where ν is the trade-off between J(α) maximization and regu-larization or sparsity of the projection vector α. The price topay for sparsity of the solution is that, unlike (2), there is no aclosed-form solution for the constrained quadratic in (4); fur-thermore, the parameter ν introduced in (4) has to be chosenby means of a tuning set that requires the problem to besolved several times and that can be computationallydemanding. In order to address this issue we propose next a
IEEE ENGINEERING IN MEDICINE AND BIOLOGY MAGAZINE MARCH/APRIL 2007
Echocardiograms are notoriously
difficult to interpret, even for the best of physicians.
Inter-observer studies have shown that even world-class
experts agree on their diagnosis only 80% of the time.
IEEE ENGINEERING IN MEDICINE AND BIOLOGY MAGAZINE MARCH/APRIL 2007 61
linear programming formulation that can be interpreted as anapproximation to (4) and that results in sparser solutions than(4). Let’s consider the following matrix:
HT =[
1√l+
(A+ − m+eT
l+
)T 1√l−
(A− − m−eT
l−
)].
From (1) we have that Sw = HTH, then
αTSWα = αTHTHα
= (Hα)T(Hα)
= ‖Hα‖22. (5)
Hence the quadratic programming problem (4) can be rewrit-ten as
minα∈Rd ν‖Hα‖22 + ‖α‖1
s.t.αT(m+ − m−) = b.
(6)
We can now use the 1-norm instead of the 2-norm in theobjective function of (6) to obtain the following linear pro-gramming formulation that can be solved more efficiently andgives sparser solutions:
minα∈Rd ν‖Hα‖1 + ‖α‖1
s.t.αT(m+ − m−) = b.
(7)
That this problem is indeed a linear program can be easilyseen from the equivalent formulation:
minα∈Rd νe ′s + e ′ ts.t.
αT(m+ − m−) = b−s ≤ Hα ≤ s−t ≤ α ≤ t.
(8)
Next, we propose an algorithm based on (8) and (3) that pro-vides accurate LFD classifiers depending on a minimal set offeatures.
Algorithm 1 Sparse Linear Fisher Discriminant
Given the training dataset {A−, A+} and a set of valuesN = {10−5, 10−4, . . . , 105} for the parameter ν do:
1) For each ν ∈ N calculate cross-validation performanceusing the linear programming (8).
2) Let ν∗ the value for which (8) gives the best cross-validation performance. Let’s call α̂ the obtained sparseprojection.
3) Select the subset F̂ of the feature set F defined byf i ∈ F̂ ⇔ α̂i �= 0 or fi ∈ F̂ ⇔ |α̂i| ≥ tol; that is, selectthe features corresponding to nonzero components of theprojection α̂.
4) Solve original quadratic programming problem (1) withclosed-form solution (3) considering only the featuresubset F̂ to obtain a final projection α∗ that depends ononly the “small” feature subset F̂.
Evaluation of Discovered Knowledge
Comparison to Other AlgorithmsIn order to empirically demonstrate the effectiveness of theproposed approach, we compared our feature-selection algo-rithm, sparse LFD (SLFD), to four other well-known classifica-tion algorithms. The first algorithm is a very popular publiclyavailable implementation of SVM called SVMlight [13]. Thisformulation does not incorporate feature selection and pro-duces classifiers that often depend on all the input features. Thepurpose of the comparison is to show that a feature-selectionmethod improves generalization performance on this dataset.The second method included in our numerical comparisons isthe relevance vector machine (RVM) algorithm [14], which isone of the most successful Bayesian methods for feature selec-tion and sparse learning. It finds the relevance of features byoptimizing the model marginal likelihood, also known as theevidence. The third approach consists of applying the standardLFD algorithm [11] without feature selection. The last classifi-cation approach used in our comparisons is the standard 1-norm SVM (SVM1) [9], which, similar to our approach, relieson the 1-norm regularization to obtain sparse classifiers. All theclassifiers were trained using 141 cases and were tested on 59cases. For the methods that needed parameters to be tuned (i.e.,our algorithm and SVMlight), the model parameters weretuned by the means of leave-one-patient-out (LOPO) [10] crossvalidation on the training set. Ten-fold cross validation was notperformed on this task because we wished to simulate a real-world situation where one does not have access to the test casesuntil the actual testing of the final classifier.
We have obtained many different answers from doctors asto what they feel the cost of a false positive (FP) (wronglylabeling the heart abnormal) or false negative (FN) (wronglylabeling the heart normal) happens to be. If this system is usedas an initial reader, then too many FPs or FNs will cause thedoctors to shut down the system because it is too unreliable.But as a validation system the main focus is to keep the FNrate low. In general, a high FP rate means you are sending toomany patients for additional, more expensive tests, whichwould lead to higher costs for health insurance. A high FNrate could mean that a patient might go undiagnosed if thedoctor using the system is not well trained and also missespotential abnormalities. For us, the “cost” of an FN is thushigher than an FP. By focusing on keeping the FN rate low,we lower the risk of missing abnormalities and leave the finaldiagnosis to the expertise of the doctor. Taking this intoaccount, we decided that the best way to evaluate the classifier
Table 1. Areas under curve for the testing set and numberof features selected for the five methods: SLFD, SVMlight,RVM, LFD, and SVM1. (Best results shown in bold.)
Algorithm AUC # of features
SLFD 89.6% 3
SVMlight 87.4% 79*
RVM 85.8% 13
LFD 87.4% 79*
SVM1 89.1% 8
*classifier uses all the features.
62 IEEE ENGINEERING IN MEDICINE AND BIOLOGY MAGAZINE MARCH/APRIL 2007
performance is to measure the area under the curve (AUC) forreceiver operating characteristic (ROC).
For each algorithm, Table 1 shows the AUC for the testing setand the number of features that the corresponding classifierdepends on. As shown in the table, our method obtained theROC with the largest area and depended on the fewest numberof features (only three) of any of the algorithms tested. This lowfeature dependence is very important in our application since thefeatures used for classification have to be calculated in real time.
Classification ResultsThe three features selected by SLFD were as follows:➤ a feature that measures motion along the significant direc-
tions of movement of the walls of the heart➤ a feature that measures correlation between the estimated
area of the heart cavity and the distance between the wallsof the heart to the center of mass of the heart
➤ the estimated EF of the heart.It is important to note that two of the features (EF feature
and the motion feature) were selected by all the classificationmethods tested. The performance obtained with SVM1 wasthe second best and was almost identical as the one obtainedby SLFD but using eight features compared to only three usedby SLFD. The ROC curve on the testing set for the final clas-sifier is shown in Figure 7. The LOPO cross-validation perfor-mance for the final model was seven FPs and 17 FNs out of 81positives (abnormals) and 60 negatives (normals); i.e., 88.3%of the normal hearts and 79.0% of the abnormal hearts werecorrectly classified.
On the testing set we obtained three FPs and 6 FNs out of39 positives (abnormals) and 20 negatives (normals); i.e.,85.0% of the normal hearts and 84.6% of the abnormal heartswere correctly classified. A three-dimensional (3-D) plotdepicting the final classifier and the test set is shown inFigure 8. These clinical results were presented and publishedat the American College of Cardiology meeting in March2005 under the title “Clinical Evaluation of a Novel AutomaticReal-Time Myocardial Tracking and Wall Motion ScoringAlgorithm for Echocardiography Introduction.”
Conclusion and Future WorkIn this article we addressed the task of building an objectiveclassification application for heart wall-motion analysis, basedon features calculated off of echocardiograms. Our novel fea-ture selection technique results in a robust hyperplane-basedclassifier that achieves the best performance in terms of AUCand number of features selected when compared to three otherwell-known classification algorithms.
The three features selected by our classifier (SLFD) areall global features, and their limited number makes it easierto explain the final classifier to physicians in order to gettheir feedback. In the future, we plan on expanding ourclassification to do segment-level classification for whichwe would identify different levels of CHD severity (nor-mal, hypokinetic, akinetic, dyskinetic and aneurysm),incorporating the use of other standard echocardiographyviews (e.g., A3C, PSAX, PLAX) and including imagesfrom other levels of stress. We would also like to apply aranking algorithm to take advantage of multiclass scoresfor classification. Comparisons of our proposed SLFDalgorithm to other publicly available datasets and medicalapplications are also planned to further explore the poten-tial of the algorithm.
Maleeha Qazi received a B.A. in mathe-matics from the University of Virginia in1999 and an M.S. in computer sciencefrom the University of Wisconsin–Madisonin 2002. She is currently working atSiemens Medical Solutions in Malvern,Pennsylvania. Her research interestsinclude statistical relational learning,
Bayesian networks, bioinformatics, and the application ofmachine learning techniques to the medical domain.
Glenn Fung received a B.S. in pure mathematics fromUniversidad Lisandro Alvarado in Barquisimeto, Venezuela,then earned an M.S. in applied mathematics from UniversidadSimon Bolivar, Caracas, Venezuela, where later he worked as
Fig. 7. ROC curve for the training set.
Fig. 8. Final hyperplane classifier in three dimensions: circlesrepresent normal hearts and stars represent abnormal heartsin the test set.
Result100
80
60
40
20
00 20 40 60 80 100
100-Specificity
Sen
sitiv
ity
Normal HeartsAbnormal Hearts
1.5
1
0.5
0
–0.5
–11
0.5 0 –0.4 0–0.2 0.2 0.4
EF FeatureDistance Feature
Tim
ing
Fea
ture
IEEE ENGINEERING IN MEDICINE AND BIOLOGY MAGAZINE MARCH/APRIL 2007 63
an assistant professor for 2 years. Later, heearned an M.S. and a Ph.D. in computersciences from the University of Wisconsin–Madison. His main interests are optimiza-tion approaches to machine learning anddata mining, with emphasis on support vec-tor machines. In the summer of 2003 hejoined the Computer-Aided Diagnosis
Group at Siemens Medical Solutions in Malvern,Pennsylvania, where he has been applying machine learningtechniques to solve challenging problems that arise in themedical domain. His recent papers are available atwww.cs.wisc.edu/~gfung.
Sriram Krishnan received the Ph.D. inelectrical engineering (systems) from theUniversity of Michigan in 1998. From1998–2002, he worked as a staff researcherat Acuson Corporation, focusing onadvanced ultrasound beamforming. In2002, he joined the Computer-AidedDiagnosis Group at Siemens Medical
Solutions. His research interests include application of deci-sion support systems in cardiology and application of comput-er vision and machine learning for cardiac imaging.
Jinbo Bi received the B.S. in mathematicsand M.S. in automatic control from BeijingInstitute of Technology, Beijing, China, in1995 and 1998, and the Ph.D. in mathemat-ics from Rensselaer Polytechnic Institute,Troy, New York, in 2003. She is currently astaff scientist working with SiemensMedical Solutions. Her research interests
include mathematical programming, machine learning, kernelmethods, support vector machines, feature selection, drug dis-covery, and medical imaging.
R. Bharat Rao is the senior director ofengineering R&D for the Computer-AidedDiagnosis and Therapy Solutions Groupat Siemens Medical Solutions, Malvern,Pennsylvania. He received the B.Tech inelectronics engineering from the IndianInstitute of Technology, Madras, in 1985,and the Ph.D. in electrical engineering
from the University of Illinois, Urbana-Champaign, in 1993.His research interests include machine learning, classifica-tion, graphical models, probabilistic inference, and opti-mization, with a focus on developing decision-supportsystems that can help physicians improve the quality ofpatient care. He is particularly interested in the developmentof novel data mining methods to collectively mine and inte-grate the various parts of a patient record (lab tests, pharma-cy, free text, images, proteomics, etc.) and the integration ofmedical domain knowledge into the mining process. Hereceived the “Siemens Inventor of the Year” award in 2005.
Alan S. Katz received the B.S. in biology from BrandeisUniversity in 1978 and the M.S. in engineering science/bioinformation systems from the California Institute ofTechnology in 1980. In 1984, he received his M.D. from the
University of Vermont. Dr. Katz did hisresidency in internal medicine at NorthShore/ Memorial-Sloan Kettering and hisfellowship in cardiology at New YorkHospital-Cornell Medical Center. Heserved as director of echocardiography atMiriam Hospital, Rhode Island, and wason the faculty of Brown University from
1987 to 2000. He is currently director of cardiac imagingand informatics at St. Francis Hospital—The Heart Centerin Roslyn, New York. He holds appointments as associateprofessor of medicine at SUNY Stony Brook and adjunctassociate professor of medicine at Brown. Dr. Katz’sresearch interests include echocardiography and digitalimaging in medicine.
Address for Correspondence: Glenn Fung, 4909 WoodburnDrive, Madison, WI 53711. E-mail: [email protected],[email protected].
References[1] M.D. Cerqueira, N.J. Weissman, V. Dilsizian, A.K. Jacobs, S. Kaul, W.K.Laskey, D.J. Pennell, J.A. Rumberger, T. Ryan, and M.S. Verani, “Standardizedmyocardial segmentation and nomenclature for tomographic imaging of the heart:A statement for healthcare professionals from the Cardiac Imaging Committee ofthe Council on Clinical Cardiology of the American Heart Association,”Circulation, vol. 105, pp. 539–542, Jan. 2002.[2] C.M. Otto, Textbook of Clinical Echocardiography. 2nd ed. Philadelphia, PA:W.B. Saunders, 2000.[3] X.S. Zhou, D. Comaniciu, and A. Gupta, “An information fusion frameworkfor robust shape tracking,” IEEE Trans. Pattern Anal. Machine Intell., vol. 27, pp.115–129, Jan. 2005.[4] D. Comaniciu, “Nonparametric information fusion for motion estimation,”in Proc. IEEE Conf. Computer Vision and Pattern Recognition, Madison, WI,June 2003, vol. 1, pp. 59–66.[5] B. Georgescu, X.S. Zhou, D. Comaniciu, and A. Gupta, “Database-guided seg-mentation of anatomical structures with complex appearance,” in Proc. IEEE Conf.Computer Vision and Pattern Recognition, San Diego, CA, 2005, pp. 429-436. [6] D. Comaniciu, X.S. Zhou, and S. Krishnan, “Robust real-time tracking ofmyocardial border: An information fusion approach,” IEEE Trans. Med. Imag.,vol. 23, pp. 849–860, July 2004.[7] American Heart Association (2005), Heart disease and stroke statistics 2005update [Online]. Available: http://www.americanheart.org/downloadable/heart.[8] World Health Organization (2004), The atlas of global heart disease and stroke[Online]. Available: http://www.who.int/cardiovasculardiseases/resources/atlas/.[9] P.S. Bradley and O.L. Mangasarian, “Feature selection via concave minimiza-tion and support vector machines,” in Proc. 15th Int. Conf. Machine Learning,Madison, WI, July 1998, pp. 82–90.[10] M. Dundar, G. Fung, L. Bogoni, M. Macari, A. Megibow, and B. Rao, “Amethodology for training and validating a CAD system and potential pitfalls,” inProc. Computer Assisted Radiology and Surgery Conf., Chicago, IL, June 2004,pp. 837–844.[11] R. Duda, P. Hart, and D. Stork, Pattern Classification. New York: Wiley,2001.[12] R. Hoffmann, T.H. Marwick, D. Poldermans, H. Lethen, R. Ciani, P. van derMeer, H.-P. Tries, P. Gianfagna, P. Fioretti, J.J. Bax, M.A. Katz, R. Erbel, and P.Hanrath, “Refinements in stress echocardiographic techniques improve inter-insti-tutional agreement in interpretation of Dobutamine stress echocardiograms,” Eur.Heart J., vol. 23, no. 10, pp. 821–829, May 2002.[13] T. Joachims (2002), SVMlight [Online]. Available: http://svmlight.joachims.org.[14] Y. Qi, T.P. Minka, R.W. Picard, and Z. Ghahramani, “Predictive automatic rel-evance determination by expectation propagation,” in Proc. 21st Int. Conf. MachineLearning, Alberta, Canada, 2004, pp. 85-92.[15] S. Mika, G. Ratsch, and K.-R. Muller, “A mathematical programmingapproach to the kernel Fisher algorithm,” in Proc. Neural Information ProcessingSystems 2000, Denver, CO, pp. 591–597, 2000.[16] S. Mika, G. Ratsch, J. Weston, B. Scholkopf, and K.-R. Muller, “Fisherdiscriminant analysis with kernels,” in Neural Networks for Signal ProcessingIX, Proc. 1999 IEEE Signal Processing Workshop, Madison, WI, pp. 41–48,Aug. 1999.[17] O.L. Mangasarian, “Generalized support vector machines,” in Advances inLarge Margin Classifiers, A. Smola, P. Bartlett, B. Scholkopf, and D.Schuurmans, Eds. Cambridge, MA: MIT Press, 2000, pp. 135–146.