1
Overview and Introduction to Clinical Proteomics
Young-Ki Paik, Hoguen Kim, Eun-Young Lee, Min-Seok Kwon,and Sang Yun Cho
Summary
As the field of clinical proteomics progresses, discovery of disease biomarkers becomesparamount. However, the immediate challenges are to establish standard operating proce-dures for both clinical specimen handling and reduction of sample complexity and toincrease the ability to detect proteins and peptides present in low amounts. The tradi-tional concept of a disease biomarker is shifting toward a new paradigm, namely, that anensemble of proteins or peptides would be more efficient than a single protein/peptidein the diagnosis of disease. Because clinical proteomics usually requires easy access towell-defined fresh clinical specimens (including morphologically consistent tissue andproperly pretreated body fluids of sufficient quantity), biorepository systems need to beestablished. Here, we address these questions and emphasize the necessity of developingvarious microdissection techniques for tissue specimens, multidimensional fractionationfor body fluids, and other related techniques (including bioinformatics), tools which couldbecome integral parts of clinical proteomics for disease biomarker discovery.
Key Words: biomarker; body fluids; clinical proteomics; translational proteomics;depletion; biorepository; multidimensional fractionation; specimen bank; biomarker panel.
Abbreviations: CSF: Cerebrospinal Fluid, SILAC: Stable Isotope Labeling withAmino acids in Cell culture, FFE: Free Flow Electrophoresis, IMAC: Immobilized MetalAffinity Chromatography, 2DE: 2-dimensional Gel electrophoresis, CBB: CoomassieBrilliant Blue, SELDI: Surface-Enhanced Laser Desorption/Ionization, MALDI: Matrix-Assisted laser desorption/ionization, MDLC: Multi-dimensional Liquid Chromatography,LC: Liquid Chromatography, TOF: Time-of-Flight, CID: Collision-induced dissociation,ETD: Electron Transfer Dissociation, LIT: Linear Ion-Trap, FT: Fourier-Transform, Q:Quadrupole, ELISA; Enzyme-Linked Immunosorbent Assay, SISCAPA: Stable IsotopeStandards with Capture by Anti-Peptide Antibody, AQUA: Absolute Quantitative
From: Methods in Molecular Biology, vol. 428: Clinical Proteomics: Methods and ProtocolsEdited by: A. Vlahou © Humana Press, Totowa, NJ
1
2 Paik et al.
Analysis. Commercial brands are also shown: MARS; Multiple Affinity Removal System,(Agilent, Palo Alto, CA, USA), EnchantTM: EnchantTM Multi-protein Affinity SeparationKit (Pall Life Sciences, Ann Arbor, MI, USA), GradiflowTM: GradiflowTM Separation (LifeBioprocess, Frenchs Forest, Australia), FFETM: BD Free Flow Electrophoresis System(BD Diagnostics, Martinsried/Planegg, Germany), Zoom®: Zoom® Benchtop ProteomicsSystem (Invitrogen Corporation, Carlsbad, CA, USA), Rotofor: Bio-Rad Rotofor® PrepIEF Ccll (Bio-Rad, Hercules, CA, USA), PF2D: ProteomeLabTM PF2D Protein Fraction-ation System (Beckman Coulter, Inc., Fullerton, CA, USA), DIGE: EttanTM DIGE System(GE Healthcare Bio-Sciences AB, Uppsala, Sweden), Deep PurpleTM: Deep PurpleTM TotalPprotein Stain (GE Healthcare Bio-Sciences AB, Uppsala, Sweden), ICATTM: Isotope-coded affinity tags (Applied Biosystems, Foster City, CA, USA), iTRAQTM: iTRAQTM
Reagents (Applied Biosystems, Foster City, CA, USA), Q-TRAPTM: (Applied Biosystems,Foster City, CA, USA).
1. Overview and Scope of Clinical ProteomicsClinical proteomics is defined as comprehensive studies of qualitative and
quantitative profiling of proteins (and peptides) present in clinical specimenssuch as body fluids and tissues. The comparison of specimens from healthy anddiseased individuals may lead to the discovery of a disease biomarker (1). Thebiomarker serves as a molecular signature reflecting stages of disease before orafter treatment and can also be used for prognostic purposes in monitoring theresponse to treatment (2). Clinical proteomics consists of a variety of exper-imental processes, which include the collection of well-phenotyped clinicalspecimens, analysis of proteins or peptides of interest, data interpretation, andvalidation of proteomics data in a clinical context (Fig. 1). After successfulidentification of a few disease biomarker candidates through extensive profiling,
Fig. 1. Clinical and translational proteomics. The key components of experimentalmethods are included in each box.
Overview and Introduction to Clinical Proteomics 3
translational proteomics involving validation with a cohort study follows. Evenafter proper identification and verification of a disease biomarker, it takes quitea long time to prove that this biomarker is applicable to clinical diagnosis orprognosis (3,4).
There has been a remarkable increase in publication of clinical proteomicspapers within a short period of time [more than 800 papers in 2006 (Fig. 2)],coinciding with the rapid growth of proteomics. Reflecting this trend in clinicalproteomics, this chapter aims to present a review of core technologies thatare used in the field of clinical proteomics with respect to sample specimenprocessing, protein separation platforms (e.g., gel-based system or liquid-basedmethods), quantitative labeling, mass spectrometry (MS), and proteome infor-matics tools. It is noteworthy that despite the advent of new technologies,there remain several bottlenecks in the proteomics field such as lack of datasetstandardization, quantification of the proteins of interest, verification of proteinor peptides identified, and an overall strategy for tackling biomarker post-identification. Thus, the pace of biomarker discovery, one of the key agendas ofclinical proteomics, will depend on how well these obstacles or bottlenecks areresolved by technical advancement (4). The following sections address theseissues in the context of clinical proteomics.
Fig. 2. Recent trends in clinical proteomics publications. The distribution of thearticles related to clinical proteomics listed in PubMed is shown here. The key wordsused for searching articles are as follows: query (clinical[All Fields] OR ((“biologicalmarkers”[TIAB] NOT Medline[SB]) OR “biological markers”[MeSH Terms] ORbiomarker[Text Word])) AND (“proteomics”[MeSH Terms] OR proteomics[TextWord] OR proteomic[All Fields] OR “proteome”[MeSH Terms] OR proteome[TextWord]).
4 Paik et al.
2. Sample Specimens and Processing Techniques Used for ClinicalProteomics2.1. General Considerations
Because clinical proteomics rely heavily on the patient specimens, threeimportant factors need to be considered before the selection and preparation ofclinical specimens: (1) selection of the correct clinical samples according to thetype of research, (2) isolation of the appropriate component from the clinicalsamples, and (3) establishment of optimal experimental conditions for eachsample (5,6,7,8). For the selection of correct clinical samples, the relationshipbetween clinical samples and the specific disease should also be considered.For example, although cancer tissue represents a specific cancer, several typesof body fluids from patients may also have a relationship to the cancer. Ifthe selected clinical samples specifically represent the disease, the next stepis to evaluate what components are related to the specific disease. That is,tumor cells in cancerous tissues are surrounded by many types of stromal cells,inflammatory cells, and connective tissues that are directly related to changesin protein expression in the cancer. If the purpose of proteomic analysis isto identify characteristic changes of specific proteins in tumor cells, then theprecise identification of tumor cell percentage that can be increased by tissuemicrodissection would appear to be necessary (5,6,7). As sample specimenconditions directly impact the results of biomarker discovery, well-definedclinical specimens should be used since the discovery of disease biomarkers ismuch easier when the samples have clear anatomical and pathophysiologicaldefinitions. Because clinical specimens are heterogeneous, sophisticated patho-logical discrimination is required for the isolation of specific diseased tissue orbody fluids. Without the expertise of a pathologist at the earliest stage, it maybe difficult to isolate a specifically defined specimen for clinical proteomics.Generally, clinical samples contain variable factors and components originatingfrom the microenvironment of specific tissues. For instance, liver tissues usuallycontain a large amount of blood in the sinusoid and this amount is increasedin tissues with dilated sinusoids (9). Lung tissues usually contain depositedexogenous materials and this amount is increased in heavy smokers (10). Notethat the amount of blood present in isolated tissues may directly influence therelative proportion of proteins found in clinical specimens. Deposited materialsand the other chemicals such as stain dye and fixatives used in the microdis-section may also influence the experimental conditions (11). In the analysis ofclinical samples, suitable buffer conditions, minimal lysis time, and high-yieldprotein precipitation are highly recommended. To avoid substantial variationsbetween experiments using clinical specimens, a large set of specimens arealso necessary because, unlike cultured cell lines, clinical specimens have high
Overview and Introduction to Clinical Proteomics 5
component variability (12). More details on specific disease types are alsodescribed throughout this volume.
2.2. Body Fluids
Surveying the literature, there appears to be five to six different types ofclinical specimens. Body fluids [e.g., plasma, urine, tear, cerebrospinal fluid,lymph, and ascites], tissues (e.g., liver, heart, muscle, brain, and lung), cells,bone, and hair have all been used for clinical proteomics (Table 1) (13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33). Each has its own meritsand limitations for biomarker discovery via proteomic analysis. Among thosesample specimens, the number of publications using body fluids has increasedrecently, perhaps because of their convenience and ease of use for noninvasivediagnosis. Since those proteins secreted in the body fluids during or after diseasemay reflect a broad range of pathophysiological conditions, much emphasis hasbeen given to identification of prominent protein/peptide biomarkers that exhibitdifferential expression at different stages. In the literature, the terms “bodyfluids” and “biofluids” are being used interchangeably, although the formerindicates a greater likelihood of being obtained directly from the patients, whilethe latter is applied more broadly, referring to liquid or liquid-like samplesobtained from living organisms including model animals and plants. Throughoutthis chapter we will use “body fluids” for clarity.
Given the large dynamic range of protein and peptide sources, plasma (acomplex liquid interface between tissues) and extra cellular fluids may be thebest body fluid to use for clinical proteomics and biomarker discovery (34,35,36,37,38). In addition to plasma, more than a dozen additional body fluids arecurrently used for biomarker discovery, ranging from urine to peritoneal fluids(Table 1). However, the biggest challenge in body fluids proteomics may be themultiple pretreatment processes including depletion of high-abundance proteins(in the case of plasma) (34,35,36) and/or their enrichment (in the case of urine)(15,39) prior to analysis (Table 1). Thus, the outcome of clinical proteomicsmay depend on proper sample processing since the quality of selection andhandling of the most specific type of specimen will affect the overall pattern ofprofiling. Because the details of body fluid proteomics have been well describedby Shen Hu et al. (38), we would like to focus on only a few essential points.
First, standard measures need to be introduced to protect specimens fromnonspecific proteolysis, lysis, and modification during collection and prepa-ration (11). For the standardization of blood sample collection, Tammenemphasizes many useful considerations of preanalytical variables in plasmaproteomics, which can be applied to processes involved with blood specimens[(40) and see Chapter 2]. The more specific problems involved in sample
Tabl
e1
Type
sof
Bio
logi
calS
peci
men
sU
sed
inC
linic
alPr
oteo
mic
s
Typ
eD
isea
seR
efer
ence
Cha
ract
eris
tics
ofth
esa
mpl
esPr
etre
atm
ent
requ
ired
for
prot
eom
ics
Flui
dSe
cret
ions
Plas
ma/
seru
m(1
3,14
)•
Rou
tinel
yac
cess
ible
body
flui
ds•
Ver
yim
port
ant
inth
edi
scov
ery
ofbi
omar
kers
ofdi
seas
es(s
yste
mic
vs.o
rgan
spec
ific
/loca
l)•
Impo
rtan
tfo
rea
rly
dete
ctio
n,di
seas
ese
veri
ty,
prog
nosi
s,m
onito
ring
ofre
spon
seto
ther
apy
•C
onsi
dera
tions
for
sam
ple
adeq
uacy
–St
orag
e–
Hem
olys
is–
Infl
uenc
eof
antic
oagu
lant
s–C
onsi
sten
tre
sults
•C
onsi
der
whe
ther
topo
olsa
mpl
esor
anal
yze
indi
vidu
alsa
mpl
es•
Dep
letio
nof
high
-abu
ndan
cepr
otei
ns(A
lbum
inco
nsis
tof
50%
ofpl
asm
apr
otei
ns)
Uri
neN
asal
disc
harg
ePr
osta
teca
ncer
Seas
onal
alle
rgic
rhin
itis
(15)
(16)
Tea
rsSa
liva
Ble
phar
itis
and
dry
eye
Ora
lan
dbr
east
canc
er(1
7,18
)(1
9)A
mni
otic
-/ce
rvic
alfl
uid
Feta
lan
eupl
oidy
and
intr
a-am
niot
icin
flam
mat
ion
(20,
21)
Folli
cula
rfl
uid
Rec
urre
ntsp
onta
neou
sab
ortio
n(2
2)
Sem
inal
flui
dN
ippl
eas
pira
tefl
uid
Cer
ebro
spin
alfl
uid
Mal
ein
fert
ility
Bre
ast
canc
erB
rain
tum
or
(23)
(24)
(25)
Prox
imal
flui
dSy
novi
alfl
uid
Asc
ites
Bro
nchi
alla
vage
flui
d
Rhe
umat
oid
arth
ritis
Ova
rian
canc
erC
hron
icob
stru
ctiv
epu
lmon
ary
dise
ase,
asth
mat
ics
and
lung
dise
ase
(26)
(13)
(27,
28)
•C
anre
flec
tdi
seas
epe
rtur
batio
nsin
the
orga
nsor
tissu
esfr
omw
hich
they
are
secr
eted
•Pr
oced
ure
ofsy
novi
albi
opsy
isno
tve
rydi
ffic
ult
•M
ucos
aan
dsa
ltha
veto
bere
mov
edne
cess
arily
Bod
yca
vity
flui
d
Pleu
ral
flui
dPe
rito
neal
flui
dL
ung
canc
erO
vari
anca
ncer
(29)
(14)
6
Tis
sue
LC
Mor
LM
PCis
olat
edFo
rmal
infi
xed
Para
ffin
embe
dded
Any
type
ofdi
seas
e(3
0)•
Ver
yim
port
ant
for
the
deve
lopm
ent
ofno
vel
insi
tubi
omar
kers
•Im
mun
oflu
ores
cenc
e,im
mun
ocyt
oche
mis
try,
imag
ing
mas
ssp
ectr
omet
ry
•C
onsi
dera
tions
for
sam
ple
adeq
uacy
•In
tegr
ity,
degr
adat
ion
ofpr
otei
n•
Con
tam
inat
ion
(mic
roor
gani
sms,
extr
aneo
usm
ater
ial)
Cel
lC
ell
lines
or prim
ary
tissu
ecu
lture
Any
type
ofdi
seas
e(3
1)•
Ver
yim
port
ant
inth
edi
scov
ery
ofbi
omar
ker
cand
idat
es•
Val
idat
ion
shou
ldbe
perf
orm
edus
ing
prim
ary
tum
orsa
mpl
es(e
.g.,
imm
unoh
isto
logi
cm
etho
ds,i
mag
ing
MS)
•D
esal
ting
and
rem
oval
ofm
edia
com
pone
nt
Bon
eC
artil
age
Rhe
umat
oid
arth
ritis
(32)
•C
artil
age
cons
ists
mai
nly
ofex
trac
ellu
lar
mat
rix,
mos
tlym
ade
ofco
llage
nsan
dpr
oteo
glyc
ans
•C
etyl
pyri
dini
umch
lori
deef
fect
ivel
yag
greg
ate
with
prot
eogl
ycan
Hai
r(3
3)•
Ove
r30
0pr
otei
nsw
ere
foun
dto
cons
titut
eth
ein
solu
ble
com
plex
form
edby
tran
sglu
tam
inas
ecr
ossl
inki
ng
•N
eed
tosu
ffic
ient
extr
actio
nof
prot
ein
from
inso
lubl
eco
mpl
ex
7
8 Paik et al.
handling are also addressed by Rai et al. (41). Second, to increase the dynamicrange of detection and reduce sample heterogeneity, pretreatments such asdepletion of high-abundance proteins appear to be required (34,35,36). Inaddition, many pretreatment steps to remove high-abundance proteins may berequired during initial sample processing. Multiple fractionations of clinicalsamples prior to major separation work would reduce the sample complexity.Note that coremoval of low-abundance proteins during this type of multipledepletion (36,42) and modification of proteins of interest during or afterisolation (43) should be considered as well. For several problems encounteredwith specimen collection, Xiao et al. (Chapter 13) in this volume also describedifferent methods to isolate extra cellular matrix (ECM) and analyze theproteome of secreted vesicles. These methods will be useful for studying ECMand secreted vesicles in various samples ranging from the primary culturedcells to tissue specimens. Therefore, one must consider the best options for thisprocess before doing the main experiment.
2.3. Tissues and Other Samples
Usually tissues are used as primary screening samples to find direct causesof disease from the lesion present in tissues of the corresponding organ, forexample, liver tissue in hepatocellular carcinoma (HCC) (44,45). Tissues arewidely used for clinical proteomics, although there are no standing operationprocedures in specimen fractionation and the detection limit of current instru-mentation remains borderline. As listed in Table 1, many cancer tissues can beprepared in different ways such as laser capture microdissection (LCM) (5,6),pressures catapulting techniques [laser microdissection and pressure catapulting(LMPC)] (30,46), and formalin-fixed paraffin-embedded sample preparation(11). Theses techniques are well described in Chapters 3, 5, 9, and 11 in thisvolume. It is desirable, however, that proteomics studies of disease tissuesshould also be coupled with parallel analysis of the corresponding body fluids.For example, for the study of cancer biomarkers, paired cancer tissue sets (tumorvs. nontumor) and the same patient’s plasma were used, which led to a morecomprehensive analysis (47,48). Experiments on tissue samples may mostly besuitable for pathophysiological studies rather than biomarker discovery due tothe complexity of the sample.
In specimen processing for proteomics studies, there are usually severalunwanted problems such as artifacts created during sample collection, processing,and storage. Other matters arise in the handling of patient information regardingsex, age, and race (49). To minimize those problems associated with systematicsample handling, it is plausible to establish a specimen bank (50,51,52). In fact,the collection of many clinical samples in a biorepository would have enormous
Overview and Introduction to Clinical Proteomics 9
benefits for proteomic research. This enables the selection of homogeneousclinical samples according to the research purposes and isolation of specificcomponents from clinical samples. Additionally, large scale collection of clinicalspecimens in a biorepository is essential for the validation of specific markersafter biomarker candidate discovery. Ideally, the clinical samples stored in thebiorepository should be (1) collected and stored immediately because dead cellsand altered proteins affect proteomic analysis, (2) subjected to accurate qualitycontrol, and (3) catalogued by reliable and secure clinical data. The quality controlof clinical samples includes trimming of specimens and confirmation of diagnosisby pathologists; information gained (such as the confirmation of tumor cell andstromal cell ratio, percentage of necrosis, percentage of fibrosis, proportion ofinfiltrated inflammatory cells, etc.) should be stored in a database of clinicalsamples. It is also essential to store clinical and follow-up data for each sampleand each patient’s written informed consent form in the biorepository network.This clinical specimen banking network provides convenience, reduced budget,and reliability for researchers involved in clinical proteomic research (50,51,52).
For representative tissue sample collection for proteomics studies, Diaz et al.(Chapter 3) address a practical experimental strategy for storage and handling ofsample specimens that are used in surface-enhanced laser desorption/ionization(SELDI), 2D gel, and liquid chromatography (LC)-based proteomics. Emphasisshould be given to the primary responsibility of pathologists in the wholeprocess of tissue proteomics in addition to morphological analysis at themolecular level.
3. Biomarker Discovery and Clinical ProteomicsGiven that one of the central issues of clinical proteomics is biomarker
discovery and its application, a brief account of this subject is appropriatehere. An excellent review of the whole arena of biomarker development can befound elsewhere (53,54,55). Until now, it has been generally accepted that aconventional concept of a disease biomarker would be a single protein/peptidewith high specificity, which is usually present in low abundance, expressed ina disease in a stage-specific manner, and serve as a major fingerprint of thebody’s response to drugs or other treatments. Although many examples of broadbiomarkers for various diseases are known (56,57,58,59,60), identification ofmore specific and selective biomarkers is urgently needed. Accordingly, wemay also need to change the current biomarker concept and eliminate theinherent bias toward individual disease biomarkers. Recently, a new idea hasbeen introduced that an ensemble of different proteins would be more efficientthan a single protein/peptide in the diagnosis of disease (61,62,63). To solve
10 Paik et al.
this problem we propose a general strategy of clinical proteomics leading todisease biomarker discovery as outlined in Fig. 3.
Since biomarker candidate proteins could come from many different cellularprocesses, they could be either in low abundance or high abundance, whichwould directly or indirectly reflect the physiological condition of the body.Perhaps they are present in different concentrations depending on the diseasestage or tissue type. For example, common proteins such as Hsp 27 (64,65), 14-3-3 proteins (66,67), apoA-I (68,69), and serum amyloid precursorA (70) appear in most of disease samples from lung cancer, gastric cancer,pancreatic cancer, prostate cancer, neuroblastoma and, inflammation. A numberof questions then arise: should they be treated as disease-specific or diseasenonspecific proteins? What would be the criterion to make this decision? Is thisdue to the fact that the number and type of proteins secreted from a specific
Fig. 3. The concept of the creation of a protein biomarker panel for a specificdisease. Each white, gray, dark-gray, and black circle represents a putative proteinbiomarker of a specific disease at that clinical stage. A group of slash-lined circlessymbolizes the biomarker panel of liver disease as an example.
Overview and Introduction to Clinical Proteomics 11
physiological condition of many different types of diseases might be similar?How one can distinguish one type of disease from another simply by lookingat their protein profiles?
As outlined in Fig. 3, at the beginning of certain disease, signals at earlierstages may be limited to only a few easily counted molecules. As the diseaseprogresses, more signal molecules might have been produced, resulting in mixedtypes of biomarkers representing multiple disease phenomena. Although thisassumption seems to be oversimplified, more noise is created at a certain stagewhere it becomes more difficult to identify those molecules at the molecularlevel because of two reasons: (1) they are in amounts too small to be detectedusing the current technology and (2) it may be too premature for the moleculesto be specific for a particular disease. Presumably, proteins appearing in stage 3or 4 may have higher specificity of a particular disease but the sensitivity mightbe low. It may be likely that this noise interferes with the signaling pathway ofa certain disease, and we may end up having no decisive marker. To circumventthis problem, it may be desirable to identify a set of biomarker candidateproteins, termed a “biomarker panel,” which ideally contains potential candidateproteins or peptides that represent specific stages of the disease as a group.Given this panel, extensive validation processes may be sought using largegroup cohort. Analogous to this strategy, many biomarker candidates at stage 1can be included in the panel, which can have more specificity and sensitivity ascompared to a single molecule biomarker. Using this kind of biomarker panel,one can use not only this molecule as diagnostic marker but also as a prognosticindicator in monitoring treatment effectiveness. For example, Linkov et al. (61)reported that both the sensitivity and specificity were improved up to 84.5 and98%, respectively, when they used a panel containing 25 multimarkers in earlydiagnosis of head and neck cancer (squamous cell cancer of the head and neck)(61). In the diagnosis of prostate cancer, specificity was increased from 5–15to 84–95% when they used a biomarker panel containing six marker proteinsas compared to a single marker. In HCC, studies have been carried out on abiomarker panel consisting of a protein array that can be used as a diagnostickit (62,63).
A general strategy for biomarker discovery is outlined in Fig. 4. In typicalclinical proteomics, work sample collection is the first step, followed bypretreatment of the sample in order to reduce sample complexity to enablesearching for low-abundance proteins (e.g., disease biomarkers) using variousfractionation tools. This multidimensional fractionation is well-describedelsewhere (34,35,36), and depends on the properties and concentration of thesample. Typically the prefractionated samples go either to a two-dimensionalelectrophoresis (2DE) or LC-based proteomics separation system, followed bysingle or multiple steps of mass spectrometric analysis depending on the sample
Fig.
4.
12
Overview and Introduction to Clinical Proteomics 13
quantity and experimental goal. The data obtained from this series of analyseswill be integrated into the proteome informatics system where protein/peptideidentification, quantification, modification, and verification of peak list arecarried out [(71) and also Chapter 19]. Usually this step becomes rate limitingsince major profiling data are constructed and analyzed at this point. Theclinical relevance of those proteins (and changes in their expression level) ina specific disease state is mostly determined, which eventually leads to identi-fication of biomarker candidates. In addition, SELDI, molecular imaging andprotein microarrays can also be applied before or after this step. Once majorbiomarker candidates are identified, those proteins are subjected to furtherverification via sophisticated analytical arrays and translational proteomics,which involves cohort studies, pre-evaluation, and a robust analytical system(4,72). Throughout the process of translational proteomics, one may be able tojudge whether the identified panel or single proteins are suitable for biomarkersof a specific disease. A recent comprehensive review by Zolg (73) addressedseveral considerations in the biomarker development pipeline from discoveryto validation. Three critical challenges within the pipeline are reduction ofclinical sample complexity, the proof of principle of biomarker function, andthe detection limit of unique proteins present in the samples.
In the search for biomarker panels, reliable statistical tools and bioinfor-matics resources are needed, which are now available on the web (Table 2;see also Chapters 16 and 17). As the number of biomarker panel candidatesincreases, more cases are being examined, which require statistical learningmethods. These methods include neural networks, genetic algorithms, k-means
�Fig. 4. A typical experimental strategy for clinical proteomics and transla-
tional proteomics. In clinical proteomics research, various experimental techniquesare included: specimen collection, prefractionation, 2DE, Non2DE (liquid-basedseparation), mass spectrometry, informatics, and others. The course of each section asmarked (square, circle in different color) is determined by the investigators, dependingon the experimental goal. At the bottom, experimental procedures for the verificationand validation of biomarker candidates are schematically outlined leading to clinicalscreening and applications. The squares indicate the separation system based on thespecific characteristics of proteins and general prefractionation system. The open circlesand open triangle represent analytical modules at the protein and peptide level, respec-tively. The arrow and junction points indicate an option of each selection. Bottom partsindicate verification procedure employing multiple reaction monitoring and quantitativemass analysis. Those biomarker candidates identified from typical clinical proteomicswould be subject to translational proteomics for validation where a large scale cohortstudy and evaluation would then proceed.
14 Paik et al.
nearest-neighbor analysis, euclidean distance-based nonlinear methods, fuzzypattern matching, selforganizing mapping, and support vector machines(74,75,76,77,78). They are very useful for classification of proteins accordingto the specific disease state (see also Chapters 16 and 20). Once biomarkercandidates are identified, it is necessary to predict in silico the function ofthese proteins and validate them in the context of clinical application. Table 3provides web resources, which can be used for clinical data management, insilico functional annotation (see Chapter 18), prediction, and identification ofmodified forms of proteins. Thus, by combining experimental methods (Fig. 4)and informatics tools (Tables 2 and 3), one is able to obtain a set of biomarkercandidate proteins (panel) that would be further used for validation throughtranslational proteomics (Fig. 1).
4. Introduction of the Experimental Strategy Describedin This Volume
For protein profiling and identification, proteomics platform technologiesare moving forward in many areas not only in clinical proteomics but also inthe general biological field. In this section, the leading scientists in the fieldof proteomics outline core techniques and their application to the studies ofclinical proteomics. For example, in plasma proteome analysis, it is necessaryto deplete high-abundance proteins using various techniques such as multidi-mensional fractionation by immunoaffinity column, gel permeation, and beads(Fig. 4). Cho et al. (Chapter 4) addresses this in relation to 2D gel analysis ofplasma wherein the technical details of sample preparation, gel electrophoresis,and quantification of proteins on the gel are described. Zhang and Koay(Chapter 5) describe the methods of 2D gel analysis for cells prepared byLCM. They describe the application of LCM in dissecting tumor cells inbreast cancer for macromolecular extraction and 2D gels. This can be usedfor preparation of samples from paraffin-embedded tissue blocks in microdis-secting the cells of interest. Further to this procedure, Mustafa et al. (Chapter 9)review the application of LCM for proteomics analysis and demonstrate thatcombining LCM and MS would facilitate identification of specific proteinsfor each sample type. For urine sample analysis, Zerefos et al. (Chapter 8)provide simple protocols for protein analysis by 2D gel or direct matrix-assistedlaser desorption/ionization-time-of-flight mass spectrometry. These techniquesinclude protein enrichment through protein precipitation and ultrafiltrationmeans. Combining these methods with the above profiling technologies allowsreproducible and sensitive analysis of one of the most significant and complexbiological samples (77).
Overview and Introduction to Clinical Proteomics 15
Table 2Clinical Proteomics Initiatives and Resources
Details Websites
InstituteCPTI National Cancer Institute’s Clinical
Proteomics Technologies, initiative forcancer
http://proteomics.cancer.gov
ABRF The Association of BiomolecularResource Facilities, an internationalsociety dedicated to advancing core andresearch biotechnology laboratoriesthrough research, communication, andeducation
http://www.abrf.org/
PPI Plasma Proteome Institute, the PPI isworking to facilitate clinical adoption ofadvanced diagnostic tests using proteinsin plasma and serum
http://www.plasmaproteome.org/plasmaframes.htm
EDRN The Early Detection Research Network,the EDRN provide up-to-dateinformation on biomarker researchthrough this website and scientificpublications
http://edrn.nci.nih.gov
Web resourcesExPASy Expert Protein Analysis System,
proteomics related information anddatabase
http://www.expasy.org/
NCBI National Center for BiotechnologyInformation, the protein entries in theEntrez search and retrieval system havebeen compiled from a variety of sources,including SwissProt, PIR, PRF, PDB,and translations from annotated codingregions in GenBank and RefSeq
http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db = Protein&itool = toolbar
CPRMap Clinical Proteomics Research Map,updated research article for disease andclinical proteomics
http://www.cprmap.com/
DatabaseMedGene MedGene can make a list of human
genes associated with a particular humandisease in ranking order
http://hipseq.med.harvard.edu/MEDGENE
16 Paik et al.
Table 3Available Bioinformatic Resources for the Analysis of Proteomics Data
Name Description Website URL PMID
Clinical proteome data management systemProteus LIMS for proteomics
pipelinehttp://www.genologics.com
CPAS LIMS for identificationand quantification usingby LC-MS/MS data
16396501
Systems biologyexperiment analysismanagementsystem
A management system forcollecting, storing,and accessing dataproduced by microarray,proteomics, andimmunohistochemistry
http://www.sbeams.org/
16756676
GPM database Open source system foranalyzing, validating,and storing proteinidentification data
http://www.thegpm.org/
15595733
SpectrumMill MS/MS data analysis andmanagement system
http://www.chem.agilent.com/
PhosphorylationGroup-basedphosphorylationscoring method
Prediction ofkinase-specificphosphorylation sites
http://973-proteinweb.ustc.edu.cn/gps/gps_web/
15980451
KinasePhos A web tool for identifyingprotein kinase-specificphosphorylation sitesusing by hidden Markovmodel
http://kinasePhos.mbc.nctu.edu.tw
15980458
NetPhos Sequence andstructure-based predictionof eukaryotic proteinphosphorylation sites
http://www.cbs.dtu.dk/services/NetPhos/
10600390
NetPhosK Prediction ofpost-translationalglycosylation andphosphorylation ofproteins from the aminoacid sequence
http://www.cbs.dtu.dk/services/NetPhosK/
15174133
Overview and Introduction to Clinical Proteomics 17
PredPhospho Prediction of phosphorylationsites using support vectormachine
http://pred.ngri.re.kr/PredPhospho.htm
15231530
PREDIKIN A prediction of substrates forserine/threonine proteinkinases based on the primarysequence of a protein kinasecatalytic domain
http://florey.biosci.uq.edu.au/kinsub/home.htm
16445868
Prosite A prediction of substratesfor protein kinases-basedconserved motif search
http://kr.expasy.org/prosite
17237102
Scansite Prediction of PK-specificphosphorylation site withBayesian decision theory
http://scansite.mit.edu
16549034
Phospho.ELM A database of experimentallyverified phosphorylation sitesin eukaryotic proteins
http://phospho.elm.eu.org/
15212693
Human proteinreference database(HPRD)
A database of knownkinase/phosphatase substrate aswell as binding motifs that arecurated from the publishedliterature
http://www.hprd.org/PhosphoMotif_finder
PhosphoSite A bioinformatics resourcededicated to physiologicalprotein phosphorylation
http://www.phosphosite.org/Login.jsp
15174125
GlycosylationNetOGlyc 2.0 Predicts O-glycosylation sites
in mucin-type proteinshttp://www.cbs.dtu.dk/services/NetOGlyc/
9557871
DictyOGlyc 1.1 Predicts O-GlcNAc sites ineukaryotic proteins
http://www.cbs.dtu.dk/services/DictyOGlyc/
10521537
YinOYang 1.2 Predicts O-GlcNAc sites ineukaryotic proteins
http://www.cbs.dtu.dk/services/YinOYang/
NetNGlyc 1.0 Predicting N-glycosylationsites
http://www.cbs.dtu.dk/services/NetNGlyc/
16316981
GlycoMod Web software for prediction ofthe possible oligosaccharidestructures in glycoproteinsfrom their experimentallydetermined masses
http://www.expasy.ch/tools/glycomod/
11680880
(Continued)
18 Paik et al.
Table 3(Continued)
Name Description Website URL PMID
Glyco-fragment A web tool to supportthe interpretation ofmass spectra of complexcarbohydrates
http://www.dkfz.de/spec/projekte/fragments/
14625865
GlycoSearchMS Compares each peakof a measured massspectrum with the calculatedfragments of all structurescontained in the SweetDB
http://www.dkfz.de/spec/glycosciences.de/sweetdb/ms/
15215392
GlycosidIQ Based on the matching ofexperimental MS2 data withthe theoretical fragmentationof glycan structures inGlycoSuiteDB
https://tmat.proteomesystems.com/glyco/glycosuite/glycodb
15174134
Saccharidetopologyanalysis tool
A web-based computationalprogram that can quicklyextract sequence informationfrom a set of MSn spectrafor an oligosaccharide of upto 10 residues
10857602
GlycoX To determine simultaneouslythe glycosylation sitesand oligosaccharideheterogeneity ofglycoproteins usingMATLAB
17022651
MODi A web server for identifyingmultiple post-translationalpeptide modifications fromtandem mass spectra
http://www.unimod.org
16845006
SWEET-DB An attempt to createannotated data collectionsfor carbohydrates
http://www.dkfz.de/spec2/sweetdb/
11752350
Protein–protein interactionMunichinformationcenter for proteinsequence’s MPPI
The database of mammalianprotein–protein interactions
http://mips.gsf.de 16381839
Overview and Introduction to Clinical Proteomics 19
Database ofinteracting proteins
A database that documentsexperimentally determinedprotein–protein interactions
http://dip.doe-mbi.ecla.edu/
11752321
Molecularinteraction networkdatabase
A database of storing, ina structured format,information aboutmolecular interactions byextracting experimentaldetails from workpublished in peer-reviewedjournals
http://mint.bio.uniroma2.it/mint
17135203
Protein–proteininteractions ofcancer proteins
Predicts interactions, whichare derived from homologywith experimentally knownprotein–protein interactionsfrom various species
http://bmm.cancerresearchuk.org/˜pip
16398927
IntAct IntAct provides a freelyavailable, open sourcedatabase system andanalysis tools for proteininteraction data
http://www.ebi.ac.uk/intact/
17145710
Biomolecularinteraction networkdatabase
A database designed tostore full descriptions ofinteractions, molecularcomplexes and pathways
http://www.bind.ca 12519993
Metabolic andsignal pathwayBioCarta A pathway database http://www.
biocarta.comKEGG A pathway database with
genomical, chemical, andbiological networkinformation
http://www.genome.jp/kegg
16381885
Cancer cell map The cancer cell map is aselected set of humancancer focused pathways
http://cancer.cellmap.org/cellmap/
HPRD A database withdata pertainingto post-translationalmodifications,protein–proteininteractions, tissueexpression,
http://www.hprd.org/
(Continued)
20 Paik et al.
Table 3(Continued)
Name Description Website URL PMID
subcellular localization,and enzyme–substraterelationships
Proteomic data resourceThe cancer cellmap
A database of clinical datafrom SELDI-TOF
http://home.ccr.cancer.gov/ncifdaproteomics/ppatterns.asp
Proteomicsidentificationsdatabase
A database of protein andpeptide identifications thathave been described in thescientific literature
http://www.ebi.ac.uk/pride/
16381953
PeptideAtlas A multiorganism, publiclyaccessible compendium ofpeptides identified in alarge set of tandem massspectrometry proteomicsexperiments
http://www.peptideatlas.org
16381952
Disease resourceOnlinemendelianinheritance inman
A database of human genesand genetic disorders
http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db = OMIM
17170002
GeneCards An integrated database ofhuman genes that includesautomatically minedgenomic, proteomic, andtranscriptomic information
http://www.genecards.org/index.shtml
15608261
Cancer genecensus
A catalogue those genes forwhich mutations have beencausally implicated in cancer
http://www.sanger.ac.uk/genetics/CGP/Census/
14993899
Two-dimensional electrophoresis is perhaps the most popular start-up toolfor proteome analysis. For clinical proteomics, 2DE has been the traditionalworkhorse of proteomics used for the analysis of different clinical specimensranging from plasma to urine (Table 1). Quantification problems in 2DE are nowsolved by employing fluorescent dyes (cy3 and cy5), which allow normalization
Overview and Introduction to Clinical Proteomics 21
of data obtained from two different clinical specimens (79). Freedman andLilley (Chapter 6) present general optimization conditions for differential in gelelectrophoresis (DIGE) in the quantitative analysis of clinical samples. Theyaddress the usefulness of differentially labeling dyes (Cy2, Cy3, and Cy5).The essence of any DIGE system is to minimize any potential human errorsin the process of identification and quantification of proteins spotted in a 2Dgel (79). The difficulties in 2D map analysis are introduced by Marengo et al.(Chapter 16). They describe methods for comparing protein spots using imageanalysis technology and related informatics tools to minimize variations betweenmeasurements of spot volume, a key to successful 2D map construction.
There are many variations of LC in protein profiling, including mass detectionmethods, column types, data mining through search engines, mass accuracy,and running conditions (80,81,82). These are all related to quantification ofproteins or peptides in the sample, one of the major bottlenecks in proteomics(83,84,85,86,87). Among the several techniques are isotope-coded affinity tags(ICAT), mass-coded affinity tagging, and nonisotope labeled methods. Xiao andVeenstra (Chapter 10) present the application of ICAT in the course of COX-2inhibitor regulated proteins in a colon cancer cell line. With emphasis on samplepreparation, they provide details on ICAT procedures for quantitative proteomics(88). In addition to this approach, Li et al. (Chapter 11) employ a strategy,which combines LCM techniques for sample preparation of HCC and cleavableisotope-coded affinity tags in order to identify those markers quantitatively.However, it should be mentioned here that some other measures are needed toincrease the efficiency of ICAT since it has drawbacks in the efficiency of samplerecovery during or after labeling steps (87). A label-free serum quantificationmethod has been recently introduced (48) (See Chapter 12 by Higgs et al.).
The use of antibody arrays in clinical proteomics has increased recently in thecontext of high-throughput detection of cancer specimens where the identitiesof the proteins of interest are known (89,90). The evaluation of antibody cross-reactivity and specificity is very crucial in these assays. This matter is addressedby Sanchez-Carbayo (Chapter 15), where technical aspects and application ofplanar antibody arrays in the quantification of serum proteins is described aswell as by Hsu et al. (Chapter 14) where the development and use of bead-based miniaturized multiplexed sandwich immunoassays for focused proteinprofiling in various body fluids is provided. The latter method using bead-based protein arrays or suspension microarray allows the simultaneous analysisof a variety of parameters within a single experiment. With the versatility ofsuspension microarray in the analysis of proteins of interest present in differenttypes of body fluids ranging from serum to synovial fluids, this multiplexedprotein profiling technology described by Hsu et al. (Chapter 14) seems tohold a great promise in clinical proteomics. Similarly, in combination with
22 Paik et al.
tissue microarrays technology (91) it would also be possible to perform parallelmolecular profiling of clinical samples together with immunohistochemistry,fluorescence in situ hybridization, or RNA in situ hybridization. SELDI isanother arena of high-throughput profiling of clinical samples in the courseof disease marker discovery [(92,93), Chapter 7]. It is expected that profilingapproaches in proteomics, such as SELDI-MS, will be frequently used in diseasemarker discovery, but only if the proper identification technologies coupledwith SELDI are improved.
During the course of biomarker discovery, large data sets are usuallygenerated and deposited in a coordinated fashion (Tables 2 and 3) (94,95).Indeed, statistical analysis of 2DE proteomics, which produce several hundredprotein spots, is complex. To circumvent some inconsistency in 2D gelproteomics data, Friedman and Lilley (Chapter 6) and Carpentier et al. (Chapter17) point out available statistical tools and suggest case-specific guidelines for2D gel spot analysis. Fitzgibbon et al. (Chapter 19) describe an open sourceplatform for LC-MS spectra where the msInspector program is used to lowerfalse positives and guide normalization of the dataset. It is also demonstratedthat msInspect can analyze data from quantitative studies with and withoutisotopic labels. Paliakasis et al. (Chapter 18) introduce web-based tools forprotein classification, which lead to prediction of potential protein functionand family clustering of related proteins. They provide some guidelines toclassification of protein data into more meaningful families. Finally, Somorjai(Chapter 20) addresses important filtering criteria for the application of proteinpattern recognition to biomarker discovery using statistical tools.
5. Concluding RemarksAlthough there are several bottlenecks in clinical proteomics (such as lack
of standardization of sample specimen process, quantification, and overallstrategy for tackling post-identification of biomarkers), we believe that thefield holds great promise in biomarker discovery. The success of clinicalproteomics depends on the availability and selection of well-phenotypedspecimens, reduction of sample complexity, development of good informaticstools, and efficient data management. Therefore, sample handling techniquesincluding microdissection for tissue sample, multidimensional fractionation forbody fluids, and pretreatment of other clinical specimens (e.g., urine, tears, andcells) should be developed in this context. Since there is no gold standard forsample collection and handling, one needs to find the best options available forsample processing without damage. In addition, establishment of a biorepositorysystem would systematically minimize some artifacts and variation betweensamples during or after identification of biomarkers.
Overview and Introduction to Clinical Proteomics 23
It is now generally accepted that an ensemble (or panel) of different proteinswould be more efficient than a single protein/peptide in the diagnosis of disease,an idea which is poised to replace the conventional concept of a biomarker.As a high-throughput way of protein profiling, the use of antibody arraysin clinical proteomics has recently increased in regard to detection of cancerspecimens. However, in the use of antibody arrays to profile serum autoanti-bodies, issues of cross-reactivity and specificity have to be resolved. Althoughnot covered here due to space limitations, with the advent of proteomicstechniques one can further analyze a network of protein–protein interactionas well as post-translational modifications of those proteins involved in aspecific disease (Table 3). It is now highly recommended that common reagentssuch as antibodies and standard proteins, which are very useful for spikingpurposes, quantification work, and sensitivity normalization of one machine toanother be used in worldwide efforts like human proteome organization plasmaproteome project (96,97). Finally, clinical proteomics needs the integration ofbiochemistry, pathology, analytical technology, bioinformatics, and proteomeinformatics to develop highly sensitive diagnostic tools for routine clinical carein the future (71,98).
AcknowledgmentsThis study was supported by a grant from the Korea Health 21 R&D project,
Ministry of Health & Welfare, Republic of Korea (A030003 to YKP).
References1. Etzioni, R., Urban, N., Ramsey, S., McIntosh, M., Schwartz, S., Reid, B., Radich, J.,
Anderson, G., and Hartwell, L. (2003) The case for early detection. Nat. Rev.Cancer 3, 1–10.
2. Ludwig, J. A. and Weinstein, J. N. (2005) Biomarkers in cancer staging, prognosisand treatment selection. Nat. Rev. Cancer 5, 845–856.
3. Xiao, Z., Prieto, D., Conrads, T. P., Veenstra, T. D., and Issaq, H. J. (2005)Proteomic patterns: their potential for disease diagnosis. Mol. Cell Endocrinol.230, 95–106.
4. Rifai, N., Gillette, M. A., and Carr, S. A. (2006) Protein biomarker discoveryand validation: the long and uncertain path to clinical utility. Nat. Biotechnol. 24,97–983.
5. Emmert-Buck, M. R., Bonner, R. F., Smith, P. D., Chuaqui, R. F., Zhuang, Z.,Goldstein, S. R., Weiss, R. A., and Liotta, L. A. (1996) Laser capture microdis-section. Science 274, 998–1001.
6. Gillespie, J. W., Ahram, M., Best, C. J., Swalwell, J. I., Krizman, D. B.,Petricoin, E. F., Liotta, L. A., and Emmert-Buck, M. R. (2001) The role of tissuemicrodissection in cancer research. Cancer J. 7, 32–39.
24 Paik et al.
7. Craven, R. A. and Banks, R. E. (2002) Use of laser capture microdissection toselectively obtain distinct populations of cells for proteomic analysis. MethodsEnzymol. 356, 33–49.
8. Vincourt, J. B., Lionneton, F., Kratassiouk, G., Guillemin, F., Netter, P.,Mainard, D., and Magdalou, J. (2006) Establishment of a reliable method for directproteome characterization of human articular cartilage. Mol. Cell Proteomics 5,1984–1995.
9. Platt, M. S., Agamanolis, D. P., Krill, C. E. Jr., Boeckman, C., Potter, J. L.,Robinson, H., and Lloyd, J. (1983) Occult hepatic sinusoid tumor of infancysimulating neuroblastoma. Cancer 52, 1183–1189.
10. Mahadevia, P. J., Fleisher, L. A., Frick, K. D., Eng, J., Goodman, S. N., andPowe, N. R. (2003) Lung cancer screening with helical computed tomographyin older adult smokers: a decision and cost-effectiveness analysis. JAMA 289,313–322.
11. Hood, B. L., Darfler, M. M., Guiel, T. G., Furusato, B., Lucas, D. A.,Ringeisen, B. R., Sesterhenn, I. A., Conrads, T. P., Veenstra, T. D., and Krizman,D. B. (2005) Proteomic analysis of formalin-fixed prostate cancer tissue. Mol. CellProteomics 4, 1741–1753.
12. Alaiya, A., Al-Mohanna, M., and Linder, S. (2005) Clinical cancer proteomics:promises and pitfalls. J. Proteome Res. 4, 1213–1222.
13. Gericke, B., Raila, J., Sehouli, J., Haebel, S., Konsgen, D., Mustea, A., andSchweigert, F. J. (2005) Microheterogeneity of transthyretin in serum and asciticfluid of ovarian cancer patients. BMC Cancer 17, 133–141.
14. Swisher, E. M., Wollan, M., Mahtani, S. M., Willner, J. B., Garcia, R., Goff, B. A.,and King, M. C. (2005) Tumor-specific p53 sequences in blood and peritoneal fluidof women with epithelial ovarian cancer. Am. J. Obstet. Gynecol. 193, 662–667.
15. Pisitkun, T., Johnstone, R., and Knepper, M. A. (2006) Discovery of urinarybiomarkers. Mol. Cell Proteomics 5, 1760–1771.
16. Ghafouri, B., Irander, K., Lindbom, J., Tagesson, C., and Lindahl, M. (2006)Comparative proteomics of nasal fluid in seasonal allergic rhinitis. J. ProteomeRes. 5, 330–338.
17. Koo, B. S., Lee, D. Y., Ha, H. S., Kim, J. C., and Kim, C. W. (2005) Comparativeanalysis of the tear protein expression in blepharitis patients using two-dimensionalelectrophoresis. J. Proteome Res. 4, 719–724.
18. Grus, F. H., Podust, V. N., Bruns, K., Lackner, K., Fu, S., Dalmasso, E. A.,Wirthlin, A., and Pfeiffer, N. (2005) SELDI-TOF-MS ProteinChip array profilingof tears from patients with dry eye. Invest. Ophthalmol. Vis. Sci. 46, 863–876.
19. Amado, F. M., Vitorino, R. M., Domingues, P. M., Lobo, M. J., and Duarte, J. A.(2005) Analysis of the human saliva proteome. Expert Rev. Proteomics 2, 521–539.
20. Wang, T. H., Chang, Y. L., Peng, H. H., Wang, S. T., Lu, H. W., Teng, S. H.,Chang, S. D., and Wang, H. S. (2005) Rapid detection of fetal aneuploidy usingproteomics approaches on amniotic fluid supernatant. Prenat. Diagn. 25, 559–566.
21. Ruetschi, U., Rosen, A., Karlsson, G., Zetterberg, H., Rymo, L., Hagberg,H., and Jacobsson, B. (2005) Proteomic analysis using protein chips to detect
Overview and Introduction to Clinical Proteomics 25
biomarkers in cervical and amniotic fluid in women with intra-amniotic inflam-mation. J. Proteome Res. 4, 2236–2242.
22. Kim, Y. S., Kim, M. S., Lee, S. H., Choi, B. C., Lim, J. M., Cha, K. Y., andBaek, K. H. (2006) Proteomic analysis of recurrent spontaneous abortion: identi-fication of an inadequately expressed set of proteins in human follicular fluid.Proteomics 6, 3445–3454.
23. Pilch, B. and Mann, M. (2006) Large-scale and high-confidence proteomic analysisof human seminal plasma. Genome Biol. 7, R40
24. Varnum, S. M., Covington, C. C., Woodbury, R. L., Petritis, K., Kangas, L. J.,Abdullah, M. S., Pounds, J. G., Smith, R. D., and Zangar, R. C. (2003) Proteomiccharacterization of nipple aspirate fluid: identification of potential biomarkers ofbreast cancer. Breast Cancer Res. Treat. 80, 87–97.
25. Zheng, P. P., Luider, T. M., Pieters, R., Avezaat, C. J., van den Bent, M. J., SillevisSmitt, P. A., and Kros, J. M. (2003) Identification of tumor-related proteins byproteomic analysis of cerebrospinal fluid from patients with primary brain tumors.J. Neuropathol. Exp. Neurol. 62, 855–862.
26. Gibson, D. S., Blelock, S., Brockbank, S., Curry, J., Healy, A., McAllister, C.,and Rooney, M. E. (2006) Proteomic analysis of recurrent joint inflammation injuvenile idiopathic arthritis. J. Proteome Res. 5, 1988–1995.
27. Merkel, D., Rist, W., Seither, P., Weith, A., and Lenter, M. C. (2005)Proteomic study of human bronchoalveolar lavage fluids from smokers withchronic obstructive pulmonary disease by combining surface-enhanced laserdesorption/ionization-mass spectrometry profiling with mass spectrometric proteinidentification. Proteomics 5, 2972–2980.
28. Wu, J., Kobayashi, M., Sousa, E. A., Liu, W., Cai, J., Goldman, S. J., Dorner, A. J.,Projan, S. J., Kavuru, M. S., Qiu, Y., and Thomassen, M. J. (2005) Differ-ential proteomic analysis of bronchoalveolar lavage fluid in asthmatics followingsegmental antigen challenge. Mol. Cell Proteomics 4, 1251–1264.
29. Tyan, Y. C., Wu, H. Y., Lai, W. W., Su, W. C., and Liao, P. C. (2005) Proteomicprofiling of human pleural effusion using two-dimensional nano liquid chromatog-raphy tandem mass spectrometry. J. Proteome Res. 4, 1274–1286.
30. Khalil, A. A. and James, P. (2007) Biomarker discovery: a proteomic approach forbrain cancer profiling. Cancer Sci. 98, 201–213.
31. Khodavirdi, A. C., Song, Z., Yang, S., Zhong, C., Wang, S., Wu, H., Pritchard, C.,Nelson, P. S., and Roy-Burman, P. (2006) Increased expression of osteopontincontributes to the progression of prostate cancer. Cancer Res. 66, 883–888.
32. Vincourt, J. B., Lionneton, F., Kratassiouk, G., Guillemin, F., Netter, P., Mainard, D.,and Magdalou, J. (2006) Establishment of a reliable method for direct proteomecharacterization of human articular cartilage. Mol. Cell Proteomics 5, 1984–1995.
33. Lee, Y. J., Rice, R. H., and Lee, Y. M. (2006) Proteome analysis of humanhair shaft: from protein identification to post-translational modification. Mol. CellProteomics 5, 789–800.
34. Cho, S. Y., Lee, E. Y., Lee, J. S., Kim, H. Y., Park, J. M., Kwon, M. S., Park, Y. K.,Lee, H. J., Kang, M. J., Kim, J. Y., Yoo, J. S., Park, S. J., Cho, J. W., Kim, H. S., and
26 Paik et al.
Paik, Y. K. (2005) Efficient prefractionation of low-abundance proteins in humanplasma and construction of a two-dimensional map. Proteomics 5, 3386–3396.
35. Lathrop, J. T., Hayes, T. K., Carrick, K., and Hammond, D. J. (2005) Rarity givesa charm: evaluation of trace proteins in plasma and serum. Expert Rev. Proteomics2, 393–406.
36. Lee, H. J., Lee, E. Y., Kwon, M. S., and Paik, Y. K. (2006) Biomarker discoveryfrom the plasma proteome using multidimensional fractionation proteomics. Curr.Opin. Chem. Biol. 10, 42–49.
37. Anderson, N. L. and Anderson, N. G. (2002) The human plasma proteome: history,character, and diagnostic prospects. Mol. Cell Proteomics 1, 845–867.
38. Hu, S., Loo, J. A., and Wong, D. T. (2006) Human body fluid proteome analysis.Proteomics 6, 6326–6353.
39. Park, M. R., Wang, E. H., Jin, D. C., Cha, J. H., Lee, K. H., Yang, C. W.,Kang, C. S., and Choi, Y. J. (2006) Establishment of a 2-D human urinary proteomicmap in IgA nephropathy. Proteomics 6, 1066–1076.
40. Tammen, H., Schutle, I., Hess, R., Menzel, C., Kellmann, M., and Schulz-Knappe, P. (2005) Prerequisites for peptidomic analysis of blood samples: I.Evaluation of blood specimen qualities and determination of technical performancecharacteristics. Comb. Chem. High Trhoughput Screen 8, 725–733.
41. Rai, A. J., Gelfand, C. A., Haywood, B. C., Warunek, D. J., Yi, J., Schuchard, M. D.,Mehigh, R. J., Cockrill, S. L., Scott, G. B., Tammen, H., Schulz-Knappe, P.,Speicher, D. W., Vitzthum, F., Haab, B. B., Siest, G., and Chan, D. W.(2005) HUPO plasma proteome project specimen collection and handling: towardsthe standardization of parameters for plasma proteome samples. Proteomics 5,3262–3277.
42. Zhou, M., Lucas, D. A., Chan, K. C., Issaq, H. J., Petricoin, E. F. 3rd, Liotta, L. A.,Veenstra, T. D., and Conrads, T. P. (2004) An investigation into the human serum“interactome”. Electrophoresis 25, 1289–1298.
43. Findeisen, P., Sismanidis, D., Riedl, M., Costina, V., and Neumaier, M. (2005)Preanalytical impact of sample handling on proteome profiling experiments withmatrix-assisted laser desorption/ionization time-of-flight mass spectrometry. Clin.Chem. 51, 2409–2411.
44. Park, K. S., Kim, H., Kim, N. G., Cho, S. Y., Choi, K. H., Seong, J. K., and Paik,Y. K. (2002) Proteomic analysis and molecular characterization of tissue ferritinlight chain in hepatocellular carcinoma. Hepatology 35, 1459–1466.
45. Park, K. S., Cho, S. Y., Kim, H., and Paik, Y. K. (2002) Proteomic alterations of thevariants of human aldehyde dehydrogenase isozymes correlate with hepatocellularcarcinoma. Int. J. Cancer 97, 261–265.
46. Marko-Varga, G., Berglund, M., Malmstrom, J., Lindberg, H., and Fehniger, T. E.(2003) Targeting hepatocytes from liver tissue by laser capture microdissectionand proteomics expression profiling. Electrophoresis 24, 3800–3805.
47. Paradis, V., Degos, F., Dargere, D., Pham, N., Belghiti, J., Degott, C., Janeau,J. L., Bezeaud, A., Delforge, D., Cubizolles, M., Laurendeau, I., and Bedossa, P.(2005) Identification of a new biomarker of hepatocellular carcinoma by serumprotein profiling of patients with chronic liver diseases. Hepatology 41, 40–47.
Overview and Introduction to Clinical Proteomics 27
48. Ru, Q. C., Zhu, L. A., Silberman, J., and Shriver, C. D. (2006) Label-free semiquan-titative peptide feature profiling of human breast cancer and breast disease sera viatwo-dimensional liquid chromatography–mass spectrometry. Mol. Cell Proteomics5, 1095–1104.
49. Azad, N. S., Rasool, N., Annuziata, C. M., Minasian, L., Whiteley, G., andKohn, E. C. (2006) Proteomics in clinical trials and practice: present uses andfuture promise. Mol. Cell Proteomics 5, 1819–1829.
50. Gunter, E. W. (1997) Biological and environmental specimen banking at theCenters for Disease Control and Prevention. Chemosphere 34, 1945–1953.
51. Strauss, G. H. and Kelly, S. J. (1990) The development of the U.S. EPA healtheffects research laboratory frozen blood cell repository program. Mutat. Res. 234,349–354.
52. Romeo, M. J., Espina, V., Lowenthal, M., Espina, B. H., Petricoin, E. F. 3rd, andLiotta, L. A. (2005) CSF proteome: a protein repository for potential biomarkeridentification. Expert Rev. Proteomics 2, 57–70.
53. Conrads, T. P., Hood, B. L., Petricoin, E. F. 3rd, Liotta, L. A., and Veenstra, T. D.(2005) Cancer proteomics: many technologies, one goal. Expert Rev. Proteomics2, 693–703.
54. Schrader, M. and Selle, H. (2006) The process chain for peptidomic biomarkerdiscovery. Dis. Markers 22, 27–37.
55. Danna, E. A. and Nolan, G. P. (2006) Transcending the biomarker mindset:deciphering disease mechanisms at the single cell level. Curr. Opin. Chem. Biol.10, 20–27.
56. De Masi, S., Tosti, M. E., and Mele, A. (2005) Screening for hepatocellularcarcinoma. Dig. Liver Dis. 37, 260–268.
57. Yamaguchi, K., Nagano, M., Torada, N. Hamasaki, N., Kawakita, M., andTanaka, M. (2004) Urine diacetylspermine as a novel tumor marker for pancreato-biliary carcinomas. Rinsho. Byori. 52, 336–339
58. Dabrowska, M., Grubek-Jaworska, H., Domagala-Kulawik, J., Bartoszewicz, Z.,Kondracka, A., Krenke, R., Nejman, P., and Chazan, R. (2004) Diagnostic usefulnessof selected tumor markers (CA125, CEA, CYFRA 21–1) in bronchoalveolar lavagefluid in patients with non-small cell lung cancer. Pol. Arch. Med. Wewn 111, 659–665.
59. Gann, P. H., Hennekens, C. H., and Stampfer, M. J. (1995) A prospective evaluationof plasma prostate-specific antigen for detection of prostatic cancer. JAMA 273,289–294
60. Ciambellotti, E., Coda, C., and Lanza, E. (1993) Determination of CA 15–3 in thecontrol of primary and metastatic breast carcinoma. Minerva Med. 84, 107–112.
61. Linkov, F., Lisovich, A., Yurkovetsky, Z., Marrangoni, A., Velikokhatnaya, L.,Nolen, B., Winans, M., Bigbee, W., Siegfried, J., Lokshin, A., and Ferris, R. L.(2007) Early detection of head and neck cancer: development of a novel screeningtool using multiplexed immunobead-based biomarker profiling. Cancer Epidemiol.Biomarkers Prev. 16, 102–107.
62. Casiano, C. A., Mediavilla-Varela, M., and Tan, E. M. (2006) Tumor-associatedantigen arrays for the serological diagnosis of cancer. Mol. Cell Proteomics 5,1745–1759.
28 Paik et al.
63. Nissom, P. M., Lo, S. L., Lo, J. C., Ong, P. F., Lim, J. W., Ou, K., Liang, R. C.,Seow, T. K., and Chung, M. C. (2006) Hcc-2, a novel mammalian ER thioredoxinthat is differentially expressed in hepatocellular carcinoma. FEBS Lett. 580, 2216–2226.
64. Feng, J. T., Liu, Y. K., Song, H. Y., Dai, Z., Qin, L. X., Almofti, M. R., Fang, C. Y.,Lu, H. J., Yang, P. Y., and Tang, Z. Y. (2005) Heat-shock protein 27: a potentialbiomarker for hepatocellular carcinoma identified by serum proteome analysis.Proteomics 5, 4581–1588.
65. Li, D. Q., Wang, L., Fei, F., Hou, Y. F., Luo, J. M., Wei-Chen, Zeng, R.,Wu, J., Lu, J. S., Di, G. H., Ou, Z. L., Xia, Q. C., Shen, Z. Z., andShao, Z. M. (2006) Identification of breast cancer metastasis-associated proteinsin an isogenic tumor metastasis model using two-dimensional gel electrophoresisand liquid chromatography-ion trap-mass spectrometry. Proteomics 6,3352–3368.
66. Lee, I. N., Chen, C. H., Sheu, J. C., Lee, H. S., Huang, G. T., Yu, C. Y.,Lu, F. J., and Chow, L. P. (2005) Identification of human hepatocellular carcinoma-related biomarkers by two-dimensional difference gel electrophoresis and massspectrometry. J. Proteome Res. 4, 2062–2069.
67. Righetti, P. G., Castagna, A., Antonucci, F., Piubelli, C., Cecconi, D.,Campostrini, N., Rustichelli, C., Antonioli, P., Zanusso, G., Monaco, S., Lomas, L.,and Boschetti, E. (2005) Proteome analysis in the clinical chemistry laboratory:myth or reality? Clin. Chim. Acta 357, 123–139.
68. Jang, J. S., Cho, H. Y., Lee, Y. J., Ha, W. S., and Kim, H. W. (2004) Thedifferential proteome profile of stomach cancer: identification of the biomarkercandidates. Oncol. Res. 14, 491–499.
69. Steel, L. F., Shumpert, D., Trotter, M., Seeholzer, S. H., Evans, A. A., London,W. T., Dwek, R., and Block, T. M. (2003) A strategy for the comparative analysisof serum proteomes for the discovery of biomarkers for hepatocellular carcinoma.Proteomics 3, 601–609.
70. Yip, T. T., Chan, J. W., Cho, W. C., Yip, T. T., Wang, Z., Kwan, T. L., Law, S. C.,Tsang, D. N., Chan, J. K., Lee, K. C., Cheng, W. W., Ma, V. W., Yip, C.,Lim, C. K., Ngan, R. K., Au, J. S., Chan, A., Lim, W. W., and Ciphergen SARSProteomics Study Group (2005) Protein chip array profiling analysis in patientswith severe acute respiratory syndrome identified serum amyloid a protein as abiomarker potentially useful in monitoring the extent of pneumonia. Clin. Chem. 51,47–55.
71. Anderson, L. and Hunter, C. L. (2005) Quantitative mass spectrometric multiplereaction monitoring assays for major plasma proteins. Mol. Cell Proteomics 5,573–588.
72. Lee, J. W., Figeys, D., and Vasilescu, J. (2007) Biomarker assay translation fromdiscovery to clinical studies in cancer drug development: quantification of emergingprotein biomarkers. Adv. Cancer Res. 96, 269–298.
73. Zolg, W. (2006) The proteomic search for diagnostic biomarkers: lost in trans-lation? Mol. Cell Proteomics 5, 1720–1726.
Overview and Introduction to Clinical Proteomics 29
74. Bensmail, H., Golek, J., Moody, M. M., Semmes, J. O., and Haoudi, A. (2005)A novel approach for clustering proteomics data using Bayesian fast Fouriertransform. Bioinformatics 21, 2210–2224.
75. Ward, D. G., Cheng, Y., N’Kontchou, G., Thar, T. T., Barget, N., Wei, W.,Billingham, L. J., Martin, A., Beaugrand, M., and Johnson, P. J. (2006) Changes inthe serum proteome associated with the development of hepatocellular carcinomain hepatitis C-related cirrhosis. Br. J. Cancer 94, 287–292.
76. Lin, N. and Zhao, H. (2005) Are scale-free networks robust to measurement errors?BMC Bioinformatics 6, 119.
77. Castagna, A., Cecconi, D., Sennels, L., Rappsilber, J., Guerrier, L., Fortis, F.,Boschetti, E., Lomas, L., and Righetti, P. G. (2005) Exploring the hidden humanurinary proteome via ligand library beads. J. Proteome Res. 4, 1917–1930.
78. Rauch, A., Bellew, M., Eng, J., Fitzgibbon, M., Holzman, T., Hussey, P., Igra, M.,Maclean, B., Lin, C. W., Detter, A., Fang, R., Faca, V., Gafken, P., Zhang, H.,Whiteaker, J., States, D., Hanash, S., Paulovich, A., and McIntosh, M. W. (2006)Computational proteomics analysis system (CPAS): an extensible open sourceanalytic system for evaluating and publishing proteomic data and high throughputbiological experiments. J. Proteome Res. 5, 112–121.
79. Lilley, K. S. and Friedman, D. B. (2004) All about DIGE: quantification technologyfor differential-display 2D-gel proteomics. Expert Rev. Proteomics 1, 401–409.
80. Qian, W. J., Jacobs, J. M., Liu, T., Camp, D. G. 2nd, and Smith, R. D.(2006) Advances and challenges in liquid chromatography-mass spectrometry-based proteomics profiling for clinical applications. Mol. Cell Proteomics 5,1727–1744.
81. Powell, D. W., Merchant, M. L., and Link, A. J. (2006) Discovery of regulatorymolecular events and biomarkers using 2D capillary chromatography and massspectrometry. Expert Rev. Proteomics 3, 63–74.
82. Andre, M., Le Caer, J. P., Greco, C., Planchon, S., El Nemer, W., Boucheix, C.,Rubinstein, E., Chamot-Rooke, J., and Le Naour, F. (2006) Proteomic analysis ofthe tetraspanin web using LC-ESI-MS/MS and MALDI-FTICR-MS. Proteomics6, 1437–1449.
83. Greengauz-Roberts, O., Stoppler, H., Nomura, S., Yamaguchi, H.,Goldenring, J. R., Podolsky, R. H., Lee, J. R., and Dynan, W. S. (2005) Saturationlabeling with cysteine-reactive cyanine fluorescent dyes provides increased sensi-tivity for protein expression profiling of laser-microdissected clinical specimens.Proteomics 5, 1746–1757.
84. Heck, A. J. and Krijgsveld, J. (2004) Mass spectrometry-based quantitativeproteomics. Expert Rev. Proteomics 1, 317–326.
85. Schneider, L. V. and Hall, M. P. (2005) Stable isotope methods for high-precisionproteomics. Drug Discov. Today 10, 353–363.
86. Zhang, J., Goodlett, D. R., Peskind, E. R., Quinn, J. F., Zhou, Y., Wang, Q.,Pan, C., Yi, E., Eng, J., Aebersold, R. H., and Montine, T. J. (2005) Quantitativeproteomic analysis of age-related changes in human cerebrospinal fluid. NeurobiolAging 26, 207–227.
30 Paik et al.
87. Liu, T., Qian, W. J., Strittmatter, E. F., Camp, D. G. 2nd, Anderson, G. A.,Thrall. B. D., and Smith, R. D. (2004) High-throughput comparative proteomeanalysis using a quantitative cysteinyl-peptide enrichment technology. Anal. Chem.76, 5345–5353.
88. Li, C., Hong, Y., Tan, Y. X., Zhou, H., Ai, J. H., Li, S. J., Zhang, L., Xia, Q. C.,Wu, J. R., Wang, H. Y., and Zeng, R. (2004) Accurate qualitative and quanti-tative proteomic analysis of clinical hepatocellular carcinoma using laser capturemicrodissection coupled with isotope-coded affinity tag and two-dimensional liquidchromatography mass spectrometry. Mol. Cell Proteomics 3, 399–409.
89. Sheehan, K. M., Calvert, V. S., Kay, E. W., Lu, Y., Fishman, D., Espina, V.,Aquino. J., Speer, R., Araujo, R., Mills, G. B., Liotta, L. A., Petricoin, E. F.3rd, and Wulfkuhle, J. D. (2005) Use of reverse phase protein microarrays andreference standard development for molecular network analysis of metastaticovarian carcinoma. Mol. Cell Proteomics 4, 346–355.
90. Knezevic, V., Leethanakul, C., Bichsel, V. E., Worth, J. M., Prabhu, V. V., Gutkind,J. S., Liotta, L. A., Munson, P. J., Petricoin, E. F. 3rd, and Krizman, D. B. (2001)Proteomic profiling of the cancer microenvironment by antibody arrays. Proteomics1, 1271–1278.
91. Sharma-Oates, A., Quirke, P., Westhead, D. R. (2005) TmaDB: a repository fortissue microarray data. BMC Bioinformatics 6, 218.
92. Rai, A. J., Stemmer, P. M., Zhang, Z., Adam, B. L., Morgan, W. T., Caffrey,R. E., Podust, V. N., Patel, M., Lim, L. Y., Shipulina, N. V., Chan, D. W.,Semmes, O. J., and Leung, H. C. (2005) Analysis of human proteome organizationplasma proteome project (HUPO PPP) reference specimens using surface enhancedlaser desorption/ionization-time of flight (SELDI-TOF) mass spectrometry: multi-institution correlation of spectra and identification of biomarkers. Proteomics 5,3467–3474.
93. Engwegen, J. Y., Gast, M. C., Schellens, J. H., and Beijnen, J. H. (2006)Clinical proteomics: searching for better tumour markers with SELDI-TOF massspectrometry. Trends Pharmacol. Sci. 27, 251–259.
94. Domon, B. and Aebersold, R. (2006) Mass spectrometry and protein analysis.Science 312, 212–217.
95. Domon, B. and Aebersold, R. (2006) Challenges and opportunities in proteomicsdata analysis. Mol. Cell Proteomics 5, 1921–1926.
96. Uhlen, M. and Ponten, F. (2005) Antibody-based proteomics for human tissueprofiling. Mol. Cell Proteomics 4, 384–393.
97. Taussig, M. J., Stoevesandt, O., Borrebaeck, C. A., Bradbury, A. R., Cahill, D.,Cambillau, C., de Daruvar, A., Dubel, S., Eichler, J., Frank, R., Gibson, T. J.,Gloriam, D., Gold, L., Herberg, F. W., Hermjakob, H., Hoheisel, J. D., Joos, T. O.,Kallioniemi, O., Koegll, M., Konthur, Z., Korn, B., Kremmer, E., Krobitsch, S.,Landegren, U., van der Maarel, S., McCafferty, J., Muyldermans, S., Nygren, P. A.,Palcy, S., Pluckthun, A., Polic, B., Przybylski, M., Saviranta, P., Sawyer, A.,Sherman, D. J., Skerra, A., Templin, M., Ueffing, M., and Uhlen, M. (2007)
Overview and Introduction to Clinical Proteomics 31
ProteomeBinders: planning a European resource of affinity reagents for analysisof the human proteome. Nat. Methods 4, 13–17.
98. Ilyin, S. E., Belkowski, S. M., and Plata-Salaman, C. R. (2004) Biomarkerdiscovery and validation: technologies and integrative approaches. TrendsBiotechnol. 22, 411–416.