+ All Categories
Home > Documents > Chemometric evaluation of the heavy metals distribution in waters from the Dilovası region in...

Chemometric evaluation of the heavy metals distribution in waters from the Dilovası region in...

Date post: 01-Jan-2017
Category:
Upload: nevin
View: 214 times
Download: 0 times
Share this document with a friend
6
Baseline Chemometric evaluation of the heavy metals distribution in waters from the Dilovası region in Kocaeli, Turkey Deniz Bingöl a,, Ümit Ay a , Seda Karayünlü Bozbas ß a , Nevin Uzgören b a Department of Chemistry, Kocaeli University, Kocaeli, Turkey b Faculty of Economics and Administrative Sciences, Dumlupınar University, Kütahya, Turkey article info Keywords: Heavy metals Principal component analysis Correlation analysis Cluster analysis Chemometrics abstract The main objective of this study was to test water samples collected from 10 locations in the Dilovası area (a town in the Kocaeli region of Turkey) for heavy metal contamination and to classify the heavy metal (Cr, Mn, Co, Ni, Cu, Zn, As, Cd, Pb and Hg) contents in water samples using chemometric methods. The heavy metals in the water samples were identified using inductively coupled plasma-mass spectrometry (ICP-MS). To ascertain the relationship among the water samples and their possible sources, the correla- tion analysis, principal component analysis (PCA), and cluster analysis (CA) were used as classification techniques. About 10 water samples were classified into five groups using PCA. A very similar grouping was obtained using CA. Ó 2012 Elsevier Ltd. All rights reserved. In Turkey, a developing country, environmental pollution prob- lems have increased since 1960 due to the rapid growth of industry and population increase in the Marmara region, specifically in Iz- mit Bay. Since the 1960s, more than 250 large industrial plants have been built in the area surrounding the bay. Industrial activi- ties in the region are mostly located along on the northern coast of Izmit Bay and surrounded by residential neighborhoods (Demi- ray et al., 2012). With its 50,000 inhabitants, the Dilovası district of Kocaeli is a symbol of Turkey’s uncontrolled industrialization. There are 185 companies serving in 45 sectors mainly metal (iron–steel, aluminum), chemistry (e.g., paint), and energy (coal- fired electric power plant), in the Dilovası organized industrial zone, which is located at the center of a bowl-like topographic structure. This caused serious environmental problems in the re- gion, for example, soil, air, and water pollution. Although serious measures were taken to reduce and control pollution since the beginning of 1990, pollution levels are still high in the region (Karademir, 2006). In Dilovası, deaths caused by cancer have sur- passed those caused by cardiovascular diseases, becoming the leading cause of death (Tuncer, 2009). Dilovası’s sewer system is directly connected to Dil Creek. This waste is not subject to any purification. Dil Creek is located in the eastern Marmara region and discharges into Izmit Bay. This water source is used for irriga- tion and as drinking water for animals. One of the most important sources of pollution in Izmit Bay is Dil Creek, which flows into the western part of the bay. An estimated 60% of the total waste water directly enters Izmit Bay (Telli-Karakoç et al., 2002). Large indus- trial plants (mainly paint and metal industries) around Dil Creek to discharge their solid and liquid waste into Dil Creek after limited treatment. In this area, Izocam, DYO, Lever, Wishes, Olmuksa, Yazıcıog ˘lu, Polisan and Çolakog ˘lu factories generate the most stream pollu- tion. Factories such as the Porland porcelain factory are able to pol- lute the river and creek from a distance thanks to hundreds of meters of pipes. Fifteen years ago, Dilovası was home to cherry, ap- ple, and peach orchards and vineyards; however, unfortunately, the Dil Creek pollution destroyed the orchards and vineyards. Waste is discharged directly into the stream without a water treat- ment system causing the extinction of many species. In the past, people swam and fished in Dil Creek’s clean water. Heavy metals are deemed serious pollutants because of toxicity, persistence, and non-degradability in the environment (Fang and Hong, 1999; Klavins et al., 2000; Tam and Wong, 2000; Yuan et al., 2004). Over the past century, heavy metals have been discharged into the world’s rivers and estuaries as a result of rapid industrialization (Chen et al., 2004; Cobelo-García and Prego, 2003; Pekey, 2006; Tam and Wong, 2000). Multivariate methods are being increasingly used because a large amount of information can be compared in graphical form, which is very difficult to do using number tables or univariate sta- tistics. There are many well established multivariate methods for classification, of which the most commonly used are correlation analysis, principal component analysis (PCA), and cluster analysis (CA) (Brereton, 2007). Some researchers have determined similar- ities between samples and groups of samples using PCA and CA. For example, researchers can use multivariate methods to evaluate 0025-326X/$ - see front matter Ó 2012 Elsevier Ltd. All rights reserved. http://dx.doi.org/10.1016/j.marpolbul.2012.12.006 Corresponding author. Tel.: +90 2623032030; fax: +90 2623032003. E-mail addresses: [email protected], [email protected] (D. Bingöl). Marine Pollution Bulletin 68 (2013) 134–139 Contents lists available at SciVerse ScienceDirect Marine Pollution Bulletin journal homepage: www.elsevier.com/locate/marpolbul
Transcript
Page 1: Chemometric evaluation of the heavy metals distribution in waters from the Dilovası region in Kocaeli, Turkey

Marine Pollution Bulletin 68 (2013) 134–139

Contents lists available at SciVerse ScienceDirect

Marine Pollution Bulletin

journal homepage: www.elsevier .com/locate /marpolbul

Baseline

Chemometric evaluation of the heavy metals distribution in waters from theDilovası region in Kocaeli, Turkey

Deniz Bingöl a,⇑, Ümit Ay a, Seda Karayünlü Bozbas� a, Nevin Uzgören b

a Department of Chemistry, Kocaeli University, Kocaeli, Turkeyb Faculty of Economics and Administrative Sciences, Dumlupınar University, Kütahya, Turkey

a r t i c l e i n f o a b s t r a c t

Keywords:Heavy metalsPrincipal component analysisCorrelation analysisCluster analysisChemometrics

0025-326X/$ - see front matter � 2012 Elsevier Ltd.http://dx.doi.org/10.1016/j.marpolbul.2012.12.006

⇑ Corresponding author. Tel.: +90 2623032030; faxE-mail addresses: [email protected],

(D. Bingöl).

The main objective of this study was to test water samples collected from 10 locations in the Dilovası area(a town in the Kocaeli region of Turkey) for heavy metal contamination and to classify the heavy metal(Cr, Mn, Co, Ni, Cu, Zn, As, Cd, Pb and Hg) contents in water samples using chemometric methods. Theheavy metals in the water samples were identified using inductively coupled plasma-mass spectrometry(ICP-MS). To ascertain the relationship among the water samples and their possible sources, the correla-tion analysis, principal component analysis (PCA), and cluster analysis (CA) were used as classificationtechniques. About 10 water samples were classified into five groups using PCA. A very similar groupingwas obtained using CA.

� 2012 Elsevier Ltd. All rights reserved.

In Turkey, a developing country, environmental pollution prob-lems have increased since 1960 due to the rapid growth of industryand population increase in the Marmara region, specifically in Iz-mit Bay. Since the 1960s, more than 250 large industrial plantshave been built in the area surrounding the bay. Industrial activi-ties in the region are mostly located along on the northern coastof Izmit Bay and surrounded by residential neighborhoods (Demi-ray et al., 2012). With its 50,000 inhabitants, the Dilovası district ofKocaeli is a symbol of Turkey’s uncontrolled industrialization.There are 185 companies serving in 45 sectors mainly metal(iron–steel, aluminum), chemistry (e.g., paint), and energy (coal-fired electric power plant), in the Dilovası organized industrialzone, which is located at the center of a bowl-like topographicstructure. This caused serious environmental problems in the re-gion, for example, soil, air, and water pollution. Although seriousmeasures were taken to reduce and control pollution since thebeginning of 1990, pollution levels are still high in the region(Karademir, 2006). In Dilovası, deaths caused by cancer have sur-passed those caused by cardiovascular diseases, becoming theleading cause of death (Tuncer, 2009). Dilovası’s sewer system isdirectly connected to Dil Creek. This waste is not subject to anypurification. Dil Creek is located in the eastern Marmara regionand discharges into Izmit Bay. This water source is used for irriga-tion and as drinking water for animals. One of the most importantsources of pollution in Izmit Bay is Dil Creek, which flows into thewestern part of the bay. An estimated 60% of the total waste water

All rights reserved.

: +90 [email protected]

directly enters Izmit Bay (Telli-Karakoç et al., 2002). Large indus-trial plants (mainly paint and metal industries) around Dil Creekto discharge their solid and liquid waste into Dil Creek after limitedtreatment.

In this area, Izocam, DYO, Lever, Wishes, Olmuksa, Yazıcıoglu,Polisan and Çolakoglu factories generate the most stream pollu-tion. Factories such as the Porland porcelain factory are able to pol-lute the river and creek from a distance thanks to hundreds ofmeters of pipes. Fifteen years ago, Dilovası was home to cherry, ap-ple, and peach orchards and vineyards; however, unfortunately,the Dil Creek pollution destroyed the orchards and vineyards.Waste is discharged directly into the stream without a water treat-ment system causing the extinction of many species. In the past,people swam and fished in Dil Creek’s clean water. Heavy metalsare deemed serious pollutants because of toxicity, persistence,and non-degradability in the environment (Fang and Hong, 1999;Klavins et al., 2000; Tam and Wong, 2000; Yuan et al., 2004). Overthe past century, heavy metals have been discharged into theworld’s rivers and estuaries as a result of rapid industrialization(Chen et al., 2004; Cobelo-García and Prego, 2003; Pekey, 2006;Tam and Wong, 2000).

Multivariate methods are being increasingly used because alarge amount of information can be compared in graphical form,which is very difficult to do using number tables or univariate sta-tistics. There are many well established multivariate methods forclassification, of which the most commonly used are correlationanalysis, principal component analysis (PCA), and cluster analysis(CA) (Brereton, 2007). Some researchers have determined similar-ities between samples and groups of samples using PCA and CA.For example, researchers can use multivariate methods to evaluate

Page 2: Chemometric evaluation of the heavy metals distribution in waters from the Dilovası region in Kocaeli, Turkey

D. Bingöl et al. / Marine Pollution Bulletin 68 (2013) 134–139 135

trace metal concentrations in some spices and herbs (Karadas andKara, 2012); to the assessment of the level of some heavy metals insediments (Idris, 2008), to determine the levels of essential, traceand toxic elements in citrus honeys from different regions (Yüceland Sultanoglu, 2012); to determine trace elements in commonlyconsumed medicinal herbs (Tokalıoglu, 2012); to evaluate the min-eral content of medicinal herbs (Kolasani et al., 2011); to evaluatetrace metal concentration in some herbs and herbal teas (Kara,2009); to evaluate heavy metals in street dust samples (Tokalıogluand Kartal, 2006); to determine concentrations of key heavy metalsin street dust and analyze their potential sources (Lu et al., 2010);to identify heavy metals in pastureland (Franco-Uría et al., 2009);to identify source of eight hazardous heavy metals in agriculturalsoils (Cai et al., 2012); to classify sea cucumber according to regionof origin (Liu et al., 2012a); and to evaluate the heavy metal con-tamination of surface soil (Yaylalı-Abanuz, 2011).

The aim of this study was to apply the chemometric techniquesof correlation analysis, principal component analysis (PCA), andcluster analysis (CA) to results obtained from inductively coupledplasma-mass spectrometry (ICP-MS) of water samples, and toidentify similarities in heavy metal content.

An ICP-MS inductively coupled plasma-mass spectrometryinstrument (Perkin Elmer DRC-e/Cetax ADX-500) was used todetermine Cr, Mn, Co, Ni, Cu, Zn, As, Cd, Pb, and Hg content in eachregion. A Hanna pH 211 Microprocessor pH-meter was used tomeasure the pH values of the solutions. The pH-meter was stan-dardized with NBS buffers prior to each measurement.

A total of 10 sites were selected on Dil creek and Hereke Portnear the Dilovası area, and water samples were collected aftermonsoon (October 2011) season (Fig. 1).

Water samples, namely d1-steel and aluminun (direct unload-ing point), d2-steel and aluminun (inside Dil Creek), d3-steel andmetal (inside Dil creek), d4-steel (sea water, Hereke), d5-sea water(between Hereke Port and Dil creek), d6-chemistry and cosmetic,d7, d8-sea waters, d9, d10-Dil Creek (direct unloading point), andsea water, were filtered using Whatman filter paper (No. 40) andrefrigerated at approximately 4 �C until laboratory analysis. Watersamples used for total metals analyses were acidified to pH 2 withultra purified 6 M HNO3, and stored at 4 �C. Contents of 10 heavy

Fig. 1. A map of sampl

metals (Cr, Mn, Co, Ni, Cu, Zn, As, Cd, Pb and Hg) were measuredusing ICP-MS. All results given are the average values of three rep-licate analyses.

The analytical data obtained from the water samples were clas-sified using correlation analysis, PCA, and CA to evaluate whetherthere are any relationships between the heavy metals in the watersamples. All statistical calculations were made using PASW Statis-tics 18 and Minitab 16 software.

The aim of a correlation analysis is to measure the relationshipbetween variables. Pearson’s correlation coefficient (r for sample)is the most common correlation coefficient. The correlation coeffi-cients can range from �1 to +1 and are independent of the units ofmeasurement. Usually, |r| > 0.75 indicates that there is a significantrelationship between the variables. In this study, first the relation-ships between variables are examined using Pearson’s correlationcoefficient. In the statistical analysis, a significant correlationamong the variables is not required. In such cases, the correlationsshould be removed from the data set. The p number of relevantvariables can be expressed as the k number (k 6 p) of new artificialvariables, which are linear components of these variables do notcorrelate within them. This function performs the PCA (Özdamar,2002).

PCA is probably the most widespread multivariate statisticaltechnique used in chemometrics, and because of the importanceof multivariate measurements in chemistry, it is regarded by manyas the technique that most significantly changes chemist’s view ofdata analysis. Exploratory data analysis such as PCA is primarilyused to determine general relationships between data. Sometimes,more complex questions need to be answered; for example, do thesamples fall into groups? The aims of PCA are to determine under-lying information from multivariate raw data (Brereton, 2007).

Additional interpretations between heavy metals and watersamples may be obtained using more powerful chemometric tech-niques such as PCA. PCA is a projection method that allows easyvisualization of all the information contained in a data set. PCAhelps determine differences between samples and identifies whichvariables contribute most to this difference (Liu et al., 2012b). PCAis a process that transforms components of data matrix A (An�p),including n samples and p, as shown in the following equation:

ing locations/sites.

Page 3: Chemometric evaluation of the heavy metals distribution in waters from the Dilovası region in Kocaeli, Turkey

136 D. Bingöl et al. / Marine Pollution Bulletin 68 (2013) 134–139

A ¼ T � Bþ EA ð1Þ

where T is n � q score matrix, and B is q � p PCA loading matrices. qgives the minimum number of principal components needed for aPCA analysis of matrix A. Each column vector of matrix T and eachrow vector of matrix B is considered one principal component ofmatrix A (Dıraman et al., 2009). In PCA, the information carried bythe original variables is projected onto a smaller number of under-lying (‘‘latent’’) variables called principal components. The firstprincipal component covers as much of the variation in the dataas possible, the second principal component is orthogonal to thefirst and covers as much of the remaining variation as possible,and so on (Kara, 2009). Because the first and second principal com-ponent usually covers a large portion of the total, a clustering ofsamples, according to the effect of all variables within the two-dimensional plane, is possible by plotting against each of the firsttwo column vectors (the first two principle components: PC1 andPC2) of matrix T. For a grouping that depends on the distributionof variables in the system, the first two rows of matrix B is plottedagainst each other (Dıraman et al., 2009). If the first m principalcomponent describes a large portion of the total variance, the restof the p–m principal component can be neglected. In this case, thereis a small variance (information) loss, and the work-space size is re-duced to m from p (m < p) (reduction of dimension) (Tatlıdil, 1992).Principal components (Yi) are independent, and their variances areequal to the eigenvalue (ki) of correlation matrixes. The total vari-ance of the original system is equal to the total variance of principalcomponents:

Total variance ¼ k1 þ k2 þ � � � þ kp ¼Xp

i¼1

VarðYiÞ: ð2Þ

The total variability of the data matrix is equal to the total var-iability of principal components;

The variability ratio explained by the kth principal component

¼ kk

k1 þ k2 þ � � � þ kpðk ¼ 1;2; . . . ;pÞ:

ð3Þ

In applications, a few principal components describes a propor-tion larger than 80% of the total variance, without causing a loss ofinformation that can substitute for the original variable (Ersunguret al., 2007). The number of eigenvalues greater than one valuewhen using standardized data matrix gives the value of m (Tatlıdil,1992).

PCA allows users to see the results by themselves rather than asa result of a property provider because the principal componentsare capable of an intermediate step for more extensive investiga-tions. In particular, CA uses principal component scores, whichare fairly common conditions (Özdamar, 2002).

Table 1Analysis data concerning water samples.

Watersamples

Average (lg/L ± StDeva)

Cr Mn Co Ni Cu

d1 0.027 ± 0.07 770.66 ± 4.06 30.21 ± 2.20 2.49 ± 1.25 13.25d2 0.023 ± 0.08 364.75 ± 2.27 554.15 ± 4.20 10.42 ± 2.29 12.92d3 0.020 ± 0.09 331.67 ± 2.37 575.61 ± 4.07 7.41 ± 2.07 13.41d4 0.263 ± 0.27 160.13 ± 1.24 211.67 ± 3.21 8.17 ± 1.97 1.66d5 0.071 ± 0.04 139.58 ± 2.26 123.35 ± 1.23 6.77 ± 1.20 1.14d6 0.081 ± 0.05 311.76 ± 3.17 113.67 ± 1.29 5.27 ± 1.45 2.77d7 0.058 ± 0.04 207.16 ± 2.47 89.53 ± 1.07 7.20 ± 1.86 2.17d8 0.027 ± 0.05 159.02 ± 3.19 21.13 ± 0.97 5.08 ± 1.23 4.74d9 0.010 ± 0.04 303.53 ± 2.17 38.49 ± 1.11 16.76 ± 3.27 3.16d10 0.015 ± 0.03 303.00 ± 3.37 17.84 ± 1.37 16.25 ± 3.17 2.99

a StDev: Standard deviation.

The purpose of CA is to organize observations of a number ofgroups/variables and determine if they share observed properties.PCA is used primarily to determine general relationships betweendata. CA is often coupled with PCA to check results and to groupindividual parameters and variables. A dendrogram is the mostcommonly used method of summarizing hierarchical clustering(Lu et al., 2010). This technique is an unsupervised classificationprocedure that involves a measurement of the similarity betweenobjects to be clustered.

The average of three results and standard deviations of analysesobtained using ICP-MS are shown in Table 1. Using the data in Ta-ble 1, the heavy metals and water samples were classified usingcorrelation analysis, principal component analysis, and clusteranalysis.

In the first stage of this study, the findings obtained by calcu-lating descriptive statistics of selected variables were interpreted(Table 2). In the second stage, correlation analysis was applied todetermine whether the principal component analysis is appropri-ate to standardize data sets (Table 3). The results of the analysisshowed that there are significant correlations between the vari-ables (heavy metals). Therefore, the principal component analy-sis is appropriate. In the third stage of study, the principalcomponent analysis was applied and the eigenvalues, which be-long to the first four components, were found to be greater thanone and explained 91% of the total variance (Table 4). The watersamples were classified in accordance with the findings, graphicswere drawn, and variables (heavy metals) that are effective inthis classification were determined. In the final stage of thestudy, water samples were classified using the Ward algorithmwith a hierarchical clustering analysis (HCA). HCA was used toassess the spatial similarity or dissimilarity in water samplesaccording to heavy metals (Malik and Nadeem, 2011). HCA wascarried out using the first four principal components, explaining91% of the total variance.

The findings obtained by calculating the descriptive statistics ofthe selected variables were given in Table 2. Descriptive statisticsfor the data set of 10 variables were analyzed. For all the variables,positive and strong asymmetry (skewness > 0.5) was found (i.e.,the observation values collected had relatively smaller values thanthe average). The kurtosis values show that the series, except forCu and As, is sharper than normal, which means that the collectionof smaller than average values is higher than normal. These resultsshow that the series is not normally distributed. In addition, thestandard deviation values were very close to the average valuesfor many variables (Mn, Ni, Cu, Zn, and As), and some were evenfound to be higher (Co, Cd, Pb, and Hg). This situation indicatedthat the observation values were significantly different than themean (i.e., the variability was very high). This means that the watersamples taken from different locations were differently character-ized by their heavy metal content.

Zn As Cd Pb Hg

± 2.43 5.34 ± 1.19 1.43 ± 0.37 0.148 ± 0.24 8.77 ± 2.01 2.780 ± 1.27± 2.21 6.26 ± 1.28 10.51 ± 1.17 0.094 ± 0.07 2.84 ± 1.06 0.072 ± 0.04± 2.57 6.35 ± 1.47 13.41 ± 1.26 0.130 ± 0.20 3.12 ± 1.20 0.065 ± 0.05± 0.26 5.34 ± 1.31 8.13 ± 1.29 0.353 ± 0.37 18.15 ± 3.27 0.023 ± 0.08± 0.17 2.36 ± 1.22 4.05 ± 0.31 0.028 ± 0.06 1.32 ± 0.27 0.021 ± 0.07± 0.22 3.96 ± 1.26 4.63 ± 0.28 0.864 ± 0.44 2.08 ± 1.03 0.013 ± 0.06± 0.19 3.62 ± 1.19 8.46 ± 0.27 0.231 ± 0.27 6.36 ± 1.19 0.016 ± 0.03± 1.57 2.87 ± 1.07 4.34 ± 0.21 0.045 ± 0.05 0.852 ± 0.47 0.020 ± 0.06± 1.20 23.91 ± 2.29 4.06 ± 0.19 0.027 ± 0.03 0.019 ± 0.04 0.019 ± 0.04± 1.24 21.86 ± 2.24 1.04 ± 0.16 0.020 ± 0.03 0.595 ± 0.37 0.008 ± 0.02

Page 4: Chemometric evaluation of the heavy metals distribution in waters from the Dilovası region in Kocaeli, Turkey

Table 2Descriptive Statistics for Cr, Mn, Co, Ni, Cu, Zn, As, Cd, Pb, Hg.

Variable Mean StDev Minimum Median Maximum Range Skewness Kurtosis

Cr 0.0595 0.0756 0.0100 0.0270 0.2630 0.2530 2.59 7.23Mn 305.1 182.7 139.6 303.3 770.7 631.1 2.04 5.21Co 177.6 212.7 17.8 101.6 575.6 557.8 1.47 0.71Ni 8.58 4.67 2.49 7.31 16.76 14.27 0.94 0.08Cu 5.82 5.18 1.14 3.08 13.41 12.27 0.91 �1.28Zn 8.19 7.88 2.36 5.34 23.91 21.55 1.67 1.27As 6.01 3.99 1.04 4.49 13.41 12.37 0.61 �0.43Cd 0.1940 0.2581 0.0200 0.1120 0.8640 0.8440 2.31 5.75Pb 4.41 5.55 0.0190 2.46 18.15 18.13 1.99 4.15Hg 0.304 0.870 0.00800 0.0205 2.780 2.772 3.16 9.98

Table 3Pearson’s correlations matrix for heavy metal contents in water samples.

Cr Mn Co Ni Cu Zn As Cd Pb

Mn �0.358Co �0.004 0.000Ni �0.198 �0.235 �0.031Cu �0.417 0.701b 0.627 �0.255Zn �0.319 0.080 �0.244 0.907a 0.149As 0.162 �0.250 0.875a �0.100 0.380 �0.349Cd 0.417 �0.003 �0.028 �0.373 �0.208 �0.342 0.060Pb 0.841a 0.082 0.060 �0.325 �0.019 �0.329 0.201 0.246Hg �0.156 0.901a �0.220 �0.461 0.524 �0.133 �0.384 �0.068 0.276

Cell contents: Pearson correlation.a Correlation is significant at the 0.01 level (2-tailed).b Correlation is significant at the 0.05 level (2-tailed).

Table 4PCA results.

Heavy metals Component

1 2 3 4

Cr �0.227355 �0.441023 �0.220162 0.398191Mn �0.198878 0.520239 �0.126453 0.188798Co �0.253415 �0.028459 0.577028 0.123304Ni 0.477632 �0.024688 0.187752 0.423878Cu �0.291689 0.417222 0.325137 0.097179Zn 0.465248 0.147935 0.029839 0.439124As �0.262640 �0.197216 0.529704 0.080615Cd �0.228365 �0.231913 �0.206578 �0.210712Pb �0.357822 �0.200684 �0.214403 0.578360Hg �0.256323 0.452239 �0.302476 0.152479Eigenvalue 2.9707 2.7472 2.2385 1.139Explained variance (%) 29.7 25.5 22.4 11.4Cumulative (%) 29.7 57.2 79.6 91.0

Table 5PCA scores sorted according to the first and second main component.

Water samples PC1 Water samples PC2

d10 2.89167 d1 3.50864d9 2.87429 d2 0.66222d5 0.50237 d10 0.64893d8 0.38607 d9 0.59746d7 �0.22454 d3 0.45042d6 �0.67878 d8 �0.17656d2 �0.76770 d5 �0.79765d3 �1.31706 d7 �1.00550d4 �1.55420 d6 �1.00645d1 �2.11212 d4 �2.88151

D. Bingöl et al. / Marine Pollution Bulletin 68 (2013) 134–139 137

In multivariate statistical analyzes, the statistical analysis mustbe used to standardize data instead of the original data when themeasurement units and variability of variables investigated weredifferent. However, the standard deviation values for the studywere quite different. The original data matrix was standardizedusing the following equation: z = (x � l)/r, and subsequent ana-lyzes were based on a standardized data matrix.

The Person’s correlation coefficients for 10 heavy metals arepresented in Table 3. The positive and negative correlation coeffi-cients indicate positive and negative correlations respectively, be-tween the two metals. A significantly positive correlation atp < 0.01 was found between the heavy metal pairs Ni–Zn (0.907),Co–As (0.875), Cr–Pb (0.841), and Mn–Hg (0.91). In addition, Mnis positively correlated with Cu at p < 0.05, and Cd is not correlatedto the other elements. There are statistically significant and high

correlations between variables. Therefore, the application of PCAto the data set is significant in eliminating the dependence struc-ture and/or reducing size.

Two pieces of information are connected, namely, geographyand concentration. So, in many areas of multivariate analysis, oneaim may be to connect the samples (e.g., geographical location/sampling site), which are represented by the scores to the variables(e.g., chemical measurements), which are represented by loadings(Brereton, 2007).

PCA was applied to the entire data set (Table 4). Due to the stan-dardized data, the correlation matrix was used for the analysis. ThePCA results are summarized in Table 4. The principal componentsthat have eigenvalues higher than one were extracted. The resultsindicate that there were four eigenvalues higher than one. The firstcomponent explains 29.7% of the total variance and loads heavilyon Ni and Zn. The second component, dominated Mn, Cu and Hg,accounts for 25.5% of the total variance. The third component isloaded by Co, Cu, and As, accounting for 22.4% of the total variance.The fourth component is dominated by Cr, Ni, Zn, and Pb, account-ing for 11.4% of the total variance.

Page 5: Chemometric evaluation of the heavy metals distribution in waters from the Dilovası region in Kocaeli, Turkey

First Component

Seco

nd C

ompo

nent

3210-1-2

4

3

2

1

0

-1

-2

-3

d10d9

d8

d7d6 d5

d4

d3 d2

d1

Score Plot of Cr; ...; Hg

Fig. 2. PCA score plot.

First Component

Seco

nd C

ompo

nent

0,50,40,30,20,10,0-0,1-0,2-0,3-0,4

0,50

0,25

0,00

-0,25

-0,50

Hg

Pb CdAs

Zn

Cu

NiCo

Mn

Cr

Loading Plot of Cr; ...; Hg

Fig. 3. PCA loading plot.

Fig. 4. Three way PCA score plot.

Observations

Sim

ilarit

y

d10d9d6d7d8d5d4d3d2d1

-34,40

10,40

55,20

100,00

Dendrogram with Ward Linkage and Euclidean Distance

Fig. 5. Dendrogram results obtained from Ward linkage method.

138 D. Bingöl et al. / Marine Pollution Bulletin 68 (2013) 134–139

PCA scores according to the first principal component as deter-mined by Ni and Zn content, and the second principal component,as determined by Mn, Cu, and Hg, are listed in Table 5.d10 to d9and d5 to d8 water samples listed in the top formed a cluster,and the d1 water sample was listed in the bottom when listingby the first main component. The d1 water sample was listed atthe top with a large score difference over the second componentin Mn, Cu and Hg content. According to the analysis of the d1 watersample, these three heavy metals are highly effective in this loca-tion. By plotting the principal component, the inter-relationshipsbetween different variables can be viewed, and interpreted forsample patterns, groupings, similarities, or differences (Kara,2009). The first and second principal component usually includesa large portion of the total variance; therefore, the first two princi-ple components (PC1 and PC2) are plotted against each other, andclustering of samples is possible in the effects of all variables with-in the two-dimensional plane. The PC1 and PC2 score vectors andthe PC1 and PC2 loading vectors from PCA are plotted against eachother in Figs. 2 and 3, respectively. The first two components ex-plain 57.2% of the variation in the data set.

When Figs. 1 and 2 are evaluated, d9 and d10 samples areshown to be characterized by Ni and Zn metals; the d1 sample isonly characterized by Cu, Hg, and Mn metals; and the d4 sampleis only characterized a Cr metal a cluster. The d2 and d3 watersamples were mainly characterized by Co metal and Cu, Co, Mn,and Hg metals in low levels. The d6 sample was situated nearthe center and was slightly affected by Cd, As and Pb metals. Sim-ilarly, the d5, d7, and d8 samples were situated close to the centerand were not characterized by any metal.

A biplot involves a superimposition of scores and a loadingsplot, with the variables and samples represented on the same dia-gram. Fig. 4 shows a graph of the PC1, PC2, and PC3 score vectorsfrom PCA against each other. The first three components explain79.6% of the total variation in the data set.

Fig. 4 was plotted according to the first three main components,which showed that water samples taken from different locationsgave similar results to the results of a two-dimensional graphand usually consisted of five groups.

Group 1: d1.Group 2: d2 d3.Group 3: d9 d10.Group 4: d5 d6 d7 d8.Group 5: d4.

At this stage of the study, the aim is to make HCA and the clas-sification of the water samples from different locations, and tocompare the findings obtained with the results obtained from

the PCA. HCA was applied to the score vectors obtained fromPCA. The score vectors of the first four principal components,explaining 91% of the total change, were used in the analysis.The measurement is based on squared Euclidean distance. In thisstudy, the clustering method used was the Ward linkage method.Dendrogram obtained from the Ward linkage method is shown in

Page 6: Chemometric evaluation of the heavy metals distribution in waters from the Dilovası region in Kocaeli, Turkey

D. Bingöl et al. / Marine Pollution Bulletin 68 (2013) 134–139 139

Fig. 5. The Dendrogram results show that the appropriate numberof clusters was five, which was similar to the PCA results.

Chemometric methods were applied to classify water accordingto heavy metal contents. The chemometric evaluation showed thata relationship exists between their heavy metal contents and thewater samples from the Dilovası area. The water samples and hea-vy metals were classified into five groups by PCA and a clusteranalysis. From the chemometric evaluation of heavy metal content,the first group contains only the d1 sample, the second group con-tains the d2 and d3 samples, the third group consists of the d9 andd10 samples, the fourth group is composed of the d5, d6, d7 and d8samples, and the fifth group only contains the d4 water sample.

References

Brereton, R.G., 2007. Applied Chemometrics for Scientists. John Wiley & Sons Ltd.,England.

Cai, L., Xu, Z., Ren, M., Guo, Q., Hu, X., Hu, G., Wan, H., Peng, P., 2012. Sourceidentification of eight hazardous heavy metals in agricultural soils of Huizhou,Guangdong Province. China Ecotoxicol. Environ. Saf. 78, 2–8.

Chen, Z., Saito, Y., Kanai, Y., Wei, T., Li, L., Yao, H., Wang, Z., 2004. Low concentrationof heavy metals in the Yangtze estuarine sediments, China: a diluting setting.Estuar. Coast. Shelf Sci. 60, 91–100.

Cobelo-García, A., Prego, R., 2003. Heavy metal sedimentary record in a Galician Ria(NW Spain): background values and recent contamination. Mar. Pollut. Bull. 46,1253–1262.

Demiray, A.D., Yolcubal, I., Akyol, N.H., Cobanoglu, G., 2012. Biomonitoring ofairborne metals using the Lichen Xanthoria parietina in Kocaeli Province.Turkey Ecol. Indic. 18, 632–643.

Dıraman, H., Özdemir, D., Hıs�ıl, Y., 2009. Ayvalık zeytin çes�idinden üretilen erkenhasat natürel zeytinyaglarının yag asitleri biles�enlerine göre kemometrikkarakterizasyonu. Gıda Teknolojileri Elektronik Dergisi 4 (3), 1–11.

Ersungur, S�.M., Kızıltan, A. & Polat, Ö., 2007. Türkiye’de bölgelerin sosyo-ekonomikgelis�mis�lik sıralaması:Temel biles�enler analizi. Atatürk Üniversitesi _Iktisadi ve_Idari Bilimler Fakültesi Dergisi. Erzurum.

Fang, T.H., Hong, E., 1999. Mechanisms influencing the spatial distribution of tracemetals in surficial sediments of the south–western Taiwan. Mar. Pollut. Bull. 38(11), 1026–1037.

Franco-Uría, A., López-Mateo, C., Roca, E., Fernández-Marcos, M.L., 2009. Sourceidentification of heavy metals in pastureland by multivariate analysis in NWSpain. J. Hazard. Mater. 165, 1008–1015.

Idris, A.M., 2008. Combining multivariate analysis and geochemical approaches forassessing heavy metal level in sediments from Sudanese harbors along the RedSea coast. Microchem. J. 90, 159–163.

Kara, D., 2009. Evaluation o trace metal concentrations in some herbs and herbalteas by principal component analysis. Food Chem. 114, 347–354.

Karademir, A., 2006. Evaluation of the potential air pollution from fuel combustionin industrial boilers in Kocaeli. Turkey Fuel 85 (12–13), 1894–1903.

Karadas, C., Kara, D., 2012. Chemometric approach to evaluate trace metalconcentrations in some spices and herbs. Food Chem. 130, 196–202.

Klavins, M., Briede, A., Rodinov, V., Kokorite, I., Parele, E., Klavina, I., 2000. Heavymetals in rivers of Latvia. Sci. Total Environ. 262, 175–183.

Kolasani, A., Xu, H., Millikan, M., 2011. Evaluation of mineral content of Chinesemedicinal herbs used to improve kidney function with chemometrics. FoodChem. 127, 1465–1471.

Liu, X., Xue, C., Wang, Y., Li, Z., Xue, Y., Xu, J., 2012a. The classification of seacucumber (Apostichopus japonicus) according to region of origin using multi-element analysis and pattern recognition techniques. Food Control 23, 522–527.

Liu, X., Xue, C., Wang, Y., Li, Z., Xu, J., 2012b. The classification of sea cucumber(Apostichopus japonicus) according to region of origin using multi-elementnalysis ad pattern recognition techniques. Food Control 23, 522–527.

Lu, X., Wang, L., Li, L.Y., Lei, K., Huang, L., Kang, D., 2010. Multivarite staisticalanalysis of heavy metals in street dust of Baoji. NW China J. Hazard. Mater. 173,744–749.

Malik, R.N., Nadeem, M., 2011. Spatial and temporal characterization of traceelements and nutrients in the Rawal Lake Reservoir, Pakistan using multivariateanalysis techniques. Environ. Geochem. Health 33, 525–541.

Özdamar, K., 2002. Paket Programlar ile _Istatistiksel Veri Analizi 2. Kaan Kitabevi.Eskis�ehir.

Pekey, H., 2006. The distribution and sources of heavy metals in Izmit Bay surfacesediments affected by a polluted stream. Mar. Pollut. Bull. 52, 1197–1208.

Tam, N.F.Y., Wong, Y.S., 2000. Spatial variation of heavy metals in surface sedimentsof Hong Kong Mangrove Swamps. Environ. Pollut. 110, 195–205.

Tatlıdil, H., 1992. Uygulamalı Çok Degis�kenli Istatistiksel Analiz. Ankara.Telli-Karakoç, F., Tolun, L., Henkelmann, B., Klimm, C., Okay, O., Schramm, K.W.,

2002. Polycyclic aromatic hydrocarbons (PAHs) and polychlorinated biphenyls(PCBs) distributions in the Bay of Marmara Sea: Izmit Bay. Environ. Pollut. 119,383–397.

Tokalıoglu, S�., Kartal, S�., 2006. Multivariate analysis of the data and speciation ofheavy metals in street dust samples from the Organized Industrial District inKayseri (Turkey). Atmos. Environ. 40, 2797–2805.

Tokalıoglu, S�., 2012. Determination of trace elements in commonly consumedmedicinal herbs by ICP-MS and multivariate analysis. Food Chem. http://dx.doi.org/10.1016/j.foodchem.2012.04.093.

Tuncer, M., 2009. National Cancer Program 2009–2015. Republic of Turkey Ministryof Health, ISBN 978-975-590-285-2.

Yaylalı-Abanuz, G., 2011. Heavy metal contamination of surface soil around Gebzeindustrial area. Turkey Microchem. J. 99, 82–92.

Yuan, C., Shi, J., Lui, J., Liang, L., Jiang, G., 2004. Speciation of heavy metals in marinesediments from the East China Sea by ICP-MS with sequential extraction.Environ. Int. 30, 769–783.

Yücel, Y., Sultanoglu, P., 2012. Determination of industrial pollution effects on citrushoneys with chemometric approach. Food Chem. http://dx.doi.org/10.1016/j.foodchem.2012.04.061.


Recommended