+ All Categories
Home > Documents > Iodine concentrations in Danish groundwater: historical data assessment 1933–2011

Iodine concentrations in Danish groundwater: historical data assessment 1933–2011

Date post: 25-Jan-2017
Category:
Upload: kim-h
View: 215 times
Download: 2 times
Share this document with a friend
14
ORIGINAL PAPER Iodine concentrations in Danish groundwater: historical data assessment 1933–2011 Denitza Dimitrova Voutchkova Søren Munch Kristiansen Birgitte Hansen Vibeke Ernstsen Brian Lyngby Sørensen Kim H. Esbensen Received: 20 August 2013 / Accepted: 13 May 2014 Ó Springer Science+Business Media Dordrecht 2014 Abstract In areas where water is a major source of dietary iodine (I), the I concentration in drinking water is an important factor for public health and epidemi- ological understandings. In Denmark, almost all of the drinking water is originating from groundwater. Therefore, understanding the I variation in groundwa- ter and governing factors and processes are crucial. In this study, we perform uni- and multivariate analyses of all available historical Danish I groundwater data from 1933 to 2011 (n = 2,562) to give an overview on the I variability for first time and to discover possible geochemical associations between I and twenty other elements and parameters. Special attention is paid on the description and the quality assurance of this complex compilation of historical data. The high variability of I in Danish groundwater ( \ d.l. to 1,220 lg/l) is characterised by both small-scale het- erogeneity and large-scale spatial trends, e.g. higher concentrations observed in the eastern part of Den- mark. Significant trends are observed also with respect to the depth of extraction and geology, indicating the importance of older marine limestone and chalk deposits. A principal component analysis on centred log-ratio-transformed data (clr) revealed associations between I, Li, B, Ba, Br implying saline water influence. High I is also associated with reduced and alkaline groundwaters for this data set, dominated by Ca–HCO 3 water type. Keywords Iodine Groundwater Denmark Multivariate data analysis Compositional data Introduction Iodine is an essential component of human thyroid hormones regulating the metabolic processes in cells and playing role in the early development of most organs. Too low I intake, i.e. \ 150 lg/day for adults, can result in a variety of iodine disorder diseases (IDD) as mental retardation, goitre, and hypothyroidism, while chronically high intake can also cause health problems, such as I induced hyperthyroidism, elevated goitre, and subclinical hypothyroidism (WHO 2007). Electronic supplementary material The online version of this article (doi:10.1007/s10653-014-9625-4) contains supple- mentary material, which is available to authorized users. D. D. Voutchkova (&) S. M. Kristiansen Department of Geoscience, Aarhus University, Høegh-Guldbergs Gade 2, 8000 Aarhus C, Denmark e-mail: [email protected] D. D. Voutchkova B. Hansen B. L. Sørensen Geological Survey of Denmark and Greenland (GEUS), Lyseng. Alle ´ 1, 8270 Højbjerg, Denmark V. Ernstsen K. H. Esbensen Geological Survey of Denmark and Greenland (GEUS), Øster Voldgade 10, 1350 Copenhagen K, Denmark K. H. Esbensen ACABS Research Group, Aalborg University, Campus Esbjerg, 6700 Esbjerg, Denmark 123 Environ Geochem Health DOI 10.1007/s10653-014-9625-4
Transcript
Page 1: Iodine concentrations in Danish groundwater: historical data assessment 1933–2011

ORIGINAL PAPER

Iodine concentrations in Danish groundwater: historicaldata assessment 1933–2011

Denitza Dimitrova Voutchkova • Søren Munch Kristiansen •

Birgitte Hansen • Vibeke Ernstsen • Brian Lyngby Sørensen •

Kim H. Esbensen

Received: 20 August 2013 / Accepted: 13 May 2014

� Springer Science+Business Media Dordrecht 2014

Abstract In areas where water is a major source of

dietary iodine (I), the I concentration in drinking water

is an important factor for public health and epidemi-

ological understandings. In Denmark, almost all of the

drinking water is originating from groundwater.

Therefore, understanding the I variation in groundwa-

ter and governing factors and processes are crucial. In

this study, we perform uni- and multivariate analyses

of all available historical Danish I groundwater data

from 1933 to 2011 (n = 2,562) to give an overview on

the I variability for first time and to discover possible

geochemical associations between I and twenty other

elements and parameters. Special attention is paid on

the description and the quality assurance of this

complex compilation of historical data. The high

variability of I in Danish groundwater (\d.l. to

1,220 lg/l) is characterised by both small-scale het-

erogeneity and large-scale spatial trends, e.g. higher

concentrations observed in the eastern part of Den-

mark. Significant trends are observed also with respect

to the depth of extraction and geology, indicating the

importance of older marine limestone and chalk

deposits. A principal component analysis on centred

log-ratio-transformed data (clr) revealed associations

between I, Li, B, Ba, Br implying saline water

influence. High I is also associated with reduced and

alkaline groundwaters for this data set, dominated by

Ca–HCO3 water type.

Keywords Iodine � Groundwater � Denmark �Multivariate data analysis � Compositional data

Introduction

Iodine is an essential component of human thyroid

hormones regulating the metabolic processes in cells

and playing role in the early development of most

organs. Too low I intake, i.e.\150 lg/day for adults,

can result in a variety of iodine disorder diseases (IDD)

as mental retardation, goitre, and hypothyroidism, while

chronically high intake can also cause health problems,

such as I induced hyperthyroidism, elevated goitre, and

subclinical hypothyroidism (WHO 2007).

Electronic supplementary material The online version ofthis article (doi:10.1007/s10653-014-9625-4) contains supple-mentary material, which is available to authorized users.

D. D. Voutchkova (&) � S. M. Kristiansen

Department of Geoscience, Aarhus University,

Høegh-Guldbergs Gade 2, 8000 Aarhus C, Denmark

e-mail: [email protected]

D. D. Voutchkova � B. Hansen � B. L. Sørensen

Geological Survey of Denmark and Greenland (GEUS),

Lyseng. Alle 1, 8270 Højbjerg, Denmark

V. Ernstsen � K. H. Esbensen

Geological Survey of Denmark and Greenland (GEUS),

Øster Voldgade 10, 1350 Copenhagen K, Denmark

K. H. Esbensen

ACABS Research Group, Aalborg University, Campus

Esbjerg, 6700 Esbjerg, Denmark

123

Environ Geochem Health

DOI 10.1007/s10653-014-9625-4

Page 2: Iodine concentrations in Danish groundwater: historical data assessment 1933–2011

Iodine in the human body originates mainly through

dietary intake including water or through inhalation of

atmospheric I (minor contribution). The air I concen-

tration is low: 10–20 ng/m3 (Hou 2009). Drinking

water is generally not considered a significant major

contributor to the dietary I intake as it usually provides

close to 10 % only (Fuge 2005). However, in

Denmark, where drinking water originates from

groundwaters, 25 % of the I intake in the average diet

was derived from drinking water and other beverages

(some containing high proportions of processed

groundwater) prior to recent mandatory I fortification

of table salt (Rasmussen et al. 2000). In 1998, a

voluntary programme for using iodised salt started in

Denmark (Rasmussen et al. 2002; Laurberg et al.

2006) aiming to increase the average dietary I intake

with 50 lg/day (Laurberg et al. 2006). Two years

later, the voluntary programme was found insufficient

and changed to mandatory in 2001. Since then, I is

added to the salt used in households and in production

of cakes and bread. A more recent study on the dietary

habits in Denmark (n = 4,431) shows that after the

fortification, I from drinking water and other bever-

ages represents 14 % of the dietary intake (Pedersen

et al. 2010). In such areas (as Denmark) where

groundwater is a major dietary source, a thorough

understanding of I variation in groundwater is of

importance for understanding epidemiology.

The I geochemical cycle is a spatially and tempo-

rally dynamic system, where different biotic and

abiotic processes are taking a part. In the hydrogeo-

chemical cycle, I is predominantly in oxidation state

-1 (iodide, I-) and ?5 (iodate, IO3-); and, next to

these inorganic forms, I is also found in different

organic species, which can also contribute a significant

fraction of the total I (Hu et al. 2009).

Iodine is one of the most abundant micronutrients in

sea water, with total concentration of 50–60 lg/l (Ito

and Hirokawa 2009) while I for coastal waters around

Denmark was found to vary seasonally with averages

from 32 to 89 lg/l total I (Truesdale et al. 2003).

Fuge (2005) found 2 lg/l total I in the precipitation

from the interior of the UK with up to 5 lg/l for coastal

precipitation. Similar values are found in rain water

collected in Germany, averaging 2.2 ± 0.8 lg/l

(Gilfedder et al. 2009) and in Denmark, range

0.78–2.70 lg/l (Hou et al. 2009).

Contents of I in Danish aquifer sediments have not

been studied but marine deposits are generally found

to be enriched in I relative to terrestrial sediments,

i.e. I content decreases in the order: deep-sea clays

(3.9 mg/kg) [ organic C-rich shales (near shore,

0.2–6.2 mg/kg) [ limestone (near shore, 2.5 mg/kg,

which is 1/10th of the I found in deep-sea carbonates,

(Muramatsu and Wedepohl 1998)) [ sandstone

(123 lg/kg) [ magmatic rocks (4–8 lg/kg), based

on Muramatsu and Wedepohl (1998) measurements.

From the study on 884 European bottled spring and

mineral waters, I concentrations were found to cover

three to four orders of magnitude, from \0.2 to

4,030 lg/l with median 4.78 lg/l (Reimann and Birke

2010). Many of the high values were traced to deep

formation waters, while one I rich bottled water from

Norway was influenced by water from a recent marine

clay deposit (Reimann and Birke 2010).

Existing data on I in Danish drinking water (close to

100 % originating from groundwater) are limited to

nationwide studies based on very few sampling

locations: tap water from 55 sites (K. M. Pedersen

et al. 1999), 40 sites (Rasmussen et al. 2000), and 47

sites (Saxholt et al. 2008), and drinking water from 22

waterworks (Andersen et al. 2002). This contemporary

data set shows that I concentration in the drinking

water in Denmark varies from 0.7 to 140 lg/l.

However, there is one order of magnitude difference

between the highest values reported by K. M. Pedersen

et al. (1999), Andersen et al. (2002), Saxholt et al.

(2008) and the one found by Rasmussen et al. (2000).

According to Andersen and Lauberg (2009), there is a

distinguishable difference between East and West

Denmark with respect to I content in drinking water.

Even though such regional variations of I content in

the Danish drinking water are known, there have been

no detailed geochemical studies related to I.

In the review of Whitehead (1984), the I concen-

tration in groundwaters from limestone and chalk

aquifers in UK was below 5 lg/l; however, some

iodine-enriched groundwaters ([50 lg/l) were also

present and attributed to possible recent sea water

intrusion, old saline water, or thermal water influence.

Some of the highest I concentrations in groundwaters

reported globally with 129 ± 3 mg/l, 2000 times

higher than the sea water concentration level, are

found in Japanese wells influenced by brine (Mura-

matsu et al. 2001). As far as we know, there are no

published studies on I in the Danish groundwater.

However, historical data on I in the groundwater can

be found in the Danish nationwide geological and

Environ Geochem Health

123

Page 3: Iodine concentrations in Danish groundwater: historical data assessment 1933–2011

hydrological database (Jupiter). Therefore, the aim of

the present study of I in Danish groundwater was 1) to

give for the first time an overview on the existing

groundwater I data with focus on the spatial variation,

geological setting, and depth of extraction and 2) by

using a multivariate analysis to identify geochemical

associations between I and other variables in order to

elucidate the governing factor(s) for the spatial

variability.

Methods and materials

Data

The groundwater data used in this study are taken

from the Danish public nationwide geological and

hydrological database, Jupiter (GEUS 2011). All

groundwater samples analysed for I together with

other supplementary information were downloaded

from Jupiter on 24th of November 2011. The supple-

mentary information consisted of: location (ground-

water sampling screen, i.e. well ID number and

geographical coordinates, screen number, screen

depth—top and bottom); when the samples were

collected (date); dominating geological setting at the

screen (given with % from the length of the screen and

type of geological setting, see Supplementary materi-

als 5); used analytical method (for each geochemical

element); samples with concentrations below detec-

tion limits (assigned an attribute ‘\x’, where x is a

specific detection limit); and objective of analytical

determination.

For the purpose of a multivariate study, all available

data were used not only for I, but also for possible co-

explanatory geochemically and geologically relevant

major and trace elements and parameters. Figure 1

presents the number of available samples analysed for I,

and the following relevant variables: free CO2 (called

further Agg.CO2), B, Ba, Br, Ca, CH4, Cl, Fe, F, H2S,

HCO3, I, K, Li, Mg, Mn, Na, NO3, O2, PO4, SO4, Sr,

conductivity, NVOC, pH, and redox values. The

charges are not included as not for all listed analytes

the exact speciation is available in the Jupiter database.

Two variables used in the multivariate analysis are

not provided by Jupiter: ‘distance to coast line’ and

‘distance to major faults’. The ‘Near 3D’ tool of

ArcMap10.0 (ESRI 1999–2010) was used to calculate

the horizontal distance from each sample location

(well X, Y coordinates provided from Jupiter data set)

to both the coastline and/or to the nearest major

geological fault lines. The distances to the present

coast line are calculated as distances to the closest sea

or fjord, after a specific data control was made, and all

derived distance values were manually checked for

calculation mistakes due to complex coastline shapes.

The distance to faults was measured as distance to the

closest of the major fault lines described by Hakansson

and Pedersen (1992).

Data sets preparation and pre-treatment

Master data set and quality assessment

After the raw data were extracted from the database, a

master data set (MDS) was prepared which consisted

Fig. 1 Number of samples analysed for iodine and 25 co-joint

variables in the master data set based on data extracted from the

Jupiter database (variables excluded in the preparation of the

master data set for multivariate analysis are also indicated;

r-MDS stands for reduced master data set)

Environ Geochem Health

123

Page 4: Iodine concentrations in Danish groundwater: historical data assessment 1933–2011

of 28 variables (I, 25 geochemical elements and

parameters, distance to the coast, and distance to major

faults) for a grand total of 2,562 objects. The data set is

characterised by many missing values because far

from all groundwater samples have been analysed for

all of the analytes (Fig. 1). The supplementary infor-

mation (Id well number, sample number, screen

number, X and Y coordinates, laboratory methods,

sampling date, geology, detection limits, and purpose,

etc.) was used for a thorough initial data quality

evaluation. The highly significant diversity of I data

quality due to many different analytical methods, as

well the possible errors in the database entries with

respect to I speciation, is discussed in full detail in

Supplementary materials 1. The MDS was first

checked for gross outliers by manual exploration for

unusually high values. As a result, a few additional

unit errors were also eliminated (Supplementary

materials 2). Values below detection limit were

substituted with 0.75 of the detection limit, following

standard geochemical traditions. Details concerning

different detection limits for I and all other analytes, as

well as the proportions of censored values and missing

data are given in Supplementary materials 3. The

Fig. 2 Spatial distribution

of the master data set (MDS)

samples/objects and the

reduced master data set (r-

MDS) samples; in the upper

right corner, the Danish

administrative regions are

given (to facilitate the

understanding of the Results

part); in the lower part of the

figure are given: (left)—

probability plot of the I

concentrations from the

r-MDS; (right)—probability

plot of all I data in the MDS,

with samples grouped by

geographical region (East

vs. West)

Environ Geochem Health

123

Page 5: Iodine concentrations in Danish groundwater: historical data assessment 1933–2011

spatial distribution of the MDS objects (samples) can

be seen on Fig. 2.

Preparation and pre-treatment of the data

for multivariate analysis

Two different operations were needed in order to

prepare the data for multivariate analysis. First, the

size of the MDS was reduced due to the many missing

values in the matrix, and second, the closed array

problem was addressed by applying the centred log-

ratio (clr) transformation of the compositional data

(see further below for details).

As seen from Fig. 1, the number of analyses for the

potentially co-varying other 25 elements varies from

65 (O2) to 2,131 (B). The requirement for proper

multivariate data analysis (MVDA) is that the pro-

portion of samples that does not contain missing

information for all (or nearly all) variables should be

as high as possible. Thus, the original MDS had to be

sensibly reduced so as to present a suitable low

proportion of missing values throughout, which

should be as randomly distributed over the data matrix

as possible. While this configuration is desirable, in

practice, it is very much the existing variable coverage

in the database that brackets such endeavours. Thus,

reduction of the MDS was carried out in two steps: (1)

reduction in the number of variables (analytes), i.e.

deletion of variables with a too high proportion of

adversely distributed missing values (major, trace

elements, and field measurements) followed by (2)

reduction of remaining objects (samples) still with a

too high proportion of missing values. In the first step,

eight variables were excluded (Agg.CO2, CH4, F, H2S,

NVOC, O2, PO4, redox value), based on the insuffi-

cient number of co-existing analyses, Fig. 1. The

choice of variables (and the exclusion of variables)

can limit interpretation of the MVDA results by failing

to recognise potentially important processes or by

lacking sufficient data evidence due to omitted

variables, necessitating a balance between the con-

trasting requirements. After removing eight variables,

there were still too many missing values in the data

matrix, so further reduction of the objects was also

necessary, with the primary objective of keeping as

many as possible of high(er) I values. All MDS

reductions were made according to well-established

general chemometric rules and procedures (Esbensen

2010).

This clean-up resulted in a reduced MDS (r-MDS),

which consists of 20 variables (B, Ba, Br, Ca, Cl, Fe,

HCO3, I, K, Li, Mg, Mn, Na, NO3, SO4, Sr,

conductivity, distance to coastline, distance to faults,

and pH) and 506 objects, representing 20 % of the

original number of objects. Although an apparently

dramatic reduction in the number of analyses

(2,562 ? 506) available for MVDA, it is a necessary

reflection of the very disparate analytical coverage

displayed in the database and the resulting covariance

matrix coverage, Fig. 1. For the purpose of a first

overview of the fullest multivariate data structure,

however, r-MDS is the optimal co-joint data set that

can be obtained from the Jupiter database. The spatial

distribution of the r-MDS objects (samples) can be

seen on Fig. 2. Most of them are located in Jutland and

part of Zealand. An overview on the number of

censored data (\d.l.) as well as missing values per

variable in the r-MDS is given in the Supplementary

materials 3. The missing values were substituted with

the arithmetic mean of the respective variable columns

in the matrix (neutral with respect to PCA) in order to

be able to address the closure problem of the

compositional data in a rational fashion.

Compositional data are multivariate data in which

components (variables, analytes) are part of a whole,

i.e. variables sum to a constant (often 100 %) (Paw-

lowsky-Glahn and Egozcue 2006). The main charac-

teristic of compositional data is that the variables are

not free to range from ‘-’ infinity to ‘?’ infinity

within the data analytical Euclidian space. They

instead occupy a restricted space, called a ‘simplex’,

having its own metric, which follows other rules than

Euclidian geometry (Buccianti and Pawlowsky-Glahn

2005). Two characteristics of compositional data are

especially important from the multivariate data ana-

lytical and interpretational (practical) point of view:

Closed variables are not free to vary independently

because of the constant sum constraint, and only their

relative magnitudes and variations (not absolute

values/concentrations) are to be used for data analysis

and interpretations. In order to overcome this ‘closure’

problem, three different log-ratio transformations

(additive, centred, and isometric) have been intro-

duced in geochemistry and data analysis over the last

two decades (see review by Pawlowsky-Glahn and

Egozcue (2006)). Buccianti and Pawlowsky-Glahn

(2005) demonstrated that the centred log-ratio (clr) is

an appropriate method for water chemistry data, and

Environ Geochem Health

123

Page 6: Iodine concentrations in Danish groundwater: historical data assessment 1933–2011

consequently the clr transformation was chosen for our

study. In order to transform the data, for the n-part

composition x (x is one sample/object, analysed for n

compounds), the clr(x) coefficient/value, written as

vector with n parts, is given as:

clr xð Þ ¼ clr x1; x2; . . .; xn½ �¼ ln x1=gð Þ; ln x2=gð Þ; . . .; ln xn=gð Þ½ �;

where g is the geometric mean of the sample

composition g = (x1, x2, …,xn)1/x.

There are few challenges associated with applying

this type of transformation to non-complete data sets

representing compilation of historical data (see

Results and discussion). The clr transformation was

performed only on the compositional variables (B,

Ba, Br, Ca, Cl, Fe, HCO3, I, K, Li, Mg, Mn, Na, NO3,

SO4, and Sr). Prior to the clr transformation, all lg/l

were recalculated to mg/l (the actual sample densities

are unknown, so density of 1 kg/l is assumed). The

four additional non-compositional variables (con-

ductivity, distance to coastline, distance to faults, and

pH) were centred with their arithmetic means and

weighted by the factor 1/STD before the PCA, known

as auto-scaling in multivariate data analysis (Esben-

sen 2010).

Data analysis

The univariate data analysis (based on means, per-

centiles, probability plots, and box plots) was done on

all iodine samples (n = 2,562) from the MDS after the

substitution of the values below detection limit (see

previous part).

The compositional variables were clr transformed

with the open source software CoDaPack (Comas-

Cufı and Thio-Henestrosa 2011); the multivariate data

analysis (PCA) was performed on the pre-treated

r-MDS (506 9 20) (see previous part) using the

Unscrambler 10.1 software package (CAMO 2011).

Principal component analysis (PCA) is used for its

suitability for simultaneous multivariate data descrip-

tion and structure exploration, notably for the possi-

bility of discovering grouping of samples and/or

variables which may otherwise be swamped in indi-

vidual sampling and analytical errors if only studied

univariately, the so-called full-spectrum advantage

(Esbensen 2010). The objective of the present PCA

was to explore the overall data structure and to

identify geochemical associations between iodine and

other major and minor ions. The nonlinear iterative

projections by alternating least-squares (NIPALS)

algorithm is used. For detailed explanation on the

methods NIPALS and PCA, see (Esbensen 2010) and

further background references herein.

Results and discussion

Iodine concentration and spatial variation

in Danish groundwater

The groundwater data included in this master data set

contain samples analysed for I (n = 2,562) obtained

from year 1933 to year 2011. Iodine has been analysed

irregularly: Less than 2 % of the samples are taken

before 1990 and the year with the most analyses is

2005 (n = 922), representing more than 35 % of all I

analyses for the studied period. About half of the

samples (50.11 %) are taken as part of the Danish

Groundwater Monitoring Programme (GRUMO). The

purpose of GRUMO is to document quantitatively and

qualitatively the groundwater status and trends so the

effect of the national action plans on the aquatic

environment can be evaluated (Hansen et al. 2011).

Until 2011, iodine was not part of the analytical

programme of GRUMO. A total of 74 clustered

catchments with area 5–50 km2 each containing 25

wells are included in the GRUMO monitoring

programme (Hansen et al. 2011), and this is reflected

by the spatial distribution of the samples part of our

data set, which are from 975 wells distributed

throughout Denmark (see Fig. 2). At some of these

wells, more than one well screen is available, so the

samples taken at different depths might be represent-

ing different aquifers too. Based on the screen lengths

which vary substantially—from less than 1 m length

to 110 m—can be concluded that the well design and

the screen types differ too. Close to 2/3 of the samples

are extracted from screens shorter than 2 m.

The I concentration varies from below detection

limit (the lowest detection limit is 0.4 lg/l) to

1,220 lg/l. For 90 % of the samples, I concentration

is below 20 lg/l and only 11 samples have I above

200 lg/l. The median value is 5.4 lg/l, and the

average I concentration in the groundwater is

13.83 lg/l.

It is interesting to check if the East–West trend for

the drinking water I (high–low I concentration,

Environ Geochem Health

123

Page 7: Iodine concentrations in Danish groundwater: historical data assessment 1933–2011

respectively) found by Andersen and Lauberg (2009)

can be observed also in the groundwater data.

Even though 100 % of the Danish drinking water is

with groundwater origin, only around 16 % of the

samples in this master data set are extracted from

wells belonging to waterworks (very few of them are

actual production wells). Indeed, even though both

high and low concentration are clustered together, the

average mean I concentration for the administrative

Capital Region is the highest (26.81 lg/l) followed

by Region Zealand (14.06 lg/l), whereas the lowest

average mean is calculated for the region of Central

Denmark (7.6 lg/l). This trend is also present for the

25th, 50th, 75th, 90th, and 95th percentiles (see

Supplementary materials 4). The probability plot for

I concentrations in West Denmark (Jutland peninsula

and the island of Funen) and East Denmark (Zealand

and Bornholm) clearly visualises the East–West

trend (Fig. 2, see also Supplementary materials 4).

However, the I concentrations are far from homoge-

neous throughout the administrative and geographi-

cal regions—both high and low concentrations

cluster together. From a geochemical point of view,

it is interesting to find an explanation for both small-

scale variations as well as for the large-scale trends.

The general trend of I concentration distribution

with depth is that the relatively higher iodine

concentrations start to be observed very sharply at

extraction depths around 40 m (to 80 m) below terrain

(Fig. 3). This difference could be explained either by a

shift in the geological settings (e.g. the limestone/

chalk vs. glacial deposits) or in the groundwater

chemistry (e.g. reflecting different water types) around

that depth.

Most Danish aquifers consist of Quaternary or

Miocene sand and/or gravel, Palaeocene to Late

Cretaceous chalk/limestone (Kelstrup et al. 1982).

Only the dominating geological setting (proportional

to the screen length) is known for about 70 % of the

well screens. There are more than 40 different

geological settings, representing either the aquifer or

an aquitard dominating at the screen length (see

Supplementary materials 5). The highest 75th, 90th,

and 95th percentiles are found for the samples

representing groundwater from Palaeocene to Late

Cretaceous chalk/limestone (Fig. 4 and Supplemen-

tary materials 5). The lowest median (2.8 lg/l) is for

the Palaeogene settings (w/o limestone); the median

value for samples representing glacial meltwater

settings (7 lg/l) is as high as for the samples

representing the Palaeocene/Cretaceous limestone/

chalk (Fig. 4). In Quaternary sediments above marine

calcareous deposits, I concentrations are generally

elevated according to Fuge (2005), which might

Fig. 3 Iodine concentrations in 10/20 m depth intervals of: (a) the top screen and (b) the bottom screen depths below terrain level

Environ Geochem Health

123

Page 8: Iodine concentrations in Danish groundwater: historical data assessment 1933–2011

explain the lack of more pronounced difference in

Fig. 4.

If we look at the types of dominating setting at

extraction depths of 30–40 m and compare to the ones

at 40–50 m, it can be seen that there are almost equal

amounts of samples representing glacial meltwater

settings (37 vs. 35 %, respectively). However, the

samples representing a limestone/chalk aquifer are

twice as many at 40–50 m depth (26 %) than at

30–40 m (13 %), which could explain the sharp

difference in the I concentrations below and above

40 m below terrain, Fig. 3 (see also Supplementary

materials 5).

The univariate data analysis allows only a descrip-

tion of the variability of the concentrations with

respect to geographical location, depth, or geology.

However, for pointing at a specific source or process,

responsible for the I variation in Danish groundwater,

a multivariate approach is needed.

Identifying governing sources and processes

by using multivariate data analysis

The PCA model revealed a strong data structure, as

only two principal components (PCs) were needed to

explain 93 % of the total variance (PC1: 87 %, PC2:

6 %). No clear multivariate outliers were identified;

thus, the PCA model is representing all 506 samples of

the r-MDS. Descriptive statistics of the r-MDS (incl.

clr-transformed data) is given in Supplementary mate-

rials 6; summary of the PCA model characteristics are

presented in Supplementary materials 7 (incl. PC1 and

PC2 loadings and descriptive statistics of PC1 and PC2

scores). Information about the relative variability in the

data set can be obtained by analysing the position of the

variables in the loading plot, as explained by Daunis-I-

Estadella et al. (2006) (interpretation of bi-plots).

Buccianti and Pawlowsky-Glahn (2005) also demon-

strated a hydrogeochemical interpretation based on clr-

transformed data. PC1–PC2 loading and score plot are

here used (Fig. 5e) to identify geochemical associations

between I and other major and minor ions and to further

elucidate the potential governing sources or processes.

It is interesting to note that the minor and trace

elements (Li, I, B, Ba, Br, Sr) are all located on the

positive side of the PC1, whereas the major elements

(HCO3, Ca, SO4, Cl, Na, Mg, K, NO3) are on the

opposite side (except Fe and Mn, which are also on the

positive side of PC1). The grouping of the variables on

the loading plot resembles very closely the differ-

ences/similarities in the variability shown on the box

plot of the clr-transformed data (Fig. 6j) and the raw

data (Fig. 7j).

The clr(I) and clr(Li) variables are clearly

positively associated with PC1 (87 % explained

variance) (Fig. 5e). The nearly coincident clr(I) and

clr(Li) are showing that the variance of ln(I/Li) is

close to 0, so I/Li ratio is constant or almost

constant for the data set. This is also supported by

the similarity in the clr variation: for clr(Li) from

-9.24 to -3.07, and for clr(I) from -7.04 to -1.5,

mapped on the PC1–PC2 score plot (Fig. 5a, c).

The raw I and Li concentrations are also mapped on

the PC1–PC2 score plot (note that the scores are

Fig. 4 Box plot of iodine

concentrations grouped

according to the dominating

geological setting at the

screen length (see also

Supplementary materials 5)

Environ Geochem Health

123

Page 9: Iodine concentrations in Danish groundwater: historical data assessment 1933–2011

based on clr-transformed data) for comparison and

orientation (Fig. 5b, d); the Li concentrations in

r-MDS vary from 0.0375 to 51.4 lg/l, with a mean:

7.89 lg/l and median 6.1 lg/l, whereas I concen-

trations vary from 0.75 to 700 lg/l (see Supple-

mentary materials 6 for descriptive statistics of

r-MDS). The Li median of our data set is lower

than the median of the European bottled water,

10 lg/l (Reimann and Birke 2010). The clr(I) is

also close to other clustered variables—clr(Ba),

clr(Br), and cl(B). Ba content in groundwater is

mainly caused by dissolution of Ba-holding miner-

als, e.g. barite (BaSO4) or witherite (BaCO3)

(Mokrik et al. 2009); however, for aquifers where

groundwater recharge is mixing with sea water,

Santos et al. (2011) discusses that no single process

can explain Ba distribution. Four processes are

pointed to be of major interest: mixing of sea- and

Fig. 5 Principal

component analysis plots,

based on the reduced master

data set: A, B, C, D ? PC1–

PC2 score plot with mapped:

clr(I) (a); raw I

concentrations (b); clr(Li)

(c); and raw Li

concentrations (d); E ?PC1–PC2 loading plot

Environ Geochem Health

123

Page 10: Iodine concentrations in Danish groundwater: historical data assessment 1933–2011

Environ Geochem Health

123

Page 11: Iodine concentrations in Danish groundwater: historical data assessment 1933–2011

freshwater, ion exchange-driven sorption/desorption

to/from sediments, marine organic matter minerali-

sation, and Fe and Mn oxide cycling (Santos et al.

2011). The B, Br, and Li have been used for

studying the origin of groundwater salinity in order

to distinguish between marine, geothermal, and

evaporite influences (Sanchez-Martos et al. 2002).

High Li content can be found in thermal waters; it

is used also for tracing the mixing of brines

produced by sea water evaporation (see overview

by Sanchez-Martos et al. (2002)). The grouping of

the clr variables I, Li, B, Ba, and Br and their

similar variability in the data set suggest for

influence of similar processes or sources.

At the negative side of PC1, clr(HCO3), and

clr(Ca) are the most influential variables, followed by

the cluster—clr(Cl), clr(SO4), clr(Na), clr (Mg). The

clr-transformed values mapped on the PC1–PC2

score plot (Fig. 6a–f) are visualising the similarity

between the variables from these two groups (for

orientation, the raw concentrations are mapped and

also shown on Fig. 7a–f). The smallest distance

between two variables on the loading plot is the one

between clr(Cl) and clr(SO4), similarly to I/Li ratio,

this means that the Cl/SO4 ratio is also constant, or

almost constant, for this data set. The clustering of

the four variables (Cl, SO4, Na, and Mg) can be

explained by their primarily marine origin. The Ca–

HCO3 pair is pointing at the carbonate system and its

importance. This is also supported by the Ca–HCO3

water type prevalence in our data set (see Piper

Diagram in Supplementary Materials 8). Not sur-

prisingly, the samples characterised by low pH

(Fig. 6i) are also plotting in the upper part of PC1

where the low Ca–HCO3 are found.

PC2 (explaining only 6 % of the variance) is

influenced strongly by only two of the clr vari-

ables—NO3 and Fe. The clr(NO3) is on the positive

side of PC2, whereas clr(Fe) is on the negative side,

followed by clr(Mn). The position of the clr(NO3)

and clr(Fe) on the loading plot results in almost

clear separation of the samples in two groups on the

PC1–PC2 score plot which is clearly visualised by

the trends shown on Fig. 6g, h). The Fe-NO3

grouping of the samples can be interpreted as

representing the different redox state of the ground-

water—oxidised versus reduced (Hansen et al.

2009). On Fig. 5b, it can be seen that the higher I

concentrations (and clr(I)) mainly lie in the lower

part of the PC1–PC2 score plot, where the influence

of clr(Fe) is higher, implying that the higher I

concentrations are found in reduced groundwaters

then in oxidised.

The link between clr(NO3) and clr(Fe) is almost

perpendicular to the link between clr(I) and

clr(HCO3), which means that there is near zero

correlation between the two pairs and they can

consequently be considered to be independent.

Based on historical data from the Jupiter database,

it is possible to derive a first overview of Danish

groundwater with respect to I and its affiliation to other

chemical compounds and parameters by multivariate

analysis. It is emphasised that the quality of the input

data (discussed in Supplementary materials 1, 2, and

3) is a crucial success factor for the data analysis and

for the reliability of the possible geochemical inter-

pretations. While gathering historical hydro-geochem-

ical data from the public Danish database, Jupiter, is

relatively straightforward, the individual co-joint data

possibilities are of highly variable individual validity,

and far from all of a sufficient quality for strong

multivariate analysis. Comprehensive scrutiny of the

Jupiter database resulted in substantial reductions of

the data set when questioned from multivariate data

quality point of view.

Additionally to the above issues, there were the

challenges associated with ‘proper’ clr data transfor-

mation: (1) proper treatment of missing and censored

values; (2) weight % recalculation; (3) data modelling

of mixed compositional ? not compositional data. All

these issues were handled in appropriate fashions (see

Materials and Methods) in our analysis; however,

some uncertainties might be introduced because of the

made assumptions. As the scope of our study is to

present a first overview on existing historical data on I

rather than developing and testing alternative data

analytical approaches, we have restricted ourselves to

as simple analysis as possible, in which the limitations

and the assumptions we have made along the process

should be kept in mind.

Fig. 6 Principal component analysis plots, based on the

reduced master data set: A, B, C, D, E, F, G, H, I ? PC1–

PC2 score plots with mapped: clr(HCO3) (a); clr (Ca) (b);

clr(SO4) (c); clr(Na) (d); clr(Mg) (e); clr(Cl) (f); clr(NO3) (g);

clr(Fe) (h); pH (i); box plot of the clr-transformed variables with

whiskers ±1.5xIQR (j)

b

Environ Geochem Health

123

Page 12: Iodine concentrations in Danish groundwater: historical data assessment 1933–2011

Environ Geochem Health

123

Page 13: Iodine concentrations in Danish groundwater: historical data assessment 1933–2011

Conclusions

The main two study objectives have been fulfilled as

much as the data and the chosen methodology allowed

for. The overview on the existing I groundwater with

focus on the spatial variability, geology, and depth of

extraction was made. The spatial variation of I

concentrations in Danish groundwaters is character-

ised both by small-scale variation (both high and low I

samples clustered together) and by a large-scale East

versus West trend with higher I concentrations

observed in the east part of Denmark. The highest I

concentrations are found in samples from aquifers

dominated by Palaeocene to Cretaceous limestone/

chalk deposits and clear shift in the I concentration is

observed between 30–40 m and 40–50 m extraction

depth reflecting the geology in the setting dominated

by limestone/chalk deposits. From the multivariate

analysis, associations between I and other constituents

were identified, and an attempt to identify governing

sources and processes was made. Iodine, Li, B, Ba, Br

are exhibiting similar variability (clustered on the

PC1–PC2 loading plot) which suggests common

source or governing process. Saline water influence

is implied; however, further studies are needed in

order to point the specific process/source, which most

probably varies in different areas. The redox state of

the water was found to be an important characteristic

of the r-MDS, and most of the high I concentrations (as

well as Li) are found at the reduced waters’ side of the

PC1–PC2 score plot. The groundwater samples in this

data set were found to be dominated by Ca–HCO3

water type, and high I concentrations are associated

also with the high pH (alkaline) side of the PC1–PC2

score plot.

Based on the historical data from the Jupiter

database, it was possible to get for first time an

overview on the I concentrations in Danish ground-

water. This work should be seen as a pilot study—an

investigation to clear the way for future geochemical

studies on I variability (both in concentration and

speciation) resulting in delineation of geographical

(and geological) areas influenced by different factors

responsible for the observed variations.

Acknowledgments This paper is part of the Ph. D. study of the

first author; the Ph. D. project was funded by GEOCENTER

Denmark. We gratefully acknowledge the financial support by

the Geological Survey of Denmark and Greenland (GEUS) and

Aarhus University.

References

Andersen, S., & Lauberg, P. (2009). The Nature of Iodine in

Drinking Water. In V. R. Preedy, G. N. Burrow, & R.

Watson (Eds.), Comprehensive handbook of iodine nutri-

tional, biochemical, pathological and therapeutic aspects

(pp. 125–134). London: Academic Press.

Andersen, S., Petersen, S. B., & Laurberg, P. (2002). Iodine in

drinking water in Denmark is bound in humic substances.

European Journal of Endocrinology, 147(5), 663–670.

doi:10.1530/eje.0.1470663.

Buccianti, A., & Pawlowsky-Glahn, V. (2005). New perspec-

tives on water chemistry and compositional data analysis.

Mathematical Geology, 37(7), 703–727. doi:10.1007/

s11004-005-7376-6.

CAMO. (2011). The unscrambler (10.1 (64-bit) ed), CAMO

Software AC.

Comas-Cufı, M., & Thio-Henestrosa, S. (2011). CoDaPack 2.0:

A stand-alone, multi-platform compositional software. In

J. J. Egozcue, R. Tolosana-Delgado, & M. I. Ortego (Eds.),

CoDaWork’11: 4th international workshop on composi-

tional data analysis (2.01.13 ed). Sant Feliu de Guıxols.

Daunis-I-Estadella, J., Barcelo-Vidal, C., & Buccianti, A.

(2006). Exploratory compositional data analysis (vol. 264,

pp. 161–174).

Esbensen, K. H. (2010). Multivariate data analysis—In practice

(5th ed), CAMO Softwares AS.

ESRI. (1999–2010). ArcMap (10.0 ed).

Fuge, R. (2005). Soils and Iodine Deficiency. In O. Selinus, B.

J. Alloway, J. A. Centeno, R. B. Finkelman, R. Fuge, U.

Lindh, et al. (Eds.), Essentials of medical geology: Impacts

of the natural environment on public health (pp. 417–433).

Amsterdam: Elsevier.

Gilfedder, B. S., Petri, M., & Biester, H. (2009). Iodine speci-

ation and cycling in fresh waters: A case study from a

humic rich headwater lake (Mummelsee). Journal of

Limnology, 68(2), 396–408.

Hakansson, E., & Pedersen, S. S. (1992). Geologisk Kort over

den Danske Undergrund VARV.

Hansen, B., Mossin, L., Ramsay, L., Thorling, L., Ernstsen, V.,

Jørgensen, J., et al. (2009). Kemisk grundvandskortlæg-

ning. In GEUS (Ed.), (pp. 1–112). Øster Voldgade 10, DK-

1350 København K: GEUS.

Hansen, B., Thorling, L., Dalgaard, T., & Erlandsen, M. (2011).

Trend reversal of nitrate in Danish groundwater—a

reflection of agricultural practices and nitrogen surpluses

since 1950. Environmental Science and Technology, 45(1),

228–234.

Fig. 7 Principal component analysis plots, based on the

reduced master data set: A, B, C, D, E, F, G, H, I ? PC1–

PC2 score plots with mapped raw concentrations: HCO3 (a);

Ca(b); SO4 (c); Na(d); Mg (e); Cl(f); NO3 (g); Fe(h);

conductivity (i); box plot of the raw concentrations (j) (to be

looked at together with Fig. 6)

b

Environ Geochem Health

123

Page 14: Iodine concentrations in Danish groundwater: historical data assessment 1933–2011

Hou, X. (2009). Iodine Speciation in Foodstuffs, Tissues, and

Environmental Samples: Iodine species and Analytical

Method. In V. R. Preedy, G. N. Burrow, & R. Watson

(Eds.), Comprehensive handbook of iodine nutritional,

biochemical, pathological and therapeutic aspects (pp.

139–150). London: Academic Press.

Hou, X., Aldahan, A., Nielsen, S. P., & Possnert, G. (2009).

Time series of 129I and 127I speciation in precipitation

from Denmark. Environmental Science and Technology,

43(17), 6522–6528.

Hu, Q., Moran, J. E., & Blackwood, V. (2009). Geochemical

Cycling of Iodine Species in Soils. In V. R. Preedy, G.

N. Burrow, & R. Watson (Eds.), Comprehensive handbook

of iodine nutritional, biochemical, pathological and ther-

apeutic aspects (pp. 93–105). London: Academic Press.

Ito, K., & Hirokawa, T. (2009). Iodine and Iodine species in

Seawater: Speciation, Distribution, and Dynamics. In V.

R. Preedy, G. N. Burrow, & R. Watson (Eds.), Compre-

hensive handbook of iodine nutritional, biochemical,

pathological and therapeutic aspects (pp. 83–91). London:

Academic Press.

JUPITER—Danmarks geologiske & hydrologiske database

(2011). http://www.geus.dk/jupiter/index-dk.htm.

Kelstrup, N., Bækgaard, A., & Andersen, L. J. (1982).

Grundvandsressourcer i Danmark (f. o. n. s. General-

direktoratet for miljø, Trans.). Hannover: Kommissionen

for de Europæiske Fællesskaber.

Laurberg, P., Jørgensen, T., Perrild, H., Ovesen, L., Knudsen,

N., Pedersen, I. B., et al. (2006). The Danish investigation

on iodine intake and thyroid disease, DanThyr: Status and

perspectives. European Journal of Endocrinology, 155(2),

219–228.

Mokrik, R., Karro, E., Savitskaja, L., & Drevaliene, G. (2009).

The origin of barium in the Cambrian-Vendian aquifer

system, North Estonia. Baariumi paritolu Kambriumi-

Vendi veekompleksis Pohja-Eestis, 58(3), 193–208.

Muramatsu, Y., Fehn, U., & Yoshida, S. (2001). Recycling of

iodine in fore-arc areas: Evidence from the iodine brines in

Chiba, Japan. Earth and Planetary Science Letters, 192(4),

583–593.

Muramatsu, Y., & Wedepohl, H. K. (1998). The distribution of

iodine in the earth’s crust. Chemical Geology, 147(3–4),

201–216. doi:10.1016/s0009-2541(98)00013-8.

Pawlowsky-Glahn, V., & Egozcue, J. J. (2006). Compositional

data and their analysis: an introduction. Geological Soci-

ety, London, Special Publications, 264(1), 1–10. doi:10.

1144/gsl.sp.2006.264.01.01.

Pedersen, A. N., Fagt, S., Groth, M. V., Christensen, T., Biltoft-

Jensen, A., Matthiessen, J., et al. (2010). Danskernes kos-

tvaner 2003–2008 (D. Fødevareinstituttet, Trans.). (pp.

1–200): DTU Fødevareinstituttet.

Pedersen, K. M., Laurberg, P., Nøhr, S., Jørgensen, A., &

Andersen, S. (1999). Iodine in drinking water varies by

more than 100-fold in Denmark. Importance for iodine

content of infant formulas. European Journal of Endocri-

nology, 140(5), 400–403.

Rasmussen, L. B., Larsen, E. H., & Ovesen, L. (2000). Iodine

content in drinking water and other beverages in Denmark.

European Journal of Clinical Nutrition, 54(1), 57–60.

Rasmussen, L. B., Ovesen, L., Bulow, I., Jørgensen, T., Knud-

sen, N., Laurberg, P., et al. (2002). Dietary iodine intake

and urinary iodine excretion in a Danish population: effect

of geography, supplements and food choice. The British

Journal of Nutrition, 87(1), 61–69. doi:10.1079/

bjn2001474.

Reimann, C., & Birke, M. (2010). Geochemistry of european

bottled water. Stuttgart, Germany: Borntraeger Science

Publishers.

Sanchez-Martos, F., Pulido-Bosch, A., Molina-Sanchez, L., &

Vallejos-Izquierdo, A. (2002). Identification of the origin

of salinization in groundwater using minor ions (Lower

Andarax, Southeast Spain). Science of the Total Environ-

ment, 297(1–3), 43–58.

Santos, I. R., Burnett, W. C., Misra, S., Suryaputra, I. G. N. A.,

Chanton, J. P., Dittmar, T., et al. (2011). Uranium and

barium cycling in a salt wedge subterranean estuary: The

influence of tidal pumping. Chemical Geology, 287(1–2),

114–123.

Saxholt, E., Christensen, A. T., Møller, A., Hartkopp, H. B.,

Hess Ygil, K., Hels, O. H. (2008). Danish Food Compo-

sition Databank. In ed. 7.01, Department of Nutrition,

National Food Institute, Technical University of Denmark.

Truesdale, V. W., Danielssen, D. S., & Waite, T. J. (2003).

Summer and winter distributions of dissolved iodine in the

Skagerrak. Estuarine, Coastal and Shelf Science, 57(4),

701–713.

Whitehead, D. C. (1984). The distribution and transformations

of iodine in the environment. Environment International,

10(4), 321–339. doi:10.1016/0160-4120(84)90139-9.

WHO. (2007). Iodine deficiency in Europe: A continuing public

health problem. In M. Andersson, B. de Benoist, I. Darn-

ton-Hill, & F. Delange (Eds.), (pp. 1–86). France: World

Health Organization, UNICEF.

Environ Geochem Health

123


Recommended