+ All Categories
Home > Documents > 1.1 Introduction - Shodhganga : a reservoir of Indian...

1.1 Introduction - Shodhganga : a reservoir of Indian...

Date post: 07-Sep-2018
Category:
Upload: ngodan
View: 217 times
Download: 0 times
Share this document with a friend
46
Chapter 1: Introduction 2 1.1 Introduction Environment has lost its pristine characteristic since the beginning of industrialization on land as well as water bodies as a consequence of selfish unending appetite of Homo sapiens for so called comfort and luxury. In Gujarat, many industrial estates are situated within the „Golden Corridor‟ (a highly industrialized zone from Vapi to Mehsana). Industrial units manufacturing dyes, paints and pigments, pharmaceuticals, textiles, etc release liquid wastes containing dyes, xenobiotic compounds and many other unnatural products in the environment and thus are the major cause of ground and surface water pollution in these areas (Fig. 1.1). This has resulted in serious health problems in workers and slums surrounding these industrial estates. This 20 th century industrialization has now compelled us to think about developing environmental remediation strategies on the priority basis to save the basic essential components of life (Shah et al., 2011). Fig. 1.1: Photograph depicting the industrial discharges, containing dyes, xenobiotic compounds and many other unnatural products, released in the Khari-cut canal flowing through the GIDC, Vatva, Ahmedabad, Gujarat, India. Canal bank samples used in the present study have been collected from this site.
Transcript

Chapter 1: Introduction

2

1.1 Introduction

Environment has lost its pristine characteristic since the beginning of industrialization

on land as well as water bodies as a consequence of selfish unending appetite of

Homo sapiens for so called comfort and luxury. In Gujarat, many industrial estates are

situated within the „Golden Corridor‟ (a highly industrialized zone from Vapi to

Mehsana). Industrial units manufacturing dyes, paints and pigments, pharmaceuticals,

textiles, etc release liquid wastes containing dyes, xenobiotic compounds and many

other unnatural products in the environment and thus are the major cause of ground

and surface water pollution in these areas (Fig. 1.1). This has resulted in serious

health problems in workers and slums surrounding these industrial estates. This 20th

century industrialization has now compelled us to think about developing

environmental remediation strategies on the priority basis to save the basic essential

components of life (Shah et al., 2011).

Fig. 1.1: Photograph depicting the industrial discharges, containing dyes, xenobiotic

compounds and many other unnatural products, released in the Khari-cut canal

flowing through the GIDC, Vatva, Ahmedabad, Gujarat, India. Canal bank samples

used in the present study have been collected from this site.

Chapter 1: Introduction

3

The initially developed physical and chemical environmental clean-up technologies

are expensive, energy consuming and most importantly are not completely

environment-friendly. Consequently, the focus shifted on developing self-sustainable

and eco-friendly microbial bioremediation technologies. Implementation of

efficacious bioremediation strategies relies heavily on indigenous microbial

community dynamics - structure and function. Microorganisms, the only living beings

to be omnipresent in environment, are the fastest evolving and adapting (Houghton

and Shanley, 1994), and thus most suitable for coping with all the changes in

environment. Consequently, microbial communities, fundamental components of

ecosystems, can play a critical role in the metabolism and detoxification of

anthropogenic/xenobiotic compounds. Microorganisms have sets of catabolic genes,

capable of processing various metabolic pathways, which are integrated in such a

manner that xenobiotic compounds are converted to intermediates which can enter

central metabolism such as kreb‟s cycle, glycolysis and others.

Bioremediation has been successfully applied for oil spill removal, hydrocarbon

degradation, wastewater treatments, metal removal, xenobiotic/recalcitrant compound

degradation and many other such contaminant removal applications (Ham and

Bonner, 1997). Prior to designing bioremediation strategies, it is necessary to inspect

and assess the contaminated site. However, in reality much of the information on

bioremediation strategies is not available and whatever is available is not assembled

and linked with other information. Conversely, it is indeed difficult to compile all

(successful and failed) strategies and compare them, as many factors like climate, soil

characteristics, geological aspects, water levels, aerobic/anaerobic, waste and disposal

facilities and such other parameters play an important role in success or failure of

bioremediation treatments. The variation in environmental factors from location to

location and even from one niche to another niche within a location has lead to a huge

diversity in microorganisms and their capabilities. Yang et al. (2011) studied effects

of soil organic matter on the microbial polycyclic aromatic hydrocarbons (PAHs)

degradation potentials. The microbial activities in humic acid were much higher than

those in humin, which sequesters organic pollutants stronger. The results suggested

that the nutrition support and sequestration were the two major mechanisms that

Chapter 1: Introduction

4

influenced the development of microbial PAHs degradation potentials (Yang et al.,

2011). Nutrient requirement, add another dimension in bioremediation, in a sense that

many microorganisms are able to use xenobiotic compounds as sole carbon and/or

nitrogen source. However, many times they cannot survive solely on xenobiotic

compound and require additional carbon and/or nitrogen source for growth and then

by co-metabolism they can transform or degrade the pollutants. Moreover, an

industrially contaminated area is polluted with not just one particular but variety of

contaminants. Consequently, microorganisms should be able to degrade in presence of

all co-contaminants. Ben Said et al. (2008) characterized PAH degrading bacteria

from sediments of the Bizerte lagoon in presence of multiple contaminants such as

organic pollutants, heavy metals and antibiotics. Moreover, a success or a failure of a

strategy in one case/application does not imply that it will be the same way in other

cases/applications. Hence, the utmost priority is to analyse the capabilities of

indigenous microbial populations and based on that information necessary

modifications may be made to fasten the bioremediation process.

Microbial bioremediation strategies can be applied either ex situ or in situ in order to

restore a contaminated environment. Ex situ treatment involves an off-site separate

treatment facility usually a bioreactor or an effluent treatment plant for degradation of

contaminants. In situ treatment involves on-site bioremediation of the contaminants

by monitoring the efficiencies of natural attenuation (progressive removal of

contaminant) and then facilitating the degradation by intentional biostimulation of

indigenous microbial communities (by providing nutrients and electron

acceptors/donors) or bioaugmentation (by adding microbes and nutrients) (Desai et

al., 2010).

Any one particular microorganism is incapable of processing all the metabolic

reactions to degrade environmental pollutants, however a group of diverse organisms

form a community and collectively process all the essential metabolic reactions for

bioremediation. Moreover, more than 99% of the microbes that exist in the

environment cannot be cultivated easily (Schloss and Handelsman, 2005). Thus, most

of the microbes in the environment have not been described and accessed for

biotechnology or basic research. Abundance and diversity of unculturable bacteria in

Chapter 1: Introduction

5

almost all environmental niches have led to the understanding that the so-called

„unculturable‟ bacteria actually multiply in their natural environment and if suitable

culture conditions were provided it should be possible to cultivate them in the

laboratory. In the earliest cultivation attempts, media with very low nutrient are used

considering the high nutrient contents of common laboratory media as compared to

those present in the natural environments (Sharma et al., 2005).

One way around this problem is metagenomics, which aims to access the genomic

potential of an environmental sample either directly or after enrichment for specific

purpose. Soil/Marine metagenome can be enriched in various ways keeping in mind

specific gene targets to be screened such as encoding promising biocatalysts, novel

antibiotics and other such targets that have potential applications in bioremediation,

industry, medicine or agriculture. Bacteria capable of xenobiotic degradation are

widely distributed in the environment. These bacteria have evolved to utilize a variety

of compounds that are present in the environment. Hence environmental samples are

considered to be a reservoir of useful enzymes for bioremediation as well as industrial

catalysis.

1.2 Azo dyes and azoreductases

Dyestuff effluents are one of the major pollutants that are released into the

environment. Even very low concentrations of dyes (less than 1 mg/l) can be highly

visible in solutions. Synthetic dyes are very soluble in water and are recalcitrant to

microbial degradation because they contain substituents such as azo, nitro or sulfo

groups. They are frequently found in a chemically unchanged form even after waste-

water treatment and hence they are regarded as pollutants (Pagga and Brown, 1986;

Shaul et al., 1991). Azo dyes are the largest class of dyes and are widely used in the

textile, leather, food, cosmetic and dyestuff manufacturing industries (Anliker, 1979;

Reisch, 1996; Blumel et al., 2002) because of their chemical stability, ease of

synthesis and versatility (Nakanishi et al., 2001). In year 2000, more than 7×105 tons

of these dyes were produced worldwide (Suzuki et al., 2001). Azo dyes are

characterized by the presence of one or more azo groups (-N=N-) and they have

become a great concern in effluent treatment due to their colour, bio recalcitrance and

potential toxicity to animals and human.

Chapter 1: Introduction

6

Azoreductases of microorganisms are favorable for the development of

biodegradation systems for such azo dyes, because these enzymes catalyze reductive

cleavage of azo groups (-N=N-) under mild conditions. In addition, bacterial enzymes

can be readily overproduced (Nakanishi et al., 2001). Identification and

overproduction of azoreductase constitute a straightforward approach for the

development of biodegradation systems. The most generally accepted hypothesis for

this phenomenon is that many bacterial strains possess cytoplasmic enzymes, which

act as „azoreductases‟ and transfer electrons via soluble flavins to azo dyes. The

individual strains may attack the dye molecule at different positions or may use

decomposition products produced by another strain for further decomposition. Hence,

it becomes inevitable to identify and characterize the complete dye decolorizing

genetic machinery.

Azo bonds in azo dyes are reduced, under anaerobic conditions, leading to formation

of corresponding amines. Intermediate aromatic amines are further mineralized under

aerobic conditions (Nakayama et al., 1983; Blumel at al., 2002; Chen et al., 2005).

Till now mainly combined anaerobic-aerobic microbial treatments of dye wastes have

been used, due to limited knowledge about microorganisms having oxygen tolerant

enzymes that are involved in decolourization of azo dyes. Aerobic treatment of

dyestuff containing wastewaters possess significant potential, however, the aerobic

metabolism of azo dyes requires specific enzymes (aerobic azoreductases), which

catalyze the NAD(P)H-dependent reduction of azo compounds to the corresponding

amines (Blumel et al., 2002; Chen et al., 2005). Very few aerobic bacteria which can

grow on/with azo compounds have been reported such as azoreductase from Bacillus

sp. OY1-2 (Suzuki et al., 2001) and Xenophilus azovorans (Blumel et al., 2002).

Moreover, Flavobacterium can aerobically degrade 4,4‟-dicarboxyazobenzene ring

(Overney, 1979; Blumel et al., 2002) and Sphingomonas sp. azoreductases can cleave

several sulphonated naphthol and benzol rings (Stolz, 1999).

Chapter 1: Introduction

7

1.3 Omic studies in bioremediation

A multilevel molecular biology approach, combining molecular- (including DNA,

mRNA, protein and metabolites), cellular-, individual-, and community- level data

represents a powerful new multi disciplinary approach to decipher complex biological

systems. „Omics‟ platform that have been used in microbial systems biology include

genomics - which determines the sequence of either whole genome or group of

genes; transcriptomics - which measures mRNA transcript levels; proteomics - which

quantifies protein abundance; metabolomics - which determines abundance of small

cellular metabolites; interactomics - which resolves the whole set of molecular

interactions in cells; and fluxomics - which establishes dynamic changes of molecules

within a cell over a time. However no single „omics‟ analysis can fully unravel the

complexities of fundamental microbial ecology. Therefore, integration of multiple

layers of information, the multi„omics‟ approach, is required to acquire a precise

picture of living microorganisms and their metabolisms (Zhang et al., 2010).

1.3.1 Genomics and Metagenomics

Genomics is the analysis of genome of an organism and metagenomics is to access the

total genomic potential of an environmental sample either directly or after enrichment

of indigenous microbial communities. The term „metagenomics‟ (also known as

community genomics, ecogenomics, or environmental genomics) has had the greatest

impact within the last few years as it has enabled to explore the uncultivable

microorganisms and understand their role in various metabolic pathways. Fig. 1.2

shows the flow-chart describing the basic metagenomic approach. Of all the „omic‟

studies, genomics and metagenomics have had the greatest share of focus in the last

two decades, mainly because of rapid advances in several molecular technologies.

These technologies have completely revolutionized the approaches of studies and

have provided enormous information about the microbial genome(s).

Chapter 1: Introduction

8

Fig. 1.2: The flow chart depicting basic metagenomic approach (Adapted from Simon

and Daniel, 2009; Meiring et al., 2011).

Over the last few years, the scientific literature has revealed the progressive

emergence of genomic high-throughput technologies in environmental microbiology

and biotechnology and described their possible or already demonstrated applications

to assess biotreatment of contaminated environments (Stenuit et al., 2008).

Cultivation-independent analyses of the microbial community structures at

contaminated sites have augmented to our understanding of the community dynamics,

relative abundance and distribution of microorganisms actively involved in

bioremediation (Desai et al., 2010). The genes responsible for biodegradation

pathways are usually arranged in clusters that comprise: (i) catabolic genes encoding

the enzymatic steps of the catabolic pathway; (ii) transport genes responsible for

active uptake of the compound; and (iii) regulatory genes that adjust expression of the

catabolic and transport genes to the presence of the compound to be degraded (Diaz

and Prieto, 2000; Diaz, 2004).

However, two technologies - amplification (PCR) and sequencing need a special

mention as they have conferred the major thrust to genomics and metagenomics. As a

corollary of revolutionary developments in high-throughput DNA sequencing

technologies, more than thousand microbial genomes from almost all known major

Chapter 1: Introduction

9

phylogenetic lineages have been fully sequenced, many are nearing completion and

many more are regularly initiated all over the world. The computational-based

annotation and comparative genomic analyses of DNA sequences have provided

biologists with information regarding gene function, genome structures, biological

pathways, metabolic and regulatory networks, and evolution of microbial genomes,

which has greatly enhanced our understanding of microbial metabolism (Schoolnik,

2001; Ward and Fraser, 2005; Sharan and Ideker, 2006; Cardenas and Tiedje, 2008;

Rocha, 2008; Zhang et al., 2010). Maphosa et al. (2010a) in their review have

discussed „Ecogenomics‟ - the application of genomics to ecological and

environmental sciences for defining phylogenetic and functional biodiversity at the

DNA, RNA and protein levels. They have described the potential of ecogenomics

approaches in developing high-throughput methods for detecting and monitoring

organohalide respirers, and for providing improvements to selection, specificity and

sensitivity of target biomarkers and their application to evaluate bioremediation

strategies. This knowledge helps to elucidate functions and interactions of organisms

at the ecosystem level in relation to ecological and evolutionary processes.

A significant achievement in the field of microbial ecology was the finding of highly

conserved as well as variable gene sequences that are present in all microorganisms,

most notable of them is the 16S rRNA gene, which has been considered as a „gold

standard‟ for characterizing phylogenetic affiliations of microorganisms that comprise

microbial communities (Lane et al., 1985; Amann et al., 1995; Lovley, 2003). The

pioneering work of Carl Woese using 16S rRNA gene as „evolutionary chronometers‟

in deducing the bacterial phylogeny changed the way we visualize the microbial

world (Larsen et al., 1993). 16S rDNA is the most commonly used phylogenetic

anchor because of its unique base pair composition, with a mosaic of highly variable

and conserved regions, convenient size (smaller than 23S rRNA and larger than 5S

rRNA), optimum size to cover polymorphisms, no lateral gene transfer and large

sequence databases. Moreover, the amplification of a taxonomically diverse collection

of 16S rRNA genes is possible with a small number of primers (Weisburg et al.,

1991). The 16S rRNA gene sequence displays an alternating pattern of conserved and

hypervariable regions reflecting the functional importance of the conserved regions in

Chapter 1: Introduction

10

the gene product‟s secondary and tertiary structure. All 16S rRNA genes share nine

hypervariable regions (Neefs et al., 1990), the locations of which are summarized in

Fig. 1.3. Therefore, by analysing 16S rRNA sequences in contaminated environments

it is possible to determine definitively the phylogenetic placement of the

microorganisms that are associated with bioremediation processes (Watanabe and

Baker, 2000; Lovley, 2003; Rogers and McClure, 2003).

Fig. 1.3: Representation of hypervariable regions within the 16S rRNA gene. The

plotted blue line reflects fluctuations in variability amongst aligned 16S rRNA gene

sequences; peaks reflect greater conservation, while troughs correspond to the known

hypervariable regions V1 to V9, also indicated by the red bars.

Nucleic acid based molecular techniques such as denaturing gradient gel

electrophoresis (DGGE), temperature gradient gel electrophoresis (TGGE), terminal

restriction fragment length polymorphism (T-RFLP), amplified ribosomal DNA

restriction analysis (ARDRA), random amplified polymorphic DNA (RAPD),

amplified fragment length polymorphism (AFLP), automated rRNA intergenic spacer

analysis (ARISA) and single strand conformation polymorphism (SSCP) are used in

conjunction with fluorescence in situ hybridization, PCR technologies, sequencing

techniques and microarrays to monitor the changes in bacterial community and

provide insights on up/down regulations of genes, microenvironment of the sites and

impact of nutrients on bioremediation. These facts in turn assist in process

optimization, validation and understanding the impact of designed bioremediation

strategies on the ecosystem. Fluorescence in situ hybridization in conjunction with

Chapter 1: Introduction

11

microautoradiography establishes both phylogenetic as well as functional links of the

species involved in the process of bioremediation (Rogers et al., 2007). DNA stable

isotope probing (DNA-SIP) is another technique used in conjunction with

metagenomics to establish links between microbial identity and particular metabolic

functions. The combination of DNA-SIP and metagenomics not only permits the

detection of rare low-abundance species but also facilitates the detection of novel

enzymes and bioactive compounds (Chen and Murrell, 2010). Ni Chadhain et al.

(2006) used a combination of techniques to study degradation of polycyclic aromatic

hydrocarbons (PAHs) by bacteria. They tracked dioxygenase gene population shifts in

soil enrichment cultures following exposure to naphthalene, phenanthrene or pyrene.

Molecular monitoring of the enrichment cultures before and after PAH degradation

using denaturing gradient gel electrophoresis and 16S rRNA gene libraries provided

information about specific phylotypes of bacteria that were associated with the

degradation of each PAH (Ni Chadhain et al., 2006). Mateos et al. (2006) have strains

of Corynebacterium glutamicum that are resistant to arsenite (up to 60 mM) and using

genetic manipulation tools they are attempting to obtain C. glutamicum mutant strains

which are able to remove arsenic from contaminated water.

Recent advances in molecular techniques, including high-throughput approaches such

as microarrays and metagenomics, have opened up new perspectives and pointed

towards new opportunities in pollution abatement and environmental management.

The current potential of microarrays and metagenomics is capable to investigate the

genetic diversity of environmentally relevant microorganisms and identify new

functional genes involved in the catabolism of xenobiotics (Eyers et al., 2004). The

development of microbial ecological DNA microarrays has enabled researchers to

simultaneously analyze thousands of phylogenetic links and functional genes in order

to characterize microbial communities involved in bioremediation (He et al., 2007).

Microarrays, developed on the basis of sequence analysis, have been used to monitor

genes involved in 2,4-dichlorophenoxyacetic acid (2,4-D) degradation in

environmental samples (Dennis et al., 2003). Loy et al. (2005) used a 16S rRNA

gene-targeted oligonucleotide microarray (RHC-PhyloChip) for cultivation-

independent diversity analysis of betaproteobacterial order „Rhodocyclales‟ in

Chapter 1: Introduction

12

activated sludge from an industrial wastewater treatment plant. Functional gene arrays

(FGA) are constructed using PCR amplified products or oligonucleotides derived

from functional genes and are used to identify genes encoding key enzymes involved

in various ecological and environmental processes such as carbon fixation,

nitrification, denitrification, sulphate reduction and contaminant degradation (Rhee et

al., 2004). He et al. (2007) developed GeoChip, a comprehensive microarray

containing 24,243 oligonucleotide (50 mer) probes and covering more than 10,000

genes in more than 150 functional groups and in 2010 they (He et al., 2010) have

developed a new generation gene array, GeoChip 3.0 with around 28,000 probes

covering approximately 57,000 gene variants from 292 functional gene families

involved in nitrogen, carbon, sulphur and phosphorous cycling, metal reduction and

resistance, and organic contaminant degradation. GeoChip 3.0 is a high-throughput

powerful tool for tracking the dynamics of microbial community, functional structure

and linking microbial communities to ecosystem processes and functioning in in situ

bioremediation study. Consequently these techniques can be used in bioremediation

studies to identify microbial populations in ecosystems, assess shifts in microbial

populations due to xenobiotic compounds, measure changes in expression levels

(gene expression profiling), co-relate and compare genomes, identify genes encoding

enzymes, detect mutations and single nucleotide polymorphisms (SNPs).

In detailed description of two metagenomic approaches: Metagenomic libraries and

Sequencing technologies, has been given in the following sub sections. However, the

first and foremost step in metagenomic analyses consists of isolating high molecular

weight DNA from environmental samples in an unbiased manner. Many reports have

highlighted the challenges in obtaining high quality (free from interference of co-

contaminants like humic acids in case of soil) DNA from polluted sites (Fortin et al.,

2004, Desai and Madamwar, 2007). Methods for nucleic acid extraction from soil,

however, suffer from compounded inefficiencies including incomplete cell lysis,

DNA sorption to soil surfaces, co-extraction of enzymatic inhibitors from soil, and

loss, degradation, or damage to DNA (Miller et al., 1999). In the initial efforts to

extract DNA from sediments and soils, scientists used either cell extraction (recovery

of cells from the soil matrix prior to cell lysis) or direct lysis within the soil matrix

Chapter 1: Introduction

13

(Holben et al., 1988; Ogram et al., 1987; Steffan et al., 1988). Direct lysis techniques

have been used more because they yield more DNA and presumably a less biased

sample of the microbial community diversity than cell extraction techniques (Holben

et al., 1988; Leff et al., 1995; Steffan et al., 1988). A large number of methods have

been published for the extraction of total microbial community DNA from soils and

sediments (Picard et al., 1992; Holben, 1994; Zhou et al., 1996; Burgmann et al, 2001;

Kauffmann et al., 2004; Bertrand et al., 2005; Desai and Madamwar, 2007).

1.3.1.1 Metagenomic libraries

Metagenomic libraries are prepared by cloning of DNA fragments isolated from an

environmental sample in a suitable vector [e.g. plasmid, phage, fosmid, cosmid, or

bacterial artificial chromosome (BAC)], which is then transformed into a suitable host

strain. Two approaches, the function-driven analysis and the sequence-driven

analysis, have emerged to extract biological information from metagenomic libraries.

The function driven analysis is based on identification of clones that express the

desired trait, followed by characterization of the active clones by sequence and/or

biochemical analysis. The limitations of this approach are that it requires expression

of the function of interest in the host cell and clustering of all of the genes required for

the function. It also depends on the availability of a high-throughput assay for the

function of interest that can be performed efficiently on vast libraries, because the

frequency of active clone is quite low. Many approaches such as improved systems

for heterologous gene expression with shuttle vectors that facilitate screening of the

metagenomic DNA in diverse host species and modifications of Escherichia coli to

expand the range of gene expression are being developed to alleviate these limitations

(Schloss and Handelsman, 2003). Conversely, the sequence-driven analysis relies on

the use of conserved DNA sequences to design hybridization probes or PCR primers

to screen metagenomic libraries for clones that contain sequence of interest.

Significant discoveries have also resulted from random sequencing of metagenomic

clones. However, there has been disagreement about the utility of random sequencing

of metagenomic clones as according to some scientists this approach is too undirected

to yield biological understanding, while others point out that there is very little

information about some divisions of bacteria that any genomic sequence is helpful in

Chapter 1: Introduction

14

designing of experiments to discover their characteristics (Schloss and Handelsman,

2003).

Brennerova et al. (2009) revealed the diversity and abundance of meta-cleavage

pathways in microbial communities from soil highly contaminated with jet fuel

(aliphatic and aromatic hydrocarbons) under air-sparging bioremediation using

metagenomics. Moreover, the extradiol dioxygenase diversity was assessed by

functional screening of a fosmid library in Escherichia coli with catechol as substrate.

Kim et al. (2010) prepared a metagenomic library in fosmid vector from a completely

fermented compost and functionally screened and characterized a novel family VIII

alkaline esterase. The functional screening of metagenomic libraries have led to the

discovery of novel genes encoding polyphenol oxidase (Beloqui et al., 2006), ester

and glycosyl hydrolase (Ferrer et al., 2005) and also given indications on the diversity

of extradiol dioxygenases in coke plant wastewater (Suenaga et al., 2007).

1.3.1.2 Sequencing technologies

Sequencing of DNA and/or RNA of humans and many other animals, plants and

microbes have revolutionized the biological research. Sanger developed the chain

termination method in 1975 and two years after that Maxam and Gilbert developed a

method based on chemical modifications. The major revolution started only after the

improvements in chain-termination method. Conversely, automated analysis played

the biggest role as it made the sequencing easier and faster. Capillary electrophoresis

replaced slab gel electrophoresis and became an integral part of automated sequencing

analysis.

Metagenomic studies using first-generation methods included open oceans (Venter et

al., 2004), coastal waters (Culley et al., 2006), coastal stromatolites (Goh et al., 2009),

insect guts (Warnecke et al., 2007) and acid mine biofilms (Tyson et al., 2004). There

are numerous reports on sequencing of complete genome or functionally essential

genes involved in metabolism of biodegradation (Nelson et al., 2002; Kim et al.,

2008; Mattes et al., 2008; Desai et al., 2010). The genome sequence of Pseudomonas

sp. has revealed details about oxygenases, oxidoreductases, ferredoxins and

cytochromes, dehydrogenases, sulfur metabolism proteins and many others. Moreover

Chapter 1: Introduction

15

many operons coding for the metabolism of a large number of aromatic compounds

and gene clusters encoding for enzymes which are predicted to be involved in the

metabolism of non-natural substrates were found (Nelson et al., 2002). Desai et al.

(2009) tracked the influence of long-term chromium pollution on soil microbial

communities by analysing 16S rRNA gene clone libraries and observed community

shifts from Proteobacteria to Firmicutes at chromium polluted sites.

However, the era‟s demand of high throughput sequencing at low costs led to constant

upgradation of technologies. Next generation sequencing has propelled biological

research (Schuster, 2008). Today a number of commercial next-generation DNA

sequencing systems are available, namely: Roche‟s 454 GS FLX Genome Analyzer

(Pyrosequencing) (Margulies et al., 2005), Illumina‟s Solexa sequencer (Bentley at

al., 2008), Applied Biosystem‟s SOLiD system (McKernan et al., 2009), Helicos

HeliScope (Shendure and Ji, 2008), Complete Genomics (Drmanac et al., 2010) and

many more are coming in near future such as Ion-Torrent Semiconductor sequencing

(www.iontorrent.com), Pacific Biosciences SMRT (www.pacificbiosciences.com) and

Oxford Nanopore sequencing (www.nanoporetech.com). Walshaw et al., (2011) have

described next-generation sequencing approaches to metagenomics in detail.

1.3.1.2.1 Pyrosequencing

The most mature technology available is Roche‟s 454 system which first became

available in 2005 and even then was capable of producing fifty times the data at one

sixth of the operating cost (Schuster, 2008; Walshaw et al., 2011). Pyrosequencing,

based on „sequencing by synthesis‟ principle, involves sequencing the single stranded

DNA by synthesizing the complementary strand and detection of pyrophosphate

released on addition of each nucleotide. The templates for pyrosequencing are made

either by solid phase template preparation (streptavidin-coated magnetic beads) or

enzymatic template preparation (apyrase and exonuclease). The templates are

hybridized to a sequencing primer and incubated with the enzymes DNA polymerase,

ATP sulfurlyase, luciferase and apyrase, and with the substrates adenosine 5´

phosphosulfate (APS) and luciferin. Detection of pyrophosphate, liberated during

DNA synthesis, is possible by cascade of enzymatic reactions followed by detection

with photodiode, photomultiplier tubes, or a charge-coupled device (CCD) camera.

Chapter 1: Introduction

16

Currently, a limitation of the method is that the lengths of individual reads of DNA

sequence are in the neighborhood of 300-500 nucleotides, shorter than the 800-1000

obtained with chain termination methods.

Metagenomes sequenced by next-generation technologies include bulk soil (Leininger

et al., 2006), mycorrhizae (Lumini et al., 2009), coral reefs (Wegley et al., 2007), the

human gut and stools (Qin et al., 2010), mammoth bone (Huson et al., 2007). Lee et

al. (2010) characterized reductive dechlorinating activities and population dynamics

in tidal flat sediments (where tetrachloroethene (PCE) and trichloroethene (TCE) are

common groundwater contaminants) using latest titanium pyrosequencing along with

16S rRNA gene clone libraries and dechlorinator-targeted quantitative real-time PCR

(qPCR). Nowadays, metagenome sequencing is generating a wealth of data and as a

result there is a continuous augmentation of information in databases. More and more

sequences are phylogenetically affiliated and the sequences are being annotated for

more and more number of genes. Besides functional analysis, many research groups

are also focussing on phylogenetically linking the functional genes. However, 16S

rRNA remains the most valuable phylogenetic marker, but this kind of approaches -

phylogenetic affiliation of functional genes will be of immense value in

characterization of bacterial communities.

Metagenomics is a burgeoning area that is generating enormous amounts of biological

information. In addition, the development of new bioinformatics approaches and tools

is allowing innovative mining of both existing and new data (Ward, 2006). The

metagenomic approach has also demonstrated that novel metabolic genes play an

important role in the biodegradation of the compounds. However metagenomics is

still a developing technology with limitations to be overcome, such as gene

enrichment, DNA extraction, host-vector design, library preparation, screening and

the phylogenetic affiliation of isolated genes.

Metagenomic analyses in bioremediation entail: (A) Identification of genes involved

in bioremediation; (B) Characterization of genes and complete genomes; (C)

Phylogenetic affiliation of key catabolic genes.

Chapter 1: Introduction

17

Conversely, sheer detection of a gene or a cluster of genes does not bequeath with a

complete picture as the informations about the mRNA expression level, the amount of

protein produced and its location, biological activity or functional relationship with

metabolomes are required for understanding any metabolic reaction. Moreover in

cells many levels of regulation occur after genes have been transcribed, such as post

transcriptional, translational and post translational regulation and all forms of

biochemical control such as allosteric or feedback regulation. Hence, to fully

elucidate microbial metabolism and its responses to environmental factors, it is

necessary to include functional characterization and accurate quantification of all

levels of gene products, mRNA, proteins and metabolites, as well as their interaction

(Zhang et al., 2010).

1.3.2 Transcriptomics and Metatranscriptomics

The genes for bioremediation may be present but not expressed. Therefore there has

been an increased emphasis on quantifying the levels of mRNA for key

bioremediation genes. Transcriptomics is analysis of transcripts of a single organism

and metatranscriptomics is analyses of transcripts of the entire interacting community.

Transcriptomics, also called as gene expression profiling, provides the insights into

the up- or down- regulation of genes under stress conditions in environmental

microbial communities. The first regulatory point for successful synthesis of a protein

from a gene is regulation of gene expression, one of the key processes for adapting to

sustain in affected environmental conditions. Often increased mRNA concentration

can be at least qualitatively associated with higher rates of contaminant degradation

(Schneegurt and Kulpa, 1998; Lovley, 2003). Although gene expression is affected by

many environmental factors, a subset of genes with altered expression can inform on

stress responses. The potential utility here is to improve biomarker identification and

to identify patterns of gene expression associated with different types of pollutants.

Analysis of the mRNA concentration for genes other than those directly involved in

bioremediation might yield additional insights into the factors that control the rate and

extent of bioremediation (Lovley, 2003). Transcriptome analyses have mainly been

dependent on technologies such as PCR, microarray and sequencing. Highly sensitive

methods that can detect mRNA for key bioremediation genes in single cells are now

Chapter 1: Introduction

18

available (Bakermans and Madsen, 2002). Conversely, the only major precaution

required is that RNA have a very short life (some transcripts have life of less than a

minute). Consequently, RNA storage after sample collection and preparation is very

important step and strategies for transcriptome analysis need to be designed

accordingly.

Bordenave et al. (2009) applied RT-PCR for analysis of differentially expressed

cDNA involved in microbial mat response after heavy fuel contamination. He et al.

(2010) carried out metatranscriptomic analysis of gene expression and regulation of

Candidatus Accumulibacter enriched lab-scale sludge during enhanced biological

phosphorus removal (EBPR) using medium density oligonucleotide microarrays.

They analysed both aerobic and anaerobic phases and detected the expression of a

number of genes involved in the carbon and phosphate metabolisms, as proposed by

EBPR as well as novel genes discovered through metagenomic analysis. Ozsolak et

al. (2009) reported direct single molecule RNA sequencing without prior conversion

of RNA to cDNA. They applied this technology to sequence femtomole quantities of

poly(A) Saccharomyces cerevisae RNA using a surface coated with poly(dT)

oligonucleotides to capture the RNAs at their natural poly(A) tails and initiate

sequencing by synthesis. They observed transcript 3‟ end heterogeneity and

polyadenylated small nucleolar RNAs. This study provides a path to high-throughput

and low-cost direct RNA sequencing and achieving the ultimate goal of a

comprehensive and bias-free understanding of transcriptomes.

Metatranscriptome analyses in bioremediation entail: (A) Identification of

differentially expressed mRNA and their respective genes; (B) Assessing changes in

the expression of previously characterized key genes (biomarkers).

1.3.3 Proteomics and Metaproteomics

Proteomics is analyses of proteins of a single organism and metaproteomics is to

analyse the protein expression profile of the whole community. Proteins, the major

components of cell, serve as catalytic enzymes in metabolic pathways and in signal

transduction of regulatory pathways of cells (Graham et al., 2007; Zhang et al., 2010).

The cellular expression of proteins in an organism varies with environmental

Chapter 1: Introduction

19

conditions. The changes in physiological response may occur due to the organism‟s

adaptive responses to different external stimuli such as presence of toxic chemicals in

the environment. Developments in functional proteomics approach for environmental

remediation will further pave the way towards cell free bioremediation (Singh and

Nagaraj, 2006). At present, amongst all the omic technologies, metaproteomic

analyses still faces lots of practical challenges. There is no common extraction

procedure for all different (hydrophilic/hydrophobic/etc.) kinds of proteins and

consequently today‟s technologies are capable of resolving only a minute fraction of

metaproteome of complex environments. Substantial improvements in the

technologies for protein extraction, separation and identification are necessary to

encompass a complete picture (Wilmes and Bond, 2006).

Proteome analyses have revolved around two techniques - Two dimensional gel

electrophoresis (2DE) and mass spectrometry (MS). MS has been integrated not just

with 2 DE but also with different chromatographs and various other techniques. Kim

et al. (2009) in their review have given a summary on proteomic approaches used to

study bacterial degradation of aromatic hydrocarbons. Wilmes and Bond (2004)

successfully carried out extraction and purification of the entire proteome from a

laboratory-scale activated sludge system for enhanced biological phosphorus removal,

separation by two-dimensional polyacrylamide gel electrophoresis and mapping of the

metaproteome. Wilmes et al. (2008) applied 2-DE along with MALDI-TOF MS and

Q-TOF MS/MS to identify highly expressed proteins in a mixed culture activated

sludge system and this study provided direct evidence linking the metabolic activities

of „Accumulibacter‟ to the chemical transformations observed in EBPR. D‟Souza-

Ticlo et al. (2009) purified and characterized a thermostable metal-tolerant laccase

having bioremediation potential. The protein Lac IId was purified by 2DPAGE and

analysed by N-terminal sequencing and internal peptide sequencing. Trautwein et al.

(2008) applied 2D-DiGE to quantify the proteomic response of the denitrifying

bacterium Aromatoleum aromaticum strain EbN1 to solvent stress. Wu et al. (2010)

performed systematic analyses at the transcriptomic and proteomic levels to

investigate the expression changes due to high Mn in environment. These studies

revealed that under conditions of increased Mn concentration, there was a change in

Chapter 1: Introduction

20

regulation of proteins involved in virulence, oxidative stress defence, cellular

metabolism, protein synthesis, RNA processing and cell division. Mn regulation of

inorganic pyrophosphatase (Ppa) indicated the potential involvement of phosphate

metabolism in the Mn-dependent oxidative stress defence. Kim et al. (2006) carried

out quantitative proteomic analysis of aromatic pathways in Pseudomonas putida KT

2440 using 2-DE/MS and cleavable isotope-coded affinity tag (ICAT) to determine

whether proteins involved in aromatic compound degradation pathways were altered

as predicted by genomic analysis carried out by Jiménez et al. (2002). The lack of

complete knowledge restricts progress in the site-specific mineralization process. In

the postgenomic era the emphasis is on 2D PAGE and MS to obtain the incomplete

biological information regarding the regulation of growth and metabolism in

microbial communities (Singh, 2006).

Pseudomonas putida is a model organism for bioremediation because of its

remarkable metabolic versatility, extensive biodegradative functions, and ubiquity in

contaminated soil environments. Thompson et al. (2010) characterized proteome

profile of aerobically grown Cr(VI)-stressed Pseudomonas putida F1 in two dissimilar

nutritional environments: rich (LB) media and minimal (M9L) media containing

lactate as the sole carbon source. Comparative analysis indicated that the core

molecular response to chromate, irrespective of the nutritional conditions tested,

comprised seven up-regulated proteins belonging to six different functional categories

including transcription, inorganic ion transport/metabolism, and amino acid

transport/metabolism (Thompson et al., 2010).

Metaproteome analyses in bioremediation entail: (A) Identification of differentially

expressed proteins and their respective genes; (B) Assessing changes in the

abundance of previously characterized key proteins (biomarkers); (C) Protein

structure and function characterization.

1.3.4 Metabolomics, Metametabolomics and Fluxomics

Metabolomics is analysis of entire repertoire of cellular metabolites and

metametabolomics is analyses of metabolites of entire interacting community. Real

time flux analysis of cellular molecules/metabolites within a cell/community over a

Chapter 1: Introduction

21

time period is known as fluxomics (Wiechert et al., 2007). Information on factors

regulating growth and metabolism of microbial communities can be accessed by

metabolomics and fluxomics can provide the missing links in the regulatory

pathways, involved in metabolism of environmental pollutants.

A microbial cell releases a number of low molecular weight primary and secondary

metabolites in response to an environment challenge or stress. The influence of the

local environment on the metabolome of an organism/community can be exploited in

bioremediation to characterize the effects of xenobiotic compounds. Metabolites can

be characterized by mass spectrometry and various spectroscopic techniques. Villas-

Boas and Bruheim (2007) have discussed the scenario of metabolome analysis in

bioremediation. They have described potential of various experimental and conceptual

approaches developed for metabolomics to be applied in bioremediation research,

such as strategies for elucidation of biodegradation pathways using isotope

distribution analysis and molecular connectivity analysis, the assessment

of mineralization process using metabolic footprinting analysis, and the improvement

of the biodegradation process via metabolic engineering. The use of metabolomics

tools can significantly extend and enhance the power of existing bioremediation

approaches by providing a better overview of the biodegradation process (Villas-Boas

and Bruheim., 2007).

Keum et al. (2008) evaluated metabolic profiles of Sinorhizobium sp. C4 using gas

chromatography and mass spectrometry during degradation of phenanthrene in

comparison to natural carbon sources. Tang et al. (2009) performed a fluxomics

analysis on Shewanella sp. known to have co-metabolic pathways for bioremediation

of toxic metals, radionuclides and halogenated organic compounds. From the

metabolic flux analysis of Shewanella sp. using GC-MS and statistical, biochemical

and genetic algorithms they deduced that Shewanella sp. displays a relatively flexible

metabolism fluxes when adapting to different carbon sources. Durand et al. (2010)

applied ex situ Nuclear Magnetic Resonance (NMR) and Liquid Chromatography-

NMR (LC-NMR) as complementary tools to LC-Mass Spectrometry (MS) to define

the metabolic pathway of mesotrione, an herbicide, by the bacterial strain Bacillus sp.

3B6. The complementarities of ex situ and LC-NMR identified six metabolites

Chapter 1: Introduction

22

whereas the structures of only four metabolites were suggested by LC-MS. The

presence of a new metabolic pathway was also evidenced by NMR. The results

demonstrate that NMR and LC-NMR spectroscopy provide unambiguous structural

information for xenobiotic metabolic profiling. Wharfe et al. (2010) used FT-IR as a

metabolite fingerprinting tool for monitoring the phenotypic and biochemical changes

in complex bacterial communities capable of degrading phenol in the activated sludge

from an industrial bioreactor. Wang et al. (2010) applied fourier transform infrared

spectroscopy (FT-IR) and gas chromatography-mass spectrometry (GC-MS) for

analysis of degradation of lube oil and TDOC (total dissolved organic carbon) by a

mixed bacterial consortium. In addition, they analysed that mixed bacterial

consortium can degrade benzene and its derivatives and other aromatic ring organic

matters more than 97%. Maphosa et al. (2010b) used a combination of molecular

diagnostics with mass-balancing and kinetic modeling to improve insight into

organohalide respiring bacteria and metabolite dynamics in an in situ dechlorinating

bioreactor and showed its utility in monitoring bioremediation.

„Meta‟metabolome analyses in bioremediation entail: (A) Identification of differential

metabolic profile; (B) Quantifying functional roles of metabolites.

1.3.5 Bioinformatics - databases, softwares, tools and approaches

Development of bioinformatics techniques has propelled the biological research by a

large magnitude as it has led to automated, precise and faster analyses of the

generated data. Though analysis of the generated data may be the last step, yet it is the

most important, as the wet-lab-data is meaningless, if not completely evaluated. Many

bioinformatic infrastructures are available for analysis of genomics, transcriptomics,

proteomics or metabolomics. The major revolutions have been automated genome

sequencing, genome comparisons to identify the genome function and microbial

evolution, derivation of metabolic and regulatory pathways, gene expression analysis,

statistical tools, data mining techniques to derive protein-protein and protein-DNA

interactions, modeling of 3D structure of proteins and 3D docking between proteins

and biochemicals for rational drug design. The computational-based annotation and

comparative genomic analyses of DNA sequences have provided information

regarding gene function, genome structures, biological pathways, metabolic and

Chapter 1: Introduction

23

regulatory networks, and evolution of microbial genomes, which has greatly enhanced

our understanding of microbial metabolism (Schoolnik, 2001; Ward and Fraser, 2005;

Sharan and Ideker, 2006; Cardenas and Tiedje, 2008; Rocha, 2008; Zhang et al.,

2010).

Despite the fast paced global effort, the current analysis is limited by the lack of

available gene-functionality from the wet-lab data, the lack of computer algorithms to

explore vast amount of data with unknown functionality, limited availability of

protein-protein and protein-DNA interactions, and the lack of knowledge of temporal

and transient behaviour of genes and pathways (Bansal, 2005). However, now the

focus in „in silico‟ developments is to modify the existing tools or develop new tools

to facilitate integrated analyses of „omic‟ studies. The integrated analyses of data sets

will assist in understanding the changes taking place at various levels, beginning from

DNA which contains the basic genetic information, followed by RNA and finally the

products protein or metabolites. Today scientific community is on the verge of using

all this knowledge to understand cellular mechanisms at the systemic level.

Bioinformatics requires the study of microbial genomics, transcriptomics, proteomics,

systems biology, computational biology, phylogenetic trees, data mining and

application of major bioinformatic tools for determining the structures and

biodegradative pathways of xenobiotic compounds (Fulekar and Sharma, 2008).

Bioinformatic resources exclusively for bioremediation studies and analyses have

been and are being developed. The University of Minnesota

Biocatalysis/Biodegradation Database (UM-BBD, http://umbbd.msi.umn.edu/) was

developed in 1995 and is regularly updated. It contains information on 1240

compounds, 864 enzymes, 1337 reactions, 274 biotransformation rules and 510

microorganism entries. Besides these data, it includes a Biochemical Periodic Table

(UM-BPT) and a rule-based Pathway Prediction System (UM-PPS)

(http://umbbd.msi.umn.edu/predict/) that predicts plausible pathways for microbial

degradation of organic compounds. Public access to UM-BBD data is increasing and

UM-BBD compound data are now contributed to PubChem and ChemSpider, the

public chemical databases. A new mirror website of the UM-BBD, UM-BPT and

UM-PPS is being developed at ETH Zürich to improve speed and reliability of online

Chapter 1: Introduction

24

access from anywhere in the world (Gao et al., 2010). Urbance et al. (2003) have

developed a freely accessible Biodegradative Strain Database (BSD)

(http://bsd.cme.msu.edu) within the phylogenetic framework of the Ribosomal

Database Project II (RDPII: http://rdp.cme.msu.edu/html) to provide detailed

information on degradative bacteria and the hazardous substances they degrade,

including corresponding literature citations, relevant patents and links to additional

web-based biological and chemical data. Pazos et al. (2003) studied the characteristics

of the global biodegradation network by analysing a large set of chemical reactions

that are implicated in biodegradation to obtain quantitative insights in its organization

and possible mechanism of evolution. Pazos et al. (2005) developed a system,

MetaRouter for maintaining heterogeneous information related to bioremediation in a

framework that allows its query, administration and mining. The system can be

accessed and administrated through a web interface and is available at

http://pdg.cnb.uam.es/MetaRouter and additional material is available at

http://www.pdg.cnb.uam.es/biodeg_net/MetaRouter. Arora et al. (2009) compiled a

database (http://www.imtech.res.in/raghava/oxdbase/) of biodegradative oxygenases

(OxDBase), which provides a compilation of the oxygenase data as sourced from

primary literature in the form of web accessible database. There are two separate

search engines for searching into the database - mono and di oxygenases database,

respectively. Each enzyme entry contains its common name and synonym, reaction in

which enzyme is involved, family and subfamily, structure and gene link and

literature citation. The entries are also linked to several external database including

BRENDA, KEGG, ENZYME and UM-BBD providing wide background information.

At present (September 2011) the database contains information of over 240

oxygenases including both dioxygenases and monooxygenases. OxDBase is the first

database that is dedicated to oxygenases and provides comprehensive information

about them. Due to the importance of the oxygenases in chemical synthesis of drug

intermediates and oxidation of xenobiotic compounds, OxDBase database would be

very useful tool in the field of synthetic chemistry as well as bioremediation. Moriya

et al. (2010) described PathPred (http://www.genome.jp/tools/pathpred/), a web-based

server to predict plausible pathways of multi-step reactions starting from a query

Chapter 1: Introduction

25

compound, based on the local RDM pattern match and the global chemical structure

alignment against the reactant pair library. This server focuses on predicting pathways

for microbial biodegradation of environmental compounds and biosynthesis of plant

secondary metabolites, which correspond to characteristic RDM patterns in 947 and

1397 reactant pairs, respectively. The server provides transformed compounds and

reference transformation patterns in each predicted reaction, and displays all predicted

multi-step reaction pathways in a tree-shaped graph. The transformation of xenobiotic

compounds by microorganisms is essential for the bioremediation of contaminated

environments. There is no single resource that provides information about

environmental contaminants as well as microorganisms with biodegradative

capabilities. A website that consolidates the detailed information about chemical

compound and reference data related to biocatalysis, biotransformation,

biodegradation and bioremediation would be an invaluable tool for academic and

industrial researchers and environmental engineers (Urbance et al., 2003).

Advances in molecular technologies over last few decades have removed the

boundaries between genomics, transcriptomics, proteomics and metabolomics. The

detailed genomic/metagenomic, transcriptomic/metatranscriptomic,

proteomic/metaproteomic, metabolomic/metametabolomic, bioinformatic and other

high-throughput analyses of environmentally relevant microorganisms/environmental

sites provide exceptional and novel insights into key biodegradative pathways at

molecular level and about the ability of organisms to adapt to changes in

environmental conditions, as outlined in Fig. 1.4. Fig. 1.4a presents an overall

framework for integration of molecular and „omic‟ approaches to gain insight into

microbial bioremediation. Fig. 1.4b schematically represents how integrated

molecular analysis can facilitate in understanding bioremediation strategies.

Chapter 1: Introduction

26

a

Chapter 1: Introduction

27

b

Fig. 1.4: (a) Integrating molecular and „omic‟ approaches to gain insight into

microbial bioremediation; (b) A schematic representation describing how integrated

molecular analysis facilitates in understanding bioremediation strategies.

1.4 Milestones in ‘omic’ studies and technologies

There have been numerous discoveries and innovations in technologies which have

changed the course of scientific research. Besides these, even some novel ideas and/or

approaches have given new insights and dimensions to the scientific thoughts. These

technologies and approaches revolutionize not just one specific scientific field of area

or study but all the scientific fields gain benefit. Table 1.1 describes some of the

important milestones that have revolutionized the approach of molecular biology to

study scientific mysteries.

Chapter 1: Introduction

28

Table 1.1: Milestones in „omic‟ studies and technologies.

(A)

Molecular

and ‘omic’

studies

Genomics Fred Sanger and his group

established techniques of

sequencing, genome mapping,

data storage and bioinformatics

analyses. They sequenced

complete genome of virus and

mitochondrion. The first bacterial

genome to be completed was that

of Haemophilis influenza in 1995.

1970s-

1980s

Metabolomics The concept that individuals might

have a metabolic profile was

introduced by Roger Williams in

late 1940s. However, the term

„metabolic profile‟ was introduced

by Horning in 1971.

1971

Bioinformatics The term bioinformatics was

coined by Paulien Hogeweg.

1979

16S rRNA gene The pioneering work of Carl

Woese, using 16SrRNA as

„evolutionary chronometers‟ in

deducing the bacterial phylogeny

changed the way we visualize the

microbial world.

1977

Ribosomal

Database Project

The Ribosomal Database Project

(RDP) provides ribosome related

data and services to the scientific

community, including online data

analysis and aligned and annotated

Bacterial and Archaeal small-

subunit 16S rRNA sequences.

Moreover, the database is

regularly augmented and upgraded

for new sequences and better

analysis.

1992

Proteomics The term proteomics was coined

to make analogy with genomics.

1997

Metagenomics Norman Pace and co-workers were

the first to clone genes directly

from environmental samples in

1991. The term was coined by Jo

Handelsman in 1998.

2000s

Metaproteomics The term metaproteomics was

proposed by Francisco Rodríguez-

Valera to describe the genes

2004

Chapter 1: Introduction

29

and/or proteins most abundantly

expressed in environmental

samples.

Multi‘omics’ Researchers are developing

platforms to integrate all the

studies and technologies.

2005 onwards

(B)

Technologies

Mass

Spectrometry

The first application of mass

spectrometry for analysis of amino

acids and peptides was reported in

1958. Mass spectrometry is a

combination of many

technologies. Many Nobel prizes

in physics and chemistry have

been awarded to studies linked to

mass spectrometry and this in

itself highlights the importance of

this technology.

1958

Two dimensional

gel

electrophoresis

O‟ Farrell developed this

technique and revolutionized

protein purification and

characterization.

1975

Sequencing Maxam-Gilbert and Chain

termination methods developed in

between 1975-1980 were the first

true and feasible sequencing

technologies.

1975-

1980

PCR Kary Mullis developed PCR

technology and this invention

revolutionized the genomic era

and provided a large thrust to

many of the scientific discoveries.

1983

Microarray Microarray evolved from southern

blotting and the use of collection

of distinct DNAs was first time

described in 1987.

1987

Pyrosequencing The technique was developed by

Mostafa Ronaghi in 1996.

However, the major applications

started only after 2005. Robert

Edwards and his colleagues

published the first sequences of

environmental samples generated

with so-called next generation

sequencing, chip-based

pyrosequencing developed by 454

Life Sequences.

1996

Chapter 1: Introduction

30

1.5 Microbial enzymes involved in biodegradation

Enzymes are the catalysts playing a key and direct role in biodegradation of

xenobiotic compounds. Consequently, in the last decade, the focus on enzymes

involved in environmental bioremediation gained considerable importance (Shah et

al., In Press). In this section, the enzyme classes (based on general classification)

involved in bioremediation are described.

1.5.1 Oxidoreductases

Oxidoreductases catalyse the transfer of electrons from one molecule (reductant) to

another molecule (oxidant). In enzyme classification oxidoreductases are classified as

EC 1 and are further subdivided into 22 sub classes. From the UM-BBD we can see

that enzymes belonging to sub classes other than EC 1.9, 1.15, 1.19 and 1.21 play a

role in biodegradation pathways (Table 1.2 – Enzyme sub classes playing role in

bioremediation are in bold).

Table 1.2: Description of oxidoreductases.

Enzyme class Description

EC 1.1 Act on the alcohol (CH-OH) group of donors

EC 1.2 Act on the aldehyde or oxo group of donors

EC 1.3 Act on the CH-CH group of donors

EC 1.4 Act on the CH-NH2 group of donors

EC 1.5 Act on the CH-NH group of donors

EC 1.6 Act on NADH or NADPH

EC 1.7 Act on other nitrogenous compounds as donors

EC 1.8 Act on sulphur as donors

EC 1.9 Act on heme as donors

EC 1.10 Act on diphenols and related substances as donors

EC 1.11 Act on peroxide as an acceptor

EC 1.12 Act on hydrogen as donors

EC 1.13 Act on single donors with incorporation of molecular oxygen

EC 1.14 Act on paired donors with incorporation of molecular oxygen

EC 1.15 Act on superoxide radicals as acceptors

EC 1.16 Oxidize metal ions

EC 1.17 Act on CH or CH2 groups

EC 1.18 Act on iron-sulphur proteins as donors

EC 1.19 Act on reduced flavodoxin as a donor

EC 1.20 Act on phosphorous or arsenic in donors

EC 1.21 Act on X-H and Y-H to form an X-Y bond

EC 1.97 Others

Chapter 1: Introduction

31

Oxidoreductases play a major role in biodegradation pathways. They carry out key

essential steps needed in transformation and mineralization of xenobiotic compounds.

Usually the oxidoreductases are the first enzymes to act on the complex compounds

and hence initiate biodegradation. Mainly oxidoreductases are generally involved in

cellular metabolism, but when under the pressure of xenobiotic compounds they start

utilising them for energy sources and thus in turn are useful for bioremediation

studies. In UM-BBD 878 enzymes are listed that play a role in bioremediation and out

of them 543 belong to oxidoreductases, clearly showing the importance of these

enzymes in biodegradation. Moreover the database also has separate section of 76 and

109 reactions for naphthalene 1,2 dioxygenase and toluene dioxygenase highlighting

the importance of these enzymes. Oxidoreductases can be oxygenases

[monooxygenases and dioxygenases - incorporate molecular oxygen into organic

substrates (classified mainly to 1.13 and 1.14)], reductases [catalyze reductions, in

most cases reductases act similar to oxidases (classified mainly to 1.1, 1.3, 1.4, 1.6,

1.7, 1.8, 1.11, 1.12, 1.20 and 1.97), oxidases [enzymes involved when molecular

oxygen acts as an acceptor of hydrogen or electrons (classified mainly to 1.2, 1.3, 1.4,

1.5, 1.7, 1.8, 1.10 and 1.20)], dehydrogenases [enzymes that oxidize a substrate by

transferring hydrogen to an acceptor that is either NAD+/NADP

+ or a flavin enzyme

(classified mainly to 1.1, 1.2, 1.3, 1.4, 1.5 and 1.17)], peroxidases [catalyzes the

reduction of hydrogen peroxide (1.11)], hydroxylases [add hydroxyl groups to its

substrates(1.14 and 1.97)], laccasses [act on phenols and similar molecules (1.10)]

and others such as hydrogenases (1.12), demethylases (1.13), nitrogenases (1.18),

denitritases (1.7), dehalogenases (1.97), dechlorinases (1.97) and others.

1.5.2 Transferases

Transferases catalyse the transfer of a functional group from one molecule (donor) to

another molecule (acceptor). Generally the donor molecules are coenzymes. In

enzyme classification transferases are classified as EC 2 and are further subdivided

into 9 sub classes. From the UM-BBD we can see that enzymes belonging to sub

classes other than EC 2.2 and 2.9 play a role in biodegradation pathways (Table 1.3 –

Enzyme sub classes playing role in bioremediation are in bold).

Chapter 1: Introduction

32

Table 1.3: Description of transferases.

Enzyme class Description

EC 2.1 Transfer one carbon groups

EC 2.2 Transfer aldehyde or ketone groups

EC 2.3 Acyltransferases

EC 2.4 Glycosyltransferases

EC 2.5 Transfer alkyl or aryl groups other than methyl groups

EC 2.6 Transfer nitrogenous groups

EC 2.7 Transfer phosphorous containing groups

EC 2.8 Transfer sulphur containing groups

EC 2.9 Transfer selenium containing groups

1.5.3 Hydrolases

Hydrolases catalyse the hydrolysis of a chemical bond. In enzyme classification

hydrolases are classified as EC 3 and are further subdivided into 13 sub classes.

Hydrolases are the second most abundant classes of enzymes involved in

bioremediation processes. From the UM-BBD we can see that enzymes belonging to

sub classes EC 3.1, 3.3, 3.5, 3.7, 3.8, 3.10, 3.13 play a role in biodegradation

pathways (Table 1.4 – Enzyme sub classes playing role in bioremediation are in

bold).

Table 1.4: Description of hydrolases.

Enzyme class Description

EC 3.1 Act on ester bonds

EC 3.2 Act on sugars

EC 3.3 Act on ether bonds

EC 3.4 Act on peptide bonds

EC 3.5 Act on carbon-nitrogen bonds

EC 3.6 Act on acid anhydrides

EC 3.7 Act on carbon-carbon bonds

EC 3.8 Act on halide bonds

EC 3.9 Act on phosphorous-nitrogen bonds

EC 3.10 Act on sulphur-nitrogen bonds

EC 3.11 Act on carbon-phosphorous bonds

EC 3.12 Act on sulphur-sulphur bonds

EC 3.13 Act on carbon-sulphur bonds

Chapter 1: Introduction

33

1.5.4 Lyases

Lyases catalyse the breaking of various chemical bonds by means other than

hydrolysis and oxidation and often resulting in formation of a multiple bonds or a new

ring structure. In enzyme classification lyases are classified as EC 4 and are further

subdivided into 7 sub classes. From the UM-BBD we can see that enzymes belonging

to sub classes other than EC 4.6 play a role in biodegradation pathways (Table 1.5 –

Enzyme sub classes playing role in bioremediation are in bold).

Table 1.5: Description of lyases.

Enzyme class Description

EC 4.1 Cleave carbon-carbon bonds

EC 4.2 Cleave carbon-oxygen bonds

EC 4.3 Cleave carbon-nitrogen bonds

EC 4.4 Cleave carbon-sulphur bonds

EC 4.5 Cleave carbon-halide bonds

EC 4.6 Cleave phosphorous-oxygen bonds

EC 4.99 Others

1.5.5 Isomerases

Isomerases catalyze the structural rearrangement of isomers. In enzyme classification

isomerases are classified as EC 5 and are further subdivided into 6 sub classes. From

the UM-BBD we can see that enzymes belonging to sub classes other than EC 5.99

play a role in biodegradation pathways (Table 1.6 – Enzyme sub classes playing role

in bioremediation are in bold).

Table 1.6: Description of isomerases.

Enzyme class Description

EC 5.1 Catalyze racemisation and epimerisation

EC 5.2 Catalyze isomerisation of geometric isomers

EC 5.3 Catalyze intramolecular transfer of electrons

EC 5.4 Catalyze intramolecular transfer of functional group

EC 5.5 Catalyze intramolecular breaking of bonds

EC 5.99 Others

Chapter 1: Introduction

34

1.5.6 Ligases

Ligases catalyze the joining of two molecules by forming a new chemical bond

usually accompanied with hydrolysis. In enzyme classification ligases are classified

as EC 6 and are further subdivided into 6 sub classes. Ligases are enzymes, involved

in joining of molecules and not breaking of molecules required for biodegradation of

xenobiotic compounds. Yet, ligases are occasionally involved in biodegradation

pathways. They add some co-factor or group to make the compound susceptible to

degradation by other enzymes. One such example is benzoate-CoA ligase adds CoA

to benzoate and leads to formation of benzoyl-CoA on which later on reductases,

hydratases and others can act for complete biodegradation. From the UM-BBD we

can see that enzymes belonging to sub classes EC 6.2 and 6.4 play a role in

biodegradation pathways (Table 1.7 – Enzyme sub classes playing role in

bioremediation are in bold).

Table 1.7: Description of ligases.

Enzyme class Description

EC 6.1 Form carbon-oxygen bonds

EC 6.2 Form carbon-sulphur bonds

EC 6.3 Form carbon-nitrogen bonds

EC 6.4 Form carbon-carbon bonds

EC 6.5 Form phosphoric ester bonds

EC 6.6 Form nitrogen-metal bonds

1.6 Elucidating the metabolic pathways involved in biodegradation

Hundreds of xenobiotic compounds, hundreds of enzymes, hundreds of reactions,

hundreds of interlinked pathways and add to this environmental conditions (aerobic or

anaerobic, nutrient availability, moisture and other factors) making biodegradation in

principle a complex process but to keep it simple all reactions have one aim to follow

and that is, mineralization into simple components, whatsoever the compound may be.

Thermodynamic feasibility, chemical equilibrium, reaction dynamism and many such

other factors play a vital role in biodegradation mechanism. Many intermediates can

follow two different pathways for their complete mineralization. The best example is

of catechol which can be degraded by ortho cleavage as well as meta cleavage

Chapter 1: Introduction

35

pathway. These differences do reflect some significance at the genetic level.

Generally genes that encode enzymes of the ortho-cleavage pathways mostly reside

on chromosome/genome, whereas those of meta-cleavage pathways are mostly

harboured by plasmids (Houghton and Shanley, 1994). This also establishes the

importance of other factors playing a role in the mechanism. However, there are

certain factors which do tend to favour a particular pathway. Moreover, compounds

belonging to same class or group of compounds, having similar kind of core structure

or functional groups generally are metabolized to common intermediates and

thereafter the pathway is common. Many a times this common intermediates are

shared even by many pathways. In many cases these intermediates interconvert

depending on reaction dynamism. There are many common intermediates such as

catechol, 4- methylcatechol, protocatechuate, 4- hydroxyl benzoate, maleylacetate, 4-

hydroxyl 2- oxopentanoate, homogentisate, glyoxylate, propanoate, cyanuric acid and

others for aerobic pathways and benzoyl-CoA, acetyl-CoA and others for anaerobic

pathways. Certain compounds undergo aerobic biodegradation, in such cases oxygen

is generally the electron acceptor. However, many contaminated environmental sites

are anoxic and in such cases microorganisms can anaerobically oxidize contaminants

with alternative electron acceptors such as nitrate, sulphate, Fe (III) oxides and others.

In certain bioremediation reactions contaminants are electron acceptors rather than

electron donors, the most prominent example is reductive dechlorination (Lovley,

2003). The other important factor that plays a role in degradation of xenobotic

compounds is that it may be that, initial reactions require anaerobic environment and

final reactions may require aerobic environment. These kinds of need make the

biodegradation tricky for certain compounds. The ultimate necessity is that the

intermediates formed must be able to enter any of the central metabolic pathways

such as glycolysis, Kreb‟s cycle or others so that they are completely mineralized and

does not lead to formation of any dead end metabolites.

The work of Salinero et al. (2009) highlights that there is much to be learned

regarding the metabolic capabilities and life-style of microbial species. Metabolic

analysis of the soil microbe Dechloromonas aromatica strain RCB revealed that it did

not harbour the anticipated (previously characterized) enzymes for anaerobic aromatic

Chapter 1: Introduction

36

degradation. On annotations of genes, it was evident that several metabolic pathways

have yet to be observed experimentally.

An increasing amount and variety of xenobiotic compounds are released in

environment. It is vital to know the fate of these chemical compounds. However, to

find that experimentally without any hint will be a very vague approach. Hence, it will

be useful to predict whether a particular compound is biodegradable and it can be

mineralized by which pathways. UM-BBD besides providing information about

enzymes, compounds, reactions, biotransformation rules and microorganisms; it

includes a Biochemical Periodic Table (UM-BPT) and a rule-based Pathway

Prediction System (UM-PPS) (http://umbbd.msi.umn.edu/predict/) that predicts

plausible pathways for microbial degradation of organic compounds (Gao et al.,

2010). Finley et al. (2009) have described a computational framework (called BNICE)

that can be used for the prediction of novel biodegradation pathways of xenobiotics.

BNICE reproduced the biodegradation routes found experimentally, and in addition, it

expanded the biodegradation reaction networks by generating novel pathways to

degrade xenobiotic compounds that are thermodynamically feasible alternatives to

known biodegradation routes and attractive targets for metabolic engineering.

Metagenomics along with complementary „omics‟ technologies based analyses

harbours the most promising opportunities to delineate the community‟s metabolic

pathways responsible for bioremediation of environmental pollutants.

1.7 Scenario and future perspectives

Bioremediation strategies are multidisciplinary involving ecology, geology,

physiology, genetics, biochemistry, engineering and microbiology. The molecular

biology and „omics‟ technologies have led to tremendous progress in exploration of

polluted sites and understanding the catabolic reactions occurring in bioremediation

processes. The data generated by present „omics‟ technologies are huge and the

foremost need that has arisen is how to store this huge information and above all how

to correlate and integrate them for comprehensive analyses. The use of massively

parallel sequencing technologies is expanding the generated data exponentially.

Therefore, the development of novel and sophisticated bioinformatic tools dedicated

Chapter 1: Introduction

37

to bioremediation studies is necessary for analysis and integration of omic data sets

involving identification of genes, phylogenetic affiliation of sequence reads and

assignment of functions to expressed proteins. This integrated „omics‟ technology

based functional analysis from genes via transcripts to proteins and metabolites at

„meta‟ level will answer all the relevant queries and mysteries that have hampered

bioremediation strategies till now. At present there is a dire need for high-throughput

methods for functional screening and characterization of key enzymes and metabolites

involved in stress response metabolic networks. Developments in existing approaches

and innovations in technologies will lead to novel and extensive approaches.

Conversely, the technologies will have to be economical and applicable at high

throughput level for increased access and applications. In future, analyses by „meta‟

approaches will be successfully used in devising site/contamination specific

bioremediation strategies.

1.8 Objectives of the study

The objectives of the research presented in this thesis are as follows:

Standardization of metagenomic DNA isolation from sites contaminated by

industrial discharges.

Contaminated soil metagenomics - library preparation and functional

screening.

Taxonomic profiling of microbial community inhabiting the contaminated

soil.

Sequence-driven analysis of metagenome for characterization of genes

involved in xenobiotic biodegradation.

Community genomics: Development and characterization of azodye

decolourizing bacterial consortia.

Isolation, characterization and expression analysis of gene coding for

azoreductase.

Chapter 1: Introduction

38

1.9 References

Amann, R.I., Ludwig, W., Schleifer, K.H. (1995) Phylogenetic identification and in

situ detection of individual microbial cells without cultivation. Microbiol Rev.

59: 143-169.

Anliker, R. (1979) Ecotoxicology of dyestuffs-a joint effort by industry. Ecotox

Environ Safety. 3: 59-74.

Arora, P.K., Kumar, M., Chauhan, A., Raghava, G.P., Jain, R.K. (2009) OxDBase: a

database of oxygenases involved in biodegradation. BMC Res Notes. 2: 67.

Bakermans, C., Madsen, E.L. (2002) Detection in coal tar waste-contaminated

groundwater of mRNA transcripts related to naphthalene dioxygenase by

fluorescent in situ hybridization with tyramide signal amplification. J Microbiol

Methods. 50: 75-84.

Bansal, A.K. (2005) Bioinformatics in microbial biotechnology - a mini review.

Microb Cell Fact. 4: 19.

Beloqui, A., Pita, M., Polaina, J., Martinez-Arias, A., Golyshina, O.V., Zumarraga,

M., Yakimov, M.M., García-Arellano, H., Alcalde, M., Fernández, V.M.,

Elborough, K., Andreu, J.M., Ballesteros, A., Plou, F.J., Timmis, K.N., Ferrer,

M., Golyshin, P.N. (2006) Novel polyphenol oxidase mined from a metagenome

expression library of bovine rumen: biochemical properties, structural analysis,

and phylogenetic relationships. J Biol Chem. 281: 22933-22942.

Ben Said, O., Goñi-Urriza, M.S., El Bour, M., Dellali, M., Aissa, P., Duran, R. (2008)

Characterization of aerobic polycyclic aromatic hydrocarbon-degrading bacteria

from Bizerte lagoon sediments, Tunisia. J Appl Microbiol. 104: 987-997.

Bentley, D.R., Balasubramanian, S., Swerdlow, H.P., Smith, G.P., Milton, J., Brown,

C.G., Hall, K.P., Evers, D.J., Barnes, C.L., Bignell, H.R., et al. (2008) Accurate

whole human genome sequencing using reversible terminator chemistry. Nature.

456: 53-59.

Bertrand, H., Poly, F., Van, V.T., Lombard, N., Nalin, R., Vogle, T.M., Simonet, P.

(2005) High molecular weight DNA recovery form soils prerequisite for

biotechnological metagenomic library construction. J Microbiol Methods. 62: 1-

11.

Blumel, S., Knackmuss, H.J., Stolz, A. (2002) Molecular cloning and characterization

of the gene coding for the aerobic azoreductase from Xenophilus azovorans

KF46F. Appl Environ Microbiol. 68: 3948-3955.

Bordenava, S., Goni-Urizza, M., Caumette, P., Duran, R. (2009) Differential display

analysis of cDNA involved in microbial mats response after heavy fuel oil

contamination. J Microbial Biochem Technol. 1: 1-4.

Brennerova, M.V., Josefiova, J., Brenner, V., Pieper, D.H., Junca, H. (2009)

Metagenomics reveals diversity and abundance of meta-cleavage pathways in

microbial communities from soil highly contaminated with jet fuel under air-

sparging bioremediation. Environ Microbiol. 11: 2216-2227.

Chapter 1: Introduction

39

Burgmann, H., Pesaro, M., Widmer, F., Zeyer, J. (2001) A strategy for optimizing

quality and quantity of DNA extracted from soil. J Microbiol Methods. 45: 7-20.

Cardenas, E., Tiedje, J.M. (2008) New tools for discovering and characterizing

microbial diversity. Curr Opin Biotechnol. 19: 544-549.

Chen, H., Hopper, S.L., Cerniglia, C.E. (2005) Biochemical and molecular

characterization of an azoreductase from Staphylococcus aureus, a tetrameric

NADPH-dependent flavoprotein. Microbiology. 151: 1433-1441.

Chen, Y., Murrell, J.C. (2010) When metagenomics meets stable-isotope probing:

progress and perspectives. Trends Microbiol. 18: 157-163.

Culley, A., Lang, A., Suttle, C. (2006) Metagenomic analysis of coastal RNA virus

communities. Science. 312: 1795-1798.

Dennis, P., Edwards, E.A., Liss, S.N., Fulthorpe, R. (2003) Monitoring gene

expression in mixed microbial communities by using DNA microarrays. Appl

Environ Microbiol. 69: 769-778.

Desai, C., Madamwar, D. (2007) Extraction of inhibitor-free metagenomic DNA from

polluted sediments, compatible with molecular diversity analysis using

adsorption and ion-exchange treatments. Bioresour Technol. 98: 761-768.

Desai, C., Parikh, R.Y., Vaishnav, T., Shouche, Y.S., Madamwar, D. (2009) Tracking

the influence of long-term chromium pollution on soil bacterial community

structures by comparative analyses of 16S rRNA gene phylotypes. Res

Microbiol. 160: 1-9.

Desai, C., Pathak, H., Madamwar, D. (2010) Advances in molecular and "-omics"

technologies to gauge microbial communities and bioremediation at

xenobiotic/anthropogen contaminated sites. Bioresour Technol. 101: 1558-1569.

Díaz, E. (2004) Bacterial degradation of aromatic pollutants: a paradigm of metabolic

versatility. Int Microbiol. 7: 173-180.

Díaz, E., Prieto, M.A. (2000) Bacterial promoters triggering biodegradation of

aromatic pollutants. Curr Opin Biotechnol. 11: 467-475.

Drmanac, R., Sparks, A.B., Callow, M.J., Halpern, A.L., Burns,

N.L., Kermani, B.G., Carnevali, P., Nazarenko, I., Nilsen, G.B., Yeung, G., et

al. (2010) Human genome sequencing using unchained base reads on self-

assembling DNA nanoarrays. Science. 327: 78-81.

D'Souza-Ticlo, D., Sharma, D., Raghukumar, C. (2009) A thermostable metal-tolerant

laccase with bioremediation potential from a marine-derived fungus. Mar

Biotechnol (NY). 11: 725-737.

Durand, S., Sancelme, M., Besse-Hoggan, P., Combourieu, B. (2010) Biodegradation

pathway of mesotrione: complementarities of NMR, LC-NMR and LC-MS for

qualitative and quantitative metabolic profiling. Chemosphere. 81: 372-380.

Eyers, L., George, I., Schuler, L., Stenuit, B., Agathos, S.N., El Fantroussi, S. (2004)

Environmental genomics: exploring the unmined richness of microbes to

degrade xenobiotics. Appl Microbiol Biotechnol. 66: 123-130.

Chapter 1: Introduction

40

Ferrer, M., Golyshina, O.V., Chernikova, T.N., Khachane, A.N., Reyes-Duarte, D.,

Santos, V.A., Strompl, C., Elborough, K., Jarvis, G., Neef, A., Yakimov, M.M.,

Timmis, K.N., Golyshin, P.N. (2005) Novel hydrolase diversity retrieved from a

metagenome library of bovine rumen microflora. Environ Microbiol. 7: 1996-

2010.

Finley, S.D., Broadbelt, L.J., Hatzimanikatis, V. (2009) Computational framework for

predictive biodegradation. Biotechnol Bioeng. 104: 1086-1097.

Fortin, N., Beaumier, D., Lee, K., Greer, C.W. (2004) Soil washing improves the

recovery of total community DNA from polluted and high organic content

sediments. J Microbiol Methods. 56: 181-191.

Fulekar, M.H., Sharma, J. (2008) Bioinformatics applied in bioremediation. Innov

Romanian Food Biotechnol. 2: 28-36.

Gao, J., Ellis, L.B., Wackett, L.P. (2010) The University of Minnesota

Biocatalysis/Biodegradation Database: improving public access. Nucleic Acids

Res. 38: D488-D491.

Goh, F., Allen, M., Leuko, S., Kawaguchi, T., Decho, A., Burns, B., Neilan, B. (2009)

Determining the specific microbial populations and their spatial distribution

within the stromatolite ecosystem of Shark Bay. ISME J. 3: 383-396.

Graham, R.L.J., Graham, C., McMullan, G. (2007) Microbial proteomics: a mass

spectrometry primer for biologists. Microb Cell Fact. 6: 26.

Ham, J.G., Bonner, J.S. (1997) Environmental site assessments and bioremediation. In

Wild, J.R., Varfolomeyev, S.D., Scozzafava, A. (Eds) Perspectives in

Bioremediation Technologies for Environmental Improvement. Kluwer

Academic Publishers, Netherlands, pp. 1-12.

He, Z., Deng, Y., Van Nostrand, J.D., Tu, Q., Xu, M., Hemme, C.L., Li, X., Wu, L.,

Gentry, T.J., Yin, Y., Liebich, J., Hazen, T.C., Zhou, J. (2010) GeoChip 3.0 as a

high-throughput tool for analyzing microbial community composition, structure

and functional activity. ISME J. 4: 1167-1179.

He, Z., Gentry, T.J., Schadt, C.W., Wu, L., Liebich, J., Chong, S.C., Huang, Z., Wu,

W., Gu, B., Jardine, P., Criddle, C., Zhou, J. (2007) GeoChip: a comprehensive

microarray for investigating biogeochemical, ecological and environmental

processes. ISME J. 1: 67-77.

Holben, W.E. (1994) Isolation and purification of bacterial DNA from soil. In

Weaver, R.W. (Eds) Methods in Soil Analysis, Part 2: Microbiological and

Biochemical Properties. Madison, WI: Soil Science Society of America, pp.

727-751.

Holben, W.E., Jansson, J.K., Chelm, B.K., Tiedje, J.M. (1988) DNA probe method

for the detection of specific microorganisms in the soil bacterial community.

Appl Environ Microbiol. 54: 703-711.

Houghton, J.E., Shanley, M.S. (1994) Catabolic potential of Pseudomonads: a

regulatory perspective. In Chaudhry, G.R. (Eds) Biological Degradation and

Bioremediation of Toxic Chemicals. Dioscorides press, Portland, pp. 11-32.

Chapter 1: Introduction

41

Huson, D., Auch, A., Qi, J., Schuster, S. (2007) MEGAN analysis of metagenomic

data. Genome Res. 17: 377-386.

Jiménez, J.I., Miñambres, B., García, J.L., Díaz, E. (2002) Genomic analysis of the

aromatic catabolic pathways from Pseudomonas putida KT2440. Environ

Microbiol. 4: 824-841.

Kauffmann, I.M., Schmitt, J., Schmid, R.D. (2004) DNA isolation from soil samples

for cloning in different hosts. Appl Microbiol Biotechnol. 64: 665-670.

Keum, Y.S., Seo, J.S., Qing, X.L., Kim, J.H. (2008) Comparative metabolomic

analysis of Sinorhizobium sp. C4 during the degradation of phenanthrene. Appl

Microbiol Biotechnol. 80: 863-872.

Kim, M., Bae, S.S., Seol, M., Lee, J.H., Oh, Y.S. (2008) Monitoring nutrient impact

on bacterial community composition during bioremediation of anoxic PAH-

contaminated sediment. J Microbiol. 46: 615-623.

Kim, S.J., Kweon, O., Cerniglia, C.E. (2009) Proteomic applications to elucidate

bacterial aromatic hydrocarbon metabolic pathways. Curr Opin Microbiol. 12:

301-309.

Kim, Y.H., Cho, K., Yun, S.H., Kim, J.Y., Kwon, K.H., Yoo, J.S., Kim, S.I. (2006)

Analysis of aromatic catabolic pathways in Pseudomonas putida KT 2440 using

a combined proteomic approach: 2DE/MS and cleavable isotope-coded affinity

tag analysis. Proteomics. 6: 1301-1318.

Kim, Y.H., Kwon, E.J., Kim, S.K., Jeong, Y.S., Kim, J., Yun, H.D., Kim, H. (2010)

Molecular cloning and characterization of a novel family VIII alkaline esterase

from a compost metagenomic library. Biochem Biophys Res Commun. 393: 45-

49.

Lane, D.J., Pace, B., Olsen, G.J., Stahl, D.A., Sogin, M.L., Pace, N.R. (1985) Rapid

determination of 16S ribosomal RNA sequences for phylogenetic analyses. Proc

Natl Acad Sci USA. 82: 6955-6959.

Larsen, N., Olsen, G.J., Maidak, B.L., McCaughey, M.J., Overbeek, R., Macke, T.J.,

Marsh, T.L., Woese, C.R. (1993) The ribosomal database project. Nucleic Acids

Res. 21: 3021-3023.

Lee, J., Lee, T.K., Löffler, F.E., Park, J. (2010) Characterization of microbial

community structure and population dynamics of tetrachloroethene-

dechlorinating tidal mudflat communities. Biodegradation. 22: 687-698.

Leff, L.G., Dana, J.R., McArthur, J.V., Shimkets, L.J. (1995) Comparison of methods

of DNA extraction from stream sediments. Appl Environ Microbiol. 61: 1141-

1143.

Leininger, S., Urich, T., Schloter, M., Schwark, L., Qi, J., Nicol, G., Prosser, J.,

Schuster, S., Schepler, C. (2006) Archae predominate among ammonia-

oxidizing prokaryotes in soils. Nature. 442: 806-809.

Lovley, D.R. (2003) Cleaning up with genomics: applying molecular biology to

bioremediation. Nat Rev Microbiol. 1: 35-44.

Chapter 1: Introduction

42

Loy, A., Schulz, C., Lucker, S., Schopfer-Wendels, A., Stoecker, K., Baranyi, C.,

Lehner, A., Wagner, M. (2005) 16S rRNA gene-based oligonucleotide

microarray for environmental monitoring of the betaproteobacterial order

„„Rhodocyclales”. Appl Environ Microbiol. 71: 1373-1386.

Lumini, E., Orgiazzi, A., Borriello, R., Bonfante, P., Bianciotto, V. (2009) Disclosing

arbuscular mycorrhizal fungal biodiversity in soil through a land-use gradient

using a pyrosequencing approach. Environ Microbiol. 12: 2165-2179.

Maphosa, F., de Vos, W.M., Smidt, H. (2010a) Exploiting the ecogenomics toolbox

for environmental diagnostics of organohalide-respiring bacteria. Trends

Biotechnol. 28: 308-316.

Maphosa, F., Smidt, H., de Vos, W.M., Röling, W.F. (2010b) Microbial community-

and metabolite dynamics of an anoxic dechlorinating bioreactor. Environ Sci

Technol. 44: 4884-4890.

Margulies, M., Egholm, M., Altman, W.E., Attiya, S., Bader, J.S., Bemben, L.A.,

Berka, J., Braverman, M.S., Chen, Y.J., Chen, Z., et al. (2005) Genome

sequencing in microfabricated high-density picolitre reactors. Nature. 437: 376-

380.

Mateos, L.M., Ordóñez, E., Letek, M., Gil, J.A. (2006) Corynebacterium glutamicum

as a model bacterium for the bioremediation of arsenic. Int Microbiol. 9: 207-

215.

Mattes, T.E., Alexander, A.K., Richardson, P.M., Munk, C., Han, C.S., Stothard, P.,

Colemans, N.V. (2008) The genome of Polaromonas sp. strain JS666: insights

into the evolution of a hydrocarbon- and xenobiotic-degrading bacterium, and

features of relevance to biotechnology. Appl Environ Microbiol. 74: 6405-6416.

McKernan, K.J., Peckham, H.E., Costa, G.L., McLaughlin, S.F.,

Fu, Y., Tsung, E.F., Clouser, C.R., Duncan, C., Ichikawa, J.K., Lee, C.C., et

al. (2009). Sequence and structural variation in a human genome uncovered by

short-read, massively parallel ligation sequencing using two-base

encoding. Genome Res. 19: 1527-1541.

Meiring, T.L., Bauer, R., Scheepers, I., Ohlhoff, C., Tuffin, M.I., Cowan, D.A. (2011)

Metagenomics and beyond: Current approaches and integration with

complementary technologies. In Marco, D. (Eds) Metagenomics: Current

Innovations and Future Trends. Caister Academic Press, Norfolk, pp. 1-19.

Miller, D.N., Bryant, J.E., Madsen, E.L., Ghiorse, W.C. (1999) Evaluation and

optimization of DNA extraction and purification procedures for soil and

sediment samples. Appl Environ Microbiol. 65: 4715-4724.

Moriya, Y., Shigemizu, D., Hattori, M., Tokimatsu, T., Kotera, M., Goto, S.,

Kanehisa, M. (2010) PathPred: an enzyme-catalyzed metabolic pathway

prediction server. Nucleic Acids Res. 38: 1-6.

Nakanishi, M., Yatome, C., Ishida, N., Kitade, Y. (2001) Putative ACP

Phosphodiesterase gene (acpD) encodes an azoreductase. J Biol Chem. 276:

46394-46399.

Chapter 1: Introduction

43

Nakayama, T., Kimura, T., Kodama, M., Nagata, C. (1983) Generation of hydrogen

peroxide and superoxide anion from active metabolites of naphthylamines and

aminoazo dyes: its possible role in carcinogenesis. Carcinogenesis. 4: 765-769.

Neefs, J., Peer, V., Hendriks, L., Wachter, R. (1990) Compilation of small ribosomal

subunit RNA sequencing. Nucleic Acids Res. 18 (suppl): 2237-2317.

Nelson, K.E., Weinel, C., Paulsen, I.T., Dodson, R.J., Hilbert, H., Martins dos Santos,

P., Fouts, D.E., Gill, S.R., Pop, M., Holmes, M., et al. (2002) Complete genome

sequence and comparative analysis of metabolically versatile Pseudomonas

putida KT2440. Environ Microbiol. 4: 799-808.

Ní Chadhain, S.M., Norman, R.S., Pesce, K.V., Kukor, J.J., Zylstra, G.J. (2006)

Microbial dioxygenase gene population shifts during polycyclic aromatic

hydrocarbon biodegradation. Appl Environ Microbiol. 72:4078-4087.

Ogram A., Sayler G.S., Barkay, T. (1987) The extraction and purification of microbial

DNA from sediments. J Microbiol Methods. 7: 57-66.

Overney, G. (1979) Ueber den aerobe Abbau von Dicarboxyazobenzol durch ein

Flavobacterium sp. Ph.D. thesis ETH 6421. ETH Zurich, Zurich, Switzerland.

Ozsolak, F., Platt, A.R., Jones, D.R., Reifenberger, J.G., Sass, L.E., McInerney, P.,

Thompson, J.F., Bowers, J., Jarosz, M., Milos, P.M. (2009) Direct RNA

sequencing. Nature. 461: 814-818.

Pagga, U., Brown, D. (1986) The degradation of dyestuffs: part II Behaviour of

dyestuffs in aerobic biodegradation tests. Chemosphere. 15: 479-491.

Pazos, F., Guijas, D., Valencia, A., De Lorenzo, V. (2005) MetaRouter:

bioinformatics for bioremediation. Nucleic Acids Res. 33: D588-D592.

Pazos, F., Valencia, A., De Lorenzo, V. (2003). The organization of the microbial

biodegradation network from a systems-biology perspective. EMBO Rep. 4:

994-999.

Picard, C., Ponsonnet, C., Paget, E., Nesme, X., Simonet, P. (1992) Detection and

enumeration of bacteria in soil by direct DNA extraction and polymerase chain

reaction. Appl Environ Microbiol. 58: 2717-2722.

Qin, J., Li, R., Raes, J., Arumugam, M., Burgdof, K.S., Manichanh, C., Nielsen, T.,

Pons, N., Levenez, F., Yamada, T. et al. (2010) A human gut microbial gene

catalogue established by metagenomic sequencing. Nature. 464: 59-65.

Reisch, M.S. (1996) Asian textile dye markers are a growing power in changing

market. Chem Eng News. 15: 10-12.

Rhee, S.K., Liu, X., Wu, L., Chong, S.C., Wan, X., Zhou, J. (2004) Detection of

genes involved in biodegradation and biotransformation in microbial

communities by using 50-mer oligonucleotide microarrays. Appl Environ

Microbiol. 70: 4303-4317.

Rocha, E.P. (2008) The organization of the bacterial genome. Annu Rev Genet. 42:

211-223.

Chapter 1: Introduction

44

Rogers, S.L., McClure, N. (2003) The role of microbiological studies in

bioremediation process optimisation. In Head, I.M., Singleton, I., Milner, M.

(Eds) Bioremediation: A critical review. Horizon Scientific Press, Norfolk, pp.

27-59.

Rogers, S.W., Mooreman, T.B., Onge, S.K. (2007) Fluorescent in situ hybridization

and microautoradiography applied to ecophysiology in soil. Soil Sci Soc Am J.

71: 620-631.

Salinero, K.K., Keller, K., Feil, W.S., Feil, H., Trong, S., Di Bartolo, G., Lapidus, A.

(2009) Metabolic analysis of the soil microbe Dechloromonas aromatica str.

RCB: indications of a surprisingly complex life-style and cryptic anaerobic

pathways for aromatic degradation. BMC Genomics. 10: 351.

Schloss, P.D., Handelsman, J. (2003) Biotechnological prospects from metagenomics.

Curr Opin Biotechnol. 14: 303-310.

Schloss, P.D., Handelsman, J. (2005) Metagenomics for studying unculturable

microorganisms: cutting the Gordian knot. Genome Biol. 6: 229.

Schneegurt, M.A., Kulpa, C.F. (1998) The application of molecular techniques in

environmental biotechnology for monitoring microbial systems. Biotechnol Appl

Biochem. 27: 73-79.

Schoolnik, G.K. (2001) The accelerating convergence of genomics and microbiology.

Genome Biol. 2: REPORTS4009.

Schuster, S.C. (2008) Next-generation sequencing transforms today's biology. Nat

Methods. 5: 16-18.

Shah, V., Jain, K., Desai, C., Madamwar, D. (2011) Metagenomics and integrative “-

omics” technologies in microbial bioremediation: Current trends and potential

applications. In Marco, D. (Eds) Metagenomics: Current Innovations and

Future Trends. Caister Academic Press, Norfolk, pp. 211-240.

Shah, V., Jain, K., Desai, C., Madamwar, D. Molecular analyses of microbial

communities involved in bioremediation. In Satyanarayana, T., Johri B.N. (Eds)

Microbes in Environmental Management and Biotechnology. Springer, New

York, In Press.

Sharan, R., Ideker, T. (2006) Modeling cellular machinery through biological network

comparison. Nat Biotechnol. 24: 427-433.

Sharma, R., Ranjan, R., Kapardar, R.K., Grover, A. (2005) „Unculturable‟ bacterial

diversity: An untapped resource. Curr Sci. 89: 72-77.

Shaul, G.M., Holdsworth, T.J., Dempsey, C.R., Dostal, K.A. (1991) Fate of water

soluble azo dyes in the activated sludge process. Chemosphere. 22: 107-119.

Shendure, J., Ji, H. (2008) Next-generation DNA sequencing. Nat Biotechnol. 26:

1135-1145.

Simon, C., Daniel, R. (2009) Achievements and new knowledge unravelled by

metagenomic approaches. Appl Microbiol Biotechnol. 85: 265-276.

Chapter 1: Introduction

45

Singh, O.V. (2006) Proteomics and metabolomics: the molecular make-up of toxic

aromatic pollutant bioremediation. Proteomics. 6: 5481-5492.

Singh, O.V., Nagaraj, N.S. (2006) Transcriptomics, proteomics and interactomics:

unique approaches to track the insights of bioremediation. Brief Funct Genomic

Proteomic. 4: 355-362.

Steffan, R.J., Goksoyr, J., Bej, A.K., Atlas, R.M. (1988) Recovery of DNA from soils

and sediments. Appl Environ Microbiol. 54: 2908-2915.

Stenuit, B., Eyers, L., Schuler, L., Agathos, S.N., George, I. (2008) Emerging high-

throughput approaches to analyze bioremediation of sites contaminated with

hazardous and/or recalcitrant wastes. Biotechnol Adv. 26: 561-575.

Stolz, A. (1999) Degradation of substituted naphthalenesulfonic acids by

Sphingomonas xenophaga BN6. J Ind Microbiol Biotechnol. 23: 391-399.

Suenaga, H., Ohnuki, T., Miyazaki, K. (2007) Functional screening of a metagenomic

library for genes involved in microbial degradation of aromatic compounds.

Environ Microbiol. 9: 2289-2297.

Suzuki, Y., Yoda, T., Ruhul, A., Sugiura, W. (2001) Molecular cloning and

characterization of the gene coding for azoreductase from Bacillus sp. 0Y1-2

isolated from soil. J Biol Chem. 276: 9059-9065.

Tang, Y.J., Martin, H.G., Dehal, P.S., Deutschbauer, A., Llora, X., Meadows, A.,

Arkin, A., Keasling, J.D. (2009) Metabolic flux analysis of Shewanella spp.

reveals evolutionary robustness in central carbon metabolism. Biotechnol

Bioeng. 102: 1161-1169.

Thompson, D.K., Chourey, K., Wickham, G.S., Thieman, S.B., VerBerkmoes, N.C.,

Zhang, B., McCarthy, A.T., Rudisill, M.A., Shah, M., Hettich, R.L. (2010)

Proteomics reveals a core molecular response of Pseudomonas putida F1 to

acute chromate challenge. BMC Genomics. 11: 311.

Trautwein, K., Kühner, S., Wöhlbrand, L., Halder, T., Kuchta, K., Steinbüchel, A.,

Rabus, R. (2008) Solvent stress response of the denitrifying bacterium

"Aromatoleum aromaticum" strain EbN1. Appl Environ Microbiol. 74: 2267-

2274.

Tyson, G.W., Chapman, J., Hugenholtz, P., Allen, E.E., Ram, R.J., Richardson, P.M.,

Solovyev, V.V., Rubin, E.M., Rokhsar, D.S., Banfield, J.F. (2004) Community

structure and metabolism through reconstruction of microbial genomes from the

environment. Nature. 428: 37-43.

Urbance, J.W., Cole, J., Saxman, P., Tiedje, J.M. (2003) BSD: the Biodegradative

Strain Database. Nucleic Acids Res. 31: 152-155.

Venter, J.C., Remington, K., Heidelberg, J.F., Halpern, A.L., Rusch, D., Eisen, J.A.,

Wu, D., Paulsen, I., Nelson, K.E., Nelson, W., et al. (2004) Environmental

genome shotgun sequencing of the Sargasso Sea. Science. 304: 66-74.

Villas-Boas, S.G., Bruheim, P. (2007) The potential of metabolic tools in

bioremediation studies. OMICS: A Journal of Integrative Biol. 11: 305-313.

Chapter 1: Introduction

46

Walshaw, J., Etherington, G.J., MacLean, D. (2011) Next-generation sequencing

approaches to metagenomics. In Marco, D. (Eds) Metagenomics: Current

Innovations and Future Trends. Caister Academic Press, Norfolk, pp. 63-88.

Wang, H., Xu, R., Li, F., Qiao, J., Zhang, B. (2010) Efficient degradation of lube oil

by a mixed bacterial consortium. J Environ Sci. 22: 381-388.

Ward, N. (2006) New directions and interactions in metagenomics research. FEMS

Microbiol Ecol. 55: 331-338.

Ward, N., Fraser, C.M. (2005) How genomics has affected the concept of

microbiology. Curr Opin Microbiol. 8: 564-571.

Warnecke, F., Luginbuhl, P., Ivanova, N., Ghassemian M., Richardson, T., Stege, J.,

Cayouette, M., Mchardy, A., Djordjevic, G., Aboushadi, N., et al. (2007)

Metagenomic and functional analysis of hindgut microbiota of a wood-feeding

higher termite. Nature. 450: 560-565.

Watanabe, K., Baker, P.W. (2000) Environmentally relevant microorganisms. J Biosci

Bioeng. 89: 1-11.

Wegley, L., Edwards, R., Rodriguez-Brito, B., Liu, H., Rohwer, F. (2007)

Metagenomic analysis of the microbial community associated with the coral

Porites astreoides. Environ Microbiol. 11: 2707-2719.

Weisburg, W.G., Barns, S.M., Pelletier, D.A., Lane, D.J. (1991) 16S ribosomal DNA

amplification for phylogenetic study. J Bacteriol. 173: 697-703.

Wharfe, E.S., Jarvis, R.M., Winder, C.L., Whiteley, A.S., Goodacre, R. (2010)

Fourier transform infrared spectroscopy as a metabolite fingerprinting tool for

monitoring the phenotypic changes in complex bacterial communities capable of

degrading phenol. Environ Microbiol. 12: 3253-3263.

Wiechert, W., Schweissgut, O., Takanaga, H., Frommer W.B. (2007) Fluxomics:

mass spectrometry versus quantitative imaging. Curr Opin Plant Biol. 10: 323-

330.

Wilmes, P., Bond, P.L. (2004) The application of two-dimensional polyacrylamide

gel electrophoresis and downstream analyses to a mixed community of

prokaryotic microorganisms. Environ Microbiol. 6: 911-920.

Wilmes, P., Bond, P.L. (2006) Metaproteomics: studying functional gene expression

in microbial ecosystems. Trends Microbiol. 14: 92-97.

Wilmes, P., Wexler, M., Bond, P.L. (2008) Metaproteomics provides functional

insight into activated sludge wastewater treatment. PLoS One. 12 (e1778): 1-11.

Wu, H.J., Seib, K.L., Srikhanta, Y.N., Edwards, J., Kidd, S.P., Maguire, T.L.,

Hamilton, A., Pan, K.T., Hsiao, H.H., Yao, C.W., Grimmond, S.M., Apicella,

M.A., McEwan, A.G., Wang, A.H., Jennings, M.P. (2010) Manganese

regulation of virulence factors and oxidative stress resistance in Neisseria

gonorrhoeae. J Proteomics. 73: 899-916.

Chapter 1: Introduction

47

Yang, Y., Zhang, N., Xue, M., Lu, S.T., Tao, S. (2011) Effects of soil organic matter

on the development of the microbial polycyclic aromatic hydrocarbons (PAHs)

degradation potentials. Environ Pollut. 159: 591-595.

Zhang, W., Li, F., Nie, L. (2010) Integrating multiple 'omics' analysis for microbial

biology: application and methodologies. Microbiology. 156: 287-301.

Zhou, J., Bruns, M.A., Tiedje, J.M. (1996) DNA recovery from soils of diverse

composition. Appl Environ Microbiol. 62: 316-322.


Recommended