Accepted Manuscript
Determinants of thermostability in the cytochrome P450 fold
Kurt L. Harris, Raine E.S. Thomson, Silja J. Strohmaier,Yosephine Gumulya, Elizabeth M.J. Gillam
PII: S1570-9639(17)30180-2DOI: doi: 10.1016/j.bbapap.2017.08.003Reference: BBAPAP 39984
To appear in:
Received date: 17 May 2017Revised date: 19 July 2017Accepted date: 7 August 2017
Please cite this article as: Kurt L. Harris, Raine E.S. Thomson, Silja J. Strohmaier,Yosephine Gumulya, Elizabeth M.J. Gillam , Determinants of thermostability in thecytochrome P450 fold, (2017), doi: 10.1016/j.bbapap.2017.08.003
This is a PDF file of an unedited manuscript that has been accepted for publication. Asa service to our customers we are providing this early version of the manuscript. Themanuscript will undergo copyediting, typesetting, and review of the resulting proof beforeit is published in its final form. Please note that during the production process errors maybe discovered which could affect the content, and all legal disclaimers that apply to thejournal pertain.
ACC
EPTE
D M
ANU
SCR
IPT
Determinants of thermostability in the cytochrome P450 fold
Kurt L. Harris, Raine E.S. Thomson, Silja J. Strohmaier, Yosephine Gumulya and Elizabeth
M.J. Gillam*
School of Chemistry and Molecular Biosciences, The University of Queensland, St. Lucia,
4072, Australia
* Author to whom correspondence should be addressed at:
Elizabeth M.J. Gillam
School of Chemistry and Molecular Biosciences,
The University of Queensland, St. Lucia, 4072, Australia
Tel: +61-7-3365-1410
Email: [email protected]
Keywords: Cytochrome P450, thermostability, directed evolution, biocatalysis, extremophile
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Abstract
Cytochromes P450 are found throughout the biosphere in a wide range of environments,
serving a multitude of physiological functions. The ubiquity of the P450 fold suggests that it
has been co-opted by evolution many times, and likely presents a useful compromise between
structural stability and conformational flexibility. The diversity of substrates metabolized and
reactions catalyzed by P450s makes them attractive starting materials for use as biocatalysts
of commercially useful reactions. However, process conditions impose different requirements
on enzymes to those in which they have evolved naturally. Most natural environments are
relatively mild, and therefore most P450s have not been selected in Nature for the ability to
withstand temperatures above ~ 40 °C, yet industrial processes frequently require extended
incubations at much higher temperatures. Thus, there has been considerable interest and
effort invested in finding or engineering thermostable P450 systems. Numerous P450s have
now been identified in thermophilic organisms and analysis of their structures provides
information as to mechanisms by which the P450 fold can be stabilized. In addition, protein
engineering, particularly by directed or artificial evolution, has revealed mutations that serve
to stabilize particular mesophilic enzymes of interest. Here we review the current
understanding of thermostability as it applies to the P450 fold, gleaned from the analysis of
P450s characterized from thermophilic organisms and the parallel engineering of mesophilic
forms for greater thermostability. We then present a perspective on how this information
might be used to design stable P450 enzymes for industrial application.
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Introduction
A degree of stability is essential to the ability of enzymes to function as biological catalysts.
The evolution of enzymes represents a trade-off between stability of the protein fold, which
enables enzymes to exist (for the most part at least) in a finite number of stable, predictable
structures, and flexibility, which makes possible the conformational changes that facilitate
stabilization of the transition state of a chemical reaction. Stability to temperature typically
determines both the ability of enzymes to remain folded at elevated temperatures and the
half-life at more moderate temperatures. While the majority of contemporary enzymes are
from mesophilic organisms, inhabiting relatively mild environmental conditions, the ability
of certain enzymes to operate within organisms inhabiting extreme environments, such as hot
springs, can tell us much about the determinants of stability and catalysis.
As well as presenting intriguing case studies for protein structure-function investigations,
enzymes from extremophiles are also often better suited to use in biotechnology than their
mesophilic counterparts. To work efficiently under industrial conditions, enzymes must often
be stable to elevated temperature and organic solvent composition, altered pH, oxidizing
conditions, and the presence of high concentrations of substrates, products or other
chemicals. Thermostability, in particular, enables reactions to be undertaken at higher
temperatures, increasing reaction rates, enhancing substrate solubility (i.e. substrate loading)
and reducing the potential for microbial contamination of biochemical processes. Moreover,
all things equal, a greater total turnover of substrate can be achieved per unit enzyme since
thermostable enzymes are more durable at moderate temperatures as well as tolerating high
temperatures. This has led to considerable interest in the determinants of thermostability in
proteins and to efforts to engineer stability into enzymes of industrial relevance. Throughout
this review we will focus on thermostability and use the terms stability and stabilization to
refer to elevated temperature rather than e.g. solvent or oxidizing conditions.
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Cytochrome P450 enzymes (P450s) are highly versatile monooxygenases found throughout
all domains of life, and are believed to have been present in the last universal common
ancestor (LUCA). Thus, it is not unexpected that the genomes of some extremophiles encode
P450s. Indeed, there are numerous examples, mostly of thermophilic archaea, that have been
found to contain functional P450 proteins capable of enduring significantly higher
temperatures than those found in mesophilic organisms. The mere existence of these proteins
is proof of the ability of the P450 fold to be stable, and retain activity, at high temperatures.
Through studying the sequences and structures of P450s from thermophiles, it is possible to
glean some understanding of the structural characteristics required for a P450 to be
thermostable. Moreover, having access to thermostable forms can make possible more
fundamental studies into the chemical biology of these proteins, such as the characterization
of the ultimate oxidant in the P450 catalytic cycle, compound I, which was done using
CYP119A1, an enzyme isolated from a thermophilic organism [1].
This review will examine what is known about the structures of P450s characterized from
thermophilic organisms in order to draw conclusions as to the features that stabilize the P450
fold. We also review the attempts that have been made to date to stabilize mesophilic P450
enzymes by directed evolution, since comparing the results of natural and artificial evolution
of thermostability can reveal complementary solutions to the problem of stabilizing the P450
fold. Finally, we present a perspective on how this information can be used to design P450
proteins for enhanced stability.
The place of thermostability in the evolution and ecology of P450 systems
Recent progress in metagenomics has revealed much about organisms capable of living under
extreme conditions such as high and low temperature, and P450s have been isolated from
organisms living at each extreme [2, 3]. Thermophiles can be defined as those organisms
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
capable of enduring or even thriving under conditions of extreme temperatures (50-121 °C,
usually over 60 °C) [4]. Hyperthermophiles thrive at temperatures of ~ 80 °C or more. At the
other end of the temperature scale, psychrophiles (otherwise known as cryophiles) grow at
temperatures between -20 ºC and 10 ºC. Few P450s have been reported to date from
psychrophiles, purported to be alkane hydroxylases, but none has been characterized in detail
[3]. By contrast a number of P450s have been investigated from thermophilic organisms.
Both types have the potential to inform about the structural adaptations to P450s necessitated
by changes in ambient temperature.
Comparing metagenome analyses of samples from various high thermal habitats indicates
that, in general, there is a decrease in biodiversity with increasing temperature [5, 6].
Microbiota commonly represented in such extreme environments (e.g. hot springs) are the
bacterial taxa Thermotogae (Fervidobacterium), Deinococcus-Thermus, Proteobacteria
(Acidithiobacillus), Aquificae, Dictyoglomi, Nitrospirae (Thermodesulfovibrio), Firmicutes
(Clostridium, Geobacillus) and archaeal taxa such as Crenarchaeota (Pyrobaculum,
Sulfolobus, Stygiolobus), Euryarchaeota (Thermococcus, Methanococcus, Archaeoglobus,
Ferroplasma) and Nanoarchaeota [7, 8]. The abundance of certain microbial taxa in these
extreme environments differs depending on the ambient temperature, pH and availability of
organic materials.
Various genome projects have revealed a remarkable and unexpectedly large number of
P450s in a variety of organisms. Genome and metagenome sequencing has generated large
amounts of sequence information; however, the functional characterization of proteins from
such organisms is mostly lacking. Only five thermophilic P450s have been characterized to
date, namely CYP119A1 from Sulfolobus acidocaldarius [9], CYP119A2 from Sulfolobus
tokodaii strain 7 (Sulfolobus sp. strain 7) [10], CYP175A1 from Thermus thermophilus [11],
CYP231A2 from Picrophilus torridus [12], and CYP154H1 from Thermobifida fusca [13].
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Little is known about the catalytic activity, physiological role or natural substrates of even the
best-characterized forms. CYP119A1 has been shown to carry out H2O2-supported styrene
epoxidation [14-16], ω-1 hydroxylation of lauric acid [17, 18], dehalogenations of
halogenated solvents [19] and the reduction of nitrite or nitrous oxide supported by either
H2O2 or the CYP101A1 (P450cam) redox system (putidaredoxin (Pdx) and putidaredoxin
reductase (PdR)). However, the physiological significance of these is unknown [20]. Even
less is known about the natural function of CYP119A2, which has primarily been studied by
direct electrochemistry, but H2O2-driven styrene epoxidation and ethylbenzene hydroxylation
have been reported [21].
CYP175A1 from T. thermophilus shows some similarity to CYP102A1 (P450BM3), but does
not bind or turn over saturated fatty acids (C10-C20) due to steric clashes with key residues in
the active site [11]. However, H2O2- and Pdx/PdR-supported hydroxylase activity towards
unsaturated monoenoic acids has been reported [22], and MD analysis has revealed that these
substrates adopt a “U-shaped” conformation within the active site, centered around the C=C
double bond. This may suggest that the native substrate for CYP175A1 also assumes such a
conformation [22]. Other substrates turned over by CYP175A1 include β-carotene [23-25],
napthalenes [26], and colorimetric substrates guaiacol and ABTS (2,2′-azino-bis(3-
ethylbenzothiazoline-6-sulfonic acid)) [27].
To date, no information is available about the endogenous substrate or activity of
CYP231A2, whereas CYP154H1 was shown to catalyze the Pdx/PdR-supported conversion
of ethylbenzene, propylbenzene, styrene and organic sulphides [13].
Other potential thermostable P450s are being revealed as the genomes of more thermophiles
become available. Another nine open reading frames (ORFs) encoding potential P450 genes
have been identified in Thermobifida fusca, the source of CYP154H1 [13]. A P450 with
progesterone hydroxylase activity has been identified in Geobacillus thermoglucosidasius
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
(growth range 42-69 °C) [28, 29]. Geobacillus stearothermophilus (growth range 35-60 °C)
has also demonstrated activity towards progesterone and testosterone, and its genome
encodes a potential P450 gene [2, 30, 31].
A screen of the genomes of the thermophilic fungi, Thielavia terrestris (growth range 22-55
°C) [32] and Myceliophthora thermophila (growth range 38-54 °C) [33], identified 79 and 70
putative P450 genes respectively, 14 and 11 of which were identified as likely to be
thermostable [34]. These organisms do not tolerate temperatures as high as bacterial and
archaean thermophiles, so the proteins they produce may be less stable. However other
enzymes from thermophilic fungi have shown temperature optima in the range of 45-70 °C
[34], with one xylanase from T. terrestris demonstrating optimal activity at 85 °C [35].
In addition to these forms, members of other cytochrome families such as CYP107, CYP109
and CYP132 are found in thermophilic organisms. The genome database annotation of these
putative thermophilic P450s revealed their functions as variously cholest-4-en-3-one 26-
monooxygenase activity, steroid hydroxylase activity (at the 15-, 6, 6- 9, and 11-
positions on various steroids), pentalenene oxygenase, erythromycin C-12 hydroxylase, and
2-hydroxy-5-methyl-1-napthoate 7-hydroxylase. However, it is unclear whether these
annotations result from any functional characterization or simply from sequence similarity
with other P450s showing the listed activities. Extrapolation of functional properties between
even closely related P450s is fraught with error as demonstrated by the functional changes
seen in even very closely related isoforms from laboratory animals and humans.
Bioinformatic analyses of “CYPomes” suggest that a large number of thermostable P450s
remain to be explored and that genome mining may lead to the discovery of novel, highly
thermostable P450 biocatalysts. A BLAST search of existing bacterial and archaeal genome
databases for homologs of CYP119 forms, CYP175A1, CYP231A2 and CYP154H1 retrieved
45 P450s from thermophilic organisms within the bacterial phyla Deinococcus-Thermus,
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Firmicutes, Actinobacteria, Acidobacteria, Chloroflexi and the archaeal phyla Crenarchaeota
and Euryarchaeota (Supplementary Table 1). Some of these thermophilic P450s occupy
branches near the base of both bacterial and archaeal domains of the evolutionary tree. In
light of the hypothesis that LUCA was a thermophile or a hyperthermophile [36, 37], we can
speculate that ancestral forms of cytochrome P450 families were thermophilic (or
hyperthermophilic) proteins. However, the existence of thermophilic P450s in many diverse
phyla of the bacterial and archaeal kingdoms may also be a result of convergent evolution,
especially as extremophiles are well-known for participating in rampant lateral gene transfer
[38]. Importantly, this search would not reveal possible thermostable forms that have poor
homology to the existing characterized thermostable P450s. With advances in metagenomics
and single-cell genomics, the number of thermostable P450s found in the so-called
“microbial dark matter” (i.e. unclassified bacteria and archaea including putative
thermophiles) is likely to grow.
P450s from thermophilic organisms
To date, the crystal structures of four “thermophilic P450s” have been solved. Three are from
archaea: CYP119A1 from S. acidocaldarius [39]; CYP119A2 (P450St), a related enzyme
from S. tokedaii strain 7 [40]; and CYP231A2, from Picrophilus torridus [12]. A fourth
enzyme, CYP175A2 comes from a eubacterium, T. thermophilus [11]. An additional
thermostable P450, CYP154H1, was isolated and characterised from the moderately
thermophilic bacterium T. fusca, which has optimum growth conditions of 50-55 °C [13].
The melting temperature (Tm) of this protein was 67 °C but it has not yet been characterized
structurally. (While strictly only organisms, not enzymes, can be described as thermophilic or
mesophilic, these terms will be used for simplicity and since their use in this manner is
widespread in the literature.)
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
CYP119A1
CYP119A1 was first reported to have been isolated from the genome of S. solfataricus by
Kennelly et al. (1996), while attempting to clone a thymidylate synthase gene from the
extremophile [9]. However, more recent reports have revealed that the enzyme was actually
derived from the closely related acidothermophile S. acidocaldarius, and that the source was
misattributed due to contamination of the cell stock (DSM 1616) [12, 14]. This protein was
initially designated CYP119, and then CYP119A1, when related enzymes were characterized
[9, 41].
The optimal growth temperature of S. acidocaldarius is commonly 70-75 °C with some
strains growing at temperatures up to 85 °C [42]. Given the extreme growth conditions of this
organism, the high Tm of CYP119A1 (~88-92°C) is not unexpected [11, 39, 43]. By
comparison, P450s derived from mesophilic microorganisms such as bacterial CYP101A1
show significantly lower Tm values (~54-61°C) [11, 44]. In addition to its high
thermostability, CYP119A1 is able to withstand greater hyperbaric pressure than most
mesophilic P450s, with a P1/2 (pressure required to inactivate half the protein) of 320 MPa at
5 °C compared to ~110-140 MPa for CYP101A1 [45, 46]. A greater stabilizing effect was
observed at increased temperatures, with a P1/2 of 380, 430 and 480 MPa at temperatures of
20, 35 and 50 °C respectively. Furthermore, once returned to normal pressure, CYP119A1
was able to completely revert from the P420 species to the P450 state in the absence of any
stabilizing agents [45].
Overall structural changes in CYP119A1 compared to mesophilic P450s
Crystal structures have been solved for the aqua-ligated enzyme [47] plus the complexes with
imidazole, 4-phenylimidazole (4-PI) [39] and various other phenyl-imidazole derivatives
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
[48]. Structurally, CYP119A1 conforms to the typical P450 fold (Figure 1). However, at just
368 residues, it is considerably shorter and more compact than P450s from mesophilic
microorganisms, such as CYP101A1 and CYP107A1 (P450eryF) at 414 and 403 residues
respectively. Much of this difference in length can be attributed to the N-termini of the
proteins (Figure 2). While CYP101A1 and CYP107A1 contain N-terminal sequences prior to
any well-defined secondary structure elements, CYP119A1 begins promptly with -helix A.
Truncations occur at other positions: the β1-1/β1-2 hairpin loop of CYP119A1 (residues 13-
24) is four residues shorter than CYP101A1 (52-66); and five residues are missing from the
β5-turn, situated between helices H and I (191-195 in CYP119A1, 225-234 in CYP101A1).
The latter region is typically involved in redox partner interaction, and it has been shown to
make direct contact with the FMN-binding reductase domain in crystal structures of the
CYP102A1 heme and FMN domains [49]. In CYP119A1 this region closely resembles that
of the self-sufficient nitric oxide reductase, CYP55 (P450nor), which uses NADH without the
assistance of a redox partner [50]. The related CYP119A2 is capable of self-sufficient
catalysis, suggesting that CYP119A1 may do the same [51]. CYP119A1 has also been shown
to have nitrite/nitrous oxide reductase activity, underscoring the similarity to CYP55 [20].
F/G-loop/B’-helix and substrate-dependent conformational changes
A significant structural difference from mesophilic P450s occurs in the repositioning of the
B’-helix and F/G-loop [39, 47]. The B’-helix occurs at residues 63-66 in CYP107A1 and 67-
77 in CYP101A1. However, the cognate region in CYP119A1 does not comprise a full helix.
Instead a helix is found downstream at residues 49-53, corresponding to an insertion in
CYP101A1 between residues 88-89. This alternative B’-helix is preceded by a long loop,
resulting in a shift of the B’-helix away from the active site and towards the surface of the
protein. To accommodate the void typically filled by the B’-helix, the F/G-loop, which is
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
usually oriented away from the heme and towards the surface, instead dips down to fill this
space, resulting in a similar overall solvent accessibility to the heme between the substrate-
free forms of the two proteins (~24 Å2 and ~18 Å2 for CYP119A1 and CYP101A1
respectively) [39].
Molecular dynamics simulations have shown that the F/G-loop of CYP119A1 can move
independently of the remainder of the protein, allowing it to transition in and out of the active
site to form the open (substrate-free) and closed (ligand-bound) forms without affecting the
structure of the rest of the protein[52]. Conformational changes in this area to accommodate
different ligands are not uncommon amongst P450s[53-55], however other P450s that exhibit
such changes, for example CYP102A1 [53, 54], CYP3A4 [56] and CYP2B4 [55] do so via
displacement of the F-and G-helices, requiring interactions with the I-helix to be broken.
Despite large conformational changes in the F/G-loop through unwinding of these helices,
CYP119A1 retains inter-helical contacts between the F/G- and I-helices. The F- and G-
helices of CYP119A1 remain in relatively constant positions, the F/G-loop extending and
retracting via unwinding of the F- and G-helices [12] (Figure 3). This difference from other
P450s may be due to CYP119A1 having more branched chain amino acids in the region,
which form a tight network of inter-helical contacts between the G- and I-helices. This could
allow CYP119A1 to endure harsher temperatures, with the F/G-loop taking on the role of
controlling substrate entry and release [12].
The I-helix and conserved Thr residue
The most highly conserved regions of CYP119A1 with respect to mesophilic P450s are
located near the heme group, namely the conserved Cys thiolate ligand (Cys317), and to a
degree, the I- and L-helices. The CYP119A1 I-helix is slightly unusual in that it contains an
additional two Thr residues following the highly conserved Thr213 (Thr252 in CYP101A1).
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Thr213 residue has been implicated in the transfer of protons during P450 catalysis, and the
prevention of auto-inactivation by H2O2 generated by uncoupling during catalysis [57, 58]. In
CYP101A1, Thr252 donates an H-bond to the peptide oxygen atom of nearby Gly248.
However, the H-bonding network of CYP119A1 is dissimilar, with Thr213 apparently
unbonded and Thr214 instead forming an H-bond with Gly210. This bonding pattern instead
resembles that of CYP102A1. Mutagenesis has revealed that Thr213 is catalytically
important for CYP119A1, while Thr214 may help to control the spin state of the heme and
also play a role in substrate binding [18, 59]. The role of Thr215 has not been investigated.
Despite significant effects on activity and heme coordination states, mutation of Thr213 and
Thr214 had little effect on thermal stability, with the Tm of single mutants remaining within
2.4 °C of the Tm of the WT [59].
Role of the Cys-thiolate ligand and Cys-ligand loop in stabilization of the CYP119A1 fold
The highly conserved thiolate Cys317 and Thr213 residues were mutated in CYP119A1
(C317H/T213A) and the structure was solved [60]. Compared with the wild type (WT) this
mutant only suffered a 1.2 °C loss in 10T50 (the temperature at which half the protein remains
folded after heating for 10 minutes). Mutants of the conserved cysteine to all 19 other amino
acids showed a maximal decrease in 10T50 of ~8 °C, with an average of 6 °C. Considering the
structural importance of this residue to the P450 fold and its conservation throughout the
clade, these decreases in stability can be considered minor [60]. The heme was retained to at
least some degree in all mutants, however the typical P450 Fe(II).CO Soret peak was shifted
suggesting a different coordination environment, which in some cases may have involved
coordination to His315. This is the first example of a P450 Cys-thiolate mutant crystal
structure to be solved, and it is a testament to the stability of CYP119A1 that it can withstand
such a mutation. High temperature molecular dynamics (MD) simulations showed that the
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Cys ligand loop, which unfolds during thermal denaturation of CYP176A1 (P450cin) and
CYP101A1, is stabilized by tight nonpolar interactions between Tyr26 and Leu308. A double
mutant of these two residues (Y26A/L308A) was constructed and exhibited a 16 °C decrease
in Tm compared to the WT [61]. Individual mutants Y26A and L308A resulted in a 12 and 10
°C decrease in Tm respectively [61].
Other features that may augment stability
Many theories have been proposed to explain the stability of proteins from thermophilic
organisms based on the relative contributions of individual amino acids. CYP119A1 contains
a higher proportion of buried isoleucines and fewer alanines than mesophilic P450s,
potentially resulting in an overall increase in hydrophobic interactions [62]. Yano et al.
identified fewer 2-residue salt bridges (8-10) in CYP119A1 than CYP101A1 (19) and
CYP107A1 (15), but more salt-bridged networks involving 3 or more residues (four in
CYP119A1 vs. three for CYP101A1 and one in CYP107A1) [39]. However, a comparison of
the thermophilic structures available currently and representative mesophilic structures
(Supplementary table 2) revealed less of a difference in the number of 2-residue salt bridges
(10 ± 3 for CYP119A1 vs. the mesophilic average of 11 ± 3). Nevertheless, CYP119A1 does
contain almost double the number of salt-bridged networks (8 ± 2) compared to the
mesophilic average (4 ± 1). A greater proportion of the total residues involved in salt bridges
are involved in networks (56 % compared to 44 %). The salt-bridged networks in CYP119
also span greater distances compared to those in the mesophilic forms, potentially
contributing to the compactness of the structure [39]. Mutagenesis of Glu114 which is
involved in a salt-bridged network with Arg363 and Glu342 resulted in a decrease in Tm of
3.8 °C [43]. Similarly, mutation of Arg259 to Lys to disrupt a salt link to the propionate
group of the heme caused a 5.9 °C decrease [43]. However, mutagenesis of Arg154 and
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Glu212 to disrupt a salt bridge formed in the imidazole and water-bound forms of
CYP119A1 (but not present in mesophilic P450s) did not result in a significant decrease in
stability [63].
CYP119A1 has unique clusters of aromatic residues that have been proposed to contribute to
its stability. Two clusters, linked together by the guanidinium group of an Arg residue, form
an aromatic/nonpolar “ladder” spanning a total distance of ~39 Å along the side of the protein
(Figure 4). The first cluster comprises Tyr2, Trp4, Phe5, Phe24, Trp281 and Tyr15 (the latter
via co-association with Phe24 to Met8) spanning ~11.3 Å. The second contains Phe225,
Phe228, Trp231, Trp250, Phe298, Phe334 and Phe338, spanning ~24 Å [39]. In comparison,
CYP107A1 has a single cluster of aromatic residues spanning just ~11 Å. Targeted and
random mutagenesis studies revealed that the mutation of individual aromatic residues
(Phe24Ser, Trp231Ala, Tyr250Ala, Trp281Ala) each resulted in an approximate 10 °C
decrease in Tm, with double mutants Tyr2Ala/Tyr250Ala and Trp4Ala/Trp281Ala resulting
in a 12-15 °C decrease [43, 63]. By contrast, mutation of an aromatic residue (Tyr168) that
was buried to a similar degree but on the other side of the protein caused no change from the
WT Tm value [63]. These results suggest that the extended aromatic clusters of CYP119A1
play an important role in its thermal stability.
The structure of the Phe24Leu mutant revealed no obvious structural deviations from the
WT, however this mutant was found to be only ~4 °C less stable than the WT [47]. Besides
the Phe24Ser involved in aromatic clusters, and Glu114Asp and Arg259Lys involved in salt
bridges, a set of other single and multiple mutants demonstrated Tm decreases in the range of
0.7-8.4 °C: Lys176Arg/Ile329Met (0.7 °C), Asp52Val/Asp72His/Glu273Gly/Lys348Arg (1.5
°C), Ile272Val/Asn367Arg/Glu368Ile (2.6 °C), Arg80Gly (4.6 °C),
Ser40Cys/Thr67Ala/Val118Leu (5.3 °C), Arg235Gly/Ile282Val/Ile299Val/Glu52Lys (7.4
°C), and Gly313Glu (8.5 °C) [43].
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
CYP119A2
CYP119A2 (P450St) was isolated from S. tokodaii (Sulfolobus sp. strain 7), an
acidothermophilic archaeon that is closely related to S. acidocaldarius, with optimal growth
conditions of pH 2-3 and 75-80 °C [10]. CYP119A2 has been shown to remain redox-active
in a didodecyldimethylammonium bromide (DDAB) film at temperatures up to 80 °C [40].
The temperature has been increased to 120 °C when experiments were carried out with the
electrochemical solvent poly(ethylene oxide) (PEO) [64].
Sequence and structure
At 367 residues, CYP119A2 is one residue shorter than CYP119A1, with the two proteins
sharing 64% sequence identity (Figure 2). The crystal structure of CYP119A2 was solved to
3.0 Å [40] and was highly similar to the previously solved CYP119A1 water- and imidazole-
bound structures with respective root mean squared deviations (RMSDs) of 1.3 and 1.4 Å
[39, 47] (Figure 1). The substrate-free structure has a water molecule coordinated to the
heme. A second water molecule forms a bridge between the coordinated water molecule and
residues of the I-helix (Ala210, Thr214), mirroring the H-bonding network found in the
water-bound structure of CYP119A1 [47].
One key difference identified between the structures of CYP119A2 and CYP119A1 occurs in
the F/G-region, which undergoes significant conformational rearrangements in CYP119A1 to
accommodate ligands of different sizes (Figure 3). While the F/G-loop of CYP119A1 appears
to adopt a conformation either enclosing the coordination site (CYP119A1-imidazole), or
directed away from the heme group (CYP119A1-H2O), the loop appears to adopt an
intermediary conformation in the case of CYP119A2-H2O, partially covering the
coordination site of the heme. This discrepancy is due to CYP119A2 containing additional
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
residues in the F-helix and G/H-loop in comparison to CYP119A1, resulting in a relative
displacement of the G-helix by ~4 Å. An interesting difference noted in this region is the
presence of a Cl- anion in CYP119A2 crystal structure at the N-terminal end of the G-helix.
This ion binds to both the side and main-chain atoms of Arg162, preventing the unravelling
of this end of the helix as observed in CYP119A1, and potentially affecting the affinity of the
active site for the ligand. The CYP119A2 structure has not yet been solved in the presence of
a substrate, so it is difficult to interpret its potential conformational changes.
Other features that may augment stability
CYP119A2 contains a total of 13 2-residue salt bridges and 7 salt-bridged networks
compared to the mesophilic averages of 11 ± 3 and 4 ± 1 respectively (Supplementary table
2), an increase which may play a role in the stability of CYP119A2. All 13 aromatic residues
involved in the aromatic clusters of CYP119A1 were conserved in CYP119A2, either directly
or through mutation to another aromatic residue in the case of Phe228Tyr and Tyr250Phe.
However, these mutations do not appear to influence the orientation or alignment of the side
chain, with the two structures superimposing very closely (Figure 4). The only exception to
this is Phe338, which is rotated by ~90° in the CYP119A2 structure due to a change in the
secondary structure of the β3 sheet.
Like CYP119A1, CYP119A2 has shorter β5-turn than most P450s, except for CYP55. When
tested for its ability to turn over substrate in the absence of a redox partner, CYP119A2 was
found to catalyze styrene epoxidation supported only by NADPH or NADH [51].
CYP175A1
CYP175A1 was the second P450 derived from a thermophilic organism to be crystallized,
and the first from a thermophilic eubacterium. T. thermophilus strain HB27 is a gram-
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
negative bacterium with an optimum growth range of 65-72 °C, but which can grow at
temperatures up to 85 °C [65, 66]. CYP175A1 has a Tm of 88 °C, comparable to that of
CYP119A1 [39].
Sequence and structure
The structure of CYP175A1 was solved to 1.8 Å using the heme domain of CYP102A1 as a
molecular replacement model [11] (Figure 1). A crystal structure of the homolog from T.
thermophilus HB8, which differs only by 10 residues, has also been deposited in the PDB
library (1WIY) [67], and its structure superimposes closely with that of the HB27 form.
CYP175A1 exhibits typical P450 structural characteristics, with a conserved heme-binding
motif and thiolate Cys residue at position 336. At 389 residues in length, this protein is again
significantly shorter than mesophilic P450s like the CYP102A1 heme domain (472 residues),
but slightly longer than CYP119A1 (368 residues) [11, 39, 68] (Figure 2). As with
CYP119A1, many of the differences to mesophilic P450s are in loops connecting secondary
structural elements. A difference of seven residues is found in the loop connecting helices E
and F, which spans residues 159-172 in CYP102A1 compared to 146-152 in CYP175A1
(Figure 2). Eight residues are also missing from the β5 turn connecting helices H and I (239-
250 in CYP102A1, 202-205 in CYP175A1), as for the thermophilic CYP119s and the self-
sufficient CYP55 [11, 39, 50, 51]. The G-helix is notably two turns shorter in CYP175A1,
however residues 176-195 align well with 198-217 of CYP102A1 with a RMSD of 1.14 Å. In
comparison to CYP102A1, the F-helix is shifted by an approximate half-turn however the
core region (157-167 in CYP175A1) aligns closely (RMSD = 1.66 Å).
CYP175A1 is highly similar to the CYP119A1 imidazole- and 4-PI bound structures, with an
RMSD of 1.7-1.8 Å. Its N-terminus is slightly longer (by ~8 residues), and contains an
additional A’-helix from residues 5-18, resembling that of the 310-helix in CYP102A1. The
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
start of the A-helix of CYP119A1 corresponds to approximately residue 22 of CYP175A1.
The B’-helix of CYP175A1 is more conventional than in CYP119A1, adopting a similar
conformation to CYP102A1 as the lid of the substrate access channel. Compared to
CYP119A1, the CYP175A1 protein retains a more traditional conserved Thr(225) H-bonding
network, with no additional Thr residues in the subsequent positions.
Other features that may augment stability
CYP175A1 has a higher overall Arg content (12.1 %) and lower Lys content (2.8 %), than
many mesophilic P450s (averaging 6.1 % and 5.3 % respectively), especially compared with
CYP102A1 (4.5 % and 8.1 % respectively, Supplementary table 3). Improved stability is also
sometimes correlated with a decrease in uncharged polar residues (Asn, Glu, Ser, Thr) and an
increase in charged residues (Lys, His, Arg, Glu, Asp) [69, 70]. CYP175A1 shows a decrease
in polar uncharged residues, but only a minor increase in charged residues. Notably,
CYP175A1 does not contain the large network of aromatic residues present in CYP119A1.
The longest network resembling that of CYP119A1 spans only ~13 Å (compared to ~39 Å in
CYP119A1), and so is unlikely to play a significant role in the stability of CYP175A1.
However, CYP175A1 contains more salt-bridged networks than mesophilic P450s; 8 ± 2
compared to the mesophilic average of 4 ± 1 (Supplementary table 2). A greater proportion of
the total residues involved in salt bridges are involved in networks containing more than one
residue: 62 % of the salt-bridged residues in CYP175A1 are in networks, compared to an
average of 41 % in mesophilic P450s.
A thermodynamic analysis of the heat- and urea-driven denaturation of CYP175A1 and
CYP101A1 showed that the increased stability of CYP175A1 is the result of a higher free
folding energy (ΔGAq), due to higher enthalpy (ΔHm) [71]. The increase in enthalpy may be
due to more internal electrostatic and H-bonding interactions, since more hydrophobic
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
interactions would result in entropy-driven stability. This is supported by the fact that both
proteins contain a similar content of hydrophobic residues, but CYP175A1 has more salt-
bridged networks and a greater ratio of Arg to Lys residues than CYP101A1 [71].
CYP231A2
CYP231A2 was one of two P450 genes identified in the acidothermophilic archaeon P.
torridus by a BLAST search of the available genomes of thermophilic organisms [2, 72].
While not quite as thermophilic as Solfolobus or T. thermophilus, this species thrives at
temperatures of ~60°C. It has optimal growth at pH 0.7 and can even grow at pH 0 making it
the one of the most acidophilic organisms identified to date [72]. The intracellular pH of P.
torridus is 4.6, lower than that of S. acidocaldarius at ~5.6 [72, 73].
CYP231A2 has a Tm of 65 °C when ligand free, but the more compact 4-PI-bound form has
higher stability (Tm of 73 °C) [12] . The optimal growth temperature of P. torridus is 60 °C,
so it is presumed that binding of the endogenous substrate of CYP231A2 would cause a
similar increase in stability. While CYP231A2 is more thermostable than mesophilic P450s,
its Tm is not as high as that of the “hyper-thermostable” P450s (CYP119A1/2 and
CYP175A1) from more hyperthermophilic organisms. Whereas CYP101A1 is irreversibly
denatured at pH 4.5, CYP231A2 can undergo a reversible transition to P420, returning to
P450 upon neutralization of pH [12].
Sequence and structure
The structure of CYP231A2 was solved to a resolution of 2.5-3.1 Å using both molecular
replacement, based on CYP119A1, and MAD (Figure 1) to resolution of 2.5-3.1 Å [12].
CYP231A2 shares 38-39% sequence identity with CYP119A1 and CYP119A2. At 352
residues, CYP231A2 is even shorter than previously studied thermophilic proteins, and much
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
shorter than mesophilic P450s, due mostly to an N-terminal truncation. The N-terminal A-
helix present in most P450s, including the thermophilic forms studied to date, is absent and
the protein instead begins with the β1 strand (Figure 2). There is a degree of structural
ambiguity in the B’-helix, specifically from residues 44 to 52, which was attributed to either
lack of a bound substrate to stabilize the structure, or the effects of missing an A-helix on the
B’-helix [12].
Other features that may augment stability
CYP231A2 differs from the other thermophilic P450s, in that it contains fewer salt-bridged
networks (3 ± 1) than mesophilic P450s (4 ± 1) (Supplementary table 2). Additionally, as Ho
et al. point out, salt bridges are unlikely to play a role in the stabilization of CYP231A2, as
many carboxylate groups would be protonated at internal the pH of P. torridus (~ 4.6) [12].
Moreover, there was no significant difference observed in Tm values measured at both pH 7
and 4 [12]. CYP231A2 does not contain aromatic networks on the scale of those seen in
CYP119A1, with the largest network consisting of just five residues (Tyr18, His23, Tyr25,
Phe243, Tyr269) and spanning a total distance of 13.9 Å (Figure 4). Rather, the
thermostability of CYP231A2 was attributed primarily to its small size and hence lower
surface to volume ratio. Its lack of hyper-thermostability compared to the other thermophilic
P450s may be the result of the absence of salt-bridged networks and large aromatic clusters.
Structural insights from thermophilic P450 structures
Determining the structural characteristics that give rise to thermostability is complex, as these
factors can vary between and even within protein families. There have been many attempts to
establish general rules of what yields a thermostable protein, but for every rule there is at
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
least one exception. The thermophilic P450s are no different, appearing to employ a variety
of stabilization mechanisms.
Size and surface turns
The first and most obvious generalization that can be drawn from the sequences and
structures of thermophilic P450s is that these proteins tend to be shorter than those derived
from mesophilic organisms (Figure 2). The thermophilic proteins are ~352-389 residues
whereas their mesophilic relatives average around 403-470 residues, for the commonly
studied mesophilic bacterial P450s: CYP107A1, CYP101A1, CYP176A1, CYP107H1
(P450BioI), CYP108 (P450terp) and the CYP102A1 heme domain). This difference is mostly
due to an N-terminal truncation, but throughout the proteins, several more residues are absent
in connecting loops and surface turns, resulting in a more compact overall structure (Figure
2). The β1-1/β1-2 hairpin loop and the β5 turn, between helices H and I, are each shortened
by 4-8 residues in multiple forms. The β5 turn truncation may be characteristic of the ability
to act independently of a reductase partner. Shortened helices are also a common feature,
with all thermophilic forms having G-helices that are shorter by ~1-3 turns compared to
CYP102A1 and CYP101A1. In the absence of sequence elements that are (evidently) not
essential to establish and maintain the P450 fold, these proteins gain a more compact, robust
structure with a higher surface area to volume ratio.
Amino acid content
Several studies have linked changes in amino acid content with improvements in overall
thermal stability [62, 69, 70, 74]. Examining the thermophilic P450s, while some trends can
be observed, multiple mechanisms appear to be able to achieve the same result
(Supplementary table 3). The one major difference that can be observed across all
thermophilic forms appears to be an increase in charged residues (Lys, His, Arg, Glu, Asp)
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
compared to polar uncharged residues (Asn, Gln, Ser, Thr), with thermophiles showing a
higher charged:polar uncharged ratio (31.2:15.5 %) than the mesophiles (27.1:18.1 %). The
hyperthermophiles (CYP119A1, CYP119A2, CYP175A1) showed a significant increase of
4.7% (P=0.02) in the proportion of charged residues compared to mesophilic organisms and a
decrease of 3.5% (P=0.08, Table 2) in polar uncharged residues. CYP231A2 has an
intermediate ratio (29.7:18.4 %), concordant with its intermediary stability.
Statistical analysis of the amino acid composition of three hyperthermophilic (CYP119A1,
CYP119A2, CYP175A1) vs. representative mesophilic forms (Supplementary table 3)
revealed significant changes in individual amino acid content: higher Glu (P=0.00001), lower
Cys (P=0.02), lower His (P=0.05), lower Met (P=0.05), lower Gln (P=0.02), and lower Thr
content (P=0.03). The changes in Gln, Thr and Glu correlate with alterations in the balance of
charged to polar, uncharged residues [70]. Cys, His and Met content are slightly decreased in
the hyperthermophiles (0.7 - 1.1 %), which is in agreement with generally small decreases in
these residues across a broader comparison of mesophilic and thermophilic proteins [69].
Substitutions of Met for Leu residues have also been observed to correlate with stability [74],
and although not significant, a general increase is also seen in the proportion of Leu residues
in these hyperthermophiles (Supplementary table 3).
CYP175A1 has a higher percentage of hydrophobic residues (54.8 %) compared to all the
other P450s examined (42.9-51.0 %). Increased Ala content was originally thought to
improve stability, due to its propensity for forming helices [74]. However later studies have
associated a slight decrease in Ala content with thermophilic proteins [69]. This seems to be
true for the thermophilic P450s other than CYP175A1 (which has an Ala content of 11.6%),
as they show Ala contents of 3.3-4.9 % compared with 7.2-9.8 % in bacterial mesophiles.
The other thermophiles also generally show higher Ile content (but in CYP175A1, Ile content
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
is lower). This could be an alternative mechanism of stability, resulting in better side-chain
packing in the protein core [62].
CYP175A1 was also distinguished amongst the thermophilic P450s by being the only form to
show a change in Arg/Lys ratio (12.1:2.8 % Arg:Lys) over mesophilic P450s, especially
compared to CYP102A1 (heme domain) which has high Lys content (4.4:7.9 %). However,
comparison with other representative forms (Supplementary table 3) reveals that CYP102A1
appears to be unusual amongst mesophiles, which have an average Arg:Lys ratio of 6.1:5.3%.
Structural interactions
Another common feature of thermostable P450s is the number of salt bridges; in particular,
an increase in the number of salt-bridged networks (i.e., containing more than 2 residues). It
has long been recognized that proteins from thermophilic organisms tend to have an
increased number of charged residues on the surface and more salt bridges [75-78] and
experimental work has linked individual salt bridges and salt-bridged networks with
increased stability across many different protein families [43, 76, 79, 80]. While some models
have suggested that salt bridges should have a negative or neutral effect on stability due to a
high associated desolvation penalty [81, 82], others have suggested that these parameters do
not hold true at high temperatures, such as those experienced by thermophilic organisms [78,
83]. At elevated temperatures, the dielectric content of water decreases and charged side
chains incur significantly lower desolvation penalties [11, 78].
Interestingly, most of the thermophilic P450 structures appear to have a slightly lower
average number of 2-residue salt bridges (8-10) compared to mesophilic forms (11 ± 3),
except for CYP119A2, which contains 13 (Supplementary table 2). However, compared to
the mesophilic average (4 ± 1), there are more salt-bridged networks containing 3 or more
residues in every thermophilic structure (7-8) except for CYP231A2 (3 ± 1). Of the residues
involved in salt bridges, a greater proportion are involved in networks containing more than 2
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
residues (52%) compared with mesophiles (41%). CYP231A2 once again fails to follow this
trend with only 37% of salt-bridged residues belonging to networks. Salt-bridged networks
appear to be distributed more evenly across the surface of the proteins in the thermophiles,
compared to CYP101A1 and CYP102A1 [11].
Most conclusions regarding the relevance of electrostatic interactions are drawn only from
comparisons of relative abundance, but a few studies have tested this hypothesis
experimentally. Disruption of a salt-bridged network in CYP119A1 (Glu114, Arg363,
Glu342) via the mutation Glu114Asp resulted in a 3.8 °C decrease in Tm [43] and a mutation
disrupting the electrostatic interaction between Arg259 and the heme propionate resulted in a
5.9 °C decrease [43] consistent with a stabilizing effect of these salt bridges on the active site
of thermophilic proteins at high temperatures [76]. In contrast, disruption of the unique salt
bridge between surface residues Arg154 and Glu212 did not have a significant effect on
stability [63].
The impact of pH on thermostability has only been considered for CYP231A2 [12], and it
was suggested that salt bridges are unlikely to be important due to the acidic cellular
environment. Similar analyses have not been done for the other thermophilic P450s, but
would be informative concerning the role of charge-charge interactions on thermostability.
A significant factor believed to contribute to the stability of CYP119A1 and CYP119A2 are
extended networks of aromatic residues. This has been experimentally validated for CYP119,
with mutations of several individual residues in this network resulting in a decrease in Tm of
up to 10°C [43, 63]. The aromatic networks of CYP175A1 and CYP231A2 (Figure 4) are
less extensive (with the largest networks spanning ~13 Å compared to ~ 39 Å for CYP119)
and thus far no experimental data is available to test the hypothesis that they stabilize these
proteins.
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Conformational dynamics
Another feature identified in CYP119A1 that differs from other P450s is the active site
arrangement and mechanism of rearrangement during substrate binding. Large
conformational changes, depending on the size of the substrate bound, have been identified in
several P450s, including CYP102A1, CYP3A4 and CYP2B4. CYP175A1 and CYP231A2
both exhibit similar conformational movements to the mesophilic examples, with the F- and
G-helices moving together as a unit, breaking and forming new inter-peptidyl bonds with the
I-helix and other parts of the protein [39, 47, 55, 56]. By contrast, the F- and G-helices of
CYP119A1 do not move so much when different substrates are bound, but appear to remain
locked in their positions while the ends of the F- and G-helices unravel to extend and contract
the F/G-loop, enabling it to either point outward away from protein in the case of the water-
bound form, or dip down to enclose the active site varying degrees depending on the size of
the bound substrate. The F/G-loop unravelling may be an adaptation to allow the active site
to retain its conformational flexibility and ability bind different substrates, while more
branched-chain amino acids in the G- and I-helical contacts, resulting in rigidly locked core
helices, may improve the overall rigidity and stability of the core P450 structure.
Artificial evolution of thermostability in P450s
Improving the thermal tolerance of native enzymes by protein engineering has become a
popular objective in recent years with increasing appreciation of the potential to implement
enzymes as biocatalysts. With regard to P450s, attempts to increase thermostability have thus
far focused on two subfamilies: the mammalian, xenobiotic metabolizing CYP2B subfamily;
and the bacterial fatty acid hydroxylases from the CYP102A subfamily, particularly
CYP102A1 (CYP102A1).
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Engineering of CYP102A1
The interest in stabilizing CYP102A1 stems from its potential as a biocatalyst for the
production of fine chemicals and bio-remediation. In directed evolution studies targeting
CYP102A1 thermostability, random mutagenesis has been applied to identify specific
stabilizing residues. Extensive chimeric libraries have also been generated using predictive
statistical methods. All studies have used a laboratory-evolved variant of the isolated
CYP102A1 heme domain, denoted as 21B3, as the ultimate template. The 21B3 mutant is
capable of functioning as a peroxygenase, utilizing hydrogen peroxide in place of NADPH
and O2 [84], eliminating the requirement for NADPH or the reductase domain. The heme
domain alone is more stable (T50 = 57 °C) than the full-length protein (T50 = 43 °C) since the
reductase is the less stable domain of the holoprotein. The 21B3 mutant was found to have a
T50 of ~46 °C, which is less stable than the wild type (WT) heme domain, but more stable
than the full-length protein.
In the first study to report engineering of a P450 to resist thermal denaturation, five cycles of
random mutagenesis were performed starting with 21B3, followed by a round of DNA
shuffling with another heme domain peroxygenase variant containing the mutation Phe87Ala
[85, 86]. From this, a thermostable peroxygenase variant, 5H6 (T50 = 61°C), was identified
that differs from 21B3 in 8 of the total 464 amino acids. Only two non-conservative
mutations, Ser106Arg and Glu442Lys, involved a change in charge. The other six
substitutions, Leu52Ile, Met145Val (a reversion to the WT), Ala184Val, Leu324Ile,
Val340Met and Ile366Val, were conservative mutations but differed in their hydrophobicity
and size.
Mutations from 5H6 have been used for stabilizing other CYP102A1 mutants in subsequent
directed evolution studies. The Ile366Val and Glu442Lys mutations were introduced and four
other mutations reverted to the WT residues (Cys47Arg Ile94Lys Cys205Phe Ser255Arg) in
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
9-10A, a mutant engineered to accommodate bulky substrates and containing 15 mutations
compared to the WT [87]. These changes increased the half-life from 3 minutes to
136 minutes at 50 °C (native CYP102A1 has a half-life of 68 minutes) [88]. Mutation
Ile366Val has also been paired with Leu52Ile and introduced into a dopamine binding mutant
that had been destabilized through the accumulation of 15 mutations (T50 = 43.4 °C) [89].
These substitutions were able to increase the T50 by 5 °C. Likewise, Ile366Val was used to
increase the T50 of a CYP102A1 mutant already containing 6 other mutations [90] from 48 °C
to 52 °C. In addition, variant 5H6 itself was subjected to error-prone PCR (epPCR) and,
although the library was screened primarily for functional improvement, three mutants were
identified with T50 values up to 3 °C higher [91]. In all three variants, a buried Phe was
converted to Leu at position 173; this substitution was not found in any other characterized
mutant.
Analysis of the individual stabilizing effects of mutations Leu52Ile, Leu324Ile, Val340Met,
Ile336Val and Glu442Lys found that each substitution alone, with the exception of
Leu324Ile, was able to increase the stability of the aforementioned dopamine binding mutant
[89] by ~1-4 °C. The Leu324Ile mutation caused a destabilization of 0.3 °C. Furthermore,
double mutants Ile366Val/Glu442Lys and Ile366Val/Val340Met, which individually were
the most stabilizing substitutions, resulted in misfolded proteins.
All these residues are relatively dispersed throughout the protein fold (Figure 5) mostly
located in peripheral regions as opposed to the heme-binding core. According to the crystal
structure of the WT CYP102A1 heme domain [92], Ser106Arg, Leu324Ile, Val340Met,
Ile366Val, and Glu442Lys are located on the protein surface and Leu52Ile, Ala184Val,
Met145Val are buried. The three buried mutations all introduce residues of increased
hydrophobicity, which may act to anchor peripheral loops and turns, preventing their
disruption. Conversely, mutations Val340Met and Ile336Val decrease hydrophobicity but are
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
exposed to solvent, as is Glu442Lys, and these residues may be stabilizing through altered
interactions with the surrounding water matrix.
Chimeras of CYP102A forms have been created by structure-guided recombination, using the
SCHEMA algorithm [93]. This method identifies cross-over points at which parent proteins
can be recombined to minimize disruption (i.e., destabilization) of the three-dimensional fold.
Seventeen double-crossover chimeras were created by swapping fragments between the heme
domains of CYP102A1 and CYP102A2, that had been mutated at Phe87Ala (CYP102A1)
and Phe88Ala (CYP102A2) respectively to enhance peroxygenase activity. None of the
chimeras were more stable than CYP102A1, but more than half were more thermostable than
CYP102A2 (T50 = 44°C) [94]. This study was subsequently expanded with the inclusion of
CYP102A3, and the three CYP102A peroxygenase mutants were recombined using seven
crossover points to create a library of >600 properly folded chimeras [95]. The most stable
chimera identified from this library had a T50 of 62 °C and differed from its closest parent by
84 amino acid substitutions, making it difficult to identify the residues responsible for
stabilization. However, this mutant only differed from the second most stable mutant (T50 =
56 °C) at 30 residues within the N-terminal fragment, which was derived from CYP102A2
(T50 =44 °C) in the more stable variant and CYP102A3 (T50 =49 °C) in the less stable variant.
In a later study, a further 204 chimeras [95] were assessed for their thermostability. This data
was then used as a training set to produce a linear regression model that was used to predict
T50 values for >6000 chimeras, including 30 with T50 ≥ 60 °C. These 30 mutants were
constructed and all displayed T50 values between 58.5 °C and 64.4 °C. The most thermostable
chimera predicted by the model (T50 = 64.4 °C) also happened to be the consensus sequence,
i.e. the sequence that combined the fragments that showed the highest frequency at each
position amongst the folded chimeras. The stable chimeras differed from one another at
between 7 and 99 positions with an average of 46 differences [96]. This illustrates the
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
limitation of chimeragenesis approaches, in that it is difficult to identify the stabilizing
residues and interactions in these mutants since the number of changes from any given parent
can range from tens to hundreds of residues.
Another statistical method that has been employed for predicting stable chimeric P450s is by
modelling fitness landscapes using Gaussian process regression, a Bayesian, machine-
learning approach [96]. A model was trained on sequence information and thermostability
data from all available chimeric P450s [96] and used to predict a total of 34 sequences that
were likely to be more stable than the most stable parent. Of these, 28 had a T50 ≥ 60 °C, 12
had a T50 ≥ 65 °C. The most stable variant displayed a T50 of 69.7 °C, making it the most
stable CYP102 variant identified to date [97]. In this study, the Upper Confidence Bound
(UCB) algorithm was used to iteratively improve the Gaussian process model in regions of
the landscape that were predicted to be highly optimized. These models are successful
because they were trained on experimental data, which implicitly allows consideration of all
factors that contribute to stability, including those that are unknown.
Engineering of CYP2B enzymes
Despite the abundance of industrially relevant eukaryotic P450s, the only forms to have been
the focus of engineering towards thermostability are several from the CYP2B subfamily.
Thermostable CYP2B forms are of interest to the pharmaceutical industry as they play a
significant role in the metabolism of many drugs and xenobiotics (albeit a relatively minor
one in humans compared to CYP3A, CYP2C and CYP2D forms). CYP2B forms show
conformational flexibility and substrate promiscuity while also displaying higher
thermostability than many other native mammalian P450s [98] (Thomson et al., unpublished
data) making them interesting candidates for structural studies. Both rational and random
approaches have been taken to stabilize CYP2B forms.
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Residues of interest have been selected for rational mutagenesis based on the stability
differences between native CYP2B forms. CYP2B1, CYP2B4 and CYP2B11 have been
found to have Tm values ~13 C, 10 C and 3 C higher than CYP2B6 respectively, and 25
residues are conserved between the three more stable forms but differ in CYP2B6. Eleven of
these residues that are buried in the available crystal structures were mutated in CYP2B6 to
match the common CYP2B1/CYP2B4/CYP2B11 sequence [98]. The Leu264Phe mutant was
the only variant to show an increase in stability (4 C) and it was hypothesized that this
change may increase the hydrophobicity or integrity of the H-helix.
A similar method was used to improve the thermal stability of CYP2B6 and CYP2B11.
Seven residues in these forms were mutated to match the amino acid found in the relatively
more stable CYP2B1 and CYP2B4. Mutation Pro334Ser was found to improve the Tm of
CYP2B6 by ~7 C and CYP2B11 by ~2 C. The mechanism of this stabilization was
explored with pressure-perturbation spectroscopy and it was found that CYP2B enzymes
containing a serine at position 334 are more compressible than those with a proline at this
position. Therefore, the Pro334Ser may stabilize the structure by increasing conformational
flexibility in the region of the heme pocket [99].
Random mutagenesis has also been used to produce a library >3000 variants of the CYP2B1
peroxygenase mutant, QM (Val183Leu/Phe202Leu/Leu209Ala/Ser334Pro). Two QM
mutants, containing Leu295His and Lys236Ile/Asp257Asn mutations respectively, showed
enhanced tolerance to temperature over QM with an increase in T50 of ~1 C and ~2 C
respectively [100]. However, when these three mutations were combined, the resulting
mutant displayed a T50 ~ 6 C lower than QM. The reason for this apparent deleterious
epistatic effect is not known. Residues 295, 236 and 257 are located in the I-helix, G-helix
and the G/H-loop respectively, and do not appear to interact with each other. This illustrates
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
the difficulty in predicting the effect of mutations in a landscape of unknown epistatic
interactions.
Structural insights from engineering studies
Thus far, attempts to increase the thermostability of P450s using random or site-directed
mutagenesis have identified relatively few beneficial mutations that generally provide
increases in stability of less than 10 °C. The effect of these mutations on divergent forms has
yet to be explored so it is difficult to predict whether any of these changes may have a
generally stabilizing effect on the P450 fold. While chimeragenesis has proven to be a
promising alternative method for producing highly stabilized variants, particularly when
coupled with predictive modelling, the underlying structural determinants of enhancements in
stability are difficult to determine since inter-residue interactions are altered simultaneously.
Chimeragenesis may be a more successful approach for engineering thermostability due to
the increased proportion of properly folded mutants that are produced by recombination of
naturally evolved forms, compared to the proportion produced by random methods, i.e., a
greater number of properly folded variants can be effectively assessed for their stability given
the lower “kill rate” compared to random mutagenesis. Moreover, recombination samples a
broader area of sequence space.
Thermostable redox partners
Ultimately, stabilization of the P450s is only half the challenge in implementing these
monooxygenases as biocatalysts. In order to exploit the full potential of a thermostable P450
it is necessary to couple it with a stabilized redox partner. Most P450 redox systems conform
to one of ten different classes, depending on the composition and topology of the electron
transfer pathway [101]. A minority of P450s undergo direct reduction by NADH (Class IX)
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
[102, 103], catalyze isomerizations (Class X) [104-106], or act as peroxygenases, which
involves the direct oxygenation of the heme without the requirement of any redox partner
[107-109]. However most P450 enzymes rely on one or more auxiliary redox partners to
carry out monooxygenations [101]. Therefore, thermostable P450 systems, rather than only
thermostable P450 enzymes, are generally required for biocatalysis. So far only a very
limited number of thermostable P450 redox partners have been identified and the
understanding of what makes them more robust at higher temperatures is poorly understood.
In some cases, the P450 and a reductase domain are assembled on a single polypeptide chain
(Classes VII and VIII) [110-112] with the best characterized being the CYP102 family of
enzymes. As noted above, the reductase domain of CYP102A1 showed lower thermostability
than the heme domain, and a more thermostable holoprotein could be engineered by
replacing the reductase domain with that of the more stable CYP102A3, creating a functional
holoprotein with an optimal temperature of 51 ºC and ten-times longer half-life than
CYP102A1 at 50 ºC [113].
Most prokaryotic P450s and the mitochondrial P450s from eukaryotes belong to Class I,
requiring a flavin-containing ferredoxin reductase (FdR) plus a ferredoxin (Fd), an iron-sulfur
protein which acts as a mediator shuttling the electrons from the FdR to the P450. Fds can be
further classified according to the configuration of the iron–sulfur as [2Fe–2S], [3Fe–4S],
[4Fe–4S], or [3Fe–4S]/[4Fe–4S] [101].
The CYP175A1 redox partners
Native T. thermophilus redox partners for CYP175A1 have been identified in the form of a
[3Fe−4S][4Fe−4S] dicluster Fd (FdTt) and an NADP-selective FdR (FdRTt) [114]. FdTt and
FdRTt supported β-carotene activity with a turnover rate ~54 times higher than the PdX/PdR-
supported system [24]. FdRTt exhibits a 30T50 value of 99 ºC and FdTt shows a thermally
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
induced unfolding temperature of 114 ºC at pH 7.4 [115], making the combination of these
redox partners the most thermostable class I redox system identified so far. A fusion protein
comprised of the FdTt and FdRTt (175RF) showed an additional 25-fold increase in Vmax for
β-carotene hydroxylation at 65 ºC [25]. However the 10T50 of the fusion was lower at 80 ºC.
The crystal structure of FdTt has been solved and compared to other, homologous Fds [116]
such as the mesophilic Fd I from Azotobacter vinelandii, FdAv. The overall structure of FdTt
is defined by two iron-sulfur clusters which are sandwiched between two antiparallel alpha
helices and two beta-sheets, in a typical bacterial dicluster Fd ((βαβ)2 fold. FdTt is mostly
stabilized around the [3Fe−4S] cluster (cluster I). Cluster I is generally considered to be the
functional cluster in seven-iron Fds whereas cluster II plays a more structural role [117, 118].
Polar residues were found to predominate on the protein surface of FdTt, compared with
more negatively charged residues in FdAv, which may contribute to greater thermal stability
due to a reduction in unfavorable clustering of similarly charged groups [116]. Additionally,
the -helices in FdTt are more stable than those in FdAv. One (αA) is stabilized by
electrostatic interactions between an Asn residue and the positively charged end of the helix
macrodipole. The other helix (αB) is stabilized by the presence of alanine residues leading to
a more well defined -helix, compared to the 310-helix present in FdAv [116, 119]. FdTt also
exhibits tighter packing compared to FdAv leading to a general stabilization of secondary
structure elements in FdTt. In particular, it shows less local conformational strain than
FdAdv, which has non-glycine residues in a left handed helical conformation [116, 119]. By
contrast, FdTt and its thermostable homologue, FdBs from B. schlegelii, have glycines in
topologically related positions, which release strain in the protein backbone [116, 119, 120].
Large hydrogen bond networks were found in FdTt, involving side chain:main chain
interactions that stabilize the core structure, with the largest network around cluster I [116].
Bulky, charged, side chains are present around cluster II, which enhance the interaction with
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
the N-terminus and fix its conformation. In other thermostable Fds the N-terminus is also
anchored by large side chains in the proximity of cluster II, so this may be a common
mechanism for stabilization, serving to constrain solvent access to cluster II and thereby
enhance oxidative stability. Additionally, FdTt contains fewer Asn residues than FdAv,
which may be important since peptide bonds involving Asn can be easily hydrolyzed when
exposed to solvent at high temperatures [116, 121].
The lack of a crystal structure for the FdRTt obviates identification of factors that may be
responsible for stabilization of this second component of the redox system. To date, this FdTt
/FdRTt electron transfer system has only been reported to support the activity of CYP175A1
and therefore its potential for use with other P450 enzymes remains untested.
Etp1fd
The most thermostable Fd that has been identified to interact with multiple class I P450s is
the C-terminal domain of an electron transfer protein (Etp1) from the fission yeast,
Schizosaccharomyces pombe, Etp1fd [122-126]. Etp1fd is homologous to vertebrate-type Fds
[122] and has been characterized with respect to its thermal stability as the full length
(Etp1fd(505–631)) and a truncated form (Etp1fd(516–618)) created for crystallization [127].
Both are highly robust, showing thermal transition temperatures, Tm, of 70.5 °C for Etp1fd
(505–631) and 65.6 °C for Etp1fd (516–618). The lower thermal stability of the truncated
version was surprising, since C-terminal truncation of Adx, the mitochondrial sequence
homologue, yields a more thermostable Fd [128]. Both truncated Fd forms, Adx (4-108) and
Etp1fd(516–618) have been crystallized, and revealed a large number of potential stabilizing
factors [127, 129]: five additional stabilizing salt bridges at the protein surface in Etp1fd
structure, compared to Adx; fewer glycines and more prolines in loops, which would
decrease the conformational entropy upon folding [130]; potential stabilization of the helix
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
dipole by the substitution of Val34 in Adx by His546 in Etp1fd [131]; more charged and
hydrophobic residues in Etp1fd in place of uncharged polar side chains in Adx; and a two-
residue shorter loop between residues 550–560 in Etp1fd. Replacement of uncharged polar
side chains and loop shortening are both typical of thermophilic proteins compared to their
mesophilic homologues and may confer enhanced intramolecular interactions and reduced
conformational entropy, respectively [132, 133].
Thermostable diflavin reductases
Class II P450s are supported by the diflavin-containing, cytochrome P450 reductases (CPRs)
that contain an FAD-containing FdR-like domain and a linked FMN-containing flavodoxin
domain. Notably, no diflavin reductase has yet shown comparable thermostability to the class
I redox partners discussed above. The most thermostable putative diflavin reductases reported
to date are the bacterial sulphite reductase, BmCPR from B. megaterium, and CaCPR1,
isolated from Capsicum annuum, with 10T50 values of 54.1 °C and 56.1 ºC respectively.
BmCPR is known to transfer electrons to Etp1fd and various Fds but also supports the
reaction of the microsomal CYP21A2 very efficiently without any Fd mediator [134, 135]. In
contrast, CaCPR1 supports activity with microsomal CYP1A2 only at low rates relative to
the cognate mammalian cytochrome P450 reductase [136]. Given the ability of BmCPR to
support CYP21A2 reactions with high efficiency, BmCPR appears to be a more promising
thermostable redox partner to support other microsomal P450-catalyzed reactions. Neither
BmCPR nor CaCPR1 have been crystallized and therefore the factors responsible for the
stability of these forms have not yet been elucidated. However, in the case of the plant
CaCPR1, it was speculated that conditions of stress during plant development may have
imposed selection pressure to improve the supply electrons to P450s, leading to improved
stability [136].
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
2-Oxoacid:ferredoxin oxidoreductase systems
The thermostable redox systems that support the CYP119 from S. acidocaldarius are unusual
in utilizing coenzyme-A and pyruvic acid rather than NADH or NADPH as the source of
electrons, and are composed of a thermostable Fd and a 2-oxoacid:ferredoxin oxidoreductase
(OFOR) [137, 138]. CYP119 has been reconstituted with a Fd and an OFOR from the
thermophilic organism, S. tokodaii strain 7 (StOFOR) [137-139] and also with the Fd and an
OFOR from the thermophilic S. solfataricus (OFOR-Ss) [137] . The latter system was found
to display a 60T50 ~70 ºC with OFOR-Ss being the least stable protein component. The
reconstituted P450 system derived from S. tokodaii strain 7 withstands temperatures of up to
70 ºC for at least 20 minutes, again with the OFOR assumed to be the temperature-limiting
component [137].
The crystal structure of StOFOR has been reported recently, but factors contributing to its
thermostability were not examined in any detail [140]. By contrast, the crystal structure of the
Fd from S. tokodaii strain 7, which shows a Tm of 109 °C [141], has been solved and has
provided some insight into potentially stabilizing elements [142]. The overall structure is
composed of two parts, a core fold and an N-terminal extension. The core is a (βαβ)2 fold that
is common to bacterial dicluster Fds and so displays a similar architecture to FdTt. However,
the N-terminal extension is distinctive, comprising a 36-residue domain and a tetrahedrally
ligated zinc ion located at the interface between the core and the N-terminal extension. Both
have been proposed to contribute additional stability to the protein, as confirmed by a
subsequent protein engineering study [141, 142]. Mutants lacking zinc (while maintaining the
iron clusters), exhibited a decrease in Tm of 20 °C compared to the WT Fd [141]. The major
role attributed to the zinc was in combining the two β-sheets and mediating indirect
interactions between β-sheets (A′ + A) and β-sheet B, thereby potentially enhancing thermal
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
stability compared to bacterial Fds lacking the zinc. Notably, however, the zinc-deficient
mutants were still highly robust with Tm values of ~89 °C.
Additional thermostabilizing effects have also been attributed to the N-terminus. With zinc
bound, residues 1–11 interacted with C-terminal residues leading to a 9 °C (Tm = 98 °C)
stabilization. Deletion of these 11 residues decreased the Tm by 11 °C, while further
truncation of up to 30 residues failed to reduce the Tm further. Moreover, a similar N-terminal
additional sequence to that of S. tokodaii strain 7, has been reported for two Fds (Fd A and
Fd B) from S. metallicus, which both show similar thermal stability with respect to S.
tokodaii strain 7, despite Fd B lacking zinc [143]. Thus, both, the zinc and the N-terminal
addition may represent common but not essential stabilizing features of Fds.
Prospects for identifying broadly useful thermostable redox partners
None of the thermostable redox systems characterized to date has been studied with more
than a few different P450s. Further work is needed to explore the potential of these systems
to support industrially relevant P450s as biocatalysts. There is also considerable scope for
identifying novel thermostable redox partners. No endogenous redox systems have been
identified for CYP231A2 or CYP154H1, let alone any of the numerous other P450s in
thermophilic organisms for which sequences are now available. For example, a BLAST
search using Sulfolobus sp. Fdx and OFOR sequences identified several proteins with 42-
59% identity in P. torridus, and a homologue to T. thermophilus FdR sharing 35% identity.
Likewise, existing sequence data suggests numerous Fds and FdRs are present in
Thermobifida fusca.
Overall, analysis of the redox systems characterized to date suggests that stabilizing elements
can be diverse and are in some cases quite specific, such as an extended N-terminal or zinc
ligation. Intramolecular, electrostatic and hydrophobic interactions that stabilize helices,
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
reduced local conformational strain by specific amino acids and extended intramolecular
hydrogen-bonding networks are more common features. Thermophilic Fds appear to have
more balanced electrostatic potential at the protein surface, a reduced total cavity volume and
a reduced accessible surface area [116] compared to their mesophilic homologs. The
difference in surface charge distribution may be particularly significant: mesophilic Fds are
very acidic and the surface distribution of Asp and Glu residues is relatively well conserved,
enabling recognition by, and interaction with, electron acceptors [116]. The less negative
surface potential and more balanced surface charge distribution of thermostable Fds may be
advantageous with respect to stability [116] but could limit interaction with P450s. It has
been proposed that the reduced attraction between redox partners in thermophilic systems can
be compensated to some degree by the faster diffusion of the proteins at elevated
temperature, under which conditions the viscosity of the medium is reduced [116].
Accordingly, a potential redox partner for thermostable P450s may not need to be as specific
with respect to their interaction with the P450 compared to redox systems that operate at
lower temperatures. This gives hope for the identification of heterologous thermostable redox
systems that interact with thermostable P450s at enhanced temperatures.
Implications for the engineering of P450 systems
The crystal structures of P450s from thermophilic organisms provide insight into what makes
a P450 thermostable, and the minimum structural elements required to retain the P450 fold.
While some generalizations can be made, there appear to be many alternative mechanisms
within P450 sequence space to generate a thermostable enzyme. Overall, naturally
thermophilic P450s are shorter than those obtained from mesophilic organisms, with large N-
terminal truncations and fewer residues in surface turns and loops which may reduce the
conformational entropy. The fact that these proteins remain active suggests that P450
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
function is not dependent on the presence of these regions. Thermophilic P450s also tend to
have a greater proportion of charged residues and fewer polar uncharged residues in
comparison to mesophilic forms. They also typically have more salt-bridged networks, and
may have extensive aromatic networks, and/or a more rigid active site that can still undergo
the conformational changes required for substrate binding through the unravelling of the F-
and G-helices. The Cys-ligand loop may also be stabilized by hydrophobic interactions. A
greater incidence of internal electrostatic and H-bonding interactions, or an increased ratio of
Arg:Lys residues may also augment stability. However, an important caveat to all these
observations is that they are made based on only a very few, select structures.
As more CYPomes are characterized from thermophilic organisms, we are likely to gain
further insight into how these proteins manage to survive under harsh conditions. Many
putative, thermostable enzymes are emerging from genome sequencing of known
thermophiles and metagenomic sequencing of microbiota sourced from hot environments,
both in terms of P450s and their cognate redox partners. These should provide ample starting
material for the development of biocatalysts that are stable under industrial conditions.
However, the functional characterization of these proteins is the rate-limiting step in their
exploitation.
Directed (or artificial) evolution studies have not been as useful as naturally thermostable
P450s in suggesting means by which to stabilize the P450 fold. Effectively, the sequence
space that has been surveyed to date by directed evolution has been limited to that which is
close to the mesophilic parents used in such experiments. The problem of deconvoluting the
effects of multiple substitutions means modelling and higher-order statistical analyses are
essential to interpreting data from directed evolution. Moreover, while such studies have the
opportunity to introduce a large number of changes into a P450 simultaneously, this has not
yet been done with the aim of testing hypotheses, e.g., assessing whether altering the balance
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
of charged to polar uncharged, or the proportion of hydrophobic residues, improves the
thermostability of P450s derived from mesophiles. Random and rational mutagenesis of
thermophilic P450s, with the aim of decreasing rather than increasing stability, may be more
useful in revealing which residues, interactions and structural features are important
determinants of thermostability.
From the analysis of the few, naturally thermostable forms that have been characterized
structurally, it is possible to suggest some possible “design heuristics” to employ in
stabilizing P450s. Shortening mesophilic proteins at the N-terminus, in the β1-1/β1-2 hairpin
loop and in the β5 turn might be expected to be beneficial, along with shortening the G-helix
by 1-3 turns. Altering the balance of amino acids to include more charged and hydrophobic
residues, at the expense of noncharged, polar and Ala residues respectively, might also
augment stability. Much more challenging modifications could include increasing the number
of complex, salt-bridged networks (i.e., involving 3 or more residues) and introducing
lengthy stacked arrays of aromatic side chains. Truncation of specific regions of the fold
would be a straightforward strategy involving the construction and screening of relatively
few mutants. However, altering the proportion of particular amino acids or attempting to
develop salt-bridged networks or aromatic stacks would be best done using a directed
evolution approach involving the creation and screening of many mutants, given the limited
understanding of structure-function relationships of most P450s and their conformational
dynamics in particular. Fortunately, it is possible to screen hundreds of mutants for the
residual folded P450 after a heat treatment by measuring Fe(II).CO vs Fe(II) difference
spectra in whole cells in microplate format [144, 145].
A major hurdle for the implementation of existing and engineered thermostable P450s in
biocatalysis remains the choice of redox system. However, the ability of CYP119 and
CY175A1 both to interact with the Pdx from Pseudomonas putida [18, 24] underscores the
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
possible interchangeability of redox partners. It appears unlikely that diflavin CPRs (class II
systems) will turn out to be as thermostable overall as Fd/FdR (class I) systems, given the
failure to identify a diflavin reductase in a thermophilic organism to date. One promising
opportunity for the identification of novel thermostable redox systems could be BLAST
searches using the sequences of mesophilic redox partners that are known to interact with a
thermostable P450 from a given thermophilic P450 organism. Alternative options, such as the
linking of a thermostable P450 to a Fd and photosystem I [146] might provide future
opportunities, considering that photosystem I complexes from thermophilic organisms are
available [147-150] and have already successfully been applied in other fields, such as the
light-induced generation of hydrogen in artificial systems [148].
The observation that CYP119A2 can catalyze monooxygenations in the absence of a redox
partner, possibly due to the shorter 5-turn that is a structural feature shared with another
self-sufficient monooxygenase, CYP55, raises the intriguing possibility that P450s could be
engineered to use NAD(P)H directly, obviating the need for a thermostable electron transport
chain.
In summary, while much has been gleaned from the study of thermostable P450 systems to
date, much more remains to be established before we can claim to understand the basis to
thermostability in the P450 fold. However, the combination of genome sequence, data
mining, directed evolution, modelling and high throughput functional analysis should
facilitate this objective and enable P450s to be exploited more fully in the design and
development of biocatalysts that are stable under industrial conditions.
Abbreviations
ABTS, (2,2′-azino-bis(3-ethylbenzothiazoline-6-sulfonic acid; 4-BPI, 4-(4-bromophenyl)-
1H-imidazole; 4-CPI, 4-(4-chlorophenyl))-1H-imidazole; CPO, chloroperoxidase; CPR,
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
NADPH-cytochrome P450 reductase; Fd, ferredoxin; FdR, ferredoxin-NADPH reductase; 4-
FPI, 4-(4-fluorophenyl))-1H-imidazole; 4-PI, 4-phenylimidazole; LUCA, last universal
common ancestor; MAD, multiple wavelength anomalous dispersion; MD, Molecular
dynamics; MeOF, [13C]p-methoxyphenylalanine; NOR, nitric oxide reductase; ORF, open
reading frame; P450, cytochrome P450, heme-thiolate protein P450; RMSD, root mean
squared deviation; T50, temperature at which half the protein is denatured; 10T50, the
temperature at which half the protein remains intact after heating for 10 minutes; Tm, melting
temperature; WT, wild type.
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Acknowledgments
This research was supported by Australian Research Council Grant DP160100865, Australian
Postgraduate Research Awards to KLH and REST and an International Postgraduate
Research Scholarship to SJS. The authors gratefully acknowledge the assistance of Julian
Zaugg with the analysis of salt bridge networks in the P450 structures. Thanks are extended
to Prof. P. Hugenholtz for access to the bacterial genome database of the Australian Centre
for Ecogenomics.
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
References
[1] J. Rittle, M.T. Green, Cytochrome P450 compound I: Capture, characterization, and C-H
bond activation kinetics, Science, 330 (2010) 933-937.
[2] C.R. Nishida, P.R. Ortiz de Montellano, Thermophilic cytochrome P450 enzymes,
Biochem. Biophys. Res. Comm., 338 (2005) 437–445.
[3] J.S. Bowman, J.W. Deming, Alkane hydroxylase genes in psychrophile genomes and the
potential for cold active catalysis, Bmc Genomics, 15 (2014) 11.
[4] D. Mehta, T. Satyanarayana, Diversity of hot environments and thermophilic microbes,
in: T. Satyanarayana, J. Littlechild, Y. Kawarabayasi (Eds.) Thermophilic microbes in
environmental and industrial biotechnology: Biotechnology of thermophiles, Springer
Netherlands, Dordrecht, 2013, pp. 3-60.
[5] S.R. Miller, C. Williams, A.L. Strong, D. Carvey, Ecological Specialization in a Spatially
Structured Population of the Thermophilic Cyanobacterium Mastigocladus laminosus, Appl.
Environ. Microbiol., 75 (2009) 729-734.
[6] W.P. Inskeep, Z.J. Jay, S.G. Tringe, M.J. Herrgard, D.B. Rusch, Y.N.P.M.P.S. Co, The
YNP metagenome project: environmental parameters responsible for microbial distribution in
the Yellowstone geothermal ecosystem, Front. Microbiol., 4 (2013).
[7] A. Lewin, A. Wentzel, S. Valla, Metagenomics of microbial life in extreme temperature
environments, Curr. Opin. Biotech., 24 (2013) 516-525.
[8] P. Menzel, S.R. Gudbergsdottir, A.G. Rike, L.B. Lin, Q. Zhang, P. Contursi, M. Moracci,
J.K. Kristjansson, B. Bolduc, S. Gavrilov, N. Ravin, A. Mardanov, E. Bonch-Osmolovskaya,
M. Young, A. Krogh, X. Peng, Comparative Metagenomics of Eight Geographically Remote
Terrestrial Hot Springs, Microb. Ecol., 70 (2015) 411-424.
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
[9] R.L. Wright, K. Harris, B. Solow, R.H. White, P.J. Kennelly, Cloning of a potential
cytochrome P450 from the archaeon Sulfolobus solfataricus, FEBS Lett., 384 (1996) 235-
239.
[10] T. Suzuki, T. Iwasaki, T. Uzawa, K. Hara, N. Nemoto, T. Kon, T. Ueki, A. Yamagishi,
T. Oshima, Sulfolobus tokodaii sp. nov. (f. Sulfolobus sp. strain 7), a new member of the
genus Sulfolobus isolated from Beppu Hot Springs, Japan, Extremophiles, 6 (2002) 39-44.
[11] J.K. Yano, F. Blasco, H. Li, R.D. Schmid, A. Henne, T.L. Poulos, Preliminary
characterization and crystal structure of a thermostable cytochrome P450 from Thermus
thermophilus, J. Biol. Chem., 278 (2003) 608-616.
[12] W.W. Ho, H. Li, C.R. Nishida, P.R. de Montellano, T.L. Poulos, Crystal structure and
properties of CYP231A2 from the thermoacidophilic archaeon Picrophilus torridus,
Biochemistry, 47 (2008) 2071-2079.
[13] A. Schallmey, G. den Besten, I.G.P. Teune, R.F. Kembaren, D.B. Janssen,
Characterization of cytochrome P450 monooxygenase CYP154H1 from the thermophilic soil
bacterium Thermobifida fusca, Appl. Microbiol. Biotech., 89 (2011) 1475-1485.
[14] K.S. Rabe, K. Kiko, C.M. Niemeyer, Characterization of the peroxidase activity of
CYP119, a thermostable P450 From Sulfolobus acidocaldarius, ChemBioChem, 9 (2008)
420-425.
[15] C. Zhang, P.X. Liu, L.Y. Huang, S.P. Wei, L. Wang, S.Y. Yang, X.Q. Yu, L. Pu, Q.
Wang, Engineering P450 peroxygenase to catalyze highly enantioselective epoxidation of
cis-beta-methylstyrenes, Chem. - Eur. J., 22 (2016) 10969-10975.
[16] C. Zhang, J. Li, B. Yang, F. He, S.Y. Yang, X.Q. Yu, Q. Wang, Enhanced turnover rate
and enantioselectivity in the asymmetric epoxidation of styrene by new T213G mutants of
CYP 119, RSC Adv., 4 (2014) 27526-27531.
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
[17] Y.R. Lim, C.Y. Eun, H.G. Park, S. Han, J.S. Han, K.S. Cho, Y.J. Chun, D. Kim,
Regioselective Oxidation of Lauric Acid by CYP119, an Orphan Cytochrome P450 from
Sulfolobus acidocaldarius, J. Microbiol. Biotechnol., 20 (2010) 574-578.
[18] L.S. Koo, C.E. Immoos, M.S. Cohen, P.J. Farmer, P.R.O. de Montellano, Enhanced
electron transfer and lauric acid hydroxylation by site-directed mutagenesis of CYP119, J.
Am. Chem. Soc., 124 (2002) 5684-5691.
[19] E. Blair, J. Greaves, P.J. Farmer, High-temperature electrocatalysis using thermophilic
P450 CYP119: Dehalogenation of CCl4 to CH4, J. Am. Chem. Soc., 126 (2004) 8632-8633.
[20] C.E. Immoos, J. Chou, M. Bayachou, E. Blair, J. Greaves, P.J. Farmer, Electrocatalytic
reductions of nitrite, nitric oxide, and nitrous oxide by thermophilic cytochrome
P450CYP119 in film-modified electrodes and an analytical comparison of its catalytic
activities with myoglobin, J. Am. Chem. Soc., 126 (2004) 4934-4942.
[21] S. Hayakawa, H. Matsumura, N. Nakamura, M. Yohda, H. Ohno, Identification of the
rate-limiting step of the peroxygenase reactions catalyzed by the thermophilic cytochrome
P450 from Sulfolobus tokodaii strain 7, FEBS J., 281 (2014) 1409-1416.
[22] S. Goyal, S. Banerjee, S. Mazumdar, Oxygenation of monoenoic fatty acids by
CYP175A1, an orphan cytochrome P450 from Thermus thermophilus HB27, Biochemistry,
51 (2012) 7880-7890.
[23] F. Blasco, I. Kauffmann, R.D. Schmid, CYP175A1 from Thermus thermophilus HB27,
the first beta-carotene hydroxylase of the P450 superfamily, Appl. Microbiol. Biotech., 64
(2004) 671-674.
[24] K. Momoi, U. Hofmann, R.D. Schmid, V.B. Urlacher, Reconstitution of β-carotene
hydroxylase activity of thermostable CYP175A1 monooxygenase, Biochem. Biophys. Res.
Comm., 339 (2006) 331-336.
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
[25] T. Mandai, S. Fujiwara, S. Imaoka, Construction and engineering of a thermostable self-
sufficient cytochrome P450, Biochem. Biophys. Res. Comm., 384 (2009) 61-65.
[26] S. Banerjee, S. Goyal, S. Mazumdar, Role of substituents on the reactivity and product
selectivity in reactions of naphthalene derivatives catalyzed by the orphan thermostable
cytochrome P450, CYP175A1, Bioorg. Chem., 62 (2015) 94-105.
[27] R.K. Behera, S. Goyal, S. Mazumdar, Modification of the heme active site to increase
the peroxidase activity of thermophilic cytochrome P450: A rational approach, J. Inorg.
Biochem., 104 (2010) 1185-1194.
[28] Y. Suzuki, T. Kishigami, K. Inoue, Y. Mizoguchi, N. Eto, M. Takagi, S. Abe, Bacillus
thermoglucosidasius sp. nov., a new species of obligately thermophilic bacilli, System. Appl.
Microbiol., 4 (1983) 487-495.
[29] O. Sideso, R.A.D. Williams, S.G. Welch, K.E. Smith, Progesterone 6-hydroxylation is
catalysed by cytochrome P-450 in the moderate thermophile Bacillus thermoglucosidasius
strain 12060, J. Steroid Biochem. Mol. Biol., 67 (1998) 163-169.
[30] S. Al-Awadi, M. Afzal, S. Oommen, Studies on Bacillus stearothermophilus. Part III.
Transformation of testosterone, 62 (2003) 48-52.
[31] S. Hashizume, T. Sekiguchi, Y. Nosoh, Effect of temperature on the viability of Bacillus
stearothermophilus, Arch. Microbiol., 107 (1976) 75-80.
[32] M. Jack, Studies on the ecology of thermophilic fungi in sun-heated soils, in, ProQuest
Dissertations Publishing, 1976.
[33] A. von Klopotek, Revision der thermophilen sporotrichum-arten: Chrysosporium
thermophilum (apinis) comb. Nov. Und chrysosporium fergusii spec. Nov. = status conidialis
von corynascus thermophilus (fergus und sinden) comb. Nov, Arch. Microbiol., 98 (1974)
365-369.
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
[34] K. Syed, K. Shale, K.H.M.N.H. Nazir, N. Krasevec, S.S. Mashele, N.S. Pagadala,
Genome-wide identification, annotation and characterization of novel thermostable
cytochrome P450 monooxygenases from the thermophilic biomass-degrading fungi Thielavia
terrestris and Myceliophthora thermophila, Genes Genom., 36 (2014) 321-333.
[35] Y. García-Huante, M. Cayetano-Cruz, A. Santiago-Hernández, C. Cano-Ramírez, R.
Marsch-Moreno, J.E. Campos, G. Aguilar-Osorio, C.G. Benitez-Cardoza, S. Trejo-Estrada,
M.E. Hidalgo-Lara, The thermophilic biomass-degrading fungus Thielavia terrestris
Co3Bag1 produces a hyperthermophilic and thermostable β-1,4-xylanase with exo- and endo-
activity, Extremophiles, 21 (2017) 175-186.
[36] K.O. Stetter, Hyperthermophilic procaryotes, FEMS Microbiol. Rev., 18 (1996) 149-
158.
[37] B.H. Toomey, S. Bello, M.E. Hahn, S. Cantrell, P. Wright, D.E. Tillitt, R.T. Di Giulio,
2,3,7,8-Tetrachlorodibenzo-p-dioxin induces apoptotic cell death and cytochrome P4501A
expression in developing Fundulus heteroclitus embryos, Aquat. Toxicol., 53 (2001) 127-
138.
[38] D.R. Nelson, Cytochrome P450 and the individuality of species, Arch. Biochem.
Biophys., 369 (1999) 1-10.
[39] J.K. Yano, L.S. Koo, D.J. Schuller, H. Li, P.R. Ortiz de Montellano, T.L. Poulos, Crystal
structure of a thermophilic cytochrome P450 from the archaeon Sulfolobus solfataricus, J.
Biol. Chem., 275 (2000) 31086-31092.
[40] Y. Oku, A. Ohtaki, S. Kamitori, N. Nakamura, M. Yohda, H. Ohno, Y. Kawarabayasi,
Structure and direct electrochemistry of cytochrome P450 from the thermoacidophilic
crenarchaeon, Sulfolobus tokodaii strain 7, J. Inorg. Biochem., 98 (2004) 1194-1199.
[41] D.R. Nelson, The cytochrome P450 homepage, Human Genomics, 4 (2009) 59-65.
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
[42] T.D. Brock, K.M. Brock, R.T. Belly, R.L. Weiss, Sulfolobus: A new genus of sulfur-
oxidizing bacteria living at low pH and high temperature, Archiv für Mikrobiol., 84 (1972)
54-68.
[43] S.A. Maves, S.G. Sligar, Understanding thermostability in cytochrome P450 by
combinatorial mutagenesis, Protein Sci, 10 (2001) 161-168.
[44] M.A. McLean, S.A. Maves, K.E. Weiss, S. Krepich, S.G. Sligar, Characterization of a
cytochrome P450 from the acidothermophilic archaea Sulfolobus solfataricus, Biochem.
Biophys. Res. Comm., 252 (1998) 166-172.
[45] R.A. Tschirret-Guth, L.S. Koo, G. Hui Bon Hoa, P.R. Ortiz de Montellano, Reversible
pressure deformation of a thermophilic cytochrome P450 enzyme (CYP119) and its active-
site mutants, J. Am. Chem. Soc., 123 (2001) 3412-3417.
[46] G. Hui Bon Hoa, C. Di Primo, I. Dondaine, S.G. Sligar, I.C. Gunsalus, P. Douzou,
Conformation changes of cytochromes P-450cam and P-450lin induced by high pressure,
Biochemistry, 28 (1989) 651-656.
[47] S.Y. Park, K. Yamane, S. Adachi, Y. Shiro, K.E. Weiss, S.A. Maves, S.G. Sligar,
Thermophilic cytochrome P450 (CYP119) from Sulfolobus solfataricus: high resolution
structure and functional properties, J. Inorg. Biochem., 91 (2002) 491-501.
[48] D. Basudhar, Y. Madrona, S. Kandel, J.N. Lampe, C.R. Nishida, P.R.O. de Montellano,
Analysis of cytochrome P450 CYP119 ligand-dependent conformational dynamics by two-
dimensional nmr and x-ray crystallography, J. Biol. Chem., 290 (2015) 10000-10017.
[49] I.F. Sevrioukova, H. Li, H. Zhang, J.A. Peterson, T.L. Poulos, Structure of a cytochrome
P450–redox partner electron-transfer complex, Proc. Natl. Acad. Sci. USA, 96 (1999) 1863-
1868.
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
[50] S.-Y. Park, H. Shimizu, S.-i. Adachi, A. Nakagawa, I. Tanaka, K. Nakahara, H. Shoun,
E. Obayashi, H. Nakamura, T. lizuka, Y. Shiro, Crystal structure of nitric oxide reductase
from denitrifying fungus Fusarium oxysporum, Nat. Struct. Mol. Biol., 4 (1997) 827-832.
[51] H. Matsumura, K. Matsuda, N. Nakamura, A. Ohtaki, H. Yoshida, S. Kamitori, M.
Yohda, H. Ohno, Monooxygenation by a thermophilic cytochrome P450 via direct electron
donation from NADH, Metallomics, 3 (2011) 389-395.
[52] R. Brandman, J.N. Lampe, Y. Brandman, P.R.O. de Montellano, Active-site residues
move independently from the rest of the protein in a 200 ns molecular dynamics simulation
of cytochrome P450 CYP119, Arch. Biochem. Biophys., 509 (2011) 127-132.
[53] H. Li, T.L. Poulos, The structure of the cytochrome P450BM-3 haem domain complexed
with the fatty acid substrate, palmitoleic acid, Nat. Struct. Mol. Biol., 4 (1997) 140-146.
[54] Y.-T. Chang, G.H. Loew, Molecular dynamics simulations of P450 BM3; examination
of substrate-induced conformational change, J. Biomol. Struct. Dyn., 16 (1999) 1189-1203.
[55] E.E. Scott, M.A. White, Y.A. He, E.F. Johnson, C.D. Stout, J.R. Halpert, Structure of
mammalian cytochrome P4502B4 complexed with 4-(4-chlorophenyl) imidazole at 1.9-
angstrom resolution - Insight into the range of P450 conformations and the coordination of
redox partner binding, J. Biol. Chem., 279 (2004) 27294-27301.
[56] M. Ekroos, T. Sjogren, Structural basis for ligand promiscuity in cytochrome P450 3A4,
Proc. Natl. Acad. Sci. USA, 103 (2006) 13682-13687.
[57] I. Schlichting, J. Berendzen, K. Chu, A.M. Stock, S.A. Maves, D.E. Benson, B.M.
Sweet, D. Ringe, G.A. Petsko, S.G. Sligar, The catalytic pathway of cytochrome P450cam at
atomic resolution, Science, 287 (2000) 1615-1622.
[58] Y. Yoshigae, U.M. Kent, P.F. Hollenberg, The role of the highly conserved threonine in
cytochrome P450 2E1: Prevention of H2O2-induced inactivation during electron transfer,
Biochemistry, 52 (2013) 4636-4647.
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
[59] L.S. Koo, R.A. Tschirret-Guth, W.E. Straub, P. Moënne-Loccoz, T.M. Loehr, P.R. Ortiz
de Montellano, The active site of the thermophilic CYP119 from sulfolobus solfataricus, J.
Biol. Chem., 275 (2000) 14112-14123.
[60] J.A. McIntosh, T. Heel, A.R. Buller, L. Chio, F.H. Arnold, Structural adaptability
facilitates histidine heme ligation in a cytochrome P450, J. Am. Chem. Soc., 137 (2015)
13861-13865.
[61] Y.T. Meharenna, T.L. Poulos, Using molecular dynamics to probe the structural basis
for enhanced stability in thermal stable cytochromes P450, Biochemistry, 49 (2010) 6680-
6686.
[62] Y.-T. Chang, G. Loew, Homology modeling, molecular dynamics simulations, and
analysis of CYP119, a P450 enzyme from extreme acidothermophilic archaeon sulfolobus
solfataricus, Biochemistry, 39 (2000) 2484-2498.
[63] A.V. Puchkaev, L.S. Koo, P.R. Ortiz de Montellano, Aromatic stacking as a determinant
of the thermal stability of CYP119 from Sulfolobus solfataricus, Arch. Biochem. Biophys.,
409 (2003) 52-58.
[64] H. Matsumura, N. Nakamura, M. Yohda, H. Ohno, The electrochemical properties of
thermophilic cytochrome P450 CYP119A2 at extremely high temperatures in poly(ethylene
oxide), Electrochem. Commun., 9 (2007) 361-364.
[65] T. Oshima, K. Imahori, Description of Thermus thermophilus (Yoshida and Oshima)
comb. Nov., a nonsporulating thermophilic bacterium from a Japanese thermal spa, Int. J.
System. Evol. Microbiol., 24 (1974) 102-112.
[66] T. Oshima, K. Imahori, Isolation of an extreme thermophile and thermostability of its
transfer ribonucleic acid and ribosomes, J. Gen. Appl. Microbiol., 17 (1971) 513-517.
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
[67] S. Hayakawa, H. Matsumura, N. Nakamura, M. Yohda, H. Ohno, Spectroscopic
characterization of the acid–alkaline transition of a thermophilic cytochrome P450, FEBS
Lett., 587 (2013) 94-97.
[68] H.M. Girvan, H.E. Seward, H.S. Toogood, M.R. Cheesman, D. Leys, A.W. Munro,
Structural and spectroscopic characterization of P450 BM3 mutants with unprecedented P450
heme iron ligand sets: New heme ligation states influence conformational equilibria in P450
BM3, J. Biol. Chem., 282 (2007) 564-572.
[69] C. Vieille, G.J. Zeikus, Hyperthermophilic enzymes: Sources, uses, and molecular
mechanisms for thermostability, Microbiol. Mol. Biol. Rev., 65 (2001) 1-43.
[70] C. Cambillau, J.-M. Claverie, Structural and genomic correlates of hyperthermostability,
J. Biol. Chem., 275 (2000) 32383-32386.
[71] R.K. Behera, S. Mazumdar, Thermodynamic basis of the thermostability of CYP175A1
from Thermus thermophilus, Intl. J. Biol. Macromol., 46 (2010) 412-418.
[72] O. Fütterer, A. Angelov, H. Liesegang, G. Gottschalk, C. Schleper, B. Schepers, C.
Dock, G. Antranikian, W. Liebl, Genome sequence of Picrophilus torridus and its
implications for life around pH 0, Proc. Natl. Acad. Sci. USA, 101 (2004) 9091-9096.
[73] M. Lubben, G. Schafer, Chemiosmotic energy conversion of the archaebacterial
thermoacidophile Sulfolobus acidocaldarius: oxidative phosphorylation and the presence of
an F0-related N,N'-dicyclohexylcarbodiimide-binding proteolipid, J. Bacteriol., 171 (1989)
6106-6116.
[74] P. Argos, M.G. Rossmann, U.M. Grau, H. Zuber, G. Frank, J.D. Tratschin, Thermal
stability and protein structure, Biochemistry, 18 (1979) 5698-5703.
[75] S. Fukuchi, K. Nishikawa, Protein surface amino acid compositions distinctively differ
between thermophilic and mesophilic bacteria, J. Mol. Biol., 309 (2001) 835-843.
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
[76] S. Kumar, C.J. Tsai, R. Nussinov, Factors enhancing protein thermostability, Protein
Eng., 13 (2000) 179-191.
[77] A. Szilagyi, P. Zavodszky, Structural differences between mesophilic, moderately
thermophilic and extremely thermophilic protein subunits: results of a comprehensive survey,
Structure, 8 (2000) 493-504.
[78] A.H. Elcock, The stability of salt bridges at high temperatures: implications for
hyperthermophilic proteins, J. Mol. Biol., 284 (1998) 489-502.
[79] C.-H. Chan, T.-H. Yu, K.-B. Wong, Stabilizing salt-bridge enhances protein
thermostability by reducing the heat capacity change of unfolding, PLoS ONE, 6 (2011)
e21624.
[80] L.B. Jónsdóttir, B.Ö. Ellertsson, G. Invernizzi, M. Magnúsdóttir, S.H. Thorbjarnardóttir,
E. Papaleo, M.M. Kristjánsson, The role of salt bridges on the temperature adaptation of
aqualysin I, a thermostable subtilisin-like proteinase, BBA-Proteins Proteomics, 1844 (2014)
2174-2181.
[81] Z.S. Hendsch, B. Tidor, Do salt bridges stabilize proteins? A continuum electrostatic
analysis, Protein Sci, 3 (1994) 211-226.
[82] Z.S. Hendsch, T. Jonsson, R.T. Sauer, B. Tidor, Protein stabilization by removal of
unsatisfied polar groups: Computational approaches and experimental tests, Biochemistry,
35 (1996) 7621-7625.
[83] B. Folch, M. Rooman, Y. Dehouck, Thermostability of salt bridges versus hydrophobic
interactions in proteins probed by statistical potentials, J. Chem. Inf. Model., 48 (2008) 119-
127.
[84] P.C. Cirino, F.H. Arnold, A self sufficient peroxide driven hydroxylation biocatalyst,
Angew. Chem. Int. Ed., 42 (2003) 3299-3301.
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
[85] P.C. Cirino, F.H. Arnold, Regioselectivity and activity of cytochrome P450 BM-3 and
mutant F87A in reactions driven by hydrogen peroxide, Adv. Synth. Catal., 344 (2002) 932-
937.
[86] O. Salazar, P.C. Cirino, F.H. Arnold, Thermostabilization of a cytochrome P450
peroxygenase, ChemBioChem, 4 (2003) 891-893.
[87] M.W. Peters, P. Meinhold, A. Glieder, F.H. Arnold, Regio- and enantioselective alkane
hydroxylation with engineered cytochromes P450 BM-3, J. Am. Chem. Soc., 125 (2003)
13442-13450.
[88] J.C. Lewis, S.M. Mantovani, Y. Fu, C.D. Snow, R.S. Komor, C.H. Wong, F.H. Arnold,
Combinatorial alanine substitution enables rapid optimization of cytochrome P450(BM3) for
selective hydroxylation of large substrates, ChemBiochem, 11 (2010) 2502-2505.
[89] R. Fasan, M.M. Chen, N.C. Crook, F.H. Arnold, Engineered alkane-hydroxylating
cytochrome P450(BM3) exhibiting nativelike catalytic properties, Angew Chem Int Ed, 46
(2007) 8414-8418.
[90] M.G. Shapiro, G.G. Westmeyer, P.A. Romero, J.O. Szablowski, B. Kuster, A. Shah,
C.R. Otey, R. Langer, F.H. Arnold, A. Jasanoff, Directed evolution of a magnetic resonance
imaging contrast agent for noninvasive imaging of dopamine, Nat. Biotech., 28 (2010) 264-
U120.
[91] J. Bloom, S. Labthavikul, C. Otey, F. Arnold, Protein stability promotes evolvability,
Proc. Natl. Acad. Sci. USA, 103 (2006) 5869-5874.
[92] K.G. Ravichandran, S.S. Boddupalli, C.A. Hasemann, J.A. Peterson, J. Deisenhofer,
Crystal structure of hemoprotein domain of P450 BM-3, a prototype for microsomal P450s,
Science, 261 (1993) 731-736.
[93] C.A. Voigt, C. Martinez, Z.G. Wang, S.L. Mayo, F.H. Arnold, Protein building blocks
preserved by recombination, Nat. Struct. Biol., 9 (2002) 553-558.
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
[94] C.R. Otey, J.J. Silberg, C.A. Voigt, J.B. Endelman, G. Bandara, F.H. Arnold, Functional
evolution and structural conservation in chimeric cytochromes P450: Calibrating a structure-
guided approach, Chem. Biol., 11 (2004) 309-318.
[95] C.R. Otey, M. Landwehr, J.B. Endelman, K. Hiraga, J.D. Bloom, F.H. Arnold,
Structure-guided recombination creates an artificial family of cytochromes P450, PLoS.
Biol., 4 (2006) e112.
[96] Y.G. Li, D.A. Drummond, A.M. Sawayama, C.D. Snow, J.D. Bloom, F.H. Arnold, A
diverse family of thermostable cytochrome P450s created by recombination of stabilizing
fragments, Nat. Biotech., 25 (2007) 1051-1056.
[97] P.A. Romero, A. Krause, F.H. Arnold, Navigating the protein fitness landscape with
Gaussian processes, Proc. Natl. Acad. Sci. USA, 110 (2013) E193-E201.
[98] S. Kumar, Y. Zhao, L. Sun, S.S. Negi, J.R. Halpert, B.K. Muralidhara, Rational
engineering of human cytochrome P450 2B6 for enhanced expression and stability:
Importance of a Leu(264)-> Phe substitution, Mol. Pharmacol., 72 (2007) 1191-1199.
[99] J.C. Talakad, P.R. Wilderman, D.R. Davydov, S. Kumar, J.R. Halpert, Rational
engineering of cytochromes P450 2B6 and 2B11 for enhanced stability: Insights into
structural importance of residue 334, Arch. Biochem. Biophys., 494 (2010) 151-158.
[100] S. Kumar, L. Sun, H. Liu, B. Muralidhara, J. Halpert, Engineering mammalian
cytochrome P450 2B1 by directed evolution for enhanced catalytic tolerance to temperature
and dimethyl sulfoxide, Protein Eng. Des. Sel., 19 (2006) 547-554.
[101] F. Hannemann, A. Bichet, K.M. Ewen, R. Bernhardt, Cytochrome P450 systems -
biological variations of electron transport chains, Biochim. Biophys. Acta, 1770 (2007) 330-
344.
[102] A. Dalber, H. Shoun, V. Ullrich, Nitric oxide reductase (P450(nor)) from Fusarium
oxysporum, J. Inorg. Biochem., 99 (2005) 185-193.
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
[103] R. Oshima, S. Fushinobu, F. Su, L. Zhang, N. Takaya, H. Shoun, Structural evidence
for direct hydride transfer from NADH to cytochrome P450nor, J. Mol. Biol., 342 (2004)
207-217.
[104] L.H. Wang, R.J. Kulmacz, Thromboxane synthase: structure and function of protein
and gene, Prostaglandins Other Lipid Mediat, 68-69 (2002) 409-422.
[105] A.N. Grechkin, Hydroperoxide lyase and divinyl ether synthase, Prostaglandins Other
Lipid Mediat., 68–69 (2002) 457-470.
[106] N. Tijet, A.R. Brash, Allene oxide synthases and allene oxides, 68-9 (2002) 423-431.
[107] I. Matsunaga, E. Kusunose, I. Yano, K. Ichihara, Separation and partial
characterization of soluble fatty acid alpha-hydroxylase from Sphingomonas paucimobilus,
Biochem. Biophys. Res. Comm., 201 (1994) 1554-1560.
[108] I. Matsunaga, A. Ueda, N. Fujiwara, T. Sumimoto, K. Ichihara, Characterization of the
ybdT gene product of Bacillus subtilis: novel fatty acid beta-hydroxylating cytochrome P450,
Lipids, 34 (1999) 841-846.
[109] M. Girhard, S. Schuster, M. Dietrich, P. Durre, V.B. Urlacher, Cytochrome P450
monooxygenase from Clostridium acetobutylicum: a new alpha-fatty acid hydroxylase,
Biochem. Biophys. Res. Comm., 362 (2007) 114-119.
[110] G.A. Roberts, G. Grogan, A. Greter, S.L. Flitsch, N.J. Turner, Identification of a new
class of cytochrome P450 from a Rhodococcus sp, J. Bacteriol., 184 (2002) 3898-3908.
[111] A.J. Warman, O. Roitel, R. Neeli, H.M. Girvan, H.E. Seward, S.A. Murray, K.J.
McLean, M.G. Joyce, H. Toogood, R.A. Holt, D. Leys, N.S. Scrutton, A.W. Munro,
Flavocytochrome P450BM3: an update on structure and mechanism of a biotechnologically
important enzyme, Biochem Soc Trans, 33 (2005) 747-753.
[112] R. De Mot, A.H.A. Parret, A novel class of self-sufficient cytochrome P450
monooxygenases in prokaryotes, Trends Microbiol., 10 (2002) 502-508.
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
[113] S. Eiben, H. Bartelmas, V.B. Urlacher, Construction of a thermostable cytochrome
P450 chimera derived from self-sufficient mesophilic parents, Appl. Microbiol. Biotech., 75
(2007) 1055–1061.
[114] T. Mandai, S. Fujiwara, S. Imaoka, A novel electron transport system for thermostable
CYP175A1 from Thermus thermophilus HB27, FEBS J., 276 (2009) 2416-2429.
[115] S. Griffin, C.L. Higgins, T. Soulimane, P. Wittung-Stafshede, High thermal and
chemical stability of Thermus thermophilus seven-iron ferredoxin - Linear clusters form at
high pH on polypeptide unfolding, Eur. J. Biochem., 270 (2003) 4736-4743.
[116] S. Macedo-Ribeiro, B.M. Martins, P.J. Pereira, G. Buse, R. Huber, T. Soulimane, New
insights into the thermostability of bacterial ferredoxins: high-resolution crystal structure of
the seven-iron ferredoxin from Thermus thermophilus, J. Biol. Inorg. Chem., 6 (2001) 663-
674.
[117] Y.-S. Jung, V.A. Roberts, C.D. Stout, B.K. Burgess, Complex formation between
Azotobacter vinelandii ferredoxin I and its physiological electron donor NADPH-ferredoxin
reductase, J. Biol. Chem., 274 (1999) 2978-2987.
[118] T. Iwasaki, T. Wakagi, T. Oshima, Ferredoxin-dependent redox system of a
thermoacidophilic archaeon, Sulfolobus sp. strain 7. Purification and characterization of a
novel reduced ferredoxin-reoxidizing iron-sulfur flavoprotein, J. Biol. Chem., 270 (1995)
17878-17883.
[119] C.D. Stout, Refinement of the 7 Fe ferredoxin from Azotobacter vinelandii at 1.9 A
resolution, J. Mol. Biol., 205 (1989) 545-555.
[120] S. Aono, I. Bertini, J.A. Cowan, C. Luchinat, A. Rosato, M.S. Viezzoli, 1 H NMR
studies of the Fe 7 S 8 ferredoxin from Bacillus schlegelii: a further attempt to understand Fe
3 S 4 clusters, J. Biol. Inorg. Chem., 1 (1996) 523-528.
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
[121] R. Hensel, I. Jakob, H. Scheer, F. Lottspeich, Proteins from hyperthermophilic archaea:
stability towards covalent modification of the peptide chain, Biochemical Society
symposium, 58 (1992) 127-133.
[122] M. Bureik, B. Schiffler, Y. Hiraoka, F. Vogel, R. Bernhardt, Functional expression of
human mitochondrial CYP11B2 in fission yeast and identification of a new internal electron
transfer protein, etp1, Biochemistry, 41 (2002) 2311-2321.
[123] F. Kern, T.K.F. Dier, Y. Khatri, K.M. Ewen, J.-P. Jacquot, D.A. Volmer, R. Bernhardt,
Highly efficient CYP167A1 (EpoK) dependent epothilone B formation and production of 7-
Ketone epothilone D as a new epothilone derivative, Sci Rep, 5 (2015) 14881.
[124] K.M. Ewen, B. Schiffler, H. Uhlmann-Schiffler, R. Bernhardt, F. Hannemann, The
endogenous adrenodoxin reductase-like flavoprotein arh1 supports heterologous cytochrome
P450-dependent substrate conversions in Schizosaccharomyces pombe, FEMS Yeast Res., 8
(2008) 432-441.
[125] S. Brixius-Anderko, L. Schiffer, F. Hannemann, B. Janocha, R. Bernhardt, A
CYP21A2 based whole-cell system in Escherichia coli for the biotechnological production of
premedrol, Microb. Cell Factories, 14 (2015).
[126] S. Janocha, R. Bernhardt, Design and characterization of an efficient CYP105A1-based
whole-cell biocatalyst for the conversion of resin acid diterpenoids in permeabilized
Escherichia coli, Appl. Microbiol. Biotech., 97 (2013) 7639-7649.
[127] J.J. Muller, F. Hannemann, B. Schiffler, K.M. Ewen, R. Kappl, U. Heinemann, R.
Bernhardt, Structural and thermodynamic characterization of the adrenodoxin-like domain of
the electron-transfer protein Etp1 from Schizosaccharomyces pombe, J. Inorg. Biochem., 105
(2011) 957-965.
[128] A.V. Grinberg, F. Hannemann, B. Schiffler, J. Muller, U. Heinemann, R. Bernhardt,
Adrenodoxin: Structure, stability, and electron transfer properties, 40 (2000) 590-612.
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
[129] J.J. Müller, A. Lapko, G. Bourenkov, K. Ruckpaul, U. Heinemann, Adrenodoxin
reductase-adrenodoxin complex structure suggests electron transfer path in steroid
biosynthesis, J. Biol. Chem., 276 (2001) 2786-2789.
[130] C.N. Pace, U. Heinemann, U. Hahn, W. Saenger, Ribonuclease T1: structure, function,
and stability, Angew. Chem. Int. Ed., 30 (1991) 343-360.
[131] K.M. Armstrong, R.L. Baldwin, Charged histidine affects alpha-helix stability at all
positions in the helix by interacting with the backbone charges, Proc. Natl. Acad. Sci. USA,
90 (1993) 11337-11340.
[132] P.J. Haney, J.H. Badger, G.L. Buldak, C.I. Reich, C.R. Woese, G.J. Olsen, Thermal
adaptation analyzed by comparison of protein sequences from mesophilic and extremely
thermophilic Methanococcus species, Proc. Natl. Acad. Sci. USA, 96 (1999) 3578-3583.
[133] M.J. Thompson, D. Eisenberg, Transproteomic evidence of a loop-deletion mechanism
for enhancing protein thermostability, J. Mol. Biol., 290 (1999) 595-604.
[134] M. Milhim, A. Gerber, J. Neunzig, F. Hannemann, R. Bernhardt, A Novel NADPH-
dependent flavoprotein reductase from Bacillus megaterium acts as an efficient cytochrome
P450 reductase, J. Biotechnol., 231 (2016) 83-94.
[135] M. Milhim, N. Putkaradze, A. Abdulmughni, F. Kern, P. Hartz, R. Bernhardt,
Identification of a new plasmid-encoded cytochrome P450 CYP107DY1 from Bacillus
megaterium with a catalytic activity towards mevastatin, J. Biotechnol., 240 (2016) 68-75.
[136] G.-Y. Lee, H.M. Kim, S.H. Ma, S.H. Park, Y.H. Joung, C.-H. Yun, Heterologous
expression and functional characterization of the NADPH-cytochrome P450 reductase from
Capsicum annuum, Plant Physiol. Biochem., 82 (2014) 116-122.
[137] A.V. Puchkaev, P.R. Ortiz de Montellano, The Sulfolobus solfataricus electron donor
partners of thermophilic CYP119: an unusual non-NAD(P)H-dependent cytochrome P450
system, Arch. Biochem. Biophys., 434 (2005) 169-177.
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
[138] A.V. Puchkaev, T. Wakagi, P.R. Ortiz de Montellano, CYP119 plus a Sulfolobus
tokodaii strain 7 ferredoxin and 2-oxoacid: ferredoxin oxidoreductase constitute a high-
temperature cytochrome P450 catalytic system, J. Am. Chem. Soc., 124 (2002) 12682-12683.
[139] Q. Zhang, T. Iwasaki, T. Wakagi, T. Oshima, 2-oxoacid: ferredoxin oxidoreductase
from the thermoacidophilic archaeon, Sulfolobus sp. strain 7, J. Biochem., 120 (1996) 587-
599.
[140] Z. Yan, A. Maruyama, T. Arakawa, S. Fushinobu, T. Wakagi, Crystal structures of
archaeal 2-oxoacid: ferredoxin oxidoreductases from Sulfolobus tokodaii, Sci. Rep., 6 (2016)
33061.
[141] K. Kojoh, H. Matsuzawa, T. Wakagi, Zinc and an N-terminal extra stretch of the
ferredoxin from a thermoacidophilic archaeon stabilize the molecule at high temperature, Eur
J Biochem, 264 (1999) 85-91.
[142] T. Fujii, Y. Hata, M. Oozeki, H. Moriyama, T. Wakagi, N. Tanaka, T. Oshima, The
crystal structure of zinc-containing ferredoxin from the thermoacidophilic archaeon
Sulfolobus sp. strain 7, Biochemistry, 36 (1997) 1505-1513.
[143] C.M. Gomes, A. Faria, J.C. Carita, J. Mendes, M. Regalla, P. Chicau, H. Huber, K.O.
Stetter, M. Teixeira, Di-cluster, seven-iron ferredoxins from hyperthermophilic Sulfolobales,
J. Biol. Inorg. Chem., 3 (1998) 499-507.
[144] C.R. Otey, High-throughput carbon monoxide binding assay for cytochromes P450, in:
F.H. Arnold, G. Georgiou (Eds.) Methods Mol. Biol., vol. 230, Humana Press Inc., Totowa,
N.J., 2003, pp. 137-139.
[145] W.A. Johnston, W. Huang, M.A. Hayes, J.J. De Voss, E.M.J. Gillam, Quantitative
whole cell cytochrome P450 measurement suitable for high throughput application, J.
Biomol. Screen., 13 (2008) 135-141.
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
[146] S.B. Mellor, A.Z. Nielsen, M. Burow, M.S. Motawia, D. Jakubauskas, B.L. Møller,
P.E. Jensen, Fusion of Ferredoxin and Cytochrome P450 Enables Direct Light-Driven
Biosynthesis, ACS Chem. Biol., 11 (2016) 1862-1869.
[147] K. Sonoike, H. Hatanaka, S. Katoh, S. Itoh, Heat-stability of iron-sulfur centers and P-
700 in photosystem I reaction center complexes isolated from the thermophilic
cyanobacterium Synechococcus elongatus, Plant Cell. Physiol., 31 (1990) 865-870.
[148] I.J. Iwuchukwu, M. Vaughn, N. Myers, H. O'Neill, P. Frymier, B.D. Bruce, Self-
organized photosynthetic nanoparticle for cell-free hydrogen production, Nat. Nano, 5 (2010)
73-79.
[149] A. Balme, M. Hervás, L.A. Campos, J. Sancho, M.A. De la Rosa, J.A. Navarro, A
comparative study of the thermal stability of plastocyanin, cytochrome c6 and photosystem I
in thermophilic and mesophilic cyanobacteria, Photosynth. Res., 70 (2001) 281-289.
[150] S. Todorovic, S.S. Leal, C.A. Salgueiro, I. Zebger, P. Hildebrandt, D.H. Murgida, C.M.
Gomes, A Spectroscopic Study of the Temperature Induced Modifications on Ferredoxin
Folding and Iron−Sulfur Moieties, Biochemistry, 46 (2007) 10733-10738.
[151] E.F. Pettersen, T.D. Goddard, C.C. Huang, G.S. Couch, D.M. Greenblatt, E.C. Meng,
T.E. Ferrin, UCSF Chimera--a visualization system for exploratory research and analysis, J.
Comput. Chem., 25 (2004) 1605-1612.
[152] M. Kumar, V. Thakur, G.P.S. Raghava, COPid: Composition Based Protein
Identification, In Silico Biol., 8 (2008) 121-128.
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Figure legends
Figure 1. Structural comparison of thermophilic and selected mesophilic P450s
A: P450cam (PDBID: 3I61), B: P450BM3-heme domain (1FAG), C: CYP119A1 (1IO7), D:
CYP119A2 (1UE8), E: CYP175A1 (1N97) and F: CYP231A2 (2RFB). The view looking
down the axis of the I-helix is shown. Structures are colored by spectral progression from
blue at the N-terminus to red at the C-terminus. The F-and G-helices are visible at the top of
the structures in green, the A- and A’-helices where present are in deep blue. Dashed lines are
shown where elements of the structure are not visible in the crystal structure. Figures were
generated using Chimera [151].
Figure 2. Multiple sequence alignment of thermophilic and representative mesophilic-
P450s
A multiple sequence alignment of the four thermophilic P450s for which structures have been
solved with representative sequences of mesophilic P450s. Secondary structure elements are
colored as identified in the crystal structures (apart from CYP71A2, as no plant P450 crystal
structure has yet been solved) with -helices shown in orange and -sheets in blue. The
thermophilic sequences are significantly shorter at the N-terminus than mesophilic
counterparts. CYP119A1 and CYP119A2 both lack the A’-helix found in many mesophilic
P450s, beginning immediately with the A-helix, while in CYP231A2 the A-helix is entirely
absent and the structure starts with the β1-1 strand. Compared to the bacterial mesophiles,
deletions can also be seen in the β1-1/β1-2 hairpin loop and the β5 turn of the thermophilic
sequences. PDB structures used to assign secondary structure are as follows: CYP119A1,
1IO7; CYP119A2 (P450St), 1UE8; CYP175A1, 1N97; CYP231A2, 2RFB; CYP107A1
(P450eryF), 1Z8O; CYP101A1 (P450cam), 3L61; CYP108 (P450terp), 1CPT; CYP176A1
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
(P450cin), 4FB2; CYP107H1 (P450BioI), 3EJE; CYP102A1 (P450BM3) heme domain, 1FAG;
CYP51, 4LXJ; CYP2C5, 1DT6; CYP3A4, 1TQN; CYP46A1, 2Q9G. The CYP71A2
sequence was obtained from UniProt (P37118).
Figure 3. Changes in conformation of the F/G-loop in various CYP119 ligand-bound
structures
The F/G-loop region is shown in CYP119 structures solved in the presence of various
ligands: A) 4-PI (PDB ID: 1F4T), B) 4-FPI (4WPD), C) imidazole (1F4U), D) 4-CPI
(4TUV), E) 4-BPI (4TT5), F) H2O (1IO7). The view is down the axis of the I-helix and the F-
and G-helices are shown above and to the top left of the I-helix, respectively. Figures were
generated using Chimera [151]. The side chain positions of Arg154 and Leu155 are shown
for reference. Note that the relative positions of the F- and G-helices change little with
respect to the I-helix, but the ends of the helices unravel, altering the size and position of the
F/G-loop to accommodate different ligands.
Figure 4: Aromatic clusters in CYP119A1, CYP119A2 and CYP231A2
A) Structural alignment of CYP119A1 (blue, PDBID 1IO7) and CYP119A2 (magenta,
1UE8), and B) Structure of CYP231A2 (green, PDBID 2RFB) with side chains of residues
belonging to aromatic clusters shown in yellow. Black brackets indicate the largest aromatic
clusters for each protein. Twelve of thirteen residues identified in the two unique aromatic
clusters of CYP119A1 are conserved in similar positions in CYP119A2, with a single residue
from cluster 2 (Phe338) rotated by ~90° in CYP119A2 due to a change in the β3 sheet. The
largest aromatic cluster of CYP231A2 (B) contains residues Tyr18, His23, Tyr25, Phe243,
Tyr269 and spans a total distance of 13.9 Å. This cluster was identified using the selection
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
criteria of aromatic residues within 4 Å of each other. Figures were generated using Chimera
[151].
Figure 5. Positions in the CYP102A1 heme domain structure (PDB ID: 4ZF6) where
stabilizing substitutions were identified in the thermostable peroxygenase variant 5H6.
Two views are shown looking down the I-helix from opposite sides of the protein. The side
chains of the WT residues corresponding to the following mutations are shown: Leu52Ile
(forest green), Ser106Arg (pink), Met145Val (yellow; a reversion to the WT from a previous
mutant), Ala184Val (dark blue), Ler324Ile (cyan), Ile366Val (purple), Val340Met (orange),
Glu442Lys (lime green).
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Figure 1
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Figure 2
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Figure 3
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Figure 4
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Figure 5
ACCEPTED MANUSCRIPT
ACC
EPTE
D M
ANU
SCR
IPT
Highlights
Thermostability is a desirable feature for using P450s in industrial processes.
The structures of four P450s from thermophiles have been characterized to date.
P450s from thermophiles tend to be shorter overall than those from mesophiles.
Extended salt-bridged and aromatic networks may stabilize the P450 fold.
Directed evolution has been used to increase the thermostability of mesophilic P450s.
ACCEPTED MANUSCRIPT