+ All Categories
Home > Documents > E Ruark, K Snape, et al (2013), Mosaic PPM1D mutations are...

E Ruark, K Snape, et al (2013), Mosaic PPM1D mutations are...

Date post: 28-Jul-2018
Category:
Upload: phungkhue
View: 216 times
Download: 0 times
Share this document with a friend
44
Institute of Cancer Research Repository https://publications.icr.ac.uk Please direct all emails to: [email protected] This is an author produced version of an article that appears in: The internet address for this paper is: NATURE https://publications.icr.ac.uk/12228/ Published text: E Ruark, K Snape, et al (2013), Mosaic PPM1D mutations are associated with predisposition to breast and ovarian cancer, Nature, Vol. 493(7432), 406-410
Transcript

Institute of Cancer Research Repository https://publications.icr.ac.uk

Please direct all emails to: [email protected]

This is an author produced version of an article that appears in:

The internet address for this paper is:

NATURE

https://publications.icr.ac.uk/12228/

Published text:

E Ruark, K Snape, et al (2013), Mosaic PPM1D mutations are associated with predisposition to breast and ovarian cancer, Nature, Vol. 493(7432), 406-410

1

Mosaic PPM1D mutations are associated with predisposition to breast and

ovarian cancer

Elise Ruark1*, Katie Snape1*, Peter Humburg2*, Chey Loveday1, Ilirjana Bajrami3, Rachel Brough3,4, Daniel Nava Rodrigues3, Anthony Renwick1, Sheila Seal1, Emma Ramsay1, Silvana Del Vecchio Duarte1, Manuel A. Rivas2,5, Margaret Warren-Perry1, Anna Zachariou1, Adriana Campion-Flora3, Sandra Hanks1, Anne Murray1, Naser Ansari Pour1, Jenny Douglas1, Lorna Gregory2, Andrew Rimmer2, Neil M. Walker6, Tsun-Po Yang7, Julian W. Adlard8, Julian Barwell9, Jonathan Berg10, Angela F. Brady11, Carole Brewer12, Glen Brice13, Cyril Chapman14, Jackie Cook15, Rosemarie Davidson16, Alan Donaldson17, Fiona Douglas18, Diana Eccles19, D. Gareth Evans20, Lynn Greenhalgh21, Alex Henderson18, Louise Izatt22, Ajith Kumar23, Fiona Lalloo24, Zosia Miedzybrodzka25, Patrick J. Morrison26, Joan Paterson27, Mary Porteous28, Mark T. Rogers29, Susan Shanley30, Lisa Walker31, Martin Gore32, Richard Houlston1, Matthew A. Brown33, Mark J. Caufield34, Panagiotis Deloukas7, Mark I. McCarthy2,35,36, John A. Todd6, The Breast and Ovarian Cancer Susceptibility Collaboration (BOCS)37, Wellcome Trust Case Control Consortium37, Clare Turnbull1,30, Jorge S. Reis-Filho3, Alan Ashworth3, Antonis C. Antoniou38, Christopher J. Lord3, Peter Donnelly2,39& Nazneen Rahman1,30

1Division of Genetics & Epidemiology, The Institute of Cancer Research, Sutton, SM2 5NG, UK 2The Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK 3The Breakthrough Breast Cancer Research Centre, The Institute of Cancer Research, London, SW3 6JB, UK 4Cancer Research UK Gene Function Laboratory, The Institute of Cancer Research, London, SW3 6JB, UK 5Nuffield Department of Clinical Medicine, University of Oxford, Oxford, OX3 7LD, UK 6Juvenile Diabetes Research Foundation/Wellcome Trust Diabetes and Inflammation Laboratory, Cambridge Institute for Medical Research, University of Cambridge, Addenbrooke’s Hospital, Cambridge, CB2 0XY, UK 7The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK 8Yorkshire Regional Genetics Service, Chapel Allerton Hospital, Leeds, LS7 4SA, UK 9Leicestershire Genetics Centre, University Hospitals of Leicester NHS Trust, LE1 5WW, UK 10Human genetics, Division of Medical Sciences, University of Dundee, DD1 9SY, UK 11NW Thames Regional Genetics Service, Kennedy Galton Centre, London, HA1 3UJ, UK 12Peninsula Regional Genetics Service, Royal Devon & Exeter Hospital, Exeter, EX1 2ED, UK 13SW Thames Regional Genetics Service, St George's Hospital, London, SW17 0RE, UK 14West Midlands Regional Genetics Service, Birmingham Women's Hospital, Birmingham, B15 2TG, UK 15Sheffield Regional Genetics Service, Sheffield Children's NHS Foundation Trust, S10 2TH, UK 16West of Scotland Regional Genetics Service, Laboratory Medicine, Southern General Hospital, Glasgow, G51 4TF, UK 17South Western Regional Genetics Service, University Hospitals of Bristol NHS Foundation Trust, BS2 8EG, UK

2

18Northern Genetics Service, Newcastle upon Tyne Hospitals NHS Foundation Trust, NE1 3BZ, UK 19Faculty of Medicine, University of Southampton, Southampton University Hospitals NHS Trust, SO16 5YA, UK 20Genetic Medicine, Manchester Academic Health Science Centre, St. Mary's Hospital, Manchester M13 9WL, UK 21 Merseyside and Cheshire Clinical Genetics Service, Liverpool Women's NHS Foundation Trust, Liverpool, L8 7SS, UK 22SE Thames Regional Genetics Service, Guy's and St Thomas NHS Foundation Trust, London, SE1 9RT, UK 23NE Thames Regional Genetics Service, Great Ormond St Hospital, London, WC1N 3JH, UK 24University Dept of Medical Genetics & Regional Genetics Service, St Mary's Hospital, Manchester, M13 9WL, UK 25University of Aberdeen and North of Scotland Clinical Genetics Service, Aberdeen Royal Infirmary, AB25 2ZA, UK 26Northern Ireland Regional Genetics Service, Belfast HSC Trust, Department of Medical Genetics, Queen's University Belfast, BT9 7AB, UK 27 East Anglian Regional Genetics Service, Cambridge University Hospitals NHS Foundation Trust, CB2 0QQ, UK 28South East of Scotland Clinical Genetics Service, Western General Hospital, Edinburgh, EH4 2XU, UK 29All Wales Medical Genetics Service, University Hospital of Wales, Cardiff, CF14 4XW, UK 30Dept of Cancer Genetics, Royal Marsden NHS Foundation Trust, Sutton, SM2 5PT, UK 31Oxford Regional Genetics Service, Oxford University Hospitals NHS Trust, Oxford, OX3 7LJ, UK 32Dept of Gynaecologic Oncology, Royal Marsden NHS Foundation Trust, London, SW3 6JJ, UK 33University of Queensland Diamantina Institute, University of Queensland, Princess Alexandra Hospital, Woolloongabba, Brisbane, 4102, Australia 34Clinical Pharmacology and Barts and The London Genome Centre, William Harvey Research Institute, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, London, EC1M 6BQ, UK. 35Oxford Centre for Diabetes, Endocrinology and Medicine, University of Oxford, Churchill Hospital, Oxford, OX3 7LI, UK. 36Oxford NIHR Biomedical Research Centre, Churchill Hospital, Oxford, OX3 7LI, UK. 37The members of these consortia are listed in the Supplementary Material 38Centre for Cancer Genetic Epidemiology, Department of Public Health and Primary Care, University of Cambridge, Cambridge, CB1 8RN, UK. 39Department of Statistics, University of Oxford, Oxford, OX1 3TG, UK *Joint first author

3

ABSTRACT

Improved sequencing technologies offer unprecedented opportunities for investigating the

role of rare genetic variation in common disease. However, there are considerable challenges

with respect to study design, data analysis and replication. Here, using pooled next-generation

sequencing of 507 genes implicated in the repair of DNA in 1,150 samples, an analytical

strategy focussed on protein truncating variants (PTVs) and a large-scale sequencing case-

control replication experiment in 13,642 individuals, we show that rare PTVs in the p53

inducible protein phosphatase PPM1D are associated with predisposition to breast cancer and

to ovarian cancer. PPM1D PTV mutations were present in 25/7781 cases vs 1/5861 controls;

P = 1.12x10-5, which included 18 mutations in 6,912 individuals with breast cancer; P =

2.42x10-4 and 12 mutations in 1,121 individuals with ovarian cancer; P = 3.10x10-9. Notably,

all the identified PPM1D PTVs were mosaic in lymphocyte DNA and clustered within a 370

bp region in the final exon of the gene, C-terminal to the phosphatase catalytic domain.

Functional studies demonstrated that the mutations result in enhanced suppression of p53 in

response to ionising radiation exposure, suggesting the mutant alleles encode hyperactive

PPM1D isoforms. Thus, although the mutations cause premature protein truncation, they do

not result in the simple loss-of-function typically associated with this class of variant, but

instead likely have a gain-of-function effect. Our results have implications for the detection

and management of breast and ovarian cancer risk. More generally, these data provide new

insights into the role of rare and of mosaic genetic variants in common conditions, and the

utility of sequencing in their identification.

4

TEXT

There is strong evidence that rare genetic variation is important in breast and ovarian cancer

predisposition1,2. In the 1990s, genome-wide linkage analysis and positional cloning led to

the identification of the DNA repair genes BRCA1 and BRCA2, rare mutations of which

confer substantial risks of both diseases1,2. More recently, through case-control resequencing

studies of candidate genes we, and others, have discovered rare variants that confer moderate

risks of breast and/or ovarian cancer3-9. These cancers are therefore exemplars of the rare

variant-common disease hypothesis.

The successful studies to date have focussed on genes encoding proteins involved in DNA

repair such as PALB2, ATM, CHEK2, BRIP1, RAD51C and RAD51D3-9. These genes are

characterised by multiple, very rare, loss-of-function mutations, usually protein truncating

variants (PTVs), which predispose carriers to breast and/or ovarian cancer3-9. To further

investigate the role of DNA repair genes in cancer susceptibility, we sequenced 507 genes

(the ‘DNA repair panel’) in 1,150 individuals with breast cancer from the UK, 69 of whom

also had ovarian cancer (Supplementary Table 1, Supplementary Fig. 1). To maximise time,

sample and cost efficiency we used a pooled approach combining 200 ng of DNA from each

of 24 individuals into a single pool which we hybridised to a custom pulldown containing the

DNA repair panel (Supplementary Table 2). We performed sequencing using an Illumina

HiSeq2000 which generated a minimum coverage per pool of 480x for ≥ 90% of the target

region (Supplementary Fig. 2). Sequence variants were called using Syzygy10, the

performance of which was evaluated using previously generated data in a subset of the

samples. The sensitivity of base substitution calling was 99.6% (439/439 common variants

and 24/26 rare variants that were present in 1/24 individuals in a pool). The sensitivity of

insertion/deletion calling was 94.4% (51/54 rare insertion/deletions present in 1/24

individuals in a pool, Supplementary Table 3).

We next considered the 34,564 sequence variants called by Syzygy. We first focussed on

PTVs because of the strong association of this class of mutation with disease. In total, 1,044

PTVs were called by Syzygy and we used a ‘PTV prioritisation method’ to stratify the genes

according to the number of different, rare truncating mutations present within the samples11.

PPM1D showed the strongest signal in this analysis, and we confirmed by Sanger sequencing

5

that five individuals carried different PPM1D PTVs. Two of these individuals had ovarian

cancer in addition to breast cancer.

To further explore the role of PPM1D in breast and ovarian cancer susceptibility we next

performed a case-control Sanger sequencing analysis of PPM1D in a total of 13,642

individuals; 7,781 unrelated individuals with breast and/or ovarian cancer and 5,861

population controls (Supplementary Table 1). We initially sequenced all PPM1D exons and

intron-exon boundaries but after completing this analysis in 3,803 samples we noted that all

10 PTV mutations identified occurred within the last exon of PPM1D, and this clustering was

highly significant (P = 8.2x10-6). We thus analysed the remaining 9,839 samples for this

mutation cluster region (MCR), identifying a further 16 PTVs (Supplementary Table 1, Fig.

1). In total we identified 25 PPM1D PTVs in individuals with breast and/or ovarian cancer

and 1 in controls (P = 1.12x10-5, Fig. 1, Fig. 2a and Supplementary Table 4). This included 18

mutations in 6,912 individuals with breast cancer (P = 2.42x10-4) and 12 mutations in 1,121

individuals with ovarian cancer (P = 3.10x10-9). The histological features of the cancers in

PPM1D mutation carriers were diverse, and five individuals had both breast and ovarian

cancer (Supplementary Table 5). The case series included 773 individuals with mutations in

BRCA1 or BRCA2 (termed ‘BRCA1/2 mutation carriers’), four of whom also carried PTVs in

PPM1D (4/773 vs 1/5861 controls, P = 8.30x10-4). We also identified a total of 16 non-

synonymous, 14 synonymous and one intronic variant across the cases and controls; there

was no evidence for an association with cancer for these variant classes (Supplementary

Table 6).

The Sanger sequencing chromatograms for the PPM1D PTVs were unusual for heterozygous

mutations as the mutant allele was considerably and consistently lower than the wildtype

allele, suggesting the mutations were mosaic in lymphocyte DNA (Fig. 2a and Supplementary

Fig. 3). This contrasted with the non-truncating variants which all had normal sequencing

profiles. DNA from saliva was available for two individuals and the PTVs were present at

similar amplitude to that identified in the corresponding blood derived DNA (Supplementary

Fig 3). To further confirm the PTV mutations were bona fide we used two additional

mutation detection methods; deep PCR amplicon sequencing12 (Fig. 2b, Supplementary Fig. 4

and Supplementary Table 4) and multiplex ligation-dependent probe amplification (MLPA)13

(Supplementary Fig. 5 and Supplementary Table 7). For the deep PCR amplicon sequencing

we generated Nextera libraries of pooled PCR products covering BRCA1, BRCA2 and the

6

PPM1D mutation, which we sequenced using an Illumina MiSeq generating a median

coverage of 3387x across the PPM1D mutation (Supplementary Fig. 4 and Supplementary

Table 4). This confirmed the PPM1D PTVs were present at a lower proportion than

heterozygous polymorphisms in BRCA1 and BRCA2, with a median mutant read percentage

of 16% (range 5-34%). Additionally, we sequenced the original DNA repair panel in six cases

individually (i.e. unpooled), which again confirmed the mutations were present, but mosaic

(Supplementary Fig. 6 and Supplementary Table 4). For three samples we had data from both

the deep PCR amplicon sequencing and the DNA repair panel which gave identical mutation

percentage results (Supplementary Table 4). Finally, family studies were also consistent with

mosaicism; none of 14 relatives carried the PPM1D mutation identified in the proband. Most

compellingly, for each of probands 17 and 24, we identified two offspring that had inherited

different maternal haplotypes at the PPM1D locus, but neither offspring carried the relevant

maternal PPM1D mutation, demonstrating that the mutations were either not present, or

mosaic in the germline of the probands (Fig. 2c).

PPM1D (protein phosphatase, Mg2+/Mn2+dependent 1D), also known as WIP1 (Wild-type

p53-induced phosphatase 1) was first identified in a screen for p53 target genes induced by

ionising radiation14. PPM1D encodes a 605 amino acid protein with a N-terminal phosphatase

catalytic domain and a C-terminal domain that contains a putative nuclear localisation signal

(Fig. 1)15. PPM1D transcription is upregulated in response to various types of DNA damage

in a p53-dependent manner. Once upregulated, PPM1D has been shown to dephosphorylate

and downregulate several targets, particularly proteins associated with the ATM/ATR-

initiated DNA damage response (DDR) and including tumour suppressors with a proven role

in cancer susceptibility such as p5316, ATM17 and CHK218. Thus it has been proposed that a

primary role of PPM1D is as a homeostatic regulator of the DDR, facilitating return of cells

to their normal state after repair of damaged DNA16. There is also accumulating evidence that

PPM1D is involved in oncogenesis15. PPM1D amplification and overexpression has been

demonstrated in multiple human tumours19, including breast cancers20 and ovarian clear cell

carcinoma21, and is a promising therapeutic target21-23.

The clustering of PTVs within the 370 bp region corresponding to amino acids 420-546,

which is downstream of the phosphatase catalytic domain but precedes or disrupts the nuclear

localisation signal,24 suggests the PTVs are not acting as simple loss-of-function mutations

(Fig. 1). Moreover, all the PTVs were in the last exon and thus predicted to evade nonsense-

7

mediated RNA decay and to result in a truncated protein that retains the phosphatase catalytic

domain, rather than in haploinsufficiency24,25. We confirmed this experimentally for three

mutations (Fig 2a). To investigate the effect of PPM1D PTVs we generated cDNA expression

constructs representing two mutant alleles (PPM1D c.1384C>T; case 6 and PPM1D

c.1420delC; case 7) and tested their ability to suppress p53 activation in response to ionising

radiation (IR) exposure. As expected, the normal elevation of p53 levels after IR exposure

was moderately suppressed in human U2-OS tumour cells transfected with a wildtype

PPM1D expression construct, matching previous observations15,16 (Fig. 3). The suppression

of p53 was enhanced in cells transfected with the mutant PPM1D expression constructs

suggesting that each of these alleles encodes a hyperactive PPM1D isoform, i.e. consistent

with a gain-of-function rather than a loss-of-function effect (Fig. 3). Similar effects were also

observed in HeLa and 293 cells (Supplementary Fig. 7).

To investigate the mechanism of oncogenesis in PPM1D PTV mutation carriers we analysed

eight tumours from five individuals. Intriguingly, the PPM1D mutations were not detectable

in any of the tumours by Sanger sequencing or MLPA (Supplementary Fig. 8). Through

microsatellite analysis we confirmed that the tumours were from the correct individuals and

demonstrated loss of heterozygosity at the PPM1D locus in seven of eight tumours, though

there was no evidence of PPM1D copy number alteration (Supplementary Fig. 8 and

Supplementary Table 8). We microdissected stromal tissue from the ovarian tumour in four

cases and undertook deep sequencing across the PPM1D PTV in blood, tumour and stromal

DNA. Each mutation was present in the blood, at similar level to that detected previously,

absent from the tumour and either absent (two cases) or present at very low level (5/915 reads

and 4/5793 reads) in the stroma, consistent with lymphocyte contamination (Supplementary

Fig. 8 and Supplementary Table 5).

These data are intriguing and strongly suggest the mechanism of cancer association in

PPM1D mutation carriers differs from that in carriers of mutations in other DNA repair genes

associated with predisposition to these cancers. There are several potential explanations. It is

possible the mutation was present in the cell of cancer origin but was subsequently lost,

perhaps because a PPM1D mutation can act as a driver to initiate oncogenesis, but is not

required, or is detrimental to the progression of the resulting cancer. The allele loss we

observed at the PPM1D locus could be interpreted as supportive of this hypothesis, but it

should be noted that it is not known if the lost allele carried the PPM1D PTV, and loss in this

8

region of 17q is common in these cancers. Alternatively, the absence of the PPM1D mutation

in the tumour could be because oncogenesis is being driven by the mutation in circulating

blood cells. Another possibility is that the PPM1D mutations are not directly involved in

causing breast or ovarian cancer. For example, they could be a separate manifestation of an

underlying lesion, perhaps one that causes genomic instability, which can lead to selection

and clonal expansion of cells with PPM1D PTVs and also to cancers in other tissues. Clearly,

further studies will be required to explain the mechanism of oncogenesis in PPM1D mutation

carriers.

Irrespective of the mechanism of the association, our data demonstrate that individuals with

mosaic PPM1D PTVs in the mutation cluster region are at increased risk of cancer. The

association is not explicable by increasing age, unlike recently reported mosaic chromosomal

abnormalities (Supplementary Table 5)26,27. To estimate the cancer risks we undertook a

retrospective cohort analysis using information on breast and ovarian cancer occurrence in the

6,577 unrelated individuals negative for BRCA1/2 mutations and controls, by modelling the

retrospective likelihood of the observed mutation status conditional on the disease phenotype,

as previously described8,28. This approach adjusts for our ascertainment of cases with more

extreme phenotypes such as young age of onset or bilateral breast cancer, which we utilise to

empower gene discovery3-6,8,9,29. The relative risk of breast cancer for PPM1D PTV carriers

was estimated to be 2.7 (95% CI: 1.3-5.3; P = 5.38x10-3), which translates to approximately

23% cumulative risk by age 80. The relative risk of ovarian cancer was estimated to be 11.5

(95% CI: 4.3-30.4; P = 9.95x10-7), which translates to approximately 18% cumulative risk by

age 80. It is noteworthy that we included an unselected hospital-based series of 322 ovarian

cancer patients in whom we identified five PPM1D PTVs, suggesting that 1-2% of ovarian

cancer patients may harbour mosaic PPM1D mutations.

The frequency of PPM1D PTVs in BRCA1/2 mutation carriers with breast and/or ovarian

cancer was also significantly different from population controls (4/773 vs 1/5861; P =

8.30x10-4) and similar to that in cases of breast and/or ovarian cancer without BRCA1/2

mutations (4/773 vs 21/6634; P = 0.56), suggesting that PPM1D PTVs are also associated

with increased risks of cancer in BRCA1/2 mutation carriers. Studies of unselected,

population-based cancer patients and of larger series of BRCA1/2 mutation carriers would be

of value to extend our observations, and to further explore the prevalence and cancer risks

associated with PPM1D mutations.

9

These data provide new insights into ovarian and breast cancer, potentially identifying a

novel class of genetic defect that lies somewhere between classic germline genetic

predisposition mutations and tumour-specific somatic events. It is also highly plausible that

PPM1D mutations are associated with other cancers, and broad evaluation of individuals with

other tumour types would be of interest. More generally, the clinical implications of a mosaic

cancer predisposition marker that is genetic, but not hereditary, and that is detectable in the

blood but not the tumour(s) it is associated with are rather profound, particularly if this

phenomenon is observed in other genes/contexts.

Our results also provide insights into genetic variation, particularly in relation to the nature

and impact of rare gene mutations associated with disease. Given the truncating mutations we

report likely have a gain-of-function effect, the widespread interchangeable use of the terms

‘truncating mutation’ and ‘loss-of-function mutation’ is inappropriate. We believe a more

descriptive term such as ‘protein truncating variant’ (PTV), which does not imply the

functional consequence of the mutation, is preferable. We also provide evidence that mosaic

mutations can have relevance to common disease. Such variants are challenging to detect by

Sanger sequencing, but are detectable by next-generation sequencing approaches. It is

therefore likely that further examples of mosaic disease-associated mutations will be

forthcoming, though studies to define the frequency and characteristics of mosaic mutations

in control individuals will be essential, to ensure the implications of such variants in case

series are correctly interpreted. Finally, although newer sequencing technologies are making

large-scale whole-genome sequencing experiments ever more feasible, it is likely that

focussed sequencing experiments with tailored design and analytical prioritisation strategies,

such as those employed here, will have utility over the next few years.

Methods Summary

Lymphocyte DNA from 8,046 individuals affected with breast and/or ovarian cancer and

5,861 population-based controls were included. A custom pulldown that included 507 genes

(DNA repair panel) was designed using the Agilent SureSelect Target Enrichment system and

sequenced in samples from 1,150 women, in pools of 24 samples, with an Illumina

HiSeq2000. Sequence reads were mapped to the human reference genome (hg19) using BWA

(version 0.5.6). Variant calling was undertaken with Syzygy (version 1.2.4)10. Primers for

PPM1D Sanger sequencing were designed using Exon-Primer. Amplicons were sequenced

10

using the BigDye Terminator Cycle sequencing kit and an ABI3730 automated sequencer

(ABI PerkinElmer). Deep PCR amplicon sequencing was undertaken by amplifying the

PPM1D mutation cluster region, BRCA1 and BRCA2 using a Multiplex PCR Kit (Qiagen),

preparing libraries with Nextera technology11 and sequencing on an Illumina MiSeq.

Sequencing of the DNA repair panel in indexed samples from six PPM1D PTV carriers was

undertaken using Illumina TruSeq kits for library preparation and an Illumina HiSeq2000. For

these latter experiments, sequence reads were mapped using Stampy version 1.0.14 and

variants were called with Platypus (http://www.well.ox.ac.uk/platypus). MLPA was

undertaken using the SALSA MLPA probe mix P200 (MRC Holland). Microsatellite analysis

was undertaken with 5’6-FAM tagged primer pairs. PCR products were run on a 3730xL

genetic analyser (ABI PerkinElmer) and data were analysed using GeneMarker v1.51

(SoftGenetics). The U2OS, HeLa and HEK293 (p53 wildtype) cell lines were transfected

with a plasmid containing full-length wildtype PPM1D cDNA and the PPM1D open reading

frame subcloned into pCMV6-AN-HA (Origene), generating a construct that could express a

PPM1D-N-terminal HA epitope fusion protein. Mutations were introduced using the

QuickChange II XL Site-Directed Mutagenesis Kit (Stratagene). Statistical analyses were

performed using the stats package in R. Cancer risks were estimated within a retrospective

cohort analysis framework8,28.

References

1 Turnbull, C. & Rahman, N. Genetic predisposition to breast cancer: past, present, and future. Annu Rev Genomics Hum Genet 9, 321-345 (2008).

2 Gayther, S. A. & Pharoah, P. D. The inherited genetics of ovarian and endometrial cancer. Curr Opin Genet Dev 20, 231-238 (2010).

3 Rahman, N. et al. PALB2, which encodes a BRCA2-interacting protein, is a breast cancer susceptibility gene. Nat Genet 39, 165-167 (2007).

4 Renwick, A. et al. ATM mutations that cause ataxia-telangiectasia are breast cancer susceptibility alleles. Nat Genet 38, 873-875 (2006).

5 Seal, S. et al. Truncating mutations in the Fanconi anemia J gene BRIP1 are low-penetrance breast cancer susceptibility alleles. Nat Genet 38, 1239-1241 (2006).

6 Meijers-Heijboer, H. et al. Low-penetrance susceptibility to breast cancer due to CHEK2(*)1100delC in noncarriers of BRCA1 or BRCA2 mutations. Nat Genet 31, 55-59 (2002).

7 Meindl, A. et al. Germline mutations in breast and ovarian cancer pedigrees establish RAD51C as a human cancer susceptibility gene. Nat Genet 42, 410-414 (2010).

8 Loveday, C. et al. Germline RAD51C mutations confer susceptibility to ovarian cancer. Nat Genet 44, 475-476 (2012).

11

9 Loveday, C. et al. Germline mutations in RAD51D confer susceptibility to ovarian cancer. Nat Genet 43, 879-882 (2011).

10 Rivas, M. A. et al. Deep resequencing of GWAS loci identifies independent rare variants associated with inflammatory bowel disease. Nat Genet 43, 1066-1073 (2011).

11 Snape, K. et al. Predisposition gene identification in common cancers by exome sequencing: insights from familial breast cancer. Breast Cancer Res Treat 134, 429-433 (2012).

12 Caruccio, N. Preparation of next-generation sequencing libraries using Nextera technology: simultaneous DNA fragmentation and adaptor tagging by in vitro transposition. Methods Mol Biol 733, 241-255 (2011).

13 Schouten, J. P. et al. Relative quantification of 40 nucleic acid sequences by multiplex ligation-dependent probe amplification. Nucleic Acids Res 30, e57 (2002).

14 Fiscella, M. et al. Wip1, a novel human protein phosphatase that is induced in response to ionizing radiation in a p53-dependent manner. Proc Natl Acad Sci U S A 94, 6048-6053 (1997).

15 Lu, X. et al. The type 2C phosphatase Wip1: an oncogenic regulator of tumor suppressor and DNA damage response pathways. Cancer Metastasis Rev 27, 123-135 (2008).

16 Lu, X., Nguyen, T. A. & Donehower, L. A. Reversal of the ATM/ATR-mediated DNA damage response by the oncogenic phosphatase PPM1D. Cell Cycle 4, 1060-1064 (2005).

17 Shreeram, S. et al. Regulation of ATM/p53-dependent suppression of myc-induced lymphomas by Wip1 phosphatase. J Exp Med 203, 2793-2799 (2006).

18 Fujimoto, H. et al. Regulation of the antioncogenic Chk2 kinase by the oncogenic Wip1 phosphatase. Cell Death Differ 13, 1170-1180 (2006).

19 Bulavin, D. V. et al. Amplification of PPM1D in human tumors abrogates p53 tumor-suppressor activity. Nat Genet 31, 210-215 (2002).

20 Natrajan, R. et al. Tiling path genomic profiling of grade 3 invasive ductal breast cancers. Clin Cancer Res 15, 2711-2722 (2009).

21 Tan, D. S. et al. PPM1D is a potential therapeutic target in ovarian clear cell carcinomas. Clin Cancer Res 15, 2269-2280 (2009).

22 Rayter, S. et al. A chemical inhibitor of PPM1D that selectively kills cells overexpressing PPM1D. Oncogene 27, 1036-1044 (2008).

23 Hayashi, R. et al. Optimization of a cyclic peptide inhibitor of Ser/Thr phosphatase PPM1D (Wip1). Biochemistry 50, 4537-4549 (2011).

24 Chuman, Y. et al. PPM1D430, a novel alternative splicing variant of the human PPM1D, can dephosphorylate p53 and exhibits specific tissue expression. J Biochem 145, 1-12 (2009).

25 Silva, A. L., Ribeiro, P., Inacio, A., Liebhaber, S. A. & Romao, L. Proximity of the poly(A)-binding protein to a premature termination codon inhibits mammalian nonsense-mediated mRNA decay. RNA 14, 563-576 (2008).

12

26 Laurie CC. et al. Detectable clonal mosaicism from birth to old age and its relationship to cancer. Nat Genet. 44, 642-650 (2012).

27 Jacobs KB et al.. Detectable clonal mosaicism and its relationship to aging and cancer. Nat Genet 44, 651-658 (2012).

28 Barnes, D. et al. Evaluation of association methods for analysing modifiers of disease risk in carriers of high risk mutations. Genet. Epidemiol. 36, 274-291 (2012).

29 Antoniou, A. C. & Easton, D. F. Polygenic inheritance of breast cancer: Implications for design of association studies. Genet Epidemiol 25, 190-202 (2003).

Acknowledgements We thank all the subjects and families that participated in the research and D. Dudakia, J. Bull, R. Linger for their assistance in recruitment. We are indebted to Mike Stratton for discussions of the data and to Ann Strydom for extensive editorial assistance. We thank the High-Throughput Genomics Group at the Wellcome Trust Centre for Human Genetics, Oxford (funded by Wellcome Trust grant reference 090532/Z/09/Z and MRC Hub grant G0900747 91070) for the generation of the Phase 1 Sequencing data. This work was funded by the Institute of Cancer Research, The Wellcome Trust, Cancer Research UK and Breakthrough Breast Cancer. We acknowledge support by the RMH-ICR National Institute for Health Research (NIHR) Specialist Biomedical Research Centre for Cancer. We acknowledge the use of DNA from the British 1958 Birth Cohort collection funded by the MRC grant G0000934 and the Wellcome Trust grant 068545/Z/02. A.C.A. is a Cancer Research UK Senior Cancer Research Fellow (C12292/A11174). P.D. is supported by a Wolfson-Royal Society Merit Award. K.S. is supported by the Michael and Betty Kadoorie Cancer Genetics Research Programme. Author Contributions E.R., K.S., P.H., N.W., T-P.Y., M.B., M.C., C.T., J.T., M.Mc., P.De., P.Do. and N.R (chair) are the WTCCC exon-resequencing group who devised and funded Phase 1. J.W.A., J.Ba., J.Be., A.F.B., C.B., G.B., C.C., J.C., R.D., A.D., F.D., D.E., D.G.E., L.G., A.H., L.I., A.K., F.L., Z.M., P.J.M., J.P., M.P., M.T.R., S.Sh., L.W. and N.R. are centre leads of the Breast and Ovarian Cancer Susceptibility Collaboration, which is coordinated by M.W.P and A.Z. A full list of the WTCCC and BOCS consortia are in the Supplementary Material. R.H and M.G assembled the unselected ovarian cancer series. L.G. coordinated Phase 1 sequencing. P.H, M.R. and P.Do. undertook analysis of the pooled DNA repair panel. J.D., A.M., S.Se., S.H. and E.R. undertook NGS sequencing and analysis. S.Se. E.R., S.DVS., N.A.P., A.Re., K.S., C.L. and J.D. undertook Sanger sequencing, MLPA, and tumour microsatellite analysis. E.R. undertook sample selection and data analyses with C.T. A.C.A. wrote the risk analysis software and oversaw the penetrance analysis which was performed by E.R. A.Ri. provided and optimised Platypus. D.N R., A.C-F. and J.S.R-F. undertook histopathological analyses and performed microdissections. I.B., R.B., C.J.L. and A.A. undertook functional analyses. E.R., K.S. and N.R. managed and oversaw all aspects of the study and wrote the manuscript. Author Information Reprints and permissions information is available at www.nature.com/reprints. There are no competing financial interests for any of the authors. Correspondence and requests for materials should be addressed to Nazneen Rahman at [email protected]

13

Fig. 1. Clustering of cancer predisposing mutations in PPM1D.

a, PPM1D mutations and cancer phenotype. b, PPM1D gene with region targeted by

mutations (mutation cluster region) in blue; c, PPM1D protein showing position of mutation

cluster region downstream of the phosphatase domain and upstream/overlapping the nuclear

localisation signal (NLS); d, mutation cluster region showing position of mutations. The

numbers above give the position of the mutations and correspond to the IDs in panel (a). Ov

ca, ovarian cancer; br ca, breast cancer; bil br ca, bilateral breast cancer.

Fig. 2. PPM1D mutations are mosaic in lymphocyte DNA.

a, Sanger sequencing traces showing mutant allele is lower in genomic DNA extracted from

peripheral blood lymphocytes (gDNA) than typical for heterozygous mutations. The cDNA

analysis demonstrates the mutations lead to truncated products not nmRNA decay. b, deep

PCR amplicon sequencing showing heterozygous BRCA1/2 variants at 50% (open dots)

whereas the PPM1D mutation is present at a lower percentage (red dots). c, Haplotype

analysis in two families. The offspring of PPM1D mutation carriers have different maternal

haplotypes spanning PPM1D (highlighted), but neither carry the mutation, indicating that it is

either not present, or mosaic in the germline of the proband.

Fig. 3. The effect of mutant PPM1D isoforms on p53 activation.

p53 wildtype U2-0S human osteosarcoma cells were transfected with PPM1D cDNA

expression constructs and exposed to ionising irradiation (5 grays). At 30 minute and 4h

intervals whole cell lysates were western blotted to estimate the IR-induced activation of p53.

Western blots showing p53 and actin (loading control) protein levels at different times (in

hours) after IR exposure are shown. ‘Empty’: transfected with empty construct, ‘PPM1D

WT’: transfected with wildtype PPM1D construct, ‘PPM1D c.1384C>T’ and ‘PPM1D

c.1420delC’: transfected with mutant PPM1D constructs. Suppression of p53 was enhanced

in cells transfected with mutant constructs suggesting these alleles encode hyperactive

PPM1D isoforms.

14

ONLINE METHODS Patients and Samples

Cases

We used lymphocyte DNA from 8,046 individuals affected with breast and/or ovarian cancer

that were recruited via two studies. 7,724 cases were recruited through 24 genetics centres in

the UK via the Breast and Ovarian Cancer Susceptibility study (BOCS), which recruits

women ≥18 years who have had breast cancer and/or ovarian cancer and have a family

history of breast cancer and/or ovarian cancer. Each proband was screened for BRCA1 and

BRCA2 mutations (by Sanger sequencing and/or heteroduplex analysis) and large

rearrangements (by MLPA). The remaining 322 cases are an unselected hospital-based series

of women with ovarian cancer who were recruited during treatment for ovarian cancer at the

Royal Marsden Hospital. The DNA was extracted from peripheral blood samples except in 11

cases, for whom DNA was extracted from a lymphoblastoid cell line (NB all the PPM1D

mutations were identified in peripheral blood-derived DNA). At least 97% of families were of

European ancestry, i.e. comparable to the controls. Informed consent was obtained from all

participants. The research was approved by the London Multicentre Research Ethics

Committee (MREC/01/2/18).

For the Phase 1 pooled DNA repair panel experiment we used lymphocyte DNA from 1,150

women with breast cancer, 69 also had ovarian cancer. 78 of these individuals had one

mutation, and one individual had two mutations, in known cancer predisposition genes. These

were included as ‘positive controls’ to evaluate variant calling (see below). For the PPM1D

case-control sequencing experiment we used 7,781 individuals with breast and/or ovarian

cancer. We did not use the case data from the pooled DNA repair panel experiment in the

case-control analysis, firstly because the mutation status of individuals cannot be definitively

obtained from the pooled experiment as one cannot be certain that every sample is equally

represented in a pool, and secondly because the mutation detection method was different to

that utilised in the case-control experiment. We used our standard case and control sample

trays for the case-control PPM1D sequencing experiment and the sample selection was blind

to the pooled DNA repair panel experiment. 885 individuals were part of both experiments.

Samples and pathology information from mutation-positive families

15

For families in which a PPM1D mutation was detected, we sought DNA samples from

relatives. We also requested tumour material, histopathology information, and

immunohistochemical profiles, including hormone receptor receptor and HER2 status for

patients with breast cancer, in probands from the hospitals where they had been treated.

Representative tumour blocks were retrieved where possible and examined by two

histopathologists (DNR & JSR-F) and classified and graded according to the World Health

Organisation 2003 classification1,2. Tumours were microdissected under a stereomicroscope

and genomic DNA was extracted from tumour and, where possible, stroma using the DNeasy

kit (Qiagen) as previously described2.

Controls We used lymphocyte DNA from 5,861 population-based controls obtained from the 1958

Birth Cohort Collection, an ongoing follow-up of persons born in Great Britain in one week

in 1958. Biomedical assessment was undertaken during 2002-2004 at which blood samples

and informed consent were obtained for creation of a genetic resource but phenotype data for

these individuals is not available. At least 97% of the controls were of European ancestry.

(http://www.cls.ioe.ac.uk/studies.asp?section=000100020003).

Sequencing

DNA repair panel sequencing

We identified genes for inclusion on the DNA repair panel from

http://www.geneontology.org/ using the search term “DNA repair” (GO:0006281) and from

http://string-db.org/ by identifying all genes interacting with ATM, BRCA1, BRCA2, BRIP1,

CHEK2 and PALB2 with highest confidence (≥ 0.9). This dataset was manually curated to

remove duplicate genes and pseudogenes. CCDS transcripts for the remaining genes were

retrieved from UCSC Genome Browser (http://genome.ucsc.edu/ from November 2010)

(Supplementary Table 2). Genomic coordinates for all coding exons were identified and

targeted in a custom pulldown designed using the Agilent SureSelect Target Enrichment

system (Agilent)3. We created 48 pools of DNA that each included 4 µl of 50 ng/µl = 200ng

of DNA from 24 individuals. We sheared 80 µl of the pooled DNA using Covaris technology.

We prepared libraries without gel size selection or PCR enrichment using the Illumina

Genomic PE Sample Prep Kit (Ilumina) and performed target enrichment according to the

Agilent SureSelect protocol. Sequencing was performed by the WTCHG High-throughput

DNA sequencing and MRC hub in Oxford on an Illumina HiSeq2000 (v2 flow cell, one lane

16

of sequencing per pool) generating 2x100 bp reads. Sequence reads for each pool were

mapped to the human reference genome (hg19) using BWA (version 0.5.6)4. Mapped reads

were filtered to remove ambiguous alignments with a quality score of 0 and bases with a call

quality below 22 were masked. Of the remaining reads for each pool 50-60% fell within the

target regions, except for Pool 21 where the on target percentage was significantly lower.

Median coverage for each pool achieved for target regions after filtering was between 2849x

and 5545x. This corresponded to an average coverage of 119x-231x per sample. All pools

had 90% of the target covered at a minimum of 480x. Target regions within the MHC

achieved substantially lower coverage and were excluded from further analysis.

We also sequenced the DNA repair panel in six PPM1D PTV positive individuals using

Illumina TruSeq kits for library preparation to enable sample indexing. Genomic DNA (1.5

µg) was fragmented and the libraries prepared using the Illumina TruSeq Sample Preparation

Kit (index set A). One pool of six libraries (500 ng each) was enriched as before but with the

addition of extra blocking primers targeted against the TruSeq index adapter sequences.

Sequencing was performed at ICR with an Illumina HiSeq2000 (v3 flowcell, one lane)

generating 2x100 bp reads. Mapped reads were filtered to remove ambiguous alignments with

a quality score of 0 and bases with a call quality below 22 were masked. Of the remaining

reads, 41-43% fell within the target region for each individual. Median coverage of the target

for each individual after filtering was between 602x and 690x. All individuals had 90% of the

target covered at a minimum of 50x.

PPM1D Sanger sequencing

We designed primers to PCR amplify and Sanger sequence PPM1D using Exon-Primer from

UCSC Genome Browser (http://genome.ucsc.edu/ from November 2010). Primers and

conditions are available on request. PCR reactions were performed using the QIAGEN

Multiplex PCR Kit (Qiagen). Amplicons were unidirectionally sequenced using the BigDye

Terminator Cycle sequencing kit and an ABI3730 automated sequencer (ABI PerkinElmer).

We analysed the full coding sequence in 2,456 cases and 1,347 controls. As all the mutations

identified in these samples were restricted to exon 6 we sequenced the mutation cluster region

(c.1261-20-c.1695), but not the rest of the gene, in the remaining 5,325 cases and 4,514

controls. We also sequenced the mutation cluster region in all available samples from

relatives of PPM1D PTV positive probands. All sequencing traces were independently

analyzed by two individuals who were blind to the others analysis. Each individual analysed

17

the sequencing with both automated software (Mutation Surveyor, SoftGenetics) and manual

visual inspection. All putative mutations were confirmed by bidirectional sequencing from a

fresh aliquot of the stock DNA. We also undertook Sanger sequencing of the PPM1D cluster

region, in triplicate, in DNA from eight tumour samples and four ovarian stromal samples.

For the cDNA sequencing we established lymphoblastoid cell lines from three individuals

with PPM1D PTVs (cases 20, 23 and 24). RNA was extracted using RNeasy Minikit

(Qiagen) and cDNA synthesised using the ThermoScript RT-PCR system (Invitrogen),

employing standard protocols. We amplified the mutation cluster region using a cDNA-

specific primer, [Forward_ACCACCAGTCAAGTCACTGG;

Reverse_TCTTTCGCTGTGAGGTTGTG] which we sequenced as described above.

Deep PCR amplicon sequencing

In lymphocyte DNA we amplified the PPM1D mutation cluster region and the full coding

sequence and intron-exon boundaries of BRCA1 and BRCA2 using the Multiplex PCR Kit

(Qiagen). We prepared indexed libraries of the PCR products using Nextera technology

(Ilumina)5. We created two pools of 24 indexed libraries which we sequenced using an

Illumina MiSeq, generating 2x150 bp reads. Data from 20 individuals passed quality control

coverage metrics, generating median coverage greater than 500x across the PPM1D cluster

region (average median coverage 3384x).

For the tumour analyses we amplified the mutation cluster region in tumour, stroma and

blood DNA using an Illumina Nextera XT library preparation kit and supplied protocol

(Illumina). To attain the required 1ng input for tagmentation we also amplified BRCA1 in 24

samples as described above and we then created one pool of 24 indexed libraries which we

sequenced using an Illumina MiSeq, generating 2x150bp reads. We visually inspected all

sequencing reads present at the mutation site after alignment with Stampy to determine if the

PPM1D mutation was present.

NGS data analysis

DNA repair panel data

For the pooled DNA repair panel analysis, variant calling was undertaken with Syzygy

(version 1.2.4)6. 402/439 previously validated SNPs with a MAF>5% genotyped through a

breast cancer GWAS7 were successfully identified with high confidence and the remaining 37

18

SNPs were detected at lower confidence. Syzygy also detected 75/80 rare variants

(MAF<1%) included in the study as positive controls (24/26 base substitutions, 14/14

insertions, 30/32 deletions and 7/8 complex indels, Supplementary Table 3). Thus sensitivity

was 99.6% for base substitutions and 94.4% for rare indels. Frequency estimation for rare

variants was assessed by evaluation of 39 BRCA1 and BRCA2 variants at a frequency of one

per pool. Syzygy correctly estimated the frequency in 33 of the 35 variants it detected,

incorrectly estimating the frequency at two per pool for the remaining two variants.

Deep PCR amplicon sequencing data

For the deep PCR amplicon sequencing and the indexed DNA repair panel sequencing in six

individuals, sequence reads were mapped to the human reference genome (hg19) using

Stampy version 1.0.148. Duplicate reads were flagged using Picard version 1.60

(http://picard.sourceforge.net). Variant calling was performed with Platypus version 0.1.9

(http://www.well.ox.ac.uk/platypus)9. The mutant read percentage was calculated as the

proportion of total reads at the variant location that contained the variant, with a minimum

mutant read percentage threshold of 5%.

Variant Annotation

Annotation for all experiments was undertaken with reference to CCDS transcripts from

EnsEMBL version 65 identified using a custom Perl script (Supplementary Table 2). Variant

calls were annotated for changes with respect to the chosen transcript and assigned a

consequence type from the list used by EnsEMBL.

PTV Prioritisation Method

This is a gene-based (rather than the more typical variant-based) strategy that aims to

prioritise potential disease-associated genes for follow-up by leveraging two properties of

protein truncating variants: (1) the strong association of rare truncating variants with disease,

and (2) collapsibility; different PTVs within a gene typically result in the same functional

effect and can be combined equally. We implemented the method using the stats package in

R. We first outputted all the predicted protein truncating variants: stop gains, coding

frameshifts and essential splice site variants (-2, -1, +1, +2, +5). For this experiment we

defined ‘rare’ as PTVs that were seen only once in the DNA repair panel data. We next

stratified the genes according to the number of different, rare singleton PTVs called. We

excluded genes for which samples had been included as positive controls (Supplementary

19

Table 3). PPM1D was the top gene in this analysis. We are undertaking further analyses and

follow-up of the DNA repair panel data which we aspire to present in a separate publication.

MLPA

We designed 22 probe pairs targeting PPM1D PTVs (n=18), wildtype PPM1D (n=2),

wildtype BRCA1 (n=1) and wildtype CEP112 (n=1) (Supplementary Table 7). We added the

synthetic probes to the SALSA MLPA probe mix P200 (MRC Holland). MLPA reactions

were performed in triplicate according to the manufacturer’s instructions. MLPA was

undertaken in lymphocyte DNA from 17 probands and in eight tumour DNA samples (from

five individuals). In brief, probes were hybridised to 150 ng of denatured DNA, amplified by

PCR, and separated on an ABI 3130 Genetic Analyzer (Applied Biosystems). Data were

analysed using GeneMarker v1.51 software (SoftGenetics).

Microsatellite analysis

We used 5’6- FAM tagged primer pairs and PCR conditions for 17q microsatellite analysis as

listed in Supplementary Table 8. 10 μl of a mastermix of 30 μl ROX size standard and 1ml

HiDi formamide were added to each reaction post PCR, denatured at 95ºC for 5 minutes, and

cooled at -20ºC for 5 minutes. Reactions were run on a 3730xL genetic analyser (Applied

Biosystems) under the fragment analysis protocol. Data were analysed using GeneMarker

v1.51 software (SoftGenetics). Microsatellite analysis was undertaken in lymphocyte DNA

from 13 individuals from eight families, and in eight tumour DNA samples and four stroma

DNA samples from five individuals. Of note, one of these cases (17) harbours both BRCA1

and PPM1D mutations. Both genes are located at chromosome 17q and it is the wild-type

BRCA1 allele that is reduced in the tumours and therefore the relevance of the loss of

heterozygosity with respect to PPM1D is difficult to deduce.

Cell line and plasmid constructs

The U2OS, HeLa and HEK293 (all p53 wildtype) cell lines were obtained from the American

Type Culture Collection (ATCC). Cells were cultured and maintained according to the

supplier’s instructions. Cells were transfected with plasmid DNA using Lipofectamine 2000

(Invitrogen). A plasmid containing full-length wildtype PPM1D cDNA (pCMV6 entry-

PPM1D) was obtained from Origene, and the PPM1D open reading frame (ORF) subcloned

into pCMV6-AN-HA (Origene), generating a construct that could express a PPM1D - N-

terminal HA epitope fusion protein. Truncating mutations were introduced into the PPM1D

20

ORF of this construct using the QuickChange II XL Site-Directed Mutagenesis Kit

(Stratagene). To generate the following mutants, we used the following DNA amplification

primers:

PPM1D mutant 1 (c.1384C>T),

forward primer GAGAGAATGTCTAAGGTGTAGTC,

reverse primer GACTACACCTTAGACATTCTCTC,

PPM1D mutant 2 (c.1420delC),

forward primer GATCCAGAACCATTGAAG,

reverse primer CTTCAATGGTTCTGGATC.

Western Blot Analysis of P53 levels

U2OS, HeLa and HEK293 cells were transfected with PPM1D expression constructs and 24

hours after transfection, cells were exposed to gamma irradiation (5 Gy) from an X ray

source. Whole cell lysates were generated from transfected cells after irradiation (at 30

minute and four hour time points) and subjected to protein electrophoresis. Immunoblotting

of electrophoresed lysates was performed using antibodies specific for p53 (9282S - Cell

Signaling Technology) and actin (sc-1616, Santa Cruz Biotech).

Frequency and Risk Estimation

Statistical analyses were performed using the stats package in R. The significance of mutation

clustering was modelled under a binomial distribution where the probability of observing a

mutation in the last exon, which comprises 31% of the coding sequence, was 0.31. The

frequency in BRCA1/BRCA2 carriers and non-carriers was compared using a two-sided test of

proportions. Risk estimation was implemented using a competing risks retrospective

likelihood model incorporating age at onset according to a proportional hazards model. Since

individuals screened for PPM1D mutations were selected on the basis of both personal and

family history of breast or ovarian cancer, standard methods of analysis that ignore the

sampling frame would yield biased estimates of the risk ratios. To address this, we analysed

data within a retrospective cohort approach by modelling the conditional likelihood of the

observed genotypes given the disease phenotypes, using information on breast and ovarian

cancer occurrence in the set of 6,577 unrelated individuals negative for BRCA1/2 mutations

(BRCA1/2 mutation-positive individuals from the BOCS series and all the unselected ovarian

case series were excluded) and controls. Male controls were included in the analysis, but were

not considered to be at risk of developing breast or ovarian cancer. We assumed a competing

21

risks model, under which, each individual was at risk of developing breast or ovarian cancer.

This has been shown to provide unbiased estimates of the risk ratios for breast and ovarian

cancer where a genetic variant may be associated with one or both of the diseases10. We

estimated the PPM1D mutation carrier frequency in the population and breast and ovarian

cancer risk ratios simultaneously. Since mutation screened probands may have been selected

on the basis of bilateral breast cancer diagnosis or on the basis of both breast and ovarian

cancer diagnosis we allowed for the risks of breast or ovarian cancer diagnosis after the first

cancer diagnosis, including the risk of contralateral breast cancer. This model assumes that

the increased breast cancer (including contralateral) or ovarian cancer risk after the first

cancer diagnosis is entirely due to the susceptibility as defined by the model, with no

additional variation in risk. Site-specific cancer risks were assumed to be independent

conditional on genotype. Therefore the incidence of cancer at the second site was assumed to

be the same as if the preceding cancer had not occurred, with the exception of contralateral

breast cancer incidence after the first breast cancer, which was assumed to be half the overall

breast cancer incidence, since only one breast was at risk. In all models females were

censored at age 80 years. We assumed that the breast and ovarian cancer incidences depend

on the underlying PPM1D genotype through models of the form: λ(t) = λ0(t)exp(βx)where

λ0(t) is the baseline incidence at age t in non-mutation carriers, β is the log risk ratio

associated with the mutation and x takes value 0 for non-mutation carriers and 1 for mutation

carriers. The overall breast and ovarian cancer incidences, over all genotypes, were

constrained to agree with the population incidences for England and Wales in the period of

1993-199711, as described previously12.13. The models were parameterised in terms of the

mutation frequencies and log-risk ratios for breast and ovarian cancer. Parameters were

estimated using maximum likelihood estimation and were implemented in the pedigree

analysis software MENDEL14. The variances of the parameters were obtained by inverting

the observed information matrix. To obtain confidence intervals for the risk ratios and

perform hypothesis testing, log risk ratios were assumed to be normally distributed. A Wald

test-statistic was used to test the null hypothesis that β=0 for both breast and ovarian cancer.

Since PPM1D mutations were not found to segregate within families, we did not take into

account precise family histories or pedigree information and therefore did not incorporate the

effects of other susceptibility genes.

22

References for Online Methods 1 Tavassoli, F. A. & Devilee, P. in Pathology and Genetics of Tumours of the Breast and

Female Genital Organs (IARC Press, Lyon, France, 2003).

2 Hernandez, L. et al. Genomic and mutational profiling of ductal carcinomas in situ and matched adjacent invasive breast cancers reveals intra-tumour genetic heterogeneity and clonal selection. J. Pathol. 227, 42-52 (2012).

3 Gnirke, A. et al. Solution hybrid selection with ultra-long oligonucleotides for massively parallel targeted sequencing. Nat. Biotechnol. 27, 182-189 (2009).

4 Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754-1760 (2009).

5 Caruccio, N. Preparation of next-generation sequencing libraries using Nextera technology: simultaneous DNA fragmentation and adaptor tagging by in vitro transposition. Methods Mol. Biol. 733, 241-255 (2011).

6 Rivas, M. A. et al. Deep resequencing of GWAS loci identifies independent rare variants associated with inflammatory bowel disease. Nat. Genet. 43, 1066-1073 (2011).

7 Turnbull, C. et al. Genome-wide association study identifies five new breast cancer susceptibility loci. Nat. Genet. 42, 504-507 (2010).

8 Lunter, G. & Goodson, M. Stampy: a statistical algorithm for sensitive and fast mapping of Illumina sequence reads. Genome Res. 21, 936-939 (2011).

9 Rimmer, A., Mathieson, I., Lunter, G. & McVean, G. Platypus: An Integrated Variant Caller (www.well.ox.ac.uk/platypus) (2012).

10 Barnes, D. et al. Evaluation of association methods for analysing modifiers of disease risk in carriers of high risk mutations. Genet. Epidemiol. 36, 274-291 (2012).

11 Cancer incidence in five continents. Volume VIII IARC Sci. Publ. 1-781 (2002).

12 Antoniou, A. C. et al. Evidence for further breast cancer susceptibility genes in addition to BRCA1 and BRCA2 in a population-based study. Genet. Epidemiol. 21, 1-18 (2001).

13 Antoniou, A. C. & Easton, D. F. Polygenic inheritance of breast cancer: Implications for design of association studies. Genet. Epidemiol. 25, 190-202 (2003).

14 Lange, K., Weeks, D. & Boehnke, M. Programs for Pedigree Analysis: MENDEL, FISHER, and dGENE. Genet. Epidemiol. 5, 471-472 (1988).

ID PPM1D mutation Cancer (age in yrs)

1a c.1270_1363dup94 Ov ca (64), Bil br ca (43,56)

2 c.1272delGGinsC Br ca (34)

3 a c.1337C>G_p.S446X Ov ca (43), Bladder ca (55)

4 a c.1340delA Br ca (46)

5 c.1340delA Br ca (65)

6 c.1384C>T_p.Q462X Br ca (59)

7 c.1420delC Ov ca (68), Br ca (71)

8 c.1430delA Br ca (44)

9 c.1434C>A_p.C478X Br ca (40)

10 c.1448delC Br ca (41)

11 c.1451delT Ov ca (67)

12 c.1451delT Bil br ca (61,76)

13 c.1451T>G_p.L484X Br ca (65)

14 c.1455_1456delGA Br ca (70)

15 c.1465delT Ov ca (60), Bil br ca (50,55)

16 c.1518delT Ov ca (69)

17 b c.1519delG Ov ca (40), Bil br ca (36,40)

18 c.1535delA Br ca (46)

19 c.1536insG Ov ca (47)

20 c.1538delT Ov ca (60) Br ca (55)

21 c.1538_1551del14 Ov ca (41)

22 c.1589delC Ov ca (69) Colorectal (69)

23 c.1600_1601delTT Br ca (62)

24 c.1613T>A_p.L538X Br ca (63)

25 c.1637_1638dupTG Ov ca (76)

26 c.1412delC control

a PPM1D mutations

b PPM1D gene

c PPM1D protein

d PPM1D mutation cluster region

420 546 Mutation Cluster Region

AAA 98

Phosphatase domain NLS

Exon1 Exon 2 Exon 6 Exon 3 Exon 4 Exon 5

Mutation Cluster Region

1261

Case 24: c.1613T>A_p.L538X

Case 6 Case 15 Case 17

Case 20 Case 22 Case 24

b

a Case 20: c.1538delT Case 23: c.1600_1601delTT

c Case 17 Case 24

293T

p53

yH2AX

Actin

52kDA —

14kDA —

38kDA —

EMPTY PPM1D

WT PPM1D

c.1384 C>T PPM1D

c.1420 delC

-IR 0.5 4 -IR 0.5 4 -IR 0.5 4 -IR 0.5 4

a)

Method PPM1D full gene Sanger sequencing

Samples 2456 cases 1347 controls

PPM1D PTVs 10 cases 0 control

b)

Method PPM1D mutation cluster region (MCR) Sanger sequencing

Samples 5325 cases 4514 controls

PPM1D PTVs 15 cases 1 control

Phase 2— case-control PPM1D sequencing

Method NGS of custom pulldown including 507 DNA repair genes

Samples 1150 cases (in 48 pools of 24 samples)

PPM1D PTVs 5 cases

Phase 1— case only DNA repair panel sequencing

Supplementary Figure 1. Samples, sequencing methods and PPM1D PTVs identified in different phases of the experiment

Supplementary Figures 1 - 8 Page 1 of 14

Supplementary Figure 2: Coverage of custom pulldown by pool

 

Bars indicate the absolute (left) and relative (right) number of reads obtained from sequencing the total number of reads (total), mapped to the reference (mapped), remaining after application of quality filters (filtered) and high quality reads mapped to the target regions (target). Mapped reads were filtered to remove ambiguous alignments with a quality score of 0 and bases with a call quality below 22 were masked. Of the remaining reads of each pool 50-60% fell within the target regions, except for Pool 21, where the on target percentage was significantly lower.

Supplementary Figures 1 - 8 Page 2 of 14

CASE 2:  c.1272delGGinsC 

Mutant 

Wildtype 

CASE 1:  c.1270_c.1363dup94 

CASE 3:  c.1337C>G_p.S446X 

CASE 4:  c.1340delA 

Mutant 

Wildtype 

CASE 5:  c.1340delA 

Mutant 

Wildtype 

CASE 8:  c.1430delA* 

CASE 7:  c.1420delC 

CASE 12:  c.1451delT* 

CASE 6:  c.1384C>T_p.Q462X 

CASE 9:  c.1434C>A_p.C478X 

CASE 10:  c.1448delC* 

CASE 11:  c.1451delT* 

Supplementary Figure 3. Sanger sequencing chromatograms for 26 PPM1D mutations

Supplementary Figures 1 - 8 Page 3 of 14

Mutant 

Wildtype 

Mutant 

Wildtype 

Mutant 

Wildtype 

CASE16:  c.1518delT 

CASE13:  c.1451T>G_p.L484X 

CASE14:  c.1455_1456delGA 

CASE15:  c.1465delT 

CASE17:  c.1519delG* 

CASE18:  c.1535delA* 

CASE19:  c.1536insG 

CASE20: blood c.1538delT 

CASE21:  c.1538_1551del14 

CASE22:  c.1589delC 

CASE20: saliva c.1538delT   

CASE23:  c.1600_1601delTT 

Supplementary Figures 1 - 8 Page 4 of 14

Mutant 

Wildtype 

The mutant allele is lower than typical of heterozygous mutations, consistent with mosaicism. *indicates that the reverse sequencing trace is presented.

26: Control c.1412delC 

CASE25:  c.1637_1638dupTG  

CASE24: blood c.1613T>A_p.L538X 

CASE24: saliva c.1613T>A_p.L538X 

Supplementary Figures 1 - 8 Page 5 of 14

Case 1 Case 2 Case 3 Case 4M

utan

t rea

d pe

rcen

tage

Mut

ant r

ead

perc

enta

ge

Mut

ant r

ead

perc

enta

ge

Mut

ant r

ead

perc

enta

ge

Coverage Coverage Coverage Coverage

Case 5 Case 6 Case 8 Case 9

Mut

ant r

ead

perc

enta

ge

Mut

ant r

ead

perc

enta

ge

Mut

ant r

ead

perc

enta

ge

Mut

ant r

ead

perc

enta

ge

Coverage Coverage Coverage Coverage

Case 11 Case 12 Case 13 Case 15

Mut

ant r

ead

perc

enta

ge

Mut

ant r

ead

perc

enta

ge

Mut

ant r

ead

perc

enta

ge

Mut

ant r

ead

perc

enta

geCoverage Coverage Coverage Coverage

Case 17 Case 18 Case 20 Case 21

Mut

ant r

ead

perc

enta

ge

Mut

ant r

ead

perc

enta

ge

Mut

ant r

ead

perc

enta

ge

Mut

ant r

ead

perc

enta

ge

Coverage Coverage Coverage Coverage

Case 22 Case 23 Case 24 Case 26

Mut

ant r

ead

perc

enta

ge

Mut

ant r

ead

perc

enta

ge

Mut

ant r

ead

perc

enta

ge

Mut

ant r

ead

perc

enta

ge

Coverage Coverage Coverage Coverage

Case 25

Mutant read percentage is calculated as the proportion of reads containing the variant. The red dot indicates the PPM1D mutation. In case 2, the complex indel was called as two different mutations and thus two red dots. Variants were censored at 5%. All mutations have a consistently lower mutant read percentage, indicating mosaicism. Open dots represent variants in BRCA1 or BRCA2.

Supplementary Figure 4. Deep PCR amplicon sequencing of BRCA1, BRCA2 and PPM1D cluster region showing mosaic mutations.

Supplementary Figures 1 - 8 Page 6 of 14

CASE 1: c.1270_1363dup94 CASE 6: c.1384C>T_p.Q462X

CASE 7: c.1420delC CASE 9: c.1434C>A_p.C478X

CASE 11: c.1451delT CASE 13: c.1451T>G_p.L484X

CASE 14: c.1455_1456delGA CASE 15: c.1465delT

CASE 16: c.1518delT CASE 17: c.1519delG

CASE 18: c.1535delA CASE 19: c.1536insG

CASE 20: c.1538delT CASE 21: c.1538_1551del14

CASE 22: c.1589delC CASE 24: c.1613T>A_p.L538X

CASE 25: c.1637_1638dupTG

Supplementary Figure 5. MLPA profiles showing PPM1D mutations.

Supplementary Figures 1 - 8 Page 7 of 14

Case 7 Case 14

Case 15 Case 16

Case 20 Case 21

Mut

ant r

ead

perc

enta

ge

Mut

ant r

ead

perc

enta

ge

Mut

ant r

ead

perc

enta

ge

Mut

ant r

ead

perc

enta

ge

Mut

ant r

ead

perc

enta

ge

Mut

ant r

ead

perc

enta

ge

Coverage Coverage

Coverage Coverage

Coverage Coverage

Supplementary Figure 6. DNA repair panel individual sequencing in six PPM1D mutation carriers.

Mutant read percentage was calculated as in Supplementary Figure 4. Read coverage was recorded at all variant sites, counting only bases within reads with a mapping quality of at least 20 and a base quality of at least 22. A window around the mutation containing at least 50 variants with similar coverage was identified. The red dot indicates the PPM1D mutation. Open dots represent other variants in the custom pulldown which were not validated. All PPM1D mutations were consistently lower, indicating mosaicism. Mutant read percentages for cases 15, 20 and 21 matched those in Supplementary Figure 4.

Supplementary Figures 1 - 8 Page 8 of 14

Supplementary Figure 7: The effect of mutant PPM1D isoforms on p53 activation

293T

EMPTY

-IR 0.5 4 ‐IR 0.5 4 ‐IR 0.5 4 ‐IR 0.5 4

PPM1Dc.1420delC

PPM1Dc. 1384C>T

PPM1DWT

p53Panel 1

Hela

ActinPanel 2

Panel 2

EMPTY

-IR 0.5 4 ‐IR 0.5 4 ‐IR 0.5 4 ‐IR 0.5 4

PPM1Dc.1420delC

PPM1Dc. 1384C>T

PPM1DWT

Actin

p53 Panel 1

p53 wild type HeLa and HEK293 cells were transfected with PPM1D cDNA expression constructs and exposed to ionising irradiation (5 Grays). At 30 minute and four hour intervals after IR exposure whole cell lysates were generated and western blotted to estimate the IR induced activation of p53. Western blots showing p53 and actin(loading control) protein levels at different times (in hours) after IR exposure are shown. ‘Empty’ represents cells transfected with an empty expression construct, ‘PPM1D WT’ represents cells transfected with a wild type PPM1D cDNA expression construct and ‘PPM1D c.1384C>T’ and ‘PPM1D c.1420delC’ represent cells transfected with mutant PPM1D cDNA constructs. The suppression of p53 was enhanced in cells transfected with the mutant constructs suggesting these alleles encode hyperactive PPM1D isoforms.

Supplementary Figures 1 - 8 Page 9 of 14

Microsatellite intensities D17S890 Lymphocyte DNA

Ovarian stroma

Ovarian tumour

D17S1838 Lymphocyte DNA

Ovarian stroma

Ovarian tumour Ovarian tumour

D17S916 Lymphocyte DNA

Ovarian stroma

Sanger sequencing chromatograms

Lymphocyte DNA

Ovarian tumour

Ovarian stroma

MLPA intensities MLPA dosage plots

Lymphocyte DNA

Ovarian tumour

M

M

Supplementary Figure 8. Tumour haplotype analysis, Sanger sequencing and MLPA analysis

Case 11 c.1451delT

Supplementary Figures 1 - 8 Page 10 of 14

Case 15: c.1465delT

Microsatellite intensities

D17S890 Lymphocyte DNA

Ovarian stroma

Ovarian tumour

D17S1838 Lymphocyte DNA

Ovarian stroma

Ovarian tumour Ovarian tumour

D17S916 Lymphocyte DNA

Ovarian stroma

Sanger sequencing chromatograms

Lymphocyte DNA

Ovarian tumour

Ovarian stroma

MLPA intensities MLPA dosage plots

Lymphocyte DNA

Ovarian tumour

M

M

Supplementary Figures 1 - 8 Page 11 of 14

Case 17: c.1519delG

Microsatellite intensities D17S1838 D17S916

Sanger sequencing chromatograms Lymphocyte DNA

Ovarian tumour

Ovarian stroma

MLPA intensities MLPA dosage plots Lymphocyte DNA

Ovarian tumour

Breast tumour (left)

Breast tumour (right)

Breast tumour (left)

Breast tumour (right)

D17S890 Lymphocyte DNA

Ovarian stroma

Ovarian tumour

Breast tumour (left)

Breast tumour (right)

M

M

M

M

Lymphocyte DNA

Ovarian stroma

Ovarian tumour

Breast tumour (left)

Breast tumour (right)

Lymphocyte DNA

Ovarian stroma

Ovarian tumour

Breast tumour (left)

Breast tumour (right)

Supplementary Figures 1 - 8 Page 12 of 14

Case 20: c.1538delT

Microsatellite intensities D17S1838 D17S916

Sanger sequencing chromatograms

Lymphocyte DNA

Ovarian tumour

Ovarian stroma

MLPA intensities MLPA dosage plots

Lymphocyte DNA

Ovarian tumour

Breast tumour

Breast tumour

D17S890 Lymphocyte DNA

Ovarian stroma

Ovarian tumour

Breast tumour

Lymphocyte DNA

Ovarian stroma

Ovarian tumour

Breast tumour

Lymphocyte DNA

Ovarian stroma

Ovarian tumour

Breast tumour

M

M

M

Supplementary Figures 1 - 8 Page 13 of 14

Case 1 c.1270_1363dup94

Microsatellite intensities

D17S890 Lymphocyte DNA

Ovarian tumour

D17S1838 Lymphocyte DNA

Ovarian tumour Ovarian tumour

D17S916 Lymphocyte DNA

Sanger sequencing chromatograms

Lymphocyte DNA

Ovarian tumour

MLPA intensities MLPA dosage plots

Lymphocyte DNA

Ovarian tumour

M

M

Microsatellite data demonstrates two alleles in DNA from lymphocytes and stroma and loss of heterozygosity in tumours in cases 1, 11, 17 and 20. There is no loss of heterozygosity in the tumour DNA from case 15. The Sanger sequencing and MLPA data demonstrate that the mosaic mutations in lymphocyte DNA are not detectable in stromal or tumour DNA. The data from deep PCR amplicon sequencing of tumour and stromal DNA is presented in Supplementary Table 5.

Supplementary Figures 1 - 8 Page 14 of 14

The Wellcome Trust Case Control Consortium

The following individuals are part of the Wellcome Trust Case Control Consortium+ Jan Aerts1, Tariq Ahmad2, Hazel Arbury1, Anthony Attwood1,3,4, Adam Auton5, Stephen G Ball6,

Anthony J Balmforth6, Chris Barnes1, Jeffrey C Barrett1, Inês Barroso1, Anne Barton7, Amanda J Bennett8, Sanjeev Bhaskar1, Katarzyna Blaszczyk9, John Bowes7, Oliver J Brand8,10, Peter S Braund11, Francesca Bredin12, Gerome Breen13,14, Morris J Brown15, Ian N Bruce7, Jaswinder Bull16, Oliver S Burren17, John Burton1, Jake Byrnes18, Sian Caesar19, Niall Cardin5, Chris M Clee1, Alison J Coffey1, John MC Connell20, Donald F Conrad1, Jason D Cooper17, Anna F Dominiczak20, Kate Downes17, Hazel E Drummond21, Darshna Dudakia16, Andrew Dunham1, Bernadette Ebbs16, Diana Eccles22, Sarah Edkins1, Cathryn Edwards23, Anna Elliot16, Paul Emery24, David M Evans25, Gareth Evans26, Steve Eyre7, Anne Farmer14, I Nicol Ferrier27,

Edward Flynn7, Alistair Forbes28, Liz Forty29, Jayne A Franklyn10,30, Timothy M Frayling2, Rachel M Freathy2, Eleni Giannoulatou5, Paul Gilbert7, Katherine Gordon-Smith19,29, Emma Gray1, Elaine Green29, Chris J Groves8, Detelina Grozeva29, Rhian Gwilliam1, Naomi Hammond1,

Matt Hardy17, Pile Harrison31, Neelam Hassanali8, Husam Hebaishi1, Sarah Hines16, Anne Hinks7,

Graham A Hitman32, Lynne Hocking33, Chris Holmes5, Eleanor Howard1, Philip Howard34,

Joanna MM Howson17, Debbie Hughes16, Sarah Hunt1, John D Isaacs35, Mahim Jain18, Derek P Jewell36, Toby Johnson34, Jennifer D Jolley3,4, Ian R Jones29, Lisa A Jones19, George Kirov29,

Cordelia F Langford1, Hana Lango-Allen2, G Mark Lathrop37, James Lee12, Kate L Lee34, Charlie Lees21, Kevin Lewis1, Cecilia M Lindgren8,18, Meeta Maisuria-Armer17, Julian Maller18, John Mansfield38, Jonathan L Marchini5, Paul Martin7, Dunecan CO Massey12, Wendy L McArdle39,

Peter McGuffin14, Kirsten E McLay1, Gil McVean5,18, Alex Mentzer40, Michael L Mimmack1,

Ann E Morgan41, Andrew P Morris18, Craig Mowat42, Patricia B Munroe34, Simon Myers18,

William Newman26, Elaine R Nimmo21, Michael C O'Donovan29, Abiodun Onipinla34, Nigel R Ovington17, Michael J Owen29, Kimmo Palin1, Aarno Palotie1, Kirstie Parnell2, Richard Pearson8, John RB Perry2,18, Anne Phillips42, Vincent Plagnol17, Natalie J Prescott9, Inga Prokopenko8,18,

Michael A Quail1, Suzanne Rafelt11, Nigel W Rayner8,18, David M Reid33, Anthony Renwick16,

Susan M Ring39, Neil Robertson8,18, Samuel Robson1, Ellie Russell29, David St Clair13, Jennifer G Sambrook3,4, Jeremy D Sanderson40, Stephen J Sawcer43, Helen Schuilenburg17, Carol E Scott1,

Sheila Seal16, Sue Shaw-Hawkins34, Beverley M Shields2, Matthew J Simmonds8,10, Debbie J Smyth17, Elilan Somaskantharajah1, Katarina Spanova16, Sophia Steer44, Jonathan Stephens3,4,

Helen E Stevens17, Kathy Stirrups1, Millicent A Stone45,46, David P Strachan47, Zhan Su5,

Deborah PM Symmons7, John R Thompson48, Wendy Thomson7, Martin D Tobin48, Mary E Travers8, Clare Turnbull16, Damjan Vukcevic18, Louise V Wain48, Mark Walker49, Neil M Walker17, Chris Wallace17, Margaret Warren-Perry16, Nicholas A Watkins3,4, John Webster50,

Michael N Weedon2, Anthony G Wilson51, Matthew Woodburn17, B Paul Wordsworth52, Chris Yau5, Allan H Young27,53, Eleftheria Zeggini1, Matthew A Brown52,54, Paul R Burton48, Mark J Caulfield34, Alastair Compston43, Martin Farrall55, Stephen CL Gough8,10,30, Alistair S Hall6, Andrew T Hattersley2,56, Adrian VS Hill18, Christopher G Mathew9, Marcus Pembrey57, Jack Satsangi21, Michael R Stratton1,16, Jane Worthington7, Matthew E Hurles1, Audrey Duncanson58, Willem H Ouwehand1,3,4, Miles Parkes12, Nazneen Rahman16, John A Todd17, Nilesh J Samani11,59, Dominic P Kwiatkowski1,18, Mark I McCarthy8,18,60, Nick Craddock29, Panos Deloukas1, Peter Donnelly5,18.

1 The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA UK. 2 Genetics of Complex Traits, Peninsula College of Medicine and Dentistry University of Exeter, EX1 2LU, UK. 3 Department of Haematology, University of Cambridge, Long Road, Cambridge, CB2 0PT, UK. 4 National Health Service Blood and Transplant, Cambridge Centre, Long Road, Cambridge CB2 0PT, UK. 5 Department of Statistics, University of Oxford, 1 South Parks Road, Oxford, OX1 3TG, UK. 6 Multidisciplinary Cardiovascular Research Centre (MCRC), Leeds Institute of Genetics, Health and

Therapeutics (LIGHT), University of Leeds, Leeds, LS2 9JT, UK. 7 arc Epidemiology Unit, Stopford Building, University of Manchester, Oxford Road, Manchester, M13 9PT,

UK. 8 Oxford Centre for Diabetes, Endocrinology and Medicine, University of Oxford, Churchill Hospital, Oxford

OX3 7LJ, UK. 9 Department of Medical and Molecular Genetics, King’s College London School of Medicine, 8th Floor Guy’s

Tower, Guy’s Hospital, London, SE1 9RT, UK. 10 Centre for Endocrinology, Diabetes and Metabolism, Institute of Biomedical Research, University of

Birmingham, Birmingham, B15 2TT, UK. 11 Department of Cardiovascular Sciences, University of Leicester, Glenfield Hospital, Groby Road, Leicester

LE3 9QP, UK. 12 IBD Genetics Research Group, Addenbrooke's Hospital, Cambridge, CB2 0QQ, UK. 13 University of Aberdeen, Institute of Medical Sciences, Foresterhill, Aberdeen AB25 2ZD, UK. 14 SGDP, The Institute of Psychiatry, King's College London, De Crespigny Park, Denmark Hill, London SE5

8AF, UK. 15 Clinical Pharmacology Unit, University of Cambridge, Addenbrookes Hospital, Hills Road, Cambridge CB2

2QQ, UK. 16 Section of Cancer Genetics, Institute of Cancer Research, 15 Cotswold Road, Sutton SM2 5NG, UK. 17 Juvenile Diabetes Research Foundation/Wellcome Trust Diabetes and Inflammation Laboratory, Department

of Medical Genetics, Cambridge Institute for Medical Research, University of Cambridge, Wellcome Trust/MRC Building, Cambridge CB2 0XY, UK.

18 The Wellcome Trust Centre for Human Genetics, University of Oxford, Roosevelt Drive, Oxford OX3 7BN, UK.

19 Department of Psychiatry, University of Birmingham, National Centre for Mental Health, 25 Vincent Drive, Birmingham, B15 2FG, UK.

20 BHF Glasgow Cardiovascular Research Centre, University of Glasgow, 126 University Place, Glasgow, G12 8TA, UK.

21 Gastrointestinal Unit, Division of Medical Sciences, School of Molecular and Clinical Medicine, University of Edinburgh, Western General Hospital, Edinburgh EH4 2XU, UK.

22 Faculty of Medicine, University of Southampton and Wessex Clinical Genetics Service, UHSFT, Southampton, UK. SO16 5YA

23 Endoscopy Regional Training Unit, Torbay Hospital, Torbay TQ2 7AA, UK. 24 Academic Unit of Musculoskeletal Disease, University of Leeds, Chapel Allerton Hospital, Leeds, West

Yorkshire LS7 4SA, UK. 25 MRC Centre for Causal Analyses in Translational Epidemiology, Department of Social Medicine, University

of Bristol, Bristol, BS8 2BN, UK. 26 Department of Medical Genetics, Manchester Academic Health Science Centre (MAHSC), University of

Manchester, Manchester M13 0JH, UK. 27 School of Neurology, Neurobiology and Psychiatry, Royal Victoria Infirmary, Queen Victoria Road,

Newcastle upon Tyne, NE1 4LP, UK. 28 Institute for Digestive Diseases, University College London Hospitals Trust, London NW1 2BU, UK. 29 MRC Centre for Neuropsychiatric Genetics and Genomics, School of Medicine, Cardiff University, Heath

Park, Cardiff, CF14 4XN, UK. 30 University Hospital Birmingham NHS Foundation Trust, Birmingham, B15 2TT, UK. 31 University of Oxford, Institute of Musculoskeletal Sciences, Botnar Research Centre, Oxford, OX3 7LD, UK. 32 Centre for Diabetes and Metabolic Medicine, Barts and The London, Royal London Hospital, Whitechapel,

London, E1 1BB, UK. 33 Bone Research Group, Department of Medicine and Therapeutics, University of Aberdeen, Aberdeen, AB25

2ZD, UK.

34 Clinical Pharmacology and Barts and The London Genome Centre, William Harvey Research Institute, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, Charterhouse Square, London EC1M 6BQ, UK.

35 Institute of Cellular Medicine, Musculoskeletal Research Group, 4th Floor, Catherine Cookson Building, The Medical School, Framlington Place, Newcastle upon Tyne, NE2 4HH, UK.

36 Gastroenterology Unit, Radcliffe Infirmary, University of Oxford, Oxford, OX2 6HE, UK. 37 Centre National de Genotypage, 2, Rue Gaston Cremieux, Evry, Paris 91057, France. 38 Department of Gastroenterology & Hepatology, University of Newcastle upon Tyne, Royal Victoria Infirmary,

Newcastle upon Tyne NE1 4LP, UK. 39 ALSPAC Laboratory, Department of Social Medicine, University of Bristol, BS8 2BN, UK. 40 Division of Nutritional Sciences, King's College London School of Biomedical and Health Sciences, London

SE1 9NH, UK. 41 NIHR-Leeds Musculoskeletal Biomedical Research Unit, University of Leeds, Chapel Allerton Hospital,

Leeds, West Yorkshire LS7 4SA, UK. 42 Department of General Internal Medicine, Ninewells Hospital and Medical School, Ninewells Avenue, Dundee

DD1 9SY, UK. 43 Department of Clinical Neurosciences, University of Cambridge, Addenbrooke's Hospital, Hills Road,

Cambridge, CB2 2QQ, UK. 44 Clinical and Academic Rheumatology, Kings College Hosptal National Health Service Foundation Trust,

Denmark Hill, London SE5 9RS, UK. 45 University of Toronto, St. Michael's Hospital, 30 Bond Street, Toronto, Ontario M5B 1W8, Canada. 46 University of Bath, Claverdon, Norwood House, Room 5.11a Bath Somerset BA2 7AY, UK. 47 Division of Community Health Sciences, St George's, University of London, London SW17 0RE, UK. 48 Departments of Health Sciences and Genetics, University of Leicester, 217 Adrian Building, University Road,

Leicester, LE1 7RH, UK. 49 Diabetes Research Group, School of Clinical Medical Sciences, Newcastle University, Framlington Place,

Newcastle upon Tyne NE2 4HH, UK. 50 Medicine and Therapeutics, Aberdeen Royal Infirmary, Foresterhill, Aberdeen, Grampian AB9 2ZB, UK. 51 School of Medicine and Biomedical Sciences, University of Sheffield, Sheffield, S10 2JF, UK. 52 Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, Nuffield Orthopaedic

Centre, University of Oxford, Windmill Road, Headington, Oxford, OX3 7LD, UK. 53 UBC Institute of Mental Health, 430-5950 University Boulevard Vancouver, British Columbia, V6T 1Z3,

Canada. 54 Diamantina Institute of Cancer, Immunology and Metabolic Medicine, Princess Alexandra Hospital, University

of Queensland, Ipswich Road, Woolloongabba, Brisbane, Queensland, 4102, Australia. 55 Cardiovascular Medicine, University of Oxford, Wellcome Trust Centre for Human Genetics, Roosevelt Drive,

Oxford OX3 7BN, UK. 56 Genetics of Diabetes, Peninsula College of Medicine and Dentistry, University of Exeter, Barrack Road,

Exeter, EX2 5DW, UK. 57 Clinical and Molecular Genetics Unit, Institute of Child Health, University College London, 30 Guilford

Street, London WC1N 1EH, UK. 58 The Wellcome Trust, Gibbs Building, 215 Euston Road, London NW1 2BE, UK. 59 Leicester NIHR Biomedical Research Unit in Cardiovascular Disease, Glenfield Hospital, Leicester, LE3 9QP,

UK. 60 Oxford NIHR Biomedical Research Centre, Churchill Hospital, Oxford, OX3 7LJ, UK.

The Breast and Ovarian Cancer Susceptibility Collaboration (BOCS)

The following individuals are part of Breast and Ovarian Cancer Susceptibility Collaboration (BOCS)

A. Ardern-Jones, J. Adlard, M. Ahmed, G. Attard, K. Bailey, E. Bancroft, C. Bardsley, D. Barton, J. Barwell, L. Baxter, R. Belk, J. Berg, B. Bernhard, T. Bishop, L. Boyes, N. Bradshaw, A.F. Brady, S. Brant, C. Brewer, G. Brice, G. Bromilow, C. Brooks, A. Bruce, B. Bulman, L. Burgess, J. Campbell, N. Canham, B. Castle, R. Cetnarskyj, C. Chapman, O. Claber, N. Coates, T. Cole, A. Collins, J. Cook, S. Coulson, G. Crawford, D. Cruger, C. Cummings, L. D’Mello, R. Davidson, L. Day, L. de Silva, B. Dell, C. Dolling, A. Donaldson, H. Dorkins, F. Douglas, S. Downing, S. Drummond, C. Dubras, J. Dunlop, S. Durrell, D. Eccles, C. Eddy, M. Edwards, E. Edwards, J. Edwardson, R. Eeles, I. Ellis, F. Elmslie, G. Evans, B. Gibbens, C. Gardiner, N. Ghali, C. Giblin, S. Gibson, S. Goff, S. Goodman, D. Goudie, L. Greenhalgh, J. Greer, H. Gregory, D. Halliday, R. Hardy, C. Hartigan, T. Heaton, A. Henderson, C. Higgins, S. Hodgson, T. Holt, T. Homfray, D. Horrigan, C. Houghton, R.S. Houlston, L. Hughes, V. Hunt, L. Irvine, L. Izatt, C. Jacobs, S. James, M. James, L. Jeffers, I. Jobson, W. Jones, M.J. Kennedy, S. Kenwrick, C. Kightley, C. Kirk, L. Kirk, E. Kivuva, K. Kohut, M. Kosicka-Slawinska, A. Kulkarni, A. Kumar, F. Lalloo, N. Lambord, C. Langman, P. Leonard, S. Levene, S. Locker, P. Logan, M. Longmuir, A. Lucassen, V. Lyus, A. Magee, A. Male, S. Mansour, D. McBride, E. McCann, V. McConnell, M. McEntagart, C. McKeown, L. McLeish, D. McLeod, A. Melville, L. Mercer, C. Mercer, Z. Miedzybrodzka, A. Mitra, P. J. Morrison, V. Murday, A. Murray, K. Myhill, J. Myring, E. O'Hara, J. Paterson, P. Pearson, G. Pichert, K. Platt, M. Porteous, C. Pottinger, S. Price, L. Protheroe, S. Pugh, O. Quarrell, K. Randhawa, C. Riddick, L. Robertson, A. Robinson, V. Roffey-Johnson, M. Rogers, S. Rose, S. Rowe, A. Schofield, N. Rahman, S. Saya, G. Scott, J. Scott, A. Searle, S. Shanley, S. Sharif, A. Shaw, J. Shaw, J. Shea-Simonds, L. Side, J. Sillibourne, K. Simon, S. Simpson, S. Slater, S. Smalley, K. Smith, L. Snadden, K. Snape, J. Soloway, Y. Stait, B. Stayner, M. Steel, C. Steel, H. Stewart, D. Stirling, M. Thomas, S. Thomas, S. Tomkins, H. Turner, E. Tyler, A. Vandersteen, E. Wakeling, F. Waldrup, L. Walker, C. Watt, S. Watts, A. Webber, C. Whyte, J. Wiggins, E. Williams, L. Winchester.


Recommended