+ All Categories
Home > Documents > Protein Metrics Inc. - High-Throughput …...High-Throughput Characterization of Complex Crosslinked...

Protein Metrics Inc. - High-Throughput …...High-Throughput Characterization of Complex Crosslinked...

Date post: 31-Aug-2020
Category:
Upload: others
View: 1 times
Download: 0 times
Share this document with a friend
1
High-Throughput Characterization of Complex Crosslinked Proteins using Byonic and Byologic Chris Becker 1 ; Yong Kil 1 ; Pierre Allemand 1 , St John Skilton 1 , Eric Carlson 1 ; Ryan D. Leib 2 ; Christopher M. Adams 2 1 Protein Metrics Inc. 2 Stanford University Mass Spectrometry, Stanford, CA Introduction Crucial structural information about protein-protein interactions can be obtained using chemical crosslinking methods in mass spectrometry. In practice, these data are incredibly complex to analyze and quantify, requiring extensive and time consuming validation to eliminate false positives or low confidence assignments. Here, we present a new rapid approach to analyze crosslink data using Byonic to identify crosslinked peptides and Byologic to facilitate data validation. This approach reduces the validation time for typical protein-protein systems from hours to minutes, and provides portable and robust data output to aid future investigations. Using this strategy, we have investigated multiple cross linked protein-protein interactions to determine binding interfaces, stoichiometry, and replicability. The Byonic search produces lists of possible peptide linkages containing thousands of potential cross links. Byologic readily condenses these data into related peptide groups, e.g., cross linked peptides with their non-crosslinked homologs, even across replicate experiments or across experimental conditions. This facilitates rapid analysis for cross-link validation at both MS1 and MS2 levels, and easy label-free quantification using XIC analysis. We propose empirical rules, such as peptide length, number of observations, peptide assignment scores, and chromatographic profiles, and demonstrate a method to assign and filter data using ‘comment’ labels to define classes based on analyst confidence. Additionally, data that are not relevant to the current analysis (such as peptides without cross links) can be filtered without loss and investigated at later times based on new developments. Once classifications are made, Byologic produces permanent and flexible report outputs for information transfer between researchers. Acknowledgment Support is gratefully acknowledged from NIH, grant GM100634 and the Stanford Dean of Research. Byologic ® Features Workflow reduces lists of unsorted cross links that number in the 100s to a manageable list of crosslinks of links in the 10s to produce a list of solid leads for structural assignment. The team has demonstrated one can integrate data from multiple linker types and multiple digest types. Based on a list of strong leads, analysts can derive initial structure, and can validate either across linkers or with the ‘uncertain’ matches which fit the alignment. Our integrated software workflow allows exhaustive analysis of these proteins. To date, these analyses have proven to be about an order of magnitude faster for interpretation (ca 1-2 hours as opposed to about ca 2 days with a traditional data analysis approach). [email protected] Methods Cross linked peptides are categorized into three groups: true-positives, uncertain, and false-positives, using a set of empirical criteria detailed below. “True positives” succeed on all of these rules, while “False Positives” fail least three of these rules. This strategy does not guarantee that a given cross link is-or-is-not ‘real,’ but provides an easy transferable, and repeatable framework for analysis. True positives are then used to anchor structural determination, with uncertain assignments filling in based on 3D structural limitations. Thus, only the most reliable mass spectral data is used to complement other structural measurements. Sequence FASTA file MS2 Identifications LC-MS/MS data Inspection and quantification Peptide-Centric sensitive crosslink analysis Search engine Comprehensive identifications Report Crosslink Data Analysis in One Step Rapid Multidimensional Validation and Classification Unreliable Chromatography Data analysis workflow: Results Dashboard Both Crosslink and Native Peptide, Above-&-Below Rapidly group, examine, and validate related variants Discussion and Conclusions Project Window Protein Coverage MS1 isotopes XICs MS2 annotation and m/z errors Peptide Windows Rules-Based Assignments, Across Range of Linkers Co-isolated Precursors MS/MS Alone Looks Reasonable… BUT MS/MS Spectrum Fragmentation Quality XIC Chromatographic Behavior Minimum Peptide and Crosslink Partner Length Observation across replicates (Both MS1 & MS/MS) MS1 Coelution and Possible MS/MS Contamination Auto-populate likely crosslinked peptides Structural Assignment Summary tables are used to efficiently parse the True Positives, False Positives, and Uncertains for review, assignment, and reporting Prot-1 Pr-1 Pr-2 Pr-1 Prot-2 Prot-1 Pr-1 Pr-1 Pr-1 Pr-1 Pr-1 Pr-1 Pr-1 Pr-1 Pr-1 Pr-1 Pr-1 Pr-1 Pr-1 Pr-1 Pr-1 Pr-1 Pr-1 Pr-1 Pr-1
Transcript
Page 1: Protein Metrics Inc. - High-Throughput …...High-Throughput Characterization of Complex Crosslinked Proteins using Byonic and Byologic Chris Becker1; Yong Kil1; Pierre Allemand1,

High-Throughput Characterization of Complex Crosslinked Proteins using Byonic and Byologic

Chris Becker1; Yong Kil1; Pierre Allemand1, St John Skilton1, Eric Carlson1; Ryan D. Leib2; Christopher M. Adams2

1Protein Metrics Inc. 2Stanford University Mass Spectrometry, Stanford, CA

Introduction

Crucial structural information about protein-protein interactions can be obtained using chemical crosslinking methods in mass spectrometry. In practice, these data are incredibly complex to analyze and quantify, requiring extensive and time consuming validation to eliminate false positives or low confidence assignments. Here, we present a new rapid approach to analyze crosslink data using Byonic to identify crosslinkedpeptides and Byologic to facilitate data validation. This approach reduces the validation time for typical protein-protein systems from hours to minutes, and provides portable and robust data output to aid future investigations.

Using this strategy, we have investigated multiple cross linked protein-protein interactions to determine binding interfaces, stoichiometry, and replicability. The Byonicsearch produces lists of possible peptide linkages containing thousands of potential cross links. Byologic readily condenses these data into related peptide groups, e.g., cross linked peptides with their non-crosslinked homologs, even across replicate experiments or across experimental conditions. This facilitates rapid analysis for cross-link validation at both MS1 and MS2 levels, and easy label-free quantification using XIC analysis. We propose empirical rules, such as peptide length, number of observations, peptide assignment scores, and chromatographic profiles, and demonstrate a method to assign and filter data using ‘comment’ labels to define classes based on analyst confidence. Additionally, data that are not relevant to the current analysis (such as peptides without cross links) can be filtered without loss and investigated at later times based on new developments. Once classifications are made, Byologic produces permanent and flexible report outputs for information transfer between researchers.

This new approach to cross link analysis is rapid, flexible, and extensible to other types of analyses, such as protein-ligand interactions, surface acc

Acknowledgment

Support is gratefully acknowledged from NIH, grant GM100634 and the Stanford Dean of Research.

Byologic® Features

Workflow reduces lists of unsorted cross links that number in the 100s to a manageable list of crosslinks of links in the 10s to produce a list of solid leads for structural assignment.

The team has demonstrated one can integrate data from multiple linker types and multiple digest types.

Based on a list of strong leads, analysts can derive initial structure, and can validate either across linkers or with the ‘uncertain’ matches which fit the alignment.

Our integrated software workflow allows exhaustive analysis of these proteins. To date, these analyses have proven to be about an order of magnitude faster for interpretation (ca 1-2 hours as opposed to about ca 2 days with a traditional data analysis approach).

[email protected]

Methods

Cross linked peptides are categorized into three groups: true-positives, uncertain, and false-positives, using a set of empirical criteria detailed below. “True positives” succeed on all of these rules, while “False Positives” fail least three of these rules. This strategy does not guarantee that a given cross link is-or-is-not ‘real,’ but provides an easy transferable, and repeatable framework for analysis. True positives are then used to anchor structural determination, with uncertain assignments filling in based on 3D structural limitations. Thus, only the most reliable mass spectral data is used to complement other structural measurements.

Sequence

FASTA file

MS2 Identifications

LC-MS/MS data

Inspection and quantificationPeptide-Centric sensitive crosslink analysis

Search engine Comprehensive identifications

Report

Crosslink Data Analysis in One Step

Rapid Multidimensional Validation and Classification

Unreliable Chromatography

Data analysis workflow:

Results Dashboard

Both Crosslink and Native Peptide, Above-&-BelowRapidly group, examine, and validate related variants

Discussion and Conclusions

Project Window

Protein Coverage

MS1 isotopes

XICs

MS2 annotation and m/z errors

Peptide Windows

Rules-Based Assignments, Across Range of Linkers

Co-isolated PrecursorsMS/MS Alone Looks Reasonable…

BUT

• MS/MS Spectrum Fragmentation Quality

• XIC Chromatographic Behavior

• Minimum Peptide and Crosslink Partner Length

• Observation across replicates (Both MS1 & MS/MS)

• MS1 Coelution and Possible MS/MS Contamination

Auto-populate likely crosslinked peptides

Structural Assignment

Summary tables are used to efficiently parse the True Positives, False Positives, and Uncertains for review, assignment, and reporting

Prot-1

Prot-1

Prot-1

Prot-1

Prot-2

Prot-2

Prot-2

Prot-2

Prot-2

Prot-2

Prot-2

Prot-2

Prot-2

Prot-2

Prot-2

Prot-2

Prot-2

Prot-2

Prot-2

Prot-2

Prot-1

Pr-1

Pr-2

Pr-1

Prot-2 Prot-1

Pr-1

Pr-1

Pr-1

Pr-1

Pr-1

Pr-1

Pr-1

Pr-1

Pr-1

Pr-1

Pr-1

Pr-1

Pr-1

Pr-1

Pr-1

Pr-1

Pr-1

Pr-1

Pr-1

Pr-1

Pr-1

Recommended