This journal is © The Royal Society of Chemistry 2020 Mol. Omics, 2020, 16, 147--155 | 147 Cite this: Mol. Omics, 2020, 16, 147 The effectiveness of filtering glycopeptide peak list files for Y ions Robert J. Chalkley, * a Katalin F. Medzihradszky, ab Zsuzsanna Darula, b Adam Pap bc and Peter R. Baker a Intact glycopeptide analysis is becoming more common with developments in mass spectrometry instrumentation and fragmentation approaches. In particular, collision-based fragmentation approaches such as higher energy collisional dissociation (HCD) and radical-driven fragmentation approaches such as electron transfer dissociation (ETD) provide complementary information, but bioinformatic strategies to utilize this combined information are currently lacking. In this work we adapted a software tool, MS-Filter, to search HCD peak list files for predicted Y ions based on matched EThcD results to propose additional glycopeptide assignments. The strategy proved to be extremely powerful for O-glycopeptide data, and also of benefit for N-linked data, where it allowed rescue of low confidence results from database searching. Introduction Intact glycopeptide analysis using mass spectrometry presents significant challenges due to the differing behavior of the peptide and glycan components in the hybrid species. Nevertheless, when analyzing a single protein, collision induced dissociation (CID) strategies have been very effective. In the characteristic fragmenta- tion of glycopeptides using these approaches one of the most prominent fragments in O-linked glycopeptide spectra is typically the gas-phase-deglycosylated peptide (Y 0 ), whereas for N-linked glycopeptides a similarly abundant ion is commonly the peptide retaining the core GlcNAc (Y 1 ) [nomenclature: 1 ]. When analyzing an isolated protein, the masses of the unmodified versions of potentially glycosylated peptides can be calculated, so it is straight- forward to use the smallest Y ion mass to identify the peptide, one can infer the mass of the glycan based on the mass difference between the observed precursor and the Y ion, then the CID fragmentation provides information on the glycan(s) composition. Resonance activation CID in ion trap yields B and Y fragments via single bond cleavages, identifying terminal groups, thus providing knowledge about branching in the oligosaccharide(s). 2,3 Beam- type CID (HCD) can produce multiple bond cleavages, and thus provides some information on the direct connection of certain sugar units in form of internal oxonium ions. 3,4 Analysis of complex mixtures of glycoproteins is currently mostly performed by enzymatic release of glycan species then analysis of peptides and glycans separately. However, intact glycopeptide analysis for these types of samples is starting to become more common due to the development of improved fragmentation methods in mass spectrometers that allow formation of fragments from both peptide and glycan compo- nents in the same spectrum, namely EThcD and stepped HCD fragmentation. 5–7 A few datasets identifying hundreds of unique glycopeptides have now been published, although there are still challenges with controlling the reliability of results from software doing these analyses. 5–9 In this work we sought to investigate software strategies that make use of small Y ions to (a) increase the number of glycopeptide identifications; and (b) improve the reliability of reported results in analyses of complex glycopeptide datasets. Using Y ions to try to identify glycopeptides from CID-type fragmentation spectra is not novel. However, using EThcD data to derive the list of glycosylated peptides for querying HCD data is a new approach. Prior strategies have all employed PNGase F to deglycosylate a fraction of the sample, then analyzed this to derive a list of potential glycopeptides to consider in the intact glycopeptide analysis. Some researchers additionally performed glycomic analysis to derive a list of glycosylations to consider, 10,11 whereas others have used a database of known glycans as a reference. 12 These approaches require significantly more sample handling and acquisition and can only be used for N-glycosylation analysis. We developed new features in the MS-Filter tool within Protein Prospector, then tested these on intact O-linked and N-linked glycopeptide datasets to see the effect of different filtering para- meters on the resulting glycopeptide results. a Department of Pharmaceutical Chemistry, School of Pharmacy, University of California San Francisco, USA. E-mail: chalkley@cgl.ucsf.edu b Laboratory of Proteomics Research, Biological Research Centre, Temesvari krt. 62, H-6726 Szeged, Hungary c Doctoral School in Biology, Faculty of Science and Informatics, University of Szeged, Kozep fasor 52, H-6726 Szeged, Hungary Electronic supplementary information (ESI) available. See DOI: 10.1039/ c9mo00178f Received 9th December 2019, Accepted 6th February 2020 DOI: 10.1039/c9mo00178f rsc.li/molomics Molecular Omics RESEARCH ARTICLE Published on 06 February 2020. Downloaded on 9/29/2021 8:28:46 AM. View Article Online View Journal | View Issue