Community evaluation of glycoproteomics informatics solutions reveals high-performance search strategies for serum glycopeptide analysis

Rebeca Kawahara, Anastasia Chernykh, Kathirvel Alagesan, Marshall Bern, Weiqian Cao, Robert J. Chalkley, Kai Cheng, Matthew S. Choo, Nathan Edwards, Radoslav Goldman, Marcus Hoffmann, Yingwei Hu, Yifan Huang, Jin Young Kim, Doron Kletter, Benoit Liquet, Mingqi Liu, Yehia Mechref, Bo Meng, Sriram NeelameghamTerry Nguyen-Khuong, Jonas Nilsson, Adam Pap, Gun Wook Park, Benjamin L. Parker, Cassandra L. Pegg, Josef M. Penninger, Toan K. Phung, Markus Pioch, Erdmann Rapp, Enes Sakalli, Miloslav Sanda, Benjamin L. Schulz, Nichollas E. Scott, Georgy Sofronov, Johannes Stadlmann, Sergey Y. Vakhrushev, Christina M. Woo, Hung Yi Wu, Pengyuan Yang, Wantao Ying, Hui Zhang, Yong Zhang, Jingfu Zhao, Joseph Zaia, Stuart M. Haslam, Giuseppe Palmisano, Jong Shin Yoo, Göran Larson, Kai Hooi Khoo, Katalin F. Medzihradszky, Daniel Kolarich, Nicolle H. Packer, Morten Thaysen-Andersen*

*Corresponding author af dette arbejde

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

107 Citationer (Scopus)
23 Downloads (Pure)

Abstract

Glycoproteomics is a powerful yet analytically challenging research tool. Software packages aiding the interpretation of complex glycopeptide tandem mass spectra have appeared, but their relative performance remains untested. Conducted through the HUPO Human Glycoproteomics Initiative, this community study, comprising both developers and users of glycoproteomics software, evaluates solutions for system-wide glycopeptide analysis. The same mass spectrometrybased glycoproteomics datasets from human serum were shared with participants and the relative team performance for N- and O-glycopeptide data analysis was comprehensively established by orthogonal performance tests. Although the results were variable, several high-performance glycoproteomics informatics strategies were identified. Deep analysis of the data revealed key performance-associated search parameters and led to recommendations for improved ‘high-coverage’ and ‘high-accuracy’ glycoproteomics search solutions. This study concludes that diverse software packages for comprehensive glycopeptide data analysis exist, points to several high-performance search strategies and specifies key variables that will guide future software developments and assist informatics decision-making in glycoproteomics.

OriginalsprogEngelsk
TidsskriftNature Methods
Vol/bind18
Udgave nummer11
Sider (fra-til)1304-1316
Antal sider13
ISSN1548-7091
DOI
StatusUdgivet - 2021

Bibliografisk note

Funding Information:
Rosa Viner and Sergei Snovida (Thermo Fisher Scientific) are thanked for providing high-quality LC-MS/MS data. Krishnatej Nishtala is thanked for aiding the data analysis. Catherine Hayes, Julien Mariethoz and Frederique Lisacek are thanked for informatics assistance. RK was supported by an Early Career Fellowship (Cancer Institute NSW ECF181259). DK was supported by an Australian Research Council Future Fellowship (FT160100344). GP was funded by FAPESP (n° 2018/15549-1). MT-A was supported by a Macquarie University Safety Net Grant. DK and NHP were supported by the Australian Research Council Centre of Excellence in Nanoscale Biophotonics (CE140100003).

Publisher Copyright:
© 2021, The Author(s).

Citationsformater