Abstract
Biomarkers are of central importance for assessing the health state and to guide medical interventions and their efficacy; still, they are lacking for most diseases. Mass spectrometry (MS)-based proteomics is a powerful technology for biomarker discovery but requires sophisticated bioinformatics to identify robust patterns. Machine learning (ML) has become a promising tool for this purpose. However, it is sometimes applied in an opaque manner and generally requires specialized knowledge. To enable easy access to ML for biomarker discovery without any programming or bioinformatics skills, we developed "OmicLearn"(http://OmicLearn.org), an open-source browser-based ML tool using the latest advances in the Python ML ecosystem. Data matrices from omics experiments are easily uploaded to an online or a locally installed web server. OmicLearn enables rapid exploration of the suitability of various ML algorithms for the experimental data sets. It fosters open science via transparent assessment of state-of-the-art algorithms in a standardized format for proteomics and other omics sciences.
Originalsprog | Engelsk |
---|---|
Tidsskrift | Journal of Proteome Research |
Vol/bind | 22 |
Udgave nummer | 2 |
Sider (fra-til) | 359-367 |
ISSN | 1535-3893 |
DOI | |
Status | Udgivet - 2023 |
Bibliografisk note
Funding Information:The work carried out in this project was funded by OmicEra Diagnostics GmbH and partially supported by the German Federal Ministry of Education and Research (BMBF) project ProDiag (Grant No. 01KI20377B) and the Michael J. Fox Foundation MJFF-019273. M.T.S. is supported financially by the Novo Nordisk Foundation (Grant Agreement NNF14CC0001). We thank Halil I. Bilgin for helpful discussions on OmicLearn. We thank our former colleagues at the Max Planck Institute of Biochemistry, Jakob M. Bader and Ozge Karayel, for initial discussions related to machine learning.
Publisher Copyright:
© 2022 The Authors. Published by American Chemical Society.