Assessment of subvisible particles in biopharmaceuticals with image feature extraction and machine learning

Ravi Maharjan*, Jae Chul Lee*, Johan Peter Bøtker*, Ki Hyun Kim*, Nam Ah Kim*, Seong Hoon Jeong*, Jukka Rantanen*

*Corresponding author af dette arbejde

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

1 Citationer (Scopus)

Abstract

An image classification tool was developed to classify subvisible particles, namely silicone oil (SO) and non-silicone oil (NSO; protein aggregate, rubber closure, and air bubble) particles, present in biopharmaceuticals using feature extraction in FlowCam® images, and the outcomes were validated with six machine learning (ML) classifiers. The image classification tool set at specific configuration: condition 1 − CDEP {(compactness, diameter, elongation, perimeter) = (8.06, 10.44, 13.30, 27.20)} identified SO, while the configuration set at condition 2 – CDEL {(compactness, diameter, elongation, length) = (3.14, 24.48, 1.97, 4.28)} detected NSO. The classification tool was particularly useful in detecting the release of SO after exposure to the stress sources. Additionally, the morphological features-based classification tool (p < 0.05) enhanced the predictive accuracy of the ML classification tools (≥97.2 %). Specifically, CNN (100 %) outperformed naïve Bayes (99.3 %), linear discriminant analysis (98.4 %), artificial neural network (98.1 %), support vector machine (SVM 97.2 %). Bootstrap forest was excluded because it failed to classify SO in a large dataset. The developed classification tool could be an alternative in classifying the image datasets without the burden of complex ML tools. Such image-based classification tool can be computationally economical solution in quality control of the protein formulations.

OriginalsprogEngelsk
Artikelnummer105061
TidsskriftChemometrics and Intelligent Laboratory Systems
Vol/bind245
Antal sider10
ISSN0169-7439
DOI
StatusUdgivet - 2024

Bibliografisk note

Funding Information:
This work was partially supported by the National Research Foundation of Korea (NRF) grant funded by the Korean government (MSIT) ( NRF-2018R1A5A2023127 and NRF-2019R1A2C1083911) . It was also partially supported by the Dongguk University Research Fund of 2023 ( S-2023-G0001-00099 ).

Publisher Copyright:
© 2024 Elsevier B.V.

Citationsformater