Challenge of missing data in observational studies: investigating cross-sectional imputation methods for assessing disease activity in axial spondyloarthritis

Stylianos Georgiadis*, Marion Pons, Simon Rasmussen, Merete Lund Hetland, Louise Linde, Daniela di Giuseppe, Brigitte Michelsen, Johan K. Wallman, Tor Olofsson, Jakub Zavada, Bente Glintborg, Anne G. Loft, Catalin Codreanu, Daniel Melim, Diogo Almeida, Sella Aarrestad Provan, Tore K. Kvien, Vappu Rantalaiho, Ritva Peltomaa, Bjorn GudbjornssonOlafur Palsson, Ovidiu Rotariu, Ross MacDonald, Ziga Rotar, Katja Perdan Pirkmajer, Karin Lass, Florenzo Iannone, Adrian Ciurea, Mikkel Østergaard, L. M. Ørnbjerg

*Corresponding author af dette arbejde

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

Abstract

Objectives We aimed to compare various methods for imputing disease activity in longitudinally collected observational data of patients with axial spondyloarthritis (axSpA).

Methods We conducted a simulation study on data from 8583 axSpA patients from ten European registries. Disease activity was assessed by the Axial Spondyloarthritis Disease Activity Score (ASDAS) and the corresponding low disease activity (LDA; ASDAS<2.1) state at baseline, 6 and 12 months. We focused on cross-sectional methods which impute missing values of an individual at a particular time point based on the available information from other individuals at that time point. We applied nine single and five multiple imputation methods, covering mean, regression and hot deck methods. The performance of each imputation method was evaluated via relative bias and coverage of 95% confidence intervals for the mean ASDAS and the derived proportion of patients in LDA.

Results Hot deck imputation methods outperformed mean and regression methods, particularly when assessing LDA. Multiple imputation procedures provided better coverage than the corresponding single imputation ones. However, none of the evaluated methods produced unbiased estimates with adequate coverage across all time points, with performance for missing baseline data being worse than for missing follow-up data. Predictive mean and weighted predictive mean hot deck imputation procedures consistently provided results with low bias.

Conclusions This study contributes to the available methods for imputing disease activity in observational research. Hot deck imputation using predictive mean matching exhibited the highest robustness and is thus our suggested approach.
OriginalsprogEngelsk
Artikelnummere004844
TidsskriftRMD Open
Vol/bind11
Udgave nummer1
Antal sider14
ISSN2056-5933
DOI
StatusUdgivet - 2025

Bibliografisk note

Funding Information:
The EuroSpA collaboration has been supported by Novartis Pharma AG since 2017 and UCB Biopharma SRL since 2022. This EuroSpA study was financially supported by UCB. No financial sponsors had any influence on the data collection, statistical analyses, manuscript preparation or decision to submit. This work was supported by UCB. UCB had no influence on the data collection, statistical analyses, manuscript preparation or decision to submit the manuscript.

Funding Information:
This work was supported by UCB. UCB had no influence on the data collection, statistical analyses, manuscript preparation or decision to submit the manuscript.

Publisher Copyright:
© Author(s) (or their employer(s)) 2025.

Citationsformater