Data imputation and body weight variability calculation using linear and nonlinear methods in data collected from digital smart scales: Simulation and validation study

Jake Turicchi*, Ruairi O'Driscoll, Graham Finlayson, Cristiana Duarte, A. L. Palmeira, Sofus C. Larsen, Berit L. Heitmann, R. James Stubbs

*Corresponding author for this work

Research output: Contribution to journalJournal articleResearchpeer-review

14 Citations (Scopus)
23 Downloads (Pure)

Abstract

Background: Body weight variability (BWV) is common in the general population and may act as a risk factor for obesity or diseases. The correct identification of these patterns may have prognostic or predictive value in clinical and research settings. With advancements in technology allowing for the frequent collection of body weight data from electronic smart scales, new opportunities to analyze and identify patterns in body weight data are available. Objective: This study aims to compare multiple methods of data imputation and BWV calculation using linear and nonlinear approaches Methods: In total, 50 participants from an ongoing weight loss maintenance study (the NoHoW study) were selected to develop the procedure. We addressed the following aspects of data analysis: cleaning, imputation, detrending, and calculation of total and local BWV. To test imputation, missing data were simulated at random and using real patterns of missingness. A total of 10 imputation strategies were tested. Next, BWV was calculated using linear and nonlinear approaches, and the effects of missing data and data imputation on these estimates were investigated. Results: Body weight imputation using structural modeling with Kalman smoothing or an exponentially weighted moving average provided the best agreement with observed values (root mean square error range 0.62%-0.64%). Imputation performance decreased with missingness and was similar between random and nonrandom simulations. Errors in BWV estimations from missing simulated data sets were low (2%-7% with 80% missing data or a mean of 67, SD 40.1 available body weights) compared with that of imputation strategies where errors were significantly greater, varying by imputation method. Conclusions: The decision to impute body weight data depends on the purpose of the analysis. Directions for the best performing imputation methods are provided. For the purpose of estimating BWV, data imputation should not be conducted. Linear and nonlinear methods of estimating BWV provide reasonably accurate estimates under high proportions (80%) of missing data.

Original languageEnglish
Article numbere17977
JournalJMIR mHealth and uHealth
Volume8
Issue number9
ISSN2291-5222
DOIs
Publication statusPublished - 2020

Keywords

  • Body weight
  • Digital tracking
  • Energy balance
  • Imputation
  • Smart scales
  • Validation
  • Weight cycling
  • Weight fluctuation
  • Weight instability
  • Weight variability

Cite this