Missing-value imputation using the robust singular-value decomposition: Proposals and numerical evaluation

Marisol García-Peña, Sergio Arciniegas-Alarcón, Wojtek J. Krzanowski, Diego Duarte

Research output: Contribution to journalArticlepeer-review

10 Scopus citations

Abstract

A common problem in the analysis of data from multi-environment trials is imbalance caused by missing observations. To get around this problem, Yan proposed a method for imputing the missing values based on the singular-value decomposition (SVD) of a matrix. However, this SVD can be affected by outliers and produce low quality imputations. In this article, we propose four extensions of the Yan method that are resistant to outliers, replacing the standard SVD method with four robust SVD extensions. We evaluate these methods, using exclusively numerical criteria in a simulation study and in a cross-validation study based on real data. We conclude that in the presence of outliers, the standard SVD method should not be used; instead, the best alternatives are the robust SVD methods based on sub-sampling when the percentage of contamination is less than 2% following a completely random missing data mechanism. In any other case, methods that either minimize the L2 norm or that involve L1 regressions are preferable.

Original languageEnglish
Pages (from-to)3288-3300
Number of pages13
JournalCrop Science
Volume61
Issue number5
DOIs
StatePublished - 01 Sep 2021

Fingerprint

Dive into the research topics of 'Missing-value imputation using the robust singular-value decomposition: Proposals and numerical evaluation'. Together they form a unique fingerprint.

Cite this