Missing-value imputation using the robust singular-value decomposition: Proposals and numerical evaluation

Marisol García-Peña, Sergio Arciniegas-Alarcón, Wojtek J. Krzanowski, Diego Duarte

Producción: Contribución a una revistaArtículorevisión exhaustiva

10 Citas (Scopus)

Resumen

A common problem in the analysis of data from multi-environment trials is imbalance caused by missing observations. To get around this problem, Yan proposed a method for imputing the missing values based on the singular-value decomposition (SVD) of a matrix. However, this SVD can be affected by outliers and produce low quality imputations. In this article, we propose four extensions of the Yan method that are resistant to outliers, replacing the standard SVD method with four robust SVD extensions. We evaluate these methods, using exclusively numerical criteria in a simulation study and in a cross-validation study based on real data. We conclude that in the presence of outliers, the standard SVD method should not be used; instead, the best alternatives are the robust SVD methods based on sub-sampling when the percentage of contamination is less than 2% following a completely random missing data mechanism. In any other case, methods that either minimize the L2 norm or that involve L1 regressions are preferable.

Idioma originalInglés
Páginas (desde-hasta)3288-3300
Número de páginas13
PublicaciónCrop Science
Volumen61
N.º5
DOI
EstadoPublicada - 01 sep. 2021

Huella

Profundice en los temas de investigación de 'Missing-value imputation using the robust singular-value decomposition: Proposals and numerical evaluation'. En conjunto forman una huella única.

Citar esto