Fitting Penalized Estimator for Sparse Covariance Matrix with Left-Censored Data by the EM Algorithm

Shanyi Lin, Qian-Zhen Zheng, Laixu Shang, Ping-Feng Xu, Man-Lai Tang

Research output: Contribution to journalArticlepeer-review

1 Downloads (Pure)

Abstract

Estimating the sparse covariance matrix can effectively identify important features and patterns, and traditional estimation methods require complete data vectors on all subjects. When data are left-censored due to detection limits, common strategies such as excluding censored individuals or replacing censored values with suitable constants may result in large biases. In this paper, we propose two penalized log-likelihood estimators, incorporating the (Formula presented.) penalty and SCAD penalty, for estimating the sparse covariance matrix of a multivariate normal distribution in the presence of left-censored data. However, the fitting of these penalized estimators poses challenges due to the observed log-likelihood involving high-dimensional integration over the censored variables. To address this issue, we treat censored data as a special case of incomplete data and employ the Expectation Maximization algorithm combined with the coordinate descent algorithm to efficiently fit the two penalized estimators. Through simulation studies, we demonstrate that both penalized estimators achieve greater estimation accuracy compared to methods that replace censored values with constants. Moreover, the SCAD penalized estimator generally outperforms the (Formula presented.) penalized estimator. Our method is used to analyze the proteomic datasets.

Original languageEnglish
Article number423
Pages (from-to)1-17
Number of pages17
JournalMathematics
Volume13
Issue number3
Early online date27 Jan 2025
DOIs
Publication statusPublished - 28 Feb 2025

Keywords

  • 62-08
  • penalized estimator
  • left-censored data
  • 62H12
  • Expectation Maximization algorithm
  • sparse covariance matrix

Fingerprint

Dive into the research topics of 'Fitting Penalized Estimator for Sparse Covariance Matrix with Left-Censored Data by the EM Algorithm'. Together they form a unique fingerprint.

Cite this