Date: 2023-03-10

Time: 15:30-16:30 (Montreal time)

Hybrid: In person / Zoom

Location: Burnside Hall 1104

https://mcgill.zoom.us/j/83436686293?pwd=b0RmWmlXRXE3OWR6NlNIcWF5d0dJQT09

Meeting ID: 834 3668 6293

Passcode: 12345

Abstract:

Principal component analysis (PCA) is one of the most commonly used techniques for dimension reduction and feature extraction. Though it has been well-studied for high-dimensional sparse PCA, little is known when the noise is heteroskedastic, which turns out to be ubiquitous in many scenarios, like biological sequencing data and information network data. We propose an iterative algorithm for sparse PCA in the presence of heteroskedastic noise, which alternatively updates the estimates of the sparse eigenvectors using the power method with adaptive thresholding in one step, and imputes the diagonal values of the sample covariance matrix to reduce the estimation bias due to heteroskedasticity in the other step. Our procedure is computationally fast and provably optimal under the generalized spiked covariance model, assuming the leading eigenvectors are sparse. A comprehensive simulation study demonstrates its robustness and effectiveness in various settings.

Speaker

Dr. Zhao Ren is an Associate Professor of Statistics in Dietrich School of Arts and Sciences at the University of Pittsburgh. His research focuses on high dimensional statistical inference, graphical models and statistical machine learning, nonparametric function estimation, and applications in Statistical Genomics.