/tags/2026-winter/index.xml 2026 Winter - McGill Statistics Seminars
  • Survival analysis of extreme events with missing observations

    Date: 2026-01-23

    Time: 15:30-16:30 (Montreal time)

    Location: In person, Burnside 1104

    https://mcgill.zoom.us/j/82195728045

    Meeting ID: 821 9572 8045

    Passcode: None

    Abstract:

    The analysis of extreme wave surge heights in Atlantic Canada is key in determining areas that are subject to flooding or at risk of severe damage from intense storms. One method for modelling extreme events is through the block maxima approach which divides a series of observations into equal-sized blocks to extract the maxima after which inference is conducted on the generalized extreme value (GEV) distribution. When observations at the series level are missing, the observed block maxima may not correspond to the true block maxima. In this presentation, we introduce this missing data problem in the context of an extreme value analysis and explain how concepts from survival analysis can be used to improve inferences on the GEV distribution using the observed block maxima.

  • A General Framework for Testing Clustering Significance and Variable-Level Inference in High-Dimensional Data

    Date: 2026-01-16

    Time: 15:30-16:30 (Montreal time)

    Location: In person, Burnside 1104

    https://mcgill.zoom.us/j/89692052783

    Meeting ID: 896 9205 2783

    Passcode: None

    Abstract:

    Clustering is a fundamental tool for uncovering heterogeneity in data, yet a longstanding challenge lies in assessing whether detected clusters represent genuine structure or arise from sampling variability, and in determining which variables drive the clustering structure. Statistical significance clustering (SigClust; Liu et al. (2008)) addresses the first challenge by testing the cluster index under a Gaussian null, estimating its distribution via Monte Carlo simulation in high dimensions. We propose SigClust-DE, which builds on recent advances in high-dimensional covariance estimation to improve the accuracy of SigClust and extends it to variable-level inference. In particular, SigClust-DE unifies clustering significance testing and differential expression (DE) analysis, a central task in RNA-seq studies. By leveraging the Monte Carlo framework, our method controls type I error while maintaining high power for variable selection. Through extensive simulations and an application to RNA-seq data, we show that SigClust-DE achieves more accurate covariance estimation, effectively controls false discoveries, and substantially improves power in detecting differentially expressed variables, providing a general framework for clustering significance and variable-level inference in high-dimensional data.