Date: 2013-02-01
Time: 14:30-15:30
Location: BURN 1205
Abstract:
I will consider the task of estimating high-dimensional Gaussian graphical models (or networks) corresponding to a single set of features under several distinct conditions. In other words, I wish to estimate several distinct but related networks. I assume that most aspects of the networks are shared, but that there are some structured differences between them. The goal is to exploit the similarity among the networks in order to obtain more accurate estimates of each individual network, as well as to identify the differences between the networks.
To begin, I will assume that network differences arise from edge perturbations. In this case, estimating the networks by maximizing the log likelihood subject to fused lasso or group lasso penalties on the differences between the precision matrices can lead to very good results. Next, I will discuss a more structured type of network difference that arises from node (rather than edge) perturbations. In order to estimate networks in this setting, I will present the “row-column overlap norm penalty”, a type of overlapping group lasso penalty.
Finally, I will present an application of these network estimation techniques to a gene expression data set, in which the goal is to identify genes whose regulatory patterns are perturbed across various subtypes of brain cancer.
This is joint work with Pei Wang, Su-In Lee, Maryam Fazel, and others.
Speaker
Daniela Witten is an Assistant Professor of Biostatistics at the University of Washington.