- Series
- Job Candidate Talk
- Time
- Thursday, January 20, 2011 - 3:00pm for 1 hour (actually 50 minutes)
- Location
- Skiles 005
- Speaker
- Jelena Bradic – Princeton University
- Organizer
- Liang Peng
High throughput genetic sequencing arrays with thousands of
measurements
per sample and a great amount of related censored clinical data have
increased demanding need for better measurement specific model
selection.
In this paper we establish strong oracle properties of non-concave
penalized methods for non-polynomial (NP) dimensional data with
censoring in the framework of Cox's proportional hazards model.
A class of folded-concave penalties are employed and both LASSO and
SCAD are discussed specifically. We unveil the question under which
dimensionality and correlation
restrictions can an oracle estimator be constructed and grasped. It is
demonstrated that non-concave penalties lead to significant reduction
of the "irrepresentable condition" needed for LASSO model selection
consistency.
The large deviation result for martingales, bearing interests of its
own, is developed for characterizing the strong oracle property.
Moreover, the non-concave regularized estimator, is shown to achieve
asymptotically the information bound of the oracle estimator. A
coordinate-wise algorithm is developed for finding the grid of
solution paths for penalized hazard regression problems, and its
performance is evaluated on simulated and gene association study
examples.