STEPWISE GROUP SPARSE REGRESSION (SGSR): GENE-SET-BASED PHARMACOGENOMIC PREDICTIVE MODELS WITH STEPWISE SELECTION OF FUNCTIONAL PRIORS1
文献类型: 会议论文
第一作者: IN SOCK JANG
作者: IN SOCK JANG 1 ; RODRIGO DIENSTMANN 1 ; ADAM A.MARGOLIN 2 ; JUSTIN GUINNEY 1 ;
作者机构: 1.Sage Bionetworks 1100 Fairview Ave.N Seattle, WA 98109, USA
2.Oregon Health & Science University 3181 S.W.Sam Jackson Park Rd, Portland, OR 97239, USA
关键词: STEPWISE;SPARSE;PHARMACOGENOMIC
会议名称: Pacific Symposium on Biocomputing
主办单位:
页码: 32-43
摘要: Complex mechanisms involving genomic aberrations in numerous proteins and pathways are believed to be a key cause of many diseases such as cancer. With recent advances in genomics, elucidating the molecular basis of cancer at a patient level is now feasible, and has led to personalized treatment strategies whereby a patient is treated according to his or her genomic profile. However, there is growing recognition that existing treatment modalities are overly simplistic,and do not fully account for the deep genomic complexity associated with sensitivity or resistance to cancer therapies. To overcome these limitations, large-scale pharmacogenomic screens of cancer cell lines - in conjunction with modern statistical learning approaches - have been used to explore the genetic underpinnings of drug response. While these analyses have demonstrated the ability to infer genetic predictors of compound sensitivity, to date most modeling approaches have been data-driven, i.e. they do not explicitly incorporate domain-specific knowledge (priors) in the process of learning a model. While a purely data-driven approach offers an unbiased perspective of the data - and may yield unexpected or novel insights - this strategy introduces challenges for both model interpretability and accuracy. In this study, we propose a novel prior-incorporated sparse regression model in which the choice of informative predictor sets is carried out by knowledge-driven priors (gene sets) in a stepwise fashion. Under regularization in a linear regression model, our algorithm is able to incorporate prior biological knowledge across the predictive variables thereby improving the interpretability of the final model with no loss - and often an improvement - in predictive performance. We evaluate the performance of our algorithm compared to well-known regularization methods such as LASSO, Ridge and Elastic net regression in the Cancer Cell Line Encyclopedia (CCLE) and Genomics of Drug Sensitivity in Cancer (Sanger) pharmacogenomics datasets, demonstrating that incorporation of the biological priors selected by our model confers improved predictability and interpretability, despite much fewer predictors, over existing state-of-the-art methods.
分类号: Q811
- 相关文献
作者其他论文 更多>>
-
PERSONALIZED HYPOTHESIS TESTS FOR DETECTING MEDICATION RESPONSE IN PARKINSON DISEASE PATIENTS USING iPHONE SENSOR DATA
作者:ELIAS CHAIBUB NETO;BRIAN M. BOT;THANNEER PERUMAL;LARSSON OMBERG;JUSTIN GUINNEY;MIKE KELLEN;ARNO KLEIN;STEPHEN H. FRIEND;ANDREW D. TRISTER
关键词:personalized medicine;hypothesis tests;sensor data;remote monitoring;Parkinson
-
THE STREAM ALGORITHM: COMPUTATIONALLY EFFICIENT RIDGE-REGRESSION VIA BAYESIAN MODEL AVERAGING, AND APPLICATIONS TO PHARMACOGENOMIC PREDICTION OF CANCER CELL LINE SENSITIVITY
作者:IN SOCK JANG;STEPHEN H. FRIEND;ADAM A. MARGOLIN;ELIAS CHAIBUB NETO
关键词:ridge-regression;Bayesian model averaging;predictive modeling;machine learning;cancer cell lines;pharmacogenomic screens
-
SYSTEMATIC ASSESSMENT OF ANALYTICAL METHODS FOR DRUG SENSITIVITY PREDICTION FROM CANCER CELL LINE DATA
作者:IN SOCK JANG;ELIAS CHAIBUB NETO;JUSTIN GUINNEY;STEPHEN H. FRIEND;ADAM A. MARGOLIN
关键词:Cancer cell lines;pharmacogenomics;machine learning;predictive modeling