THE STREAM ALGORITHM: COMPUTATIONALLY EFFICIENT RIDGE-REGRESSION VIA BAYESIAN MODEL AVERAGING, AND APPLICATIONS TO PHARMACOGENOMIC PREDICTION OF CANCER CELL LINE SENSITIVITY
文献类型: 会议论文
第一作者: IN SOCK JANG
作者: IN SOCK JANG 1 ; STEPHEN H. FRIEND 1 ; ADAM A. MARGOLIN 1 ; ELIAS CHAIBUB NETO 1 ;
作者机构: 1.Sage Bionetworks, 1100 Faiuiiew Avenue North, Seattle, Washington 08109, USA
关键词: ridge-regression;Bayesian model averaging;predictive modeling;machine learning;cancer cell lines;pharmacogenomic screens
会议名称: Pacific Symposium on Biocomputing
主办单位:
页码: 27-38
摘要: Computational efficiency is important for learning algorithms operating in the "large p, small n" setting. In computational biology, the analysis of data sets containing tens of thousands of features ("large p"), but only a few hundred samples ("small n"), is nowadays routine, and regularized regression approaches such as ridge-regression, lasso, and elastic-net are popular choices. In this paper we propose a novel and highly efficient Bayesian inference method for fitting ridge-regression. Our method is fully analytical, and bypasses the need for expensive tuning parameter optimization, via cross-validation, by employing Bayesian model averaging over the grid of tuning parameters. Additional computational efficiency is achieved by adopting the singular value decomposition re-parametrization of the ridge-regression model, replacing computationally expensive inversions of large p×p matrices by efficient inversions of small and diagonal n×n matrices. We show in simulation studies and in the analysis of two large cancer cell line data panels that our algorithm achieves slightly better predictive performance than cross-validated ridge-regression while requiring only a fraction of the computation time. Furthermore, in comparisons based on the cell line data sets, our algorithm systematically out-performs the lasso in both predictive performance and computation time, and shows equivalent predictive performance, but considerably smaller computation time, than the elastic-net.
分类号: q811
- 相关文献
[1]SYSTEMATIC ASSESSMENT OF ANALYTICAL METHODS FOR DRUG SENSITIVITY PREDICTION FROM CANCER CELL LINE DATA. IN SOCK JANG,ELIAS CHAIBUB NETO,JUSTIN GUINNEY,STEPHEN H. FRIEND,ADAM A. MARGOLIN. 2014
[2]Validation of a workflow based on Sentinel-2, Sentinel-1 and meteorological data predicting biomass on pastures. Nickmilder C,Tedde A,Dufrasne I.. 2021
[3]A Machine Learning Approach to Diffusion MRI Partial Volume Estimation. Emmanuel Vallee,Wenchuan Wu,Francesca Galassi,Saad Jbabdi,Stephen Smith. 2018
[4]Machine Learning-Based Fraud Detection System for Insurance Claims in IoT Environment. Bediga Sharan,Mohammad Hassan,V Divya Vani,Vijilius Helena Raj,GinniNijhawan,Priyanka Prabhakar Pawar. 2024
[5]Evaluation of Solar Energy Generation and Radiation Prediction Using Machine Learning. Santosh S Raghuwanshi,Jagrati Gupta,Gourav Shrivastava. 2023
[6]Deployment of models to predict compressed sward height at a large scale: results and feedback. Nickmilder C.,Tedde A,Dufrasne I.. 2022
[7]Prediction of the Risk of Heart Attack Using Machine Learning Techniques. Pinaki Ghosh,Umesh Kumar Lilhore,Sarita Simaiya,Atul Garg,Devendra Prasad,Ajay Kumar. 2022
[8]MGRFormer: A Multimodal Transformer Approach for Surgical Gesture Recognition. Kevin Feghoul,Deise Santana Maia,Mehdi El Amrani,Mohamed Daoudi,Ali Amad. 2024
[9]Blockchain Cloud Computing: Comparative study on DDoS, MITM and SQL Injection Attack. Mamta Swarnkar,Shekh Kulsum Almas,Nagendra Singh,Harsh Pratap Singh,Anuprita Mishra,Anula Khare. 2024
作者其他论文 更多>>
-
Causality-aware counterfactual confounding adjustment as an alternative to linear residualization in anticausal prediction tasks based on linear learners
作者:Elias Chaibub Neto
关键词:
-
PERSONALIZED HYPOTHESIS TESTS FOR DETECTING MEDICATION RESPONSE IN PARKINSON DISEASE PATIENTS USING iPHONE SENSOR DATA
作者:ELIAS CHAIBUB NETO;BRIAN M. BOT;THANNEER PERUMAL;LARSSON OMBERG;JUSTIN GUINNEY;MIKE KELLEN;ARNO KLEIN;STEPHEN H. FRIEND;ANDREW D. TRISTER
关键词:personalized medicine;hypothesis tests;sensor data;remote monitoring;Parkinson
-
STEPWISE GROUP SPARSE REGRESSION (SGSR): GENE-SET-BASED PHARMACOGENOMIC PREDICTIVE MODELS WITH STEPWISE SELECTION OF FUNCTIONAL PRIORS1
作者:IN SOCK JANG;RODRIGO DIENSTMANN;ADAM A.MARGOLIN;JUSTIN GUINNEY
关键词:STEPWISE;SPARSE;PHARMACOGENOMIC
-
Kernelized Bayesian Transfer Learning
作者:Mehmet Gonen;Adam A. Margolin
关键词:
-
SYSTEMATIC ASSESSMENT OF ANALYTICAL METHODS FOR DRUG SENSITIVITY PREDICTION FROM CANCER CELL LINE DATA
作者:IN SOCK JANG;ELIAS CHAIBUB NETO;JUSTIN GUINNEY;STEPHEN H. FRIEND;ADAM A. MARGOLIN
关键词:Cancer cell lines;pharmacogenomics;machine learning;predictive modeling