Three-step hybrid strategy towards efficiently selecting variables in multivariate calibration of near-infrared spectra
文献类型: 外文期刊
第一作者: Yu, Hai-Dong
作者: Yu, Hai-Dong;Yun, Yong-Huan;Zhang, Weimin;Chen, Haiming;Liu, Dongli;Zhong, Qiuping;Chen, Wenxue;Chen, Weijun;Yun, Yong-Huan
作者机构:
关键词: Variable selection; Near-infrared spectra; Multivariate calibration; Hybrid strategy; Variable space
期刊名称:SPECTROCHIMICA ACTA PART A-MOLECULAR AND BIOMOLECULAR SPECTROSCOPY ( 影响因子:4.098; 五年影响因子:3.464 )
ISSN: 1386-1425
年卷期: 2020 年 224 卷
页码:
收录情况: SCI
摘要: Variable (feature or wavelength) selection is a critical step in multivariate calibration of near-infrared (NIR) spectra. The high-resolution NIR or its imaging instruments usually generate hundreds or thousands of wavelengths, which make the variable selection methods tend to appear a high risk of over-fitting, low efficiency, or requiring large computational abilities. Thus, it is a great challenge to efficiently select informative variables and obtain an optimal variable combination in a huge variable space. We propose a hybrid strategy for efficiently selecting variables based on three steps including rough selection, fine selection and optimal selection. The strong interpretability method like wavelength interval selection method (interval partial least squares, iPLS) was first used to roughly select informative intervals and shrink the variable space. Wavelength point selection methods such as variable importance in projection (VIP) and modified variable combination population analysis (mVCPA) were used to continuingly shrink the variable space from large to small in order to remain the very important variables. In the third step, applying some optimization methods such as iteratively retaining informative variables (IRIV) and genetic algorithm (GA) is to find an optimal variable combination from the remaining variables. It makes full use of the advantages of various involved methods and makes up for their disadvantages when facing high dimensional data. Two NIR datasets were employed to investigate the performance of the three-step hybrid strategy. It can significantly improve the prediction performance of the models built when compared with other single or hybrid methods (iPLS, VIP, iPLS-VIP, iPLS-VCPA, iPLS-mVCPA, VIP-GA, VIP-IRIV, mVCPA-GA, mVCPA-IRIV), indicating that the three-step hybrid strategy, including iPLS-VIP-IRIV, iPLS-VIP-GA, iPLS-mVCPA-GA and iPLS-mVCPA-IRIV, could efficiently select informative variables. Therefore, the three-step hybrid strategy is a good alternative for variable selection methods in the face of high dimensional NIR spectral data. (c) 2019 Elsevier B.V. All rights reserved.
分类号:
- 相关文献
作者其他论文 更多>>
-
Large-scale genomic rearrangements boost SCRaMbLE in Saccharomyces cerevisiae
作者:Cheng, Li;Zhao, Shijun;Li, Tianyi;Hou, Sha;Luo, Zhouqing;Yu, Wenfei;Jiang, Shuangying;Ma, Yingxin;Cai, Yizhi;Dai, Junbiao;Zhao, Shijun;Yu, Wenfei;Dai, Junbiao;Luo, Zhouqing;Xu, Jinsheng;Monti, Marco;Schindler, Daniel;Cai, Yizhi;Zhang, Weimin;Boeke, Jef D.;Zhang, Weimin;Boeke, Jef D.;Hou, Chunhui;Boeke, Jef D.;Dai, Junbiao;Dai, Junbiao;Li, Tianyi
关键词:
-
Study on multiscale structures and digestibility of cassava starch and medium-chain fatty acids complexes using molecular simulation techniques
作者:Shang, Wenting;Li, Xin;Du, Jinyu;Guo, Yuxin;Fu, Dekun;He, Yanfu;Zhang, Weimin;Shang, Wenting;Li, Xin;Du, Jinyu;Guo, Yuxin;Fu, Dekun;He, Yanfu;Zhang, Weimin;Shang, Wenting;Li, Xin;Du, Jinyu;Guo, Yuxin;Fu, Dekun;He, Yanfu;Zhang, Weimin;Pan, Fei;Zhou, Zhongkai
关键词:Cassava starch -fatty acids complexes; Structural characteristics; Starch in vitro digestibility; Molecular simulation
-
Physicochemical properties and oil-water interfacial behavior of subcritical water-treated coconut (Cocos nucifera L.) globulins
作者:Ma, Jingrong;He, Rongrong;Chen, Weijun;Pei, Jianfei;Zhong, Qiuping;Chen, Haiming;Chen, Wenxue;Pan, Chuang
关键词:Coconut globulins; Subcritical water treatment; Conformational flexibility; Interfacial properties; Emulsion stability
-
Molecular interaction of soybean protein and piperine by computational docking analyses
作者:Zhang, Chaohua;Ding, Yunshuang;Wu, Guiping;Gu, Fenglin;Niu, Zhiqiang;He, Zhiliang;Hu, Weicheng;Zhang, Chaohua;Ding, Yunshuang;Gu, Fenglin;Zhang, Chaohua;Ding, Yunshuang;Chen, Weijun;Wu, Guiping;Dong, Conghui;Ye, Zan;Gu, Fenglin;Wu, Haifeng
关键词:Piperine; Soybean protein; Structure; Molecular dynamics simulations; Docking study
-
Partridge tea polyphenols alleviated STZ-induced diabetic nephropathy by regulating Keap1/Nrf2/ARE signaling pathway in C57BL/6 mice
作者:Zhao, Mantong;Meng, Keke;Zhao, Meihui;Shi, Haohao;Liu, Zhongyuan;Yun, Yonghuan;Zhang, Weimin;Xia, Guanghua;Duan, Zhouwei
关键词:PTP; Diabetic nephropathy; Oxidative stress; Keap1/Nrf2/ARE signal pathway
-
Structural variations of a new fertility restorer gene, Rf20, , underlie the restoration of wild abortive-type cytoplasmic male sterility in rice
作者:Song, Shufeng;Li, Yixing;Qiu, Mudan;Xu, Na;Li, Bin;Chen, Weijun;Wang, Tiankang;Yu, Dong;Pan, Yi;Yuan, Dingyang;Li, Li;Qiu, Mudan;Gong, Mengmeng;Xia, Siqi;Xu, Na;Zhang, Longhui;Li, Lei;Li, Jinglei;Qiu, Yingxin;Dong, Hao;Yuan, Dingyang;Li, Li
关键词:COX11; cytoplasmic male sterility; rf20; WA352; wild abortion
-
Characterization of methyltetrahydrophthalic anhydride esterified corn starch and their ability in stabilizing Pickering emulsion
作者:Zhong, Yang;Yang, Mingxing;Liu, Dayu;Liu, Wenlong;Chen, Weijun;Lin, Yi;Zeng, Xiaodan
关键词:Methyltetrahydrophthalic anhydride; Esterified starch; Pickering emulsion; Stability