Three-step hybrid strategy towards efficiently selecting variables in multivariate calibration of near-infrared spectra
文献类型: 外文期刊
作者: Yu, Hai-Dong 1 ; Yun, Yong-Huan 1 ; Zhang, Weimin 1 ; Chen, Haiming 1 ; Liu, Dongli 1 ; Zhong, Qiuping 1 ; Chen, Wenxu 1 ;
作者机构: 1.Hainan Univ, Coll Food Sci & Engn, 58 Renmin Rd, Haikou 570228, Hainan, Peoples R China
2.Chinese Acad Trop Agr Sci, Inst Environm & Plant Protect, Haikou 571101, Hainan, Peoples R China
关键词: Variable selection; Near-infrared spectra; Multivariate calibration; Hybrid strategy; Variable space
期刊名称:SPECTROCHIMICA ACTA PART A-MOLECULAR AND BIOMOLECULAR SPECTROSCOPY ( 影响因子:4.098; 五年影响因子:3.464 )
ISSN: 1386-1425
年卷期: 2020 年 224 卷
页码:
收录情况: SCI
摘要: Variable (feature or wavelength) selection is a critical step in multivariate calibration of near-infrared (NIR) spectra. The high-resolution NIR or its imaging instruments usually generate hundreds or thousands of wavelengths, which make the variable selection methods tend to appear a high risk of over-fitting, low efficiency, or requiring large computational abilities. Thus, it is a great challenge to efficiently select informative variables and obtain an optimal variable combination in a huge variable space. We propose a hybrid strategy for efficiently selecting variables based on three steps including rough selection, fine selection and optimal selection. The strong interpretability method like wavelength interval selection method (interval partial least squares, iPLS) was first used to roughly select informative intervals and shrink the variable space. Wavelength point selection methods such as variable importance in projection (VIP) and modified variable combination population analysis (mVCPA) were used to continuingly shrink the variable space from large to small in order to remain the very important variables. In the third step, applying some optimization methods such as iteratively retaining informative variables (IRIV) and genetic algorithm (GA) is to find an optimal variable combination from the remaining variables. It makes full use of the advantages of various involved methods and makes up for their disadvantages when facing high dimensional data. Two NIR datasets were employed to investigate the performance of the three-step hybrid strategy. It can significantly improve the prediction performance of the models built when compared with other single or hybrid methods (iPLS, VIP, iPLS-VIP, iPLS-VCPA, iPLS-mVCPA, VIP-GA, VIP-IRIV, mVCPA-GA, mVCPA-IRIV), indicating that the three-step hybrid strategy, including iPLS-VIP-IRIV, iPLS-VIP-GA, iPLS-mVCPA-GA and iPLS-mVCPA-IRIV, could efficiently select informative variables. Therefore, the three-step hybrid strategy is a good alternative for variable selection methods in the face of high dimensional NIR spectral data. (c) 2019 Elsevier B.V. All rights reserved.
- 相关文献
作者其他论文 更多>>
-
Molecular interaction of soybean protein and piperine by computational docking analyses
作者:Zhang, Chaohua;Ding, Yunshuang;Wu, Guiping;Gu, Fenglin;Niu, Zhiqiang;He, Zhiliang;Hu, Weicheng;Zhang, Chaohua;Ding, Yunshuang;Gu, Fenglin;Zhang, Chaohua;Ding, Yunshuang;Chen, Weijun;Wu, Guiping;Dong, Conghui;Ye, Zan;Gu, Fenglin;Wu, Haifeng
关键词:Piperine; Soybean protein; Structure; Molecular dynamics simulations; Docking study
-
Mechanism of membrane damage to Shigella sonnei by linalool from plant essential oils: A driver of oxidative stress
作者:He, Rongrong;Wu, Hao;Liu, Jicai;Chen, Wenxue;Chen, Weijun;Chen, Haiming;Zhong, Qiuping;Zhang, Ming;Chen, Wenxue;Gu, Fenglin;Gu, Fenglin
关键词:Linalool; Shigella sonnei; Membrane damage; Oxidative stress; Enzyme activity
-
Active Navigation System for a Rubber-Tapping Robot Based on Trunk Detection
作者:Fang, Jiahao;Zhang, Weimin;Fang, Jiahao;Sun, Yao;Shi, Yongliang;Cao, Jianhua
关键词:active navigation; pose tracking; factor graph; trunk detection; hybrid map
-
Potential of Near-Infrared Spectroscopy (NIRS) for Efficient Classification Based on Postharvest Storage Time, Cultivar and Maturity in Coconut Water
作者:Shen, Xiaojun;Li, Xin;Deng, Fuming;Niu, Xiaoqing;Wang, Yuanyuan;Kan, Jintao;Shen, Xiaojun;Wei, Jingyi;Chen, Fusheng;Wang, Tao;Zhang, Weimin;Yun, Yong-Huan;Niu, Xiaoqing;Yun, Yong-Huan
关键词:Cocos nucifera L; liquid endosperm; dwarfs; non-destructive analysis; discrimination
-
Effect of extraction technique on chemical compositions and antioxidant activities of freeze-dried green pepper
作者:Zhang, Chaohua;Chen, Weijun;Zhang, Chaohua;Gu, Fenglin;Zhang, Chaohua;Gu, Fenglin;Wu, Guiping;Dong, Conghui;Gu, Fenglin;Gu, Fenglin;Wu, Guiping;Hu, Weicheng;Niu, Zhiqiang
关键词:freeze-dried pepper; pepper oleoresin; ultrasonic-microwave extraction; antioxidant activity; extraction kinetics
-
Antibacterial mechanism of linalool emulsion against Pseudomonas aeruginosa and its application to cold fresh beef
作者:He, Rongrong;Zhang, Zhengke;Xu, Lilan;Chen, Weijun;Zhang, Ming;Zhong, Qiuping;Chen, Haiming;Chen, Wenxue;Chen, Wenxue
关键词:Linalool emulsion; Pseudomonas aeruginosa; Antibacterial; Molecular docking; Cold fresh beef
-
Revealing informative metabolites with random variable combination based on model population analysis for metabolomics data
作者:Yun, Yong-Huan;Zhang, Jiachao;Chen, Haiming;Chen, Wenxue;Zhong, Qiuping;Zhang, Weimin;Chen, Weijun;Yun, Yong-Huan
关键词:Metabolomics; Variable selection; Biomarker discovery; Informative metabolites; Variable combination; Model population analysis