To improve prediction performance and reduce artifacts in Raman spectra, we developed an eXtreme Gradient Boosting (XGBoost) preprocessing method to preprocess the Raman spectra of glucose, glycerol and ethanol mixtures. To ensure the robustness and reliability of the XGBoost preprocessing method, an X-LR model (which combined XGBoost preprocessing and a linear regression (LR) model) and a X-MLP model (which combined XGBoost preprocessing and a multilayer perceptron (MLP) model) were developed. These two models were used to quantitatively analyze the concentrations of glucose, glycerol and ethanol in the Raman spectra of mixed solutions. The proportion map of hyperparameters was firstly used to narrow down the search range of hyperparameters in the X-LR and the X-MLP models. Then the correlation coefficients (R2), root mean square of calibration (RMSEC), and root mean square error of prediction (RMSEP) were used to evaluate the models' performance. Experimental results indicated that the XGBoost preprocessing method achieved higher accuracy and generalization capability, with better performance than those of other preprocessing methods for both LR and MLP models.
XGBoost algorithm assisted multi-component quantitative analysis with Raman spectroscopy.
Qiaoyun Wang,Xin Zou,Yinji Chen,Ziheng Zhu,Chongyue Yan,Peng Shan,Shuyu Wang,Yongqing Fu
Published 2024 in Spectrochimica Acta Part A - Molecular and Biomolecular Spectroscopy
ABSTRACT
PUBLICATION RECORD
- Publication year
2024
- Venue
Spectrochimica Acta Part A - Molecular and Biomolecular Spectroscopy
- Publication date
2024-07-31
- Fields of study
Medicine, Materials Science, Chemistry
- Identifiers
- External record
- Source metadata
Semantic Scholar, PubMed
CITATION MAP
EXTRACTION MAP
CLAIMS
- No claims are published for this paper.
CONCEPTS
- No concepts are published for this paper.
REFERENCES
Showing 1-42 of 42 references · Page 1 of 1
CITED BY
Showing 1-28 of 28 citing papers · Page 1 of 1