Abstract Using proper diesel brand is the key to ensure the normal operation of diesel engine. It is even more important to identify the brands of diesel oil effectively. This paper presented a new model of near infrared spectroscopy (NIRS) identification of diesel oil brands that combined Tree-based feature selection, Synthetic Minority Oversampling Technique (SMOTE) and Extreme Gradient Boosting (XGBoost) ensemble learning in order to achieve the goal of high accuracy and rapidity. To further demonstrate the practical effect of the proposed ensemble method, it was compared with a single decision tree (DT) classifier based on classification and regression tree (CART) algorithm. As a result, the recognition rate of Tree-SMOTE-XGBoost model proposed in this paper was 19.33% higher than that of XGBoost model, and 9.25% higher than that of Tree-SMOTE-DT model. More importantly, it can ensure the accuracy of each class under the premise of serious imbalance of classes. The proposed method saves manpower and material resources, and provides a new alternative approach for diesel brands identification.
A new method of diesel fuel brands identification: SMOTE oversampling combined with XGBoost ensemble learning
Shutao Wang,Shiyu Liu,Jingkun Zhang,Xiange Che,Yuanyuan Yuan,Zhifang Wang,D. Kong
Published 2020 in Fuel
ABSTRACT
PUBLICATION RECORD
- Publication year
2020
- Venue
Fuel
- Publication date
2020-12-15
- Fields of study
Materials Science, Computer Science, Engineering
- Identifiers
- External record
- Source metadata
Semantic Scholar
CITATION MAP
EXTRACTION MAP
CLAIMS
- No claims are published for this paper.
CONCEPTS
- No concepts are published for this paper.
REFERENCES
Showing 1-36 of 36 references · Page 1 of 1
CITED BY
Showing 1-58 of 58 citing papers · Page 1 of 1