ABSTRACT In this paper, a data mining based methodology for process identification from historical data was proposed. Thereon, it considers the phases of process understanding, data collection, data preparation, data modeling, and model evaluation. As some parts of historical data are irrelevant, a data selection step, based on the Gaussian Mixture Model (GMM) clustering algorithm, was considered. Additionally, the methodology includes a data informativity step to study the richness of data. In this regard, the condition number (CN) and the extended CN for ridge regression (RR CN) were used. To evaluate the approach, 2 years of industrial thickener historical data were used. Thereafter, data were prepared and an ARX (Auto-Regressive with eXogenous inputs) model structure was adopted to identify the model. To estimate input delays, Granger causality was used. As for fit criteria, least square regression was tested and compared to ridge regression as a less sensitive method to multicollinearity. The results were then evaluated based on the 20-step ahead prediction and compared to existing methods. In this context, the proposed approach gave the best results with an R 2 of 98.11% and 62.70% for 1 and 20-step ahead predictions, respectively.
A data mining based approach for process identification using historical data
Ridouane Oulhiq,K. Benjelloun,Y. Kali,M. Saad
Published 2021 in International Journal of Modelling and Simulation
ABSTRACT
PUBLICATION RECORD
- Publication year
2021
- Venue
International Journal of Modelling and Simulation
- Publication date
2021-05-03
- Fields of study
Computer Science, Engineering
- Identifiers
- External record
- Source metadata
Semantic Scholar
CITATION MAP
EXTRACTION MAP
CLAIMS
- No claims are published for this paper.
CONCEPTS
- No concepts are published for this paper.
REFERENCES
Showing 1-37 of 37 references · Page 1 of 1
CITED BY
Showing 1-7 of 7 citing papers · Page 1 of 1