TY - JOUR
T1 - Electric energy consumption predictions for residential buildings
T2 - Impact of data-driven model and temporal resolution on prediction accuracy
AU - Kim, Jiwon
AU - Kwak, Younghoon
AU - Mun, Sun Hye
AU - Huh, Jung Ho
N1 - Publisher Copyright:
© 2022 Elsevier Ltd
PY - 2022/12/15
Y1 - 2022/12/15
N2 - This study aims to investigate the measurement parameters for predicting the electric energy consumption of residential buildings using a data-driven model. Herein, the temporal resolution of data and algorithms that can improve prediction accuracy are comparatively investigated. For the investigation, the time units of the data collected from the monitoring system of an actual residential building are set as 10 min and 1 h. Further, algorithms such as multiple linear regression (MLR), multilayer perceptron (MLP), support vector machine (SVM), and random forest (RF) are employed to predict the electric energy consumption of the building. The parameters of the data collection include electric energy consumption based on the usage type, occupancy information, and indoor environmental information. The model is validated using a K-fold technique, and the prediction accuracy is compared using R2 and the t-value. Analyses using seven input variables reveal that the prediction accuracy for the 1 h interval data is better than that for the 10 min interval data, based on the temporal resolution of the data. In addition, the results of the algorithms reveal that the prediction accuracy is the highest when the MLR algorithm is used, followed by those when using the RF, MLP, and SVM algorithms. A relatively simple statistical method and low-resolution data rather than a complex machine learning algorithm or high-resolution data achieved the best prediction accuracy. These results are expected to facilitate high-accuracy predictions of the electric energy consumption of residential buildings.
AB - This study aims to investigate the measurement parameters for predicting the electric energy consumption of residential buildings using a data-driven model. Herein, the temporal resolution of data and algorithms that can improve prediction accuracy are comparatively investigated. For the investigation, the time units of the data collected from the monitoring system of an actual residential building are set as 10 min and 1 h. Further, algorithms such as multiple linear regression (MLR), multilayer perceptron (MLP), support vector machine (SVM), and random forest (RF) are employed to predict the electric energy consumption of the building. The parameters of the data collection include electric energy consumption based on the usage type, occupancy information, and indoor environmental information. The model is validated using a K-fold technique, and the prediction accuracy is compared using R2 and the t-value. Analyses using seven input variables reveal that the prediction accuracy for the 1 h interval data is better than that for the 10 min interval data, based on the temporal resolution of the data. In addition, the results of the algorithms reveal that the prediction accuracy is the highest when the MLR algorithm is used, followed by those when using the RF, MLP, and SVM algorithms. A relatively simple statistical method and low-resolution data rather than a complex machine learning algorithm or high-resolution data achieved the best prediction accuracy. These results are expected to facilitate high-accuracy predictions of the electric energy consumption of residential buildings.
KW - Data-driven model
KW - Electric energy consumption
KW - Prediction accuracy
KW - Residential building
KW - Temporal resolution
UR - http://www.scopus.com/inward/record.url?scp=85139285254&partnerID=8YFLogxK
U2 - 10.1016/j.jobe.2022.105361
DO - 10.1016/j.jobe.2022.105361
M3 - Article
AN - SCOPUS:85139285254
SN - 2352-7102
VL - 62
JO - Journal of Building Engineering
JF - Journal of Building Engineering
M1 - 105361
ER -