학술논문

Transformer-Based Forecasting for Sustainable Energy Consumption Toward Improving Socioeconomic Living: AI-Enabled Energy Consumption Forecasting
Document Type
Periodical
Source
IEEE Systems, Man, and Cybernetics Magazine IEEE Syst. Man Cybern. Mag. Systems, Man, and Cybernetics Magazine, IEEE. 10(2):52-60 Apr, 2024
Subject
Signal Processing and Analysis
Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Computing and Processing
General Topics for Engineers
Power, Energy and Industry Applications
Training
Energy consumption
Load forecasting
Transfer learning
Predictive models
Transformers
Data models
Language
ISSN
2380-1298
2333-942X
Abstract
Smart energy management encompasses energy consumption prediction and energy data analytics. Energy consumption prediction or electric load forecasting leverages autoregressive and moving-average models. Recently, there has been a lot of traction in data-driven models for energy consumption prediction. In this article, a self-attention-based Transformer model is proposed. The deep-learning model captures long-term dependencies in the data sequence and can be used for long-term prediction. The proposed model is compared with autoregressive integrated moving average (ARIMA) and long short-term memory network (LSTM) models. The different models were applied to the load consumption data from a house located in Sceaux, Paris, France. The prediction windows of 24, 100, and 200 h were considered. To evaluate the performance, the mean absolute prediction error (MAPE) and root-mean-square error (RMSE) were considered as the metrics. The Transformer and LSTM models performed significantly better than ARIMA. Even though Transformer and LSTM performed on par, the load forecasted using Transformer was closer to the previous data in the dataset, which proves the better efficiency of the model. Since the Transformer model has transfer learning ability, it can be used as a pretrained model for training of other time-series datasets and, hence, the model can be potentially applied in other related prediction scenarios also, where time-series data are involved.