Predictive Analytics for Sales Forecasting in Emerging Market Retail: An Ensemble Learning and Time-Series Approach for A&Z Supermarket

Authors

  • Idowu Olugbenga Adewumi Department of Computer and Information Science, Faculty of Natural and Applied Science, Lead City University, Nigeria

DOI:

https://doi.org/10.57260/csdj.2026.286583

Keywords:

Sales forecasting, Ensemble learning, Time-series decomposition, LSTM neural networks, XGBoost regressor

Abstract

Evaluation of forecasting models with 5000 daily sales records (2020-2023) from A&Z Supermarket located at Oyo State, Nigeria. Ten (10) product IDs from five (5) stores. The dataset incorporates a total of 11 predictive variables. This includes price, competitor price, macroeconomic index (simulated), promo indicators, holiday flags and rolling averages. The forecast models are assessed through a train/test split with a 70/30 ratio. They are forecasting for a 7-day horizon. Further, rolling window validation prevents leakage. According to descriptive analysis, the average sales quantity resulted in 26.49 sales units with a standard deviation of 11.58. The day of sales displays cumulative spikes, mainly during weekends as well as holidays.  The research compares various traditional models such as ARIMA, SARIMA, and Linear Regression with machine learning models like Random Forest, XGBoost, LSTM, and GRU and has a RMSE ranging from 10.55 to 14.36 and R² values ranging from %u221A0.740 to 0.884. Bagging, boosting, stacking techniques are used to improve the performance of predictive models. The stacking ensemble achieved the lowest RMSE (8.14). The performance of LSTM was observed to be best as depicted by (RMSE= 8.75, R²= 0.91) as it outperformed Prophet and XGBoost. The lack of generalizability can be attributed to a narrow dataset that incorporates only a single retailer and a simulated macroeconomic index. An analysis of the business KPIs indicates a decline in inventory costs of 17.8% and of stockouts by 15.4%. Such improvements along with a 9.5% increase in ROI are encouraged through the use of LSTM (Long-Short Term Memory) investments. The results indicate that hybrid ensemble-time-series models are superior to conventional methods and offer important practical conveniences in optimizing inventory and forecasting demand in retail settings in emerging markets.

Downloads

Download data is not yet available.

References

Ahaggach, H., Abrouk, L., & Lebon, E. (2024). Systematic mapping study of sales forecasting: Methods, trends, and future directions. Forecasting, 6(3), 502-532. https://doi.org/10.3390/forecast6030028

Giannopoulos, P. G., Dasaklis, T. K., Tsantilis, L., & Patsakis, C. (2025). Machine learning algorithms in intermittent demand forecasting: a review. International Journal of Production Research, 1–43. https://doi.org/10.1080/00207543.2025.2578701

Hobor, L., Brcic, M., Polutnik, L., & Kapetanovic, A. (2025). Comparative Analysis of Modern Machine Learning Models for Retail Sales Forecasting. https://doi.org/10.48550/arXiv.2506.05941

Hu, H., Ran, D., Thaichon, P., Wang, B., & Zhu, Z. (2025). Grid based market sales forecasting for retail businesses using automated machine learning and geospatial intelligence. Expert Systems with Applications, 284, 127869. https://doi.org/10.1016/j.eswa.2025.127869

Kizgin, K. T., Alp, S., Aydin, N., & Yu, H. (2025). Machine learning-based sales forecasting during crises: Evidence from a Turkish women's clothing retailer. Mathematical Problems in Engineering, 108(1). https://doi.org/10.1177/00368504241307719

Kristian, D., & Suri, P. A. (2025). Comparative study of AutoML and machine learning in retail forecasting.. Procedia Computer Science, 269, 882-891. https://doi.org/10.1016/j.procs.2025.09.031

Makridakis, S., Spiliotis, E., & Assimakopoulos, V. (2022). Predicting/hypothesizing the findings of the M5 competition. International Journal of Forecasting, 38(4), 1337-1345. https://doi.org/10.1016/j.ijforecast.2021.09.014

Mansur, S., Nur, S. B., & Rahman, M. M. (2025). Sales forecasting for retail stores using hybrid neural networks and machine learning models. PeerJ Computer Science, 11, e3058. https://doi.org/10.7717/peerj-cs.3058

Mejía, S., & Aguilar, J. (2024). A demand forecasting system of product categories defined by their time series using a hybrid approach of ensemble learning with feature engineering. Computing, 106, 3945–3965. https://doi.org/10.1007/s00607-024-01320-y

Mustapha, O. O., & Sithole, T. (2025). Forecasting retail sales using machine learning models. American Journal of Statistics and Actuarial Sciences, 6(1), 35–67. https://doi.org/10.47672/ajsas.2679

Pattnaik, M., Padhi, S. K., Panda, L., Naushad, U., Behera, R. K., Mishra, A., & Pattnaik, A. (2025). AI-driven sales forecasting in retail: insights from Big Bazaar’s sales data, International Journal of Management Science and Engineering Management , 21, 1-14. https://doi.org/10.1080/17509653.2025.2499076

Reddy, K. M., Ravikanth, K., Penjarla, N. K., Poola, S., Patha, S. (2025). Sales Forecasting using Predictive Analytics: AMachine LearningandTime-Series Approach . International Journal of Research PublicationandReviews, 6(8), 5476-5483. https://doi.org/10.55248/gengpi.6.0825.3193

Samal, T, & Ghosh, A. (2026). Ensemble based predictive analytics for demand forecasting in multi channel retail. Expert Systems with Applications, 299, 130212. https://doi.org/10.1016/j.eswa.2025.130212

Sattar, M. U., Dattana, V., Hasan, R., Mahmood, S., Khan, H. W., & Hussain, S. (2025). Enhancing supply chain management: A comparative study of machine learning models for forecasting and ESG performance. Sustainability, 17(13), 5772. https://doi.org/10.3390/su17135772

Sun, W., Li, M., Chen, X. H., & Wang, Y. (2024). Enhancing exchange rate prediction and risk management under uncertainty shocks: an AI-driven ensemble prediction model based on metaheuristic optimization. Ann Oper Res, 1-39. https://doi.org/10.1007/s10479-024-06319-4

Theodoridis, G., & Tsadiras, A. (2025). Retail demand forecasting: A comparative analysis of deep neural networks and the proposal of LSTMixer, a linear model extension. Information, 16(7), 596. https://doi.org/10.3390/info16070596

Udodiugwu, M. I., Eneremadu, K. E., Onunkwo, A. R., Onyia, M. K., & Gloria, O. C. (2024). The Role of Artificial Intelligence in Enhancing the Performance of Banks in Nigeria. Arabian Journal of Business and Management Review (Oman Chapter), 11(2), 27–34. https://j.arabianjbmr.com/index.php/ocAJBMR/article/view/1236

Wu, X. (2025). Forecasting Supply Chain Sales and Profits Using Machine Learning: A Comparative Study on Retail Data. Conference: ICMML 2025: 2025 3rd International Conference on Mathematics and Machine Learning, 136-141. https://doi.org/10.1145/3783779.3783802

Zhang, x., Kang, Y., Li., C., Wang, W., & Wang, K. (2025). LSTM-conformal forecasting-based bitcoin forecasting method for enhancing reliability. PLOS ONE, 20(5), e0319008. https://doi.org/10.1371/journal.pone.0319008

Downloads

Published

2026-02-18

How to Cite

Adewumi, I. O. (2026). Predictive Analytics for Sales Forecasting in Emerging Market Retail: An Ensemble Learning and Time-Series Approach for A&Z Supermarket. Community and Social Development Journal, 27(1), 95–114. https://doi.org/10.57260/csdj.2026.286583

Issue

Section

บทความวิจัย (RESEARCH ARTICLE)

Categories