Customer Churn Prediction Using the RFM Approach and Extreme Gradient Boosting for Company Strategy Recommendation

Authors

  • Mohammad Isa Irawan Institut Teknologi Sepuluh Nopember
  • Nadhifa Afrinia Dwi Putris Institut Teknologi Sepuluh Nopember
  • Noryanti binti Muhammad Universiti Malaysia Pahang

DOI:

https://doi.org/10.26594/register.v10i2.4004

Keywords:

Customer Churn, Customer, Companies, XGBoost

Abstract

Customers are vital assets in the growth and sustainability of business  organizations. However, customers may discontinue their engagement with a company and switch to competitors’ products or services for various reasons. This event referred to as customer churn. Losing customers significantly impacts a company's revenue, often resulting in financial decline. Churn events, which are subject to dynamic monthly changes, are further influenced by intense competition and rapid technological advancements. Analyzing customer characteristics is crucial to understanding customer behavior, with metrics such as recency, frequency, monetary (RFM) serving as key indicators of subscription and transaction patterns. The Extreme Gradient Boosting method is applied to address the challenge of classifying churn and non-churn customers. The prescriptive analytics process is carried out to identify the features most influential in prediction outcomes, enabling the formulation of strategic recommendations to mitigate churn problems. The integration of RFM analysis with the XGBoost method provides optimal results, particularly in the third segmentation, achieving an accuracy of = 0.98833, precession = 0.98768, recall = 0.98899, and f1-score = 0.98833. The prescriptive analytics process highlights three critical features, namely city factor, GMV generation, and total customer transaction generation. This findings demonstrate that the segmentation characteristics, data representation, and behavioral approach with RFM analysis have an effect on improving the performance of the model in churn prediction.

Author Biographies

Mohammad Isa Irawan, Institut Teknologi Sepuluh Nopember

Department of Mathematics

Nadhifa Afrinia Dwi Putris , Institut Teknologi Sepuluh Nopember

Department of Mathematics

Noryanti binti Muhammad, Universiti Malaysia Pahang

Centre for Mathematical Sciences

References

B. Indonesia. "Profil Perusahaan Tercatat." BEI. https://www.idx.co.id/id/perusahaan-tercatat/ profil-perusahaan-tercatat/ (accessed Jan. 21, 2023).

K. UKM. "SATU DATA KUMKM TERINTEGRASI." KEMENKOPUKM. https://satudata. kemenkopukm.go.id/kumkm_dashboard/ (accessed Jan. 20, 2023)

Hsiao-Ting Tseng, "Customer-centered data power: Sensing and responding capability in big data analytics", Journal of Business Research, Volume 158, March 2023, 113689. [Online serial]. Available: https://doi.org/10.1016/j.jbusres.2023.113689. [Accessed Nov 10, 2024]

A. Perisic, D. Jung, and M. Pahor, "Churn in the Mobile Gaming Field: Establishing Churn Definitions and Measuring Classification Similarities," Expert Systems with Applications, vol. 191, April, 2022. [Online serial]. Available: https://www.sciencedirect.com/science/article/pii/S09574174210 15852. [Accessed Jan. 20, 2023]

A. Wicaksono, A. Anita, and T. Padilah, "Uji Performa Teknik Klasifikasi untuk Memprediksi Customer Churn," Bianglala Informatika, vol. 9, no. 1, 2021. [Online serial]. Available: https://ejournal.bsi.ac.id/ejurnal/index.php/Bianglala/article/view/9992. [Accessed Jan. 21, 2023]

I. Kabasakal, "Customer Segmentation Based On Recency Frequency Monetary Model: A Case Study in E-Retailing," Bilisim Teknolojileri Dergisi, vol. 13, no. 1, January, 2020. [Online serial]. Available: https://www.researchgate.net/publication/338961311_Customer_Segmentation_Based_On_Recency_Frequency_Monetary_Model_A_Case_Study_in_E-Retailing. [Accessed Jan. 21, 2023]

K. Liu, Supply Chain Analytics: Concepts, Techniques and Applications, 1st ed. Swiss: Palgrave Macmillan Cham, 2022. [E-book] Available: https://doi.org/10.1007/978-3-030-92224-5.

T. Chen and C. Guestrin, "XGBoost: A Scalable Tree Boosting System," Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, August, 2016. [Online serial]. Available: https://doi.org/10.1145/2939672.2939785. [Accessed Jan. 26, 2023]

M. Herawati, I. Wibowo, and I. Mukhlash, "Prediksi Customer Churn Menggunakan Algoritma Fuzzy Iterative Dichotomiser 3," Limits: Journal of Mathematics and Its Applications, vol. 13, no. 1, 2016. [Online serial]. Available: http://dx.doi.org/10.12962/j1829605X.v13i1.1913. [Accessed Feb. 07, 2023]

S. Shrestha and A. Shakya, "A Customer Churn Prediction Model using XGBoost for the Telecommunication Industry in Nepal," Procedia Computer Science, vol. 215, 2022. [Online serial]. Available: https://doi.org/10.1016/j.procs.2022.12.067. [Accessed Jan. 16, 2023]

C. Mena, A. Caigny, K. Coussement, K. Bock, and S. Lessmann, "Churn prediction with sequential data and deep neural networks. a comparative analysis," Computer Science arXiv, September, 2019. [Online serial]. Available: https://doi.org/10.48550/arXiv.1909.11114

H. Zhang and W. Zhang, "Application of GWO-attention-ConvLSTM model in customer churn prediction and satisfaction analysis in customer relationship management". Heliyon, September 4, 2024 [Online serial]. Available: https://doi.org/10.1016/j.heliyon.2024.e37229 [Accessed Nov 20, 2024]

S.S. Poudel, S. Pokharel, M. Timilsina, "Explaining customer churn prediction in telecom industry using tabular machine learning models", Machine Learning with Applications, June 24, 2024 [online serial]. Available: https://doi.org/10.1016/j.mlwa.2024.100567 [Accessed Nov 15, 2024]

F.E. Usman-Hamza, L.F. Capretz , A.O. Balogun , H.A. Mojeed , R.T. Amosa , S. A. Salihu, A.G. Akintola, and M.A. Mabayoje, "Sampling-based novel heterogeneous multi-layer stacking

ensemble method for telecom customer churn prediction ", Scientific African, May 3, 2024 [Online serial]. Available: https://doi.org/10.1016/j.sciaf.2024.e02223 [Accessed No 18, 2024]

G. Meiselwitz, Social Computing and Social Media. Communication and Social Communities, 1st ed. Jerman: Springer Cham, 2019. [E-book] Available: https://doi.org/10.1007/978-3-030-21905-5.

S. Yulianti, O. Soesanto, and Y. Sukmawaty, "Penerapan Metode Extreme Gradient Boosting (XGBOOST) pada Klasifikasi Nasabah Kartu Kredit," Journal of Mathematics: Theory and Applications, 2022. [Online serial]. Available: https://doi.org/10.31605/jomta.v4i1.1792. [Accessed Jan. 18, 2023]

V. Bewick, L. Cheek, J. Ball, "Statistics review 14: Logistic regression," Critical care, vol. 9, no. 112, January, 2005. [Online serial]. Available: https://doi.org/10.1186/cc3045. [Accessed May. 24, 2023]

B. Pratiwi, A. Handayani, and S. Sarjana, "Pengukuran Kinerja Sistem Kualitas Udara Dengan Teknologi Wsn Menggunakan Confusion Matrix," Jurnal Informatika Upgris, vol. 6, no. 2, December, 2020. [Online serial]. Available: https://doi.org/10.26877/jiu.v6i2.6552. [Accessed Jan. 26, 2023]

Downloads

Published

2024-12-22

How to Cite

[1]
M. I. Irawan, N. A. D. Putris, and N. binti Muhammad, “Customer Churn Prediction Using the RFM Approach and Extreme Gradient Boosting for Company Strategy Recommendation”, Register: Jurnal Ilmiah Teknologi Sistem Informasi, vol. 10, no. 2, pp. 127–140, Dec. 2024.

Issue

Section

Article