Customer Churn Prediction Using the RFM Approach and Extreme Gradient Boosting for Company Strategy Recommendation
DOI:
https://doi.org/10.26594/register.v10i2.4004Keywords:
Customer Churn, Customer, Companies, XGBoostAbstract
Customers are vital assets in the growth and sustainability of business organizations. However, customers may discontinue their engagement with a company and switch to competitors’ products or services for various reasons. This event referred to as customer churn. Losing customers significantly impacts a company's revenue, often resulting in financial decline. Churn events, which are subject to dynamic monthly changes, are further influenced by intense competition and rapid technological advancements. Analyzing customer characteristics is crucial to understanding customer behavior, with metrics such as recency, frequency, monetary (RFM) serving as key indicators of subscription and transaction patterns. The Extreme Gradient Boosting method is applied to address the challenge of classifying churn and non-churn customers. The prescriptive analytics process is carried out to identify the features most influential in prediction outcomes, enabling the formulation of strategic recommendations to mitigate churn problems. The integration of RFM analysis with the XGBoost method provides optimal results, particularly in the third segmentation, achieving an accuracy of = 0.98833, precession = 0.98768, recall = 0.98899, and f1-score = 0.98833. The prescriptive analytics process highlights three critical features, namely city factor, GMV generation, and total customer transaction generation. This findings demonstrate that the segmentation characteristics, data representation, and behavioral approach with RFM analysis have an effect on improving the performance of the model in churn prediction.
References
B. Indonesia. "Profil Perusahaan Tercatat." BEI. https://www.idx.co.id/id/perusahaan-tercatat/ profil-perusahaan-tercatat/ (accessed Jan. 21, 2023).
K. UKM. "SATU DATA KUMKM TERINTEGRASI." KEMENKOPUKM. https://satudata. kemenkopukm.go.id/kumkm_dashboard/ (accessed Jan. 20, 2023)
Hsiao-Ting Tseng, "Customer-centered data power: Sensing and responding capability in big data analytics", Journal of Business Research, Volume 158, March 2023, 113689. [Online serial]. Available: https://doi.org/10.1016/j.jbusres.2023.113689. [Accessed Nov 10, 2024]
A. Perisic, D. Jung, and M. Pahor, "Churn in the Mobile Gaming Field: Establishing Churn Definitions and Measuring Classification Similarities," Expert Systems with Applications, vol. 191, April, 2022. [Online serial]. Available: https://www.sciencedirect.com/science/article/pii/S09574174210 15852. [Accessed Jan. 20, 2023]
A. Wicaksono, A. Anita, and T. Padilah, "Uji Performa Teknik Klasifikasi untuk Memprediksi Customer Churn," Bianglala Informatika, vol. 9, no. 1, 2021. [Online serial]. Available: https://ejournal.bsi.ac.id/ejurnal/index.php/Bianglala/article/view/9992. [Accessed Jan. 21, 2023]
I. Kabasakal, "Customer Segmentation Based On Recency Frequency Monetary Model: A Case Study in E-Retailing," Bilisim Teknolojileri Dergisi, vol. 13, no. 1, January, 2020. [Online serial]. Available: https://www.researchgate.net/publication/338961311_Customer_Segmentation_Based_On_Recency_Frequency_Monetary_Model_A_Case_Study_in_E-Retailing. [Accessed Jan. 21, 2023]
K. Liu, Supply Chain Analytics: Concepts, Techniques and Applications, 1st ed. Swiss: Palgrave Macmillan Cham, 2022. [E-book] Available: https://doi.org/10.1007/978-3-030-92224-5.
T. Chen and C. Guestrin, "XGBoost: A Scalable Tree Boosting System," Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, August, 2016. [Online serial]. Available: https://doi.org/10.1145/2939672.2939785. [Accessed Jan. 26, 2023]
M. Herawati, I. Wibowo, and I. Mukhlash, "Prediksi Customer Churn Menggunakan Algoritma Fuzzy Iterative Dichotomiser 3," Limits: Journal of Mathematics and Its Applications, vol. 13, no. 1, 2016. [Online serial]. Available: http://dx.doi.org/10.12962/j1829605X.v13i1.1913. [Accessed Feb. 07, 2023]
S. Shrestha and A. Shakya, "A Customer Churn Prediction Model using XGBoost for the Telecommunication Industry in Nepal," Procedia Computer Science, vol. 215, 2022. [Online serial]. Available: https://doi.org/10.1016/j.procs.2022.12.067. [Accessed Jan. 16, 2023]
C. Mena, A. Caigny, K. Coussement, K. Bock, and S. Lessmann, "Churn prediction with sequential data and deep neural networks. a comparative analysis," Computer Science arXiv, September, 2019. [Online serial]. Available: https://doi.org/10.48550/arXiv.1909.11114
H. Zhang and W. Zhang, "Application of GWO-attention-ConvLSTM model in customer churn prediction and satisfaction analysis in customer relationship management". Heliyon, September 4, 2024 [Online serial]. Available: https://doi.org/10.1016/j.heliyon.2024.e37229 [Accessed Nov 20, 2024]
S.S. Poudel, S. Pokharel, M. Timilsina, "Explaining customer churn prediction in telecom industry using tabular machine learning models", Machine Learning with Applications, June 24, 2024 [online serial]. Available: https://doi.org/10.1016/j.mlwa.2024.100567 [Accessed Nov 15, 2024]
F.E. Usman-Hamza, L.F. Capretz , A.O. Balogun , H.A. Mojeed , R.T. Amosa , S. A. Salihu, A.G. Akintola, and M.A. Mabayoje, "Sampling-based novel heterogeneous multi-layer stacking
ensemble method for telecom customer churn prediction ", Scientific African, May 3, 2024 [Online serial]. Available: https://doi.org/10.1016/j.sciaf.2024.e02223 [Accessed No 18, 2024]
G. Meiselwitz, Social Computing and Social Media. Communication and Social Communities, 1st ed. Jerman: Springer Cham, 2019. [E-book] Available: https://doi.org/10.1007/978-3-030-21905-5.
S. Yulianti, O. Soesanto, and Y. Sukmawaty, "Penerapan Metode Extreme Gradient Boosting (XGBOOST) pada Klasifikasi Nasabah Kartu Kredit," Journal of Mathematics: Theory and Applications, 2022. [Online serial]. Available: https://doi.org/10.31605/jomta.v4i1.1792. [Accessed Jan. 18, 2023]
V. Bewick, L. Cheek, J. Ball, "Statistics review 14: Logistic regression," Critical care, vol. 9, no. 112, January, 2005. [Online serial]. Available: https://doi.org/10.1186/cc3045. [Accessed May. 24, 2023]
B. Pratiwi, A. Handayani, and S. Sarjana, "Pengukuran Kinerja Sistem Kualitas Udara Dengan Teknologi Wsn Menggunakan Confusion Matrix," Jurnal Informatika Upgris, vol. 6, no. 2, December, 2020. [Online serial]. Available: https://doi.org/10.26877/jiu.v6i2.6552. [Accessed Jan. 26, 2023]
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2024 Mohammad Isa Irawan, Nadhifa Afrinia Dwi Putris , Noryanti binti Muhammad
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Please find the rights and licenses in Register: Jurnal Ilmiah Teknologi Sistem Informasi. By submitting the article/manuscript of the article, the author(s) agree with this policy. No specific document sign-off is required.
1. License
The non-commercial use of the article will be governed by the Creative Commons Attribution license as currently displayed on Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
2. Author(s)' Warranties
The author warrants that the article is original, written by stated author(s), has not been published before, contains no unlawful statements, does not infringe the rights of others, is subject to copyright that is vested exclusively in the author and free of any third party rights, and that any necessary written permissions to quote from other sources have been obtained by the author(s).
3. User/Public Rights
Register's spirit is to disseminate articles published are as free as possible. Under the Creative Commons license, Register permits users to copy, distribute, display, and perform the work for non-commercial purposes only. Users will also need to attribute authors and Register on distributing works in the journal and other media of publications. Unless otherwise stated, the authors are public entities as soon as their articles got published.
4. Rights of Authors
Authors retain all their rights to the published works, such as (but not limited to) the following rights;
Copyright and other proprietary rights relating to the article, such as patent rights,
The right to use the substance of the article in own future works, including lectures and books,
The right to reproduce the article for own purposes,
The right to self-archive the article (please read out deposit policy),
The right to enter into separate, additional contractual arrangements for the non-exclusive distribution of the article's published version (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal (Register: Jurnal Ilmiah Teknologi Sistem Informasi).
5. Co-Authorship
If the article was jointly prepared by more than one author, any authors submitting the manuscript warrants that he/she has been authorized by all co-authors to be agreed on this copyright and license notice (agreement) on their behalf, and agrees to inform his/her co-authors of the terms of this policy. Register will not be held liable for anything that may arise due to the author(s) internal dispute. Register will only communicate with the corresponding author.
6. Royalties
Being an open accessed journal and disseminating articles for free under the Creative Commons license term mentioned, author(s) aware that Register entitles the author(s) to no royalties or other fees.
7. Miscellaneous
Register will publish the article (or have it published) in the journal if the article’s editorial process is successfully completed. Register's editors may modify the article to a style of punctuation, spelling, capitalization, referencing and usage that deems appropriate. The author acknowledges that the article may be published so that it will be publicly accessible and such access will be free of charge for the readers as mentioned in point 3.