Penanganan imbalance class data laboratorium kesehatan dengan Majority Weighted Minority Oversampling Technique
Keywords:classification, data laboratory health, imbalanced, MWMOTE, data laboratorium kesehatan, klasifikasi
Diagnosis suatu penyakit akan menjadi tepat jika didukung dengan berbagai proses mulai pengecekan awal (amannesa) sampai pengecekan laboratorium. Hasil dari proses laboratorium mempunyai informasi berbagai penyakit, akan tetapi beberapa jenis penyakit memiliki prevalensi rendah. Penyakit bervalensi rendah memiliki pengaruh dalam penanganan pasien lebih lanjut. Dengan rasio yang tidak seimbang data laboratorium akan menyebabkan nilai akurasi menjadi rendah dalam pengklasifikasian dan penanganan penyakit. Majority Weighted Minority Oversampling Technique (MWMOTE) adalah saalah satu cara untuk menyelesaikan imbalanced. Penelitian ini bertujuan menangani permasalahan ketidakseimbangan data laboratorium kesehatan sehingga diperoleh hasil pengklasifikasian penyakit dengan tingkat akurasi lebih tinggi. Hasil pada penelitian ini menunjukkan bahwa MWMOTE dapat meningkatkan akurasi untuk permasalahan ketidakseimbangan data sebesar 3,13%.
Diagnosis of a disease will be appropriate if supported by various processes ranging from initial checks (amannesa) to laboratory checks. Results from the laboratory process have information on various diseases, but some types of diseases have a low prevalence. Low-valvature disease has an effect in the treatment of the patient further. With an unbalanced ratio the laboratory data will cause the accuracy value to be low in the classification and handling of the disease. Majority Weighted Minority Oversampling Technique (MWMOTE) is one way to complete imbalanced. This study aims to address the problem of imbalance of health laboratory data to obtain the results of the classification of disease with a higher degree of accuracy. The results of this study indicate that MWMOTE can improve accuracy for data imbalance problems by 3.13%.
Almeida, J., Barbosa, L., Pais, A., & Formosinho, S. (2007). Improving hierarchical cluster analysis: A new method with outlier detection and automatic clustering. Chemometrics and Intelligent Laboratory Systems, 2007(2007), 208-217.
Barua, S., Islam, M. M., Yao, X., & Murase, K. (2014). MWMOTE--Majority Weighted Minority Oversampling Technique for Imbalanced Data Set Learning. IEEE Transactions on Knowledge and Data Engineering, 26(2), 405-425.
Batra, S., & Sachdev, S. (2016). Organizing standardized electronic healthcare records data for mining. Health Policy and Technology, 5(3), 226-242.
Chawla, N. V., Bowyer, K. W., Hall, L. O., & Kegelmeyer, W. P. (2002). SMOTE: Synthetic Minority Over-sampling Technique. JAIR-Journal of Artificial Intelligence Research, 16, 321-357.
Fahrudin, T., Buliali, J. L., & Fatichah, C. (2016). Predictive modeling of the first year evaluation based on demographics data: Case study students of Telkom University, Indonesia. 2016 International Conference on Data and Software Engineering (ICoDSE). Denpasar: IEEE.
Guo, S., Guo, D., Chen, L., & Jiang, Q. (2016). A centroid-based gene selection method for microarray data classification. Journal of Theoretical Biology, 400(2016), 32-41.
Kaur, B., & Singh, W. (2014). Review on Heart Disease Prediction System using Data Mining Techniques. International Journal on Recent and Innovation Trends in Computing and Communication, 2(10), 3003 – 3008.
Mahmood, A. M. (2015). Class Imbalance Learning in Data Mining – A Survey. International Journal of Communication Technology for Social Networking Services, 3(2), 17-38.
Meesad, P., & Yen, G. (2003). Combined numerical and linguistic knowledge representation and its application to medical diagnosis. IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans, 33(2), 206-222.
Napierała, K. (2012). Improving Rule Classifiers For Imbalanced Data. Poznań: Institute of Computing Science.
Ng, W. W., Hu, J., Yeung, D. S., Yin, S., & Roli, F. (2015). Diversified Sensitivity-Based Undersampling for Imbalance Classification Problems. IEEE Transactions on Cybernetics, 45(11), 2402-2412.
Phoungphol, P. (2013). A Classification Framework for Imbalanced Data. Atlanta: Georgia State University. Retrieved from http://scholarworks.gsu.edu/cs_diss/78
Seiffert, C., Khoshgoftaar, T. M., & Hulse, J. V. (2009). Hybrid sampling for imbalanced data. Integrated Computer-Aided Engineering, 16(3), 193-210.
Usharani, Y., & P.Sammulal. (2016). An Innovative Imputation and Classification Approach for Accurate Disease Prediction. International Journal of Computer Science and Information Security (IJCSIS), 14, 23-31.
Zhang, Z., Krawczyk, B., Garcìa, S., Rosales-Pérez, A., & Herrera, F. (2016). Empowering one-vs-one decomposition with ensemble learning for multi-class imbalanced data. Knowledge-Based Systems, 106(15 August 2016), 251-263.
Zheng, Z., Cai, Y., & Li, Y. (2015). Oversampling method for imbalanced classification. Computing and Informatics, 34(5), 1017-1037.
Please find the rights and licenses in Register: Jurnal Ilmiah Teknologi Sistem Informasi. By submitting the article/manuscript of the article, the author(s) agree with this policy. No specific document sign-off is required.
The non-commercial use of the article will be governed by the Creative Commons Attribution license as currently displayed on Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
2. Author(s)' Warranties
The author warrants that the article is original, written by stated author(s), has not been published before, contains no unlawful statements, does not infringe the rights of others, is subject to copyright that is vested exclusively in the author and free of any third party rights, and that any necessary written permissions to quote from other sources have been obtained by the author(s).
3. User/Public Rights
Register's spirit is to disseminate articles published are as free as possible. Under the Creative Commons license, Register permits users to copy, distribute, display, and perform the work for non-commercial purposes only. Users will also need to attribute authors and Register on distributing works in the journal and other media of publications. Unless otherwise stated, the authors are public entities as soon as their articles got published.
4. Rights of Authors
Authors retain all their rights to the published works, such as (but not limited to) the following rights;
Copyright and other proprietary rights relating to the article, such as patent rights,
The right to use the substance of the article in own future works, including lectures and books,
The right to reproduce the article for own purposes,
The right to self-archive the article (please read out deposit policy),
The right to enter into separate, additional contractual arrangements for the non-exclusive distribution of the article's published version (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal (Register: Jurnal Ilmiah Teknologi Sistem Informasi).
If the article was jointly prepared by more than one author, any authors submitting the manuscript warrants that he/she has been authorized by all co-authors to be agreed on this copyright and license notice (agreement) on their behalf, and agrees to inform his/her co-authors of the terms of this policy. Register will not be held liable for anything that may arise due to the author(s) internal dispute. Register will only communicate with the corresponding author.
Being an open accessed journal and disseminating articles for free under the Creative Commons license term mentioned, author(s) aware that Register entitles the author(s) to no royalties or other fees.
Register will publish the article (or have it published) in the journal if the article’s editorial process is successfully completed. Register's editors may modify the article to a style of punctuation, spelling, capitalization, referencing and usage that deems appropriate. The author acknowledges that the article may be published so that it will be publicly accessible and such access will be free of charge for the readers as mentioned in point 3.