Ambiguitas Machine Translation pada Cross Language Chatbot Bea Cukai
DOI:
https://doi.org/10.26594/register.v5i1.1387Keywords:
ambiguitas, ambiguity, chatbot, Cross Language, machine translation, mesin translasi, POS Tagging,Abstract
Sistem Information Retrieval (IR) maupun chatbot semakin banyak dikembangkan. Salah satu bagian yang banyak diteliti adalah cross language. Masalah pada pengembangan cross language yaitu terjadinya kesalahan pada hasil terjemahan mesin translasi yang memberikan arti tidak sesuai dengan bahasa natural, sehingga pengguna tidak mendapatkan jawaban yang semestinya, bahkan tidak jarang pula pengguna tidak menemukan jawaban. Penelitian ini mengusulkan skema baru mesin translasi yang bertujuan meningkatkan performa dalam masalah ambiguitas. Mesin translasi bekerja dengan cek kebenaran kata kunci, kemudian melakukan Part-of-Speech (POS) Tagging pada kata benda (noun). Kemudian, setiap kata benda yang terdeteksi akan dicari sinonimnya. Lalu, sinonim yang didapatkan akan ditambahkan dan menjadi alternatif kueri baru. Kueri yang mempunyai nilai confident tertinggi diasumsikan sebagai kueri yang paling sesuai. Pada hasil yang didapatkan setelah dilakukan uji coba, melalui penambahan metode yang kami usulkan pada machine translation, dapat meningkatkan akurasi chatbot dibandingkan tanpa menggunakan skema yang diusulkan. Hasil akurasi bertambah 5%, dari yang semula 73% menjadi 77%.
Information retrieval and chatbot systems are increasingly being developed with its language part mostly studied. However, the problem associated with its development is the occurrence of errors in the translation machine resulting in inaccurate answers not in accordance with the natural language, thereby providing users with wrong answers. This study proposes a new translation machine scheme that aims to improve performance while translating ambiguous terms. Translation machines functions by checking the correctness of keywords, and carrying out Part-of-Speech (POS) Tagging on nouns (noun). The synonyms of any detected noun are searched for and obtained added to become alternative new queries. Those with the highest confident value are assumed to be the most appropriate. The results obtained after testing, through the addition of the method proposed in machine translation, can improve the accuracy of the chatbot compared to not using the proposed scheme. The results of the accuracy increased from the original 73% to 77%.
References
Asim, M. N., Wasim, M., Khan, M. U., Mahmood, N., & Mahmood, W. (2019). The Use of Ontology in Retrieval: A Study on Textual, Multilingual, and Multimedia Retrieval. IEEE Access, 7, 21662-21686.
Feng, X., Huang, L., Qin, B., Lin, Y., Ji, H., & Liu, T. (2017). Multi-level cross-lingual attentive neural architecture for low resource name tagging. Tsinghua Science and Technology, 22(6), 633-645.
Fryer, L. K., Ainley, M., Thompson, A., Gibson, A., & Sherlock, Z. (2017). Stimulating and sustaining interest in a language course: An experimental comparison of Chatbot and Human task partners. Computers in Human Behavior, 75, 461-468.
Jovita, J., Linda, L., Hartawan, A., & Suhartono, D. (2015). Using Vector Space Model in Question Answering System. Procedia Computer Science, 59(2015), 305-311.
Kumar, M. A., Rajendran, S., & Soman, K. P. (2015). Cross-Lingual Preposition Disambiguation for Machine Translation. Procedia Computer Science, 54(2015), 291-300.
Madankar, M., Chandak, M. B., & Chavhan, N. (2016). Information Retrieval System and Machine Translation: A Review. Procedia Computer Science, 78(2016), 845-850.
Meade, G., Midgley, K. J., Dijkstra, T., & Holcomb, P. J. (2018). Cross-language Neighborhood Effects in Learners Indicative of an Integrated Lexicon. Journal of Cognitive Neuroscience, 30(1), 70-85.
Oard, D., & Wang, J. (1999). Effects of term segmentation on Chinese/English cross-language information retrieval. 6th International Symposium on String Processing and Information Retrieval. Cancun, Mexico, Mexico: IEEE.
Qu, J., Nguyen, L. M., & Shimazu, A. (2016). Cross-language information extraction and auto evaluation for OOV term translations. China Communications, 13(12), 277-296.
Sharma, V. K., & Mittal, N. (2016). Exploiting Wikipedia API for Hindi-english Cross-language Information Retrieval. Procedia Computer Science, 89(2016), 434-440.
Tran, K.-N., & Christen, P. (2015). Cross-Language Learning from Bots and Users to Detect Vandalism on Wikipedia. IEEE Transactions on Knowledge and Data Engineering, 27(3), 673-685.
Zhang, J., Zhou, Y., & Zong, C. (2016). Abstractive Cross-Language Summarization via Translation Model Enhanced Predicate Argument Structure Fusing. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 24(10), 1842-1853.
Zhou, G., Xie, Z., He, T., Zhao, J., & Hu, X. T. (2016). Learning the Multilingual Translation Representations for Question Retrieval in Community Question Answering via Non-negative Matrix Factorization. IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP), 24(7), 1305-1314.
Downloads
Published
How to Cite
Issue
Section
License
Please find the rights and licenses in Register: Jurnal Ilmiah Teknologi Sistem Informasi. By submitting the article/manuscript of the article, the author(s) agree with this policy. No specific document sign-off is required.
1. License
The non-commercial use of the article will be governed by the Creative Commons Attribution license as currently displayed on Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
2. Author(s)' Warranties
The author warrants that the article is original, written by stated author(s), has not been published before, contains no unlawful statements, does not infringe the rights of others, is subject to copyright that is vested exclusively in the author and free of any third party rights, and that any necessary written permissions to quote from other sources have been obtained by the author(s).
3. User/Public Rights
Register's spirit is to disseminate articles published are as free as possible. Under the Creative Commons license, Register permits users to copy, distribute, display, and perform the work for non-commercial purposes only. Users will also need to attribute authors and Register on distributing works in the journal and other media of publications. Unless otherwise stated, the authors are public entities as soon as their articles got published.
4. Rights of Authors
Authors retain all their rights to the published works, such as (but not limited to) the following rights;
Copyright and other proprietary rights relating to the article, such as patent rights,
The right to use the substance of the article in own future works, including lectures and books,
The right to reproduce the article for own purposes,
The right to self-archive the article (please read out deposit policy),
The right to enter into separate, additional contractual arrangements for the non-exclusive distribution of the article's published version (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal (Register: Jurnal Ilmiah Teknologi Sistem Informasi).
5. Co-Authorship
If the article was jointly prepared by more than one author, any authors submitting the manuscript warrants that he/she has been authorized by all co-authors to be agreed on this copyright and license notice (agreement) on their behalf, and agrees to inform his/her co-authors of the terms of this policy. Register will not be held liable for anything that may arise due to the author(s) internal dispute. Register will only communicate with the corresponding author.
6. Royalties
Being an open accessed journal and disseminating articles for free under the Creative Commons license term mentioned, author(s) aware that Register entitles the author(s) to no royalties or other fees.
7. Miscellaneous
Register will publish the article (or have it published) in the journal if the article’s editorial process is successfully completed. Register's editors may modify the article to a style of punctuation, spelling, capitalization, referencing and usage that deems appropriate. The author acknowledges that the article may be published so that it will be publicly accessible and such access will be free of charge for the readers as mentioned in point 3.