Clickbait detection: A literature review of the methods used
DOI:
https://doi.org/10.26594/register.v6i1.1561Keywords:
clikbait, deep learning, literature review, machine learning, word embeddingAbstract
Online news portals are currently one of the fastest sources of information used by people. Its impact is due to the credibility of the news produced by actors from the media industry, which is sometimes questioned. However, one of the problems associated with this medium used to obtain information is clickbait. This technique aims to attract users to click hyperbolic headlines with content that often disappoints the reader. This study was, therefore, conducted to determine: 1) existing dataset available. 2) The method used in clickbait detection which consists of data preprocessing, analysis of features, and classification. 3) Difference steps from the method used.References
A. Anand, T. Chakraborty and N. Park, "We Used Neural Networks to Detect Clickbaits: You Won’t Believe What Happened Next!," in 39th European Conference on IR Research, Cham, 2017.
A. Agrawal, "Clickbait detection using deep learning," in 2016 2nd International Conference on Next Generation Computing Technologies (NGCT), Dehradun, India, 2016.
G. Loewenstein, "The psychology of curiosity: A review and reinterpretation," Psychological Bulletin, vol. 116, no. 1, pp. 75-98, 1994.
M. Potthast, S. Köpsel, B. Stein and M. Hagen, "Clickbait Detection," in European Conference on Information Retrieval ECIR, Cham, 2016.
C. Manning, M. Surdeanu, J. Bauer, J. Finkel, S. Bethard and D. McClosky, "The Stanford CoreNLP Natural Language Processing Toolkit," in Proceedings of 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations, Baltimore, Maryland, USA, 2014.
A. Chakraborty, B. Paranjape, S. Kakarla and N. Ganguly, "Stop Clickbait: Detecting and preventing clickbaits in online news media," in IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), San Francisco, CA, USA, 2016.
M. M. U. Rony, N. Hassan and M. Yousuf, "Diving Deep into Clickbaits: Who Use Them to What Extents in Which Topics with What Effects?," in The 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM '17), Sydney, Australia, 2017.
M. Dong, L. Yao, X. Wang, B. Benatallah and C. Huang, "Similarity-Aware Deep Attentive Model for Clickbait Detection," in Pacific-Asia Conference on Knowledge Discovery and Data Mining PAKDD 2019: Advances in Knowledge Discovery and Data Mining, Macau, China.
R. Maulidi, M. F. Ayilillahi, L. Isyiriyah and J. F. Palandi, "Penerapan Neural Network Backprogpagation Untuk Klasifikasi Artikel Clickbait," in Seminar Nasional FST 2018, Malang, 2018.
A. F. Yavi, "Klasifikasi Artikel Berbahasa Indonesia untuk Mendeteksi Clickbait menggunakan Metode Naïve Bayes," J-INTECH Journal of Information and Technology, vol. 6, no. 1, pp. 141-147, 2018.
B. Kitchenham and S. Charters, "Guidelines for performing Systematic Literature Reviews in Software Engineering," 2007.
P. Biyani, K. Tsioutsiouliklis and J. Blackmer, ""8 amazing secrets for getting more clicks": detecting clickbaits in news streams using article informality," in AAAI'16 Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, Phoenix, Arizona, 2016.
X. Cao, T. Le, J. (. Zhang and D. Lee, "Machine Learning Based Detection of Clickbait Posts in Social Media," 2017.
P. K. Dimpas, R. V. Po and M. J. Sabellano, "Filipino and english clickbait detection using a long short term memory recurrent neural network," in International Conference on Asian Language Processing (IALP), Singapore, Singapore, 2017.
J. Fu, L. Liang, X. Zhou and J. Zheng, "A Convolutional Neural Network for Clickbait Detection," in 2017 4th International Conference on Information Science and Control Engineering (ICISCE), Changsha, China, 2017.
V. Kumar, D. Khattar, S. Gairola, Y. K. Lal and V. Varma, "Identifying Clickbait: A Multi-Strategy Approach Using Neural Networks," in SIGIR '18 The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval, Ann Arbor, MI, USA, 2018.
S. Manjesh, T. Kanakagiri, P. Vaishak, V. Chettiar and G. Shobha, "Clickbait Pattern Detection and Classification of News Headlines Using Natural Language Processing," in 2nd International Conference on Computational Systems and Information Technology for Sustainable Solution (CSITSS), Bangalore, India, 2017.
M. M. U. Rony, N. Hassan and M. Yousuf, "BaitBuster: A Clickbait Identification Framework," in The Thirty-Second AAAI Conference on Artificial Intelligence (AAAI-18), New Orleans, Louisiana, USA, 2018.
A. Geçkil, A. A. Müngen, E. Gündogan and M. Kaya, "A Clickbait Detection Method on News Sites," in 2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), Barcelona, Spain, 2018.
P. Klairith and S. Tanachutiwat, "Thai Clickbait Detection Algorithms Using Natural Language Processing with Machine Learning Techniques," in International Conference on Engineering, Applied Sciences, and Technology (ICEAST), Phuket, Thailand, 2018.
S. Pandey and G. Kaur, "Curious to Click It?-Identifying Clickbait using Deep Learning and Evolutionary Algorithm," in International Conference on Advances in Computing, Communications and Informatics (ICACCI), Bangalore, India, 2018.
K. Shu, S. Wang, T. Le, D. Lee and H. Liu, "Deep Headline Generation for Clickbait Detection," in IEEE International Conference on Data Mining (ICDM), Singapore, Singapore, 2018.
N. Wongsap, T. Prapphan, L. Lou, S. Kongyoung, S. Jumun and N. Kaothanthong, "Thai Clickbait Headline News Classification and its Characteristic," in International Conference on Embedded Systems and Intelligent Technology & International Conference on Information and Communication Technology for Embedded Systems (ICESIT-ICICTES), Khon Kaen, Thailand, 2018.
H.-T. Zheng, J.-Y. Chen, X. Yao, A. K. Sangaiah, Y. Jiang and C.-Z. Zhao, "Clickbait Convolutional Neural Network," symmetry, vol. 10, no. 5, 2018.
M. Potthast, T. Gollub, M. Hagen and B. Stein, "The Clickbait Challenge 2017: Towards a Regression Model for Clickbait Strength," 2017.
S. García, J. Luengo and F. Herrera, Data Preprocessing in Data Mining, New York: Springer, 2015.
Y. Li, L. Xu, F. Tian, L. Jiang, X. Zhong and E. Chen, "Word Embedding Revisited: A New Representation Learning and Explicit Matrix Factorization Perspective," in Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence (IJCAI 2015), Buenos Aires, Argentina, 2015.
T. Mikolov, K. Chen, G. Corrado and J. Dean, "Efficient Estimation of Word Representations in Vector Space," 2013.
T. Mikolov, I. Sutskever, I. Sutskever, G. Corrado and J. Dean, "Distributed Representations of Words and Phrases and their Compositionality," in Advances in Neural Information Processing Systems 26 (NIPS 2013), Lake Tahoe, Nevada, United States, 2013.
J. Pennington, R. Socher and C. D. Manning, "GloVe: Global Vectors for Word Representation," in In Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), Doha, Qatar, 2014.
A. Géron, Hands-On Machine Learning with Scikit-Learn and TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Systems, 1st ed., Gravenstein Highway North, Sebastopol: O'Reilly, 2017.
V. Kotu and B. Deshpande, Data Science: Concepts and Practice, 2nd ed., Cambridge, United States: Morgan Kaufmann, 2019.
M. Langarizadeh and F. Moghbeli, "Applying Naive Bayesian Networks to Disease Prediction: a Systematic Review," Acta Inform Med, vol. 24, no. 5, p. 364–369, 2016.
M. F. Hornick, E. Marcadé and S. Venkayala, Java Data Mining Strategy, Standard, and Practice: A Practical Guide for Architecture, Design, and Implementation, San Francisco: Morgan Kaufmann, 2007.
R. Pal, "Predictive modeling based on multivariate random forests," in Predictive Modeling of Drug Sensitivity, Elsevier, 2017, pp. 189-218.
M. Binkhonain and L. Zhao, "A review of machine learning algorithms for identification and classification of non-functional requirements," Expert Systems with Applications: X, vol. 1, no. April, p. 100001, 2019.
J. Han, M. Kamber and J. Pei, Data Mining Concepts and Techniques, Third Edition ed., Wyman Street, Waltham: Morgan Kaufmann, 2012.
G. Shi, "Decision Trees," in Data Mining and Knowledge Discovery for Geoscientists, Elsevier, 2014, pp. 111-138.
J. H. Friedman, "Stochastic gradient boosting," Computational Statistics & Data Analysis, vol. 38, no. 4, pp. 367-378, 2002.
X.-S. Yang, "Neural networks and deep learning," in Introduction to Algorithms for Data Mining and Machine Learning, Elsevier, 2019, pp. 139-161.
R. Hecht-Nielsen, "Theory of the Backpropagation Neural Network," in Neural Networks for Perception Computation, Learning, and Architectures, Elsevier, 1992, pp. 65-93.
R. Nisbet, G. Miner and K. Yale, "Deep Learning," in Handbook of Statistical Analysis and Data Mining Applications, 2nd ed., Elsevier, 2018, pp. 741-751.
E. Fathi and B. M. Shoja, "Deep Neural Networks for Natural Language Processing," in Handbook of Statistics, vol. 38, Elsevier, 2019, pp. 229-316.
S. Theodoridis, "Neural Networks and Deep Learning," in Machine Learning A Bayesian and Optimization Perspective, Elsevier, 2015, pp. 875-936.
J.-T. Chien, "Deep Neural Network," in Source Separation and Machine Learning, Elsevier, 2019, pp. 259-320.
F. C. Pereira and S. S. Borysov, "Machine Learning Fundamentals," in Mobility Patterns, Big Data and Transport Analytics Tools and Applications for Modeling, Elsevier, 2019, pp. 9-29.
Downloads
Published
How to Cite
Issue
Section
License
Please find the rights and licenses in Register: Jurnal Ilmiah Teknologi Sistem Informasi. By submitting the article/manuscript of the article, the author(s) agree with this policy. No specific document sign-off is required.
1. License
The non-commercial use of the article will be governed by the Creative Commons Attribution license as currently displayed on Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
2. Author(s)' Warranties
The author warrants that the article is original, written by stated author(s), has not been published before, contains no unlawful statements, does not infringe the rights of others, is subject to copyright that is vested exclusively in the author and free of any third party rights, and that any necessary written permissions to quote from other sources have been obtained by the author(s).
3. User/Public Rights
Register's spirit is to disseminate articles published are as free as possible. Under the Creative Commons license, Register permits users to copy, distribute, display, and perform the work for non-commercial purposes only. Users will also need to attribute authors and Register on distributing works in the journal and other media of publications. Unless otherwise stated, the authors are public entities as soon as their articles got published.
4. Rights of Authors
Authors retain all their rights to the published works, such as (but not limited to) the following rights;
Copyright and other proprietary rights relating to the article, such as patent rights,
The right to use the substance of the article in own future works, including lectures and books,
The right to reproduce the article for own purposes,
The right to self-archive the article (please read out deposit policy),
The right to enter into separate, additional contractual arrangements for the non-exclusive distribution of the article's published version (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal (Register: Jurnal Ilmiah Teknologi Sistem Informasi).
5. Co-Authorship
If the article was jointly prepared by more than one author, any authors submitting the manuscript warrants that he/she has been authorized by all co-authors to be agreed on this copyright and license notice (agreement) on their behalf, and agrees to inform his/her co-authors of the terms of this policy. Register will not be held liable for anything that may arise due to the author(s) internal dispute. Register will only communicate with the corresponding author.
6. Royalties
Being an open accessed journal and disseminating articles for free under the Creative Commons license term mentioned, author(s) aware that Register entitles the author(s) to no royalties or other fees.
7. Miscellaneous
Register will publish the article (or have it published) in the journal if the article’s editorial process is successfully completed. Register's editors may modify the article to a style of punctuation, spelling, capitalization, referencing and usage that deems appropriate. The author acknowledges that the article may be published so that it will be publicly accessible and such access will be free of charge for the readers as mentioned in point 3.