Retweet Prediction Based on User-Based, Content-Based, and Time-Based Features Using ANN Optimized with GWO
Keywords:Social Media; Twitter; Retweet; Prediction; Imbalance Class; Undersampling; Oversampling; Artificial Neural Network
Social media has emerged as immensely popular and favored platforms among the masses today. Twitter, being one of the most renowned social media platforms, allows users to express themselves through tweet postings. Retweeting is a crucial feature on Twitter, enabling users to disseminate tweets authored by others. In this context, this research aims to predict retweet behavior using User-Based, Content-Based, and Time-Based features, coupled with an Artificial Neural Network classifier optimized with Grey Wolf Optimization. One of the challenges in retweet prediction lies in class imbalance, where the number of retweets on certain tweets is significantly disproportionate compared to others. To address this issue, this study implements undersampling and oversampling techniques. Undersampling reduces the number of samples from the majority class, whereas oversampling involves duplicating or synthesizing samples from the minority class, thereby creating class balance. The research successfully achieves promising results in retweet prediction. After applying oversampling techniques, the classification process attains an accuracy of 85.58%, precision of 87.77%, recall of 83.92%, and F1-score of 85.80%. These results demonstrate the effectiveness of the proposed method in retweet prediction and handling class imbalance issues
D. Henry, E. Stattner, and M. Collard, “Social media, diffusion under influence of parameters: survey and perspectives,” Procedia Comput Sci, vol. 109, pp. 376–383, 2017.
M. Broersma and T. Graham, “Twitter as a news source: How Dutch and British newspapers used tweets in their news coverage, 2007–2011,” Journalism practice, vol. 7, no. 4, pp. 446–464, 2013.
I. Daga, A. Gupta, R. Vardhan, and P. Mukherjee, “Prediction of likes and retweets using text information retrieval,” Procedia Comput Sci, vol. 168, pp. 123–128, 2020.
B. Suh, L. Hong, P. Pirolli, and E. H. Chi, “Want to be retweeted? large scale analytics on factors impacting retweet in twitter network,” in 2010 IEEE second international conference on social computing, IEEE, 2010, pp. 177–184.
A. Kupavskii et al., “Prediction of retweet cascade size over time,” in Proceedings of the 21st ACM international conference on Information and knowledge management, 2012, pp. 2335–2338.
D. Boyd, S. Golder, and G. Lotan, “Tweet, tweet, retweet: Conversational aspects of retweeting on twitter,” in 2010 43rd Hawaii international conference on system sciences, IEEE, 2010, pp. 1–10.
T. B. N. Hoang and J. Mothe, “Predicting information diffusion on Twitter–Analysis of predictive features,” J Comput Sci, vol. 28, pp. 257–264, 2018.
R. Rakes, J. Jondri, and K. M. Lhaksamana, “Prediksi Retweet Berdasarkan Feature User-based Menggunakan Metode Klasifikasi Support Vector Machine,” eProceedings of Engineering, vol. 8, no. 5, 2021.
S. Kakar, D. Dhaka, and M. Mehrotra, “Value-Based Retweet Prediction on Twitter,” Informatica, vol. 45, no. 2, 2021.
S. B. Maind and P. Wankar, “Research paper on basic of artificial neural network,” International Journal on Recent and Innovation Trends in Computing and Communication, vol. 2, no. 1, pp. 96–100, 2014.
A. Fong, C. Sibley, A. Cole, C. Baldwin, and J. Coyne, “A comparison of artificial neural networks, logistic regressions, and classification trees for modeling mental workload in real-time,” in Proceedings of the Human Factors and Ergonomics Society Annual Meeting, SAGE Publications Sage CA: Los Angeles, CA, 2010, pp. 1709–1712.
Y. Chen, L. Song, Y. Liu, L. Yang, and D. Li, “A review of the artificial neural network models for water quality prediction,” Applied Sciences, vol. 10, no. 17, p. 5776, 2020.
R. Dastres and M. Soori, “Artificial Neural Network Systems,” International Journal of Imaging and Robotics, vol. 21, pp. 13–25, Mar. 2021.
B. Eftekhar, K. Mohammad, H. E. Ardebili, M. Ghodsi, and E. Ketabchi, “Comparison of artificial neural network and logistic regression models for prediction of mortality in head trauma based on initial clinical data,” BMC Med Inform Decis Mak, vol. 5, no. 1, pp. 1–8, 2005.
A. J. Paulson, R. A. Prabhavathy, S. Rekh, and E. Brindha, “Application of neural network for prediction of compressive strength of silica fume concrete,” Int. J. Civ. Eng. Technol, vol. 10, no. 2, pp. 1859–1867, 2019.
G. Negi, A. Kumar, S. Pant, and M. Ram, “Optimization of complex system reliability using hybrid grey wolf optimizer,” Decision Making: Applications in Management and Engineering, vol. 4, no. 2, pp. 241–256, 2021.
Y. Li, X. Lin, and J. Liu, “An improved gray wolf optimization algorithm to solve engineering problems,” Sustainability, vol. 13, no. 6, p. 3208, 2021.
S. Mirjalili, S. M. Mirjalili, and A. Lewis, “Grey wolf optimizer,” Advances in engineering software, vol. 69, pp. 46–61, 2014.
Q. Al-Tashi, H. Md Rais, S. J. Abdulkadir, S. Mirjalili, and H. Alhussian, “A review of grey wolf optimizer-based feature selection methods for classification,” Evolutionary Machine Learning Techniques: Algorithms and Applications, pp. 273–286, 2020.
E. T. Arifin, J. Jondri, and I. Indwiarti, “Prediction Retweet Using User-Based and Content-Based with ANN-GA Classification Method,” Building of Informatics, Technology and Science (BITS), vol. 4, no. 2, pp. 522–528, 2022.
Bila bermanfaat silahkan share artikel ini
Berikan Komentar Anda terhadap artikel Retweet Prediction Based on User-Based, Content-Based, and Time-Based Features Using ANN Optimized with GWO
Copyright (c) 2023 Irgi Aditya Rachman, Jondri Jondri, Kemas Muslim L
This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under Creative Commons Attribution 4.0 International License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (Refer to The Effect of Open Access).