Retweet Prediction Based on User-Based, Content-Based, and Time-Based Features Using ANN Optimized with GWO


Authors

  • Irgi Aditya Rachman Telkom University, Bandung, Indonesia
  • Jondri Jondri Telkom University, Bandung, Indonesia
  • Kemas Muslim L Telkom University, Bandung, Indonesia

DOI:

https://doi.org/10.30865/klik.v4i1.1067

Keywords:

Social Media; Twitter; Retweet; Prediction; Imbalance Class; Undersampling; Oversampling; Artificial Neural Network

Abstract

Social media has emerged as immensely popular and favored platforms among the masses today. Twitter, being one of the most renowned social media platforms, allows users to express themselves through tweet postings. Retweeting is a crucial feature on Twitter, enabling users to disseminate tweets authored by others. In this context, this research aims to predict retweet behavior using User-Based, Content-Based, and Time-Based features, coupled with an Artificial Neural Network classifier optimized with Grey Wolf Optimization. One of the challenges in retweet prediction lies in class imbalance, where the number of retweets on certain tweets is significantly disproportionate compared to others. To address this issue, this study implements undersampling and oversampling techniques. Undersampling reduces the number of samples from the majority class, whereas oversampling involves duplicating or synthesizing samples from the minority class, thereby creating class balance. The research successfully achieves promising results in retweet prediction. After applying oversampling techniques, the classification process attains an accuracy of 85.58%, precision of 87.77%, recall of 83.92%, and F1-score of 85.80%. These results demonstrate the effectiveness of the proposed method in retweet prediction and handling class imbalance issues

Downloads

Download data is not yet available.

References

D. Henry, E. Stattner, and M. Collard, “Social media, diffusion under influence of parameters: survey and perspectives,” Procedia Comput Sci, vol. 109, pp. 376–383, 2017.

M. Broersma and T. Graham, “Twitter as a news source: How Dutch and British newspapers used tweets in their news coverage, 2007–2011,” Journalism practice, vol. 7, no. 4, pp. 446–464, 2013.

I. Daga, A. Gupta, R. Vardhan, and P. Mukherjee, “Prediction of likes and retweets using text information retrieval,” Procedia Comput Sci, vol. 168, pp. 123–128, 2020.

B. Suh, L. Hong, P. Pirolli, and E. H. Chi, “Want to be retweeted? large scale analytics on factors impacting retweet in twitter network,” in 2010 IEEE second international conference on social computing, IEEE, 2010, pp. 177–184.

A. Kupavskii et al., “Prediction of retweet cascade size over time,” in Proceedings of the 21st ACM international conference on Information and knowledge management, 2012, pp. 2335–2338.

D. Boyd, S. Golder, and G. Lotan, “Tweet, tweet, retweet: Conversational aspects of retweeting on twitter,” in 2010 43rd Hawaii international conference on system sciences, IEEE, 2010, pp. 1–10.

T. B. N. Hoang and J. Mothe, “Predicting information diffusion on Twitter–Analysis of predictive features,” J Comput Sci, vol. 28, pp. 257–264, 2018.

R. Rakes, J. Jondri, and K. M. Lhaksamana, “Prediksi Retweet Berdasarkan Feature User-based Menggunakan Metode Klasifikasi Support Vector Machine,” eProceedings of Engineering, vol. 8, no. 5, 2021.

S. Kakar, D. Dhaka, and M. Mehrotra, “Value-Based Retweet Prediction on Twitter,” Informatica, vol. 45, no. 2, 2021.

S. B. Maind and P. Wankar, “Research paper on basic of artificial neural network,” International Journal on Recent and Innovation Trends in Computing and Communication, vol. 2, no. 1, pp. 96–100, 2014.

A. Fong, C. Sibley, A. Cole, C. Baldwin, and J. Coyne, “A comparison of artificial neural networks, logistic regressions, and classification trees for modeling mental workload in real-time,” in Proceedings of the Human Factors and Ergonomics Society Annual Meeting, SAGE Publications Sage CA: Los Angeles, CA, 2010, pp. 1709–1712.

Y. Chen, L. Song, Y. Liu, L. Yang, and D. Li, “A review of the artificial neural network models for water quality prediction,” Applied Sciences, vol. 10, no. 17, p. 5776, 2020.

R. Dastres and M. Soori, “Artificial Neural Network Systems,” International Journal of Imaging and Robotics, vol. 21, pp. 13–25, Mar. 2021.

B. Eftekhar, K. Mohammad, H. E. Ardebili, M. Ghodsi, and E. Ketabchi, “Comparison of artificial neural network and logistic regression models for prediction of mortality in head trauma based on initial clinical data,” BMC Med Inform Decis Mak, vol. 5, no. 1, pp. 1–8, 2005.

A. J. Paulson, R. A. Prabhavathy, S. Rekh, and E. Brindha, “Application of neural network for prediction of compressive strength of silica fume concrete,” Int. J. Civ. Eng. Technol, vol. 10, no. 2, pp. 1859–1867, 2019.

G. Negi, A. Kumar, S. Pant, and M. Ram, “Optimization of complex system reliability using hybrid grey wolf optimizer,” Decision Making: Applications in Management and Engineering, vol. 4, no. 2, pp. 241–256, 2021.

Y. Li, X. Lin, and J. Liu, “An improved gray wolf optimization algorithm to solve engineering problems,” Sustainability, vol. 13, no. 6, p. 3208, 2021.

S. Mirjalili, S. M. Mirjalili, and A. Lewis, “Grey wolf optimizer,” Advances in engineering software, vol. 69, pp. 46–61, 2014.

Q. Al-Tashi, H. Md Rais, S. J. Abdulkadir, S. Mirjalili, and H. Alhussian, “A review of grey wolf optimizer-based feature selection methods for classification,” Evolutionary Machine Learning Techniques: Algorithms and Applications, pp. 273–286, 2020.

E. T. Arifin, J. Jondri, and I. Indwiarti, “Prediction Retweet Using User-Based and Content-Based with ANN-GA Classification Method,” Building of Informatics, Technology and Science (BITS), vol. 4, no. 2, pp. 522–528, 2022.


Bila bermanfaat silahkan share artikel ini

Berikan Komentar Anda terhadap artikel Retweet Prediction Based on User-Based, Content-Based, and Time-Based Features Using ANN Optimized with GWO

ARTICLE HISTORY


Published: 2023-08-10
Abstract View: 30 times
PDF Download: 18 times