ANALISIS SENTIMEN TERHADAP TIMNAS INDONESIA DI PIALA ASIA 2023 DENGAN MODEL TRANSFORMER BERBAHASA INDONESIA

Authors

  • Muhammad Irfan Abidin Universitas Muhammadiyah Surakarta
  • Endang Wahyu Pamungkas Universitas Muhammadiyah Surakarta

DOI:

https://doi.org/10.36341/rabit.v10i2.6142

Keywords:

Sentiment Analysis, Transformer, IndoBERT, IndoRoBERTa, DistilBERT Multilingual

Abstract

This study aims to analyze public sentiment towards the Indonesian National Team during the 2023 Asian Cup through Instagram comments, using the Indonesian-language Transformer model. A total of 21,045 comments were collected from the official @timnasindonesia account and filtered into 17,829 comments worthy of analysis after going through preprocessing processes such as text cleaning, case folding, normalization, tokenization, stopword removal, and stemming. Comments were then automatically classified into three sentiment classes, namely positive, negative, and neutral, using the Indonesian Sentiment Lexicon (InSet). Three Transformer models were used, namely IndoBERT, IndoRoBERTa, and DistilBERT Multilingual, and compared to the baseline SVM + TF-IDF. The evaluation used accuracy, precision, recall, and F1-score metrics. The results showed that IndoBERT (learning rate 5e-5) gave the best performance with an accuracy of 0.8897 and an F1-score of 0.8859, outperforming other models. The analysis was conducted by considering the typical challenges of Instagram comments such as slang, abbreviations, emojis, and mixed use of Indonesian-English. These findings validate the effectiveness of the monolingual Transformer model for Indonesian, which is still rarely compared systematically in the context of social media. These findings can also be used by the management of the Indonesian National Team or sports policy makers to evaluate public responses in real-time, as well as being a reference in developing an adaptive and contextual public opinion analysis system.

Downloads

Download data is not yet available.

References

M. Kholilullah, M. Martanto, and U. Hayati, “ANALISIS SENTIMEN PENGGUNA TWITTER(X) TENTANG PIALA DUNIA USIA 17 MENGGUNAKAN METODE NAIVE BAYES,” JATI (Jurnal Mahasiswa Teknik Informatika), vol. 8, no. 1, pp. 392–398, Feb. 2024, doi: 10.36040/JATI.V8I1.8378.

R. H. Muhammadi, T. G. Laksana, and A. B. Arifa, “Combination of Support Vector Machine and Lexicon-Based Algorithm in Twitter Sentiment Analysis,” Khazanah Informatika : Jurnal Ilmu Komputer dan Informatika, vol. 8, no. 1, pp. 59–71, Mar. 2022, doi: 10.23917/KHIF.V8I1.15213.

A. Agustin, S. Andrean, S. Susanti, R. Rahmiati, and H. Hamdani, “REVIEW APLIKASI KREDIVO MENGGUNAKAN ANALISIS SENTIMEN DENGAN ALGORITMA SUPPORT VECTOR MACHINE,” Rabit : Jurnal Teknologi dan Sistem Informasi Univrab, vol. 9, no. 1, pp. 39–49, Dec. 2024, doi: 10.36341/RABIT.V9I1.4107.

H. M. Rorong, K. Santa, and V. Peggie Rantung, “Sentimen Analisis U-17 Pada Media Sosial X Dengan Metode Support Vector Machine: Sentiment Analysis of U-17 on Social Media X Using the Support Vector Machine Method,” JOURNAL OF INFORMATICS, BUSINESS, EDUCATION AND INNOVATION TECHNOLOGY, vol. 10, no. 9, pp. 94–100, Nov. 2024, Accessed: Mar. 05, 2025. [Online]. Available: https://jibeit.teknikinformatika.org/index.php/jibeit/article/view/186

Abd. C. Fauzan and K. Hikmah, “IMPLEMENTASI ALGORITMA NAIVE BAYES DALAM ANALISIS POLARISASI OPINI MASYARAKAT TERKAIT VAKSIN COVID-19,” Rabit : Jurnal Teknologi dan Sistem Informasi Univrab, vol. 7, no. 2, pp. 122–128, Jul. 2022, doi: 10.36341/RABIT.V7I2.2403.

S. P. Tanzil and M. R. Pribadi, “Analisis Sentimen Pengguna Instagram terhadap Timnas Indonesia U-23 pada Piala AFC menggunakan Algoritma K-Nearest Neighbor (K-NN) dengan SMOTE,” Telekontran : Jurnal Ilmiah Telekomunikasi, Kendali dan Elektronika Terapan, vol. 12, no. 1, pp. 68–80, Jul. 2024, doi: 10.34010/TELEKONTRAN.V12I1.12869.

R. A. Laksono, K. R. Sungkono, R. Sarno, and C. S. Wahyuni, “Sentiment analysis of restaurant customer reviews on tripadvisor using naïve bayes,” Proceedings of 2019 International Conference on Information and Communication Technology and Systems, ICTS 2019, pp. 49–54, Jul. 2019, doi: 10.1109/ICTS.2019.8850982.

G. Z. Nabiilah, S. Y. Prasetyo, Z. N. Izdihar, and A. S. Girsang, “BERT base model for toxic comment analysis on Indonesian social media,” Procedia Comput Sci, vol. 216, pp. 714–721, Jan. 2023, doi: 10.1016/J.PROCS.2022.12.188.

Y. Asri, D. Kuswardani, L. F. M. Horhoruw, and S. A. Ramadhana, MACHINE LEARNING & DEEP LEARNING: Analisis Sentiment Menggunakan Ulasan Pengguna Aplikasi. Jawa Timur: Uwais Inspirasi Indonesia, 2024.

H. Imaduddin, F. Y. A’la, and Y. S. Nugroho, “Sentiment Analysis in Indonesian Healthcare Applications using IndoBERT Approach,” International Journal of Advanced Computer Science and Applications, vol. 14, no. 8, pp. 113–117, 2023, doi: 10.14569/IJACSA.2023.0140813.

R. Kusnadi, Y. Yusuf, A. Andriantony, R. A. Yaputra, and M. Caintan, “ANALISIS SENTIMEN TERHADAP GAME GENSHIN IMPACT MENGGUNAKAN BERT,” Rabit : Jurnal Teknologi dan Sistem Informasi Univrab, vol. 6, no. 2, pp. 122–129, Jul. 2021, doi: 10.36341/RABIT.V6I2.1765.

A. Jannani, T. Amzil, N. Sael, and S. Bouhsissin, “Sentiment-Annotated Hibapress: A Moroccan News Arabic Dataset (SAHMNAD) predicted using Fine-Tuned Arabic Language Models and Zero-Shot LLMs,” 2025 5th International Conference on Innovative Research in Applied Science, Engineering and Technology, IRASET 2025, 2025, doi: 10.1109/IRASET64571.2025.11008106.

F. F. Rachman and S. Pramana, “Analisis Sentimen Pro dan Kontra Masyarakat Indonesia tentang Vaksin COVID-19 pada Media Sosial Twitter,” Indonesian of Health Information Management Journal (INOHIM), vol. 8, no. 2, pp. 100–109, Dec. 2020, doi: 10.47007/INOHIM.V8I2.223.

T. Baldwin and Y. Li, “An In-depth Analysis of the Effect of Text Normalization in Social Media,” NAACL HLT 2015 - 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference, pp. 420–429, 2015, doi: 10.3115/V1/N15-1045.

F. Koto and G. Y. Rahmaningtyas, “Inset lexicon: Evaluation of a word list for Indonesian sentiment analysis in microblogs,” Proceedings of the 2017 International Conference on Asian Language Processing, IALP 2017, vol. 2018-January, pp. 391–394, Jul. 2017, doi: 10.1109/IALP.2017.8300625.

B. Wilie et al., “IndoNLU: Benchmark and Resources for Evaluating Indonesian Natural Language Understanding,” Sep. 2020, Accessed: Feb. 27, 2025. [Online]. Available: https://arxiv.org/abs/2009.05387v3

F. Koto, A. Rahimi, J. H. Lau, and T. Baldwin, “IndoLEM and IndoBERT: A Benchmark Dataset and Pre-trained Language Model for Indonesian NLP,” COLING 2020 - 28th International Conference on Computational Linguistics, Proceedings of the Conference, pp. 757–770, Nov. 2020, doi: 10.18653/v1/2020.coling-main.66.

J. Devlin, M.-W. Chang, K. Lee, K. T. Google, and A. I. Language, “BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding,” Proceedings of the 2019 Conference of the North, pp. 4171–4186, 2019, doi: 10.18653/V1/N19-1423.

Y. Liu et al., “RoBERTa: A Robustly Optimized BERT Pretraining Approach,” Jul. 2019, Accessed: Apr. 14, 2025. [Online]. Available: https://arxiv.org/abs/1907.11692v1

B. Richardson and A. Wicaksana, “Comparison of IndoBERT-lite and RoBERTa in Text Mining for Indonesian Language Question Answering Application,” International Journal of Innovative Computing, Information and Control, vol. 18, no. 06, pp. 1719-, Dec. 2022, doi: 10.24507/IJICIC.18.06.1719.

V. Sanh, L. Debut, J. Chaumond, and T. Wolf, “DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter,” Oct. 2019, Accessed: Apr. 14, 2025. [Online]. Available: https://arxiv.org/abs/1910.01108v4

T. Liu, S. Li, Y. Dong, Y. Mo, and S. He, “Spam Detection and Classification Based on DistilBERT Deep Learning Algorithm,” Applied Science & Engineering Journal for Advanced Research Peer Reviewed and Refereed Journal ISSN, pp. 6–10, 2024, doi: 10.5281/zenodo.11180575.

F. Barbieri, L. E. Anke, and J. Camacho-Collados, “XLM-T: Multilingual Language Models in Twitter for Sentiment Analysis and Beyond,” 2022. Accessed: Jun. 26, 2025. [Online]. Available: https://aclanthology.org/2022.lrec-1.27/

Published

2025-07-10

How to Cite

[1]
M. I. Abidin and E. W. Pamungkas, “ANALISIS SENTIMEN TERHADAP TIMNAS INDONESIA DI PIALA ASIA 2023 DENGAN MODEL TRANSFORMER BERBAHASA INDONESIA”, rabit, vol. 10, no. 2, pp. 482–496, Jul. 2025.

Issue

Section

Articles

Similar Articles

> >> 

You may also start an advanced similarity search for this article.