ANALISIS SENTIMEN TERHADAP TIMNAS INDONESIA DI PIALA ASIA 2023 DENGAN MODEL TRANSFORMER BERBAHASA INDONESIA
DOI:
https://doi.org/10.36341/rabit.v10i2.6142Keywords:
Sentiment Analysis, Transformer, IndoBERT, IndoRoBERTa, DistilBERT MultilingualAbstract
This study aims to analyze public sentiment towards the Indonesian National Team during the 2023 Asian Cup through Instagram comments, using the Indonesian-language Transformer model. A total of 21,045 comments were collected from the official @timnasindonesia account and filtered into 17,829 comments worthy of analysis after going through preprocessing processes such as text cleaning, case folding, normalization, tokenization, stopword removal, and stemming. Comments were then automatically classified into three sentiment classes, namely positive, negative, and neutral, using the Indonesian Sentiment Lexicon (InSet). Three Transformer models were used, namely IndoBERT, IndoRoBERTa, and DistilBERT Multilingual, and compared to the baseline SVM + TF-IDF. The evaluation used accuracy, precision, recall, and F1-score metrics. The results showed that IndoBERT (learning rate 5e-5) gave the best performance with an accuracy of 0.8897 and an F1-score of 0.8859, outperforming other models. The analysis was conducted by considering the typical challenges of Instagram comments such as slang, abbreviations, emojis, and mixed use of Indonesian-English. These findings validate the effectiveness of the monolingual Transformer model for Indonesian, which is still rarely compared systematically in the context of social media. These findings can also be used by the management of the Indonesian National Team or sports policy makers to evaluate public responses in real-time, as well as being a reference in developing an adaptive and contextual public opinion analysis system.
Downloads
References
M. Kholilullah, M. Martanto, and U. Hayati, “ANALISIS SENTIMEN PENGGUNA TWITTER(X) TENTANG PIALA DUNIA USIA 17 MENGGUNAKAN METODE NAIVE BAYES,” JATI (Jurnal Mahasiswa Teknik Informatika), vol. 8, no. 1, pp. 392–398, Feb. 2024, doi: 10.36040/JATI.V8I1.8378.
R. H. Muhammadi, T. G. Laksana, and A. B. Arifa, “Combination of Support Vector Machine and Lexicon-Based Algorithm in Twitter Sentiment Analysis,” Khazanah Informatika : Jurnal Ilmu Komputer dan Informatika, vol. 8, no. 1, pp. 59–71, Mar. 2022, doi: 10.23917/KHIF.V8I1.15213.
A. Agustin, S. Andrean, S. Susanti, R. Rahmiati, and H. Hamdani, “REVIEW APLIKASI KREDIVO MENGGUNAKAN ANALISIS SENTIMEN DENGAN ALGORITMA SUPPORT VECTOR MACHINE,” Rabit : Jurnal Teknologi dan Sistem Informasi Univrab, vol. 9, no. 1, pp. 39–49, Dec. 2024, doi: 10.36341/RABIT.V9I1.4107.
H. M. Rorong, K. Santa, and V. Peggie Rantung, “Sentimen Analisis U-17 Pada Media Sosial X Dengan Metode Support Vector Machine: Sentiment Analysis of U-17 on Social Media X Using the Support Vector Machine Method,” JOURNAL OF INFORMATICS, BUSINESS, EDUCATION AND INNOVATION TECHNOLOGY, vol. 10, no. 9, pp. 94–100, Nov. 2024, Accessed: Mar. 05, 2025. [Online]. Available: https://jibeit.teknikinformatika.org/index.php/jibeit/article/view/186
Abd. C. Fauzan and K. Hikmah, “IMPLEMENTASI ALGORITMA NAIVE BAYES DALAM ANALISIS POLARISASI OPINI MASYARAKAT TERKAIT VAKSIN COVID-19,” Rabit : Jurnal Teknologi dan Sistem Informasi Univrab, vol. 7, no. 2, pp. 122–128, Jul. 2022, doi: 10.36341/RABIT.V7I2.2403.
S. P. Tanzil and M. R. Pribadi, “Analisis Sentimen Pengguna Instagram terhadap Timnas Indonesia U-23 pada Piala AFC menggunakan Algoritma K-Nearest Neighbor (K-NN) dengan SMOTE,” Telekontran : Jurnal Ilmiah Telekomunikasi, Kendali dan Elektronika Terapan, vol. 12, no. 1, pp. 68–80, Jul. 2024, doi: 10.34010/TELEKONTRAN.V12I1.12869.
R. A. Laksono, K. R. Sungkono, R. Sarno, and C. S. Wahyuni, “Sentiment analysis of restaurant customer reviews on tripadvisor using naïve bayes,” Proceedings of 2019 International Conference on Information and Communication Technology and Systems, ICTS 2019, pp. 49–54, Jul. 2019, doi: 10.1109/ICTS.2019.8850982.
G. Z. Nabiilah, S. Y. Prasetyo, Z. N. Izdihar, and A. S. Girsang, “BERT base model for toxic comment analysis on Indonesian social media,” Procedia Comput Sci, vol. 216, pp. 714–721, Jan. 2023, doi: 10.1016/J.PROCS.2022.12.188.
Y. Asri, D. Kuswardani, L. F. M. Horhoruw, and S. A. Ramadhana, MACHINE LEARNING & DEEP LEARNING: Analisis Sentiment Menggunakan Ulasan Pengguna Aplikasi. Jawa Timur: Uwais Inspirasi Indonesia, 2024.
H. Imaduddin, F. Y. A’la, and Y. S. Nugroho, “Sentiment Analysis in Indonesian Healthcare Applications using IndoBERT Approach,” International Journal of Advanced Computer Science and Applications, vol. 14, no. 8, pp. 113–117, 2023, doi: 10.14569/IJACSA.2023.0140813.
R. Kusnadi, Y. Yusuf, A. Andriantony, R. A. Yaputra, and M. Caintan, “ANALISIS SENTIMEN TERHADAP GAME GENSHIN IMPACT MENGGUNAKAN BERT,” Rabit : Jurnal Teknologi dan Sistem Informasi Univrab, vol. 6, no. 2, pp. 122–129, Jul. 2021, doi: 10.36341/RABIT.V6I2.1765.
A. Jannani, T. Amzil, N. Sael, and S. Bouhsissin, “Sentiment-Annotated Hibapress: A Moroccan News Arabic Dataset (SAHMNAD) predicted using Fine-Tuned Arabic Language Models and Zero-Shot LLMs,” 2025 5th International Conference on Innovative Research in Applied Science, Engineering and Technology, IRASET 2025, 2025, doi: 10.1109/IRASET64571.2025.11008106.
F. F. Rachman and S. Pramana, “Analisis Sentimen Pro dan Kontra Masyarakat Indonesia tentang Vaksin COVID-19 pada Media Sosial Twitter,” Indonesian of Health Information Management Journal (INOHIM), vol. 8, no. 2, pp. 100–109, Dec. 2020, doi: 10.47007/INOHIM.V8I2.223.
T. Baldwin and Y. Li, “An In-depth Analysis of the Effect of Text Normalization in Social Media,” NAACL HLT 2015 - 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference, pp. 420–429, 2015, doi: 10.3115/V1/N15-1045.
F. Koto and G. Y. Rahmaningtyas, “Inset lexicon: Evaluation of a word list for Indonesian sentiment analysis in microblogs,” Proceedings of the 2017 International Conference on Asian Language Processing, IALP 2017, vol. 2018-January, pp. 391–394, Jul. 2017, doi: 10.1109/IALP.2017.8300625.
B. Wilie et al., “IndoNLU: Benchmark and Resources for Evaluating Indonesian Natural Language Understanding,” Sep. 2020, Accessed: Feb. 27, 2025. [Online]. Available: https://arxiv.org/abs/2009.05387v3
F. Koto, A. Rahimi, J. H. Lau, and T. Baldwin, “IndoLEM and IndoBERT: A Benchmark Dataset and Pre-trained Language Model for Indonesian NLP,” COLING 2020 - 28th International Conference on Computational Linguistics, Proceedings of the Conference, pp. 757–770, Nov. 2020, doi: 10.18653/v1/2020.coling-main.66.
J. Devlin, M.-W. Chang, K. Lee, K. T. Google, and A. I. Language, “BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding,” Proceedings of the 2019 Conference of the North, pp. 4171–4186, 2019, doi: 10.18653/V1/N19-1423.
Y. Liu et al., “RoBERTa: A Robustly Optimized BERT Pretraining Approach,” Jul. 2019, Accessed: Apr. 14, 2025. [Online]. Available: https://arxiv.org/abs/1907.11692v1
B. Richardson and A. Wicaksana, “Comparison of IndoBERT-lite and RoBERTa in Text Mining for Indonesian Language Question Answering Application,” International Journal of Innovative Computing, Information and Control, vol. 18, no. 06, pp. 1719-, Dec. 2022, doi: 10.24507/IJICIC.18.06.1719.
V. Sanh, L. Debut, J. Chaumond, and T. Wolf, “DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter,” Oct. 2019, Accessed: Apr. 14, 2025. [Online]. Available: https://arxiv.org/abs/1910.01108v4
T. Liu, S. Li, Y. Dong, Y. Mo, and S. He, “Spam Detection and Classification Based on DistilBERT Deep Learning Algorithm,” Applied Science & Engineering Journal for Advanced Research Peer Reviewed and Refereed Journal ISSN, pp. 6–10, 2024, doi: 10.5281/zenodo.11180575.
F. Barbieri, L. E. Anke, and J. Camacho-Collados, “XLM-T: Multilingual Language Models in Twitter for Sentiment Analysis and Beyond,” 2022. Accessed: Jun. 26, 2025. [Online]. Available: https://aclanthology.org/2022.lrec-1.27/
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Rabit : Jurnal Teknologi dan Sistem Informasi Univrab

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Copyright Notice
The copyright of the received article shall be assigned to the publisher of the journal. The intended copyright includes the right to publish the article in various forms (including reprints). The journal maintains the publishing rights to published articles. Therefore, the author must submit a statement of the Copyright Transfer Agreement.*)
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
In line with the license, authors and any users (readers and other researchers) are allowed to share and adapt the material only for non-commercial purposes. In addition, the material must be given appropriate credit, provided with a link to the license, and indicated if changes were made. If authors remix, transform or build upon the material, authors must distribute their contributions under the same license as the original.
Please find the rights and licenses in RABIT : Jurnal Teknologi dan Sistem Informasi Univrab. By submitting the article/manuscript of the article, the author(s) accept this policy.
1. License
The non-commercial use of the article will be governed by the Creative Commons Attribution license as currently displayed on Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
2. Author’s Warranties
The author warrants that the article is original, written by stated author(s), has not been published before, contains no unlawful statements, does not infringe the rights of others, is subject to copyright that is vested exclusively in the author and free of any third party rights, and that any necessary written permissions to quote from other sources have been obtained by the author(s).
3. User Rights
RABIT's spirit is to disseminate articles published are as free as possible. Under the Creative Commons license, RABIT permits users to copy, distribute, display, and perform the work for non-commercial purposes only. Users will also need to attribute authors and RABIT on distributing works in the journal.
4. Rights of Authors
Authors retain all their rights to the published works, such as (but not limited to) the following rights;
- Copyright and other proprietary rights relating to the article, such as patent rights,
- The right to use the substance of the article in own future works, including lectures and books,
- The right to reproduce the article for own purposes,
- The right to self-archive the article,
- The right to enter into separate, additional contractual arrangements for the non-exclusive distribution of the article's published version (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal (RABIT : Jurnal Teknologi dan Sistem Informasi Univrab).
5. Co-Authorship
If the article was jointly prepared by other authors, any authors submitting the manuscript warrants that he/she has been authorized by all co-authors to be agreed on this copyright and license notice (agreement) on their behalf, and agrees to inform his/her co-authors of the terms of this policy. RABIT will not be held liable for anything that may arise due to the author(s) internal dispute. RABIT will only communicate with the corresponding author.
6. Royalties
This agreement entitles the author to no royalties or other fees. To such extent as legally permissible, the author waives his or her right to collect royalties relative to the article in respect of any use of the article by RABIT.
7. Miscellaneous
RABIT will publish the article (or have it published) in the journal if the article’s editorial process is successfully completed. RABIT's editors may modify the article to a style of punctuation, spelling, capitalization, referencing and usage that deems appropriate. The author acknowledges that the article may be published so that it will be publicly accessible and such access will be free of charge for the readers as mentioned in point 3.