Comparison of Term Weighting Methods in Sentiment Analysis of the New State Capital of Indonesia with the SVM Method

Authors

  • Muhammad Kiko Aulia Reiki Telkom University
  • Yuliant Sibaroni Telkom University
  • Erwin Budi Setiawan Telkom University

DOI:

https://doi.org/10.21108/ijoict.v8i2.681

Keywords:

term weighting, nusantara, Indonesian state capital, sentiment analysis

Abstract

The relocation of the State Capital to “Nusantaraâ€, which was inaugurated with the enactment of UU No. 3 of 2022, is a significant project that has sparked polemics among Indonesian citizens. Many people expressed their opinions and thoughts regarding the relocation of the State Capital on Twitter. This tendency of public opinion needs to be identified with sentiment analysis. In sentiment analysis, term weighting is an essential component to obtain optimal accuracy. Various people are trying to modify the existing term weighting to increase the performance and accuracy of the model. One of them is icf-based or tf-bin.icf, which combines inverse category frequency (ICF) and relevance frequency (RF). This study compares the tf-idf, tf-rf, and tf-bin.icf term weighting with the SVM classification method on the new State Capital of Indonesia topic. The tf-idf weighting results are still the best compared to the tf-bin.icf and tf-rf term weights, with an accuracy score of 88.0% a 1,3% difference with tf-bin.icf term weighting.

Downloads

Download data is not yet available.

References

[1] M. Safiullah, P. Pathak, S. Singh, and A. Anshul, “Social media as an upcoming tool for political marketing effectiveness,” Asia Pacific Management Review, vol. 22, no. 1, pp. 10–15, Mar. 2017, doi: 10.1016/j.apmrv.2016.10.007.
[2] D. Neu, G. Saxton, A. Rahaman, and J. Everett, “Twitter and social accountability: Reactions to the Panama Papers,” Critical Perspectives on Accounting, vol. 61, pp. 38–53, Jun. 2019, doi: 10.1016/j.cpa.2019.04.003.
[3] Presiden Republik Indonesia, “Undang-undang Republik Indonesia Nomor 3 Tahun 2022 Tentang Ibu Kota Negara,” 2022.
[4] V. Effendy, A. Novantirani, and M. K. Sabariah, “Sentiment Analysis on Twitter about the Use of City Public Transportation Using Support Vector Machine Method,” 2016. doi: https://doi.org/10.21108/IJOICT.2016.21.85.
[5] B. Liu, “Sentiment Analysis and Opinion Mining,” Morgan & Claypool Publishers, 2012.
[6] B. Liu, Sentiment Analysis : Mining Opinions, Sentiments, and Emotions. 2015. doi: doi:10.1162/COLIr00259595.
[7] P. Arsi and R. Waluyo, “Analisis Sentimen Wacana Pemindahan Ibu Kota Indonesia Menggunakan Algoritma Support Vector Machine (SVM),” vol. 8, no. 1, pp. 147–156, 2021, doi: 10.25126/jtiik.202183944.
[8] A. Alsaeedi, “A survey of term weighting schemes for text classification,” 2020. doi: https://doi.org/10.1504/IJDMMM.2020.106741.
[9] D. Wang and H. Zhang, “Inverse-Category-Frequency based Supervised Term Weighting Schemes for Text Categorization,” XXX-XXX, 2010. doi: https://doi.org/10.48550/arXiv.1012.2609.
[10] Z. Liu, X. Lv, K. Liu, and S. Shi, “Study on SVM compared with the other text classification methods,” in 2nd International Workshop on Education Technology and Computer Science, ETCS 2010, 2010, vol. 1, pp. 219–222. doi: 10.1109/ETCS.2010.248.
[11] K. Sailunaz and R. Alhajj, “Emotion and sentiment analysis from Twitter text,” J Comput Sci, vol. 36, Sep. 2019, doi: 10.1016/j.jocs.2019.05.009.
[12] P. Gandhi, S. Bhatia, and N. Alkhaldi, “Sentiment Analysis Using Deep Learning,” IET, 2021, pp. 204–211.
[13] A. P. Natasuwarna STMIK Pontianak Jurusan Sistem Informasi, “Analisis Sentimen Keputusan Pemindahan Ibu Kota Negara Menggunakan Klasifikasi Naive Bayes,” SEMINAR NASIONAL SISTEM INFORMASI dan TEKNIK INFORMATIKA, pp. 47–54, 2019.
[14] G. Domeniconi, G. Moro, R. Pasolini, and C. Sartori, “A Comparison of Term Weighting Schemes for Text Classification and Sentiment Analysis with a Supervised Variant of tf.idf,” Communications in Computer and Information Science, vol. 584, p. v, 2016, doi: 10.1007/978-3-319-30162-4.
[15] E. Uwiragiye and K. L. Rhinehardt, “TFIDF-Random Forest: Prediction of Aptamer-Protein Interacting Pairs,” IEEE/ACM Trans Comput Biol Bioinform, 2021, doi: 10.1109/TCBB.2021.3098709.
[16] B. Trstenjak, S. Mikac, and D. Donko, “KNN with TF-IDF based framework for text categorization,” in Procedia Engineering, 2014, vol. 69, pp. 1356–1364. doi: 10.1016/j.proeng.2014.03.129.
[17] S. Fransiska and A. Irham Gufroni, “Sentiment Analysis Provider by.U on Google Play Store Reviews with TF-IDF and Support Vector Machine (SVM) Method,” Scientific Journal of Informatics, vol. 7, no. 2, pp. 2407–7658, 2020, [Online]. Available: http://journal.unnes.ac.id/nju/index.php/sji
[18] S. Styawati and K. Mustofa, “A Support Vector Machine-Firefly Algorithm for Movie Opinion Data Classification,” IJCCS (Indonesian Journal of Computing and Cybernetics Systems), vol. 13, no. 3, p. 219, Jul. 2019, doi: 10.22146/ijccs.41302.
[19] A. Kowalczyk, “Support vector machines succinctly,” Syncfusion Inc, 2017.
[20] M. Jamaluddin and A. D. Wibawa, “Patient Diagnosis Classification based on Electronic Medical Record using Text Mining and Support Vector Machine,” in 2021 International Seminar on Application for Technology of Information and Communication (iSemantic), 2021, pp. 243–248. doi: 10.1109/iSemantic52711.2021.9573178.
[21] S. Patnaik and X. Li, “Proceedings of International Conference on Computer Science and Information Technology,” 2013. [Online]. Available: http://www.springer.com/series/11156
[22] F. Colas and P. Brazdil, “Comparison of SVM and Some Older Classification Algorithms in Text Classification Tasks,” 2006.
[23] E. Haddi, X. Liu, and Y. Shi, “The role of text pre-processing in sentiment analysis,” in Procedia Computer Science, 2013, vol. 17, pp. 26–32. doi: 10.1016/j.procs.2013.05.005.

Downloads

Published

2023-01-03

How to Cite

Muhammad Kiko Aulia Reiki, Sibaroni, Y., & Setiawan, E. B. (2023). Comparison of Term Weighting Methods in Sentiment Analysis of the New State Capital of Indonesia with the SVM Method . International Journal on Information and Communication Technology (IJoICT), 8(2), 53–65. https://doi.org/10.21108/ijoict.v8i2.681

Issue

Section

Data Science