Comparison of Term Weighting Methods in Sentiment Analysis of the New State Capital of Indonesia with the SVM Method
DOI:
https://doi.org/10.21108/ijoict.v8i2.681Keywords:
term weighting, nusantara, Indonesian state capital, sentiment analysisAbstract
The relocation of the State Capital to “Nusantaraâ€, which was inaugurated with the enactment of UU No. 3 of 2022, is a significant project that has sparked polemics among Indonesian citizens. Many people expressed their opinions and thoughts regarding the relocation of the State Capital on Twitter. This tendency of public opinion needs to be identified with sentiment analysis. In sentiment analysis, term weighting is an essential component to obtain optimal accuracy. Various people are trying to modify the existing term weighting to increase the performance and accuracy of the model. One of them is icf-based or tf-bin.icf, which combines inverse category frequency (ICF) and relevance frequency (RF). This study compares the tf-idf, tf-rf, and tf-bin.icf term weighting with the SVM classification method on the new State Capital of Indonesia topic. The tf-idf weighting results are still the best compared to the tf-bin.icf and tf-rf term weights, with an accuracy score of 88.0% a 1,3% difference with tf-bin.icf term weighting.
Downloads
References
[2] D. Neu, G. Saxton, A. Rahaman, and J. Everett, “Twitter and social accountability: Reactions to the Panama Papers,†Critical Perspectives on Accounting, vol. 61, pp. 38–53, Jun. 2019, doi: 10.1016/j.cpa.2019.04.003.
[3] Presiden Republik Indonesia, “Undang-undang Republik Indonesia Nomor 3 Tahun 2022 Tentang Ibu Kota Negara,†2022.
[4] V. Effendy, A. Novantirani, and M. K. Sabariah, “Sentiment Analysis on Twitter about the Use of City Public Transportation Using Support Vector Machine Method,†2016. doi: https://doi.org/10.21108/IJOICT.2016.21.85.
[5] B. Liu, “Sentiment Analysis and Opinion Mining,†Morgan & Claypool Publishers, 2012.
[6] B. Liu, Sentiment Analysis : Mining Opinions, Sentiments, and Emotions. 2015. doi: doi:10.1162/COLIr00259595.
[7] P. Arsi and R. Waluyo, “Analisis Sentimen Wacana Pemindahan Ibu Kota Indonesia Menggunakan Algoritma Support Vector Machine (SVM),†vol. 8, no. 1, pp. 147–156, 2021, doi: 10.25126/jtiik.202183944.
[8] A. Alsaeedi, “A survey of term weighting schemes for text classification,†2020. doi: https://doi.org/10.1504/IJDMMM.2020.106741.
[9] D. Wang and H. Zhang, “Inverse-Category-Frequency based Supervised Term Weighting Schemes for Text Categorization,†XXX-XXX, 2010. doi: https://doi.org/10.48550/arXiv.1012.2609.
[10] Z. Liu, X. Lv, K. Liu, and S. Shi, “Study on SVM compared with the other text classification methods,†in 2nd International Workshop on Education Technology and Computer Science, ETCS 2010, 2010, vol. 1, pp. 219–222. doi: 10.1109/ETCS.2010.248.
[11] K. Sailunaz and R. Alhajj, “Emotion and sentiment analysis from Twitter text,†J Comput Sci, vol. 36, Sep. 2019, doi: 10.1016/j.jocs.2019.05.009.
[12] P. Gandhi, S. Bhatia, and N. Alkhaldi, “Sentiment Analysis Using Deep Learning,†IET, 2021, pp. 204–211.
[13] A. P. Natasuwarna STMIK Pontianak Jurusan Sistem Informasi, “Analisis Sentimen Keputusan Pemindahan Ibu Kota Negara Menggunakan Klasifikasi Naive Bayes,†SEMINAR NASIONAL SISTEM INFORMASI dan TEKNIK INFORMATIKA, pp. 47–54, 2019.
[14] G. Domeniconi, G. Moro, R. Pasolini, and C. Sartori, “A Comparison of Term Weighting Schemes for Text Classification and Sentiment Analysis with a Supervised Variant of tf.idf,†Communications in Computer and Information Science, vol. 584, p. v, 2016, doi: 10.1007/978-3-319-30162-4.
[15] E. Uwiragiye and K. L. Rhinehardt, “TFIDF-Random Forest: Prediction of Aptamer-Protein Interacting Pairs,†IEEE/ACM Trans Comput Biol Bioinform, 2021, doi: 10.1109/TCBB.2021.3098709.
[16] B. Trstenjak, S. Mikac, and D. Donko, “KNN with TF-IDF based framework for text categorization,†in Procedia Engineering, 2014, vol. 69, pp. 1356–1364. doi: 10.1016/j.proeng.2014.03.129.
[17] S. Fransiska and A. Irham Gufroni, “Sentiment Analysis Provider by.U on Google Play Store Reviews with TF-IDF and Support Vector Machine (SVM) Method,†Scientific Journal of Informatics, vol. 7, no. 2, pp. 2407–7658, 2020, [Online]. Available: http://journal.unnes.ac.id/nju/index.php/sji
[18] S. Styawati and K. Mustofa, “A Support Vector Machine-Firefly Algorithm for Movie Opinion Data Classification,†IJCCS (Indonesian Journal of Computing and Cybernetics Systems), vol. 13, no. 3, p. 219, Jul. 2019, doi: 10.22146/ijccs.41302.
[19] A. Kowalczyk, “Support vector machines succinctly,†Syncfusion Inc, 2017.
[20] M. Jamaluddin and A. D. Wibawa, “Patient Diagnosis Classification based on Electronic Medical Record using Text Mining and Support Vector Machine,†in 2021 International Seminar on Application for Technology of Information and Communication (iSemantic), 2021, pp. 243–248. doi: 10.1109/iSemantic52711.2021.9573178.
[21] S. Patnaik and X. Li, “Proceedings of International Conference on Computer Science and Information Technology,†2013. [Online]. Available: http://www.springer.com/series/11156
[22] F. Colas and P. Brazdil, “Comparison of SVM and Some Older Classification Algorithms in Text Classification Tasks,†2006.
[23] E. Haddi, X. Liu, and Y. Shi, “The role of text pre-processing in sentiment analysis,†in Procedia Computer Science, 2013, vol. 17, pp. 26–32. doi: 10.1016/j.procs.2013.05.005.
Downloads
Published
How to Cite
Issue
Section
License
Manuscript submitted to IJoICT has to be an original work of the author(s), contains no element of plagiarism, and has never been published or is not being considered for publication in other journals. Author(s) shall agree to assign all copyright of published article to IJoICT. Requests related to future re-use and re-publication of major or substantial parts of the article must be consulted with the editors of IJoICT.