Evaluation of the Impact of Labeling Quality and Class Imbalance on Sentiment Classification of the Palestine–Israel Conflict

Salvia Devi Muhshanah, Evi Maria

Abstract


This study aims to evaluate the performance of sentiment classification on social media data related to the Palestine–Israel conflict, with a particular emphasis on the role of labeling quality and data distribution. The proposed approach combines TF-IDF text representation with lexicon-based labeling using InSet, along with two classification algorithms: Support Vector Machine (SVM) and Random Forest. The dataset was collected from the social media platform X and consisted of 2,831 Indonesian-language tweets that had undergone preprocessing. The results indicate that the sentiment distribution was dominated by the negative class (39.35%), followed by neutral (38.43%) and positive (22.21%) classes, indicating the presence of class imbalance. The labeling validity evaluation produced a Cohen’s Kappa value of 0.0175, indicating a low level of agreement between automatic labeling and manual annotation. The SVM model achieved an accuracy of 0.69 and a weighted F1-score of 0.68. However, both models demonstrated poor performance on the positive class as the minority class. These findings suggest that the limitations in model performance are not solely caused by the classification algorithms themselves, but are also significantly influenced by labeling quality and data distribution characteristics. This study contributes by emphasizing the importance of comprehensive evaluation throughout the sentiment analysis pipeline, particularly when dealing with complex and uncontrolled data sources such as social media.

Keywords


class imbalance; lexicon-based labeling; sentiment analysis; TF-IDF; text classification

Full Text:

PDF

References


A. Giachanou and F. Crestani, “Like it or not: A Survey of Twitter Sentiment Analysis Methods,” ACM Comput. Surv., Vol. 49, No. 2, pp. 1–41, Jun. 2017, DOI: 10.1145/2938640.

Y. Mao, Q. Liu, and Y. Zhang, “Sentiment Analysis Methods, Applications, and Challenges: A Systematic Literature Review,” J. King Saud Univ. - Comput. Inf. SCI., Vol. 36, No. 4, p. 102048, Apr. 2024, DOI: 10.1016/j.jksuci.2024.102048.

R. Christie, G. Suha Ma’rifa, and J. A. Priliska, “Analisis Konflik Israel dan Palestina terhadap Pelanggaran Hak Asasi Manusia dalam Perspektif Hukum Internasional,” J. Kewarganegaraan, Vol. 8, No. 1, pp. 349–350, 2024.

B. Liu, Sentiment Analysis and Opinion Mining. Morgan & Claypool Publishers, 2012.

S. M. Mohammad, “Sentiment Analysis: Detecting Valence, Emotions, and Other Affectual States from Text,” in Emotion Measurement, H. L. Meiselman, Ed., Elsevier, 2016, pp. 201–237. DOI: 10.1016/B978-0-08-100508-8.00009-6.

Haibo He and E. A. Garcia, “Learning from Imbalanced Data,” IEEE Trans. Knowl. Data Eng., Vol. 21, No. 9, pp. 1263–1284, Sep. 2009, DOI: 10.1109/TKDE.2008.239.

Y. Zhang, R. Jin, and Z.-H. Zhou, “Understanding Bag-of-Words Model: A Statistical Framework,” Int. J. Mach. Learn. Cybern., Vol. 1, No. 1–4, pp. 43–52, Dec. 2010, DOI: 10.1007/s13042-010-0001-0.

M. Taboada, J. Brooke, M. Tofiloski, K. Voll, and M. Stede, “Lexicon-based Methods for Sentiment Analysis,” Comput. Linguist., Vol. 37, No. 2, pp. 267–307, Jun. 2011, DOI: 10.1162/COLI_a_00049.

B. Frenay and M. Verleysen, “Classification in the Presence of Label Noise: A Survey,” IEEE Trans. Neural Networks Learn. Syst., Vol. 25, No. 5, pp. 845–869, May 2014, DOI: 10.1109/TNNLS.2013.2292894.

A. A. Syam, G. Hardy M, A. Salim, D. F. Surianto, and M. Fajar B, “Analisis Teknik Preprocessing pada Sentimen Masyarakat terkait Konflik Israel-Palestina menggunakan Support Vector Machine,” JIPI (Jurnal Ilm. Penelit. dan Pembelajaran Inform., Vol. 9, No. 3, pp. 1465–1467, Aug. 2024, DOI: 10.29100/jipi.v9i3.5527.

D. Deltania, Garno, and A. Jamaludin, “Analisis Sentimen Publik terhadap Invasi Zionis kepada HAMAS menggunakan Support Vector Machine (SVM),” J. Mhs. Tek. Inform., Vol. 8, No. Vol. 8 No. 4 (2024): JATI Vol. 8 No. 4, pp. 4465–4466, Aug. 2024, DOI: https://doi.org/10.36040/jati.v8i4.9959.

F. M. Carina, Admi Salma, Dony Permana, and Zamahsary Martha, “Sentiment Analysis of X Application Users on the Conflict between Israel and Palestine using Support Vector Machine Algorithm,” UNP J. Stat. Data SCI., Vol. 2, No. 2, p. 204, May 2024, DOI: 10.24036/ujsds/vol2-iss2/170.

L. Breiman, “Random Forests,” Mach. Learn., Vol. 45, No. 1, pp. 5–32, Oct. 2001, DOI: 10.1023/A:1010933404324.

T. Wahyudi et al., “Klasifikasi Sentimen X-Twitter Perihal Pemindahan Ibu Kota Indonesia menggunakan Ekstrasi Fitur TF-IDF dan Metode Support Vector Machine (SVM),” J. Keilmuan dan Apl. Bid. Tek. Inform., Vol. 18, No. 2, p. 191, Aug. 2024, DOI: 10.47111/JTI.

F. D. Ananda and Y. Pristyanto, “Analisis Sentimen Pengguna Twitter terhadap Layanan Internet Provider menggunakan Algoritma Support Vector Machine,” MATRIK J. Manajemen, Tek. Inform. dan Rekayasa Komput., Vol. 20, No. 2, p. 410, May 2021, DOI: 10.30812/matrik.v20i2.1130.

R. Azhar and M. F. Wijayanto, “Analisis Sentimen di Twitter: Mengungkap Persepsi dan Emosi Publik Seputar Konflik Palestina-Israel,” Stain. (Seminar Nas. Teknol. Sains), Vol. 3, No. Vol. 3 No. 1 (2024): STAINS (Seminar Nasional Teknologi & Sains), p. 120, 2024, DOI: https://doi.org/10.29407/stains.v3i1.4132.

F. Koto and G. Y. Rahmaningtyas, “Inset Lexicon: Evaluation of a Word List for Indonesian Sentiment Analysis in Microblogs,” Singapura, 2017. DOI: 10.1109/IALP.2017.8300625.

D. Ananda Efraim, “Analisis Sentimen pada Sosial Media Instagram menggunakan Algoritma Naive Bayes (Studi Kasus : Timnas Futsal Indonesia),” Jakarta, Aug. 2023.

J. R. Landis and G. G. Koch, “The Measurement of Observer Agreement for Categorical Data,” Biometrics, Vol. 33, No. 1, pp. 159–174, Mar. 1977, DOI: 10.2307/2529310.

M. R. Adrian, M. P. Putra, M. H. Rafialdy, and N. A. Rakhmawati, “Perbandingan Metode Klasifikasi Random Forest dan SVM pada Analisis Sentimen PSBB,” J. Inform. UPGRIS, Vol. 7, p. 39, Jun. 2021, Accessed: Mar. 29, 2026. [Online]. Available: https://journal.upgris.ac.id/index.php/JIU/article/view/7099/4309

I. Afdhal et al., “Penerapan Algoritma Random Forest untuk Analisis Sentimen Komentar di YouTube tentang Islamofobia,” J. Nas. Komputasi dan Teknol. Inf., Vol. 5, No. 1, p. 124, Feb. 2022, Accessed: Mar. 29, 2026. [Online]. Available: https://repository.uin-suska.ac.id/59747/1/Jurnal Ibnu Afdhal.pdf

P. Kumala Sari and R. Randy Suryono, “Komparasi Algoritma Support Vector Machine dan Random Forest untuk Analisis Sentimen Metaverse,” J. Mnemon., Vol. 7, No. 1, pp. 32–33, Feb. 2024, Accessed: Mar. 29, 2026. [Online]. Available: https://www.ejournal.itn.ac.id/mnemonic/article/view/8977




DOI: https://doi.org/10.32520/stmsi.v15i5.6304

Article Metrics

Abstract view : 0 times
PDF - 0 times

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.