COMPARISON OF RANDOM FOREST AND NAÏVE BAYES CLASSIFIER METHODS IN SENTIMENT ANALYSIS ON CLIMATE CHANGE ISSUE

  • Fatkhurokhman Fauzi Department of Statistics, Faculty of Mathematics and Natural Science, Universitas Muhammadiyah Semarang, Indonesia
  • Wiwik Setiayani Department of Statistics, Faculty of Mathematics and Natural Science, Universitas Muhammadiyah Semarang, Indonesia
  • Tiani Wahyu Utami Department of Statistics, Faculty of Mathematics and Natural Science, Universitas Muhammadiyah Semarang, Indonesia
  • Eko Yuliyanto Department of Chemistry Education, Faculty of Mathematics and Natural Science, Universitas Muhammadiyah Semarang, Indonesia
  • Iis Widya Harmoko Meteorology Climatology and Geophysics, Indonesia
Keywords: Climate Change, naïve bayes classifier, random forest, sentiment analysis, text mining, twitter

Abstract

The last decade was recorded as a decade with a bad record on the issue of disasters in the world due to climate change. Measuring public opinion is one of the steps to mitigate the impact of climate change. Twitter is a popular social media for channeling opinions. Twitter provides a great source of data for understanding public opinion and the perceived risk of an issue. In recent decades, when discussing climate change, there are those who agree and those who oppose it. Sentiment analysis is a branch of learning in the realm of text mining that is used as a solution to see opinions on a problem, one of which is climate change. In this study, we will try to analyze opinions on climate change issues using the Random Forest and Naïve Bayes classifier methods. Data were obtained from Twitter for the period January 2022-June 2022. The training data used in this research is 80%:20%. There are slightly more positive statements than negative ones. The results obtained with the Naïve Bayes classifier method are an accuracy of 76.25%, an F-1 score of 78%, and a recall of 80%. While the results of the random forest method are 70.6% accuracy, 69% F-1 score, and 63% recall. The Nive Bayes method is better than the Random Forest method for classifying climate change opinions with an accuracy of 76.25%.

Downloads

Download data is not yet available.

References

Y. Chen, J. Li, and A. Chen, “Does high risk mean high loss: Evidence from flood disaster in southern China,” Science of The Total Environment, vol. 785, p. 147127, 2021, doi: https://doi.org/10.1016/j.scitotenv.2021.147127.

B. A. Revich, “Toward the assessment of the Russian population mortality risk factors and the feasibility of their reduction: Comments on the World Bank Report ‘dying Too Young,’” Stud Russ Econ Dev, vol. 17, no. 6, pp. 643–654, Dec. 2006, doi: 10.1134/S1075700706060104.

S. Solomon, Climate change 2007 : the physical science basis : contribution of Working Group I to the Fourth Assessment Report of the Intergovernmental Panel on Climate Change. Cambridge University Press, 2007.

Y. Suryadi, D. N. Sugianto, and Hadiyanto, “Climate Change in Indonesia (Case Study: Medan, Palembang, Semarang),” E3S Web of Conferences, vol. 31, pp. 3–8, 2018, doi: 10.1051/e3sconf/20183109017.

Supari, F. Tangang, L. Juneng, and E. Aldrian, “Observed changes in extreme temperature and precipitation over Indonesia,” International Journal of Climatology, vol. 37, no. 4, pp. 1979–1997, 2017, doi: https://doi.org/10.1002/joc.4829.

B. Dahal, S. A. P. Kumar, and Z. Li, “Topic modeling and sentiment analysis of global climate change tweets,” Soc Netw Anal Min, vol. 9, no. 1, p. 24, 2019, doi: 10.1007/s13278-019-0568-8.

B. Zhang Lei and Liu, “Sentiment Analysis and Opinion Mining,” in Encyclopedia of Machine Learning and Data Mining, G. I. Sammut Claude and Webb, Ed., Boston, MA: Springer US, 2017, pp. 1152–1161. doi: 10.1007/978-1-4899-7687-1_907.

Bahrawi, “SENTIMENT ANALYSIS USING RANDOM FOREST ALGORITHM-ONLINE SOCIAL MEDIA BASED.”

P. Liu, H. han Zhao, J. yu Teng, Y. yan Yang, Y. feng Liu, and Z. wei Zhu, “Parallel naive Bayes algorithm for large-scale Chinese text classification based on spark,” J Cent South Univ, vol. 26, no. 1, pp. 1–12, Jan. 2019, doi: 10.1007/s11771-019-3978-x.

H. Annur, “KLASIFIKASI MASYARAKAT MISKIN MENGGUNAKAN METODE NAÏVE BAYES,” ILKOM Jurnal Ilmiah, vol. 10, no. 2, pp. 160–165, 2018.

C. J. Hinde, R. Stone, D. Xhemali, C. J. Hinde, and R. G. Stone, “Naïve Bayes vs. Decision Trees vs. Neural Networks in the Classification of Training Web Pages,” IJCSI International Journal of Computer Science Issues, vol. 4, no. 1, 2009, [Online]. Available: https://hdl.handle.net/2134/5394.

A. Primajaya and B. N. Sari, “Random Forest Algorithm for Prediction of Precipitation,” Indonesian Journal of Artificial Intelligence and Data Mining (IJAIDM), vol. 1, no. 1, pp. 27–31, 2018.

R. Supriyadi, W. Gata, N. Maulidah, A. Fauzi, I. Komputer, and S. Nusa Mandiri Jalan Margonda Raya No, “Penerapan Algoritma Random Forest Untuk Menentukan Kualitas Anggur Merah,” JURNAL ILMIAH EKONOMI DAN BISNIS, vol. 13, no. 2, pp. 67–75, 2020, [Online]. Available: http://journal.stekom.ac.id/index.php/E-Bisnis■page67

M. Lydiri, Y. el Habouz, and H. Zougagh, “Sentiment Analysis Decision System for Tracking Climate Change Opinion in Twitter,” in Business Intelligence, M. Fakir, M. Baslam, and R. el Ayachi, Eds., Cham: Springer International Publishing, 2022, pp. 188–196.

B. Dahal, S. A. P. Kumar, and Z. Li, “Topic modeling and sentiment analysis of global climate change tweets,” Soc Netw Anal Min, vol. 9, no. 1, Dec. 2019, doi: 10.1007/s13278-019-0568-8.

A. Sabika, “SENTIRESEARCH: LEXICON-BASED WEB APPLICATION FOR INDONESIAN SENTIMENT ANALYSIS,” 2020.

P. M. Dikky and A. Indriani, “IMPLEMENTASI TEXT MINING DALAM KLASIFIKASI BUKU DENGAN METODE NAÏVE BAYES CLASSIFIER STUDI KASUS PADA PERPUSTAKAAN STMIK PPKIA TARAKANITA RAHMAWATI,” in Seminar Nasional Inovasi dan Tren, 2014, pp. 243–247.

V. F. Rodriguez-Galiano, B. Ghimire, J. Rogan, M. Chica-Olmo, and J. P. Rigol-Sanchez, “An assessment of the effectiveness of a random forest classifier for land-cover classification,” ISPRS Journal of Photogrammetry and Remote Sensing, vol. 67, pp. 93–104, 2012, doi: https://doi.org/10.1016/j.isprsjprs.2011.11.002.

Published
2023-09-30
How to Cite
[1]
F. Fauzi, W. Setiayani, T. Utami, E. Yuliyanto, and I. Harmoko, “COMPARISON OF RANDOM FOREST AND NAÏVE BAYES CLASSIFIER METHODS IN SENTIMENT ANALYSIS ON CLIMATE CHANGE ISSUE”, BAREKENG: J. Math. & App., vol. 17, no. 3, pp. 1439-1448, Sep. 2023.