A comparative study of classification algorithms for sentiment analysis of COVID-19 vaccine opinions using machine learning

[ X ]

Tarih

2024

Dergi Başlığı

Dergi ISSN

Cilt Başlığı

Yayıncı

Altınbaş Üniversitesi / Lisansüstü Eğitim Enstitüsü

Erişim Hakkı

info:eu-repo/semantics/closedAccess

Özet

The task of analyzing textual data and classifying them into positive, negative, or neutral emotions within the domain of natural language processing is a multifaceted undertaking. The primary objective of this study is to employ machine learning algorithms in order to classify opinions pertaining to the coronavirus disease and vaccines. In this study, four algorithms were employed, namely Random Forest (RF), Gradient Boosting Classifier (GBC), Logistic Regression (LR), and Decision Tree (DT). The RF and GBC algorithms demonstrated a commendable accuracy rate of 89%, while the LR and DT algorithms yielded a slightly lower accuracy rate of 87%. The findings derived from this research can provide valuable guidance to policymakers in effectively addressing potential barriers that may impede the successful execution of vaccination campaigns. The analysis of the Kaggle data, which encompasses a wide range of commentaries related to the pandemic and vaccines, underscores the urgent need for prompt measures to attain herd immunity against Covid-19. This imperative objective holds significant importance in effectively managing the transmission of the virus and mitigating its adverse consequences on the well-being of the general population. The task at hand necessitates the acknowledgment and resolution of public apprehensions, as well as the establishment of trust and assurance in the vaccination initiative. This study presents an analysis of the machine learning techniques employed and conducts a comparative evaluation of their significance. The forthcoming research endeavors to create an application that will be capable of categorizing sentiments and opinions pertaining to diseases and vaccines. It is imperative for governments and organizations to comprehend the obstacles linked to the worldwide COVID-19 vaccination endeavor in order to develop efficacious strategies. Nevertheless, it is crucial to acknowledge that the scope of the analysis was restricted to tweets written in the English language. This limitation may potentially undermine the credibility and generalizability of the findings pertaining to overall sentiment. Additional investigation could be conducted to examine more extensive Twitter datasets employing deep learning models in order to gain a deeper comprehension of the general public's attitudes towards COVID-19 vaccines.

Açıklama

Anahtar Kelimeler

Twitter Sentiment Analysis, NLP, RF, DT, LR, GBC, Covid-19 Vaccines

Kaynak

WoS Q Değeri

Scopus Q Değeri

Cilt

Sayı

Künye

Zainulabdeen, D. S. Z. (2024). A comparative study of classification algorithms for sentiment analysis of COVID-19 vaccine opinions using machine learning. (Yayınlanmamış yüksek lisans tezi). Altınbaş Üniversitesi, Lisansüstü Eğitim Enstitüsü, İstanbul.

Koleksiyon