Publicación:
Model Proposal for the Detection of False Information About COVID-19 Using Machine Learning and Natural Language Processing Techniques

dc.contributor.authorBolaños, Yair Andrey Salinas
dc.contributor.authorTicona, Wilfredo
dc.date.accessioned2025-08-11T16:44:01Z
dc.date.issued2023
dc.description.abstractOne of the main problems that arose as a result of this health emergency was the circulation of false information on COVID-19. Therefore, the study carried out aimed to find the best classifier of false information on COVID-19 in the Peruvian context. For this, 2022 information records related to COVID-19 were collected through web scraping of websites, Facebook and Twitter, which were manually labeled as True or False and then validated. Natural Language Processing techniques such as Bag of Words, TF-IDF, Word2Vec and FastText were used for feature extraction. Finally, different Machine Learning model were developed using KNN, Decision Tree, Naive Bayes, SVM, Logistic Regression and MLP. The results were evaluated according to the Accuracy, Precision, Recall and F1-score metrics. The best model resulted from the combination of the SVM algorithm (C (0.5), gamma (1) and kernel (rbf)) with TF - IDF of dimension 300 and n-grams from 1 to 2, whose metrics were superior to the others with 87.41% Accuracy, 88.63% Precision, 87.39% Recall and 88% F1-score. © 2023 Latin American and Caribbean Consortium of Engineering Institutions. All rights reserved.
dc.identifier.scopus2-s2.0-85172331361
dc.identifier.urihttps://cris.esan.edu.pe/handle/20.500.12640/709
dc.identifier.uuid12ae075b-2a0e-47d7-9751-f5fcbe6db5e1
dc.language.isoen
dc.publisherLatin American and Caribbean Consortium of Engineering Institutions
dc.relation.ispartofProceedings of the LACCEI international Multi-conference for Engineering, Education and Technology
dc.rightshttp://purl.org/coar/access_right/c_14cb
dc.subjectAccuracy
dc.subjectCOVID-19
dc.subjectFalse information
dc.subjectMachine Learning
dc.subjectNatural Language Processing
dc.titleModel Proposal for the Detection of False Information About COVID-19 Using Machine Learning and Natural Language Processing Techniques
dc.typehttp://purl.org/coar/resource_type/c_5794
dspace.entity.typePublication

Archivos

Colecciones