Model Proposal for the Detection of False Information About COVID-19 Using Machine Learning and Natural Language Processing Techniques

Bolaños, Yair Andrey Salinas; Ticona, Wilfredo

Publicación:
Model Proposal for the Detection of False Information About COVID-19 Using Machine Learning and Natural Language Processing Techniques

dc.contributor.author	Bolaños, Yair Andrey Salinas
dc.contributor.author	Ticona, Wilfredo
dc.date.accessioned	2025-08-11T16:44:01Z
dc.date.issued	2023
dc.description.abstract	One of the main problems that arose as a result of this health emergency was the circulation of false information on COVID-19. Therefore, the study carried out aimed to find the best classifier of false information on COVID-19 in the Peruvian context. For this, 2022 information records related to COVID-19 were collected through web scraping of websites, Facebook and Twitter, which were manually labeled as True or False and then validated. Natural Language Processing techniques such as Bag of Words, TF-IDF, Word2Vec and FastText were used for feature extraction. Finally, different Machine Learning model were developed using KNN, Decision Tree, Naive Bayes, SVM, Logistic Regression and MLP. The results were evaluated according to the Accuracy, Precision, Recall and F1-score metrics. The best model resulted from the combination of the SVM algorithm (C (0.5), gamma (1) and kernel (rbf)) with TF - IDF of dimension 300 and n-grams from 1 to 2, whose metrics were superior to the others with 87.41% Accuracy, 88.63% Precision, 87.39% Recall and 88% F1-score. © 2023 Latin American and Caribbean Consortium of Engineering Institutions. All rights reserved.
dc.identifier.scopus	2-s2.0-85172331361
dc.identifier.uri	http://hdl.handle.net/20.500.14929/709
dc.identifier.uuid	12ae075b-2a0e-47d7-9751-f5fcbe6db5e1
dc.language.iso	en
dc.publisher	Latin American and Caribbean Consortium of Engineering Institutions
dc.relation.ispartof	Proceedings of the LACCEI international Multi-conference for Engineering, Education and Technology
dc.rights	http://purl.org/coar/access_right/c_14cb
dc.subject	Accuracy
dc.subject	COVID-19
dc.subject	False information
dc.subject	Machine Learning
dc.subject	Natural Language Processing
dc.subject.ocde	https://purl.org/pe-repo/ocde/ford#2.02.03
dc.title	Model Proposal for the Detection of False Information About COVID-19 Using Machine Learning and Natural Language Processing Techniques
dc.type	http://purl.org/coar/resource_type/c_5794
dspace.entity.type	Publication

Colecciones

Publicaciones

Publicación: Model Proposal for the Detection of False Information About COVID-19 Using Machine Learning and Natural Language Processing Techniques

Archivos

Colecciones

Publicación:
Model Proposal for the Detection of False Information About COVID-19 Using Machine Learning and Natural Language Processing Techniques