You are in:Home/Publications/Performance Evaluation of Statistical and Machine Learning Models

Assist. Ayman Samy Mahmoud Hegazy :: Publications:

Title:
Performance Evaluation of Statistical and Machine Learning Models
Authors: M M Ramadan; D S El-telbany; Ayman S. Hegazy
Year: 2023
Keywords: Machine Learning; Hyperparameter Tuning; Cross-validation; Whitening Transformation
Journal: مجلة الدراسات والبحوث التجارية
Volume: Not Available
Issue: Not Available
Pages: 10
Publisher: Not Available
Local/International: Local
Paper Link: Not Available
Full paper Ayman Samy Mahmoud Hegazy_My Paper.pdf
Supplementary materials Not Available
Abstract:

Machine learning algorithms have gained popularity in recent years in many fields due to their promising results in predictive performance of classification problems. The application of machine-learning algorithms has also been highly simplified in the last years due to their well-documented integration in commonly used statistical programming languages (such as R or Python). Machine learning is a subsection of Artificial Intelligence (AI), it is one of the most promising tools in classification and it a model that aims to discover the unknown function, dependence, or structure between input and output variables. This study proposes statistical and machine learning models to diagnose anemia disease. Some machine learning techniques have been used in this work to avoid overfitting, pre-process the data and adjust the outliers to give better results. Three classifiers, including Logistic Regression, k-Nearest Neighbor and Decision Tree are implemented in this work. The performance of the models is evaluated based confusion matrix, recall, precision, f1-score, accuracy, Matthews correlation coefficient and ROC curve to compute area under the curve (AUC). The results show the logistic regression has the highest accuracy of 99.57%, with recall values of 99.41%, precision values of 99.61, f1-score of 99.51% and Matthews correlation coefficient values of 99.13%. Decision tree has the second highest accuracy of 98.64%, with recall values of 99.02%, precision values of 97.87, f1-score of 98.44%, and Matthews correlation coefficient values of 97.23%.

Google ScholarAcdemia.eduResearch GateLinkedinFacebookTwitterGoogle PlusYoutubeWordpressInstagramMendeleyZoteroEvernoteORCIDScopus