Comparative Evaluation and Comprehensive Analysis of Machine Learning Models for Regression Problems

作者： Boran, Sekeroglu ^1,5 Yoney, Kirsal Ever ^2,5 Kamil, Dimililer ^3,5 Fadi, Al-Turjman ^4,5
作者单位：

1. Information Systems Engineering Department, Near East University, Nicosia, Cyprus, Mersin 10, Turkey

2. Software Engineering Department, Near East University, Nicosia, Cyprus, Mersin 10, Turkey

3. Electrical and Electronic Engineering Department, Near East University, Nicosia, Cyprus, Mersin 10, Turkey

4. Artificial Intelligence Engineering Department, Near East University, Nicosia, Cyprus, Mersin 10, Turkey

5. Research Centre for AI and IoT, Near East University, Nicosia, Cyprus, Mersin 10, Turkey
通讯作者： Boran, Sekeroglu Email: boran.sekeroglu@neu.edu.tr
提交时间：2022-11-28 21:54:58

摘要: Artificial intelligence and machine learning applications are of significant importance almost in every field of human life to solve problems or support human experts. However, the determination of the machine learning model to achieve a superior result for a particular problem within the wide real-life application areas is still a challenging task for researchers. The success of a model could be affected by several factors such as dataset characteristics, training strategy and model responses. Therefore, a comprehensive analysis is required to determine model ability and the efficiency of the considered strategies. This study implemented ten benchmark machine learning models on seventeen varied datasets. Experiments are performed using four different training strategies 60:40, 70:30, and 80:20 hold-out and five-fold cross-validation techniques. We used three evaluation metrics to evaluate the experimental results: mean squared error, mean absolute error, and coefficient of determination (R2 score). The considered models are analyzed, and each model's advantages, disadvantages, and data dependencies are indicated. As a result of performed excess number of experiments, the deep Long-Short Term Memory (LSTM) neural network outperformed other considered models, namely, decision tree, linear regression, support vector regression with a linear and radial basis function kernels, random forest, gradient boosting, extreme gradient boosting, shallow neural network, and deep neural network. It has also been shown that cross-validation has a tremendous impact on the results of the experiments and should be considered for the model evaluation in regression studies where data mining or selection is not performed.

Machine learning Regression Comparative evaluation Analysis Validation

期刊： DATA INTELLIGENCE
分类： 计算机科学 >> 计算机科学的集成理论
引用： ChinaXiv:202211.00424 (或此版本 ChinaXiv:202211.00424V1)
DOI: 10.1162/dint_a_00155
CSTR:32003.36.ChinaXiv.202211.00424.V1
推荐引用方式： Boran, Sekeroglu, Yoney, Kirsal Ever, Kamil, Dimililer,Fadi, Al-Turjman.(2022).Comparative Evaluation and Comprehensive Analysis of Machine Learning Models for Regression Problems.DATA INTELLIGENCE.doi: 10.1162/dint_a_00155 (点此复制)

版本历史

[V1]

2022-11-28 21:54:58

ChinaXiv:202211.00424V1

下载全文

相关论文推荐

1. Turing’s thinking machine and ’t Hooft’s principle of superposition of states	2024-05-14
2. Brief Discussion on Scenes and Strategies in Capital Markets Manipulation Detection: From Influence Diffusion Perspectives	2024-04-24
3. SteganoDDPM: A high-quality image steganography self-learning method using diffusion model	2024-04-23
4. Multimodal Physical Fitness Monitoring (PFM) Framework Based on TimeMAE-PFM in Wearable Scenarios	2024-04-07
5. Terrain Point Cloud Inpainting via Signal Decomposition	2024-04-05
6. Federated Learning based on Pruning and Recovery	2024-03-16
7. Application of Deep Learning Methods Combined with Physical Background in Wide Field of View Imaging Atmospheric Cherenkov Telescopes	2024-03-10
8. Confident Association for Long-term Tracking	2024-01-07
9. Overview of deep learning theory and its application	2024-01-06
10. Predicting League of Legends Match Results Based on Machine	2024-01-03


公开评论匿名评论仅发给作者